gnunet-svn
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[GNUnet-SVN] r9949 - Extractor-docs/WWW


From: gnunet
Subject: [GNUnet-SVN] r9949 - Extractor-docs/WWW
Date: Fri, 1 Jan 2010 14:24:36 +0100

Author: grothoff
Date: 2010-01-01 14:24:36 +0100 (Fri, 01 Jan 2010)
New Revision: 9949

Added:
   Extractor-docs/WWW/index.html
Removed:
   Extractor-docs/WWW/libextractor.html
Log:
rename

Copied: Extractor-docs/WWW/index.html (from rev 9944, 
Extractor-docs/WWW/libextractor.html)
===================================================================
--- Extractor-docs/WWW/index.html                               (rev 0)
+++ Extractor-docs/WWW/index.html       2010-01-01 13:24:36 UTC (rev 9949)
@@ -0,0 +1,205 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" 
"http://www.w3.org/TR/html4/loose.dtd";>
+<html><head>
+<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+<title>GNU libextractor - GNU Project - Free Software Foundation</title>
+<meta name="content-language" content="en"><meta name="language" 
content="en"><meta name="description" content="a simple library for keyword 
extraction"><meta name="author" content="Vids Samanta and Christian Grothoff">
+<meta name="rights" content="(C) 2002,2003,2004,2005,2006,2007,2009 by Vids 
Samanta and Christian Grothoff">
+<meta name="keywords" content="keyword, extraction, mp3, html, pdf, images, 
jpeg, gif, ps, mime, real, qt, asf, mpeg, avi, riff, tiff, summary, summaries, 
kbps, format, mime-type, zip, elf, doc, ppt, xls, sha-1, md5, open office, sxw, 
dvi, id3, id3v2, id3v2.3, id3v2.4, thumbnails, exiv2, nsf, sid, flv, flac">
+<meta name="robots" content="index,follow">
+<meta name="revisit-after" content="28 days">
+<meta name="content-language" content="en">
+<meta name="language" content="en">
+<meta http-equiv="expires" content="43200">
+<meta http-equiv="content-type" content="text/html; charset=UTF-8">
+<link rel="SHORTCUT ICON" href="favicon.ico">
+</head>
+<body>
+<table width="99%" border="0" cellpadding="0" cellspacing="0">
+<tbody>
+<tr><td colspan="2" width="99%" bgcolor="#99bbff" align="center">GNU 
libextractor - a simple library for keyword extraction</td></tr>
+<tr><td valign="top"><table width="15%" border="0" cellpadding="2" 
cellspacing="3">
+<tbody>
+<tr><th nowrap="nowrap" bgcolor="99BBFF"><a 
href="http://www.gnu.org/software/libextractor/";>Home</a></th></tr>
+<tr><td bgcolor="efefef"><a href="#about">About</a></td></tr>
+<tr><td bgcolor="efefef"><a href="#news">Recent News</a></td></tr>
+<tr><td bgcolor="efefef"><a href="#contact">Contact</a></td></tr>
+<tr><th nowrap="nowrap" bgcolor="99BBFF"><a 
href="download.html">Download</a></th></tr>
+<tr><th nowrap="nowrap" bgcolor="99BBFF"><a 
href="documentation.html">Documentation</a></th></tr>
+<tr><th nowrap="nowrap" bgcolor="99BBFF"><a 
href="http://freshmeat.net/projects/libextractor/";>Freshmeat Page</a></th></tr>
+</tbody>
+</table>
+</td>
+<td valign="top"><a name="about"></a>
+<h2>GNU libextractor</h2>
+<img src="extractor_logo.png" alt="libextractor" vspace="0" width="136" 
border="0" height="94" hspace="0" align="right">
+<p>
+GNU libextractor is a library used to extract meta-data from files of
+arbitrary type.
+It is designed to use helper-libraries to perform the actual
+extraction, and to be trivially extendable by linking against external
+extractors for additional file types.
+libextractor is a <a href="http://www.gnu.org/";>GNU</a> package.
+Our official GNU website can be found at <a 
href="http://www.gnu.org/software/libextractor/";>http://www.gnu.org/software/libextractor/</a>.
+libextractor can be downloaded from this site or the <a 
href="http://www.gnu.org/prep/ftp.html";>GNU mirrors</a>.
+</p>
+<p>
+The goal is to provide developers of file-sharing networks or
+WWW-indexing bots with a universal library to obtain simple keywords to
+match against queries.
+libextractor contains a shell-command <tt>extract</tt> that, similar to the
+well-known <tt>file</tt> command, can extract meta-data from a file an print
+the results to stdout.
+</p>
+<p>
+Currently, libextractor supports the following formats:
+HTML, 
+PDF, 
+PS, 
+OLE2 (DOC, XLS, PPT),
+OpenOffice (sxw),
+StarOffice (sdw),
+DVI,
+MAN,
+FLAC, 
+MP3 (ID3v1 and ID3v2), 
+NSF(E) (NES music),
+SID (C64 music),
+OGG, 
+WAV,
+EXIV2,
+JPEG, 
+GIF,
+PNG, 
+TIFF,
+DEB,
+RPM, 
+TAR(.GZ),
+ZIP, 
+ELF,
+S3M (Scream Tracker 3),
+XM (eXtended Module),
+IT (Impulse Tracker),
+FLV,
+REAL,
+RIFF (AVI),
+MPEG,
+QT 
+and 
+ASF.
+<br>
+Also, various additional MIME types are detected.
+</p>
+<p>
+libextractor is free software; you can redistribute it and/or modify
+it under the terms of the GNU General Public License as published by
+the Free Software Foundation; either version 2 of the License, or (at
+your option) any later version.
+</p>
+<a name="news"></a><h2>Recent News</h2>
+<dl>
+<dt>Sat Oct 24 21:09:18 CEST 2009 | libextractor binding for Mono updated.</dt>
+<dd>You can find the updated binding for Mono in the download section.</dd>
+<dt>Sat Jul  4 11:45:08 CET 2009 | libextractor v0.5.23 released.</dt>
+<dd>This release makes the RPM extractor work with the latest librpm
+library and links against an external version of libexiv2 (instead of
+using an internal, outdated version of the code).</dd>
+<dt>Fri Feb 20 11:24:50 MST 2009 | libextractor v0.5.22 released.</dt>
+<dd>This release fixes various minor bugs in various plugins and the
+build system. We now use libtool 2.x which helps fix some issues with
+multiple threads loading and unloading certain plugins concurrently.</dd>
+</dl>
+<p>
+<a href="oldnews.html">Older news archive</a>
+</p>
+<a name="links"></a><h2>Links</h2>
+<p>
+Related work:
+<ul>
+<li><a href="http://www.wotsit.org/";>File format database</a></li>
+<li><a href="http://getid3.sf.net/";>getid3, similar project for PHP</a></li>
+<li><a 
href="http://blog.thinkphp.de/archives/12-My-first-PHP-Extension..html";>PHP 
wrapper for libextractor</a></li>
+<li><a href="http://dublincore.org/documents/dcmi-terms/";>Meta-data 
categorization standard</a></li>
+<li><a href="http://hul.harvard.edu/jhove/";>JHOVE, Harvard Object Validation 
Environment</a></li>
+<li><a href="http://hachoir.org/";>Hachoir binary file parser</a></li>
+<li><a href="http://meta-extractor.sourceforge.net/";>Metadata Extraction Tool 
developed by the National Libary of New Zealand</a></li>
+</ul>
+Articles related to libextractor:
+<ul>
+<li><a href="http://www.linuxjournal.com/article/7552";>Reading File Metadata 
with extract and libextractor</a></li>
+<li><a href="http://servers.linux.com/servers/06/08/21/1558230.shtml";>How to 
recover lost files after you accidentally wipe your hard drive</a></li>
+<li><a 
href="http://www.gnucitizen.org/blog/all-your-metadata-are-belong-to-us";>All 
your Metadata are belong to Us</a></li>
+</ul>
+Projects that use libextractor:
+<ul>
+<li><a href="http://mediatomb.cc/";>MediaTomb, UPnP AV Mediaserver</a></li>
+<li><a href="http://witme.sourceforge.net/libferris.web/";>libferris, a virtual 
file system</a></li>
+<li><a href="http://evidence.sf.net/";>Evidence, enlightened file 
manager</a></li>
+<li><a href="http://fossology.org/";>FOSSology</a></li>
+<li><a href="http://gnunet.org/";>GNUnet, secure P2P file sharing</a></li>
+<li><a href="http://gnunet.org/doodle/";>doodle, index your disk</a></li>
+<li><a href="http://www.tracker-project.org/";>File indexer, uses embedded 
MySQL database (doodle uses home-grown suffix tree)</a></li>
+<li><a href="http://lobotomy-project.org/";>Lobotomy, experimental desktop 
environment</a></li>
+<li><a href="http://www.edge-security.com/metagoofil.php";>Metagoofil, Metadata 
analyzer for information gathering</a></li>
+<li><a href="http://launchpad.net/basenji";>Portable volume indexer</a></li>
+</ul>
+</p>
+
+
+<a name="contact"></a><h2>Contact</h2>
+<p>
+GNU libextractor is developed by <a 
href="http://grothoff.org/christian/";>Christian Grothoff</a> and <a 
href="http://compilers.cs.ucla.edu/~vids/";>Vids Samanta</a>.
+For questions about libextractor send email to <a 
href="mailto:address@hidden";>address@hidden</a>.
+</p>
+
+<p>
+Please send general FSF &amp; GNU inquiries to
+<a href="mailto:address@hidden";>&lt;address@hidden&gt;</a>.
+There are also <a href="/contact/">other ways to contact</a>
+the FSF.<br />
+Please send broken links and other corrections or suggestions to
+<a href="mailto:address@hidden";>&lt;address@hidden&gt;</a>.</p>
+
+<p>Please see the <a 
href="/server/standards/README.translations.html">Translations
+README</a> for information on coordinating and submitting translations
+of this article.</p>
+
+<p>Copyright &copy; 2009 Free Software Foundation, Inc.</p>
+
+<p>Verbatim copying and distribution of this entire article are
+permitted worldwide, without royalty, in any medium, provided this
+notice, and the copyright notice, are preserved.</p>
+
+</td>
+</tr>
+</tbody>
+</table>
+<hr>
+<a href="mailto:address@hidden";>address@hidden</a>
+
+
+<div id="translations">
+<h4>Translations of this page</h4>
+
+<!-- Please keep this list alphabetical by language code.
+     Comment what the language is for each type, i.e. de is German.
+     Write the language name in its own language (Deutsch) in the text.
+     If you add a new language here, please
+     advise address@hidden and add it to
+      - /home/www/html/server/standards/README.translations.html
+      - one of the lists under the section "Translations Underway"
+      - if there is a translation team, you also have to add an alias
+      to mail.gnu.org:/com/mailer/aliases
+     Please also check you have the language code right; see:
+     http://www.loc.gov/standards/iso639-2/php/code_list.php
+     If the 2-letter ISO 639-1 code is not available,
+     use the 3-letter ISO 639-2.
+     Please use W3C normative character entities. -->
+
+<ul class="translations-list">
+<!-- English -->
+<li><a 
href="/software/libextractor/libextractor.html">English</a>&nbsp;[en]</li>
+</ul>
+</div>
+
+</body>
+</html>

Deleted: Extractor-docs/WWW/libextractor.html
===================================================================
--- Extractor-docs/WWW/libextractor.html        2010-01-01 00:31:20 UTC (rev 
9948)
+++ Extractor-docs/WWW/libextractor.html        2010-01-01 13:24:36 UTC (rev 
9949)
@@ -1,205 +0,0 @@
-<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" 
"http://www.w3.org/TR/html4/loose.dtd";>
-<html><head>
-<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
-<title>GNU libextractor - GNU Project - Free Software Foundation</title>
-<meta name="content-language" content="en"><meta name="language" 
content="en"><meta name="description" content="a simple library for keyword 
extraction"><meta name="author" content="Vids Samanta and Christian Grothoff">
-<meta name="rights" content="(C) 2002,2003,2004,2005,2006,2007,2009 by Vids 
Samanta and Christian Grothoff">
-<meta name="keywords" content="keyword, extraction, mp3, html, pdf, images, 
jpeg, gif, ps, mime, real, qt, asf, mpeg, avi, riff, tiff, summary, summaries, 
kbps, format, mime-type, zip, elf, doc, ppt, xls, sha-1, md5, open office, sxw, 
dvi, id3, id3v2, id3v2.3, id3v2.4, thumbnails, exiv2, nsf, sid, flv, flac">
-<meta name="robots" content="index,follow">
-<meta name="revisit-after" content="28 days">
-<meta name="content-language" content="en">
-<meta name="language" content="en">
-<meta http-equiv="expires" content="43200">
-<meta http-equiv="content-type" content="text/html; charset=UTF-8">
-<link rel="SHORTCUT ICON" href="favicon.ico">
-</head>
-<body>
-<table width="99%" border="0" cellpadding="0" cellspacing="0">
-<tbody>
-<tr><td colspan="2" width="99%" bgcolor="#99bbff" align="center">GNU 
libextractor - a simple library for keyword extraction</td></tr>
-<tr><td valign="top"><table width="15%" border="0" cellpadding="2" 
cellspacing="3">
-<tbody>
-<tr><th nowrap="nowrap" bgcolor="99BBFF"><a 
href="http://www.gnu.org/software/libextractor/";>Home</a></th></tr>
-<tr><td bgcolor="efefef"><a href="#about">About</a></td></tr>
-<tr><td bgcolor="efefef"><a href="#news">Recent News</a></td></tr>
-<tr><td bgcolor="efefef"><a href="#contact">Contact</a></td></tr>
-<tr><th nowrap="nowrap" bgcolor="99BBFF"><a 
href="download.html">Download</a></th></tr>
-<tr><th nowrap="nowrap" bgcolor="99BBFF"><a 
href="documentation.html">Documentation</a></th></tr>
-<tr><th nowrap="nowrap" bgcolor="99BBFF"><a 
href="http://freshmeat.net/projects/libextractor/";>Freshmeat Page</a></th></tr>
-</tbody>
-</table>
-</td>
-<td valign="top"><a name="about"></a>
-<h2>GNU libextractor</h2>
-<img src="extractor_logo.png" alt="libextractor" vspace="0" width="136" 
border="0" height="94" hspace="0" align="right">
-<p>
-GNU libextractor is a library used to extract meta-data from files of
-arbitrary type.
-It is designed to use helper-libraries to perform the actual
-extraction, and to be trivially extendable by linking against external
-extractors for additional file types.
-libextractor is a <a href="http://www.gnu.org/";>GNU</a> package.
-Our official GNU website can be found at <a 
href="http://www.gnu.org/software/libextractor/";>http://www.gnu.org/software/libextractor/</a>.
-libextractor can be downloaded from this site or the <a 
href="http://www.gnu.org/prep/ftp.html";>GNU mirrors</a>.
-</p>
-<p>
-The goal is to provide developers of file-sharing networks or
-WWW-indexing bots with a universal library to obtain simple keywords to
-match against queries.
-libextractor contains a shell-command <tt>extract</tt> that, similar to the
-well-known <tt>file</tt> command, can extract meta-data from a file an print
-the results to stdout.
-</p>
-<p>
-Currently, libextractor supports the following formats:
-HTML, 
-PDF, 
-PS, 
-OLE2 (DOC, XLS, PPT),
-OpenOffice (sxw),
-StarOffice (sdw),
-DVI,
-MAN,
-FLAC, 
-MP3 (ID3v1 and ID3v2), 
-NSF(E) (NES music),
-SID (C64 music),
-OGG, 
-WAV,
-EXIV2,
-JPEG, 
-GIF,
-PNG, 
-TIFF,
-DEB,
-RPM, 
-TAR(.GZ),
-ZIP, 
-ELF,
-S3M (Scream Tracker 3),
-XM (eXtended Module),
-IT (Impulse Tracker),
-FLV,
-REAL,
-RIFF (AVI),
-MPEG,
-QT 
-and 
-ASF.
-<br>
-Also, various additional MIME types are detected.
-</p>
-<p>
-libextractor is free software; you can redistribute it and/or modify
-it under the terms of the GNU General Public License as published by
-the Free Software Foundation; either version 2 of the License, or (at
-your option) any later version.
-</p>
-<a name="news"></a><h2>Recent News</h2>
-<dl>
-<dt>Sat Oct 24 21:09:18 CEST 2009 | libextractor binding for Mono updated.</dt>
-<dd>You can find the updated binding for Mono in the download section.</dd>
-<dt>Sat Jul  4 11:45:08 CET 2009 | libextractor v0.5.23 released.</dt>
-<dd>This release makes the RPM extractor work with the latest librpm
-library and links against an external version of libexiv2 (instead of
-using an internal, outdated version of the code).</dd>
-<dt>Fri Feb 20 11:24:50 MST 2009 | libextractor v0.5.22 released.</dt>
-<dd>This release fixes various minor bugs in various plugins and the
-build system. We now use libtool 2.x which helps fix some issues with
-multiple threads loading and unloading certain plugins concurrently.</dd>
-</dl>
-<p>
-<a href="oldnews.html">Older news archive</a>
-</p>
-<a name="links"></a><h2>Links</h2>
-<p>
-Related work:
-<ul>
-<li><a href="http://www.wotsit.org/";>File format database</a></li>
-<li><a href="http://getid3.sf.net/";>getid3, similar project for PHP</a></li>
-<li><a 
href="http://blog.thinkphp.de/archives/12-My-first-PHP-Extension..html";>PHP 
wrapper for libextractor</a></li>
-<li><a href="http://dublincore.org/documents/dcmi-terms/";>Meta-data 
categorization standard</a></li>
-<li><a href="http://hul.harvard.edu/jhove/";>JHOVE, Harvard Object Validation 
Environment</a></li>
-<li><a href="http://hachoir.org/";>Hachoir binary file parser</a></li>
-<li><a href="http://meta-extractor.sourceforge.net/";>Metadata Extraction Tool 
developed by the National Libary of New Zealand</a></li>
-</ul>
-Articles related to libextractor:
-<ul>
-<li><a href="http://www.linuxjournal.com/article/7552";>Reading File Metadata 
with extract and libextractor</a></li>
-<li><a href="http://servers.linux.com/servers/06/08/21/1558230.shtml";>How to 
recover lost files after you accidentally wipe your hard drive</a></li>
-<li><a 
href="http://www.gnucitizen.org/blog/all-your-metadata-are-belong-to-us";>All 
your Metadata are belong to Us</a></li>
-</ul>
-Projects that use libextractor:
-<ul>
-<li><a href="http://mediatomb.cc/";>MediaTomb, UPnP AV Mediaserver</a></li>
-<li><a href="http://witme.sourceforge.net/libferris.web/";>libferris, a virtual 
file system</a></li>
-<li><a href="http://evidence.sf.net/";>Evidence, enlightened file 
manager</a></li>
-<li><a href="http://fossology.org/";>FOSSology</a></li>
-<li><a href="http://gnunet.org/";>GNUnet, secure P2P file sharing</a></li>
-<li><a href="http://gnunet.org/doodle/";>doodle, index your disk</a></li>
-<li><a href="http://www.tracker-project.org/";>File indexer, uses embedded 
MySQL database (doodle uses home-grown suffix tree)</a></li>
-<li><a href="http://lobotomy-project.org/";>Lobotomy, experimental desktop 
environment</a></li>
-<li><a href="http://www.edge-security.com/metagoofil.php";>Metagoofil, Metadata 
analyzer for information gathering</a></li>
-<li><a href="http://launchpad.net/basenji";>Portable volume indexer</a></li>
-</ul>
-</p>
-
-
-<a name="contact"></a><h2>Contact</h2>
-<p>
-GNU libextractor is developed by <a 
href="http://grothoff.org/christian/";>Christian Grothoff</a> and <a 
href="http://compilers.cs.ucla.edu/~vids/";>Vids Samanta</a>.
-For questions about libextractor send email to <a 
href="mailto:address@hidden";>address@hidden</a>.
-</p>
-
-<p>
-Please send general FSF &amp; GNU inquiries to
-<a href="mailto:address@hidden";>&lt;address@hidden&gt;</a>.
-There are also <a href="/contact/">other ways to contact</a>
-the FSF.<br />
-Please send broken links and other corrections or suggestions to
-<a href="mailto:address@hidden";>&lt;address@hidden&gt;</a>.</p>
-
-<p>Please see the <a 
href="/server/standards/README.translations.html">Translations
-README</a> for information on coordinating and submitting translations
-of this article.</p>
-
-<p>Copyright &copy; 2009 Free Software Foundation, Inc.</p>
-
-<p>Verbatim copying and distribution of this entire article are
-permitted worldwide, without royalty, in any medium, provided this
-notice, and the copyright notice, are preserved.</p>
-
-</td>
-</tr>
-</tbody>
-</table>
-<hr>
-<a href="mailto:address@hidden";>address@hidden</a>
-
-
-<div id="translations">
-<h4>Translations of this page</h4>
-
-<!-- Please keep this list alphabetical by language code.
-     Comment what the language is for each type, i.e. de is German.
-     Write the language name in its own language (Deutsch) in the text.
-     If you add a new language here, please
-     advise address@hidden and add it to
-      - /home/www/html/server/standards/README.translations.html
-      - one of the lists under the section "Translations Underway"
-      - if there is a translation team, you also have to add an alias
-      to mail.gnu.org:/com/mailer/aliases
-     Please also check you have the language code right; see:
-     http://www.loc.gov/standards/iso639-2/php/code_list.php
-     If the 2-letter ISO 639-1 code is not available,
-     use the 3-letter ISO 639-2.
-     Please use W3C normative character entities. -->
-
-<ul class="translations-list">
-<!-- English -->
-<li><a 
href="/software/libextractor/libextractor.html">English</a>&nbsp;[en]</li>
-</ul>
-</div>
-
-</body>
-</html>





reply via email to

[Prev in Thread] Current Thread [Next in Thread]