[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[GNUnet-SVN] r9949 - Extractor-docs/WWW
From: |
gnunet |
Subject: |
[GNUnet-SVN] r9949 - Extractor-docs/WWW |
Date: |
Fri, 1 Jan 2010 14:24:36 +0100 |
Author: grothoff
Date: 2010-01-01 14:24:36 +0100 (Fri, 01 Jan 2010)
New Revision: 9949
Added:
Extractor-docs/WWW/index.html
Removed:
Extractor-docs/WWW/libextractor.html
Log:
rename
Copied: Extractor-docs/WWW/index.html (from rev 9944,
Extractor-docs/WWW/libextractor.html)
===================================================================
--- Extractor-docs/WWW/index.html (rev 0)
+++ Extractor-docs/WWW/index.html 2010-01-01 13:24:36 UTC (rev 9949)
@@ -0,0 +1,205 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
+<html><head>
+<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+<title>GNU libextractor - GNU Project - Free Software Foundation</title>
+<meta name="content-language" content="en"><meta name="language"
content="en"><meta name="description" content="a simple library for keyword
extraction"><meta name="author" content="Vids Samanta and Christian Grothoff">
+<meta name="rights" content="(C) 2002,2003,2004,2005,2006,2007,2009 by Vids
Samanta and Christian Grothoff">
+<meta name="keywords" content="keyword, extraction, mp3, html, pdf, images,
jpeg, gif, ps, mime, real, qt, asf, mpeg, avi, riff, tiff, summary, summaries,
kbps, format, mime-type, zip, elf, doc, ppt, xls, sha-1, md5, open office, sxw,
dvi, id3, id3v2, id3v2.3, id3v2.4, thumbnails, exiv2, nsf, sid, flv, flac">
+<meta name="robots" content="index,follow">
+<meta name="revisit-after" content="28 days">
+<meta name="content-language" content="en">
+<meta name="language" content="en">
+<meta http-equiv="expires" content="43200">
+<meta http-equiv="content-type" content="text/html; charset=UTF-8">
+<link rel="SHORTCUT ICON" href="favicon.ico">
+</head>
+<body>
+<table width="99%" border="0" cellpadding="0" cellspacing="0">
+<tbody>
+<tr><td colspan="2" width="99%" bgcolor="#99bbff" align="center">GNU
libextractor - a simple library for keyword extraction</td></tr>
+<tr><td valign="top"><table width="15%" border="0" cellpadding="2"
cellspacing="3">
+<tbody>
+<tr><th nowrap="nowrap" bgcolor="99BBFF"><a
href="http://www.gnu.org/software/libextractor/">Home</a></th></tr>
+<tr><td bgcolor="efefef"><a href="#about">About</a></td></tr>
+<tr><td bgcolor="efefef"><a href="#news">Recent News</a></td></tr>
+<tr><td bgcolor="efefef"><a href="#contact">Contact</a></td></tr>
+<tr><th nowrap="nowrap" bgcolor="99BBFF"><a
href="download.html">Download</a></th></tr>
+<tr><th nowrap="nowrap" bgcolor="99BBFF"><a
href="documentation.html">Documentation</a></th></tr>
+<tr><th nowrap="nowrap" bgcolor="99BBFF"><a
href="http://freshmeat.net/projects/libextractor/">Freshmeat Page</a></th></tr>
+</tbody>
+</table>
+</td>
+<td valign="top"><a name="about"></a>
+<h2>GNU libextractor</h2>
+<img src="extractor_logo.png" alt="libextractor" vspace="0" width="136"
border="0" height="94" hspace="0" align="right">
+<p>
+GNU libextractor is a library used to extract meta-data from files of
+arbitrary type.
+It is designed to use helper-libraries to perform the actual
+extraction, and to be trivially extendable by linking against external
+extractors for additional file types.
+libextractor is a <a href="http://www.gnu.org/">GNU</a> package.
+Our official GNU website can be found at <a
href="http://www.gnu.org/software/libextractor/">http://www.gnu.org/software/libextractor/</a>.
+libextractor can be downloaded from this site or the <a
href="http://www.gnu.org/prep/ftp.html">GNU mirrors</a>.
+</p>
+<p>
+The goal is to provide developers of file-sharing networks or
+WWW-indexing bots with a universal library to obtain simple keywords to
+match against queries.
+libextractor contains a shell-command <tt>extract</tt> that, similar to the
+well-known <tt>file</tt> command, can extract meta-data from a file an print
+the results to stdout.
+</p>
+<p>
+Currently, libextractor supports the following formats:
+HTML,
+PDF,
+PS,
+OLE2 (DOC, XLS, PPT),
+OpenOffice (sxw),
+StarOffice (sdw),
+DVI,
+MAN,
+FLAC,
+MP3 (ID3v1 and ID3v2),
+NSF(E) (NES music),
+SID (C64 music),
+OGG,
+WAV,
+EXIV2,
+JPEG,
+GIF,
+PNG,
+TIFF,
+DEB,
+RPM,
+TAR(.GZ),
+ZIP,
+ELF,
+S3M (Scream Tracker 3),
+XM (eXtended Module),
+IT (Impulse Tracker),
+FLV,
+REAL,
+RIFF (AVI),
+MPEG,
+QT
+and
+ASF.
+<br>
+Also, various additional MIME types are detected.
+</p>
+<p>
+libextractor is free software; you can redistribute it and/or modify
+it under the terms of the GNU General Public License as published by
+the Free Software Foundation; either version 2 of the License, or (at
+your option) any later version.
+</p>
+<a name="news"></a><h2>Recent News</h2>
+<dl>
+<dt>Sat Oct 24 21:09:18 CEST 2009 | libextractor binding for Mono updated.</dt>
+<dd>You can find the updated binding for Mono in the download section.</dd>
+<dt>Sat Jul 4 11:45:08 CET 2009 | libextractor v0.5.23 released.</dt>
+<dd>This release makes the RPM extractor work with the latest librpm
+library and links against an external version of libexiv2 (instead of
+using an internal, outdated version of the code).</dd>
+<dt>Fri Feb 20 11:24:50 MST 2009 | libextractor v0.5.22 released.</dt>
+<dd>This release fixes various minor bugs in various plugins and the
+build system. We now use libtool 2.x which helps fix some issues with
+multiple threads loading and unloading certain plugins concurrently.</dd>
+</dl>
+<p>
+<a href="oldnews.html">Older news archive</a>
+</p>
+<a name="links"></a><h2>Links</h2>
+<p>
+Related work:
+<ul>
+<li><a href="http://www.wotsit.org/">File format database</a></li>
+<li><a href="http://getid3.sf.net/">getid3, similar project for PHP</a></li>
+<li><a
href="http://blog.thinkphp.de/archives/12-My-first-PHP-Extension..html">PHP
wrapper for libextractor</a></li>
+<li><a href="http://dublincore.org/documents/dcmi-terms/">Meta-data
categorization standard</a></li>
+<li><a href="http://hul.harvard.edu/jhove/">JHOVE, Harvard Object Validation
Environment</a></li>
+<li><a href="http://hachoir.org/">Hachoir binary file parser</a></li>
+<li><a href="http://meta-extractor.sourceforge.net/">Metadata Extraction Tool
developed by the National Libary of New Zealand</a></li>
+</ul>
+Articles related to libextractor:
+<ul>
+<li><a href="http://www.linuxjournal.com/article/7552">Reading File Metadata
with extract and libextractor</a></li>
+<li><a href="http://servers.linux.com/servers/06/08/21/1558230.shtml">How to
recover lost files after you accidentally wipe your hard drive</a></li>
+<li><a
href="http://www.gnucitizen.org/blog/all-your-metadata-are-belong-to-us">All
your Metadata are belong to Us</a></li>
+</ul>
+Projects that use libextractor:
+<ul>
+<li><a href="http://mediatomb.cc/">MediaTomb, UPnP AV Mediaserver</a></li>
+<li><a href="http://witme.sourceforge.net/libferris.web/">libferris, a virtual
file system</a></li>
+<li><a href="http://evidence.sf.net/">Evidence, enlightened file
manager</a></li>
+<li><a href="http://fossology.org/">FOSSology</a></li>
+<li><a href="http://gnunet.org/">GNUnet, secure P2P file sharing</a></li>
+<li><a href="http://gnunet.org/doodle/">doodle, index your disk</a></li>
+<li><a href="http://www.tracker-project.org/">File indexer, uses embedded
MySQL database (doodle uses home-grown suffix tree)</a></li>
+<li><a href="http://lobotomy-project.org/">Lobotomy, experimental desktop
environment</a></li>
+<li><a href="http://www.edge-security.com/metagoofil.php">Metagoofil, Metadata
analyzer for information gathering</a></li>
+<li><a href="http://launchpad.net/basenji">Portable volume indexer</a></li>
+</ul>
+</p>
+
+
+<a name="contact"></a><h2>Contact</h2>
+<p>
+GNU libextractor is developed by <a
href="http://grothoff.org/christian/">Christian Grothoff</a> and <a
href="http://compilers.cs.ucla.edu/~vids/">Vids Samanta</a>.
+For questions about libextractor send email to <a
href="mailto:address@hidden">address@hidden</a>.
+</p>
+
+<p>
+Please send general FSF & GNU inquiries to
+<a href="mailto:address@hidden"><address@hidden></a>.
+There are also <a href="/contact/">other ways to contact</a>
+the FSF.<br />
+Please send broken links and other corrections or suggestions to
+<a href="mailto:address@hidden"><address@hidden></a>.</p>
+
+<p>Please see the <a
href="/server/standards/README.translations.html">Translations
+README</a> for information on coordinating and submitting translations
+of this article.</p>
+
+<p>Copyright © 2009 Free Software Foundation, Inc.</p>
+
+<p>Verbatim copying and distribution of this entire article are
+permitted worldwide, without royalty, in any medium, provided this
+notice, and the copyright notice, are preserved.</p>
+
+</td>
+</tr>
+</tbody>
+</table>
+<hr>
+<a href="mailto:address@hidden">address@hidden</a>
+
+
+<div id="translations">
+<h4>Translations of this page</h4>
+
+<!-- Please keep this list alphabetical by language code.
+ Comment what the language is for each type, i.e. de is German.
+ Write the language name in its own language (Deutsch) in the text.
+ If you add a new language here, please
+ advise address@hidden and add it to
+ - /home/www/html/server/standards/README.translations.html
+ - one of the lists under the section "Translations Underway"
+ - if there is a translation team, you also have to add an alias
+ to mail.gnu.org:/com/mailer/aliases
+ Please also check you have the language code right; see:
+ http://www.loc.gov/standards/iso639-2/php/code_list.php
+ If the 2-letter ISO 639-1 code is not available,
+ use the 3-letter ISO 639-2.
+ Please use W3C normative character entities. -->
+
+<ul class="translations-list">
+<!-- English -->
+<li><a
href="/software/libextractor/libextractor.html">English</a> [en]</li>
+</ul>
+</div>
+
+</body>
+</html>
Deleted: Extractor-docs/WWW/libextractor.html
===================================================================
--- Extractor-docs/WWW/libextractor.html 2010-01-01 00:31:20 UTC (rev
9948)
+++ Extractor-docs/WWW/libextractor.html 2010-01-01 13:24:36 UTC (rev
9949)
@@ -1,205 +0,0 @@
-<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
-<html><head>
-<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
-<title>GNU libextractor - GNU Project - Free Software Foundation</title>
-<meta name="content-language" content="en"><meta name="language"
content="en"><meta name="description" content="a simple library for keyword
extraction"><meta name="author" content="Vids Samanta and Christian Grothoff">
-<meta name="rights" content="(C) 2002,2003,2004,2005,2006,2007,2009 by Vids
Samanta and Christian Grothoff">
-<meta name="keywords" content="keyword, extraction, mp3, html, pdf, images,
jpeg, gif, ps, mime, real, qt, asf, mpeg, avi, riff, tiff, summary, summaries,
kbps, format, mime-type, zip, elf, doc, ppt, xls, sha-1, md5, open office, sxw,
dvi, id3, id3v2, id3v2.3, id3v2.4, thumbnails, exiv2, nsf, sid, flv, flac">
-<meta name="robots" content="index,follow">
-<meta name="revisit-after" content="28 days">
-<meta name="content-language" content="en">
-<meta name="language" content="en">
-<meta http-equiv="expires" content="43200">
-<meta http-equiv="content-type" content="text/html; charset=UTF-8">
-<link rel="SHORTCUT ICON" href="favicon.ico">
-</head>
-<body>
-<table width="99%" border="0" cellpadding="0" cellspacing="0">
-<tbody>
-<tr><td colspan="2" width="99%" bgcolor="#99bbff" align="center">GNU
libextractor - a simple library for keyword extraction</td></tr>
-<tr><td valign="top"><table width="15%" border="0" cellpadding="2"
cellspacing="3">
-<tbody>
-<tr><th nowrap="nowrap" bgcolor="99BBFF"><a
href="http://www.gnu.org/software/libextractor/">Home</a></th></tr>
-<tr><td bgcolor="efefef"><a href="#about">About</a></td></tr>
-<tr><td bgcolor="efefef"><a href="#news">Recent News</a></td></tr>
-<tr><td bgcolor="efefef"><a href="#contact">Contact</a></td></tr>
-<tr><th nowrap="nowrap" bgcolor="99BBFF"><a
href="download.html">Download</a></th></tr>
-<tr><th nowrap="nowrap" bgcolor="99BBFF"><a
href="documentation.html">Documentation</a></th></tr>
-<tr><th nowrap="nowrap" bgcolor="99BBFF"><a
href="http://freshmeat.net/projects/libextractor/">Freshmeat Page</a></th></tr>
-</tbody>
-</table>
-</td>
-<td valign="top"><a name="about"></a>
-<h2>GNU libextractor</h2>
-<img src="extractor_logo.png" alt="libextractor" vspace="0" width="136"
border="0" height="94" hspace="0" align="right">
-<p>
-GNU libextractor is a library used to extract meta-data from files of
-arbitrary type.
-It is designed to use helper-libraries to perform the actual
-extraction, and to be trivially extendable by linking against external
-extractors for additional file types.
-libextractor is a <a href="http://www.gnu.org/">GNU</a> package.
-Our official GNU website can be found at <a
href="http://www.gnu.org/software/libextractor/">http://www.gnu.org/software/libextractor/</a>.
-libextractor can be downloaded from this site or the <a
href="http://www.gnu.org/prep/ftp.html">GNU mirrors</a>.
-</p>
-<p>
-The goal is to provide developers of file-sharing networks or
-WWW-indexing bots with a universal library to obtain simple keywords to
-match against queries.
-libextractor contains a shell-command <tt>extract</tt> that, similar to the
-well-known <tt>file</tt> command, can extract meta-data from a file an print
-the results to stdout.
-</p>
-<p>
-Currently, libextractor supports the following formats:
-HTML,
-PDF,
-PS,
-OLE2 (DOC, XLS, PPT),
-OpenOffice (sxw),
-StarOffice (sdw),
-DVI,
-MAN,
-FLAC,
-MP3 (ID3v1 and ID3v2),
-NSF(E) (NES music),
-SID (C64 music),
-OGG,
-WAV,
-EXIV2,
-JPEG,
-GIF,
-PNG,
-TIFF,
-DEB,
-RPM,
-TAR(.GZ),
-ZIP,
-ELF,
-S3M (Scream Tracker 3),
-XM (eXtended Module),
-IT (Impulse Tracker),
-FLV,
-REAL,
-RIFF (AVI),
-MPEG,
-QT
-and
-ASF.
-<br>
-Also, various additional MIME types are detected.
-</p>
-<p>
-libextractor is free software; you can redistribute it and/or modify
-it under the terms of the GNU General Public License as published by
-the Free Software Foundation; either version 2 of the License, or (at
-your option) any later version.
-</p>
-<a name="news"></a><h2>Recent News</h2>
-<dl>
-<dt>Sat Oct 24 21:09:18 CEST 2009 | libextractor binding for Mono updated.</dt>
-<dd>You can find the updated binding for Mono in the download section.</dd>
-<dt>Sat Jul 4 11:45:08 CET 2009 | libextractor v0.5.23 released.</dt>
-<dd>This release makes the RPM extractor work with the latest librpm
-library and links against an external version of libexiv2 (instead of
-using an internal, outdated version of the code).</dd>
-<dt>Fri Feb 20 11:24:50 MST 2009 | libextractor v0.5.22 released.</dt>
-<dd>This release fixes various minor bugs in various plugins and the
-build system. We now use libtool 2.x which helps fix some issues with
-multiple threads loading and unloading certain plugins concurrently.</dd>
-</dl>
-<p>
-<a href="oldnews.html">Older news archive</a>
-</p>
-<a name="links"></a><h2>Links</h2>
-<p>
-Related work:
-<ul>
-<li><a href="http://www.wotsit.org/">File format database</a></li>
-<li><a href="http://getid3.sf.net/">getid3, similar project for PHP</a></li>
-<li><a
href="http://blog.thinkphp.de/archives/12-My-first-PHP-Extension..html">PHP
wrapper for libextractor</a></li>
-<li><a href="http://dublincore.org/documents/dcmi-terms/">Meta-data
categorization standard</a></li>
-<li><a href="http://hul.harvard.edu/jhove/">JHOVE, Harvard Object Validation
Environment</a></li>
-<li><a href="http://hachoir.org/">Hachoir binary file parser</a></li>
-<li><a href="http://meta-extractor.sourceforge.net/">Metadata Extraction Tool
developed by the National Libary of New Zealand</a></li>
-</ul>
-Articles related to libextractor:
-<ul>
-<li><a href="http://www.linuxjournal.com/article/7552">Reading File Metadata
with extract and libextractor</a></li>
-<li><a href="http://servers.linux.com/servers/06/08/21/1558230.shtml">How to
recover lost files after you accidentally wipe your hard drive</a></li>
-<li><a
href="http://www.gnucitizen.org/blog/all-your-metadata-are-belong-to-us">All
your Metadata are belong to Us</a></li>
-</ul>
-Projects that use libextractor:
-<ul>
-<li><a href="http://mediatomb.cc/">MediaTomb, UPnP AV Mediaserver</a></li>
-<li><a href="http://witme.sourceforge.net/libferris.web/">libferris, a virtual
file system</a></li>
-<li><a href="http://evidence.sf.net/">Evidence, enlightened file
manager</a></li>
-<li><a href="http://fossology.org/">FOSSology</a></li>
-<li><a href="http://gnunet.org/">GNUnet, secure P2P file sharing</a></li>
-<li><a href="http://gnunet.org/doodle/">doodle, index your disk</a></li>
-<li><a href="http://www.tracker-project.org/">File indexer, uses embedded
MySQL database (doodle uses home-grown suffix tree)</a></li>
-<li><a href="http://lobotomy-project.org/">Lobotomy, experimental desktop
environment</a></li>
-<li><a href="http://www.edge-security.com/metagoofil.php">Metagoofil, Metadata
analyzer for information gathering</a></li>
-<li><a href="http://launchpad.net/basenji">Portable volume indexer</a></li>
-</ul>
-</p>
-
-
-<a name="contact"></a><h2>Contact</h2>
-<p>
-GNU libextractor is developed by <a
href="http://grothoff.org/christian/">Christian Grothoff</a> and <a
href="http://compilers.cs.ucla.edu/~vids/">Vids Samanta</a>.
-For questions about libextractor send email to <a
href="mailto:address@hidden">address@hidden</a>.
-</p>
-
-<p>
-Please send general FSF & GNU inquiries to
-<a href="mailto:address@hidden"><address@hidden></a>.
-There are also <a href="/contact/">other ways to contact</a>
-the FSF.<br />
-Please send broken links and other corrections or suggestions to
-<a href="mailto:address@hidden"><address@hidden></a>.</p>
-
-<p>Please see the <a
href="/server/standards/README.translations.html">Translations
-README</a> for information on coordinating and submitting translations
-of this article.</p>
-
-<p>Copyright © 2009 Free Software Foundation, Inc.</p>
-
-<p>Verbatim copying and distribution of this entire article are
-permitted worldwide, without royalty, in any medium, provided this
-notice, and the copyright notice, are preserved.</p>
-
-</td>
-</tr>
-</tbody>
-</table>
-<hr>
-<a href="mailto:address@hidden">address@hidden</a>
-
-
-<div id="translations">
-<h4>Translations of this page</h4>
-
-<!-- Please keep this list alphabetical by language code.
- Comment what the language is for each type, i.e. de is German.
- Write the language name in its own language (Deutsch) in the text.
- If you add a new language here, please
- advise address@hidden and add it to
- - /home/www/html/server/standards/README.translations.html
- - one of the lists under the section "Translations Underway"
- - if there is a translation team, you also have to add an alias
- to mail.gnu.org:/com/mailer/aliases
- Please also check you have the language code right; see:
- http://www.loc.gov/standards/iso639-2/php/code_list.php
- If the 2-letter ISO 639-1 code is not available,
- use the 3-letter ISO 639-2.
- Please use W3C normative character entities. -->
-
-<ul class="translations-list">
-<!-- English -->
-<li><a
href="/software/libextractor/libextractor.html">English</a> [en]</li>
-</ul>
-</div>
-
-</body>
-</html>
[Prev in Thread] |
Current Thread |
[Next in Thread] |
- [GNUnet-SVN] r9949 - Extractor-docs/WWW,
gnunet <=