https://github.com/DigitalPebble/behemoth
Tip revision: 4e33e29ae81aa6d06786295fdc9a9664060df4a3 authored by Julien Nioche on 25 April 2018, 10:57:53 UTC
WARC converter to allow custom metadata, fixes #63
WARC converter to allow custom metadata, fixes #63
Tip revision: 4e33e29
File | Mode | Size |
---|---|---|
core | ||
gate | ||
io | ||
language-id | ||
mahout | ||
solr | ||
tika | ||
uima | ||
.gitignore | -rw-r--r-- | 107 bytes |
.travis.yml | -rw-r--r-- | 16 bytes |
LICENSE.txt | -rw-r--r-- | 555 bytes |
README.md | -rw-r--r-- | 1.2 KB |
behemoth | -rwxr-xr-x | 2.0 KB |
behemoth-site.xml | -rw-r--r-- | 2.5 KB |
eclipse-format.xml | -rw-r--r-- | 31.2 KB |
hadoop-job.xml | -rw-r--r-- | 1.3 KB |
pom.xml | -rw-r--r-- | 3.6 KB |
script.sh | -rw-r--r-- | 4.9 KB |