Author
Label
Projects
Milestones
Reviews
Assignee
Sort
fix for NUTCH-1749 contributed by steeveb972
#285
opened Feb 12, 2018 by
steeveb972
•
Changes requested
fix for NUTCH-2455 more efficient usage of hostdb in generate
#254
opened Dec 8, 2017 by
okedoki
•
Changes requested
NUTCH-2449: Replace Tika LanguageIdentifier in language-identifier
#233
opened Oct 24, 2017 by
YossiTamari
NUTCH-2429 Fix Plugin System to allow protocol plugins to bundle their URLStreamHandlers
#222
opened Sep 22, 2017 by
HiranChaudhuri
•
Changes requested
NUTCH-2202 Integration of Anthelion (Focused Crawling Module) into Nutch
#97
opened Mar 9, 2016 by
lewismc
ProTip!
Adding no:label will show everything without a label.

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.
