A collection of pig scripts and UDFs for performing web archive analysis tasks. The scripts rely on Apache Hadoop and Pig Latin and make use of the archive-metadata-extractor library.
-
Notifications
You must be signed in to change notification settings - Fork 0
rschmidt13/spades
About
A collection of Apache Pig scripts for web archive analysis tasks.
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published