webarchive-commons-1.1.10
ato
released this
15 Oct 08:46
·
41 commits
to master
since this release
Fixes
- WAT extractor: do not fail on missing WARC-Filename in warcinfo record
- ExtractingParseObserver: extract rel, hreflang and type attributes
- ExtractingParseObserver: extract links from onClick attributes
Dependency Upgrades
- commons-collections 3.2.2
- commons-io 2.14.0
- dsiutils 2.2.8
- guava 33.3.0-jre
- hadoop 3.4.0 (now optional)
- pig 0.17.0
- org.json 20231013
Dependency Removals
- joda-time (was unused)