Skip to content

Dump scraped HTML from MongoDB to a bzip2 compressed tar file archive.

License

Notifications You must be signed in to change notification settings

opented/htmldump

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

htmldump

Dump scraped HTML from MongoDB to a bzip2-compressed tar file archive.

Requirements

  • Python 2.7
  • Dependencies from requirements.txt

Usage

htmldump.py -H opented.org -u USERNAME -p PASSWORD OUTPUT [DOC-RE]

See htmldump.py -h for details.

License

Copyright 2012 Joost Cassee / OpenTED

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.

About

Dump scraped HTML from MongoDB to a bzip2 compressed tar file archive.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages