We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
here I want to split warc file to small chunks and then use multiprocessing in python
multiprocessing
for text file, we can use seeks, but how to seek in warc module or .gz warc files ?? any advices ?
seeks
The text was updated successfully, but these errors were encountered:
You can open it as gzip file and perform seek, then from there you can pass the file pointer to WARCReader
Sorry, something went wrong.
No branches or pull requests
here I want to split warc file to small chunks and then use
multiprocessing
in pythonfor text file, we can use
seeks
, but how to seek in warc module or .gz warc files ??any advices ?
The text was updated successfully, but these errors were encountered: