-
-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Slash (/) missing in warcit output #26
Comments
Hey @DHKaplan, you need to enter the exact URL prefix you want when running warcit. For instance warcit http://www.wticalumni.com/ my-local-folder The prefix could be anything, for instance something like: warcit 'http://mydomain.com/query?q=' my-local-folder This flexibility of the tool makes it necessary that you give the exact URL prefix. |
@despens The folder that contains my html is www.wticalumni.com I get no pages found. When I edit the gz file with an ASCII editor I get:
Note the the Source-URI line is I really appreciate your reply, but I can't see what I am doing wrong. |
Hi @DHKaplan, you just need to use the desired
|
I needed a small warc file for testing, so I took a regular wget download and picked a few files that interconnected and used warcit to create the warc file. When I looked at it in Replayweb.page there were no pages visible. I edited the warc file in an ASCII editor and found that the "/" was not being inserted after the domain name. Please see https://forum.webrecorder.net/t/warcit-not-putting-a-before-the-file-name/413 for more information.
The text was updated successfully, but these errors were encountered: