Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

added scrape-meta for fullerton_pd #161

Open
wants to merge 3 commits into
base: dev
Choose a base branch
from

Conversation

naumansharifwork
Copy link
Contributor

@naumansharifwork naumansharifwork commented Oct 24, 2024

Added Scrape meta for fullerton pd

This is a laserfiche site, so basically it uses Post requests to get the response in Json format.

The code first downloads the Index Page by using folder Post Requests it gives the insights of the page and provide the ids of the other folders present within the page.
We iterate the folders one by one and if there are files we save them as assets in meta-file and if there is another folder we repeat the step recursively until a folder has only the files not the next folder.

@newsroomdev
Copy link
Member

@naumansharifwork could you update the pull request description and explain how this code handling laserfiche files? i am reading the code, but it would be good to make sure laserfiche is getting documented due to its complexity. adding inline comments are helpful, too.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

clean.platforms.laserfich.fetch_laserfiche
2 participants