Skip to content

Latest commit

 

History

History
34 lines (26 loc) · 1.36 KB

README.md

File metadata and controls

34 lines (26 loc) · 1.36 KB

Web Fetcher

This is a web scraper app built with Python and Flask. It scrapes data from a website using Selenium and displays the results on a web page.

Installation

To run this app, you need to have Docker installed on your system. Clone this repository and run the following command:

docker build -t web-fetcher .

This will build a Docker image for the app. Once the image is built, run the following command to start the app:

 docker run -p 5000:5000 --dns 8.8.8.8 --name web-scraper  webfetcher:latest

This will start the app on port 5000. You can then access the app by navigating to http://localhost:5000 in your web browser.

Alternatively, is possible to use it locally by running install_chromedriver.sh which will download locally a chromedriver in the root project folder. Dependencies may be installed by running pip install -r requirements.txt It's not recommended as Chrome version installed on the system may not be compatible with the version used by the driver

Usage

Once the app is running, you can enter the URL of the website you want to scrape in the url field in the request

curl --location --request POST 'localhost:5000/retrieve' \
--header 'Content-Type: application/json' \
--data-raw '{
    "url" : "https://www.google.com"
}

License

This work is distributed under MIT license.