Skip to content

TheRayFitzgerald/scrape-with-screenshot

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Scrape with Screenshot

A smart web scraper that uses AI to extract structured data from any website section you screenshot. Simply capture a part of a webpage containing repeated information (like product listings, search results, or tables), and the AI will automatically identify and extract the relevant data patterns.

Features

  • AI-powered data extraction from webpage screenshots
  • Automatic pattern recognition for similar data elements
  • Structured JSON output

Usage

Installation

  1. Clone this repository and fill in the necessary API keys in the .env file:
    git clone https://github.com/TheRayFitzgerald/scrape-with-screenshot.git
    cd scrape-with-screenshot
    cp .env.example .env
  2. Set your screenshots destination to ./screenshots/ directory in the root of the project (guide: mac | windows, i guess?).
  3. Install the dependencies and run the script:
    python3 -m venv venv
    source venv/bin/activate
    pip install -r requirements.txt
    python3 main.py
  4. A browser window will open, take a screenshot of the section of the website you want to scrape. The script will then scrape the website and output the results to a JSON file. 🎉

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages