This project automates the process of downloading, summarizing, and converting daily papers from Hugging Face into easily readable formats.
- Download daily papers from Hugging Face API
- Extract abstracts and generate markdown summaries
- Handle empty files and weekends/holidays
- Avoid reprocessing existing files
hf_daily_papers/
│
├── data/
│ ├── input/ # Downloaded JSON files
│ ├── output/ # Generated markdown files
│
├── src/
│ ├── download_daily_papers.py
│ ├── daily_papers_abstract_extractor.py
│
└── README.md
-
Clone this repository:
git clone https://github.com/elsatch/daily_hf_papers_abstracts.git cd hf_daily_papers
-
Install the required dependencies:
pip install requests
-
Download daily papers:
python src/download_daily_papers.py [YYYYMMDD]
If no date is provided, it will download papers for the current date.
-
Process JSON files and generate markdown summaries:
python src/daily_papers_abstract_extractor.py
- The scripts handle empty files that may occur during weekends or holidays.
- Existing processed files are not overwritten to avoid unnecessary reprocessing.
- You can run these scripts daily to keep up with the latest papers.
Contributions are welcome! Please feel free to submit a Pull Request.
This project is open source and available under the MIT License.