Skip to content

Latest commit

 

History

History

Dataset (Section 3): We are making the News-articles-dataset public. (Note: PM25 data is publicly available in CPCB and OpenAQ)

File/Folder Decription
Figure-1_Number-of-cities-for-pm25-data-available.ipynb To plot number of cities along with time for which PM25 data is available.
Figure-2_TOI-vs-Hindu-articles_over_time.ipynb To plot articles of TOI and The Hindu over time (2010-21).
TOI_Articles_Scrapper.ipynb to scrape articles from TOI archives, articles will be saved in ArticlesData/TOI
Hindu_Articles_Scrapper.ipynb to scrape articles from The Hindu archives, articles will be saved in ArticlesData/Hindu
News_articles_dataet contains well organized 17.4K news-articles of TOI and The Hindu (2010-21)
airpollution_keywords.csv Queries used to scrape AQ articles