Dataset (Section 3): We are making the News-articles-dataset public. (Note: PM25 data is publicly available in CPCB and OpenAQ)
File/Folder | Decription |
---|---|
Figure-1_Number-of-cities-for-pm25-data-available.ipynb | To plot number of cities along with time for which PM25 data is available. |
Figure-2_TOI-vs-Hindu-articles_over_time.ipynb | To plot articles of TOI and The Hindu over time (2010-21). |
TOI_Articles_Scrapper.ipynb | to scrape articles from TOI archives, articles will be saved in ArticlesData/TOI |
Hindu_Articles_Scrapper.ipynb | to scrape articles from The Hindu archives, articles will be saved in ArticlesData/Hindu |
News_articles_dataet | contains well organized 17.4K news-articles of TOI and The Hindu (2010-21) |
airpollution_keywords.csv | Queries used to scrape AQ articles |