Skip to content

priya-mane/research-paper-parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Research paper parser

Parser Google Colaboratory Link

The script reads latex files for research paper from the given directory and extracts essential information from the latex format.

The script purges unwanted items like -

  • Images
  • Tables
  • Equations

The script returns a json object containing following items for each research paper -

  • Title
  • Author
  • Abstract
  • Introduction
  • Conclusions
  • Results
  • Acknowledgments

Run the script using the following command

python get_details.py -p <directory_containing_papers> -o <output_file_path>

Example :

python get_details.py -p ./papers -o op_json.json

Output

Developed with ❤️ by Priya Mane & Pratik Merchant.

About

Extracts essential data from research papers.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published