Parser Google Colaboratory Link
The script reads latex files for research paper from the given directory and extracts essential information from the latex format.
The script purges unwanted items like -
- Images
- Tables
- Equations
The script returns a json object containing following items for each research paper -
- Title
- Author
- Abstract
- Introduction
- Conclusions
- Results
- Acknowledgments
python get_details.py -p <directory_containing_papers> -o <output_file_path>
Example :
python get_details.py -p ./papers -o op_json.json