L2-news-simplifier (Final course project for L2326 - Machine learning for statistical NLP: Advanced) - Anni Nieminen
This repository holds my code (two Jupyter notebooks), data, and report for the final course project of LT2326.
Note that due to memory issues, the fine-tuned models are not available in this repository. Also note that the data (D-wikipedia dataset) used for the fine-tuning in Notebook 1 is compressed in this repository. Both the models as well as extracted data (as well as all the files that are have been uploaded in this repository) can be found on the mltgpu at:
/home/[email protected]/L2-Swe-news-simplifier-L2326
This notebook holds the code for fine-tuning the base model (mT5-small) for the task. The resulting fine-tuned model is named:
final-task-fine-tuned-model-40k-traindata
and it is available at:
/home/[email protected]/L2-Swe-news-simplifier-L2326/final-task-fine-tuned-model-40k-traindata
This notebook holds the code for the second fine-tuning of the model. Meaning that this code fine-tunes the model that is saved in task-fine-tuning.ipynb. The resulting fine-tuned model is named:
final-language-fine-tuned-model
and it is available at:
/home/[email protected]/L2-Swe-news-simplifier-L2326/final-language-fine-tuned-model
Holds the 50 Swedish news article pairs used for the fine-tuning in notebook 2.
My project report discussing project background, methodolody, data, results, and limitations.