Submission for NeurIPS LLM Efficiency Challenge:1 LLM + 1GPU + 1Day

Base Model and Config

We used mistral 7B (https://huggingface.co/mistralai/Mistral-7B-v0.1) as our base model
1x4090 GPU
4-bit QLoRA
Weights and Bias

Data Filters

We use three kind of filters

Rouge Filter

We run the base model through open source train data, once we get the output we calculate rouge score with output and expected output.
With cutoff threshold we filter out the data points with high rouge score.

Platypus Filter

We used platypus (https://arxiv.org/abs/2308.07317) based embedding filter with same data as in the paper, but we discarded the LLM generated data.

Random Filter

We extracted random examples using this filter from some tasks. Following table depicts our exact setting for the different model versions ###Submission 1

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
__MACOSX/data_preprocessing_codes		__MACOSX/data_preprocessing_codes
data_preprocessing_codes		data_preprocessing_codes
python		python
README.md		README.md
img_1.png		img_1.png
img_3.png		img_3.png
tem.txt		tem.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Submission for NeurIPS LLM Efficiency Challenge:1 LLM + 1GPU + 1Day

Base Model and Config

Data Filters

Rouge Filter

Platypus Filter

Random Filter

About

Releases

Packages

Languages

pawan2411/repo-one

Folders and files

Latest commit

History

Repository files navigation

Submission for NeurIPS LLM Efficiency Challenge:1 LLM + 1GPU + 1Day

Base Model and Config

Data Filters

Rouge Filter

Platypus Filter

Random Filter

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages