Added other dependencies and clarification about HF models #11

randerzander · 2023-05-25T01:39:49Z

When trying to run the fine tuning example, I noticed this library needs some additional dependencies not mentioned in the README

…ix cases where non-standard llama model path names gets bypassed in tokenizer check. The tokenizer is init with use_fast=True and qlora requires >4.29.2 transformers so the only possible tokenizer is LlamaTokenizerFast.

Fixes a copy paste error where per_device_train_batch_size was set twice.

Check for LlamaTokenizerFast rather than infer type from path name.

Fix link to inference notebook

Set per_device_eval_batch_size in finetune.sh

artidoro

Thank you for contributing! A small suggestion but otherwise this would be very helpful

artidoro · 2023-05-28T03:34:01Z

README.md

@@ -37,11 +37,12 @@ pip install -q -U bitsandbytes
 pip install -q -U git+https://github.com/huggingface/transformers.git
 pip install -q -U git+https://github.com/huggingface/peft.git
 pip install -q -U git+https://github.com/huggingface/accelerate.git
+pip install -U datasets evaluate scipy nltk


I think these might change. It would be better to put these in a requirements.txt file. Also, note that we removed the nltk dependency.

Suppress pad_token warning message

randerzander and others added 3 commits May 24, 2023 21:38

Added other dependencies and clarification about HF models

7e11d07

Update README.md

073a485

Shevilll approved these changes May 26, 2023

View reviewed changes

pmysl and others added 6 commits May 27, 2023 10:32

Fix link to inference notebook

530382b

Update finetune.sh

3926ee5

Fixes a copy paste error where per_device_train_batch_size was set twice.

Adding guanaco openassistant dataset

a380962

Merge pull request artidoro#20 from Qubitium/check-llama

af55cf0

Check for LlamaTokenizerFast rather than infer type from path name.

Merge pull request artidoro#51 from pmysl/main

204dda8

Fix link to inference notebook

Merge pull request artidoro#58 from muelletm/patch-1

ce5e5be

Set per_device_eval_batch_size in finetune.sh

artidoro requested changes May 28, 2023

View reviewed changes

pmysl and others added 4 commits May 28, 2023 14:00

Suppress pad_token warning message

e31aedd

Merge pull request artidoro#63 from pmysl/main

f96eec1

Suppress pad_token warning message

Merge branch 'main' of github.com:randerzander/qlora

e591859

moved added requirements to requirements.txt

a1c807f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added other dependencies and clarification about HF models #11

Added other dependencies and clarification about HF models #11

randerzander commented May 25, 2023 •

edited

Loading

artidoro left a comment

artidoro May 28, 2023

Added other dependencies and clarification about HF models #11

Are you sure you want to change the base?

Added other dependencies and clarification about HF models #11

Conversation

randerzander commented May 25, 2023 • edited Loading

artidoro left a comment

Choose a reason for hiding this comment

artidoro May 28, 2023

Choose a reason for hiding this comment

randerzander commented May 25, 2023 •

edited

Loading