Skip to content

Commit

Permalink
Fix starcoder2 missing bias in nemo2 config (#11809)
Browse files Browse the repository at this point in the history
  • Loading branch information
meatybobby authored Jan 14, 2025
1 parent c1e46ea commit dc08edd
Showing 1 changed file with 3 additions and 0 deletions.
3 changes: 3 additions & 0 deletions nemo/export/trt_llm/nemo_ckpt_loader/nemo_file.py
Original file line number Diff line number Diff line change
Expand Up @@ -553,6 +553,9 @@ def load_nemo_model(nemo_ckpt: Union[str, Path], nemo_export_dir: Union[str, Pat
elif nemo_model_config["activation"] == "squared_relu":
nemo_model_config["activation"] = "squared-relu"

if nemo_model_config.get("add_bias_linear"):
nemo_model_config["bias"] = True

nemo_model_config["mcore_gpt"] = True
nemo_model_config["max_position_embeddings"] = nemo_model_config.get("seq_length", 4096)
nemo_model_config["rotary_percentage"] = nemo_model_config.get("rotary_percent", 1.0)
Expand Down

0 comments on commit dc08edd

Please sign in to comment.