Fix emb model export and load with trfrs #756

JingyaHuang · 2025-01-07T14:16:58Z

What does this PR do?

Fixes #744

With the PR, we should be once again able to export embedding model via transformers library or sentence transformer library depending on the class called:

With Transformers

import torch
from optimum.neuron import NeuronModelForFeatureExtraction
from transformers import AutoConfig, AutoTokenizer

compiler_args = {"auto_cast": "matmul", "auto_cast_type": "fp16"}
input_shapes = {"batch_size": 4, "sequence_length": 512}
model = NeuronModelForFeatureExtraction.from_pretrained(
    model_id="TaylorAI/bge-micro-v2", # BERT SMALL
    export=True,
    disable_neuron_cache=True,
    **compiler_args,
    **input_shapes,
)

With Sentence Transformers

import torch
from optimum.neuron import NeuronModelForSentenceTransformers
from transformers import AutoConfig, AutoTokenizer

compiler_args = {"auto_cast": "matmul", "auto_cast_type": "fp16"}
input_shapes = {"batch_size": 4, "sequence_length": 512}
model = NeuronModelForSentenceTransformers.from_pretrained(
    model_id="TaylorAI/bge-micro-v2", # BERT SMALL
    export=True,
    disable_neuron_cache=True,
    **compiler_args,
    **input_shapes,
)

HuggingFaceDocBuilderDev · 2025-01-07T14:19:59Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

dacorvo

LGTM, Thanks ! We got hub errors on some of the CI jobs, but it is completely unrelated.

JingyaHuang added 2 commits January 7, 2025 14:11

fix

f6202aa

Merge branch 'main' into fix-emb-model-with-trfrs

659ee3d

JingyaHuang mentioned this pull request Jan 7, 2025

Loading compiled fails: model_type=bert -> transformers being used in compiled config. #744

Closed

fix test

500ef33

JingyaHuang requested a review from dacorvo January 9, 2025 12:26

dacorvo approved these changes Jan 9, 2025

View reviewed changes

JingyaHuang merged commit fe71b3c into main Jan 9, 2025
7 of 10 checks passed

JingyaHuang deleted the fix-emb-model-with-trfrs branch January 9, 2025 12:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix emb model export and load with trfrs #756

Fix emb model export and load with trfrs #756

JingyaHuang commented Jan 7, 2025

HuggingFaceDocBuilderDev commented Jan 7, 2025

dacorvo left a comment

Fix emb model export and load with trfrs #756

Fix emb model export and load with trfrs #756

Conversation

JingyaHuang commented Jan 7, 2025

What does this PR do?

HuggingFaceDocBuilderDev commented Jan 7, 2025

dacorvo left a comment

Choose a reason for hiding this comment