You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
text_embedding is from the img_features, text_features = self.extract_global_feature(features)
Here there is only a linear layer mapping.
The expression of the formula in the paper is as follows
, but I can't find clip_text_embedding. Can you help me find where in the code to use clip_text_embedding?
Thanks for this interesting work.
This paper uses cos_sim to compute the simliarity between Learned Text Embeddings and CLIP Text Embeddings,But I can find out where it's using it.
There doesn't seem to be a calculation going on here.
The text was updated successfully, but these errors were encountered: