-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
您好,请问怎么把这个用于视频分类?比如5分钟的短视频 #6
Comments
Thanks for your question. We haven't applied the model to video classification. However, you could use the ViT as a base model to encode each frame of your video. |
when there has hundreds of frames ,How to deal with it? |
Thank you for your question. The problem of how to efficiently process videos is interesting, but not the focus of this repo. We are happy to discuss this potential application with you, but maybe at some other venue. I would suggest you to check related literature first. My previous response was mainly to convey the idea that ViT can be used for processing images in general. |
Directly using VIT to process video may result in bad results. |
No description provided.
The text was updated successfully, but these errors were encountered: