Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

您好,请问怎么把这个用于视频分类?比如5分钟的短视频 #6

Open
dotsonliu opened this issue Feb 19, 2021 · 4 comments

Comments

@dotsonliu
Copy link

No description provided.

@christy-yuan-li
Copy link
Collaborator

Thanks for your question. We haven't applied the model to video classification. However, you could use the ViT as a base model to encode each frame of your video.

@dotsonliu
Copy link
Author

when there has hundreds of frames ,How to deal with it?

@christy-yuan-li
Copy link
Collaborator

christy-yuan-li commented Feb 23, 2021

Thank you for your question. The problem of how to efficiently process videos is interesting, but not the focus of this repo. We are happy to discuss this potential application with you, but maybe at some other venue. I would suggest you to check related literature first. My previous response was mainly to convey the idea that ViT can be used for processing images in general.

@runningJ
Copy link

Directly using VIT to process video may result in bad results.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants