Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About the training of the classifier #13

Open
Smiling-Weeping-zhr opened this issue Jul 30, 2024 · 3 comments
Open

About the training of the classifier #13

Smiling-Weeping-zhr opened this issue Jul 30, 2024 · 3 comments

Comments

@Smiling-Weeping-zhr
Copy link

Hello authors, we have reproduced your code. We loaded the classifier of ImageNette, and the accuracy could reach 0.99, but we trained from scratch without loading any weights, and the result was only 0.73, and we used other efficient web frameworks for classification, and the accuracy was only 0.85. Can you tell me how you train, thank you very much。

@chanwkimlab
Copy link
Collaborator

chanwkimlab commented Aug 6, 2024

Hi, thanks for your interest in our work! As the ImageNette dataset contains only 10 easily distinguishable classes, achieving an accuracy over 0.9 is typically easy. If your model's accuracy is only 0.73, there might be something wrong. For more context, you can check the ImageNette leaderboard here. It's difficult to debug the exact issues based on the information provided, but one possible explanation could be that you're using a version of the ImageNette dataset with label noise.

@iancovert
Copy link
Collaborator

Another possibility is that your performance is worse due to training from a random initialization (I'm guessing this is what you mean by training "from scratch"). ViTs are more difficult to train due to less inductive bias (see this paper for example), so it's common to use pre-trained weights (e.g., from ImageNet classification) unless you have a lot of data.

@Smiling-Weeping-zhr
Copy link
Author

Thanks very much

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants