Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ensemble Task Implementation #50

Open
sdsawtelle opened this issue May 22, 2022 · 2 comments
Open

Ensemble Task Implementation #50

sdsawtelle opened this issue May 22, 2022 · 2 comments

Comments

@sdsawtelle
Copy link

@HobbitLong Thank you very much for making the effort to clean and post your code for these benchmarks! I'm sure that you don't have time to post code for the ensemble distillation task, but I am going to try reproducing that benchmark so perhaps if there are any tricks or different hyperparameters settings that you can remember for that particular task off the top of your head then we can document them in this issue.

@sdsawtelle
Copy link
Author

For Figure 4 in the paper, I'm wondering exactly how a single point is generated in those plots. For example, for the point that is ResNet distillation from four teachers, is that an average over multiple trials? And if so, for each trial are four new teachers trained from scratch for that trial? Or was there a pool of e.g. 8 teachers and each 4-teacher trial randomly selects four from among those 8, each 6-teacher trial randomly selects 6 from among those 8 etc?

@ShristiDasBiswas
Copy link

Hi, were you able to reproduce the ensemble distillation task?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants