Ensemble Task Implementation #50

sdsawtelle · 2022-05-22T21:04:04Z

@HobbitLong Thank you very much for making the effort to clean and post your code for these benchmarks! I'm sure that you don't have time to post code for the ensemble distillation task, but I am going to try reproducing that benchmark so perhaps if there are any tricks or different hyperparameters settings that you can remember for that particular task off the top of your head then we can document them in this issue.

sdsawtelle · 2022-05-29T20:56:31Z

For Figure 4 in the paper, I'm wondering exactly how a single point is generated in those plots. For example, for the point that is ResNet distillation from four teachers, is that an average over multiple trials? And if so, for each trial are four new teachers trained from scratch for that trial? Or was there a pool of e.g. 8 teachers and each 4-teacher trial randomly selects four from among those 8, each 6-teacher trial randomly selects 6 from among those 8 etc?

ShristiDasBiswas · 2024-03-27T19:21:33Z

Hi, were you able to reproduce the ensemble distillation task?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensemble Task Implementation #50

Ensemble Task Implementation #50

sdsawtelle commented May 22, 2022

sdsawtelle commented May 29, 2022

ShristiDasBiswas commented Mar 27, 2024

Ensemble Task Implementation #50

Ensemble Task Implementation #50

Comments

sdsawtelle commented May 22, 2022

sdsawtelle commented May 29, 2022

ShristiDasBiswas commented Mar 27, 2024