项目介绍

1。使用minin数据集，有teacher model, student model

#核心使用 loss

target label one-hot ###teacher 先训练模块，使用softmax

###student model 通过训练好的teacher模块，送入网络，对于老师的outputs，使用 softmax_t t就是使用蒸馏，对于softmmax的值进行一个 x = x/t

然后 student模块预测值有

loss = distillation(output, target, teacher_output, temp=5.0, alpha=0.7)

使用了 teacher model 的预测值进行softmax_t和 target计算损失和 student model 预测值进行 softmax_t 和 target 计算损失

之后在使用单独使用学生模块进行训练，学生模块

student的模块更加轻量

loss = student+ teacher loss 进行处理

def distillation(y, labels, teacher_scores, temp, alpha):
    """
    y: student model output predicts, no softmmax
    labels: True label 
    teacher_scores: teacher model outputs predicts, no softmax
    temp: temperature
    alph: weights 

    targets: use KLD calculates students model and teacher model softmax_temp, multiplicate students
    model no softmmax predicts 
    """
    return nn.KLDivLoss()(F.log_softmax(y / temp, dim=1), F.softmax(teacher_scores / temp, dim=1)) * (
            temp * temp * 2.0 * alpha) + F.cross_entropy(y, labels) * (1. - alpha)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
1.jpg		1.jpg
4-1_Loss_Function_in_Pytorch.ipynb		4-1_Loss_Function_in_Pytorch.ipynb
4-2_Knowledge_Distillation.ipynb		4-2_Knowledge_Distillation.ipynb
README.md		README.md
hello.txt		hello.txt
student.pt		student.pt
student_kd.pt		student_kd.pt
teacher.pt		teacher.pt
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

项目介绍

1。使用minin数据集，有teacher model, student model

About

Releases

Packages

Languages

yuheyuan/softmax_-

Folders and files

Latest commit

History

Repository files navigation

项目介绍

1。使用minin数据集，有teacher model, student model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages