EmpathyEar: An Open-source Avatar Multimodal Empathetic Chatbot

Hao Fei, Han Zhang, Bin Wang, Lizi Liao, Qian Liu, and Erik Cambria.

This is the repository that contains the source code for the ACL 2024 Demo of EmpathyEar: An Open-source Avatar Multimodal Empathetic Chatbot. EmpathyEar is a pioneering open-source, avatar-based multimodal empathetic chatbot, to fill the gap in traditional text-only empathetic response generation (ERG) systems.

Demonstration

resized-video-demo-trimmed.mp4

Workflow

Framework Architecture

Quick Start

Prepare pre-trained models

Download ChatGLM3 chekpoints from https://huggingface.co/THUDM/chatglm3-6b/tree/main and place it in the ChatGLM-6B folder.
Download pre-trained ChatGLM3 lora checkpoints from https://pan.baidu.com/s/14zzdxyRZL3dqBmI2hJPlIw?pwd=qj4w and place it in the ChatGLM-6B folder. You can also fine-tune ChatGLM by following these steps:

cd chatglm
./scripts/finetune_lora.sh

Download the pre-trained StyleTTS2 model on LibriTTS at https://huggingface.co/yl4579/StyleTTS2-LibriTTS/tree/main and place it in the StyleTTS2-LibriTTS folder.
Download the pretrained models for EAT and place them in the ckpt,[Utils] folder respectively according to the following steps.

gdown --id 1KK15n2fOdfLECWN5wvX54mVyDt18IZCo && unzip -q ckpt.zip -d ckpt
gdown --id 1HGVzckXh-vYGZEUUKMntY1muIbkbnRcd && unzip -q Utils.zip -d Utils

Create conda environment

conda env create -f environment.yml
conda activate empathyear

Test on Empathetic Dialogue dataset

python inference.py
The generated TTS wav files will be saved in TTS_audio, and the generated talking face videos will be saved in MP4_video.

Acknowledge

We acknowledge these works for their public codes: ChatGLM3, ImageBind, StyleTTS2, EAT.

License Notices

This repository is under BSD 3-Clause License. EmpathyEar is a research project intended for non-commercial use only. One must NOT use the code of EmpathyEar for any illegal, harmful, violent, racist, or sexual purposes. One is strictly prohibited from engaging in any activity that will potentially violate these guidelines. Any potential commercial use of this code should be approved by the authors.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Data		Data
EAT		EAT
MP4_results/ED_test		MP4_results/ED_test
StyleTTS2		StyleTTS2
TTS_results/ED_test		TTS_results/ED_test
chatglm		chatglm
figures		figures
README.md		README.md
environment.yml		environment.yml
inference.py		inference.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EmpathyEar: An Open-source Avatar Multimodal Empathetic Chatbot

Demonstration

Workflow

Framework Architecture

Quick Start

Prepare pre-trained models

Create conda environment

Test on Empathetic Dialogue dataset

Acknowledge

License Notices

About

Releases

Packages

Languages

scofield7419/EmpathyEar

Folders and files

Latest commit

History

Repository files navigation

EmpathyEar: An Open-source Avatar Multimodal Empathetic Chatbot

Demonstration

Workflow

Framework Architecture

Quick Start

Prepare pre-trained models

Create conda environment

Test on Empathetic Dialogue dataset

Acknowledge

License Notices

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages