With the advancement of A.I. technology in recent years, natural language processing technology has been able to solve so many problems. While working as an NLP engineer, I encountered various tasks, and I thought it would be nice to gather and organize the natural language processing tasks I have dealt with in one place. Borrowing Kyubyong's project format, I organized natural language processing tasks with references and example code.
WIKI
Automated Essay ScoringDATA
The Hewlett Foundation: Automated Essay ScoringMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's AES
WIKI
Speech RecognitionDATA
LibriSpeechDATA
AISHELL-1DATA
KsponSpeechMODEL
Deep Speech2MODEL
Listen, Attend and SpellMODEL
Wav2vec 2.0OFF-THE-SHELF
Pororo's ASRCODE
Example with KsponSpeech
WIKI
Dialogue SystemDATA
Persona ChatDATA
Korean SNS CorpusMODEL
Dialogue GPTCODE
Example with Korean SNS Corpus
WIKI
Dialogue SystemDATA
Persona Chat
DATA
Ubuntu Dialogue CorpusDATA
Korean SNS CorpusMODEL
Poly EncoderCODE
Example with Ubuntu Dialogue Corpus
WIKI
Cloze TestINFO
Masked-Language-Modeling with BERTMODEL
BERTMODEL
RoBERTaOFF-THE-SHELF
Pororo's Fill in the BlankCODE
Example with WikiCorpus
WIKI
AutocorrectionDATA
NUS Non-commercial research/trial corpus licenseDATA
Cornell Movie--Dialogs CorpusOFF-THE-SHELF
Pororo's GEC
WIKI
GraphemeWIKI
PhonemeREPRESENTATIVE-DATA
Multilingual Pronunciation DataOFF-THE-SHELF-MODEL
Pororo's G2P
PAPER
Wizard of Wikipedia: Knowledge-Powered Conversational agentsDATA
Wizard of WikipediaCODE
Example with Wizard of Wikipedia
WIKI
Language ModelINFO
A beginner’s guide to language modelsMODEL
GPT3MODEL
GPT2MODEL
Ken-LMMODEL
RNN-LMCODE
Example with OpenWebText
WIKI
Reading ComprehensionINFO
Machine Reading Comprehension with BERTDATA
SQuADDATA
KorQuadMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's MRCCODE
Example with SQuAD & KorQuad
WIKI
TranslationDATA
WMT 2014 English-to-FrenchDATA
Korean-English translation corpusMODEL
TransformerOFF-THE-SHELF
Pororo's TranslationCODE
Example with Korean-English translation corpus
PAPER-WITH-CODE
Math Word Problem SolvingDATA
DeepMind Mathmatics DatasetDATA
KMWP (Korean Math Word Problems)CODE
Example with KMWP
WIKI
Textual EntailmentDATA
GLUE-MNLIDATA
KorNLIMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's NLICODE
Example with GLUE-MNLI
WIKI
Named Entity RecognitionDATA
CoNLL-2002 NER corpusDATA
CoNLL-2003 NER corpusDATA
Naver NERMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's NERCODE
Example with Naver NER
WIKI
ParaphraseOFF-THE-SHELF
Pororo's Paraphrase Generation
OFF-THE-SHELF
Pororo's P2G
WIKI
Sentiment AnalysisDATA
GLUE-SSTDATA
NSMCMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's Sentiment AnalysisCODE
Example with NSMC
WIKI
Semantic SimilarityDATA
GLUE-STSDATA
KorSTSMODEL
BERTMODEL
RoBERTaMODEL
ElectraOFF-THE-SHELF
Pororo's STSCODE
Example with SQuAD
WIKI
Speech SynthesisDATA
LJ SpeechDATA
CSS10DATA
KSSMODEL
Tacotron2MODEL
FastSpeech2MODEL
WaveNetMODEL
Hifi-GANOFF-THE-SHELF
Pororo's TTSCODE
Example with LJ-SpeechCODE
Example with KSS
WIKI
Automatic SummarizationDATA
XSumDATA
Korean Summarization CorpusMODEL
BARTOFF-THE-SHELF
Pororo's SummarizationCODE
Example with XSum
- Soohwan Kim @sooftware
- Contacts: [email protected]