Skip to content

TurkuNLP/finnish-instructions

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 

Repository files navigation

finnish-instructions

Centralized repo for Finnish instruction data. License?

Format

instruction: mandatory. May include all the required context.

input: optional. Used as an optional context for instructions.

output: Generated response.

Examples:

instruction: "Laadi kysymys annetusta tekstistä."
input: "Teksti: Sandra siirtyi toimistoon. Sandra meni kylpyhuoneeseen. Mary meni makuuhuoneeseen. Daniel siirtyi eteiseen."
output: "Missä on Daniel?"

or

instruction: "Kumpi on parempaa suomenkieltä, vaihtoehto a) {a} vai b) {b}"
input: ""
output: "a"

Datasets:

TODOs

Ready / contains already usable material

  1. Machine translated from the original English using DeepL
  1. Synthetic datasets

TODOs

  • OCR-correction
  • Masked word prediction
  • Recognize text from space-separated characters
  • Recognize text from characters without space

Ready

  1. Native Finnish datasets:

Ready

  • Paraphrase
  • Question Answering

Processing

About

Centralized repo for Finnish instruction data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages