Skip to content

mesolitica/embedding-benchmarks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

embedding-benchmarks

Benchmarking RAG Embedding models for Malaysian context, HuggingFace space at https://huggingface.co/spaces/mesolitica/Malaysian-Embedding-Leaderboard

📈 We evaluate models based on 2 datasets,

  1. Research paper keyword melayu using Crossref, https://huggingface.co/datasets/mesolitica/malaysian-ultrachat/resolve/main/ultrachat-crossref-melayu-malay.jsonl
  2. lom.agc.gov.my PDF files, https://huggingface.co/datasets/mesolitica/malaysian-ultrachat/resolve/main/ultrachat-lom-agc.jsonl

About

Benchmarking Embedding models for Malaysian context.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published