Change the repository type filter
All
Repositories list
82 repositories
SWE-agent
Public[NeurIPS 2024] SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges.SWE-bench
Public[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?HELMET
PublicSimPO
Public[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free RewardEdge-Pruning
Public- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
LESS
Public- [EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
ALCE
Public[EMNLP 2023] Enabling Large Language Models to Generate Text with Citations. Paper: https://arxiv.org/abs/2305.14627NLProofS
PublicEMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443MQuAKE
PublicAutoCompressors
PublicWebShop
Public[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agentsil-scaling-in-games
PublicLM-Science-Tutor
PublicUSACO
PublicELIZA-Transformer
PublicLitSearch
PublicPTP
PublicImproving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073LLMBar
PublicCopyCat
Publictree-of-thought-llm
Public[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language ModelsQuRating
PublicCEPE
PublicHeuristic-Core
Public[ACL 2024] The Heuristic Core: Understanding Subnetwork Generalization in Pretrained Language Models - https://arxiv.org/abs/2403.03942c-sts
PublicTransformerPrograms
Public