OLMo 2 (WIP) #1897

ysjprojects · 2025-01-04T05:18:40Z

https://huggingface.co/collections/allenai/olmo-2-674117b93ab84e98afc72edc
https://arxiv.org/abs/2501.00656

Version 2 of OLMo released by Ai2.

Comes in 7B and 13B Base, Instruct, and additional SFT and DPO models.

First, we find that OLMo 2 7B and 13B are the best fully-open models to-date, often outperforming open weight models of equivalent size. Not only do we observe a dramatic improvement in performance across all tasks compared to our earlier OLMo 0424 model but, notably, OLMo 2 7B outperforms LLama-3.1 8B and OLMo 2 13B outperforms Qwen 2.5 7B despite its lower total training FLOPs. The OLMo 2 models sit at the Pareto frontier of training FLOPs vs model average performance (see figure above).

rasbt · 2025-01-08T15:07:50Z

Hi there,
just wanted to say thanks for taking on this PR (I know this is a lot of work)! The OLMo models are awesome, and I'd be great to have OLMo 2 in LitGPT.

ysjprojects · 2025-01-12T07:13:29Z

Hi there, just wanted to say thanks for taking on this PR (I know this is a lot of work)! The OLMo models are awesome, and I'd be great to have OLMo 2 in LitGPT.

Thanks mate!

Currently on vacation, will resume working on this PR once I'm back.

OLMo 2: implemented core

b62eed2

ysjprojects requested review from rasbt and lantiga as code owners January 4, 2025 05:18

ysjprojects added 3 commits January 4, 2025 00:37

minor fix

f559763

fix vocab size

276a8fc

fix test_model

1ac888f

ysjprojects changed the title ~~OLMo 2~~ OLMo 2 (WIP) Jan 4, 2025

ysjprojects added 6 commits January 7, 2025 19:47

custom conversion fn for olmo2 due to new q_norm and k_norm components

d3456e3

minor fix

121f851

minor fix on test_model.py

3d34921

fix: post_feedforward_layernorm

15f549d

minor fix

ac3509f

input_norm

852ca3e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OLMo 2 (WIP) #1897

OLMo 2 (WIP) #1897

ysjprojects commented Jan 4, 2025

rasbt commented Jan 8, 2025

ysjprojects commented Jan 12, 2025

OLMo 2 (WIP) #1897

Are you sure you want to change the base?

OLMo 2 (WIP) #1897

Conversation

ysjprojects commented Jan 4, 2025

rasbt commented Jan 8, 2025

ysjprojects commented Jan 12, 2025