feat: add a deep thinking reasoner model (o1-preview/mini) #68

michaelneale · 2024-09-17T08:17:11Z

This starts to introduce the concept of a "slower" reasoner model (alongside accelerator and processor) - and uses it via a toolkit for enhanced debugging/understanding and code authoring.

It will use the o1 models in some cases as needed, to compliment gpt4-o and gpt-4o-mini in the openai case. The "reasoner" model doesn't do tool calling or planning directly, but is consulting.

src/goose/toolkit/reasoner.py

tests/curves/p256_tests.rs

wesrblock

Looks great!

michaelneale · 2024-09-19T07:51:31Z

I have been using this all week to great success.

codefromthecrypt · 2024-09-30T06:45:28Z

@michaelneale curious what specific sort of thing do you feel was more effective or possible with this vs without?

michaelneale · 2024-10-01T08:46:43Z

@codefromthecrypt mostly harder problems - for example the "interactive" fix for goose came from it initially (goose wasn't able to solve things that deep before). It also seems to help avoid goose prematurely jumping to a solution by misunderstanding the nuances of a problem and doing a "too obvious" thing. Well worth it IMO (and as a toolkit - so it doesn't kick in all the time).

michaelneale · 2024-10-07T22:30:56Z

@baxen I think this is ready for review - if interested to include it as something people can opt in or not (as toolkit, no extra deps)

* main: feat: add groq provider (#134) feat: add a deep thinking reasoner model (o1-preview/mini) (#68) fix: use concrete SessionNotifier (#135) feat: add guards to session management (#101) fix: Set default model configuration for the Google provider. (#131) test: convert Google Gemini tests to VCR (#118) chore: Add goose providers list command (#116) docs: working ollama for desktop (#125) docs: format and clean up warnings/errors (#120) docs: update deploy workflow (#124) feat: Implement a goose run command (#121)

* main: (23 commits) feat: Run with resume session (#153) refactor: move langfuse wrapper to a module in exchange instead of a package (#138) docs: add subheaders to the 'Other ways to run Goose' section (#155) fix: Remove tools from exchange when summarizing files (#157) chore: use primitives instead of typing imports and fixes completion … (#149) chore: make vcr tests pretty-print JSON (#146) chore(release): goose 0.9.5 (#159) chore(release): exchange 0.9.5 (#158) chore: updates ollama default model from mistral-nemo to qwen2.5 (#150) feat: add vision support for Google (#141) fix: session resume with arg handled incorrectly (#145) docs: add release instructions to CONTRIBUTING.md (#143) docs: add link to action, IDE words (#140) docs: goosehints doc fix only (#142) chore(release): release 0.9.4 (#136) revert: "feat: add local langfuse tracing option (#106)" (#137) feat: add local langfuse tracing option (#106) feat: add groq provider (#134) feat: add a deep thinking reasoner model (o1-preview/mini) (#68) fix: use concrete SessionNotifier (#135) ...

michaelneale added 5 commits September 17, 2024 15:48

some progress

9e58dc8

progress

d8eb939

more progress

7bbc92f

check point

6ef3b03

seems to be working ok now

4b66341

michaelneale changed the title ~~O1 reasoner~~ feat: add a deep thinking reasoner model Sep 17, 2024

michaelneale mentioned this pull request Sep 17, 2024

feat: O1 support in goose for new models square/exchange#47

Closed

michaelneale changed the title ~~feat: add a deep thinking reasoner model~~ feat: add a deep thinking reasoner model (o1-preview/mini) Sep 17, 2024

wesrblock reviewed Sep 17, 2024

View reviewed changes

src/goose/toolkit/reasoner.py Outdated Show resolved Hide resolved

michaelneale added 4 commits September 18, 2024 11:25

typo

a83717a

tidy

3e97233

use mini as recommended for now

62f894a

Merge remote-tracking branch 'origin/main' into o1-reasoner

cd5b77a

michaelneale requested a review from wesrblock September 18, 2024 05:53

michaelneale added the enhancement New feature or request label Sep 18, 2024

michaelneale added 4 commits September 18, 2024 16:25

checkpoint

3d366c2

using mainstream provider

f63a411

removing unneeded stuff

28e6e4f

formatting

c8cdcb7

codefromthecrypt reviewed Sep 18, 2024

View reviewed changes

tests/curves/p256_tests.rs Outdated Show resolved Hide resolved

removing test junk

843b4d8

wesrblock approved these changes Sep 18, 2024

View reviewed changes

Merge remote-tracking branch 'origin/main' into o1-reasoner

f42d885

michaelneale marked this pull request as ready for review September 19, 2024 07:33

michaelneale requested review from codefromthecrypt and baxen and removed request for codefromthecrypt September 19, 2024 07:33

updated to main

7504678

michaelneale added the work-in-progress label Sep 25, 2024

michaelneale added 2 commits September 27, 2024 16:17

Merge remote-tracking branch 'origin/main' into o1-reasoner

bf31ee9

tuning the prompts for tools

71ea8fd

michaelneale removed the work-in-progress label Oct 7, 2024

salman1993 approved these changes Oct 10, 2024

View reviewed changes

salman1993 merged commit 8706e9e into main Oct 10, 2024
2 checks passed

lamchau deleted the o1-reasoner branch October 24, 2024 11:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add a deep thinking reasoner model (o1-preview/mini) #68

feat: add a deep thinking reasoner model (o1-preview/mini) #68

michaelneale commented Sep 17, 2024 •

edited

Loading

wesrblock left a comment

michaelneale commented Sep 19, 2024

codefromthecrypt commented Sep 30, 2024

michaelneale commented Oct 1, 2024

michaelneale commented Oct 7, 2024

feat: add a deep thinking reasoner model (o1-preview/mini) #68

feat: add a deep thinking reasoner model (o1-preview/mini) #68

Conversation

michaelneale commented Sep 17, 2024 • edited Loading

wesrblock left a comment

Choose a reason for hiding this comment

michaelneale commented Sep 19, 2024

codefromthecrypt commented Sep 30, 2024

michaelneale commented Oct 1, 2024

michaelneale commented Oct 7, 2024

michaelneale commented Sep 17, 2024 •

edited

Loading