Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add a deep thinking reasoner model (o1-preview/mini) #68

Merged
merged 18 commits into from
Oct 10, 2024

Conversation

michaelneale
Copy link
Collaborator

@michaelneale michaelneale commented Sep 17, 2024

This starts to introduce the concept of a "slower" reasoner model (alongside accelerator and processor) - and uses it via a toolkit for enhanced debugging/understanding and code authoring.

It will use the o1 models in some cases as needed, to compliment gpt4-o and gpt-4o-mini in the openai case. The "reasoner" model doesn't do tool calling or planning directly, but is consulting.

@michaelneale michaelneale changed the title O1 reasoner feat: add a deep thinking reasoner model Sep 17, 2024
@michaelneale michaelneale changed the title feat: add a deep thinking reasoner model feat: add a deep thinking reasoner model (o1-preview/mini) Sep 17, 2024
@michaelneale michaelneale added the enhancement New feature or request label Sep 18, 2024
Copy link

@wesrblock wesrblock left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great!

@michaelneale michaelneale marked this pull request as ready for review September 19, 2024 07:33
@michaelneale michaelneale requested review from codefromthecrypt and baxen and removed request for codefromthecrypt September 19, 2024 07:33
@michaelneale
Copy link
Collaborator Author

I have been using this all week to great success.

@codefromthecrypt
Copy link
Collaborator

@michaelneale curious what specific sort of thing do you feel was more effective or possible with this vs without?

@michaelneale
Copy link
Collaborator Author

@codefromthecrypt mostly harder problems - for example the "interactive" fix for goose came from it initially (goose wasn't able to solve things that deep before). It also seems to help avoid goose prematurely jumping to a solution by misunderstanding the nuances of a problem and doing a "too obvious" thing. Well worth it IMO (and as a toolkit - so it doesn't kick in all the time).

@michaelneale
Copy link
Collaborator Author

@baxen I think this is ready for review - if interested to include it as something people can opt in or not (as toolkit, no extra deps)

@salman1993 salman1993 merged commit 8706e9e into main Oct 10, 2024
2 checks passed
ahau-square added a commit that referenced this pull request Oct 10, 2024
* main:
  feat: add groq provider (#134)
  feat: add a deep thinking reasoner model (o1-preview/mini) (#68)
  fix: use concrete SessionNotifier (#135)
  feat: add guards to session management (#101)
  fix: Set default model configuration for the Google provider. (#131)
  test: convert Google Gemini tests to VCR (#118)
  chore: Add goose providers list command (#116)
  docs: working ollama for desktop (#125)
  docs: format and clean up warnings/errors (#120)
  docs: update deploy workflow (#124)
  feat: Implement a goose run command (#121)
lukealvoeiro added a commit that referenced this pull request Oct 17, 2024
* main: (23 commits)
  feat: Run with resume session (#153)
  refactor: move langfuse wrapper to a module in exchange instead of a package (#138)
  docs: add subheaders to the 'Other ways to run Goose' section (#155)
  fix: Remove tools from exchange when summarizing files (#157)
  chore: use primitives instead of typing imports and fixes completion … (#149)
  chore: make vcr tests pretty-print JSON (#146)
  chore(release): goose 0.9.5 (#159)
  chore(release): exchange 0.9.5 (#158)
  chore: updates ollama default model from mistral-nemo to qwen2.5 (#150)
  feat: add vision support for Google (#141)
  fix: session resume with arg handled incorrectly (#145)
  docs: add release instructions to CONTRIBUTING.md (#143)
  docs: add link to action, IDE words (#140)
  docs: goosehints doc fix only (#142)
  chore(release): release 0.9.4 (#136)
  revert: "feat: add local langfuse tracing option  (#106)" (#137)
  feat: add local langfuse tracing option  (#106)
  feat: add groq provider (#134)
  feat: add a deep thinking reasoner model (o1-preview/mini) (#68)
  fix: use concrete SessionNotifier (#135)
  ...
@lamchau lamchau deleted the o1-reasoner branch October 24, 2024 11:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants