-
Notifications
You must be signed in to change notification settings - Fork 51
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: add a deep thinking reasoner model (o1-preview/mini) #68
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great!
I have been using this all week to great success. |
@michaelneale curious what specific sort of thing do you feel was more effective or possible with this vs without? |
@codefromthecrypt mostly harder problems - for example the "interactive" fix for goose came from it initially (goose wasn't able to solve things that deep before). It also seems to help avoid goose prematurely jumping to a solution by misunderstanding the nuances of a problem and doing a "too obvious" thing. Well worth it IMO (and as a toolkit - so it doesn't kick in all the time). |
@baxen I think this is ready for review - if interested to include it as something people can opt in or not (as toolkit, no extra deps) |
* main: feat: add groq provider (#134) feat: add a deep thinking reasoner model (o1-preview/mini) (#68) fix: use concrete SessionNotifier (#135) feat: add guards to session management (#101) fix: Set default model configuration for the Google provider. (#131) test: convert Google Gemini tests to VCR (#118) chore: Add goose providers list command (#116) docs: working ollama for desktop (#125) docs: format and clean up warnings/errors (#120) docs: update deploy workflow (#124) feat: Implement a goose run command (#121)
* main: (23 commits) feat: Run with resume session (#153) refactor: move langfuse wrapper to a module in exchange instead of a package (#138) docs: add subheaders to the 'Other ways to run Goose' section (#155) fix: Remove tools from exchange when summarizing files (#157) chore: use primitives instead of typing imports and fixes completion … (#149) chore: make vcr tests pretty-print JSON (#146) chore(release): goose 0.9.5 (#159) chore(release): exchange 0.9.5 (#158) chore: updates ollama default model from mistral-nemo to qwen2.5 (#150) feat: add vision support for Google (#141) fix: session resume with arg handled incorrectly (#145) docs: add release instructions to CONTRIBUTING.md (#143) docs: add link to action, IDE words (#140) docs: goosehints doc fix only (#142) chore(release): release 0.9.4 (#136) revert: "feat: add local langfuse tracing option (#106)" (#137) feat: add local langfuse tracing option (#106) feat: add groq provider (#134) feat: add a deep thinking reasoner model (o1-preview/mini) (#68) fix: use concrete SessionNotifier (#135) ...
This starts to introduce the concept of a "slower" reasoner model (alongside accelerator and processor) - and uses it via a toolkit for enhanced debugging/understanding and code authoring.
It will use the o1 models in some cases as needed, to compliment gpt4-o and gpt-4o-mini in the openai case. The "reasoner" model doesn't do tool calling or planning directly, but is consulting.