Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Babyai text #454

Open
wants to merge 7 commits into
base: master
Choose a base branch
from

Conversation

BartekCupial
Copy link

Description

Before I will do the checklist I wanted to ask for feedback, do the community think that this change is useful and want me to continue?

Added BabyAI-Text environments to Minigrid, which provide textual descriptions of observations instead of symbolic representations. This extends the BabyAI platform to enable training language models as agent policies in interactive environments. The new environments maintain the same underlying mechanics as original BabyAI tasks but return text observations describing the agent's partial view of the environment.

Code is copied and adapted from https://github.com/flowersteam/Grounding_LLMs_with_online_RL

Key additions:

  • text description of observation in the info
  • new babyai environments

This addition enables research on grounding language models in interactive environments while maintaining the sample efficiency.

Type of change

Please delete options that are not relevant.

  • New feature (non-breaking change which adds functionality)
  • This change requires a documentation update

Screenshots

Example observation:
Screenshot_20241031_190026
Corresponding description:

[   
    'You see a wall 2 steps right', 
    'You see a blue key 3 steps left and 1 step forward', 
    'You see a red box 3 steps left', 
    'You see a blue box 2 steps left and 1 step forward', 
    'You see a red ball 1 step left and 1 step forward', 
    'You see a grey ball 1 step forward', 
    'You see a blue box 1 step right and 1 step forward'
]

Checklist:

  • [ ] I have run the pre-commit checks with pre-commit run --all-files
  • [ ] I have commented my code, particularly in hard-to-understand areas
  • [ ] I have made corresponding changes to the documentation
  • [ ] My changes generate no new warnings
  • [ ] I have added tests that prove my feature works
  • [ ] New and existing unit tests pass locally with my changes

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants