Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AnimalAI Multi Arena Episodes #49

Closed
alhasacademy96 opened this issue Mar 13, 2024 · 2 comments
Closed

AnimalAI Multi Arena Episodes #49

alhasacademy96 opened this issue Mar 13, 2024 · 2 comments
Assignees
Labels
enhancement New feature or request good first issue Good for newcomers stale

Comments

@alhasacademy96
Copy link
Member

AAI’s episodic structure is currently tied to arenas: agents spawn into an arena and interact with it until it completes (e.g. because the goal is reached, timeout, etc.). Then the agent is reset and the next arena is loaded.

Animals do not interact with their environment in an episodic way:
Learning happens continuously
Animals remember events from arbitrarily far in the past and use this knowledge to shape their behaviour
Context switches are not well defined and instead happen gradually or imperceptibly

Tests in Cognitive Science can make use of these features. For example an episodic memory task may involve allowing an agent to learn a route through a maze to a goal, and then presenting it with the same maze but with a path blocked (Sara and Seraphina are working on tests of this type in babies and AI, respectively).

In AAI currently the only way an agent can learn about the structure of the arena and use that knowledge elsewhere is through training on that arena. This is a problem because training can destroy previous capabilities (catastrophic forgetting), and so we couldn’t analyse a “generalist agent’s” ability to learn the structure of new mazes in this way.

A more valid way to explore this would be to have an agent learn about the structure of the maze in context (i.e. during an episode) and then have that episode continue when the maze is switched to a new configuration.

To allow this, users should be able to specify that multiple arenas can be grouped together into one episode.

@alhasacademy96 alhasacademy96 added enhancement New feature or request good first issue Good for newcomers labels Mar 13, 2024
Copy link

This issue is stale because it has been open for 45 days with no activity.

@github-actions github-actions bot added the stale label Apr 27, 2024
@alhasacademy96
Copy link
Member Author

Closing as completed in PR #35.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers stale
Projects
None yet
Development

No branches or pull requests

2 participants