[Feature Request] Transform that stacks data for agents with identical specs #2566

kurtamohler · 2024-11-14T21:09:35Z

Motivation

Some multi-agent environments, like VmasEnv, stack all of the tensors for observations, rewards, etc. for different agents that have identical specs. For instance, in one of these stacked environments, if there are 2 agents that each have 8 observations, the observation spec might look like this:

Composite(
    group_0: Composite(
        observation: UnboundedContinuous(
            shape=torch.Size([2, 8]),
            ...),
        shape=torch.Size([2])),
    shape=torch.Size([]))

In contrast, other environments, like UnityMLAgentsEnv, have separate keys for each agent, even if the agents' specs are identical. For instance, with 2 agents that each have 8 observations, the observation spec might look like this:

Composite(
    group_0: Composite(
        agent_0: Composite(
            observation: UnboundedContinuous(
                shape=torch.Size([8]),
                ...),
            shape=torch.Size([])),
        agent_1: Composite(
            observation: UnboundedContinuous(
                shape=torch.Size([8]),
                ...),
            shape=torch.Size([])),
        shape=torch.Size([])),
    shape=torch.Size([]))

It is not easy to apply the same training script to two environments that use these two different formats. For instance, applying the multi-agent PPO tutorial to a Unity env is not straightforward.

Solution

If we had an environment transform that could stack all the data from different keys, we could convert an environment that uses the unstacked format into an environment that uses the stacked format. Then it should be straightforward to use the same (or almost the same) training script on the two different environments.

Alternatives

Additional context

Checklist

I have checked that there is no similar issue in the repo (required)

The text was updated successfully, but these errors were encountered:

thomasbbrunner · 2024-11-15T08:24:47Z

Have you taken a look at the group_map argument? When set to MarlGroupMapType.ALL_IN_ONE_GROUP the environment should return all agents in a single group (when possible, otherwise in more than one group).

If you are using this setting, then imo there's an issue in the implementation of the grouping of agents in UnityMLAgentsEnv.

kurtamohler added the enhancement New feature or request label Nov 14, 2024

kurtamohler self-assigned this Nov 14, 2024

kurtamohler linked a pull request Nov 14, 2024 that will close this issue

[Feature] Add Stack transform #2567

Open

7 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Transform that stacks data for agents with identical specs #2566

[Feature Request] Transform that stacks data for agents with identical specs #2566

kurtamohler commented Nov 14, 2024

thomasbbrunner commented Nov 15, 2024 •

edited

Loading

[Feature Request] Transform that stacks data for agents with identical specs #2566

[Feature Request] Transform that stacks data for agents with identical specs #2566

Comments

kurtamohler commented Nov 14, 2024

Motivation

Solution

Alternatives

Additional context

Checklist

thomasbbrunner commented Nov 15, 2024 • edited Loading

thomasbbrunner commented Nov 15, 2024 •

edited

Loading