v0.3.1
This release provides a bunch of bug fixes and speedups.
What's Changed
[BugFix] Fix broken gym tests (#1980)
[BugFix,CI] Fix Windows CI (#1983)
[Minor] Cleanup
[CI] Install stable torch and tensordict for release tests (#1978)
[Refactor] Remove remnant legacy functional calls (#1973)
[Minor] Use the main branch for the M1 build wheels (#1965)
[BugFix] Fixed import for importlib (#1914)
[BugFix] Fix offline CatFrames for pixels (#1964)
[BugFix] Fix offline CatFrames (#1953)
[BugFix] Fix batch-size expansion in functionalization (#1959)
[BugFix] Update iql docstring example (#1950)
[BugFix] Update cql docstring example (#1951)
[BugFix] Fix examples (#1945)
[BugFix] Remove reset on last step of a rollout (#1936)
[BugFix] Vmap randomness for value estimator (#1942)
[BugFix] Fix multiple context syntax in multiagent examples (#1943)
[BugFix] Fix habitat (#1941)
[BugFix] Fix env.shape regex matches (#1940)
[Minor] Add env.shape attribute (#1938)
[BugFix] Fix replay buffer extension with lists (#1937)
[BugFix] No grad on collector reset (#1927)
[BugFix] fix trunc normal device (#1931)
[BugFix, Performance] Fewer imports at root (#1930)
[BugFix] Fix OOB TruncatedNormal LP (#1924)
[BugFix] Fix KLPENPPOLoss KL computation (#1922)
[Doc] Fix onw typo (#1917)
[BugFix] Make sure ParallelEnv does not overflow mem when policy requires grad (#1909)
[BugFix] Non exclusive terminated and truncated (#1911)
[BugFix] Use setdefault in _cache_values (#1910)
[BugFix] Fix Ray collector example error (#1908)
[BugFix] Make KL-controllers independent of the model (#1903)
[Minor] Remove warnings in test_cost (#1902)
[BugFix] Adaptable non-blocking for mps and non cuda device in batched-envs (#1900)
[BugFix] Fix flaky rb tests (#1901)
[BugFix] Fix exploration in losses (#1898)
[BugFix] Solve recursion issue in losses hook (#1897)
[Doc] Update getting-started-5.py (#1894)
[Doc] Getting started tutos (#1886)
[BugFix] Use traj_terminated in SliceSampler (#1884)
[Doc] Improve PrioritizedSampler doc and get rid of np dependency as much as possible (#1881)
[BugFix] Fix _reset data passing in parallel env (#1880)
[BugFix] state typo in RNG control module (#1878)
[BugFix] Fix a bug in SliceSampler, indexes outside sampler lengths were produced (#1874)
[BugFix] check_env_specs seeding logic (#1872)
[BugFix] Fix update in serial / parallel env (#1866)
[Doc] Installation instructions in API ref (#1871)
[BugFix] better device consistency in EGreedy (#1867)
[BugFix] Fix load_state_dict and is_empty td bugfix impact (#1869)
[Doc] Fix tutos (#1863)
Full Changelog: v0.3.0...v0.3.1