[rllib] Improve `test_single_agent_env_runner` to prevent flaky tests #58397

pseudo-rnd-thoughts · 2025-11-04T14:22:53Z

Description

In improving the SingleEnvRunner.make_env, I found that some of the tests could be flaky.
This PR improves the testing, in particular, to sample to ensure that the tests don't fail occasionally and the documentation to reflect this.

The primary flaky problem I found is that sample(num_timesteps=X) will not always return a total of X timesteps, rather at least X timesteps up to the number of environments more.
I'm updated the documentation to clarify this for users.

In addition, I've added tests for when neither the number of timesteps or episodes are given and for the force_reset argument

Signed-off-by: Mark Towers <[email protected]>

gemini-code-assist

Code Review

This pull request improves the tests for SingleAgentEnvRunner.sample to prevent flakiness and updates the documentation to reflect the correct behavior. The changes are generally good and address the stated goals. I've identified a couple of areas in the tests where the assertions could be more precise to better validate the functionality. My review includes suggestions to tighten these assertions.

rllib/env/tests/test_single_agent_env_runner.py

Signed-off-by: Mark Towers <[email protected]>

simonsays1980

LGTM. Thanks for the refinement @pseudo-rnd-thoughts !

simonsays1980 · 2025-11-14T19:10:27Z

rllib/env/single_agent_env_runner.py


            # Sample n timesteps.
            if num_timesteps is not None:
+                assert num_timesteps >= 0


simonsays1980 · 2025-11-14T19:12:09Z

rllib/env/tests/test_single_agent_env_runner.py

                num_timesteps=10, num_episodes=10, random_actions=True
            ),
        )
+        # Verify that an error is raised if a negative number is used


simonsays1980 · 2025-11-14T19:13:34Z

rllib/env/tests/test_single_agent_env_runner.py

            env_runner.stop()

+    def test_env_context(self):
+        """Tests, whether SingleAgentEnvRunner can pass kwargs to the environments correctly."""


Great! Thanks for adding this. This was missing but is very important.

…_env` (#58410) ## Description Allow users to use environments that are already vectorized for `SingleAgentEnvRunner` With `gymnasium.make_vec`, users have the option to either use the `SyncVectorEnv` to vectorize a base environment or to directly create a vector environment using the `vectorize_mode: gymnasium.VectorizeMode`. This PR utilises the `env_runners(gym_env_vectorize_mode=...)` argument to support `VectorizeMode.VECTOR_ENTRY_POINT` ``` import gymnasium as gym config = ... config.env_runners( gym_env_vectorize_mode=gym.VectorizeMode.VECTOR_ENTRY_POINT, ) ``` An important change related to this PR is that the values accepted for the vectorize mode is either the enum (`VectorizedMode.ASYNC`, etc) or the enum values (`"async"`, etc) as before it was the string version was the enum name (`"ASYNC"`) rather than the enum value itself. ## Related issues Completion of #57643, ## Additional information #58397 must be merged first We should apply a similar change to the `MultiAgentEnvRunner.make_env` --------- Signed-off-by: Mark Towers <[email protected]> Co-authored-by: Mark Towers <[email protected]>

Improve test_single_agent_env_runner to prevent flaky tests

f9e855d

Signed-off-by: Mark Towers <[email protected]>

pseudo-rnd-thoughts requested a review from a team as a code owner November 4, 2025 14:22

pseudo-rnd-thoughts requested a review from simonsays1980 November 4, 2025 14:23

pseudo-rnd-thoughts added rllib RLlib related issues rllib-envrunners Issues around the sampling backend of RLlib labels Nov 4, 2025

gemini-code-assist bot reviewed Nov 4, 2025

View reviewed changes

rllib/env/tests/test_single_agent_env_runner.py Show resolved Hide resolved

rllib/env/tests/test_single_agent_env_runner.py Outdated Show resolved Hide resolved

This comment was marked as outdated.

Sign in to view

Gemini Code Review

a5036c3

Signed-off-by: Mark Towers <[email protected]>

pseudo-rnd-thoughts changed the title ~~Improve test_single_agent_env_runner to prevent flaky tests~~ [rllib] Improve test_single_agent_env_runner to prevent flaky tests Nov 4, 2025

Add more tests

65e1ad5

Signed-off-by: Mark Towers <[email protected]>

pseudo-rnd-thoughts mentioned this pull request Nov 5, 2025

[rllib] Add support for vectorize modes in SingleAgentEnvRunner.make_env #58410

Merged

simonsays1980 approved these changes Nov 14, 2025

View reviewed changes

simonsays1980 added the go add ONLY when ready to merge, run all tests label Nov 14, 2025

Merge branch 'master' into improve-test-single-agent-env-runner-testing

312f036

simonsays1980 merged commit cf9f783 into ray-project:master Nov 17, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[rllib] Improve `test_single_agent_env_runner` to prevent flaky tests #58397

[rllib] Improve `test_single_agent_env_runner` to prevent flaky tests #58397

pseudo-rnd-thoughts commented Nov 4, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

simonsays1980 left a comment

Uh oh!

simonsays1980 Nov 14, 2025

Uh oh!

simonsays1980 Nov 14, 2025

Uh oh!

simonsays1980 Nov 14, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[rllib] Improve test_single_agent_env_runner to prevent flaky tests #58397

[rllib] Improve test_single_agent_env_runner to prevent flaky tests #58397

Conversation

pseudo-rnd-thoughts commented Nov 4, 2025

Description

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

simonsays1980 left a comment

Choose a reason for hiding this comment

Uh oh!

simonsays1980 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

simonsays1980 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

simonsays1980 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[rllib] Improve `test_single_agent_env_runner` to prevent flaky tests #58397

[rllib] Improve `test_single_agent_env_runner` to prevent flaky tests #58397