How to use

The environments in the Stable Gym package can be imported like any other gymnasium environments. You can then use the gym.vector.make() function to create an instance of the environment. Here’s a bare minimum example of using one of the Stable Gym environment. This will run an instance of the Oscillator-v1 environment for 1000 timesteps. You should see the observations being printed to the console. More examples can be found in the Stable Gym examples folder.

"""A simple example on how to use the Stable Gym gymnasium environments."""

import gymnasium as gym

# ENV_NAME = "stable_gym:Oscillator-v1"
# ENV_NAME = "stable_gym:CartPoleCost-v1"
# ENV_NAME = "stable_gym:SwimmerCost-v1"
# ENV_NAME = "stable_gym:FetchReachCost-v1"
# ENV_NAME = "stable_gym:MinitaurBulletCost-v1"
ENV_NAME = "stable_gym:QuadXHoverCost-v1"

if __name__ == "__main__":
    env = gym.make(ENV_NAME, render_mode="human")

    # Define a policy function.
    # NOTE: Can be any function that takes an observation and returns an action.
    def policy(*args, **kwargs):
        """A simple policy that samples random actions."""
        return env.action_space.sample()

    # Run training loop.
    observation, info = env.reset(seed=42)
    for _ in range(1000):
        action = policy(observation)  # User-defined policy function
        observation, reward, terminated, truncated, info = env.step(action)

        if terminated or truncated:
            print("Environment terminated or truncated. Resetting.")
            observation, info = env.reset()
    env.close()

Important

Some of the environments in this package do not have a render method.