Environment Base

class jaxatari.environment.JAXAtariAction[source]

Bases: object

“Namespace” for Atari action integer constants. These are directly usable in JAX arrays.

DOWN: int = 5

DOWNFIRE: int = 13

DOWNLEFT: int = 9

DOWNLEFTFIRE: int = 17

DOWNRIGHT: int = 8

DOWNRIGHTFIRE: int = 16

FIRE: int = 1

LEFT: int = 4

LEFTFIRE: int = 12

NOOP: int = 0

RIGHT: int = 3

RIGHTFIRE: int = 11

UP: int = 2

UPFIRE: int = 10

UPLEFT: int = 7

UPLEFTFIRE: int = 15

UPRIGHT: int = 6

UPRIGHTFIRE: int = 14

classmethod get_all_values() → Array[source]

class jaxatari.environment.JaxEnvironment(consts: EnvConstants | None = None)[source]

Bases: Generic[EnvState, EnvObs, EnvInfo, EnvConstants]

Abstract class for a JAX environment. Generics: EnvState: The type of the environment state. EnvObs: The type of the observation. EnvInfo: The type of the additional information. EnvConstants: The type of the environment constants.

action_space() → Space[source]: Returns the action space of the environment as an array containing the actions that can be taken. Returns: The action space of the environment as an array.

image_space() → Space[source]: Returns the image space of the environment. Returns: The image space of the environment.

obs_to_flat_array(obs: EnvObs) → Array[source]: Converts the observation to a flat array. :param obs: The observation.

observation_space() → Space[source]: Returns the observation space of the environment. Returns: The observation space of the environment.

render(state: EnvState) → Tuple[Array][source]

Renders the environment state to a single image. :param state: The environment state.

Returns: A single image of the environment state.

reset(key: PRNGKey | None = None) → Tuple[EnvObs, EnvState][source]: Resets the environment to the initial state. Returns: The initial observation and the initial environment state.

step(state: EnvState, action) → Tuple[EnvObs, EnvState, float, bool, EnvInfo][source]

Takes a step in the environment. :param state: The current environment state. :param action: The action to take.

Returns: The observation, the new environment state, the reward, whether the state is terminal, and additional info.