PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
-
Updated
Nov 15, 2021 - Python
PyTorch implementation of some reinforcement learning algorithms: A2C, PPO, Behavioral Cloning from Observation (BCO), GAIL.
An end-to-end (E2E) reinforcement learning model for autonomous vehicle collision avoidance in the CARLA simulator, using a recurrent PPO algorithm for dynamic control. The model processes RGB camera inputs to make real-time acceleration and steering decisions.
AI agents for the boardgame Splendor
Repository for "Dynamic Multi-Pair Trading Strategy in Cryptocurrency Markets with Deep Reinforcement Learning" (Lebiedź and Ślepaczuk, 2026).
Solving the Atari Breakout environment using Stable Baselines
Reinforcement learning toolkit for training agents to play Pokémon Red. Built on PyBoy, Stable-Baselines3, and Gymnasium. Includes pixel/symbolic/hybrid observation treatments and rliable-based statistical analysis.
Unofficial Stable-Baselines3 extension package with MaskableRecurrentPPO.
Add a description, image, and links to the recurrent-ppo topic page so that developers can more easily learn about it.
To associate your repository with the recurrent-ppo topic, visit your repo's landing page and select "manage topics."