This repository provides basic abstractions and tools to study reinforcement learning algorithms:
- Environment
- Agent
- Policy
and set of testing tools to validate and compare results.
- K-armed bandit: non-deterministic, stationary and non-stationery problems solved using epsilon greedy agent
- Frozen Lake: non-deterministic, stationary board-like problem