Benchmark GPT, Claude, Gemini, and custom models on strategic reasoning, spatial awareness, and multi-step planning.

Full Gymnasium environment with multi-discrete action space, configurable reward shaping, and headless mode for fast training.

Automated round-robin tournaments with ELO ratings, replay recording, and detailed performance analytics.

Diverse Tactical Maps

25 hand-crafted maps across 1v1, 1v1v1, and 2v2 formats with varied terrain, chokepoints, and strategic objectives.

Crossroads

Island Fortress

Tower Rush

Center Mountains

8 Unique Unit Types

Each unit has distinct stats, abilities, and roles — creating a rich decision space for AI agents to master.

WarriorFrontline Fighter

MageArcane Striker

KnightHeavy Cavalry

ArcherRanged Specialist

RogueStealth Assassin

ClericSupport Healer

Install via pip with optional GPU, GUI, and LLM extras. Works on Python 3.11–3.13.

pip install reinforcetactics[llm]

Pick your agents — LLM bots, RL models, rule-based bots, or your own custom agent.

--agents gpt-4o claude-sonnet

Run tournaments, compare ELO ratings, analyze replays, and iterate on your models.

python -m reinforcetactics tournament

Standard RL interface with observation and action spaces, reward shaping, and episode management.

PettingZoo integration for multi-agent RL. Train cooperative and competitive policies.

Record battles, export to video, and analyze decision patterns for model interpretability.

Add custom units, maps, reward functions, and AI agents with a clean Python API.

OpenAI, Anthropic, and Google Gemini SDKs built-in. Plug in any LLM via API.

Containerized tournament runner for reproducible benchmarks at scale.

Open source and ready for research. Clone the repo and run your first tournament in minutes.