Note

Main scripts | Typical train and test flow | Citation

Results

Trained with 3 agent and executed with 10 agents

Trained with 3 agent and executed with 100 agents

Main scripts

algorithm/marl_ppo.py for training Multi agent PPO on target MPE environment.
- Note run this script as python module with python -m algorithm/marl_ppo.py for imports to work properly.
envs/target_mpe_env.py. This is the main class that defines the target MPE environment.
- Also look at envs/wrapper.py for env wrappers.
config/mappo_config.py. This is the one and only file for changing config values to run experiments. Used python classes instead of yaml file to get auto complete and type checking and easier refactor when accessing and changing the structure of config.
visualize_actor.py for visualizing the trained actor in a local environment.
model/actor_critic_rnn.py has all the flax linen networks used in the PPO.

Typical train and test flow

Run the train_with_gpu.ipynb notebook in a colab with gpu.
- Remember to set up the config in WandbConfig in config/mappo_config.py and change mode online to get wandb logging.
- The artifacts are saved under the name "PPO_RNN_Runner_State"
Visualize the actor with visualize_actor.py after changing the artifact_version variable in the block. if __name__ == "__main__"

Note

It is recommended to first install either requirements_jax_cpu.txt or requirements_jax_cuda.txt before requirements.txt since the packages in requirements will install a jax version for you.

Citing JaxInforMARL

If you use JaxInforMARL in your work, please cite as follows:

@software{JaxInforMARL,
      title={JaxInforMARL: Multi-Agent Target MPE RL Environments with GNNs in JAX},
      author={Joseph Selvaraaj},
      year = {2025},
      url = {https://github.com/jselvaraaj/JaxInforMARL},
      version = {1.0.0}
    }

Name	Name	Last commit message	Last commit date
Latest commit jselvaraaj setup in cursor Mar 1, 2025 803b630 · Mar 1, 2025 History 135 Commits
algorithm	algorithm	Separate out equivariant and non equivariant features in vanilla network	Feb 2, 2025
config	config	setup in cursor	Mar 1, 2025
envs	envs	Separate out equivariant and non equivariant features in vanilla network	Feb 2, 2025
model	model	Do a basic equivariant gnn	Feb 2, 2025
.gitignore	.gitignore	Reset hidden state each episode; add hidden state as communication me…	Jan 12, 2025
100_agents.gif	100_agents.gif	Add gifs	Jan 12, 2025
10_agents.gif	10_agents.gif	Add gifs	Jan 12, 2025
InforMARLJAX.code-workspace	InforMARLJAX.code-workspace	setup in cursor	Mar 1, 2025
LICENSE	LICENSE	house keeping	Dec 30, 2024
README.md	README.md	Update readme	Jan 13, 2025
__init__.py	__init__.py	Pass the dynamic agent entity graph to network	Jan 3, 2025
calculate_metric.py	calculate_metric.py	reformat code to make intellij happy	Jan 30, 2025
calculate_metric_for_a_run.py	calculate_metric_for_a_run.py	Separate out equivariant and non equivariant features in vanilla network	Feb 2, 2025
calculate_metrics_for_a_run.ipynb	calculate_metrics_for_a_run.ipynb	Communicate the whole path to other agents	Jan 31, 2025
interactable_viz_actor.ipynb	interactable_viz_actor.ipynb	Formatting changes; interactive config changes	Jan 29, 2025
playground.ipynb	playground.ipynb	Add neural ODE	Jan 31, 2025
requirements.txt	requirements.txt	finalize discrete neural ode; remove diffrax as a dep	Jan 31, 2025
requirements_jax_cpu.txt	requirements_jax_cpu.txt	Refactor and housekeeping	Jan 5, 2025
requirements_jax_cuda.txt	requirements_jax_cuda.txt	Refactor and housekeeping	Jan 5, 2025
train_with_gpu.ipynb	train_with_gpu.ipynb	minor updates	Jan 16, 2025
visualize_actor.py	visualize_actor.py	Separate out equivariant and non equivariant features in vanilla network	Feb 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Results

Trained with 3 agent and executed with 10 agents

Trained with 3 agent and executed with 100 agents

Main scripts

Typical train and test flow

Note

Citing JaxInforMARL

About

Releases

Packages

Languages

License

jselvaraaj/JaxInforMARL

Folders and files

Latest commit

History

Repository files navigation

Results

Trained with 3 agent and executed with 10 agents

Trained with 3 agent and executed with 100 agents

Main scripts

Typical train and test flow

Note

Citing JaxInforMARL

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages