Blackjack Reinforcement Learning

Overview

This project implements a Blackjack environment using Gymnasium and a PPO agent using Stable Baselines 3. You can customize the number of decks in the game by changing the --deck_size argument. Motivation: if we include seencards history in the state, can we imporve the odds.

Base Agent

State:

Probabilities of each card in the deck.
Entropy of the deck.
Player's hand value.
Dealer's visible card.

Actions:

No Bet
Bet
Hit
Stand
Double

Rewards:

Pealize for invalid actions.
Penalize propotional to entory. To encourage the agent to bet more as we get more information (from seen cards)

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
agent.py		agent.py
blackjack.py		blackjack.py
cards.py		cards.py
eval.py		eval.py
state.py		state.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Blackjack Reinforcement Learning

Overview

Base Agent

About

Releases

Packages

Languages

suijth/Blackjack_RL

Folders and files

Latest commit

History

Repository files navigation

Blackjack Reinforcement Learning

Overview

Base Agent

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages