Skip to content

An intelligent game-playing agent for the board game Abalone, using alpha-beta pruning, transposition tables, and heuristics, developed as part of COMP 3981 project.

Notifications You must be signed in to change notification settings

immangat/all-alone-ai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

COMP 3981 Project

Part 2

Team Members

  • Mangat Toor
  • Nicolas Rodriguez
  • Tomasz Stojek
  • Vitor Guara

Contents

I     Game Board Representation

II    State Space Generation

III   Moving Notation

IV  Team Member Contribution

V    References

V    Additional Documents

V    AI player

I     Game Board Representation

Belgian Daisy

a. State Representation

The state is represented by a dictionary where the key is a tuple of two integers: (int letter, int number). The first int in the tuple is used to represent the letter, with A will be treated as 1, and B as 2 etc. The values in each dictionary entry is a string (String color_of_marble) representing a black or white marble. (“w” or “b”) For the above board :
board = dictionary Visual Representation State Representation (A,1) board(1,1).value = “w” (C,2) board(3,2).value = None (G,5) board(7,5).value = “b”

state = {
        "(1,1)": "white",
        "(3,2)": none,
        "(7,5)": "black",
        ...
}

b. Initial State

The initial state can be one of three states:

  1. Standard
    Standard
  2. German Daisy
    German Daisy
  3. Belgian Daisy
    Belgian Daisy

c. Actions

Types of Movements: • Inline Movements:

  1. Single Marble Movement
  2. Movement of 2 or 3 Marbles Together • Sidestep Movements:
  3. Of 2 or 3 Marbles Together

Sumitos: An action where a group of two or three marbles of one color pushes a group of opposing marbles, only occurring in inline moves.

Limits

An inline move of a single marble cannot execute a sumito and must move into an empty spot on the board.

There are three types of sumitos that are allowed: 3 vs 1 3 vs 2 2 vs 1 Sidestep Movements: Are limited to groups of 2 to 3 marbles that are connected along the same axis.(all touching one another in a straight line on the board) Each marble in a sidestep movement must have an empty spot in the direction it will shift into.

If an inline move goes into an opposing player marble then a sumito must occur, sumito must be legal General Prohibition: Players cannot move any of their own marbles off the board.

d. Transition Model

Description Actions Resulting State
move to direction 1 [marbles],1 For each coordinate(x,y) in marbles: (x + 1,y + 1)
move to direction 3 [marbles],3 For each coordinate(x,y) in marbles: (x +0,y + 1)
move to direction 5 [marbles],5 For each coordinate(x,y) in marbles: (x -1,y + 0)
move to direction 7 [marbles],7 For each coordinate(x,y) in marbles: (x - 1,y - 1)
move to direction 9 [marbles],8 For each coordinate(x,y) in marbles: (x + 0,y - 1)
move to direction 11 [marbles],9 For each coordinate(x,y) in marbles: (x + 1,y + 0)
example Sumito to 11 [marbles],11 For each coordinate(x,y) in marbles: (x + 1,y + 0) AND For each opponent coordinate(x,y) inline: (x + 1,y + 0) If opponent coordinate is off-board: remove it

Move Examples

Move to direction 3 : Marbles list [(1,1),(1,2)] -> For each coordinate(x,y) in marbles: (x +0,y + 1) End state = Marbles list [(1,2),(1,3)]

Sumito to 11: Marbles list [(1,1), (2,1),(3,1)] -> For each coordinate(x,y) in marbles: (x + 1,y + 0) End state = Marbles list [(2,1),(3,1),(4,1)] Opponent's Marbles [(4,1),(5,1)] -> For each coordinate(x,y) in marbles: (x + 1,y + 0) End state = Opponent’s Marbles [(5,1)] -> (6,1) is off-board so it is removed.

The goal test consists of checking if any player has gotten six of the opposite marbles out of the board.

II    State Space Generation

Understanding and determining legal moves in abalone requires analyzing the current board layout and applying the game's rules accurately. To identify all legal moves using a structured approach based on two primary classes: State and State_Space_Generator. To follow the process:

Marble Locations: Begin with identifying the positions of your marbles on the board, stored in an dictionary named our_marbles.

Key Classes

State:

Purpose: Captures a potential state space / board layout resulting from a move. Parameters: Requires details of the move being considered and the current board setup. Function: Holds the resultant board state after the move.

State_Space_Generator:

Role: Central to identifying all legal moves and storing them for evaluation. Important Arrays: states: Stores State objects representing valid moves found. current_marbles: Contains coordinates of the player's marbles as tuples. current_board = current state of the board

Core Methods for Move Evaluation

Single Marble Check:

Iterates through the player's marbles. For each marble, it explores all possible directions for an empty spot. Valid Moves: Inline movements into empty spaces that don't result in sumito are considered legal. Outcome: If a valid move is identified, a new State object (comprising the move and the current board) is added to the states array. Grouping Formation: Encountering a friendly marble forms a new group, passed to multi_marble_check for further analysis.

Multi Marble Check:

Assesses legality of movements for marble groupings. Validates inline movements through a helper function, considering: Empty Space: Direct movement into an empty space is legal, and added to states as State object. Opponent Marbles: Applies sumito check to determine legality. If legal added to states. Friendly Marbles: For groups smaller than three, recursively applies multi_marble_check. Sumito Checks: Evaluates the feasibility of pushing opponent marbles based on the number and arrangement of friendly and opponent marbles.

Sumito Evaluation

Mechanism: For inline movements, it counts the number of friendly marbles from the back and checks if they can push fewer opponent marbles without affecting any friendly ones.

III   Moving Notation

There are two types of moves: inline (i) and side-step (s)

Inline Move Notation:

i – An – D

i: in-line An: coordinate of trailing marble D : direction of move

Side Step Move Notation:

s – An – Bm - D

s: side-step An, Bm: coordinates of extremities D : direction of move To avoid ambiguity,

  • A <= B
  • If A = B then n <= m
  • single marble moves are always represented as inline

• D: Direction of movement (1, 3, 5, 7, 9, 11). o 1: Up Right o 3: Right o 5: Down Right o 7: Down Left o 9: Left o 11: Up Left

Example Notation with Pictures

  • Single Black Marble Move Right: [[F6] ,3] img_6.png
  • Double White Marble (Straight Line) Move Up Left: [[F5,F6], 11] img_7.png
  • Double White Marble (Diagonal) Move Up Right: [[F6,E6], 1] img_8.png
  • Triple Black Marble Move Down Right: [[E6,E7,E8],5] img_9.png

IV  Team Member Contribution

  1. Mangat Toor

    • Worked on timer logic for the game.
    • Worked on move notation and problem formulation.
    • Worked on move logic.
    • Worked on pause functionality.
    • Worked on documentation.
    • Worked on State_Space_Generation
  2. Nicolas Rodriguez

    • Worked on moving single marbles.
    • Worked on axis checking logic when moving marbles.
    • Worked on creating the initial GUI.
    • Worked on move history.
    • Worked on selection logic.
    • Worked on State_Space_Generation
  3. Tomasz Stojek

    • Worked on logic for moving multiple marbles.
    • Worked on logic for selecting multiple marbles.
    • Worked on move history.
    • Worked on State_Space_Generation
  4. Vitor Guara

    • Worked on the skeleton of the classes mapped to the GUI.
    • Worked on initial board positions.
    • Worked on undo and start buttons.
    • Worked on saving the states for the undo button.
    • Worked on displaying the score.
    • Worked on State_Space_Generation

V    References

Huang, C.-E. (n.d.). MoveNotation [PDF file]. BCIT. Retrieved from https://learn.bcit.ca/d2l/le/content/1003821/viewContent/9748751/View

V    Additional Documents

Step 1: Install PyInstaller First, you need to install PyInstaller. You can do this using pip, Python's package installer. Open your command line (Terminal on macOS/Linux, Command Prompt or PowerShell on Windows) and run the following command:

pip install pyinstaller

Navigate to the folder the has game_control.py And run the following command:

pyinstaller --name="name_of_exe" --onefile -w --add-data "gui_json/theme.json:gui_json/" game_control.py

this will create a .exe in your dist folder of part 2

move it to folder where the test.input files are which in this example are in part_2

Find the executable in part_ 2 and run it

This will bring up the game menu where you can select the file you want and then select the Search States button, after a few seconds you will find your files in the part_2 folder

Search Strategy and Performance

Design and Architecture of the Game-Playing Agent

  • Diagram of a Company: A visual representation of the architecture and design of the game-playing agent.

Heuristic Evaluation Function

Description of Heuristic

The heuristic function for a board game evaluates the current state of the game board, assigning a numerical value to represent the position's favorability for the player. This is specifically tailored for a game of Abalone, evaluating for either black or white players, or both independently.

Weights

Weights are assigned to each part of the evaluation function. These can be adjusted for fine-tuning the agent's performance. Black is considered a maximizing player, and white a minimizing player, with weights for the same part presented in modulus for simplicity.

Number of Marbles Off the Grid

Evaluates the number of marbles remaining for each player. Fewer marbles on board benefit the opposing player, with a weight value of 25 assigned for marbles off-grid.

Positional Score

Scores are based on marble positions on the grid, with central positions being more favorable. This includes a system of concentric circles scored from inside out. The weight value for this part is 4.

Coherence Score

This score reflects how clustered a player's marbles are, indicating board control. It uses DFS to identify groups, adding a bonus for each adjacent marble of the same color. The weight for coherence is 1.

Overall Score

The final score is the sum of the above factors for both players, providing the board's evaluation.

Performance Evaluation

Enhancements aim to improve the search algorithm's efficiency and effectiveness, including alpha-beta pruning, transposition tables, and move ordering.

Alpha-Beta Pruning

Optimizes the search tree by eliminating irrelevant branches, reducing the search space.

Transposition Table

Stores previously evaluated board positions to prevent redundant evaluations. It's implemented as a dictionary, with positions hashed as integers.

Inner Transposition Table

Similar to the main table but stores evaluations at the deepest level, saving time during evaluation processes.

Move Ordering

Prioritizes certain moves to improve alpha-beta pruning effectiveness, using the transposition table to order moves by score.

Justification and Evaluation

Weights and strategies are chosen based on their effectiveness in testing.

Metrics

  • Win Rate Against Baselines: Performance against a basic Random AI.
  • Execution Time at Depth 5: Speed of decision-making process.
  • Board Control: Maintaining control over key areas of the board.
  • Number of Pushes and Sumitos Performed: Measures offensive maneuvers.
  • Tendency to Keep Clustered Together: Indicates defensive capabilities.
  • Maximum Depth Reached Within 10 Seconds: Depth of search within a time limit.
  • Ability to React to Knockout Threats from Opponent: Defensive adaptability.
  • Ability to Finish Games Within 60 Moves: Efficiency in concluding games.

Performance Metrics for GuaraAI

  • Win Rate Against Baselines: 100%
  • Execution Time at Depth 5: 3 seconds
  • Board Control: Prioritizes control over aggression.
  • Number of Pushes and Sumitos Performed: Acts on high enough scores.
  • Tendency to Keep Clustered Together: Maintains strong formation.
  • Maximum Depth Reached Within 10 Seconds: Depth of 6.
  • Ability to React and Defend: Yes.
  • Efficiency in Concluding Games: Yes.

Performance Results vs. Other Heuristics

  • GuaraAI vs NicoAI: 80% win rate.
  • GuaraAI vs TomekAI: 90% win rate.
  • GuaraAI vs MangatAI: 90% win rate.

About

An intelligent game-playing agent for the board game Abalone, using alpha-beta pruning, transposition tables, and heuristics, developed as part of COMP 3981 project.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •  

Languages