Panel 17 - Computer | Simulation

General Game Playing as a Bandit-Arms Problem: A Multiagent Monte-Carlo Solution Exploiting Nash Equilibria

Matt Banda, Oberlin CollegeFollow

Location

King Building 239

Document Type

Presentation

Start Date

4-27-2019 4:00 PM

End Date

4-27-2019 5:20 PM

Abstract

One of the main drawbacks of game playing programs in the ﬁeld of A.I. is that the success of many highly renowned programs such as Alpha-Go and Alpha-Star have come at the expense of generality. Exploiting heuristics or having prior professional gameplay data is how these systems inevitably succeed, which, while impressive, requires signiﬁcant amounts of human intuition about the game. This level of human intervention in game playing programs begs to ask the question: Is the program the originator of the unique strategies that arise, or is it simply a reﬂection of what humans are capable of? However, general game playing hopes to ﬁll in this lack of generality and remove the need for excessive human intervention. This project approaches general game playing in a unique way by combining popular methods of stochastic tree searching with a Multiagent System and a unique algorithm that I call the “Wise Explorer” algorithm. The goal of the system is to explore the worst possible branches of the game first to rule them out, followed by an in-depth search on the most promising branches. The system constantly refers to the data it collects during its extensive search, and it outputs a strategic move for any given state of a game. In essence, if you’re ever in a bind during a game of tic-tac-toe, the system will tell you exactly what your best move is.

Keywords:

Artificial Intelligence, Machine Learning, Computer Learning

Notes

Session VI, Panel 17 - Computer | Simulation
Moderator: Jason Stalnaker, Associate Professor of Physics

Recommended Citation

Banda, Matt, "General Game Playing as a Bandit-Arms Problem: A Multiagent Monte-Carlo Solution Exploiting Nash Equilibria" (04/27/19). Senior Symposium. 1.
https://digitalcommons.oberlin.edu/seniorsymp/2019/panel_17/1

Major

Computer Science

Advisor(s)

Roberto Hoyle, Computer Science

Project Mentor(s)

Bob Geitz, Computer Science

April 2019

This document is currently not available here.

COinS

Apr 27th, 4:00 PM Apr 27th, 5:20 PM

General Game Playing as a Bandit-Arms Problem: A Multiagent Monte-Carlo Solution Exploiting Nash Equilibria

King Building 239

Panel 17 - Computer | Simulation

General Game Playing as a Bandit-Arms Problem: A Multiagent Monte-Carlo Solution Exploiting Nash Equilibria

Location

Document Type

Start Date

End Date

Abstract

Keywords:

Notes

Recommended Citation

Major

Advisor(s)

Project Mentor(s)

Search

Browse

Author Corner

Links

Panel 17 - Computer | Simulation

General Game Playing as a Bandit-Arms Problem: A Multiagent Monte-Carlo Solution Exploiting Nash Equilibria

Presenter Information

Location

Document Type

Start Date

End Date

Abstract

Keywords:

Notes

Recommended Citation

Major

Advisor(s)

Project Mentor(s)

Share

Search

Browse

Author Corner

Links