Game AI - Machine Learning

Artificial Intelligence

Lecture 08 – Machine Learning

Edirlei Soares de Lima

<edirlei.lima@universidadeeuropeia.pt>

Game AI – Model

•

Pathfinding

Steering behaviours

Finite state machines

Automated planning

Behaviour trees

Randomness

Sensor systems

Machine learning

Learning in Games

•

Learning is a hot topic in games.

In principle, learning AI has the potential to adapt to each

player, learning their tricks and techniques and providing

consistent challenges.

–

Produce more believable characters.

–

Reduce the effort needed to create game-specific AI.

•

In practice, it hasn’t yet fulfilled its promises.

–

Applying learning to a game requires careful planning and an

understanding of its pitfalls.

Online vs. Offline Learning

•

Online Learning: learning is performed during the game, while

the player is playing.

–

Allows characters to adapt dynamically to the player’s style.

–

Predictability and testing problems: if the game is constantly changing,

it can be difficult to replicate bugs and problems.

Offline Learning: learning occurs during the development of

the game.

–

Performed by processing data about real games and trying to calculate

strategies or parameters.

–

Unpredictable learning algorithms can be tried out and their results to

be tested exhaustively.

Behavior Learning

•

Intra-Behavior Learning: change only a small area of a

character’s behavior.

–

Examples: learn to target correctly projectiles, learn the best patrol

routes, learn the best cover points, etc.

–

Easy to control and test.

Inter-Behavior Learning: learn new behaviors.

–

Examples: learn that the best way to kill an enemy is to lay an ambush,

learn to tie a rope across a backstreet to stop an escaping motorbike.

–

This kind of AI is almost pure fantasy.

Warning About Learning in Games

•

In reality, learning is not as widely used in games as you might

think.

–

Main problems: complexity, reproducibility, and quality control.

•

Be careful with hyped-up papers about learning and games.

–

Always constrain the kinds of things that can be learned in your game.

Learning algorithms are attractive because you can do less

implementation work.

–

But on the other hand, you need to do a different work: collect and

present data to the algorithm and make sure the results are valid.

What is Machine Learning?

Machine Learning Tasks

•

Supervised Learning: learning a function that maps an input

to an output based on example input-output pairs.

Unsupervised Learning: learning a function to describe hidden

structure from "unlabeled" data.

Reinforcement Learning: simulates how agents take actions in

an environment so as to maximize “rewards”.

Learning Phases

•

Train:

–

Training examples are presented to the system;

The system learns from the examples;

The system gradually adjusts its parameters to produce the desired

output.

•

Test:

–

Unseen examples are presented to the system;

–

The system tries to recognize the unseen examples using the

knowledge obtained during the training phase.

Use:

–

After being tested and validated, the system is used for its intended

purpose.

Training Examples

Supervised Learning)

(

Attributes/Features

Example

Attrib₁Attrib₂Attrib₃

Attrib₄

Attrib₅

Attrib₆

Class

X₁

X₂

X₃

X₄

X₅

0.24829

0.24816

0.24884

0.24802

0.24775

0.49713

0.49729

0.49924

0.50013

0.49343

0.00692

0.00672

0.01047

0.01172

0.01729

-0.020360

0.0065762

-0.002901

0.001992

-0.014341

0.429731

0.431444

0.423145

0.422416

0.420937

-0.2935

-0.29384

-0.28956

-0.29092

-0.29244

1

3

2

Classification of Unseen Examples

Attributes/Features

Example

Attrib₁Attrib₂Attrib₃

Attrib₄

Attrib₅

Attrib₆

Class

X₁

X₂

X₃

X₄

X₅

0.22829

0.21816

0.23884

0.23002

0.24575

0.48713

0.48729

0.49824

0.49013

0.49243

0.00592

0.00572

0.01447

0.02172

0.01029

-0.010360

0.0045762

-0.003901

0.002992

-0.015341

0.419731

0.421444

0.433145

0.412416

0.430937

-0.2845

-0.28484

-0.24956

-0.28092

-0.28244

?

Training Examples

•

Suppose we are writing a racing game and we want an AI

character to learn a player’s style of going around corners.

We want to learn when is the best moment to slow down

(break).

–

Output (classes): break, not break;

–

Important game information (attributes): speed and distance to a

corner.

•

To get training data, we can record some gameplay sessions.

Training Examples

•

Gameplay data:

Distance Speed

Break?

Yes

2

3

7

8

2

8

3

.4

11.3

70.2

72.7

89.4

15.2

8.6

.2

Yes

5.7

0.6

.8

No

Yes

No

2.1

.8

Yes

69.4

Yes

Training Examples

•

Sometimes it is important to make the data as obvious as

possible. We can categorize distances as “near” or “far” and

speed as “slow” or “fast”.

–

When making decisions, most human players don’t consider precise

velocity or distance. They usually categorize the information.

Distance Speed

Break?

Yes

Near

Far

Slow

Fast

Slow

Fast

Yes

No

Far

Yes

Near

Far

No

Yes

Near

Yes

Classification Problem

(Feature Space)

2,20

2,00

1,80

1,60

1,40

1,20

1,10

Height

2

0

40

60

70

90

110 130 150

Weight

Supervised Learning

•

Given a finite amount of training data, we need to find a function h that

approximates the real function f(x) (that generated the data and is unknown).