Ambrozic, Martin

Resource type

Graduating extended essay / Research project

Thesis type

(Project) M.Sc.

Date created

2021-01-26

Authors/Contributors

Author: Ambrozic, Martin

Abstract

Learning actions values is a key idea in sports analytics with applications such as player ranking, tactical insight and outcome prediction. We compare two fundamentally different approaches for learning action values on a novel play-by-play volleyball dataset. In the first approach, we employ regression models that implicitly assume statistical independence of data samples. In the second approach, we use a deep reinforcement learning model, explicitly enforcing the sequential nature of the data in the learning process. We find that temporally independent regression can in certain settings outperform the reinforcement learning approach in terms of predictive accuracy, but the latter performs much better when temporal consistency is required. We also consider a mimic regression tree as a way to add interpretability to the deep reinforcement learning approach. Finally, we examine the computed action values and perform a number of example analyses to verify their validity.

Keywords

Identifier

etd21266

Copyright statement

Copyright is held by the author(s).

Permissions

This thesis may be printed or downloaded for non-commercial research and scholarly purposes.

Supervisor or Senior Supervisor

Thesis advisor: Schulte, Oliver

Language

English

Member of collection

Computing Science Theses

Download file	Size
input_data\21090\etd21266.pdf	432.76 KB

Temporal consistency in learning action values for volleyball

Keywords

Views & downloads - as of June 2023