Gaussian Based Non-linear Function Approximation for Reinforcement Learning

Abbas Haider; Glenn Hawe; Hui Wang; Bryan Scotney

doi:10.1007/s42979-021-00642-4

Gaussian Based Non-linear Function Approximation for Reinforcement Learning

Abbas Haider, Glenn Hawe, Hui Wang, Bryan Scotney

Research output: Contribution to journal › Article › peer-review

4 Citations (Scopus)

60 Downloads (Pure)

Abstract

Reinforcement learning (RL) problems with continuous states and discrete actions (CSDA) can be found in classic examples such as Cart Pole and Puck World, as well as real world applications such as Market Making. Solutions to CSDA problems typically involve a function approximation (FA) of the mapping from states to actions and can be linear or nonlinear. Linear FAs such as tile-coding (Sutton and Barto in Reinforcement learning, 2nd ed, 2009) suffer from state information loss due to state discretization, whilst non-linear FAs such as DQN (Mnih et al. in Playing atari with deep reinforcement learning, https://arxiv.org/abs/1312.5602, 2013) are practically infeasible in infinitely large state spaces due to their cubic time complexity (O(n3)). In this paper, we propose a novel, general solution to CSDA problems, called Gaussian distribution based non-linear function approximation (GBNLFA). Experimentation on three CSDA RL problems (Cart Pole, Puck World, Market Marking) demonstrates the superiority of GBNLFA over state-of-the-art FAs, namely tile-coding and DQN. In particular, GBNLFA resolves the state information loss problem with linear FAs and provides an asymptotically faster algorithm (O(n)) than linear FAs (O(n2)) and neural network based nonlinear FAs (O(n3)).

Original language	English
Article number	223
Pages (from-to)	223
Journal	SN Computer Science
Volume	2
Issue number	3
Early online date	20 Apr 2021
DOIs	https://doi.org/10.1007/s42979-021-00642-4
Publication status	Published (in print/issue) - 20 Apr 2021

Keywords

Function approximation
Reinforcement learning
Gaussian distribution
Probability density function

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

Access to Document

10.1007/s42979-021-00642-4Licence: CC BY

Haider2021_Article_GaussianBasedNon-linearFunctioFinal published version, 1.21 MBLicence: CC BY

Cite this

@article{ede9cde2905741df8ee23a56046a9ae2,

title = "Gaussian Based Non-linear Function Approximation for Reinforcement Learning",

abstract = "Reinforcement learning (RL) problems with continuous states and discrete actions (CSDA) can be found in classic examples such as Cart Pole and Puck World, as well as real world applications such as Market Making. Solutions to CSDA problems typically involve a function approximation (FA) of the mapping from states to actions and can be linear or nonlinear. Linear FAs such as tile-coding (Sutton and Barto in Reinforcement learning, 2nd ed, 2009) suffer from state information loss due to state discretization, whilst non-linear FAs such as DQN (Mnih et al. in Playing atari with deep reinforcement learning, https://arxiv.org/abs/1312.5602, 2013) are practically infeasible in infinitely large state spaces due to their cubic time complexity (O(n3)). In this paper, we propose a novel, general solution to CSDA problems, called Gaussian distribution based non-linear function approximation (GBNLFA). Experimentation on three CSDA RL problems (Cart Pole, Puck World, Market Marking) demonstrates the superiority of GBNLFA over state-of-the-art FAs, namely tile-coding and DQN. In particular, GBNLFA resolves the state information loss problem with linear FAs and provides an asymptotically faster algorithm (O(n)) than linear FAs (O(n2)) and neural network based nonlinear FAs (O(n3)).",

keywords = "Function approximation, Reinforcement learning, Gaussian distribution, Probability density function",

author = "Abbas Haider and Glenn Hawe and Hui Wang and Bryan Scotney",

year = "2021",

month = apr,

day = "20",

doi = "10.1007/s42979-021-00642-4",

language = "English",

volume = "2",

pages = "223",

journal = "SN Computer Science",

issn = "2661-8907",

publisher = "Springer",

number = "3",

}

TY - JOUR

T1 - Gaussian Based Non-linear Function Approximation for Reinforcement Learning

AU - Haider, Abbas

AU - Hawe, Glenn

AU - Wang, Hui

AU - Scotney, Bryan

PY - 2021/4/20

Y1 - 2021/4/20

N2 - Reinforcement learning (RL) problems with continuous states and discrete actions (CSDA) can be found in classic examples such as Cart Pole and Puck World, as well as real world applications such as Market Making. Solutions to CSDA problems typically involve a function approximation (FA) of the mapping from states to actions and can be linear or nonlinear. Linear FAs such as tile-coding (Sutton and Barto in Reinforcement learning, 2nd ed, 2009) suffer from state information loss due to state discretization, whilst non-linear FAs such as DQN (Mnih et al. in Playing atari with deep reinforcement learning, https://arxiv.org/abs/1312.5602, 2013) are practically infeasible in infinitely large state spaces due to their cubic time complexity (O(n3)). In this paper, we propose a novel, general solution to CSDA problems, called Gaussian distribution based non-linear function approximation (GBNLFA). Experimentation on three CSDA RL problems (Cart Pole, Puck World, Market Marking) demonstrates the superiority of GBNLFA over state-of-the-art FAs, namely tile-coding and DQN. In particular, GBNLFA resolves the state information loss problem with linear FAs and provides an asymptotically faster algorithm (O(n)) than linear FAs (O(n2)) and neural network based nonlinear FAs (O(n3)).

AB - Reinforcement learning (RL) problems with continuous states and discrete actions (CSDA) can be found in classic examples such as Cart Pole and Puck World, as well as real world applications such as Market Making. Solutions to CSDA problems typically involve a function approximation (FA) of the mapping from states to actions and can be linear or nonlinear. Linear FAs such as tile-coding (Sutton and Barto in Reinforcement learning, 2nd ed, 2009) suffer from state information loss due to state discretization, whilst non-linear FAs such as DQN (Mnih et al. in Playing atari with deep reinforcement learning, https://arxiv.org/abs/1312.5602, 2013) are practically infeasible in infinitely large state spaces due to their cubic time complexity (O(n3)). In this paper, we propose a novel, general solution to CSDA problems, called Gaussian distribution based non-linear function approximation (GBNLFA). Experimentation on three CSDA RL problems (Cart Pole, Puck World, Market Marking) demonstrates the superiority of GBNLFA over state-of-the-art FAs, namely tile-coding and DQN. In particular, GBNLFA resolves the state information loss problem with linear FAs and provides an asymptotically faster algorithm (O(n)) than linear FAs (O(n2)) and neural network based nonlinear FAs (O(n3)).

KW - Function approximation

KW - Reinforcement learning

KW - Gaussian distribution

KW - Probability density function

U2 - 10.1007/s42979-021-00642-4

DO - 10.1007/s42979-021-00642-4

M3 - Article

SN - 2661-8907

VL - 2

SP - 223

JO - SN Computer Science

JF - SN Computer Science

IS - 3

M1 - 223

ER -

Gaussian Based Non-linear Function Approximation for Reinforcement Learning

Abstract

Keywords

UN SDGs

Access to Document

Fingerprint

Cite this