Fast Online Q(lambda)

Wiering, M.A.; Schmidhuber, J.

Fast Online Q(lambda)

Files

Wiering_98_fastonline.pdf (235.89 KB)

Publication date

1998

Authors

Wiering, M.A.

Schmidhuber, J.

Document Type

Article

Metadata

Show full item record

Collections

Utrecht University Repository

Abstract

Q(lambda)-learning uses TD(lambda)-methods to accelerate Q-learning. The update complexity of previous online Q(lambda)implementations based on lookup-tables is bounded by the size of the state-action space. Our faster algorithm's update complexity is bounded by the number of actions. The method is based on the observation that Q-value updates may be postponed until they are needed.

Keywords

Reinforcement learning, Q-learning, TD(lambda), online Q(lambda), lazy learning

URI

https://dspace.library.uu.nl/handle/1874/25451

Fast Online Q(lambda)

Files

Publication date

Authors

Editors

Advisors

Supervisors

DOI

Document Type

Metadata

Collections

License

Abstract

Keywords

Citation

URI