Purdue University Graduate School
Browse

FAST(ER) DATA GENERATION FOR OFFLINE RL AND FPS ENVIRONMENTS FOR DECISION TRANSFORMERS

Download (1.24 MB)
thesis
posted on 2023-12-06, 14:00 authored by Mark R TrovingerMark R Trovinger

Reinforcement learning algorithms have traditionally been implemented with the goal

of maximizing a reward signal. By contrast, Decision Transformer (DT) uses a transformer

model to predict the next action in a sequence. The transformer model is trained on datasets

consisting of state, action, return trajectories. The original DT paper examined a small

number of environments, five from the Atari domain, and three from continuous control,

and one that examined credit assignment. While this gives an idea of what the decision

transformer can do, the variety of environments in the Atari domain are limited. In this

work, we propose an extension of the environments that decision transformer can be trained

on by adding support for the VizDoom environment. We also developed a faster method for

offline RL dataset generation, using Sample Factory, a library focused on high throughput,

to generate a dataset comparable in quality to existing methods using significantly less time.


History

Degree Type

  • Master of Science

Department

  • Computer Science

Campus location

  • Fort Wayne

Advisor/Supervisor/Committee Chair

Dr. Mohammedreza Hajiarbabi

Advisor/Supervisor/Committee co-chair

Dr. Adolfo Coronado

Additional Committee Member 2

Dr. Jonathan Rusert

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC