Decision Transformer Model and Its Practical Use Cases

Michalco, Dominik

dc.contributor.advisor	Platoš, Jan
dc.contributor.author	Michalco, Dominik
dc.date.accessioned	2025-06-23T11:50:00Z
dc.date.available	2025-06-23T11:50:00Z
dc.date.issued	2025
dc.identifier.other	OSD002
dc.identifier.uri	http://hdl.handle.net/10084/156980
dc.description.abstract	This bachelor thesis investigates the Decision Transformer (DT) model, a recent method that reformulates reinforcement learning (RL) as a sequence modeling problem. Unlike traditional RL approaches that rely on value functions or policy gradients, DT predicts future actions based on full trajectories of states, actions, and desired returns (return-to-go). The thesis first presents the theoretical foundations of reinforcement learning and Transformer architectures, followed by a detailed examination of the DT model’s components, architecture, and advantages. Practical application areas are then proposed and evaluated through experiments in a grid-based navigation task and the game 2048. The results highlight the model’s strengths and weaknesses, and offer insights for future research and deployment in real-world settings.	en
dc.description.abstract	Tato bakalářská práce se zabývá modelem Decision Transformer (DT), který představuje moderní přístup k řešení úloh posilovaného učení (reinforcement learning) pomocí sekvenčního modelování. Na rozdíl od tradičních metod RL, které se opírají o odhady hodnot nebo přímou optimalizaci politiky, se DT spoléhá na předchozí trajektorie a predikuje následující akce na základě historie stavů, akcí a požadovaného návratu (return-to-go). Práce nejprve představuje teoretický základ posilovaného učení a architektury Transformer, následuje podrobný popis modelu Decision Transformer, včetně jeho architektury a výhod oproti klasickým metodám. Praktická část se zaměřuje na návrh vhodných oblastí použití a ověření funkčnosti modelu pomocí experimentů v prostředí navigace v mřížce a ve hře 2048. Výsledky ukazují silné i slabé stránky modelu a poskytují doporučení pro jeho další využití v praxi.	cs
dc.format.extent	2040325 bytes
dc.format.mimetype	application/pdf
dc.language.iso	en
dc.publisher	Vysoká škola báňská – Technická univerzita Ostrava	cs
dc.subject	Decision Transformer	en
dc.subject	reinforcement learning	en
dc.subject	sequence modeling	en
dc.subject	Transformer	en
dc.subject	offline RL	en
dc.subject	action prediction	en
dc.subject	deep learning	en
dc.subject	2048	en
dc.subject	navigation	en
dc.subject	Decision Transformer	cs
dc.subject	posilované učení	cs
dc.subject	sekvenční modelování	cs
dc.subject	Transformer	cs
dc.subject	offline RL	cs
dc.subject	predikce akcí	cs
dc.subject	hluboké učení	cs
dc.subject	2048	cs
dc.subject	navigace	cs
dc.title	Decision Transformer Model and Its Practical Use Cases	en
dc.title.alternative	Model Decision Transformer a jeho využití v praxi	cs
dc.title.alternative	Model Decision Transformer a jeho využitie v praxi	sk
dc.type	Bakalářská práce	cs
dc.contributor.referee	Dráždilová, Pavla
dc.date.accepted	2025-06-02
dc.thesis.degree-name	Bc.
dc.thesis.degree-level	Bakalářský studijní program	cs
dc.thesis.degree-grantor	Vysoká škola báňská – Technická univerzita Ostrava. Fakulta elektrotechniky a informatiky	cs
dc.description.department	460 - Katedra informatiky	cs
dc.thesis.degree-program	Informatika	cs
dc.description.result	velmi dobře	cs
dc.identifier.sender	S2724
dc.identifier.thesis	MIC0494_FEI_B0613A140014_2025
dc.rights.access	openAccess