NPV-DQN: Improving Value-based Reinforcement Learning, by Variable Discount Factor, with Control Applications

Paczolay, Gabor; Harmati, Istvan

Author	Paczolay, Gabor
Author	Harmati, Istvan
xmlui.dri2xhtml.METS-1.0.item-date-accessioned	2025-08-18T13:00:44Z
xmlui.dri2xhtml.METS-1.0.item-date-available	2025-08-18T13:00:44Z
xmlui.dri2xhtml.METS-1.0.item-date-issued	2024
xmlui.dri2xhtml.METS-1.0.item-identifier-issn	1785-8860	hu_HU
xmlui.dri2xhtml.METS-1.0.item-identifier-uri	http://hdl.handle.net/20.500.14044/32343
xmlui.dri2xhtml.METS-1.0.item-description-abstract	Discount factor plays an important role in reinforcement learning algorithms. It decides how much future rewards are valued for the present time-step. In this paper, a system with a Q value estimation, based on two distinct discount factors are utilized. These estimations can later be merged into one network, to make the computations more efficient. The decision of which network to use, is based on the relative value of the maximum value of the short-term network, the more unambiguous the maximum is, the more probability is rendered to the selection of that network. The system is then benchmarked, on a cartpole and a gridworld environment.	hu_HU
dc.format	PDF	hu_HU
xmlui.dri2xhtml.METS-1.0.item-language	en	hu_HU
Title	NPV-DQN: Improving Value-based Reinforcement Learning, by Variable Discount Factor, with Control Applications	hu_HU
xmlui.dri2xhtml.METS-1.0.item-rights-access	Open access	hu_HU
xmlui.dri2xhtml.METS-1.0.item-rights	Óbudai Egyetem	hu_HU
xmlui.dri2xhtml.METS-1.0.item-other-containerPublisherPlace	Budapest	hu_HU
xmlui.dri2xhtml.METS-1.0.item-publisher-university	Óbudai Egyetem	hu_HU
xmlui.dri2xhtml.METS-1.0.item-subject-area	Társadalomtudományok - gazdálkodás- és szervezéstudományok	hu_HU
xmlui.dri2xhtml.METS-1.0.item-subject-oszkar	reinforcement learning	hu_HU
xmlui.dri2xhtml.METS-1.0.item-subject-oszkar	DQN	hu_HU
xmlui.dri2xhtml.METS-1.0.item-subject-oszkar	NPV	hu_HU
xmlui.dri2xhtml.METS-1.0.item-subject-oszkar	NPV-DQN	hu_HU
xmlui.dri2xhtml.METS-1.0.item-type-type	Tudományos cikk	hu_HU
xmlui.dri2xhtml.METS-1.0.item-other-containerTitle	Acta Polytechnica Hungarica	hu_HU
local.tempfieldCollections	Folyóiratcikkek	hu_HU
xmlui.dri2xhtml.METS-1.0.item-identifiers [doi]	10.12700/APH.21.11.2024.11.10
xmlui.dri2xhtml.METS-1.0.item-description-version	Kiadói változat	hu_HU
xmlui.dri2xhtml.METS-1.0.item-format-page	16 p.	hu_HU
xmlui.dri2xhtml.METS-1.0.item-other-containerPeriodicalNumber	11. sz.	hu_HU
xmlui.dri2xhtml.METS-1.0.item-other-containerPeriodicalVolume	21. évf.	hu_HU
xmlui.dri2xhtml.METS-1.0.item-other-containerPeriodicalYear	2024	hu_HU
xmlui.dri2xhtml.METS-1.0.item-other-containerPublisher	Óbudai Egyetem	hu_HU

Files in this item

Name:: Paczolay_Harmati_151.pdf
Size:: 524.2Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

2.01. 2024 Volume 21, Issue No. 11. [17]

Show simple item record