Knowledge Base Optimization of the HFRIQ- Learning

Tompa, Tamás; Kovács, Szilveszter

Szerző	Tompa, Tamás
Szerző	Kovács, Szilveszter
Utolsó hozzáférés ideje	2025-08-19T08:43:20Z
Elérhető	2025-08-19T08:43:20Z
Megjelenés ideje	2024
ISSN, e-ISSN	1785-8860	hu_HU
Közvetlen link	http://hdl.handle.net/20.500.14044/32424
Összefoglaló (Abstract)	The learning process of conventional reinforcement learning methods, such as Q- learning and SARSA typically start with an empty knowledge base. In each iteration step, the initial empty knowledge base is gradually constructed by reinforcement signals obtained from the environment. Even only if a fragment of knowledge is available regarding the system behavior which can be injected into the learning process, the learning performance can be improved. In Heuristically Accelerated Fuzzy Rule Interpolation-based Q-learning (HFRIQ- learning), the external knowledge can be represented in the form of human experts defined state-action fuzzy rules. If the expert knowledge base contains inaccuracies, i.e., incorrect state-action rules, it can negatively impact the learning performance. The main goal of this paper is to introduce a methodology for correcting (optimizing) the inaccurate a priori expert knowledge and as an additional benefit of optimization, to reduce the size of the Q-function representation fuzzy rule-base during the learning phase. The paper also introduces some examples how the quality of expert knowledge influences the HFRIQ-learning performance on a well-known reinforcement learning benchmark problem.	hu_HU
dc.format	PDF	hu_HU
Nyelv	en	hu_HU
Cím és alcím	Knowledge Base Optimization of the HFRIQ- Learning	hu_HU
Hozzáférés szintje	Open access	hu_HU
Copyright	Óbudai Egyetem	hu_HU
Kiadás helye	Budapest	hu_HU
Egyetem	Óbudai Egyetem	hu_HU
Tudományterület	Műszaki tudományok - informatikai tudományok	hu_HU
Tárgyszó	reinforcement learning	hu_HU
Tárgyszó	heuristically accelerated reinforcement learning	hu_HU
Tárgyszó	expert knowledge representation	hu_HU
Tárgyszó	fuzzy rule interpolation	hu_HU
Tárgyszó	q-learning	hu_HU
Tárgyszó	expert rule validation	hu_HU
Műfaj	Tudományos cikk	hu_HU
A cikket/könyvrészletet tartalmazó dokumentum címe	Acta Polytechnica Hungarica	hu_HU
local.tempfieldCollections	Folyóiratcikkek	hu_HU
Egyéb azonosítók [doi]	10.12700/APH.21.10.2024.10.6
Változat	Kiadói változat	hu_HU
Terjedelem	18 p.	hu_HU
A forrás folyóirat száma	10. sz.	hu_HU
A forrás folyóirat évfolyama	21. évf.	hu_HU
A forrás folyóirat éve	2024	hu_HU
Kiadó	Óbudai Egyetem	hu_HU

A dokumentumhoz tartozó fájlok

Név:: Tompa_Kovacs_150.pdf
Méret:: 580.7KB
Formátum:: PDF

Megtekintés/Megnyitás

A dokumentum a következő gyűjtemény(ek)ben található meg

Acta Polytechnica Hungarica [175]

Rövidített megjelenítés