Bibliographic data

Document WO002022033746A1 (Pages: 26)

Document 1/1

Bibliographic data Document WO002022033746A1 (Pages: 26)
INID	Criterion	Field	Contents
54	Title	TI	[DE] TRAINIEREN EINES REINFORCEMENT LEARNING AGENTEN ZUR STEUERUNG EINES AUTONOMEN SYSTEMS [EN] TRAINING A REINFORCEMENT LEARNING AGENT TO CONTROL AN AUTONOMOUS SYSTEM [FR] ENTRA�NEMENT D'UN AGENT D'APPRENTISSAGE PAR RENFORCEMENT POUR COMMANDER UN SYST�ME AUTONOME
71/73	Applicant/owner	PA	BAYERISCHE MOTOREN WERKE AG, DE
72	Inventor	IN	B�DECKER JOSCHKA, DE ; H�GLE MARIA, DE ; KALWEIT GABRIEL, DE ; WERLING MORITZ, DE
22/96	Application date	AD	Jun 14, 2021
21	Application number	AN	2021065919
	Country of application	AC	EP
	Publication date	PUB	Feb 17, 2022
33 31 32	Priority data	PRC PRN PRD	DE 102020121150 20200811
51	IPC main class	ICM	G06N 3/00 (2006.01)
51	IPC secondary class	ICS	B60W 30/18 (2012.01) G06N 3/08 (2006.01)
	IPC additional class	ICA
	IPC index class	ICI
	Cooperative patent classification	CPC	B60W 60/001 G06N 20/00 G06N 3/008 G06N 3/08
	MCD main class	MCM	G06N 3/00 (2006.01)
	MCD secondary class	MCS	B60W 30/18 (2012.01) G06N 3/08 (2006.01)
	MCD additional class	MCA
57	Abstract	AB	[DE] Ein Aspekt der Erfindung betrifft eine Vorrichtung zum Trainieren eines Reinforcement Learning Agenten zur Steuerung eines autonomen Systems, wobei die Vorrichtung eingerichtet ist, das Umfeld des autonomen Systems zu erfassen, zumindest ein zu dem autonomen System vergleichbares Objekt im Umfeld des autonomen Systems zu erkennen, ein Verhalten des zumindest einen zu dem autonomen System vergleichbaren Objekts zu erfassen, und den Reinforcement Learning Agenten in Abh�ngigkeit von dem erfassten Verhalten des zumindest einen zu dem autonomen System vergleichbaren Objekts zu trainieren. [EN] One aspect of the invention relates to a device for training a reinforcement learning agent to control an autonomous system, wherein the device is designed to detect the environment of the autonomous system, to detect at least one object in the environment of the autonomous system that can be compared to the autonomous system, to detect a behaviour of the at least one object that can be compared to the autonomous system, and to train the reinforcement learning agent in accordance with the detected behaviour of the at least one object that can be compared to the autonomous system. [FR] Selon un aspect, l'invention concerne un dispositif pour entra�ner un agent d'apprentissage par renforcement pour commander un syst�me autonome, le dispositif �tant con�u pour d�tecter l'environnement du syst�me autonome, pour d�tecter au moins un objet comparable au syst�me autonome dans l'environnement du syst�me autonome, pour d�tecter un comportement de l'objet ou des objets comparable(s) au syst�me autonome, et pour entra�ner l'agent d'apprentissage par renforcement en fonction du comportement d�tect� de l'objet ou des objets comparable(s) au syst�me autonome.
56	Cited documents identified in the search	CT	US000010627823B1 US020200189597A1
56	Cited documents indicated by the applicant	CT
56	Cited non-patent literature identified in the search	CTNP	MICHEL DEUDON ET AL: "HighRes-net: Recursive Fusion for Multi-Frame Super-Resolution of Satellite Imagery", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 15 February 2020 (2020-02-15), XP081600852 7; WANG PIN ET AL: "A Reinforcement Learning Based Approach for Automated Lane Change Maneuvers", 2018 IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV), IEEE, 26 June 2018 (2018-06-26), pages 1379 - 1384, XP033423455, DOI: 10.1109/IVS.2018.8500556 7
56	Cited non-patent literature indicated by the applicant	CTNP
	Citing documents		Determine documents
	Sequence listings
	Search file IPC	ICP	B60W 30/18 G06N 3/00