. . . . . . “R\u00F3bert Busa-Fekete等人:基于偏好的强化学习:使用基于偏好的竞赛算法的进化直接策略搜索。(2014)”。 . . . _:ID_6bb87f2a1fd53504a28ce22ca6f0dc43。_:ID_6bb87f2a1fd53504a28ce22ca6f0dc43 ._:ID_6bb87f2a1fd53504a28ce22ca6f0dc43 ._:ID_6bb87f2a1fd53504a28ce22ca6f0dc43“期刊/ml/Busa-FeketeSWCH14”。 _:ID_e70922eaea7a50c75f6ac2a07147514d。_:ID_e70922eaea7a50c75f6ac2a07147514d ._:ID_e70922eaea7a50c75f6ac2a07147514d ._:ID_e70922eaea7a50c75f6ac2a07147514d“10.1007/S10994-014-5458-8”。 _:ID_264745b25e0eb70bd70b0eae44b5b002。_:ID_264745b25e0eb70bd70b0eae44b5b002 ._:ID_264745b25e0eb70bd70b0eae44b5b002 ._:ID_264745b25e0eb70bd70b0eae44b5b002“Q115146321”。 “基于偏好的强化学习:使用基于偏好的竞赛算法的进化直接政策搜索。”。 . . . . . . "5"^^. _:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_1。_:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_1 ._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_1“R\u00F3bert Busa-Fekete”。_:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_1 ._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_1"1"^^._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_1 . _:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_2。_:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_2 ._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_2“余额\u00E1zs Sz\u00F6r\u00E9nyi”。_:Sig_d63b0c1deb6c6ac00d0f6f20728811f_2 ._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_2"2"^^._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_2 . _:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_3。_:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_3 ._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_3“保罗·翁”。_:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_3 ._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_3"3"^^._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_3 . _:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_4。_:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_4 ._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_4“魏伟成”。_:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_4 ._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_4"4"^^._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_4 . _:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_5。_:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_5 ._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_5“Eyke H\u00FCllermier”。_:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_5 ._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_5"5"^^._:Sig_d63b0c1deb6c6ac00d0f6f20728811ff_5 . . . . "327-351" . “马赫。学习。”。 “马赫。学习。”。 “97”。 "3" . “2014年”^^. “dblp记录'journals/ml/Busa-FeketeSWCH14'的RDF数据的起源信息”。 . . . “2023-08-28T21:35:33+0200”。