{“状态”：“确定”，“消息类型”：“工作”，“信息版本”：“1.0.0”，“邮件”：{“索引”：{-“日期-部分”：[[2023,4,6]]，“日期-时间”：“2023-04-06T04:33:32Z”，“时间戳”：1680755612679}，“出版商位置”：“美国纽约州纽约市”，“引用-计数”：25，“出版者”：“ACM”，“内容-域”：[“dl.ACM.org”]，“交叉标记-严格离子“：true}，”短容器时间“：[]，“published-print”：{“date-parts”：[[2022,4,5]]}，“DOI”：“10.1145\/3517207.3526968”，“type”：“proceedings-article”，“created”：{“date-parts”：[[202022,3,29]]，“date-time”：“2022-03-29T22:09:26Z”，“timestamp”：1648591766000}，《update-policy》：“http://\/dx.DOI.org\/10.1145\/crossmark-policy”，“source”“：”Crossref“，”is-referenced-by-count“：0，”title“：[“时间移位强化学习”]，“前缀”：“10.1145”，“作者”：[{“given”：“Deepak George”，“family”：“Thomas”，“sequence”：“first”，“affiliation”：[}“name”：“Department of Computer Science”}]}，{“给定”：“Tichakorn”，“家族”：“Wongpiromsarn”，“序列”：“additional”，“filiation“[{”name“：”Department for Computer Science”{]}：“Jannesari”，“sequence”：“additional”，“affiliation”：[{“name”：“Department of Computer Science”}]}]，“member”：“320”，“published-on-line”：{“date-parts”：[2022,4,5]]}，“reference”：[}“key”：“e_1_3_2_1_1_1_1”，“volume-title”：“Conference on Robot Learning。PMLR，156-168”，“author”：“Amiranashvili Artemij”，“year”：“2018”，“unstructured”：“Artemij Amiranashvili，Alexey Dosovitskiy，Vladlen Koltun，and Thomas Brox.2018。动态对象强化学习中的运动感知。机器人学习会议。PMLR，156-168。Artemij-Amiranashvili，Alexey Doshovitskii，Vladelen Koltun.and Thomas-Brox.2018年。动态物体强化学习中的运动感知。在机器人学习会议上。PMLR，156--168.“}，{“key”：“e_1_3_2_1_2_1”，“volume-title”：“Miles Brundage，and Anil Anthony Bharath”，“author”：“Arulkumaran Kai”，“year”：“2017”，“unstructured”：“Kai Arulkumanan，Marc Peter Deisenroth，Miles Brundage，and Anil-Anthony Bhrath。2017。深度强化学习的简要调查。arXiv预印本arXiv:1708.05866（2017）Kai Arulkumaran、Marc Peter Deisenroth、Miles Brundage和Anil Anthony Bharath。2017年，深度强化学习的简要调查。arXiv预印arXiv:1708.05866（2017）。“}，{”key“：”e_1_3_2_1_3_1“，”doi-asserted-by“：”publisher“，“doi”：“10.5555\/2566972.2566979”}，“key”：“e_1_ 3_2_1_4_1”，“volume-title”：“Openai健身房。arXiv预印本arXiv:1606.01540”，“作者”：“Brockman Greg”，“年份”：“2016”，“非结构化”：”Greg Brockman、Vicki Cheung、Ludwig Pettersson、Jonas Schneider、John Schulman、Jie Tang和Wojciech Zaremba。2016年，Openai健身房。arXiv预印arXiv:1606.01540（2016）。Greg Brockman、Vicki Cheung、Ludwig Pettersson、Jonas Schneider、John Schulman、Jie Tang和Wojciech Zaremba。2016年，Openai健身房。arXiv预印arXiv:1606.01540（2016）。“}，{”key“：”e_1_3_2_1_5_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1109\/CVPR.2017.502”}，“key”：“e_1_ 3_2_1 _6_1”，“doi-assert-by”：“publisher”，”doi“：”10.1109\/ICRA.2017.7989384“}”，{年份”：“2010年”，“非结构化”：“Hado Hasselt.2010。双Q学习。神经信息处理系统的进展23（2010），2613-2621。哈多·哈塞尔特。2010年。双Q学习。神经信息处理系统进展23（2010），2613--2621.“}，{”key“：”e_1_3_2_1_8_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1109\/ICRA.2019.8794033”}，“key”：“e_1_ 3_2_1_ 9_1”，“doi-assert-by”：“publisher”，”doi“：”10.1109\/CVPR.2014.223“}”，{使用增强数据进行强化学习。arXiv预印本arXiv:2004.14990“，“作者”：“拉斯金·迈克尔”，“年份”：“2020”，“非结构化”：“迈克尔·拉斯金、基敏·李、亚当·斯托克、莱罗·平托、彼得·阿比埃尔和阿拉文德·斯里尼瓦斯。2020年，使用增强数据强化学习。arXiv预印本arXiv:2004.14990（2020）。迈克尔·拉斯金、Kimin Lee、Adam Stooke、Lerre Pinto、Pieter Abbeel和Aravind Srinivas。2020年，使用增强数据强化学习。arXiv预印本arXiv:2004.14990（2020）。“}，{”key“：”e_1_3_2_11_1“，”doi-asserted-by“：”publisher“，”doi“：”10.1109“\/ICCV.2019.00718”}，“key”：“e_1_ 3_2_12_1”，“doi-assert-by”：“crossref”，“unstructured”：“Volodymyr Mnih Koray Kavukcuoglu David Silver Andrei A Rusu Joel Veness Marc G Bellemarc G Alex Graves Martin Riedmiller Andreas K Fidjeland Georgo Ostrovski etal 2015。通过深度强化学习进行人性化控制。《自然》518 7540（2015）529--533。Volodymyr Mnih Koray Kavukcuoglu David Silver Andrei A Rusu Joel Veness Marc G Bellemare Alex Graves Martin Riedmiller Andreas K Fidjeland Georg Ostrovski等人，2015年。通过深度强化学习进行人性化控制。nature 518 7540（2015）529--533.“，”DOI“：”10.1038\/nature14236“}，{“key”：“e_1_3_2_13_1”，“unstructured”：“Bradford W Mott and Stella Team。1996。Stella：多平台Atari 2600 VCS模拟器。布拉德福德·W·莫特和斯特拉团队。1996年。Stella：多平台Atari 2600 VCS模拟器。“}，{”key“：”e_1_3_2_14_1“，”volume-title“：”SAI智能系统会议记录。Springer，115-128“，”author“：”Roghair Jeremy“，”year“：”2021“，”unstructured“：”杰里米·罗海尔（Jeremy Roghair）、埃米尔·尼亚拉基（Amir Niaraki）、京泰高（Kyungtae Ko）和阿里·贾内萨里（Ali Jannesari）。2021 . 一种基于视觉的无人机避障深度强化学习算法。SAI智能系统会议记录。施普林格，115-128。Jeremy Roghair、Amir Niaraki、Kyungtae Ko和Ali Jannesari，2021年。一种基于视觉的无人机避障深度强化学习算法。SAI智能系统会议记录。Springer，115-128.“}，{“key”：“e_1_3_2_1_15_1”，“volume-title”：“优先体验重播。arXiv预印本arXiv:1511.05952”，“作者”：“Schaul Tom”，“年份”：“2015”，“非结构化”：“Tom Schaul，John Quan，Ioannis Antonoglou，and David Silver.2015。优先体验重播。arXiv预印本arXiv:11511.05952（2015）。Tom Schaul、John Quan、Ioannis Antonoglou和David Silver。2015年，优先体验重播。arXiv预印本arXiv:11511.05952（2015）。“}，{”key“：”e_1_3_2_16_1“，”volume-title“：”强化学习与潜流。arXiv预印本arXiv:2101.01857“，”author“：”尚温玲“，”year“：”2021“，”unstructured“：”温玲商、王晓飞、Aravind Srinivas、Aravid Rajeswaran、杨高、彼得·阿比埃尔和迈克尔·拉斯金。2021。用潜流强化学习。arXiv预印本arXiv：2101.01857（2021）。尚雯玲、王晓菲、Aravind Srinivas、Aravind Rajeswaran、Yang Gao、Pieter Abbeel和Michael Laskin。2021.用潜流强化学习。arXiv预印本arXiv：2101.01857（2021）。}，{“key”：“e_1_3_2_1_17_1”，“volume-title”：“视频中动作识别的双流卷积网络。arXiv预印本arXiv:1406.2199”，《作者》：“西蒙尼安·凯伦”，《年份》：“2014年”，“非结构化”：“凯伦·西蒙尼安和安德鲁·齐瑟曼。2014年。视频中动作识别的双流卷积网络。arXiv预印本arXiv:1406.2199（2014）。凯伦·西蒙扬和安德鲁·齐瑟曼。2014.视频中动作识别的双流卷积网络。arXiv预印本arXiv:1406.2199（2014）。“｝，｛”键“：”e_1_3_2_1_18_1“，”卷标题“：”卷曲：强化学习的对比无监督表示。arXiv预印本arXiv:2004.04136“，”作者“：”Srinivas Aravind“，”年份“：”2020“，”非结构化“：”Aravind Srinivas、Michael Laskin和Pieter Abbeel。2020 . 卷曲：强化学习的对比无监督表征。arXiv预印本arXiv:2004.04136（2020）。Aravind Srinivas、Michael Laskin和Pieter Abbeel。2020年。卷曲：强化学习的对比无监督表征。arXiv预印本arXiv:2004.04136（2020）。“}，{”key“：”e_1_3_2_1_19_1“，”unstructured“：”Chandan Kumar Subrahmanyam Vaddi和Ali Jannesari.2019。实时无人机应用的高效目标检测模型。(2019). 2019年，Chandan Kumar Subrahmanyam Vaddi和Ali Jannesari。实时无人机应用的高效目标检测模型。(2019).“}，{”key“：”e_1_3_2_1_20_1“，”volume-title“：”David Budden，Abbas Abdolmaleki，Josh Merel，Andrew Lefrancq，et al.“，”author“：”Tassa Yuval“，“year”：“2018”，“unstructured”：“Yuval Tassa、Yotam Doron、Alistair Muldal、Tom Erez、Yazhe Li、Diego de Las Casas、David Budden、Abbas Abdolmaleki、Josh Merel、Andrew Lefrancq等人，2018年。Deepmind控制套件。arXiv预印arXiv:1801.00690（2018）。Yuval Tassa、Yotam Doron、Alistair Muldal、Tom Erez、Yazhe Li、Diego de Las Casas、David Budden、Abbas Abdolmaleki、Josh Merel、Andrew Lefrancq等人，2018年。Deepmind控制套件。arXiv预印arXiv:1801.00690（2018）。“}，{”key“：”e_1_3_2_1_21_1“，”doi-asserted-by“：”publisher“，”doi“：”10.1109\/ICCV.2015.510“}、{”密钥“：”e_1_3_2_1_22_1“、”doi-aserted-by”：“publisher”，“doi”：“10.1609\/aaai.v30i1.10295 46484-8_2“}，{”key“：”e_1_3_2_1_24_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1007\/978-3-030-01246-5_49“}，{”key“：”e_1_3_2_1_25_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1007\/978-3-030-01216-8_43”}]，”event“：{”name“：”EuroSys“22:第十七届欧洲计算机系统会议”，“location”：“Rennes France”，“缩写词”：“EuroSys'22”，“赞助商”：[“SIGOPS ACM操作系统特别利益小组”]}，“container-title”：[”第二届欧洲机器学习与系统研讨会论文集“]，“original-title”：[]，“link”：[{“URL”：“https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3517207.3526968”，“content-type”：“unspecified”，“content-version”：“vor”，“intended-application”：“similarity-checking”}]，“deposed”：{“date-parts”：[2023,4,5]]，“date-time”：”2023-04-05T10:07:54Z“，”timestamp“：1680689274000}，”score“：1，”resource“：{“primary”：{”URL“：”https:\/\/dl.acm.org\/doi\/10.1145\/3517207.3526968“}}，“subtitle”：[]，“shorttitle”：[]，“issued”：{“date-parts”：[2022,4,5]]}，《references-count》：25，“alternative-id”：[“10.1145\/351727.35269968”，“10.1145\/3517207”]，“URL”：“http://\/dx.doi.org\/10.1145\/3517207.3526968”，“关系”：{}，“subject“：[]，”published“：{“date-parts”：[2022,4,5]]}，”assertion“：[{“value”：“2022-04-05”，“order”：2，“name”：“published”，“label”：“published”，“group”：{”name“：”publication_history“，”label“：”publication history“}}}]}