{“状态”：“确定”，“消息类型”：“工作”，“信息版本”：“1.0.0”，“邮件”：{“索引”：{“日期-部件”：[[2024,6,21]]，“日期-时间”：“2024-06-21T23:36:52Z”，“时间戳”：1719013012422}，“发布者位置”：“美国纽约州纽约市”，“引用-计数”：77，“发布商”：“ACM”，“许可证”：[{“开始”：{-“日期-零件”：[2023,3,13]]，时间“：”2023-03-13T00:00:00Z“，”时间戳“：1678665600000}，“content-version”：“vor”，“delay-in-days”：0，“URL”：“http://www.acm.org\/publications\/policys\/corpyright_policy#Background”}]，“funder”：[{“DOI”：“10.13039\/1000000001”，“name”：“NSF（National Science Foundation）”，“DOI-asserted-by”：“publisher”，“award”：[“IIS-1924802，IIS-2106690”]}]，“content-domain”：{“domain”：[“dl.acm.org”]，“crossmark-restriction”：true}，“short-container-title”：[]，“published-print”：{“date-parts”：[[2023,3,13]]}，”DOI“：”10.1145\/3568162.3576986“，”type“：”proceedings-article“，”created“：{”date-part“：[2023,3,9]]，”date-time“：”2023-03-09T18:08:48Z“，”timestamp“：1678385328000}”，“update-policy”：“http://\/d x.DOI.org\/10.1145\/crossmark-policy“，”source“：”Crossref“，“is-referenced-by-count”：4，“title”：[“人与机器人交互中协调隐式和显式人类反馈的自我注释方法”]，“prefix”：“10.1145”，“author”：[{“ORCID”：“http://\/ORCID.org\/00000-0002-8535-2771”，“authenticated-ORCID”：false，“give”：“Qiping”，“family”：“Zhang”，“sequence”：“first”，“affiliation”：[}“name”：“美国康涅狄格州纽黑文市耶鲁大学”}]}，{“ORCID”：“http://\/ORCID.org\/0000-0002-0320-5795”，“authenticated-ORCID”：false，“given”：”奥斯汀“，“family”：“Narcomey”，“sequence”：“additional”，“affiliation”：[{“name”：“美国科涅狄格省纽黑文耶鲁大学（Yale University，New Haven，CT，USA）}]}，{”ORCID“：”：“http:/\/ORCID=org\/00000-0002-0152-053X”，“authenticated-ORCID“：false”，“given”：“Kate”，“family”：“Candon”，“sequence”：“additional”，“affiliation”：[{“name”：“美国康涅狄格州纽黑文耶鲁大学”}]}，{“ORCID”：“http://\/ORCID.org\/00000-0003-0698-5472”，“authenticated-ORCID”：false，“given”：”Marynel“，”family“：”V\u00e1zquez“，”sequence“：”additional““，”在线发布“：{“date-parts”：[[2023,3,13]]}，“reference”：[{“key”：“e_1_3_2_1_1”，“doi-asserted-by”：“publisher”，”doi“：”10.1145\/3319502“}，{“key”：”e_1_ 3_2_2_1“，”doi-assert-by“：”publisher“：“荒川里库（Riku Arakawa Sosuke Kobayashi Yuya Unno Yuta Tsuboi）和前田信义（Shin-ichi Maeda），2018年。DQN-TAMER：具有顽固反馈的人在回路中强化学习。https:\/\/doi.org\/10.48550\/ARXIV.1810.11748“，”doi“：”10.48550\/ARXV.1810.1748“}，{”key“：”e_1_3_2_2_4_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1016\/j.robot.2008.10.024”}，“key”：“e_3_2_5_1”，“doi-assert-by”：“publisher”，”doi:“10.1109\/FG.2018.00019”}{“键”：“e_1_3_2_2_6_1”，“doi-asserted-by”：“出版商”，“doi”：“10.1145\/3453445”}，{“密钥”：“e_1_3_2_2_7_1“，“doi-asserted-by”：“publisher”，“doi”：“10.1017\/9781108676649”}，{“key”：“e_1_ 3_2_8_1”，”doi-assert-by“：”publisher“，”doi“：”10.1145\/3136755.3136814“}，”{“密钥”：“e_1_3_2_9_1”、“volume-title”：“人类计算的人工智能，Thomas S”，“author”：“Broekens Joost”，“unstructured”：“Joost Broost”埃肯斯，2007年。情感和强化：情感面部表情促进机器人学习。托马斯·黄（Thomas S.Huang）、安东·尼霍尔特（Anton Nijholt）、马贾·潘蒂奇（Maja Pantic）和亚历克斯·彭特兰（Alex Pentland）主编《人工智能用于人类计算》。Springer Berlin Heidelberg，Berlin，Heidelbeg，113-132。“}，{“key”：“e_1_3_2_10_1”，“volume-title”：“第22届自治代理和多代理系统国际会议（AAMAS’23）的议事录”，“author”：“Candon Kate”，“year”：“2023”，“unstructured”：“Kate Candon，Zoe Hsu，Yoony Kim，Jesse Chen，Nathan Tsoi，and Marynel V\u00e1zquez.2023。非语言人类信号可以帮助自主代理推断人类对其行为的偏好。程序中。第22届自主代理和多代理系统国际会议（AAMAS’23）。IFAAMAS。“}，{”key“：”e_1_3_2_2_11_1“，”doi-asserted-by“：”publisher“，”doi“：”10.1145 \/3568162.3576980“}”，{“key”：“e_1_ 3_2_12_1”，“doi-assert-by”：“publisher”，“doi”：“10.1093 \/oxfordhb”}，}}，{“键”：“e_1_3_2_2_14_1”，“doi-asserted-by”：“出版商”，“doi”：“10.1016\/B978-0-12-813445-0.00010-1“}，{”key“：”e_1_3_2_2_15_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1109\/TAFFC.2017.2737019”}，“key”：“e_1_ 3_2_16_1”，“volume-title”：“Thomaz”，”author“：”Chernova Sonia“，”year“：”2014“，”unstructured“：”Sonia Chernova and Andrea L.Thomaz.2014。机器人向人类教师学习。Morgan&Claypool出版社。“}，{”key“：”e_1_3_2_2_17_1“，”doi-asserted-by“：”publisher“，“doi”：“10.3115\/v1”}，“key”：“e_1_ 3_2_18_1”，“doi-assert-by”：“publisher”，”doi“：”10.1109\/ICRA.2019.8794065{“键”：“e_1_3_2_20_1”，“doi-asserted-by”：“出版商”，“doi”：“10.24963\/ijcai.2021\/599”}，{“key“：”e_1_3_2_21_1“，”doi-asserted-by“：”publisher“，”doi“：”10.1609\/aaai.v35i18.17998“}，{“key”：“e_1_ 3_2_2_22_1”，“doi-assert-by”：“publisher”，“doi”：“10.1037\/a0020019”}，“key“:”e_3_2_33_1“，{“键”：“e_1_3_2_24_1”，“doi-asserted-by”：“出版商”，“doi”：“10.5555\/3016387.3016461”}，{“key“：”e_1_3_2_25_1“，”volume-title“：”Weinberger（Eds.）“，”卷“：”26“，”author“：”Griffith Shane“，”年份“：”2013“，”非结构化“：”Shane Griffith.，Kaushik Subramanian，Jonathan Scholz，Charles L Isbell，and Andrea L Thomaz。2013.政策塑造：将人的反馈与强化学习相结合。《神经信息处理系统进展》，C.J.Burges、L.Bottou、M.Welling、Z.Gahramani和K.Q.Weinberger（编辑），第26卷。Curran Associates，Inc.https:\/\/procedures.neurips.cc\/paper\/2013\/file\/e034fb6b66aacc1d48f445ddfb08da98-paper.pdf“}，{“key”：“e_1_3_2_26_1”，“doi-asserted-by”：“publisher”，”doi“：“10.1016\/S0166-4115（08）62386-9”}，“key“：”e_1_ 3_2_2_27_1“，”doi-assert-by“：”publisher“，“doi”：“10.1080 \/01621459.1977.10480998“}，{”键“：”e_1_3_2_2_28_1“，”doi-asserted-by“：”publisher“，”doi“：”10.1177\/1754073912451331“}，{“key”：”e_1_3_2_29_1“，“doi-assert-by”：“publisher”，“doi”：“10.1037\/0278-739.3.10.4.598”}，“key“：”e_ 1_3_2 _2_30_1“、”volume-title“：”语言学注释手册“，”author“：”Ide Nancy“，”unstructured“：”Nancy Ide and James Pustejovsky.2017。语言注释手册（第1版）。斯普林格出版公司。“，”edition“：”1“}，{”key“：”e_1_3_2_2_31_1“，”doi-asserted-by“：”publisher“，”doi“：”10.1145\/375735.376334“}”，{“key”：“e_1_ 3_2_32_1”，“doi-assert-by”：“publisher”，“doi”：“10.1016\/j.cub.2015.05.052”}，“key“:”e_3_2_33_1“Dinesh Babu Jayagopi Samira Sheikhi David Klotz Johannes Wienke Jean-Marc Odobez Sebastian Wrede Vasil Khalidov Laurent Nguyen Britta Wrede和Daniel Gatica-Perez。2012.Vernissage语料库：一个多模式人-机器人交互数据集。(2012) 8. http:\/\/infoscience.epfl.ch\/record\/182715“，“DOI”：“10.1109\/HRI.2013.6483545”}，{“key”：“e_1_3_2_2_34_1”，“volume-title”：“Lin（Eds.）”，“卷”：“33”，“作者”：“Jeon Hong Jun”，“年份”：“2020”，“非结构化”：“Hong Jun-Jeon，Smitha Milli，and Anca Dragan.2020。奖励-国家（隐含）选择：奖励学习的统一形式。《神经信息处理系统进展》，H.Larochelle、M.Ranzato、R.Hadsell、M.F.Balcan和H.Lin（编辑），第33卷。Curran Associates公司，4415--4426。https:\/\/会议记录。neurips.cc\/paper\/2020\/file\/2f10c1578a0706e06b6d7db6f0b4a6af-paper.pdf“}，{“key”：“e_1_3_2_2_35_1”，“doi-asserted-by”：“publisher”，”doi“：”10.1145\/2753767“}”，{”key“：”e_3_2_36_1“，”volume-title“Goffman面对面交互的方法。Erving Goffman:探索交互顺序”，“author”：“Kendon Adam”，“年份”：“1988年”，“非结构化”：“亚当·肯登。1988.戈夫曼面对面交流的方法。欧文·戈夫曼：探索交互顺序（1988）。“}，{”key“：”e_1_3_2_2_37_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1145\/2909824.3020226”}，“key”：“e_1_a_2_38_1”，“doi-assert-by”：“publisher”，”doi“：”10.1007\/s12369-012-0163-x“}“}，{”键“：”e_1_3_2_2_40_1“，”doi-asserted-by“：”出版商“，”doi“：”10.1007\/978-3-319-02675-6_46“}，{“key”：“e_1_3_2_2_41_1”，“doi-asserted-by”：“publisher”，”doi“：”10.1109\/ACII52823.2021.9597447“}”，{”key“：”e_1_ 3_2_42_1“，”volume-title“：”Jinjuan Heidi Feng，and Harry Hochheiser“，”author“：”Lazar Jonathan“，“year”：“2017”，“unstructured”：“Jonational Lazar，Jinjua Heidi Feng，and Harry Hochhelie 2017年2月。人机交互研究方法。摩根·考夫曼。“}，{”key“：”e_1_3_2_2_43_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1145\/2696454.2696466”}，“{”key“：“e_1_ 3_2_44_1”，“doi-assert-by”：“publisher”，”doi“：”10.1007\/s10458-020-09447-w“}“：”李光亮“，”非结构化”：“Li Guangliang、Hayley Hung、Shimon Whiteson和W.Bradley Knox。2013年，《利用信息行为提高Tamer框架中的参与度》，《2013年自主代理和多代理系统国际会议论文集》（美国明尼苏达州圣保罗）（AAMAS’13）。国际自治代理和多代理系统基金会，Richland，SC，909--916.“}，{”key“：”e_1_3_2_46_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1109\/ACCESS.2020.3006254”}，“key”：“e_1_ 3_2_47_1”，“doi-assert-by”：“publisher”，”doi“：”10.1007\/s10458-015-9283-7：“解耦权重衰减正则化。在学习代表国际会议上。https:\/\/openreview.net\/forum？id=Bkg6RiCqY7“，”author“：”Loshchilov Ilya“，”year“：”2019“，”unstructured“：”Ilya Loshchirov and Frank Hutter“。2019.解耦重量衰减规律化。在学习代表国际会议上。https:\/\/openreview.net\/forum？id=Bkg6RiCqY7“}，{“key”：“e_1_3_2_2_49_1”，“volume-title”：“机器学习国际会议。PMLR，2285-2294”，“author”：“麦克拉珊·詹姆斯”，“year”：“2017”，“unstructured”：“詹姆斯·麦克拉珊，Mark K Ho，Robert Loftin，Bei Peng，Guan Wang，David L Roberts，Matthew e Taylor，and Michael L Littman.2017。从政策相关的人的反馈中进行交互式学习。在机器学习国际会议上。PMLR，2285--2294.“}，{”key“：”e_1_3_2_2_50_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1109\/HRI53351.2022.9889395”}，“key”：“e_1_ 3_2_51_1”，“doi-assert-by”：“publisher”，”doi“：”10.1109\/ACII.2019.8925434“}“：”10.1145 \/3522579“}，{”key“：”e_1_3_2_2_53_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1145\/2522848.2522865“}，{“key”：“e_1_3_2_2_54_1”，“volume-title”：“移情对话：上下文化对话的多级数据集。arXiv预印本arXiv:2205.12698”，“author”：“Omitaomu Damilola”，“year”：“2022”，“unstructured”：“Damilola Omitaomu、Shabnam Tafreshi、Tinting Liu、Sven Buechel、Chris Callison Burch、Johannes Eichstaedt、Lyle Ungar和Jo\u00e3o Sedoc。2022.移情对话：情境化对话的多层次数据集。arXiv预印arXiv:2205.12698（2022）。“}，{”key“：”e_1_3_2_2_55_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1109\/ICCV.2017.528”}，“key”：“e_1_ 3_2_56_1”，“doi-assert-by”：“publisher”，”doi“：”10.1109\/ICORR“}”，{“key“:”e_2_2_57_1“、”volume-title“：”ICRA开放源代码软件研讨会3.“，”作者：“Quigley Morgan”，年：“2009年”，“非结构化”：“Morgan Quigley、Ken Conley、Brian Gerkey、Josh Faust、Tully Foote、Jeremy Leibs、Rob Wheeler和Andrew Ng，2009年。ROS：一个开源机器人操作系统。ICRA开放源码软件研讨会3.“}，{”key“：”e_1_3_2_2_58_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1037\/1528-3542.2.3.273”}，“key”：“e_1_ 3_2_59_1”，“首页”：“99”，“article-title”：“多模态注释工具比较-研讨会报告”，“卷”：“7”，“作者”：“Rohlfing Katharina”，“年份”：“2006”，“非结构化”：“Katharina Rohlfing、Daniel Loehr、Susan Duncan、Amanda Brown、Amy Franklin、Irene Kimbara、Jan-Torsten Milde、Fey Parrill、Travis Rose、Thomas Schmidt等人，2006年。多模式注释工具的比较-车间报告。Gespr\u00e4chforschung-Online-Zeitschrift zur Verbalen Interaktion 7（2006），99-123.“，“日记标题”：“Gespr\u00e4mchforschong-Online-Zeitschriff zur Verbalen Interaktion”}，{“key”：“e_1_3_2_60_1”，“volume-title”：“Seshia”，“author”：“Sadigh-Dorsa”，“year”：“2017”，“unstructured”：“Dorsa Sadigh，Anca D.Dragan，S.Shankar Sastry，and Sanjit A。塞希亚。2017年，基于积极偏好的奖励功能学习。机器人学：科学与系统。“}，{”key“：”e_1_3_2_2_61_1“，”doi-asserted-by“：”publisher“，“doi”：“10.15607\/RSS.2016.XII.029”}，“key”：“e_1_a_2_62_1”，“doi-assert-by”：“publisher”，”doi“：”10.1109\/ACCES.2016.2614525“}“}，{”键“：”e_1_3_2_64_1“，”卷时间“：”奖励推断的人类偏见。《第36届国际机器学习会议论文集》（Proceedings of the 36 International Conference on Machine Learning Research），“卷”：“5679”，“作者”：“Shah Rohin”，“年份”：“2019”，“非结构化”：“Rohin Shah，Noah Gundotra，Pieter Abbeel，and Anca Dragan。2019。关于学习而非假设的可行性，人类对奖励推断的偏见。第36届国际机器学习会议论文集（机器学习研究论文集，第97卷），Kamalika Chaudhuri和Ruslan Salakhut-dinov（编辑）。PMLR，5670-5679。https:\/\/procedures.mlr.press\/v97\/shah19a.html“}，{“key”：“e_1_3_2_2_65_1”，“doi-asserted-by”：“publisher”，”doi“：”10.1109\/IROS4761.2022.9981726“}”，{”key“：”e_1_ 3_2_66_1“，”volume-title“：”广义线性混合模型：现代概念、方法和应用程序“，”author“：”Stroup Walter W“，”unstroup“：”Walter W Stroup.2012。广义线性混合模型：现代概念、方法和应用。CRC出版社。“}，{”key“：”e_1_3_2_2_67_1“，”doi-asserted-by“：”publisher“，“unstructured”：“Halit Bener Suay和Sonia Chernova，2011。人的引导和状态空间大小对交互式强化学习的影响。2011年RO-MAN.1-6。https:\/\/doi.org\/10.109\/ROMAN.2011.6005223“，”doi“：”10.1109\/ROMAN.2011.60005223“}，{“key”：“e_1_3_2_2_68_1”，“volume-title”：“强化学习：简介”，“作者”：“Sutton Richard S”，“非结构化”：“Richard S-Sutton和Andrew G Barto，2018。强化学习：简介。麻省理工学院出版社。“}，{”key“：”e_1_3_2_2_69_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1016\/j.artint.2007.09.009”}，“key”：“e_1_ 3_2_70_1”，“doi-assert-by”：“publisher”，”doi“：”10.1109\/IROS51168.2021.9636319机器学习研究“卷：“293”，“作者”：“Quiros Jose Vargas”，“年份”：“2022”，“非结构化”：“Jose Varkas Quiros，Stephanie Tan，Chirag Raman，Laura Cabrera-Quiros和Hayley Hung。2022.Covfee：一个可扩展的网络框架，用于人类行为的连续注释。克里斯蒂娜·帕尔梅罗（Cristina Palmero）、朱利奥·C·S·雅克（Julio C.S.Jacques Junior）、阿尔伯特·克拉普（Albert Clap\u00e9s）、伊莎贝拉·盖恩（Isabelle Guyon）、韦韦·图（Wei-Wei Tu）、托马斯·莫斯伦德（Thomas B.Moeslund）和塞尔吉奥·埃斯卡莱拉（Sergio Escalera）（编辑）。PMLR，第265--293页。https:\/\/procedures.mlr.press\/v173\/vargas-quiros22a.html“}，{“key”：“e_1_3_2_2_72_1”，“volume-title”：“面部评估：使用面部表情和强化学习训练用户界面。arXiv预印本arXiv:1606.02807”，“author”：“Veeriah Vivek”，“year”：“2016”，“unstructured”：“Viveek Veeria，Patrick M Pilarski，and Richard S Sutton.2016。面部评价：通过面部表情和强化学习训练用户界面。arXiv预印arXiv:1606.02807（2016）。“}，{”key“：”e_1_3_2_2_73_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1145\/2559636.2559684”}，“key”：“e_1_ 3_2_74_1”，“doi-assert-by”：“publisher”，”doi“：”10.1007\/BF02504798“}，{“键”：“e_1_3_2_76_1”，“卷标”：“第五届国际语言资源与评价会议记录（LREC’06）”，“作者”：“Wittenburg-Peter”，“年份”：“2006年”，“非结构化”：“Peter Wittenberg，Hennie Brugman，Albert Russel，Alex Klassmann，and Han Sloetjes。2006年，ELAN：多模态研究的专业框架。《第五届国际语言资源与评估会议论文集》（LREC'06）。意大利热那亚欧洲语言资源协会（ELRA）。http://www.lrec-conf.org \/processes\/lrec2006\/pdf\/153_pdf.pdf“}，{”key“：”e_1_3_2_77_1“，”doi-asserted-by“：”publisher“，“doi”：“10.1109\/ICRA48506.2021.9562098”}]，”event“：{”name“：”HRI'23:ACM\/IEEE国际人机交互会议“，”location“：”Stockholm Sweden“，”缩写词：“HRI'23”，“赞助商”：[“SIGAI ACM人工智能特别兴趣小组”，“SIGCHI ACM计算机与人类交互特别兴趣小组“]}，“container-title”：[“2023年ACM\/IEEE人类与机器人交互国际会议论文集”]，“原始标题”：[]，“链接”：[{“URL”：“https:\/\/dl.ACM.org\/doi\/pdf\/10.1145\/3568162.3576986”，“content-type”：“application \/pdf“，”content-version“：”vor“，”intended-application“：”syndication“}，{“URL”：“https:\/\/dl.acm.org\/doi\/pdf\/10.1145\/3568162.3576986”，“content-type”：“unspecified”，“content-version”：“vor”，“intended-plication”：“similarity-checking”}]，“deposed”：{“date-parts”：[2024,3,13]]，“date-time”：”2024-03-13T10:39:48Z“，”时间戳“：1710326388000}，”score“：1，”resource“：{主要”：{“URL”：“https:\/\/dl.acm.org\/doi\/10.1145\/3568162.3576986”}}，”subtitle“：[]，”shorttitle“：[]，”issued“：{date-parts”：[[2023,3,13]]}，“references-count”：77，“alternative-id”：[“10.1145\/35686162.3576996”，“10.1145\/3568162”]，”URL“：”http://dx。doi.org \/10.1145\/3568162.3576986“，”关系“：{}，”主题“：[]，”发布“：{”date-parts“：[[2023,3,13]]}，”assertion“：[{”value“：”2023-03-13“，”order“：2，”name“：”published“，”label“：”published“，”group“：{”name“:”publication_history“，”标签“：”publication history“}}]}}