｛“status”：“ok”，“message type”：“work”，“message version”：“1.0.0”，“message”：｛“indexed”：｛“date parts”：[[2022,4,4]]，“date-time”：“2022-04-04T18:43:59Z”，“timestamp”：1649097839226｝，“reference count”：15，“publisher”：“Walter de Gruyter GmbH”，“issue”：“4”，“license”：[｛“start”：｛“date parts”：[[2012,1,11]，“date-time”：“2012-01-01T00:00:00Z”，“timestamp””：132537600000}，“content-version”：“tdm”，“delay-in-days”：0，“URL”：“http://www.springer.com/tdm”}]，“content-domain”：{“domain”:[]，“crossmark-restriction”：false}，”short-container-title“：[]，”published-print“：{”date-parts“：[2012,1,1]]}，‘abstract’：“摘要<\/jats:title>机器人平台中序列学习的实现带来了几个挑战。决定何时停止一个动作并继续下一个动作需要在感官信息的稳定性和下一个需要什么动作的知识之间取得平衡。这里介绍的工作为成功执行和学习动态序列提供了一个起点。利用NAO仿人平台，我们提出了一个基于动态场理论和强化学习方法的数学模型，用于获取和执行一系列基本运动行为。给出了用于序列生成的两种强化学习方法的仿真和实现结果<\/jats:p>“，”DOI“：”10.2478\/s13230-013-0109-5“，”type“：”journal-article“，”created“：{”date-parts“：[[2013,5,1]]，”date-time“：”2013-05-01T18:35:12Z“，”timestamp“：1367433312000}“，”source“：“Crossref”，“is-referenced-by-count”：1，“title”：[“通过强化学习基于DFT的序列：NAO实现”]，“前缀”：“10.2478”，“volume”：“3”，“作者“：[{”给定“：”Boris“，”family“：”Dur\u00e1n“，”sequence“：”first“，”affiliation“：[]}，{”给出“：”Gauss“，”家庭“：”Lee“，”序列“：”additional“，”从属“：[]}，“给定”：“Robert”，“family”：“Lowe”，“sequence”：“additional”，“affiliance”：[]{，“member”：“374”，“reference”：[{”key“：”109_CR1“，“first page”：“77”，“卷”：“27”，“作者”：“S Amari”，“年份”：“1977”，“非结构化“：”S.Amari，《侧向抑制型神经场模式形成的动力学》，《生物控制论》，第27卷，第77页，第201387页，1977年2008年”，“非结构化”：“G.Sch\u00f6ner，剑桥计算认知建模手册。R.Sun，英国：剑桥大学出版社，2008，ch.《认知的动力系统方法》，第101\u2013126页。Schoner，\u201cTarget在带有低电平传感器的自动车辆上的表示。\u201d《国际机器人研究杂志》，第19卷，第5期，第424\u20134472000年5月。[在线]。可用：http://\/dx.doi.org\/10.1177\/02783640022066950“，”journal-title“：”The International journal of Robotics Research“}，{“key”：“109_CR4”，“doi-asserted-by”：“crossref”，“doi”：“10.1109\/DEVLRN.2007.4354022”，“volume-title”：“On The development of intentity understanding for joint action taskes”，“author”：“W Erlhagen”，年：“2007”，“unstructured”：“W。Erlhagen，A.Mukovskiy，F.Chersi，E.Bicho，《关于联合行动任务意图理解的发展》，2007年第二期。Sandamirskaya和G.Sch\u00f6ner，《序列顺序的具体描述：不稳定性如何驱动序列生成》，《神经网络》，第23卷，第10期，第1164\u201311792010年12月。“，“期刊标题”：“神经网络”}，{“key”：“109_CR6”，“volume-title”：“前面。计算。神经科学：计算神经科学与神经技术伯恩斯坦会议与Neurex年会，BC11，第0期，“作者”：“Y Sandamirskaya”，“年份”：“2011年”，“非结构化”：“Y.Sandamierskaya，M.Richter，和G.Sch\u00f6ner，序列生成和行为组织的神经动力学，前面的u201d。计算。神经科学：计算神经科学与神经技术Bernstein Conference&Neurex Annual Meting，BC11，2011年第0期。”}，{“key”：“109_CR7”，“volume title”：“强化学习：导论（自适应计算和机器学习）”，“author”：“RS Sutton”，“year”：“1998”，“nonstructured”：“R.S.Sutton and A.G。Barto，《强化学习：导论》（自适应计算和机器学习）。麻省理工学院出版社，1998年3月。[在线]。可用：http://www.amazon.com/exec\/obidos\/redirect？tag=citeulike07-20&path=ASIN\/0262193981“}，{”key“：”109_CR8“，”doi-asserted-by“：”crossref“，”first-page“：”350“，”doi“：”10.1007\/s002210050467“，”volume“：“121”，”author“：”RE Suri“，”year“：”1998“，”unstructured“：”R.E.Suri and W。舒尔茨，利用多巴胺样强化信号的神经网络模型学习序列运动，《实验脑研究》，第121卷，第350页，20133541998年10月10日，第002210050467页。[在线]。可用信息：http://\/dx.doi.org\/10.1007\/s002210050467“，”journal-title“：”实验性大脑研究“}，{“key”：“109_CR9”，“volume-title”：“CoRR”，“author”：“J Modayil”，“year”：“2011”，“unstructured”：“J.Modayil，A.White，and R.S.Sutton，\u201cMulti-timescale nexting in A reinforcement learning robot，\u201 d CoRR，vol.abs\/1112.1133，2011.”}，“{”key“：”109_CR10”，“卷-时间”：“发展与学习，2010年。ICDL 2010。第九届IEEE国际会议”，“作者”：“Y Sandamirskaya”，“年份”：“2010年”，“非结构化”：“Y.Sandamierskaya and G.Sch\u00f6ner，《动作系统中的序列：多维动态神经场实现》，《发展与学习》，2010年。ICDL 2010。第九届IEEE国际会议，2010年。“}，{”问题“：“3”，”关键“：“109_CR11”，”doi-asserted-by“：”交叉引用“，”首页“：”139“，”doi“：”10.1016\/j.jmp.2008.12.005“，”卷“：”53“，”作者“：”Y Niv“，”年份“：”2009“，”非结构化“：”Y。Niv，《大脑强化学习》，《数学心理学杂志》，第53卷，第3期，第139页，20131542009年。[在线]。可用：http://\/linkinghub.elsevier.com/retrieve\/pii\/S0022249608001181“，”journal-title“：”journal of Mathematical Psychology“}，{“key”：“109_CR12”，“doi-asserted-by”：“crossref”，“doi”：“10.1007\/978-3642-27645-3”，“volume-title”：“强化学习：State-of-the-Art，ser.适应，学习和优化。Springer”，“author”：“M Wiering”，“年”：“2012年”，“非结构化”：“M.Wiering和M.van Otterlo，强化学习：State-Of-the-Art，ser。适应、学习和优化。施普林格，2012年。[在线]。可用：http://\/books.google.com/books？id=YPjNuvrJR0MC“}，{“key”：“109_CR13”，“volume title”：“动态系统开发方法”，“author”：“E Thelen”，“year”：“1996”，“nonstructured”：“E.Thelen and L.Smith，动态系统开发方法，ser。麻省理工学院出版社/布拉德福德认知心理学丛书。麻省理工学院出版社，1996。[在线]。可用：http://\/books.google.com/books？id=kBslxoe0TekC“}，{”issue“：“5”，”key“：“109_CR14”，”volume“：”24“，”year“：”2001“，”unstructured“：”J.K.O\u2019Regan和A.No\u00eb，\u201cA sensorsemotor account of vision and visual conscious。\u201d《行为与脑科学》，第24卷，第5期，2001年10月。[在线]。可用：http://\/view.ncbi.nlm.nih.gov\/pubmed\/12239892“，”journal-title“：”The Behavioral and brain sciences“}，{“key”：“109_CR15”，“volume-title”：“CoRR”，“author”：“S Kazerounian”，“year”：“2012”，“unstructured”：“S.Kazerouunian，M.D.Luciw，M.Richter，and Y。Sandamirskaya，\u201c神经动力学中行为序列的自主强化，\u201 d CoRR，vol.abs\/1210.3569，2012.“}]，“容器-时间”：[“Paladyn，Journal of Behavior Robotics”]，“原始标题”：[]，“链接”：[{“URL”：“http://\/link.springer.com/content\/pdf\/10.2478\/s13230-013-0109-5.pdf”，“内容类型”：“application\/pdf”content-version“：”vor“，”intended-application“：”text-mining“}，{“URL”：“http://\/link.springer.com/article\/10.2478\/s13230-013-0109-5\/fulltext.html”，“content-type”：“text\/html”，“content-version”：“vor”，“intended-application”：“text-mining”}，}“URL”：“http:\\/link.stringer.com/content\/pdf\/10.248\/s13230-0109-5”，“content-type”：“未指定”，“content-version“：”vor“，”intended-application“：”similarity-checking“}]，”deposed“：{”date-parts“：[[2021,2,28]]，”date-time“：”2021-02-28T16:13:51Z“，”timestamp“：1614528831000}，”score“：1，”resource“：”{“primary”：{“URL”：“https:\/\/\www.degruyter.com\/document\/doi\/10.2478\/s13230-013-0109-5\/html”}，“subtitle”：[]，“短标题”：[]，“已发布”：{“date-parts“：[[2012,1,1]]}，”references-count“：15，”journal-issue“：{”issue“：”4“}，“URL”：“http://\/dx.doi.org\/10.2478\/s13230-013-0109-5”，“relation”：{}，，“ISSN”：[“2081-4836”]，“ISSN-type”：[{“value”：“2081-48036”，“type”:“electronic”}]，“subject”：[/]，“published”：{“date-part”：[2012]1,1]]}}