{“状态”：“确定”，“消息类型”：“工作”，“信息版本”：“1.0.0”，“邮件”：{“索引”：{-“日期-部件”：[[2024,5,6]]，“日期-时间”：“2024-05-06T12:03:25Z”，“时间戳”：1714997005321}，“引用-计数”：31，“出版商”：“Springer Science and Business Media LLC”，“问题”：“1”，“许可证”：[{“开始”：{:“日期-零件”：[2019,9,27]]，时间“：”2019-09-27T00:00:00Z“，“timestamp”：1569542400000}，“content-version”：“tdm”，“delay-in-days”：0，“URL”：“http://www.springer.com/tdm”}，{“start”：{“date-parts”：[[2019,9,27]]，“date-time”：“2019-09-27T00:00:00Z”，“timetamp”：155954240000}，springer.com/tdm“}]，”content-domain“：{”域“：[”link.springer.com“]，“crossmark-restriction”：false}，“short-container-title”：[“Lang Resources&Evaluation”]，“published-print”：{“date-parts”：[[2021,3]]}，”DOI“：”10.1007\/s10579-019-09476-2“，”type“：”journal-article“，”created“：{”date-part“：[[2019,9,27]]，”date-time“：”2019-09-27T15:04:53Z“，”timestamp“：1569596693000}“page”：“127-150”，“更新策略”：“http://\/dx.doi.org\/10.1007\/springer_crossmark_policy”，“source”：“Crossref”、“is-referenced-by-count”：13，“title”：[“带卷积神经网络的多模式页面流分段”]，“prefix”：“10.1007”，”volume“：”55“，”author“：[{”ORCID“http://\/ORCID.org\/00000-002-4239-295X”，“authenticated-ORCID”：false，“give”：“Gregor”，“family”：“Wiedemann”“”、“sequence”：“first”，“affiliation”：[]}，{“given”：“Gerhard”，“family”：“Heyer”，“sequences”：“additional”，“filiance”：[]}]，“member”：“297”，“published-online”：{“date-parts”：[[2019,9,27]]}、“reference”：[{“key”：”9476_CR1“，”doi-asserted-by“：”publisher“，“unstructured”：“Agin，O.，Ulas，C.，Ahat，M.，&Bekar，C.（2015）一种使用二进制分类的多阶段文档流分割方法。第六届图形和图像处理国际会议论文集，https://doi.org\/10.1117\/12.178778.“，”doi“：”10.1117\/12.1178778“}，{“key”：“9476_CR2”，“unstructured”：“Blei，D.M.，Ng，A.Y.，&Jordan，M.I.（2003）。潜在dirichlet分配。机器学习研究杂志，3，993\u20131022。网址：http://www.cs.princeton.edu\/~blei\/papers\/BleiNgJordan2003.pdf。“}，{”key“：”9476_CR3“，”doi-asserted-by“：”publisher“，”first-page“：”135“，”doi“：”10.1162\/tacl_a_00051“，”volume“：“5”，”author“：”P Bojanowski“，”year“：”2017“，”unstructured“：”Bojanoowski，P.，Grave，E.，Joulin，a.，&Mikolov，T.（2017）。使用子单词信息丰富单词向量。计算语言学协会学报，5，135\u2013146.”，“新闻标题”：“计算语言学协会杂志”}，{“key”：“9476_CR4”，“doi-asserted-by”：“publisher”，“unstructured”：“Cho，K.，Van Merrienboer，B.，Gulcehre，C.，Bahdanau，D.，Bougares，F.，Schwenk，H.，&Bengio，Y.（2014）。使用RNN编码器\u2013解码器学习短语表示，以进行统计机器翻译。《2014年自然语言处理实证方法会议论文集》（第1724\u20131734页）。计算语言学协会，https:\/\/doi.org\/10.3115\/v1\/D14-1179。URL http:\/\/acweb.org/collectory\/D14-1179.“，”DOI“：”10.3115\/v1\/D14-1179“｝，｛”key“：”9476_CR5“，”nonstructured“：”Daher，H.，&Bela\u00efd（2014）。用于商业应用的文档流分割。在文档识别和检索学报XXI（pp.9201\u20139215）中法国旧金山，URL https:\/\/hal.archives-ouvertes.fr\/hal-00926615.“}，{“key”：“9476_CR6”，“doi-asserted-by”：“publisher”，“unstructured”：“Daher，H.，Bouguelia，M.R.，Belaid，A.，&DAndecy，V.P.（2014）.多页管理文档流分割。第22届模式识别国际会议论文集（第966\u2013971页）。https:\/\/doi.org\/10.109\/ICPR.2014.176.“，”doi“：”10.1109\/ICPR.2014.176“}，{“问题”：“6”，“密钥”：“9476_CR7”，“doi-asserted-by”：“发布者”，“首页”：“391”，“doi”：“10.1002\/（SICI）1097-4571（1999）41:6<391:：AID-ASI>3.0.CO；2-9”，“卷”：“41”，“作者”：“S Deerww ester”，“年份”：“1990年”，“非结构化”：“Deerwester，S.，Dumais，S.T.，Furnas，G。W.、Landauer，T.K.和Harshman，R.（1990年）。通过潜在语义分析进行索引。《美国信息科学学会杂志》，41（6），391\u2013407。“，”杂志标题：“美国信息科学协会杂志”}，{“key”：“9476_CR8”，“unstructured”：“Fan，R.E.，Chang，K.W.，Hsieh，C.J.，Wang，X.R.，&Lin，C.J.（2008）.LIBLINEAR：用于大型线性分类的库。机器学习研究杂志，91871\u20131874。网址：http://\/jmlr.org\/papers\/volume9\/fan08a/fan08a.pdf。“}，{”key“：”9476_CR9“，”doi-asserted-by“：”publisher“，”unstructured“：”Gallo，I.，Noce，L.，Zamberletti，A.，&Calefati，A.（2016）。用于页面流分割和分类的深度神经网络。《数字图像计算国际会议论文集：技术和应用》（pp 1\u20137），2016年。https:\/\/doi.org\/10.109\/DICTA.2016.7797031.“，”doi“：”10.1109\/DICTA.2016.7799031“}，{“key”：“9476_CR10”，“doi-asserted-by”：“publisher”，“unstructured”：“Gordo，A.，Rusinol，M.，Karatzas，D.，&Bagdanov，A.D.（2013）数字邮件收发室应用程序的文档分类和页面流分段。《第十二届国际文献分析与识别会议论文集》（第621\u2013625页）。https:\/\/doi.org\/10.109\/ICDAR.2013.128.“，”doi“：”10.1109\/ICDAR.2013.128“}，{“key”：“9476_CR11”，“doi-asserted-by”：“publisher”，“unstructured”：“Hamdi，A.，Voerman，J.，Coustaty，M.，Joseph，A.，d\u2019Andency，V.P.，&Ogier，J.M.（2017）机器学习与基于确定性规则的文档流分割系统。在第14届IAPR国际文件分析与识别会议（ICDAR）论文集（第77\u201382页）。https:\/\/doi.org\/10.109\/ICDAR.2017.332.“，”doi“：”10.1109\/ICDAR.2017.332“}，{“key”：“9476_CR12”，“doi-asserted-by”：“publisher”，“unstructured”：“Hamdi，A.，Coustaty，M.，Joseph，A.，d\u2019Andecy，V.P.，Doucet，A.，&Ogier，J.M.（2018）.文档流分段的特征选择。第十三届IAPR文件分析系统国际研讨会论文集（第245\u2013250页）。https:\/\/doi.org\/10.109\/DAS.2018.66.“，”doi“：”10.1109\/DAS/2018.66“}，{“key”：“9476_CR13”，“doi-asserted-by”：“publisher”，“unstructured”：“Harley，A.W.，Ufkes，A.，&Derpanis，K.G.（2015）用于文档图像分类和检索的深度卷积网络的评估。第十三届国际文献分析与识别会议（ICDAR）论文集（第991\u2013995页）。https:\/\/doi.org\/10.109\/ICDAR.2015.7333910.“，”doi“：”10.1109\/ICDAR.2015.7338910“}，{“key”：“9476_CR14”，“unstructured”：“Isemann，D.，Niekler，A.，Pre\u00dfler，B.，Viereck，F.，&Heyer，G.（2014）。作为工业灾害预防构建块的遗留文档的OCR。《工业灾害预防学报》LREC处的凹痕灾害管理和原则化大规模信息提取及应急后后勤研讨会”}，{“key”：“9476_CR15”，“unstructured”：“Joachims，T.（1998）。支持向量机的文本分类：具有许多相关特征的学习。《第十届欧洲机器学习会议论文集》（第137\u2013142页）柏林：施普林格。ISBN 978-3-540-69781-7。“}，{“key”：“9476_CR16”，“doi-asserted-by”：“publisher”，“unstructured”：“Karpinski，R.，&Bela\u00efd，A.（2016）。用于文档流分段的结构描述符和事实描述符的组合。收录于第十二届IAPR文档分析系统研讨会论文集（第221\u2013226页）。https:\//doi.org\/10.109\/DAS.2016.21。”，“DOI“：”10.1109\/DAS.2016.21“}，{“key”：“9476_CR17”，“DOI-asserted-by”：“publisher”，“unstructured”：“Kim，Y.（2014）。句子分类的卷积神经网络。《2014年自然语言处理经验方法会议论文集》（第1746\u20131751页）计算语言学协会，https:\/\/doi.org\/10.3115\/v1\/D14-1181。URL http://\/aclweb.org\/antology\/D14-1181.“，”DOI“：”10.3115\/v1\/D14-1181“}，{”key“：”9476_CR18“，”DOI-asserted-by“：”publisher“，”first page“：“119”，“DOI”：“10.1016\/j.patrec.2013.10.030”，“volume”：“43”，“author”：“j Kumar”，“year”：“2014”，“unstructured”：“Kumar，j.，Ye，P.，&Doermann，D.（2014）.文档图像分类和检索的结构相似性。模式识别字母，43，119\u2013126.”，“日记标题”：“模式识别字母”}，{“key”：“9476_CR19”，“unstructured”：“Landis，J.R.，&Koch，G.G.（1977）。分类数据观察者一致性的测量。生物统计学，33（1），159\u2013174。ISSN 0006341X，15410420。URL http://www.jstor.org\/stable\/2529310.“}，{“key”：“9476_CR20”，“unstructured”：“Le，Q.，&Mikolov，T.（2014）。句子和文档的分布式表示。《第31届国际机器学习会议论文集》（第1188\u20131196页）非结构化”：“Lewis，D.，Agam，G.，Argamon，S.，Frieder，O.，Grossman，D.，&Heard，J.（2006）。构建用于复杂文档信息处理的测试集合。第29届国际ACM SIGIR年会论文集（第665\u2013666页）。“，”DOI“：”10.1145\/114810.1148307“｝，｛”key“：”9476_CR22“，”非结构化“：”Meilender，T.，&Bela\u00efd，A.（2009）。通过修改的后向-前向算法对连续文档流进行分割。在SPIE-电子成像，美国洛杉矶，URL https:\/\/hal.inria.fr\/inria-00347217.“｝，｛”key“：”9476_CR23“，”非结构化“：”Niekler，A.和J\u00e4technhen，P.（2012）。文本的潜在dirichlet分配的匹配结果。《第十一届认知建模国际会议论文集》（第317\u2013322页）。柏林大学。}，{“key”：“9476_CR24”，“doi-asserted-by”：“publisher”，“unstructured”：“Noce，L.，Gallo，I.，Zamberletti，A.，&Calefati，A.（2016）。卷积神经网络用于文档图像分类的嵌入文本内容。2016年ACM文档工程研讨会论文集（第165\u2013173页）。ACM：纽约。ISBN 978-1-4503-4438-8。https:\/\/doi.org\/10.1145\/2960811.2960814.“，”doi“：”10.1145\/2968811.2960814“}，{“问题”：“1”，“密钥”：“9476_CR25”，“doi-asserted-by”：“发布者”，“首页”：“62”，“doi”：“10.1109\/tsmc.1979.4310076”，“卷”：“9”，“作者”：“N Otsu”，“年份”：“1979”，“非结构化”：“Otsu，N.（1979）从灰度直方图中选择阈值的方法。IEEE系统、人与控制论汇刊，9（1），62\u201366。https:\/\/doi.org\/10.109\/tsmc.1979.4310076.“，”journal-title“：”IEEE系统、人和控制论事务“}，{”issue“：”7“，”key“：”9476_CR26“，”doi-asserted-by“：”publisher“，”first-page“：”961“，”doi“：”10.1109\/TKDE.2010.27“，“volume”：“23”，“author”：“XH-Phan”，“year”：“2011”，“unstructured”：“Phan，X.H.，Nguyen，C.T.，Le，D.T.，Nguyen，L。M.和Horiguchi，S.（2011年）。一个隐藏的基于主题的框架，用于使用简短的web文档构建应用程序。IEEE知识与数据工程汇刊，23（7），961\u2013976。https:\/\/doi.org\/10.109\/TKDE.2010.27.“，”journal-title“：”IEEE知识与数据工程学报“}，{”issue“：”4“，”key“：”9476_CR27“，”doi-asserted-by“：”publisher“，”first page“：“331”，”doi“：”10.1007\/s10032-014-0225-8“，“volume”：“17”，“author”：“M Rusi\u00f1ol”，“year”：“2014”，“unstructured”：“”Rusi\u00f1ol，M.、Frinken，V.、Karatzas，D.、Bagdanov，A.D.和Llad\u00f3s，J.（2014）。管理文档图像流中的多模式页面分类。国际文献分析与识别杂志，17（4），331\u2013341。https://doi.org/10.1007\\s10032-014-0225-8.”，“期刊标题”：“国际文献分析与识别期刊”}，｛“密钥”：“9476_CR28”，“非结构化”：“Simonyan，K.，&Zisserman，A.（2014）。用于大规模图像识别的甚深卷积网络。CoRR，arXiv:1409.1556，URL http:\/\/arXiv.org/abs\/1409.1556。”｝，｛“密钥”：“9476_CR29“，“非结构化”：“Vaswani，A.、Shazeer，N.、Parmar，N.，Uszkoreit，J.、Jones，L.、Gomez，A.N.、Aidan N.，K.、\u0141\u00a0ukasz和Polosukhin，I.（2017）。注意力是你所需要的。神经信息处理系统进展30（第5998页\u20136008）。Curran Associates，网址：http://\/papers.nips.cc\/paper\/7181-antelection-is-all-you-need.pdf。“｝，｛”issue“：”2“，”key“：”9476_CR30“，”doi断言“：”publisher“，”first page“：”135“，”doi“：”10.1177\/094439318758389“，”volume“：”37“，”author“：”G Wiedemann“，”year“：”2019“，”nonstructured“：”Wiedemann，G.（2019）重温比例分类：使用主动学习对政治宣言进行自动内容分析。《社会科学计算机评论》，37（2），135\u2013159。https:\/\/doi.org\/10.1177\/0894439318758389.“，”journal-title“：”Social Science Computer Review“}，{“key”：“9476_CR31”，“unstructured”：“Wiedemann，G.，Ruppert，E.，Jindal，R.，&Biemann（2018）将学习从LDA转移到BiLSTM-CNN，以便在推特中检测攻击性语言。《2018年GermEval任务会议记录》，第14届自然语言处理会议（Konvens）（第85\u201394页）。奥地利维也纳：奥地利科学院。“}]，”containertitle“：[”Language Resources and Evaluation“]，”original-title“:[]，”Language“：”en“，”link“：[{”URL“：”http://\/link.springer.com/content\/pdf\/10.1007\/s10579-019-09476-2.pdf“，”content-type“：”application\/pdf“、”content-version“：”vor“，”intended-application“：”text-mining“}，”{“URL”：“http://\/link.springer.com/article\/10.1007\/s10579-019-09476-2\/fulltext.html“，”content-type“：”text\/html“，”content-version“：”vor“，”intended-application“：”text-mining“}，{”URL“：”http://\/llink.springer-com/content\/pdf\/10007\/s10579-019-0947 6-2.pdf“，”内容-type”：“application\/pdf”，“content-version”：“vor”，“intended-epplication”d-应用程序“：”相似性检查“}”，“存放”：{“日期部分”：[[2021,4,2]]，“日期时间”：“2021-04-02T19:06:20Z”，“时间戳”：1617390380000}，“分数”：1，“资源”：{“主要”：{:“URL”：“http://\/link.springer.com/10.1007\/s10579-019-09476-2”}}，”副标题“：[]，”短标题“：[]，”已发布“：{”日期部分“：[2019,9,27]]}，“references-count”：31，“journal-issue”：{“issue”：“1”，“published-print“：{“date-parts”：[[2021,3]]}}，“alternative-id”：[“9476”]，“URL”：“http://\/dx.doi.org\/10.1007\/s10579-019-09476-2”，“relation”：{}，”ISSN“：[“1574-020X”，“1574-0218”]，”ISSN-type“：[{”value“：”1574-020X'，“type”：“print”}，{“value”：“电子”}]，“主题”：[]，“发布”：{“日期部分”：[[2019,9,27]]}，“断言”：[{“值”：“2019年9月27日“，”订单“：1，”名称“：”first_online“，”标签“：”first online“，“group”：{“name”：“Article History”，“label”：“文章历史”}}]}}