In public survey
Document Intelligence
Play video
Document Mind, a multi-modal document recognition and understanding engine based on years of technology accumulation, provides users with structured information extraction and intelligent document processing of various documents. It supports diversified document processing requirements in general scenarios, industry scenarios, and custom scenarios. Please add nails for product inquiry and answer. Communication group: 44854217

Product specification

Convert PDF to Word document, preserving layout and style
Package Type
PDF to Word
Capacity specification of flow packet
500 pages
Purchase duration
1 year
Enquiry in progress
Converts a picture to a Word document, preserving layout and style
Package Type
Picture to Word
Capacity specification of flow packet
500 pages
Purchase duration
1 year
Enquiry in progress
Convert PDF to editable Excel document
Package Type
PDF to Excel
Capacity specification of flow packet
500 pages
Purchase duration
1 year
Enquiry in progress
Convert pictures to editable Excel documents
Package Type
Picture to Excel
Capacity specification of flow packet
500 pages
Purchase duration
1 year
Enquiry in progress
Convert PDF to pictures page by page
Package Type
PDF to Picture
Capacity specification of flow packet
500 pages
Purchase duration
1 year
Enquiry in progress
Convert multiple pictures to PDF
Package Type
Picture to PDF
Capacity specification of flow packet
500 pages
Purchase duration
1 year
Enquiry in progress
Extract hierarchical trees, layouts, fields, etc. from documents
Package Type
Intelligent document parsing
Document understanding of resource package capacity specification
10000 pages
Purchase duration
1 year
Enquiry in progress
Extract styles, text, fields, etc. from tables
Package Type
Table intelligent parsing
Document understanding of resource package capacity specification
500 pages
Purchase duration
1 year
Enquiry in progress

Product architecture

Product Introduction
As the next generation of automation technology, document intelligence deeply integrates multiple technologies such as character recognition, natural language processing, image processing, electronic document parsing, and document pre training model to intelligently automate unstructured and semi-structured documents, thereby simplifying business operation processes and improving document processing efficiency.
Product value
Extract key information from unstructured data to give play to data value
Mining and analyzing document data to make better decisions
Automate the processing of various documents to improve work and learning efficiency
Connect with the internal system of the enterprise to improve the intelligent level of the enterprise
Related products

Product Functions

Universal Document Intelligence
It provides intelligent document processing capability in general scenarios, and can realize document understanding, document format conversion, document error correction and other functions.
  • Document understanding: carry out structural identification and understanding of various documents and forms, and on this basis, complete document extraction and document processing tasks in multiple common scenarios.
  • Document format conversion: convert non editable documents such as PDF and pictures to editable document formats such as Word and Excel, and maximize the retention of document layout style while achieving high-precision content recognition.
  • Document error correction: it can correct errors such as text, words, grammar and punctuation in the document, check all kinds of Chinese and English problems in the document and return modification suggestions, so as to achieve efficient, accurate and standardized document writing.
Industry document intelligence
For document processing requirements in industry scenarios, it provides industry document processing capabilities in bidding, legal documents, contract processing and other scenarios.
  • Intelligent bidding: for bidding scenarios, provide various bidding documents, bid winning announcements and other documents for structural analysis and extraction.
Document self-learning
For user-defined scenarios, a self-learning training tool is provided, which can achieve a high-precision document processing model with only a few sample annotations.
  • Document self-learning: for enterprise and individual developer users without algorithm foundation, complete data processing, model building training and management, deployment and release and other operations through the model independent training platform to achieve rapid, high-precision, personalized AI model production.

Product advantages

Advanced algorithm technology
Relying on Ali's rich document scenes, the advanced multi-modal document recognition and understanding engine is polished, and the algorithm effect and performance indicators are at a high level.
Rich industrial applications
Covering customs logistics, bidding, government affairs, finance and taxation and other multi industry and multi scenario applications, it can meet the document processing needs of all walks of life.
Flexible deployment mode
It supports public cloud API, hybrid cloud Docker, aPaaS, SaaS and other product deployment methods, with flexible product access and low threshold for use.
Reliable service quality
It provides highly available document processing capability, which has been repeatedly honed in massive document processing business. It has high service stability and supports flexible expansion and contraction.

Application scenarios

Government and enterprise office
Finance and taxation
Government and enterprise office
Intelligently process various office documents and forms to extract structured information from documents.
Capable of providing
Intelligent document parsing
Unstructured office documents are parsed into structured data, and key field information is extracted to replace manual processing.
Document format conversion
Convert non editable PDF, pictures and other documents into editable Word, Excel and other formats, and improve the convenience of document processing.
Recommended combination
Finance and taxation
Process financial documents such as company financial reports and research reports, convert them into structured documents, and use them for system analysis, product introduction and other scenarios.
Capable of providing
Intelligent document parsing
Extract the key information in unstructured documents such as financial reports and research reports, and further process after connecting with the analysis system.
Document format conversion
Convert electronic and scanned PDF documents into Word, HTML and other formats for product details.
Recommended combination

Documentation and Tools

More products and services

Document understanding
Structurally identify and understand various documents and forms, and on this basis, complete the document processing tasks in a variety of common scenarios such as document extraction and document comparison.
Document format conversion
Convert non editable documents such as PDF and pictures to editable document formats such as Word and Excel, so as to maximize the retention of document layout style while achieving high-precision content recognition.
Identification of trade documents
Provide intelligent identification and analysis capability for customs declaration, commercial invoice, bill of lading, air waybill, etc.
Document self-learning
Provide data annotation and training capabilities, and support self-learning training of various documents and forms.