data service Data annotation platform Solution Characteristic advantages Service process cooperative partner Customized service

data service

  • Data Annotations
  • data acquisition
  • computer vision
  • speech recognition
  • Natural semantics
  • Image semantic segmentation

  • Picture classification

  • Picture box selection

  • Facial bone management

  • 3D point cloud

  • 2D3D blend annotation

  • Continuous frame annotation

  • Video classification

  • Video content extraction

Image semantic segmentation

Image semantic segmentation is based on polygon annotation of regions, which divides complex and irregular images into regions and marks corresponding attributes to help image recognition model training. It is mostly applied to human body segmentation, scene segmentation and automatic driving road segmentation, and can be applied to intelligent driving, intelligent equipment, and intelligent security scene landing.

10W Area/day

Dimensioning ability

98% +

Accuracy

Picture classification

Based on the Baidu label base, the human resources can realize the cleaning and classification of tens of millions of images. According to your needs, you can classify the image sets you provide, help image recognition model training, and can be applied to smart retail, smart devices, smart entertainment and other scenes.

300W Figure/day

Dimensioning ability

99% +

Accuracy

Picture box selection

Picture framing can help image recognition model training, and is used to frame the recognition subject targets in the pictures. It is commonly used to frame faces, human bodies, obstacles, traffic lights, and can be applied to the landing of intelligent driving, intelligent security, and intelligent devices

10W Box/day

Dimensioning ability

99% +

Accuracy

Facial bone management

Facial skeleton dotting is a point based annotation, which is mostly used to annotate facial features, human skeleton key points and car tire grounding points in pictures, to help image recognition model training, and can be applied to intelligent driving, intelligent equipment, and intelligent security scene landing.

15W Figure/day

Dimensioning ability

98% +

Accuracy

3D point cloud

3D point cloud annotation can help the training of automatic driving models. Baidu, based on its rich experience in automatic driving annotation and advanced annotation tools, can frame 3D obstacles and semantic segmentation of radar maps, help vehicles better perceive the road surface, and can be applied to the training landing of automatic driving scenes

40W Box/day

Box selection ability

eight hundred Frame/day

Segmentation capability

98% +

Accuracy

2D3D blend annotation

2D3D fusion annotation can help the training of automatic driving models. Baidu, based on its rich experience in automatic driving annotation and advanced annotation tools, can annotate the data of 2D3D multi-sensor fusion at the same time to help vehicles achieve visual and radar perception, which can be applied to the training landing of automatic driving scenes

10W Box/day

Dimensioning ability

98% +

Accuracy

Continuous frame annotation

Continuous frame annotation is often used for training automatic driving and video image recognition models. By extracting frames from the video and marking the target objects in each frame, it can be applied to intelligent driving, intelligent security, and landing of intelligent devices

25W Box/day

Dimensioning ability

98% +

Accuracy

Video classification

Video classification is to classify videos by theme by watching video clips to help build a video database. It is commonly used for image recognition model training in the video industry and can be applied to the landing of smart entertainment scenes

1W Segment/day

Dimensioning ability

98% +

Accuracy

Video content extraction

Video content extraction is to extract frames from videos, transcribe subtitles in each frame, summarize and extract video themes, help build video databases, and is commonly used in image recognition model training in the video industry, which can be applied to the landing of smart entertainment scenes

5W Item/day

Dimensioning ability

98% +

Accuracy

  • Voice cleaning

  • phonetic transcription

  • Speech segmentation

  • Phoneme annotation

Voice cleaning

Voice cleaning uses technology to clean empty audio, which is monitored manually to screen out qualified audio. Based on Baidu label base, manpower can realize massive audio cleaning, help voice recognition model training, and can be applied to smart home, smart devices, smart customer service, smart stores and other scenarios

three hundred Hour/day

Dimensioning ability

98% +

Accuracy

phonetic transcription

Speech transcription is based on the content of audio playback and transcribed into the corresponding text. It is commonly used for speech recognition model training. It can support speech transcription in Mandarin, dialect, English and small languages, and is applied to the landing of smart homes, smart devices, smart customer service, smart stores and other scenarios

fifty Hour/day

Dimensioning ability

98% +

Accuracy

Speech segmentation

Voice segmentation is to monitor long audio, mark the starting point of the speaker in the audio, used for voice recognition model training, and applied to the landing of smart homes, smart devices, smart customer service, smart stores and other scenarios

two hundred Hour/day

Dimensioning ability

98% +

Accuracy

Phoneme annotation

Phoneme annotation is used to monitor audio, transcribe text and annotate phonetic symbols of text, which is commonly used in speech synthesis technology

five thousand Sentence/day

Dimensioning ability

98% +

Accuracy

  • Text cleaning

  • Text classification

  • Text enrichment

  • OCR transfer

  • Emotional annotation

  • NLP callout

Text cleaning

Text cleaning is to filter the text according to your rules and select the data that meets the requirements. Based on the manpower of Baidu label base, it can achieve the cleaning of tens of millions of texts, help NLP model training, and can be applied to intelligent customer service, intelligent finance, intelligent driving and other scenarios.

100W Item/day

Dimensioning ability

98% +

Accuracy

Text classification

Text classification is the attribute classification of text according to your rules. Based on Baidu label base, human resources can realize the classification operation of millions of texts, help NLP model training, and can be applied to intelligent customer service, intelligent finance, intelligent driving and other scenarios.

20W Item/day

Dimensioning ability

98% +

Accuracy

Text enrichment

Text enrichment is to write text around the theme, so that for the same theme, text expressions are diverse and practical, which can help NLP model training, and can be applied to intelligent customer service, intelligent finance, intelligent driving and other scenarios.

2W Item/day

Dimensioning ability

98% +

Accuracy

OCR transfer

OCR transcription is to mark and transcribe the text content in the picture. It supports image transcription in Chinese, English and small languages, helps image and text recognition models, and can be applied to smart entertainment, smart devices and other scenes

20W Item/day

Dimensioning ability

98% +

Accuracy

Emotional annotation

Emotional tagging is to judge the emotional tendency of text expression, classify positive and negative texts, help NLP model training, and can be applied to smart home, smart entertainment, smart finance and other scenarios

10W Item/day

Dimensioning ability

98% +

Accuracy

NLP callout

NLP annotation is the annotation of text syntax, including slot extraction, text relations, etc., which can help NLP model training and can be applied to smart home, smart entertainment, smart finance and other scenarios

5W Item/day

Dimensioning ability

96% +

Accuracy

  • computer vision
  • Language recognition
  • Natural semantics
  • Image Capture

  • Image acquisition

  • Portrait collection

  • Video capture

  • Automatic driving road acquisition

Image Capture

The image capture service can quickly capture all kinds of images published on the network, screen data that meet your model requirements through technical and manual cleaning, help image recognition model training, and can be applied to smart devices, smart finance, smart retail and other scenarios.

1000W Figure/day

Acquisition capability

97% +

Accuracy

Image acquisition

Image acquisition service, based on Baidu offline acquisition users, can capture all kinds of images in real life, including goods, cars, documents, landscapes, etc., help train image recognition models, and can be applied to smart retail, intelligent devices and other scenes.

10W Figure/day

Acquisition capability

97% +

Accuracy

Portrait collection

Portrait collection service can help improve the accuracy of face recognition models. Based on Baidu's offline collection capability, it can carry out multi ethnic face image collection in 22 countries nationwide and overseas, and support diversified acquisition requirements of multi angle, multi light, and multi scene. It can be landed in smart devices, smart security, smart finance and other visual scenes.

five hundred Person/day

Acquisition capability

97% +

Accuracy

Video capture

Video capture service, which can capture videos of specified objects, faces, security and other scenes, supports diversified acquisition requirements of multi angle, multi light, and multi scene. It can be landed in visual scenes such as intelligent security, intelligent equipment and smart finance.

five thousand Segment/day

Acquisition capability

97% +

Accuracy

Automatic driving road acquisition

Baidu has its own collection team, equipped with laser radar and industrial cameras, which can provide cross city 2D and 3D road data collection services, support vehicle customization and sensor modification, and is applicable to the training of automatic driving models, and can be applied to the landing of automatic driving scene training based on vision or radar schemes.

five hundred Km/day

Acquisition capability

99% +

Accuracy

  • Wake up word collection

  • ASR voice acquisition

  • TTS voice acquisition

Wake up word collection

Wake up word collection, recording users' wake-up word voice based on Baidu's collection resources. The crowd can cover all parts of the country, support voice recording of specific devices, near and far fields, and multiple speech speeds, help voice recognition model training, and can be applied to the landing of smart homes, smart devices, smart stores and other scenes

one thousand Person/day

Acquisition capability

97% +

Accuracy

ASR voice acquisition

ASR voice acquisition can help the training of voice recognition models. Through Baidu's national and overseas resources, it can collect all kinds of voice audio, including Mandarin, dialect, English and small languages, and can be applied to smart homes, smart devices, smart customer service, smart stores and other scenes

one hundred Hour/day

Acquisition capability

97% +

Accuracy

TTS voice acquisition

TTS voice acquisition is often applied to voice synthesis technology. Baidu can provide professional speakers to record high fidelity voice in professional recording studio environment, which can be applied to landing of smart customer service, smart home, smart devices and other scenes

ten Hour/day

Acquisition capability

98% +

Accuracy

  • web capture

web capture

Web page capture can quickly capture the text content in the web page you provide, screen the digital text that meets your model requirements through technical capture and manual cleaning, help NLP model training, and can be applied to intelligent customer service, intelligent finance, intelligent driving and other scenarios.

5000W Item/day

Acquisition capability

97% +

Accuracy

Data annotation platform

Privatized data annotation platform

Deployed in the customer's local area, the customer organizes employees or outsourcing personnel to annotate data on the enterprise intranet.

  • Provide comprehensive and powerful annotation tools, support function customization, and support docking with various systems
  • Flexible and configurable project management process
  • Hierarchical organization and personnel management
Application service
Solution

Integrated intelligent driving data solution of "standard acquisition, storage, management and training"

Based on years of data experience in the intelligent driving industry, it provides complete supporting products and services for the whole process of data collection, annotation, storage, management, training, cleaning and evaluation, and helps the rapid implementation of intelligent driving technology.

Characteristic advantages

The government jointly builds a labeling base to ensure data security and service quality

The largest AI data annotation base in China, covering an area of more than 10000 square meters, has 2500 full-time professional annotation personnel, and has been listed as a key project in 2019 by Shanxi Province.

More secure data security

Strict internal legal supervision process, secure private data deployment, anti data leakage answer management mechanism, real-time monitoring and encrypted labeling equipment ensure the safety of customer data without risk.

More accurate data quality

Strict personnel training operation mechanism and three rounds of data audit mechanism, supplemented by intelligent audit algorithm and intelligent management platform, ensure that the data quality is far higher than the industry average.

More efficient processing speed

Hundreds of data project scheme experts, 2000 full-time labeling personnel of Baidu Shanxi Base, 20000 full-time labeling personnel of contracted outfield, and 30000 online labeling users of Baidu crowdsourcing have achieved the ability to handle millions of data labeling.

More preferential payment

With the self built labeling base, scientific crowdsourcing task distribution mode, and intelligent data collection and labeling tools, scale effect and efficient operation can be achieved, thus reducing costs and benefiting paying customers.

Service process

A team of 100 senior data experts, a professional tagging platform, 10000 people+professional tagging staff, and full support for data services

The customer proposed
Raw data requirements

Step 1

customized
Exclusive data scheme

Step 2

implement
Data solutions

Step 3

Baidu Automatic Quality Inspection
Algorithm audit

Step 4

artificial
Four rounds of review

Customer acquisition
High quality AI data

Application service
cooperative partner
Customized service

Professional AI data helps the development of enterprise intelligence

Application service