- PDF file format with English text and a maximum of 100 pages. Results improve with smaller files.
- Embedded text or scanned documents
- Describe a single asset or piece of equipment
- Key-value pair data representation
Before you start
- Ingest the documents into CDF.
- Set up access capabilities.
- Create a view in a data model with properties that reflect the key-value data.
Parse documents
1
Navigate to document parsing
Navigate to Data management > Contextualize > Document parsing.
2
Create parsing task
Select Create parsing task and the documents you want to parse.
You can parse several documents simultaneously, but the data from each document is ingested into a separate data model view.
3
Select next to continue
Select Next to continue.
4
Select views and run
Select the views you want to populate parsed data into and select Run.
5
Review the parsed data
Review the parsed data.
- Select a property in the Parsed data sidebar to zoom into a field in the document.
- Hover over a field to update the values.
6
Approve or reject parsing
Reject or approve the parsing. The approved data is stored as a data model instance.