For vectorized files, only the first page is parsed. If your multi-page PDF contains critical information on subsequent pages, consider splitting the file before parsing.
Core concepts
Understanding these foundational concepts helps you work effectively with diagram parsing:| Concept | Definition |
|---|---|
| Connection | A visual relationship between symbols, represented by lines or pipes in the diagram. Connections map how assets interact in your operations. |
| Diagram | A single page of a parsed file containing all detected symbols and their connections. Each diagram is created using a specific library. |
| Geometry | One of the many possible visual compositions of a symbol. Each geometry is defined by SVG paths that help the algorithm recognize different visual styles. |
| Instance | A detected occurrence of a symbol in the diagram. A single symbol can identify multiple instances based on the number of matching vectors found during the parsing process. |
| Library | A CDF project-specific collection of symbols used to detect objects in engineering diagrams. You must select a library before running a parsing job. |
| Merge | The process of linking detected tags and symbols that represent the same physical entity in the knowledge graph. |
| Symbol | A blueprint that defines how to detect a particular type of equipment, such as a valve, instrument, or pipe. Each symbol contains one or more geometries. |
| Tag | A text annotation detected in the diagram that identifies an asset or file reference. Tags can be linked to existing resources in CDF. |
| Template | A read-only set of symbols available by default in every CDF project. Templates provide a starting point for diagram parsing without requiring custom symbol creation. |