Skip to main content
The process detects, extracts, and maps asset tags, file references, symbols, and connections from engineering diagrams to create a structured knowledge graph. The parsing flow differs based on file characteristics: Rasterized files support tag detection and mapping through text recognition and are limited to the Tags detection workflow. Vectorized files support tag detection and mapping, symbol detection and library matching, connection tracing between symbols, and full detection workflow across all tabs (Tag detections, Symbols, Merge, Connections).
For vectorized files, only the first page is parsed. If your multi-page PDF contains critical information on subsequent pages, consider splitting the file before parsing.
This diagram illustrates the parsing flow for vectorized files:

Core concepts

Understanding these foundational concepts helps you work effectively with diagram parsing:
ConceptDefinition
ConnectionA visual relationship between symbols, represented by lines or pipes in the diagram. Connections map how assets interact in your operations.
DiagramA single page of a parsed file containing all detected symbols and their connections. Each diagram is created using a specific library.
GeometryOne of the many possible visual compositions of a symbol. Each geometry is defined by SVG paths that help the algorithm recognize different visual styles.
InstanceA detected occurrence of a symbol in the diagram. A single symbol can identify multiple instances based on the number of matching vectors found during the parsing process.
LibraryA CDF project-specific collection of symbols used to detect objects in engineering diagrams. You must select a library before running a parsing job.
MergeThe process of linking detected tags and symbols that represent the same physical entity in the knowledge graph.
SymbolA blueprint that defines how to detect a particular type of equipment, such as a valve, instrument, or pipe. Each symbol contains one or more geometries.
TagA text annotation detected in the diagram that identifies an asset or file reference. Tags can be linked to existing resources in CDF.
TemplateA read-only set of symbols available by default in every CDF project. Templates provide a starting point for diagram parsing without requiring custom symbol creation.