Select one of these options for matching Cognite Data Fusion (CDF) resources to assets:Documentation Index
Fetch the complete documentation index at: https://docs.cognite.com/llms.txt
Use this file to discover all available pages before exploring further.
- Create a pipeline to rerun matching models on data sets to improve the results based on confirmed matches over time. If the data set receives new data, you can rerun the pipeline to find additional matches.
- Run a quick match for one-time matching of individual resources, or groups of resources, to assets. The matching model isn’t stored, and you can’t reuse it or improve it over time.
- Match the nodes in a 3D model to assets.
Prerequisites
You need these capabilities to match entities.Step 1: Select the matching process
Step 2: Select entities and assets to match
Select source entities
Step 3: Set up the matching model and generate suggested matches
Select the fields for matching entities
name fields when it searches for matches. You can select different fields or add more fields using the dropdown menus and selecting Add fields.Train the matching model
Select the similarity scoring model
- Select Simple to calculate a similarity score based on identical letter or digit sequences, hereafter referred to as tokens, for each pair of fields defined above. This is the fastest option.
- Select Insensitive, which is similar to simple, but ignores lowercase/uppercase differences.
- Select Bigram, which is similar to simple, but adds similarity score based on bigrams of the tokens (two adjacent tokens). For instance, would AA-11-BB be considered more similar to AA-11-CC than AA-00-BB, while Simple would see them as equally similar.
- Select Frequency weighted bigram, which is similar to Bigram but gives higher weights to less commonly occurring tokens.
- Select Bigram extra tokenizers, which is similar to bigram, but the model learns that leading zeros, spaces, and lowercase/uppercase should be ignored in matching.
- Select Bigram combo, which calculates all of the above options, relying on the machine learning model to determine the appropriate features to use. Hence, this is a good choice if there already exists some matches the model can train on (see option below). This is the slowest option.
Optional. Configure rule generation
Step 4: Validate suggested matches and update CDF
For pipelines, the entity matching model suggests matches in this order:- Confirmed matches - for pipeline: already confirmed matched entity and asset.
- Confirmed patterns - matches created by one of the already confirmed patterns.
- Predictions from the entity matching model.
Filter entities by type
- All: Show all the entities you have selected for matching.
- Matched: Show entities that have already been matched to an asset. Select this option to change the existing matching for entities.
- Unmatched entities: Show entities that haven’t yet been matched to an asset. Select this option to validate the suggestions and match individual entities or groups of entities to assets.
- Different recommendation: Show entities that have already been matched to an asset but where the model recommends a new match. Select this option to change the existing matching for entities.
Optional. Group by pattern
Confirm matches
Review draft matches
Save the pipeline (for pipelines only)
Step 5: Rerun a matching model in existing pipelines
If a data set receives new data, you can select Rerun pipeline on the overview page to find additional matches. To adjust the matching model, select Open on the More options button.
Step 6: Match the nodes in a 3D model to an asset
Contextualize your 3D models so you can see them related to the asset. You can also see the models in the Cognite apps,such as InField and Maintain.Select 3D model revision
Select assets to match
Configure matching fields
Choose model type
- PDMS - improves response time by filtering out nodes that don’t need to be mapped to assets based on keywords in the name.
- Unfiltered - all nodes in the 3D model are used to match assets.
Review mapping results