Skip to main content
The Cognite OSDU extractor is available as a Windows executable file (.exe file), a Windows installer (.msi file), and as a Docker container.

Before you start

  • Check the server requirements.
  • Assign access capabilities for the extractor to write data to the respective CDF destination resources.
  • Set up a CDF RAW database to serve as a target for the records ingested from OSDU. Make a note of the name.
  • Set up a data set to serve as a target for the data files transferred from OSDU. Make a note of the data set ID.
  • Create an environment file (.env) in the folder where you’ll run the extractor. This file holds the credentials for connecting to CDF and OSDU. You can use the .env_example file or this template as a starting point:
    .env
    COGNITE_BASE_URL=https://api.cognitedata.com
    COGNITE_TENANT_ID=1d0fffff-1fff-4fff-bfff-5fffffffffff
    COGNITE_CLIENT_ID=5a4fffff-3fff-5fff-afff-7fffffffffff
    COGNITE_CLIENT_SECRET=JZZ1Z~YwZzZzZ~NZz9zzZZ9zZzzz9zzZ_9zZZ
    COGNITE_TOKEN_URL=https://login.microsoftonline.com/1d0fffff-1fff-4fff-bfff-5fffffffffff/oauth2/v2.0/token
    COGNITE_CDF_CLUSTER_ID=api
    COGNITE_PROJECT=name_of_cdf_project
    
    OSDU_API_URL=https://api.osduv012.cognite.ai
    OSDU_TENANT_ID=1d2fffff-8fff-4fff-bfff-6fffffffffff
    OSDU_CLIENT_ID=f03fffff-9fff-4fff-9fff-2fffffffffff
    OSDU_CLIENT_SECRET=tPZ9Z~z~9zZZzzZZ9zZzzZ.Z9z_ZzzzZ9Zz9zzZZ
    OSDU_TOKEN_URL=https://login.microsoftonline.com/1d2fffff-8fff-4fff-bfff-6fffffffffff/oauth2/v2.0/token
    OSDU_SCOPE=f03fffff-9fff-4fff-9fff-2fffffffffff/.default openid profile offline_access
    OSDU_PARTITION=example-opendes
    

Install the extractor

  • Windows executable
  • Windows service
1

Download the extractor

Navigate to Data management > Integrate > Extractors and find the OSDU extractor’s package for your operating system.
2

Extract the files

Download the zip file and decompress it to the same folder where you created the .env file.
3

Create a configuration file

4

Run the executable

Open a command line window and run the executable file.
You stop the extractor by pressing Ctrl+C on your keyboard. The log file is stored in the configured path.

Explore the extracted data

You can explore the extracted data in CDF RAW and browse the files in the configured CDF data set.

Data stored in CDF RAW

Records ingested from OSDU are stored in the configured CDF RAW database. Adjust and clean up the data before transforming it into the CDF resource types. A RAW table is automatically created for each configured OSDU kind, and a row is added for each extracted record. The record ID is used as the row key. Each root key in the record becomes a column.
OSDU records displayed in CDF RAW database table

Data stored in CDF Files

The OSDU files linked to the extracted records are saved in CDF Files. For instance, this can be CSV, LAS, or LIS files with the row data for a well-log record. You can transform the data to insert the rows into the corresponding CDF Sequence. The extractor saves the files in CDF as is from the OSDU generic file service API or the DDMS API. The files are not further processed.
Files extracted from OSDU in the Data explorerFiles extracted from OSDU in the configured data set
OSDU files in CDF Data Explorer interface
OSDU files in CDF Data Catalog interface

Monitor extractions

Set up extraction pipelines and set the external ID in the configuration file. The first time you run the extractor, it creates the extraction pipeline entries and logs succeeding runs with a success or failed status. You’ll see the extraction runs as two sections on the Run history tab on the Extraction pipeline page.