Resources | Cegal

Petoro ensures data quality and unambiguity with help from the "Source of Truth" project

Written by Editorial staff | Jan 8, 2025 12:29:17 PM

With Data Management tools from Cegal, data sources and data quality are reviewed and then secured at Petoro. The "Source of Truth" project saves Petoro hundreds of hours of manual work, removes sources of error from valuable information from the Norwegian shelf and saves Norway millions of kroner.

"Managing data is a bit like playing whispers. You start out with one truth. Then something changes, perhaps the data is transferred to another system, some decimal places are deleted and the number is rounded off. Then there are errors and consequential errors, but when you go back to the source, it is not certain that the source is correct either. Perhaps the figures have been updated with a change that has not been returned to the original source", begins Åge Løklingholm, Principal Data & Digitalization Consultant at Cegal.

Being sure that the data is correct, up-to-date and accurate is a challenge for most businesses. The issue has become even more relevant in the AI age, when artificial intelligence is taking over and automating more and more tasks. This makes it even more important to be sure that the data is correct. This is described by Cegal's Anaïs Kermagoret, Principal Solution Architect at the CTO Office.

Petoro manages the state's oil and gas interests

Petoro's task is to manage the state's interests in the oil and gas sector on the Norwegian continental shelf. Petoro shall create the greatest possible value and achieve the highest possible income for the state from the State's Direct Financial Interest (SDFI). Through the SDFI, the Norwegian state has interests in a number of oil and gas fields, pipelines and onshore facilities. These interests are managed by Petoro, which prioritizes creating more investment opportunities, finding solutions across the portfolio and taking care of our surroundings. The cash flow from the SDFI is transferred in its entirety to the Pension Fund of Norway.

"A major challenge is that we have enormous amounts of data collected over a long period of time. In addition, the original data is often processed to adapt to tools that provide insight and as a basis for analysis. Over time, this creates uncertainty about data quality and what is the right information. This is because the interpretations we use to estimate the value of decisions and future revenues are updated and reassessed over time, or unintentional changes occur in the import of data," explains Ragnar Sandvik, Strategic Advisor at Petoro.

The authorities' official data source

For Petoro, which manages highly valuable information from oil and gas production on the Norwegian continental shelf, incorrect data can have major financial consequences. Therefore, Petoro initiated a Data Management project to ensure data quality and be confident that the data sources are correct and up to date. Cegal assists with program developers, analysts and data expertise in the "Source of Truth" (SoT) project.

"We want the data to give the same answer, no matter what data source we use. We want there to be one truth for us, the authorities and the operators on the Norwegian continental shelf."

Ragnar Sandvik, strategic advisor in Petoro

The first part of the project involves the geographical location of about 100 wells. This was a PoC (Proof of Concept) to show the benefit of the SoT project and assess whether it was profitable to expand the project to cover larger parts of Petoro's operations.

"DISKOS is Norway's official data source and the authorities' archive. If the official data is not consistent with Equinor's, for example, we need to know why. Has there been an error at Equinor, has Equinor corrected incorrect data that has not been updated or is the error with us in our systems. There are many sources of error. For example, when data is moved between systems, decimals can be rounded off and removed or geolocation can be described in slightly different ways. Then there are small changes in the data that can have major consequences in the results of analyses," Sandvik continues.

Sourcing data from images, PDFs and databases

Petoro started by examining the data associated with the wells on the shelf, so-called metadata; location geographically and in depth, whether they are in production, what the status is and so on. It is very important that the wells are correctly located in the data models Petoro uses. If the information is wrong, this can lead to incorrect decisions. Petoro's goal is to maximize the value of the fields on the Norwegian shelf, so we must avoid decisions being made on the wrong basis. "We first wanted to investigate whether the official data retrieved from wells and reported to the authorities is consistent with the latest data used by the operator and the data we use ourselves in our simulations," says Sandvik.


Ragnar Sandvik - Petoro

The challenge in the project is that parts of the data reported to the authorities are not structured in a database or as machine-readable files, but are stored in pdf reports and as images.

The information to be structured thus comes from four sources:  
  1. IT systems
  2. PDFs
  3. Machine-readable files
  4. Images

The reason for this fragmentation is partly that information about the first wells on the oil fields was collected far back in time and partly because there is no standardized template for the industry.


Many failures proved the usefulness of Data Management

"Our first test proved the usefulness of the system. The system read and evaluated different sources of well data from approximately 100 wells in a field over the course of a day. Unfortunately, the results were disheartening for the industry. The sources of the data were not consistent and there were a lot of discrepancies, and we didn't know where the errors lay," says Sandvik.

With the Source of Truth project, Cegal has provided Petoro with a tool for data evaluation and data comparisons. By utilizing advanced analytics and automated processes, SoT makes it possible to assess and compare data from different data sources efficiently. Microsoft Fabric and Microsoft Power BI are key components of the solution. Integration with Fabric and Power BI streamlines data ingestion and visualization.

This is the foundation of the solution:    
  • Data ingestion with Microsoft Fabric: SoT uses Microsoft Fabric to efficiently ingest large amounts of data from various sources. This includes structured data as well as unstructured data from scanned documents and PDFs, ensuring that all relevant information is available for analysis.
  • Data visualization with Power BI: Once the data is incorporated, Power BI is used to create intuitive dashboards and reports. These visualizations facilitate deeper insights and streamline the process of establishing models and quality-assuring models on which to base decisions.
The Cegal solution has resulted in significant time savings. Data that is stored unstructured is automatically read into the system regardless of the format in which it is stored.

For us, the job has been reduced to using an intuitive dashboard to identify wells with different interpretations and values. Instead of compiling data, we can do more valuable work, such as checking the sources and quality assuring data. Now we spend our time analyzing and evaluating why there are differences in the interpretations, says Sandvik.

Saves hundreds of hours and millions of dollars

The Source of Truth project has given Petoro a number of benefits:

  • Increased productivity:
    Major time savings in the hunt for sources of error. Finds error sources faster.

  • Fewer data conflicts:
    A significant reduction in conflicts around data values and interpretations, resulting in more effective collaboration between partners.

  • Faster consensus on data sources:
    There is faster consensus on which dataset is right. This also speeds up projects.

  • Higher confidence in data-driven decisions:
    Increased confidence in the selected data sources, resulting in more informed and reliable decision-making.

  • Efficient data management:
    Streamlined data ingestion and visualization processes through Microsoft Fabric and Power BI. This makes it easier to analyze and understand large data sets.

"If we were to sort data manually, it would take weeks. It would also be easy to make mistakes, and for humans this is a demotivating job. By using Source of Truth, Petoro will be able to spend its time creating added value for the Norwegian people rather than checking data." 

Ragnar Sandvik, strategic advisor in Petoro

 

Sandvik says that Petoro chose Cegal as a partner in the project because of Cegal's expertise in exploiting the possibilities of Microsoft Azure

"In addition, Cegal has industry knowledge and knows Petoro. This helps to identify the technical opportunities based on our business needs, our processes and the content of the data", says Sandvik.

"We have barely started. We have a lot of data that needs to go into the application. There are several thousand wells on the Norwegian continental shelf (NCS) that are used in models as a basis for decision-making in the oil companies", he concludes.

Petoro is the licensee of about one third of the oil and gas reserves on the NCS, with interests in 43 producing fields.