top of page

GEO PROVENANCE DATASET

Trace the Origin of Your Code and Protect Your Software Supply Chain

Understanding the geographic and authorial origin of your codebase is critical to regulatory compliance and trust. The SCANOSS Geo Provenance Dataset helps identify where your code comes from—who wrote it, and where it originated—ensuring supply chain transparency.

Product_04_Provenance-Light.png

Detect geographic origin + authorship metadata

Validate your

software supply chain integrity

Identify code of unknown or high-risk provenance

How it works

Using commit history, code fingerprinting, and author attribution models, SCANOSS provides detailed insights into the provenance of every open source component in your software. This includes both declared origins—where authorship or source information is explicitly stated—and inferred origins, identified through analysis of code patterns, metadata, and historical context, even when no explicit information is available.

Step 1:

Scan your code repositories with SCANOSS

​​

Step 2:

Get author and location metadata from code and commit history

​​

Step 3:

Retrieve geographic provenance insights via API

Download Technical Datasheet

USE CASE

Integrated Open Source Software License Compliance

USE CASE

Open Source Software in AI-generated Code

Ready to get started?

bottom of page