Our only telegram handle: @ZKPofficialTelegram Verified

ZPK
BUY COINS
...ChevronTokenized DatasetsChevronMetadata Structuring

Metadata Structuring

Metadata serves two distinct purposes in the tokenized datasets framework:

Metadata Structuring

Public Metadata

Basic descriptive information such as schema definitions, creation timestamps, and general categorization is stored on-chain in Patricia Tries in plaintext to enable discovery and validation. This information is hashed using SHA-512 to ensure integrity, but the hash is stored alongside the plaintext data.

Learn Crypto the Easy Way

Privacy-Sensitive Metadata

Check

Statistical summaries, detailed provenance information, and other potentially sensitive metadata can be encrypted and stored off-chain through off-chain workers, with only hash references maintained on-chain. For this sensitive metadata, zero-knowledge proofs can selectively verify properties without revealing the underlying information.

Check

This dual approach balances the need for discoverability with privacy protection, acknowledging that hashing alone does not provide confidentiality for on-chain information.

The marketplace implements a standardized metadata schema that balances comprehensiveness with efficiency. The schema includes:

Dataset identification

Dataset identification

Basic information like title, description, version, and creation timestamp

Technical specifications

Technical specifications

Format, size, encoding, compression method, and schema definition

Quality indicators

Quality indicators

Completeness, consistency metrics, update frequency, and last verification date

Domain-specific attributes

Domain-specific attributes

Field-relevant indicators like resolution for images, sampling rate for audio, or collection methodology for surveys

Usage terms

Usage terms

License type, attribution requirements, and permitted use categories

Dataset identification

Dataset identification

Basic information like title, description, version, and creation timestamp

Technical specifications

Technical specifications

Format, size, encoding, compression method, and schema definition

Quality indicators

Quality indicators

Completeness, consistency metrics, update frequency, and last verification date

Domain-specific attributes

Domain-specific attributes

Field-relevant indicators like resolution for images, sampling rate for audio, or collection methodology for surveys

Usage terms

Usage terms

License type, attribution requirements, and permitted use categories

Functions of Metadata in the Tokenization Process

This structured approach enables efficient discovery and evaluation of datasets while ensuring that critical information is consistently available across the marketplace. The metadata serves several crucial functions in the tokenization process:

icon

It enables efficient dataset discovery without requiring access to the full data

icon

It provides the basis for quality assessment and validation before purchase

icon

It facilitates provenance tracking and attribution for regulatory compliance

icon

It documents the technical requirements for utilizing the dataset effectively

icon

It enables efficient dataset discovery without requiring access to the full data

icon

It provides the basis for quality assessment and validation before purchase

icon

It facilitates provenance tracking and attribution for regulatory compliance

icon

It documents the technical requirements for utilizing the dataset effectively

Consider a 500 MB dataset of weather records: its metadata might include the source, schema (e.g., columns for temperature, humidity, wind speed), statistical summaries (e.g., average temperature of 15°C), and a timestamp.
The metadata is serialized, hashed with SHA-512 to produce a fixed-length digest, and linked to the CID. This allows consumers to verify the dataset's authenticity and relevance before purchase.

The Proof Behind Champions

Those who compete at the edge of human precision now back the technology that defines digital truth.