Skip to main content

[DSB:THEORY.SEMANTIC_BINDING] [DSI:NAME=DOCUMENT_SEMANTIC_IDENTIFIER;ROLE=LEARNING;AUTHOR=SIMON_MACFARLANE;VERSION=1_0] [DSM:SYSTEM=SEMANTIC_BINDING;AUDIENCE=PUBLIC,PROFESSIONAL,AUTHORING_SYSTEMS]


Page 3.2 — DSI — Document Semantic Identifier


[SSB:THEORY.SEMANTIC_BINDING.SECTION_SECTION_IDENTIFIER.PURPOSE.DEFINITION.3-2-1] [SSI:TITLE=WHAT_IS_DSI;AUTHORITY=PRIMARY;REF=3-2-1] [SSM:SECTION=DEFINITION;INTENT=DEFINITION;ABSTRACTION=HIGH]

What DSI Is

Document Semantic Identifier (DSI) declares the stable identity of a document.

It answers one—and only one—question:

“Which document is this?”

DSI allows systems to treat a document as a distinct semantic artefact, independent of:

  • its content
  • its structure
  • its applicability
  • its audience
  • its internal sections

DSI identifies the document.
It does not describe what the document means.


[SSB:THEORY.SEMANTIC_BINDING.SECTION_SECTION_IDENTIFIER.SCOPE.CONSTRAINT.3-2-2] [SSI:TITLE=WHAT_DSI_DOES_NOT_DO;AUTHORITY=PRIMARY;REF=3-2-2] [SSM:SECTION=DEFINITION;INTENT=CONSTRAINT;ABSTRACTION=MEDIUM]

What DSI Does Not Do

DSI must never:

  • encode semantic meaning
  • declare subject matter
  • imply intent or abstraction
  • constrain usage or audience
  • affect retrieval eligibility directly

Two documents may share the same DSB
but must never share the same DSI.

DSI identifies a document.
Meaning is declared elsewhere.


[SSB:THEORY.SEMANTIC_BINDING.SECTION_SECTION_IDENTIFIER.FIELDS.DEFINITION.3-2-3] [SSI:TITLE=DSI_FIELDS;AUTHORITY=PRIMARY;REF=3-2-3] [SSM:SECTION=DEFINITION;INTENT=DEFINITION;ABSTRACTION=MEDIUM]

Canonical DSI Fields

DSI is declared using the following fields:

  • NAME — canonical document identifier (stable)
  • ROLE — document role within the system
  • AUTHOR — accountable author or authority
  • VERSION — explicit version identifier

Example: '[XXX: NAME=DOCUMENT_SEMANTIC_IDENTIFIER; ROLE=LEARNING; AUTHOR=SIMON_MACFARLANE; VERSION=1_0 ]'

These fields uniquely identify a document instance.


[SSB:THEORY.SEMANTIC_BINDING.SECTION_SECTION_IDENTIFIER.ROLE.MODEL.3-2-4] [SSI:TITLE=DOCUMENT_ROLE;AUTHORITY=PRIMARY;REF=3-2-4] [SSM:SECTION=CONCEPT;INTENT=MODEL;ABSTRACTION=HIGH]

Document Role

ROLE declares the functional role of the document within the system.

Typical values include:

  • LEARNING
  • REFERENCE
  • POLICY
  • SPECIFICATION
  • PROCEDURE
  • GOVERNANCE

Role supports:

  • retrieval ranking
  • response composition
  • document lifecycle management
  • cross-document reasoning

ROLE does not change meaning.
It changes how the document is used.


[SSB:THEORY.SEMANTIC_BINDING.SECTION_SECTION_IDENTIFIER.VERSION.MODEL.3-2-5] [SSI:TITLE=VERSIONING;AUTHORITY=PRIMARY;REF=3-2-5] [SSM:SECTION=CONCEPT;INTENT=MODEL;ABSTRACTION=HIGH]

Versioning

VERSION identifies a specific revision of the document.

Versioning enables:

  • controlled evolution
  • safe updates
  • reproducible answers
  • auditability over time

Changing VERSION creates a new document instance
while preserving lineage through NAME.


[SSB:THEORY.SEMANTIC_BINDING.SECTION_SECTION_IDENTIFIER.SYSTEM_USE.APPLICATION.3-2-6] [SSI:TITLE=SYSTEM_USE_OF_DSI;AUTHORITY=PRIMARY;REF=3-2-6] [SSM:SECTION=CONCEPT;INTENT=APPLICATION;ABSTRACTION=HIGH]

How Systems Use DSI

Systems rely on DSI to:

  • cluster sections by document
  • enforce document-level constraints
  • attribute answers and citations
  • track provenance and audit trails
  • distinguish versions in retrieval

DSI is the anchor for document identity,
not semantic interpretation.


[SSB:THEORY.SEMANTIC_BINDING.SECTION_SECTION_IDENTIFIER.RULES.CONSTRAINT.3-2-7] [SSI:TITLE=CANONICAL_DSI_RULES;AUTHORITY=PRIMARY;REF=3-2-7] [SSM:SECTION=DEFINITION;INTENT=CONSTRAINT;ABSTRACTION=MEDIUM]

Canonical Rules for DSI

  • Exactly one DSI per document
  • DSI must appear once, at document level
  • NAME must be globally unique
  • VERSION must be explicit
  • DSI must not encode meaning or scope
  • DSI may change only via versioning

Violating these rules breaks document identity guarantees.


[SSB:THEORY.SEMANTIC_BINDING.SECTION_SECTION_IDENTIFIER.SUMMARY.SUMMARY.3-2-8] [SSI:TITLE=SUMMARY;AUTHORITY=SECONDARY;REF=3-2-8] [SSM:SECTION=SUMMARY;INTENT=SUMMARY;ABSTRACTION=HIGH]

Summary

Document Semantic Identifier (DSI) exists to declare which document this is.

It is intentionally:

  • meaning-neutral
  • structurally simple
  • stable across systems
  • auditable over time

By separating identity from meaning and applicability:

  • DSB defines what the document is about
  • DSI defines which document this is
  • DSM defines where and how it may be used

This separation is mandatory for deterministic retrieval, safe reuse, and explainable AI behaviour.


[SSB:THEORY.SEMANTIC_BINDING.SECTION_SECTION_IDENTIFIER.STATUS.STATUS.3-2-9] [SSI:TITLE=STATUS;AUTHORITY=SECONDARY;REF=3-2-9] [SSM:SECTION=STATUS;INTENT=STATUS;ABSTRACTION=LOW]

Status

Document Semantic Identifier (DSI) is active and mandatory.

No semantically bound document is valid without a correctly declared DSI.