Glossary of Digital Twins

A glossary of digital twins and digital twin technology from the Digital Twin Consortium.

This glossary is not intended to cover industry-specific domains for which one might build a digital twin system.

The glossary is a Work-In-Progress draft.

We suggest starting with the term Digital Twin and following links from there.

Table of Contents


Aggregation

Aggregation is an integration strategy that involves copying data to gather it into a centralized location.

Miriam Webster defines the verb “aggregate” as “to collect or gather into a mass or whole”. In software architecture, aggregation implies gathering, copying, and possibly transforming information from multiple systems into a single centralized system.

Back to Table of Contents.


Computational Representation

A computational representation is an executable digital representation consisting of computational algorithms and supporting data representing some subject matter from a dynamic perspective.

Examples include simulations and predictive analytics.

The subject matter of computational representation is typically a process or a set of entities with properties that are a function of time over the time-frame of interest.

When executed (aka “run”), computational representations often take stored representations as input and their computational algorithms produce stored representations as output.

Computational representations may also use reference data, which is really just another example of a stored representation.

Examples of computational representations include Finite Element Analysis (FEA) models, Machine Learning models, and various other kinds of simulations based on mathematical equations, including those describing laws of physics and engineering.

Two categories of stored representations

Related terms

Computational representations are distinguished from stored representations, that do not contain algorithms.

Back to Table of Contents.


Cyber-Physical System

A system consisting of physical and digital systems integrated via networking.

A digital twin considered together with its physical twin is an example of a cyber-physical system.

A physical device with a closely integrated digital control system is another example of a cyber-physical system. Such a cyber-physical system may be considered a real-world entity represented by a digital twin.

See Wikipedia's definition of cyber-physical system.

Back to Table of Contents.


Data Interoperability

Data Interoperability is a functional digital twin subsystem that supports its integration representation/function by aggregating or federating data from digital systems that may be considered "inside" or "external to" the digital twin system.

Back to Table of Contents.


Data Model

A data model is a model of data that describes its structure, datatypes, and meaning.

A data model will use some data modeling paradigm that defines its approach to structuring data. Some examples include:

  • The relational paradigm structures data into tables with columns and foreign-keys
  • The entity-relationship paradigm structures data into classes with properties and supports relationship classes to express relations.
  • The triple-graph paradigm structures data as a series of subject-predicate-object "triples"
  • The property-graph paradigm structures data as nodes in a graph with nodes defined by classes with properties and edges defined by separate classes.

The datatypes associated with columns/properties/attributes may be "text" or "date" or "number" or more specialized datatypes such as "integer number" or "floating-point number". The specific datatypes available will vary per data-modeling language.

The meaning of particular data structured according to the data model may be encoded by naming data modeling elements (tables/columns/classes/properties) using the names of real-world concepts. For example, particular data that lies in the part of the structure named "equipment.weight" with datatype "number" can be understood to represent the weight of a piece of physical equipment, expressed numerically. Those names may be cross-referenced with natural-language documentation providing a more-detailed description of the meaning of the modeled data. Some data-modeling languages allow additional metadata to be associated to elements, e.g. the units-of-measure for "equipment.weight".

A given data model may lie somewhere along an implementation spectrum. It may be an implementation-neutral "logical" or "conceptual" data model or a persistence-specific data model like a SQL database schema. Often, a high-level conceptual data model is exposed through an API, but is mapped to a low-level implementation-specific data model by the software that implements the API.

The subject matter of a data model may be the data of a stored representation. Conversely, many stored representations have a data model that describes their structure and meaning.

The medium of a "data model" is a data modeling language.

The modeling perspective of a "data model" encompasses its data-modeling paradigm, where it sits on the "implementation perspective", as well as the choices it makes to simplify the real-world subject matter into named/structured data that describes aspects of that subject matter.

The following figure shows how a Data Model relates to a Stored Representation and how both relate to the physical world.
Data Model

Sometimes people use the unqualified term "model" to mean "data model", especially when they do "data modeling" as part of their job. They may use a term like "building model" to refer to a data model for creating stored representations of buildings (rather than a specific digital representation of a particular building). For this reason, it may be helpful to use a synonym like "schema", since "building schema" will be more-reliably interpreted as a "data model for structuring data that represents buildings" and never as a stored representation (aka "model") of a particular building.

As an example, a stored representation of a particular real-world building will consist of data that represents relevant aspects of that building for the desired use cases. The stored representation will have a "buildings data model" that defines the concepts of real-world buildings that can be expressed in the data of the stored representation, and how they are named, structured, and represented in the stored representation's data.

See data modeling in digital twin systems.

Alternate terms

  • "Ontology" is sometimes used as a synonym for "data model", particularly for data-modeling ontologies
  • "Schema" is sometimes used as a synonym for "data model". DDL defines database schemas. OData uses CSDL (Common Schema Definition Language). RDFS (Resource Description Framework Schema) is sometimes used in conjunction with OWL to define data models using a triple-graph approach.
  • "Information Model" is sometimes used as a synonym for "data model", though some parties make subtle distinctions between "information model" and "data model", such as taking it to imply a data model that lies on the "conceptual" end of the implementation spectrum.

The various synonyms of "data model" and the fact that some ontologies can be used as data models can be a significant source of confusion in the design and development of stored representations for digital twin systems.

Data Models and Ontologies

Back to Table of Contents.


Data Modeling in Digital Twin Systems

Data models and ontologies play various functional roles in digital twin systems.

Digital Twin Systems

Data Modeling in Digital Twin Systems

A persistence data model uses a data modeling language to describe data structures and types in a way that is compatible with the persistence technology. If the persistence technology is a relational database, this will be a SQL DDL "schema". Other persistence technologies will use different data modeling paradigms and data modeling languages.

Service interfaces embody a logical data model that describes the data structures and types used by an API or protocol. This data model may differ from the lower-level persistence data model, with the API implementation handling the mapping between the two. Examples include OData's CSDL (for entity-relationship style REST APIs) and GraphQL Schema (for property-graph style APIs). Other common ways of defining the "logical" data model of an API or protocol include JSON Schema and XML schema. Non-web-based APIs model data in programming-language-specific ways.

Conceptual data models are intended to be relatively free of the implementation-specific functional, performance, and scalability concerns of service interfaces and stored representations so that they can focus on describing the real-world concepts in the domain of the digital twin. These data models may or may not be machine-readable, as they may only be used to guide the development of the GUI and lower-level data models.

Conceptual data models may be used in a digital twin system that is integrating information from multiple stored representations, each with its own persistence and logical data models. The digital twin system may have an integration representation/function that provides access to data using a conceptual data model that is compatible with the overall ontology of the digital twin system. . A variety of data modeling languages can be used for this function, including those for data-modeling ontologies.

Because the integration representation/function may employ both aggregation and federation integration strategies, data modeling languages compatible with integration-friendly service interfaces like OData, SPARQL, and GraphQL may be useful. These support both access to aggregated integration representations and mapping to physically-separate federated stored representations.

Most data modeling languages can be used in many of these functional roles, and roles can be combined. A single data model could be used at the conceptual, service interface, and persistence levels if system requirements can be met. The data-modeling ontology of the integration service interface may be used as the overall ontology of the digital twin system and the data model of the integration representation/function.

Data Models and Ontologies

Back to Table of Contents.


Data Modeling Language

A data modeling language is a lexical or graphical language used define the data structures and data types of a data model.

Examples of graphical data modeling languages include:

  • UML (Unified Modeling Language): a general modeling language that can be used for data modeling.
  • ERD (Entity Relationship Diagrams): a data modeling language well-suited for relational databases.

Lexical data modeling languages include:

  • SQL Data Definition Language (DDL): The standard language for relational database schemas.
  • Digital Twin Definition Language (DTDL): A property-graph data-modeling language with support for integration with IoT via "telemetry" properties. DTDL was created by Microsoft and open-sourced.
  • GraphQL Schema: A property-graph data-modeling language used with GraphQL to support federation of data from multiple sources and purpose-specific views of data. GraphQL was created by Facebook and open-sourced.
  • Web Ontology Language (OWL): A triple-graph data-modeling language that is part of the Semantic Web Stack.
  • EXPRESS: A entity-relationship data-modeling language used by STEP for modeling geometry and IFC for modeling built assets.
  • Many more examples can be found here.

See data modeling in digital twin systems.

Back to Table of Contents.


Data-modeling Ontology

A data-modeling ontology is an ontology that is also a conceptual data model.

Data-modeling ontologies describe real-world universals using data structures and datatypes in a particular data-modeling paradigm. Ideally, they also include precise human-readable definitions of the universals that they are describing.

An incomplete list of examples of data-modeling ontologies relevant to digital twins includes:

Though data-modeling ontologies define data structures and datatypes that can be used to represent instances of the ontologically-defined entities, those data structures and types don't have to be used to store or transmit the data. Data stored in different data structures can refer to the terms defined by the ontology to clarify the semantics of the data. Data-modeling ontologies may prioritize unambiguous description of universals over optimization of data persistence.

Back to Table of Contents.


Data Modeling Paradigm

A data modeling paradigm is an approach to data modeling embodied in a set of fundamental constructs that govern how data is structured and typed.

Some common examples include:

  • The relational paradigm structures data into tables with columns and foreign-keys
  • The entity-relationship paradigm structures data into classes with properties and supports relationship classes to express relations.
  • The triple-graph paradigm structures data as a series of subject-predicate-object "triples"
  • The property-graph paradigm structures data as nodes in a graph with nodes defined by classes with properties and edges defined by separate classes.
  • JSON uses an object-oriented data-modeling paradigm defined by ECMA-404.
  • XML uses a hierarchical data-modeling paradigm defined by the W3C XML Information Set.

Specialized stored representations may use idiosyncratic data modeling paradigms.

See data modeling in digital twin systems.

Back to Table of Contents.


Digital Representation

A digital representation is a representation (aka "model") of some subject matter consisting of structured digital information and/or computational algorithms.

Before computers, we represented/modeled physical things using clay, wood, or plastic models or drawings on paper. Nowadays, we use digital representations, which may include 3D models, drawings, or lexical databases of various sorts.

Alternate terms
"Digital model" can be used as a synonym for "digital representation."

Narrower terms

Two categories of stored representations

Related terms

  • In the context of our definition of digital twin, a digital representation's subject matter may include many entities and processes.

Back to Table of Contents.


Digital System-of-Systems

A digital system-of-systems is a digital system comprised of one or more other digital systems

Back to Table of Contents.


Digital Thread

Digital thread is a mechanism for correlating information across multiple dimensions of the virtual representation, where the dimensions include (but are not limited to) time or lifecycle stage (including design intent), kind-of-model, and configuration history.

Digital thread generally relies on stable, consistent real-world identifiers to "join" across different repositories of information.

Back to Table of Contents.


Digital Twin

A digital twin is a virtual representation of real-world entities and processes, synchronized at a specified frequency and fidelity.

Digital Twin Systems transform business by accelerating holistic understanding, optimal decision-making, and effective action.

Digital Twins use real-time and historical data to represent the past and present and simulate predicted futures.

Digital Twins are motivated by outcomes, tailored to use cases, powered by integration, built on data, guided by domain knowledge, and implemented in IT/OT systems.

Alternate, non-preferred terms

Related terms

Back to Table of Contents.


Digital Twin Applications

Digital twin applications are software applications that leverage digital twin services and/or service interfaces.

Digital twin applications may be considered to be "using" a digital twin system or to be "part of" such a system.

Digital twin applications are driven by use-cases.

Back to Table of Contents.


Digital Twin Platform

A digital twin platform is a set of integrated services, applications, and other digital twin subsystems that are designed to be used to implement [digital twin systems].

Back to Table of Contents.


Digital Twin Services

Digital twin services are functional subsystems of a digital twin system that provide value by leveraging the digital twin.

Examples of digital twin services include:

  • Visualization services for web, mobile, desktop, Virtual Reality (VR), and Augmented Reality (AR) devices.
  • Analysis services of various kinds.
  • Many other kinds of services that will be enumerated at a later date.

Most digital twin services provide a service interface for use by other services and digital twin applications.

Back to Table of Contents.


Digital Twin System

A digital twin system is a system-of-systems that implements a digital twin.

Digital Twin System

Related terms

Back to Table of Contents.


Digital Twin System Feature

A digital twin system feature is a functionality of a digital twin system.

Digital twin system features are generally implemented by subsystems of the digital twin system

Back to Table of Contents.


Digital Twin Use Case

A digital twin use case is a use case in which digital twins may be used to accomplish desired outcomes.

Digital Twin use cases are designed to improve outcomes, provide requirements for digital twin systems.

A digital twin use case typically defines requirements for synchronization including synchronization frequency, etc.

Back to Table of Contents.


Federation

Federation is an integration strategy that involves coordinated access to data repositories without making centralized copies of them.

Federation only implies gathering enough centralized "index" information to use the federated systems in a coordinated manner, without copying the bulk of their data. If transformation of data is required, it is performed on-the-fly.

Back to Table of Contents.


Integration Representation/Function

The integration representation/function is a subsystem of a digital twin system that provides integrated and semantically-aligned access to the stored representations of the digital twin system.

The integration representation/function may comprise a stored representation for data aggregation and functions for data federation.

The integration representation/function is closely associated with digital thread.

See data modeling in digital twin systems.

Back to Table of Contents.


Integration Service Interface

A integration service interface is a service interface for the integration representation/function of a digital twin system.

Back to Table of Contents.


Integration Strategies

Integration Strategies (in digital twin systems) are strategies for integrating data/information to achieve a cohesive virtual representation.

In software architecture, there are two main patterns for data integration: aggregation and federation.

Individual stored representations (including the inputs and outputs of computational representations) can be integrated by the integration representation/function of a digital twin system.

The integration service interface of a digital twin system can function as the service interface that exposes the overall digital twin system as a stored representation which can in turn be aggregated by a "larger" digital twin system--a federation of digital twin systems.

Back to Table of Contents.


Interventional Frequency

An interventional frequency is a synchronization frequency characterizing how often interventions in reality occur in order to synchronize reality with the state of a stored representation.

Back to Table of Contents.


IT/OT Platform

The IT/OT Platform of a digital twin system is the set of Information Technology and Operational Technology infrastructure and services on which the subsystems of a digital twin system are implemented.

Subsystems of the IT/OT Platform include:

  • A software platform and tooling stack.
  • Platform APIs
  • Orchestration of low-level infrastructure
  • Compute, storage, and networking infrastructure

These systems could be cloud-based, on premises, embedded, mobile, distributed, etc.

Back to Table of Contents.


Management and Automation

Management and Automation is a functional digital twin subsystem that supports general management and operation of the digital twin system.

Back to Table of Contents.


Model

A representation of some subject matter modeled in some medium from some modeling perspective.

The subject matter could be a car, a city, a supply chain, or a factory and its surroundings.

The modeling medium could be clay, wood, plastic, mathematical equations in a spreadsheet, or ink-on-paper. Digital twins are mostly concerned with the "digital" medium--information stored as bits and bytes that can be loaded into a computer's working memory.

The modeling perspective is the set simplifications made by the modeler. Every model simplifies its subject matter in some way, otherwise it would be an actual replica of the subject matter. It may be smaller in scale or lower in fidelity. It may only represent geometric aspects or also represent material composition. It may only represent functional or financial or legal aspects. It may represent a complex 3D form as a simple curve in space or as a set of triangles. It is common for representational models to represent "any and all aspects of the subject matter that are relevant to the use cases for the model" without explicitly stating their modeling perspective.

When you see "____ model" sometimes the blank is filled with the subject matter (e.g. a "building model") and sometimes with a modeling media (e.g. a "clay model" or "digital model") and occasionally even with a modeling perspective (e.g. a "functional model", "physical model", or "financial model").

Whether a digital model is a stored representation or computational representation can also be considered part of its modeling perspective.

Back to Table of Contents.


Modeling Fidelity

Back to Table of Contents.


Observational Frequency

An observational frequency is a synchronization frequency characterizing how often observations of reality are made and reflected in a stored representation.

Back to Table of Contents.


Ontology

An ontology is a representational artefact that describes universals and certain relations among them in a domain of interest.

"Universals" are types or classes of entities, in contrast to "particulars" which are instances of entities.

An ontology can be written in any number of natural or formal languages and a given ontological understanding can be represented in more than one language.

A human-readable ontology is used to assist peoples understanding of and communication regarding the ontology's domain.

A machine-readable ontology can be read by software to associate the semantics defined in the ontology with data. This may make the data more meaningful to humans. It may also facilitate data interoperability if data structures from different systems are mapped to terms from a common ontology.

Machine-readable ontologies should also include detailed human-readable definitions of their terms in order to be used effectively for meaningful semantic mapping.

Ontologies generally do not specify data structures or data types used to represent particular entities, but data-modeling ontologies do.

An ontology can be used in a digital twin system to provide consistent semantics and support data integration among different [stored representations].

Data Models and Ontologies

Also see wikipedia's entry for ontology (information science) and ontology (philosophical).

The given definition is a simplified version of the definition from "Building Ontologies with Basic Formal Ontology" by Barry Smith.

Back to Table of Contents.


Physical Twin

A physical twin is a set of real-world entities and processes that corresponds to a digital twin

The physical twin is the subject matter modeled by a digital twin.

The physical twin may constitute physical systems of interest and their environment, interactions, and processes.

Alternate, non-preferred terms

  • Physical asset - May be used in specific domains (such as infrastructure), but is not broad enough for general purposes.

Back to Table of Contents.


Real-to-virtual synchronization

Real-to-virtual synchronization (aka observational synchronization) is synchronization that causes a stored representation to reflect new observations of the real world.

In other words, it is the process of mirroring the real world in the virtual representation, based on observation of the real-world.

Back to Table of Contents.


Real-World

In the context of digital twins, "real-world" refers to the subject matter of a stored representation or digital twin (which can be considered as a kind of stored representation).

The DTC definition of digital twin refers to "real-world entities and processes," where:

  • "Entities" refers to things that exist over a significant time-span. These are typically physical objects and systems but also include immaterial things like organizations, supply-chains, work-orders, etc.
  • "Processes" refers to events or activities that occur in time. These might be transportation journeys or accidents, power outages, pharmaceutical production processes, weather events, heart attacks, or destructive processes such as corrosion or earthquakes.

The interactions among entities and processes are also included in the "real-world" subject matter.

Because digital twin systems are "real", the term "real-world" in our context refers to portion of the real-world that is outside of the digital twin system and is the subject matter of its digital twin. Of course, one could implement a separate digital twin of the IT/OT Systems that implement a given digital twin system.

Back to Table of Contents.


Security, Trust, and Governance

Security, Trust, and Governance are a set of foundational concerns for a the functional subsystems of a digital twin system.

The concerns include privacy, security, safety, resilience, and reliability.

Back to Table of Contents.


Service Interface

A system’s service interface is a digitally addressable endpoint that implements a protocol through which other systems and services may interact with the system.

Narrower terms

  • Each subsystem may have its own service interface, e.g. a virtual representation service interface, or a service interface for a particular kind of stored representation, or a visualization service interface, etc.

Back to Table of Contents.


Simulation Modeling Language

A simulation modeling language is a lexical or graphic language used to define simulation models for computational representations.

Examples include Modelica, Simulink, etc. Simulations can also be coded in programming languages such as FORTRAN, C, Lisp, etc. Not all computational representations are defined via simulation modeling languages, because simulation techniques and technologies vary widely.

See wikipedia for simulation languages and simulation software.

Back to Table of Contents.


Subsystems of a Digital Twin System

A subsystem of a digital twin system is a system that implements a digital twin system feature.

Examples are listed in the entry for digital twin system.

Subsystems of a digital twin system may have service interfaces

Back to Table of Contents.


Stored Representation

A stored representation is a digital representation consisting of stored structured information, representing states of some subject matter.

A stored representation can be queried, in contrast to a computational representation, which must be executed to produce output.

Stored representations can take the form of databases of all kinds including relational databases, graph databases and other NoSQL databases, IoT "data historians", specialized CAD, BIM, and GIS repositories, 3D meshes derived from photogrammetry and/or point clouds, satellite or radar imagery, spreadsheets, intelligent 2D drawings and schematics, etc.

Two categories of digital representations

Related terms

Back to Table of Contents.


Synchronization

Synchronization is the process of causing the virtual representation to more-closely match the real-world or cause the real-world to more-closely match the virtual representation of a desired state.

Synchronization is implemented via some synchronization mechanism.

Narrower terms

Back to Table of Contents.


Synchronization Fidelity

Back to Table of Contents.


Synchronization Frequency

Synchronization Frequency is a frequency characterizing how often synchronization occurs

The frequency will not be uniform for a digital twin. It may vary per stored representation or even within the stored representation.

Narrower terms

Back to Table of Contents.


Synchronization Mechanism

A synchronization mechanism is a mechanism through which synchronization is implemented.

Back to Table of Contents.


Use Case

A use case is a set of circumstances or a scenario for the use of something.

A use case is typically associated with constraints or other requirements.

Back to Table of Contents.


Virtual

Not physically existing as such but made by software to appear to do so.

We have used google's definition of "virtual", because it captures the intent of a digital twin to allow one to virtually interact with a portion of reality without actually directly interacting with it. Virtual observations are indirect (and may be of lower fidelity than actual in-person observations), but sensors may also provide more accurate, quantified observations that can be recorded for future reference. The digital virtual world is also more accessible that the real one, and it is machine-readable, so that computers can query it and perform analytics and simulations with it.

"Virtual" is not technically a synonym for "digital", but in the context of a digital twin the virtual world is implemented digitally, and the terms are used somewhat interchangeably. "Virtual" conveys more of the intent of creating an accessible replica of the real world and "digital" conveys an important implementation detail.

Thus the intent of the term "virtual representation" was to convey a digital artefact that serves as a "virtual replica" of the real-world or a "virtual world" that mirrors the real world. The virtual representation consists of stored representations with different modeling perspectives, but by "joining" them together with digital threads we achieve a complex, multi-facetted model (aka "representation") that is greater than the sum of its parts. The virtual representation aspires to give the "appearance of" a reality that is richer than any individual stored representation.

Back to Table of Contents.


Virtual Representation

A virtual representation is a complex, cohesive digital representation comprised of stored representations, computational representations, and supporting data which collectively provide an information-rich "virtual" experience of their subject matter.

The integration representation/function of a digital twin system "virtually" joins information of various kinds together into the cohesive, multi-faceted, representation of reality that we call a "virtual representation".

Alternate terms

  • Avatar
  • Virtual replica

Back to Table of Contents.


Virtual-to-real synchronization

Virtual-to-real synchronization (aka interventional synchronization) is synchronization that intervenes in the real world to make it more-closely match a stored representation of a desired state.

In other words, it is the process of mirroring the virtual representation into the real world, through some intervention in the real world.

Back to Table of Contents.


Virtual Twin

Use digital twin instead. See virtual.

Back to Table of Contents.