∞DL Traceability of Derived Data Objects to Raw Data
∞DL provides a simple mechanism for linking data objects to the sequences of data processing
and analysis. Each DataObject may have a standard attribute SOURCE with contains a list of full
IDL paths to addressable subranges of data used as a source for this (derived) data.
Obviously, raw data objects are distinguished by the fact that they do not have source data objects.
Following the SOURCE links, ∞DL can build and visualize networks of data objects reflecting the
sequential process of stepwise data reduction. Using a global IDL addressing mechanism,
such networks can connect data across laboratory and institution boundaries.
This provides a rather unique and exciting opportunity for an entirely new scientific publishing
paradigm. Given the fact that all the data in ∞DL is visualizable using ∞DL Viewers, a toolbox
can be built on top of ∞DL framework providing the ability to link final reports to the visual
representations of all steps of data acquisition, processing, and analysis, obviating the need
of generating data representations specifically for the particular manuscript in a particular journal.
Metaphorically speaking, “the data never leaves the house”. Any electronic publication in the world
just uses data object links provided by ∞DL, which are converted by ∞DL to 2D or 3D media objects
on-demand.
Synopsis:
Perceiving the global research process as a massive production of digital data objects, which is a
necessary and objective process driving scientific breakthroughs and the progress of human society,
the current system of scientific knowledge management which relies heavily on highly selective
publications of data interpretations with severely reduced assess to the underlying data, especially
the raw data, is woefully inadequate.
This system should not be replaced, but needs to be supported and augmented by a parallel and
independent system of data library exposing as many digital data objects as possible linked into
the context of global research process.
The ∞DL model described here offers a holistic, integrated approach to create a global data library
from bottom up. The general contract of ∞DL is that that in exchange for customers’ willingness to
format or re-format the data objects in compliance with the ∞DL format, the Infinite Data Library
will provide an unprecedented level of data integrity, interpretability, and ease of data sharing,
data mining, and data audit.
This would be impossible without deliberately reducing the variety of elementary data formats,
but the foundational belief is that the limited set of archetypical formats is sufficient for
building structured data objects of any practical complexity, the same way as 26 characters are
sufficient to describe the complexity of human thought and feeling in English language.
The author hopes to attract like-minded “concerned citizens” of the research community,
data enthusiasts, and all kinds of talented people, to collaborate on the implementation of ∞DL.