Paper
16 December 2002 Online scientific data curation, publication, and archiving
Jim Gray, Alexander S. Szalay, Ani R. Thakar, Christopher Stoughton, Jan vandenBerg
Author Affiliations +
Abstract
Science projects are data publishers. The scale and complexity of current and future science data changes the nature of the publication process. Publication is becoming a major project component. At a minimum, a project must preserve the ephemeral data it gathers. Derived data can be reconstructed from metadata, but metadata is ephemeral. Longer term, a project should expect some archive to preserve the data. We observe that published scientific data needs to be available forever -- this gives rise to the data pyramid of versions and to data inflation where the derived data volumes explode. As an example, this article describes the Sloan Digital Sky Survey (SDSS) strategies for data publication, data access, curation, and preservation.
© (2002) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Jim Gray, Alexander S. Szalay, Ani R. Thakar, Christopher Stoughton, and Jan vandenBerg "Online scientific data curation, publication, and archiving", Proc. SPIE 4846, Virtual Observatories, (16 December 2002); https://doi.org/10.1117/12.461524
Lens.org Logo
CITATIONS
Cited by 63 scholarly publications and 1 patent.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Data archive systems

Databases

Data storage

Observatories

Astronomy

Data processing

Internet

RELATED CONTENT

MESA: Mercator scheduler and archive system
Proceedings of SPIE (October 05 2012)
ESO Archive data and metadata model
Proceedings of SPIE (September 24 2012)
Archive systems for the TNG telescope lessons learned in...
Proceedings of SPIE (December 16 2002)
The DIRP framework Flexible HPC based post processing of...
Proceedings of SPIE (September 24 2012)
Chandra data archive operations: lessons learned
Proceedings of SPIE (June 29 2006)
ESO data flow system in operations closing the data...
Proceedings of SPIE (July 25 2000)

Back to Top