Gaia DR4 content - Gaia
Gaia Data Release 4 (Gaia DR4)
The original Gaia DR4 data will become available from the Gaia ESA Archive. The expected content of Gaia DR4, pending certain processing and validation activities, is described here and summarised in this table.
Significant changes compared to previous Gaia data releases
Gaia DR4 is different from previous data releases in several ways:
Number and data volume of tables and other products
Gaia DR4 will contain a large number and diversity of data products. Lower-level data such as the individual observations underlying the source-level catalogue parameters will be released as well. Consequently, the total data volume of Gaia DR4 is approximately 500 TB with the largest fraction associated to the lower-level products. The total data volume of Gaia DR3 is 10 TB.
Content of the main source catalogues
In contrast to previous releases, data will be published for all processed sources, which amount to some 2.7 billion sources in total.
The gaia_source table in Gaia DR4 now contains a subset of about 2 billion sources considered to have both high-quality astrometry and photometry. With respect to previous data releases, the parameters in this table are now consolidated from several different processing modules such that the best-suited model has been applied for a given source (e.g. a binary-star model instead of the default single-star model).
The gaia_source table is complemented by the following four tables that contain the results from the dedicated processing pipelines for all 2.7 billion sources:
- all_source_astrometry: Results for all sources from the core astrometry pipeline
- all_source_photometry: Results for all sources from the core photometry pipeline
- all_source_rvs: Results for all sources from the core spectroscopy pipeline
- all_source_flags: Compiles a selection of flags for each source published in the Gaia catalogue.
Summary of expected tables in the Gaia DR4 archive
The columns of the table below have the following meaning:
- Id: Identifier that will eventually map to the corresponding section in the data model documentation.
- Data model chapter: Indicates the name of the data model chapter. The Gaia DR4 draft data model will be published in advance of the data release.
- Gaia Archive Table Name: Indicates the table name in the ESA Gaia Archive.
- Description: Description of the data set.
- Category: Helps to distinguish between the different data sets.
- New in DR4: Inidcates if the type of data was already published with Gaia DR3 or not. All data published with Gaia DR4 will be new, but some data product types have not been published before. These are indicated with "NEW". "NEW in Archive" marks products that were published elsewhere, e.g. on Cosmos, in previous releases .
- Data level: Indicative data level of the product. Level 1 (DL1) corresponds to image data, i.e. CCD pixel/sample values. Level 2 (DL2) corresponds to epoch data, i.e. time-resolved processing results usually per CCD transit. Level 3 (DL3) corresponds to source-level data with summary results per celestial source. Level 4 (DL4) corresponds to derived quantities that incorporate astrophysical models and priors.
- Label: Helps to distinguish between the different data sets.
Background of the data
Gaia DR4 data is based on data collected between 25 July 2014 (10:30 UTC) and 20 January 2020 (22:00 UTC) spanning a period of 66 months of data collection. As a comparison, Gaia DR3 was based on 34 months of data, Gaia DR2 was based on 22 months of data and Gaia DR1 was based on observations collected in the first 14 months of Gaia's routine operational phase.
The reference epoch for Gaia DR4 is J2017.5. Remember that the reference epoch is different for each Gaia data release, and that the reference epoch for Gaia DR3 was J2016.0, for Gaia DR2 was J2015.5 and for Gaia DR1 was J2015.0.
Positions and proper motions are referred to the ICRS, to which the optical reference frame defined by Gaia DR4 (Gaia-CRF4) is aligned. The time coordinate for Gaia DR4 results is the barycentric coordinate time (TCB).
Gaia Source Identifiers
Sources in the Gaia Catalogue are all identified through the Gaia Source Identifier, i.e., the source_id field in the various tables in the Gaia Archive. The construction of the source identifiers is explained in the archive documentation (for Gaia DR1, see the data model section). In particular, the source_id number contains rough information about the source position on the sky.
As explained in previous announcements, there are various reasons why the identifier of a specific source may change or disappear when going from the Gaia DR1 to the Gaia DR2 source list and on to the Gaia DR3 and Gaia DR4 source list. Users of Gaia data should thus be aware that the source list for Gaia DR4 should be treated as independent from Gaia DR3, Gaia DR2 and from Gaia DR1. With each new Gaia data release, the source list is becoming progressively more stable.
The Gaia source names ('designations') for Gaia DR4 are all constructed as follows:
Gaia DR4 yyy....yy
Accessing the data
The Gaia Archive will be the main point of access to the Gaia DR4 data, but the data (or a subset thereof) will also be served from our partner data centres (CDS, ASDC, ARI, AIP and Flatiron). Data can be extracted from the Gaia Archive by performing ADQL queries and downloading the corresponding results tables.
The total size will be about 500TB, of which a large fraction is the level 2 data.
Passbands
Gaia DR4 passbands will be published to document how the Gaia DR4 magnitudes are computed, and to allow to reproduce the analysis of the data.
Documentation
Data release documentation is provided along with each data release in the form of a downloadable PDF and a webpage. Please visit the Gaia Archive to access Gaia documentation, and make sure to go through all relevant information given from the documentation overview page.
Known issues with Gaia Data Release 4
A list of issues with the data or info given along with the data will be provided. In case you find an issue with the data yourself, please contact the Gaia Helpdesk to let us know.
Data model
The Gaia DR4 data model describes all tables together with the names and contents of the columns inside each table. This information will become available from the Gaia Archive along with the data release documentation.
In advance of Gaia DR4, we will publish a draft version of the data model here.
Data Release papers
Along with the Gaia DR4 data release documentation, several data processing papers will be published describing the specifics of the data processing and validation performed by the different coordination units in the Gaia Data Processing and Analysis Consortium (DPAC). There will also be some papers on the performance verification of Gaia, providing basic demonstrations of the scientific potential of the Gaia DR4 catalogue. Some pre-release processing papers are already available.
- Removed a total of (417) style text-align:center;
- Removed a total of (51) style text-align:justify;
- Removed a total of (6) align=middle;
- Removed a total of (1) align=center.
- Removed a total of (1) border attribute.
- Removed a total of (1) cellpadding attribute.
- Removed a total of (1) cellspacing attribute.