Input data
==========
Input data for the IMI is stored on AWS and automatically accessed when running the IMI on AWS. When
running the IMI on a local compute cluster, you will need to download the required input data.
Input data for GEOS-Chem
------------------------
The forward model in the IMI is the `GEOS-Chem chemical transport model `_.
The input files needed to run GEOS-Chem within the IMI can be accessed from the `GEOS-Chem Nested Input Data
`_ portal (aka `s3://gcgrid `_).
The GEOS-Chem Input Data portal is part of the `AWS Open Data Sponsorship Program `_.
As a result, **the data is completely free to use**. You will NOT incur any data egress fees when downloading data from the
`s3://gcgrid `_ bucket.
Input data for GEOS-Chem include:
- `Meteorological fields `_ (GEOS-FP or MERRA-2)
- `Emissions inventories `_
- Chemistry input data (e.g. archived OH fields)
- `Initial conditions for starting GEOS-Chem simulations `_
To automatically download these data for your inversion domain and time period, we recommend setting the ``DryRun`` options
in the :doc:`IMI configuration file ` to true. This will execute a `GEOS-chem dry-run simulation `_
to identify and download the necessary giles.
You may also download these files manually using `AWS CLI `_. See `this tutorial `_ for instructions,
swapping out data paths as necessary.
Alternatively, you can access the data via `AWS S3 Explorer `_.
Satellite data
--------------
The IMI currently supports the following datasets for use in the inversion:
- ``TROPOMI``: The operational `TROPOMI `_ retrieval product developed by the SRON Netherlands Institute for Space Research.
- ``blendedTROPOMI``: The Blended TROPOMI+GOSAT retrieval product developed by `Balasus et al. (2023) `_ to mitigate retrieval artifacts in the operational product.
Boundary conditions
-------------------
The IMI uses gridded 3D boundary conditions saved out from global 2°x2.5° GEOS-Chem simulation to define the inflow/outflow of methane at the edges of a regional simulation domain, essential for accurate regional inversions.
These files are further smoothed using either TROPOMI or the blended TROPOMI+GOSAT retrieval products to remove any systematic biases. These boundary condition files archive hourly species concentrations
and may also be used as initial conditions for the forward model simulations.
The IMI boundary condition files are stored on AWS in `s3://imi-boundary-conditions `_ and may be downloaded using AWS CLI as described above.