Cloud Data Access
Many of IRSA's holdings are available in AWS S3 cloud storage buckets as part of the NASA Open Source Science Initiative and the AWS Open Data Program. The files can be either downloaded or accessed in place. Credentials are not required. There are many ways to access the buckets, including:
- Python: See our Notebook Tutorials for python demonstrations.
- Command line: See the AWS Command Line Interface documentation.
- Basic example (bucket and prefix options are described below):
aws s3 ls --no-sign-request "${bucket_name}/${prefix}/"
- Basic example (bucket and prefix options are described below):
- Bulk downloads: Download scripts for catalogs can be found through the links at Catalog Bulk Download.
This page contains a listing of the IRSA-curated datasets in S3 and the information needed for access. Under each mission heading below, the bucket information is given followed by a subsection for each dataset. Each image set and catalog is then listed along with its base S3 prefix (path to the base directory, relative to the bucket).
Images: Prior knowledge of specific files is not required for access, but is sometimes convenient. If you have an IRSA URL or filename for an image or ancillary file, you can construct the S3 key (full path to the file, relative to the bucket) by substituting the S3 prefix listed below for the first part of the URL or name. Split the URL or filename after the data product ID, which is the final level of the prefix. For example, the AllWISE intensity image with the URL (data product ID in bold):
https://irsa.ipac.caltech.edu/ibe/data/wise/allwise/p3am_cdd/09/0904/0904m213_ac51/0904m213_ac51-w1-int-3.fits
has the S3 key:
wise/allwise/images/p3am_cdd/09/0904/0904m213_ac51/0904m213_ac51-w1-int-3.fits
URLs and filenames can be obtained in various ways through IRSA's image services. In addition, some of the image sets below include a link to IBE documentation which describes the naming syntax and can be used to manually construct URLs.
Catalogs: Catalog data is in Apache Parquet format, partitioned by HEALPix pixel index at the given order (k). The name of the base directory containing the Parquet dataset is listed below for convenience, relative to the prefix.
Requesting additional datasets: If there are other IRSA holdings that you would like us to make available in the cloud, please contact IRSA's Help Desk.
Spitzer
Bucket name: nasa-irsa-spitzer
Bucket region: us-west-2
Spitzer Enhanced Imaging Products (SEIP)
AWS Open Data Registry page: https://registry.opendata.aws/spitzer-seip
Data products:
Super Mosaics (FITS)
- S3 prefix: spitzer/seip/seip_science/images
WISE
Bucket name: nasa-irsa-wise
Bucket region: us-west-2
AllWISE
AWS Open Data Registry page: https://registry.opendata.aws/wise-allwise/
Data products:
Images Atlas (FITS)
- S3 prefix: wise/allwise/images/p3am_cdd
- IBE docs: https://irsa.ipac.caltech.edu/ibe/docs/wise/allwise/p3am_cdd/
Source Catalog (Parquet)
- S3 prefix: wise/allwise/catalogs/p3as_psd/healpix_k5
- Parquet name: wise-allwise.parquet
All-Sky
AWS Open Data Registry page: https://registry.opendata.aws/wise-allsky/
Data products:
Single-exposure Image Sets (FITS)
- S3 prefix: wise/allsky/images/4band_p1bm_frm
- IBE docs: https://irsa.ipac.caltech.edu/ibe/docs/wise/allsky/4band_p1bm_frm/
3-Band Cryo
AWS Open Data Registry page: https://registry.opendata.aws/wise-cryo-3band/
Data products:
Single-exposure Image Sets (FITS)
- S3 prefix: wise/cryo-3band/images/3band_p1bm_frm
- IBE docs: https://irsa.ipac.caltech.edu/ibe/docs/wise/cryo_3band/3band_p1bm_frm/
Post-Cryo
AWS Open Data Registry page: https://registry.opendata.aws/wise-postcryo/
Data products:
Single-exposure Image Sets (FITS)
- S3 prefix: wise/postcryo/images/2band_p1bm_frm
- IBE docs: https://irsa.ipac.caltech.edu/ibe/docs/wise/postcryo/2band_p1bm_frm/
NEOWISE Reactivation
Bucket name: nasa-irsa-wise
Bucket region: us-west-2
NEOWISE-R
AWS Open Data Registry page: https://registry.opendata.aws/wise-neowiser/
Data products:
Single-exposure Image Sets (FITS)
- S3 prefix: wise/neowiser/images/p1bm_frm
- IBE docs: https://irsa.ipac.caltech.edu/ibe/docs/wise/neowiser/p1bm_frm/
Single-exposure Source Table (Parquet)
Composed of yearly data releases plus an addendum, stored separately.
Replace "{PART}" below with either "yearN" (where N is an integer, 1-11) or "addendum".
- S3 prefix: wise/neowiser/catalogs/p1bs_psd/healpix_k5/{PART}
- Parquet name: neowiser-healpix_k5-{PART}.parquet
unWISE
Bucket name: nasa-irsa-wise
Bucket region: us-west-2
AWS Open Data Registry page: https://registry.opendata.aws/wise-unwise/
Data products:
Time-Domain Catalog (Parquet)
- S3 prefix: unwise/neo7/catalogs/time_domain/healpix_k5
- Parquet name: unwise-neo7-time_domain-healpix_k5.parquet
OpenUniverse 2024 Matched Rubin and Roman Simulations
Bucket name: nasa-irsa-simulations
Bucket region: us-east-1
Preview
AWS Open Data Registry page: https://registry.opendata.aws/openuniverse2024/
Data products:
Roman simulated data products (FITS, Parquet)
- S3 prefix: openuniverse2024/roman/preview
Rubin simulated data products (FITS, Parquet)
- S3 prefix: openuniverse2024/rubin/preview