Star 历史趋势
数据来源: GitHub API · 生成自 Stargazers.cn
README.md

NVIDIA Earth2Studio

python version license coverage mypy format ruff uv

Earth2Studio is a Python-based package designed to get users up and running with AI Earth system models fast. Our mission is to enable everyone to build, research and explore AI driven weather and climate science.

- Earth2Studio Documentation -

Install | User-Guide | Examples | API

Earth2Studio README hero

Quick start

Running AI weather prediction can be done with just a few lines of code.

  • For detailed installation steps, including model-specific installations, see the install guide.
  • See the examples gallery providing different inference workflow samples.
  • Swap out data sources or models depending on your use case!

Tutorial

Earth2Studio Tutorial

Agent-assisted setup

Automate setup with your preferred coding agent using NVIDIA Earth2Studio skills. Install the Earth2Studio skill set, then ask your favorite agent (Claude, Codex, OpenCode, etc) to recommend a model, configure an environment, or run a first deterministic forecast. Find more Earth2Studio skills in the NVIDIA Skills catalog.

Earth2Studio agentic setup

npx skills add NVIDIA/skills --skill earth2studio-install npx skills add NVIDIA/skills --skill earth2studio-discover npx skills add NVIDIA/skills --skill earth2studio-data-fetch npx skills add NVIDIA/skills --skill earth2studio-deterministic-forecast

Example agent prompts:

Use the Earth2Studio discover skill to recommend a starter forecast workflow. Use the Earth2Studio install skill to set up my environment for FourCastNet3 inference. Create a script to fetch ERA5 surface winds data for March 2024. Create a deterministic forecast workflow with GFS, FourCastNet3, and a Zarr output store.

NVIDIA FourCastNet3

from earth2studio.models.px import FCN3 from earth2studio.data import GFS from earth2studio.io import ZarrBackend from earth2studio.run import deterministic as run model = FCN3.load_model(FCN3.load_default_package()) data = GFS() io = ZarrBackend("outputs/fcn3_forecast.zarr") run(["2025-01-01T00:00:00"], 10, model, data, io)

ECMWF AIFS

from earth2studio.models.px import AIFS from earth2studio.data import IFS from earth2studio.io import ZarrBackend from earth2studio.run import deterministic as run model = AIFS.load_model(AIFS.load_default_package()) data = IFS() io = ZarrBackend("outputs/aifs_forecast.zarr") run(["2025-01-01T00:00:00"], 10, model, data, io)

Google Graphcast

from earth2studio.models.px import GraphCastOperational from earth2studio.data import GFS from earth2studio.io import ZarrBackend from earth2studio.run import deterministic as run package = GraphCastOperational.load_default_package() model = GraphCastOperational.load_model(package) data = GFS() io = ZarrBackend("outputs/graphcast_operational_forecast.zarr") run(["2025-01-01T00:00:00"], 4, model, data, io)

[!IMPORTANT] Earth2Studio is an interface to third‑party models, checkpoints, and datasets. Licenses for these assets are owned by their providers. Ensure you have the rights to download, use, and (if applicable) redistribute each model and dataset. Links to the original license and source are often provided in the API docs for each model/data source.

Latest News

[!NOTE] As of version 0.14.0, Earth2Studio TOML default installs now target CUDA 13.

  • Himawari AHI, Himawari-8/9 AHI ISatSS L2 Full Disk satellite data source is now available for geostationary satellite observations over the Western Pacific.
  • GHCN-Daily, Global Historical Climatology Network daily station observation data frame source providing access to global surface observations.
  • Orbit-2, precipitation downscaling diagnostic model added for high-resolution precipitation estimates.
  • Conventional Observation Sources, new data sources for in-situ observations including NNJA conventional obs and real-time GDAS conventional obs.

For a complete list of latest features and improvements see the changelog.

Overview

Earth2Studio is an AI inference pipeline toolkit focused on weather and climate applications that is designed to ride on top of different AI frameworks, model architectures, data sources and SciML tooling while providing a unified API.

Earth2Studio model zoo Earth2Studio data sources

The composability of the different core components in Earth2Studio easily allows the development and deployment of increasingly complex pipelines that may chain multiple data sources, AI models and other modules together.

Earth2Studio composable pipelines

The unified ecosystem of Earth2Studio provides users the opportunity to rapidly swap out components for alternatives. In addition to the largest model zoo of weather/climate AI models, Earth2Studio is packed with useful functionality such as optimized data access to cloud data stores, statistical operations and more to accelerate your pipelines.

Earth-2 Open Models

Access state of the art Nvidia open models for climate and weather: Earth-2 Open Models. For training recipes for these models, see the PhysicsNeMo repository.

Features

Earth2Studio package focuses on supplying you the tools to build your own workflows, pipelines, APIs, or packages using modular components including:

Prognostic Models

Prognostic models in Earth2Studio perform time integration, taking atmospheric fields at a specific time and auto-regressively predicting the same fields into the future (typically 6 hours per step), enabling both single time-step predictions and extended time-series forecasting.

Earth2Studio maintains the largest collection of pre-trained state-of-the-art AI weather/climate models ranging from global forecast models to regional specialized models, covering various resolutions, architectures, and forecasting capabilities to suit different computational and accuracy requirements.

Available models include but are not limited to:

ModelResolutionArchitectureTime StepCoverage
GraphCast Small1.0°Graph Neural Network6hGlobal
GraphCast Operational0.25°Graph Neural Network6hGlobal
Pangu 3hr0.25°Transformer3hGlobal
Pangu 6hr0.25°Transformer6hGlobal
Pangu 24hr0.25°Transformer24hGlobal
Aurora0.25°Transformer6hGlobal
FuXi0.25°Transformer6hGlobal
AIFS0.25°Transformer6hGlobal
AIFS Ensemble0.25°Transformer Ensemble6hGlobal
StormCast3kmDiffusion + Regression1hRegional (US)
SFNO0.25°Neural Operator6hGlobal
DLESyM0.25°Convolutional6hGlobal

For a complete list, see the prognostic model API docs.

Diagnostic Models

Diagnostic models in Earth2Studio perform time-independent transformations, typically taking geospatial fields at a specific time and predicting new derived quantities without performing time integration enabling users to build pipelines to predict specific quantities of interest that may not be provided by forecasting models.

Earth2Studio contains a growing collection of specialized diagnostic models for various phenomena including precipitation prediction, tropical cyclone tracking, solar radiation estimation, wind gust forecasting, and more.

Available diagnostics include but are not limited to:

ModelResolutionArchitectureCoverageOutput
PrecipitationAFNO0.25°Neural OperatorGlobalTotal precipitation
SolarRadiationAFNO1H0.25°Neural OperatorGlobalSurface solar radiation
WindgustAFNO0.25°AFNOGlobalMaximum wind gust
TCTrackerVitart0.25°AlgorithmicGlobalTC tracks & properties
CBottleInfill100kmDiffusionGlobalGlobal climate sample
CBottleSR5kmDiffusionRegional / GlobalHigh-res climate
CorrDiffVariableDiffusionRegionalFine-scale weather
CorrDiffTaiwan2kmDiffusionRegional (Taiwan)Taiwan fine-scale weather

For a complete list, see the diagnostic model API docs.

Datasources

Data sources in Earth2Studio provide a standardized API for accessing weather and climate datasets from various providers (numerical models, data assimilation results, and AI-generated data), enabling seamless integration of initial conditions for model inference and validation data for scoring across different data formats and storage systems.

Earth2Studio includes data sources ranging from operational weather models (GFS, HRRR, IFS) and reanalysis datasets (ERA5 via ARCO, CDS) to AI-generated climate data (cBottle) and local file systems. Fetching data is just plain easy, Earth2Studio handles the complicated parts giving you an easy to use Xarray data array of requested data under a shared package wide vocabulary and coordinate system.

Available data sources include but are not limited to:

Data SourceTypeResolutionCoverageData Format
GFSOperational0.25°GlobalGRIB2
GFS_FXForecast0.25°GlobalGRIB2
HRRROperational3kmRegional (US)GRIB2
HRRR_FXForecast3kmRegional (US)GRIB2
ARCO ERA5Reanalysis0.25°GlobalZarr
CDSReanalysis0.25°GlobalNetCDF
IFSOperational0.25°GlobalGRIB2
NCAR_ERA5Reanalysis0.25°GlobalNetCDF
WeatherBench2Reanalysis0.25°GlobalZarr
GEFS_FXEnsemble Forecast0.25°GlobalGRIB2
ISDObservationalPointRegional (US)CSV
MRMSReanalysis1kmRegional (US)GRIB2

For a complete list, see the data source API docs.

IO Backends

IO backends in Earth2Studio provides a standardized interface for writing and storing pipeline outputs across different file formats and storage systems enabling users to store inference outputs for later processing.

Earth2Studio includes IO backends ranging from traditional scientific formats (NetCDF) and modern cloud-optimized formats (Zarr) to in-memory storage backends.

Available IO backends include:

IO BackendFormatFeaturesLocation
ZarrBackendZarrCompression, ChunkingIn-Memory/Local
AsyncZarrBackendZarrAsync writes, Parallel I/OIn-Memory/Local/Remote
NetCDF4BackendNetCDF4CF-compliant, MetadataIn-Memory/Local
XarrayBackendXarray DatasetRich metadata, Analysis-readyIn-Memory
KVBackendKey-ValueFast Temporary AccessIn-Memory

For a complete list, see the IO API docs.

Perturbation Methods

Perturbation methods in Earth2Studio provide a standardized interface for adding noise to data arrays, typically enabling the creation of ensembling forecast pipelines that capture uncertainty in weather and climate predictions.

Available perturbations include but are not limited to:

Perturbation MethodTypeSpatial CorrelationTemporal Correlation
GaussianNoiseNoneNone
Correlated SphericalGaussianNoiseSphericalAR(1) process
Spherical GaussianNoiseSpherical (Matern)None
BrownNoise2D FourierNone
Bred VectorDynamicalModel-dependentModel-dependent
Hemispheric Centred Bred VectorDynamicalHemisphericModel-dependent

For a complete list, see the perturbations API docs.

Statistics / Metrics

Statistics and metrics in Earth2Studio provide operations typically useful for in-pipeline evaluation of forecast performance across different dimensions (spatial, temporal, ensemble) through various statistical measures including error metrics, correlation coefficients, and ensemble verification statistics.

Available operations include but are not limited to:

StatisticTypeApplication
RMSEError MetricForecast accuracy
ACCCorrelationPattern correlation
CRPSEnsemble MetricProbabilistic skill
Rank HistogramEnsemble MetricEnsemble reliability
Standard DeviationMomentSpread measure
Spread-Skill RatioEnsemble MetricEnsemble calibration

For a complete list, see the statistics API docs.

For a more complete list of features, be sure to view the documentation. Don't see what you need? Great news, extension and customization are at the heart of our design.

Contributors

Check out the contributing document for details about the technical requirements and the user guide for higher level philosophy, structure, and design.

License

Earth2Studio is provided under the Apache License 2.0, refer to the LICENSE file for full license text.

关于 About

Open-source deep-learning framework for exploring, building and deploying AI weather/climate workflows.
aiclimate-sciencedeep-learningweather

语言 Languages

Python98.7%
Shell0.7%
Standard ML0.4%
Dockerfile0.1%
Makefile0.1%

提交活跃度 Commit Activity

代码提交热力图
过去 52 周的开发活跃度
337
Total Commits
峰值: 15次/周
Less
More

核心贡献者 Contributors