Super Cloud Library Enhances Cloud Process Studies

PROJECT

Super Cloud Library

KEY POINTS

The Super Cloud Library—a big data analysis and visualization tool for Earth science applications—has been infused into the Data Analytics and Storage System (DASS) at the NASA Center for Climate Simulation and is producing results over twenty times faster than previously deployed manual processes.

Diagram illustrating the anatomy of a deep convection cloud

The image on the bottom depicts the anatomy of the deep convection cloud above. The color contours of the cross section are observed hydrometeor sizes. (Image credit: NASA/GSFC Conceptual Image Lab)

Today’s high-performance computers consisting of hundreds of thousands of processors have enabled ultra-high-resolution, long-term Earth science simulations, such as those used to study cloud formation. The output files of these simulations can be huge—over 150 Terabytes. Not only are such large datasets hard to distribute, they are difficult to analyze with a desktop computer. A NASA team has developed a way for users to obtain adequate insight into these voluminous datasets without downloading them locally.

Researchers at Goddard Space Flight Center (GSFC) have developed the Super Cloud Library (SCL), a big data analysis and visualization tool for use with high-resolution cloud resolving models (CRMs). NASA recently infused the SCL into the Data Analytics and Storage System (DASS) at the NASA Center for Climate Simulation (NCCS) where it has enhanced CRM database management, distribution, visualization, subsetting, and evaluation.

The SCL architecture is built upon a Hadoop framework, which employs the Hadoop Distributed File System (HDFS)—a stable, distributed, scalable, and portable file system. Hadoop enables users to compute and visualize various standard/non-standard statistics. Within the Hadoop framework, a CRM’s diagnostic capabilities are further enhanced with Spark (a big data processing tool), which accelerates the Hadoop MapReduce process by ~100 times.

Diagram illustrating the simulation of a rain event

An example simulation showing a rain event. Updraft is shown in red and rain amount in blue.

The HDFS data model allows for transforming data from Network Common Data Form (NetCDF) to Comma Separated Values (CSV) format with indexes. Consequently, data can be stored in HDFS and straightforwardly processed by Hadoop-based tools for subsetting and diagnosis. HDFS’s interface definition language (IDL) and Python tool permit users to visualize CRM-simulated cloud properties in two and three dimensions and diagnose HDFS-resident data. Moreover, HDFS’s concurrent Hadoop reader is capable of increasing the speed of reading data from HDFS to be more than 20 times faster than sequential reading. Finally, the HDFS dynamic Hadoop reader can fetch Parallel File System-resident data and make them ready for MapReduce applications including subsetting and diagnosis.

Diagram of the convection core and associated rain, snow and cloud water

Examples of dynamical subsetting of the convective core and associated rain, snow, and cloud water mixing ratio over a 3D box of 20 x 20 x 20 km (there are a total of two billion grid points in the whole 3D domain).

The SCL was built on the NCCS Discover system, which directly stores various CRM simulations, including those produced by the NASA-Unified Weather Research and Forecasting (NU-WRF) model and Goddard Cumulus Ensemble (GCE) model. The team also developed a web portal for SCL to allow a user to subset, diagnose, visualize, save, and download the subset data. SCL allows users to conduct large-scale on-demand tasks automatically, without the need to download voluminous CRM datasets to a local computer. Therefore, the SCL makes CRM output more usable by the science community. SCL’s Technology Readiness Level is level 5 (system prototype in an operational setting). As PI, Dr. Wei-Kuo Tao notes, “The Super Cloud Library implemented within the DASS has enabled climate researchers to analyze extremely large volumes of high-resolution model data without direct access to high-end computing.”

SPONSORING ORGANIZATION

Earth Science Division’s AIST Program

PROJECT LEAD

Dr. Wei-Kuo Tao, NASA GSFC

Suggested Searches

Highlights

A New Alloy is Enabling Ultra-Stable Structures Needed for Exoplanet Discovery

NASA Announces Winners of 2025 Human Lander Challenge

Hubble Captures an Active Galactic Center

Missions

Humans in Space

Earth

The Solar System

The Universe

Science

Aeronautics

Technology

Learning Resources

About NASA

NASA en Español

News & Events

Multimedia

Highlights

Hubble Captures an Active Galactic Center

By Air and by Sea: Validating NASA’s PACE Ocean Color Instrument

NASA Mars Orbiter Learns New Moves After Nearly 20 Years in Space

Highlights

NASA Announces Winners of 2025 Human Lander Challenge

NASA, Australia Team Up for Artemis II Lunar Laser Communications Test

Testing NASA-Developed Heat Shield Made by U.S. Company

Highlights

NASA Mission Monitoring Air Quality from Space Extended

By Air and by Sea: Validating NASA’s PACE Ocean Color Instrument

NASA-Assisted Scientists Get Bird’s-Eye View of Population Status

Highlights

Comet 3I/ATLAS

What’s Up: July 2025 Skywatching Tips from NASA

NASA Missions Help Explain, Predict Severity of Solar Storms

Highlights

Comet 3I/ATLAS

Hubble Observations Give “Missing” Globular Cluster Time to Shine

Discovery Alert: Scientists Spot a Planetary Carousel

Highlights

NASA Mission Monitoring Air Quality from Space Extended

Hubble Observations Give “Missing” Globular Cluster Time to Shine

How NASA’s SPHEREx Mission Will Share Its All-Sky Map With the World

Highlights

NASA Advances Pressure Sensitive Paint Research Capability

NASA Air Taxi Passenger Comfort Studies Move Forward

NASA Aircraft to Make Low-Altitude Flights in Mid-Atlantic, California

Highlights

NASA-Assisted Scientists Get Bird’s-Eye View of Population Status

Testing NASA-Developed Heat Shield Made by U.S. Company

Heliophysics – Research and Development of Initiatives of Advanced New Technologies (RADIANT) Program

Highlights

Career Spotlight: Mathematician (Ages 14-18)

Highlights

By Air and by Sea: Validating NASA’s PACE Ocean Color Instrument

NASA Announces Winners of 2025 Human Lander Challenge

NASA Mars Orbiter Learns New Moves After Nearly 20 Years in Space

Highlights

Pódcast en español de la NASA estrena su tercera temporada

Las carreras en la NASA despegan con las pasantías

El X-59 de la NASA completa las pruebas electromagnéticas

Super Cloud Library Enhances Cloud Process Studies

NASA Science Editorial Team

Contents

PROJECT

KEY POINTS

SPONSORING ORGANIZATION

PROJECT LEAD

Share

Details

Related Terms

Explore More

NASA Mission Monitoring Air Quality from Space Extended

A New Alloy is Enabling Ultra-Stable Structures Needed for Exoplanet Discovery

By Air and by Sea: Validating NASA’s PACE Ocean Color Instrument

Discover More Topics From NASA

James Webb Space Telescope

Perseverance Rover

Parker Solar Probe

Juno