Mentorship

Each summer, ASTG mentors talented interns through the NASA internship program. The inters are mentored by ASTG members and engage in a variety of software and reasearch and development activities. Here we show some of the interesting projects from recent interns!

Mentorship

2023

eViz: Enhanced visualization utilities for NU-WRF models

WRF output
WRF visualizations with their corresponding YAML configuration files.

Aarav Khanna
Cornell University
Department of Computer Sciences
B.S. Candidate

The ASTG is developing the EViz data visualization tool (eViz) which is used to aid Earth scientists in easily diagnosing outputs from large-scale Earth models and thus assist in the model validation process. Aarav developed EViz functions to support visualization of NU-WRF models (WRF and LIS) and wrote a utility named metadump which facilitates the creation of YAML files that are used to provide specifications for the eViz tool.

ASSERT - A Software Suite for Earth-Systems Regression Testing

ASSERT class family trees
ASSERT class family trees.

Deon Kouatchou
Carnegie Mellon University
School of Computer Science
B.S. Candidate

ASSERT is a centralized, Python-based regression testing toolkit that aims to allow users to build regression testing workflows for numerous Earth System models. Deon helped in the early-stage development of ASSERT, particularly the development of the software infrastructure. This included, among other things, a large set of utilities, logging support, a comprehensive set of unit tests, and thorough documentation.

Processing ASTG Course Evaluation Surveys

Each year, ASTG provides several computer programming courses. At the end of each class, participants submit an evaluation survey which results are contained in an Excel file. It is difficult for us to quickly analyze the results of each survey and extract important information that we can use for presentations. Cindy Chen (a rising Sophomore in high school) has been writing a Python tool that automatically reads Excel files and generate useful reports (in terms of plots, texts, etc.) based on the responses of an evaluation surveys.

2022

Integrating satellite data sources in EViz visualization toolkit

Deon Kouatchou-Ngongang pres
Expanding data sources available in EViz.

Deon Kouatchou-Ngongan
Carnegie Mellon University
School of Computer Science
B.S. Candidate

NASA Earth System models scientists often have the need to compare model results with satellite observational data to evaluate experiments and simulations. eViz provides a quick, accessible, and flexible visualization toolkit for exploring and comparing data. By integrating satellite data sources into EViz, ASTG aims to enable comparison model-observation visualization.

2021

eViz: Exploring Earth System Data Products and diagnosing Earth System Models using a visualization environment

Deepthi Raguhnandan Poster
Streamling the visualization of Earth System model data products.

Deepthi Raghunandan
University of Maryland
Department of Computer Sciences
PhD Candidate

NASA Earth Systems models, such as numerical weather models, produce large amounts of outputs. Scientist need quick access and flexible visualization tools to analyze and validate the data. ASTG has developed and continues to improve a modern Python framework for easy visualizaton (eViz) of Earth System data products.

Machine Learning for Non-parametric Data Assimilation in Hydrological Models

Logan Qualls Poster
The value of remote sensing data in data-driven streamflow models.

Logan Qualls
University of Alabama
Department of Geological Sciences
Masters Candidate

Recent advances in machine learning have led to powerful streamflow models based on the long-short term memory models that have been created using the CAMELS data, an in-situ 30-year record for streamflow that spans accross 600 water basins in the CONUS. This project investigates the integration and use of remote sensing data for training.

2019

Jonathan Frame Poster
Combining process-based models with machine learning.

Machine Learning for Non-parametric Data Assimilation in Hydrological Models

Johnathan Frame
University of Alabama
Department of Geological Sciences
PhD Candidate

We test a method of analyzing hydrologic modeling predictions of soil moisture with a hybrid machine learning (ML) + physically-based modeling approach. This method is capable of non-parametric data assimilation and corrections to model structural error. Dr. Pelissier has developed a parallelized machine learning code for Gaussian Process Regression (GPR) as the ML component of this project. We are applying this code to improve predictability of soil moisture data at 10 FluxNet towers. The GPR shows significant improvement in prediction of out-of-sample soil moisture predictions over calibrated Noah-MP results. To compare the GPR against ‘traditional’ data assimilation techniques we ran the Noah-MP model with an Ensemble Kalman Filter (EnKF). The results show a similar performance improvement between GPR and EnKF when run in sample, but EnKF provides little to no benefit when making predictions after only a few time steps following an observation. Our results show that this hybrid approach continues improving model predictions even without soil moisture state observations.

Publications

Pelissier, C., J. Frame, and G. Nearing. “Combining parametric land surface 889 models with machine learning.” arXiv preprint arXiv:2002.06141 890 (2020).

2018

Quantum Computing and Linears Systems

Chris Culver Poster
Solving Linear Systems with Quantum Computers.

Chris Culver
The George Washington University
Department of Physics
PhD Candidate

The goal of this intersnship was to assess the current abilities of the IBM quantum computer, and to implement and investigate quantum algorithms to solve linear systems. Linear systems are ubiquitous throughout scientific computing and quantum algorithms that provide increased computing power would have a large impact on NASA’s missions. The first phase of the project is to understand the errors and limitations of the current IBM quantum computer. The second phase is to understand, implement, and test quantum linear-system-solvers on the IBM machine. Lastly, we will report the results of the study and the feasibility of solving linear systems and real-world problems on quantum computers today and in the near future.

Using Machine Learning to Model Global Terrestrial Carbon Flux

Donovan_Murphy Poster
Carbon flux monitoring using machine learning.

Donovan Murphy
University of Alabama
Bachelors in Science Candidate

This project evaluates the use of machine learning to meet the challenges of scalability and continuity in carbon flux monitoring and prediction. Today, the measurement of terrestrial gas fluxes is limited in spatial representation, evidenced by the over eight hundred sites capturing carbon information in a globally distributed yet patchy network, FLUXNET. Using the eddy covariance method, these sites compute net ecosystem exchange (NEE), a measure of the vectoring of carbon dioxide through their ecosystems. Though these sites deliver high quality, in situ data, they are inherently restricted to hyper-local observation, limiting accurate inferences toward a finer spatial resolution. Additionally, many ecoregions lack FLUXNET site representation, including the Pacific coast of South America, Central Asia and the Middle East, and the vegetation-rich archipelagos of the Sunda Shelf (see Fig. 1). These absences are based on sites’ geographic distribution and independent of ecosystem type.