Introduction to Zyraο
Jump to:
Kid Version | High School Version | College Version | White Paper Version
Kid Versionο
Imagine you have a big box of LEGO bricks mixed together β some from space sets, some from castles, some from race cars.
Zyra is like a magical robot helper that:
Finds the bricks you want (getting data from the internet or your computer)
Puts them in order (sorting and cleaning the data)
Builds something amazing (turning the data into pictures, videos, or maps you can show to friends).
It makes science data less messy and more fun to look at.
High School Versionο
Zyra is a Python tool that:
Collects data from many sources like websites, cloud storage, and special science file formats.
Processes it so itβs easier to work with (cutting, reshaping, converting formats).
Visualizes it in charts, maps, and animations.
Think of it like a 3-step factory:
Input: Raw data from the web, satellites, or experiments.
Processing: Filtering, analyzing, or reformatting.
Output: Graphs, weather maps, or animated videos you can share.
Itβs modular β you can swap out any step for your own custom tool.
College Versionο
Zyra is an open-source, modular Python framework for reproducible scientific data workflows.
It organizes work into four layers:
Acquisition Layer β Connects to FTP, HTTP/S, S3, and local sources; supports GRIB, NetCDF, GeoTIFF, and streaming video.
Processing Layer β Extracts subsets, applies transformations, and converts between scientific formats. Includes tools like
VideoProcessor
andGRIBDataProcessor
.Visualization Layer β Uses Matplotlib and Basemap to produce static plots, animations, and composites with consistent color maps and overlays.
Utilities Layer β Handles credentials, date parsing, file management, and small shared helpers.
The system is designed for flexibility, reproducibility, and interoperability, making it suitable for research, teaching, and operational pipelines.
White Paper Versionο
Abstract:
Zyra is a composable Python framework for end-to-end scientific data workflows, enabling acquisition, transformation, and visualization across diverse environmental and geospatial datasets. It is designed to address reproducibility, modularity, and interoperability challenges in modern data science.
Architecture:
Acquisition Managers implement standardized connect/list/fetch/upload APIs for heterogeneous data sources (e.g.,
FTPManager
,HTTPManager
,S3Manager
).Processing Managers support domain-specific operations, including video encoding/decoding (FFMPEG), GRIB parsing, NetCDF extraction, and geospatial transformations.
Visualization Managers integrate with Matplotlib and Basemap to generate consistent, publication-quality graphics, with support for packaged basemaps and overlays.
Utility Managers provide cross-cutting capabilities for credential handling, temporal range calculations, file path operations, and metadata management.
Supported Formats & Protocols: GRIB2, NetCDF, GeoTIFF, MP4, PNG, JPEG; FTP, HTTP/S, AWS S3, local filesystem.
Use Cases: Operational forecasting pipelines, climate research, geospatial analysis, educational demonstrations, and public communication products.