Ophidia
Ophidia is a CMCC Foundation research effort addressing Big Data challenges for eScience. The Ophidia framework represents an open source solution for the analysis of scientific multi-dimensional data, joining HPC paradigms and Big Data approaches. It provides an environment targeting High Performance Data Analytics (HPDA) through parallel and in-memory data processing, data-driven task scheduling and server-side analysis. The framework exploits an array-based storage model, leveraging the datacube abstraction from OLAP systems, and a hierarchical storage organisation to partition and distribute large multi-dimensional scientific datasets over multiple nodes. Ophidia is primarily used in the climate change domain, although it has also been successfully exploited in other scientific domains.
Software license: GPLv3.
Installation
The framework is composed by different software components. The source code for the various components is available on GitHub.
The installation guide is available in the documentation.
Ophidia can also be installed through the Spack package manager.
For the client side, Ophidia also provides the Python bindings, called PyOphidia. To install PyOphidia:
pip install pyophidia
or to install in a Conda environment:
conda install -c conda-forge pyophidia
The PyOphidia documentation (Installation, usage, examples and tutorial) can be found here.
Usage
Ophidia provides features for data management and analysis, such as:
data reduction and subsetting
data intercomparison
array processing
time series analysis
statistical and mathematical operations
data manipulation and transformation
interactive data exploration
The user guide documents all the available Ophidia features.