We want to hear from you! Take our short OPeNDAP User Survey!
Interoperability
To OPeNDAP interoperability means all things Access related. This means software written in different programing languages that can access OPeNDAP data, and also software that can be used across different platforms and infrastructure, from a laptop, to a HPC, and the commercial cloud.
On this page, you will find many Client APIs and data Servers developed by groups outside of OPeNDAP, and we thank them for contributing to the OPeNDAP effort! If you have written a client, server, client-library or toolkit, let us know! We would love to add your software to this list.
Photo by NASA
QUICK LINKS
Client-libraries
PyDAP
PyDAP is a Python library which gives transparent access to various OPeNDAP servers. Pydap’s client can lazily open remote files from servers that implement both DAP2 and DAP4 Protocols, and can define server-side subsetting for efficient access. Pydap dates back to 2006, and it remains under active development.
Xarray
Xarray is an open source project and Python package within the PANGEO ecosystem that introduces labels in the form of dimensions, coordinates, and attributes on top of raw NumPy-like arrays. Xarray allows for an intuitive, concise, and less error-prone user experience. Pydap can be used as a backend engine for xarray, and therefore Xarray offers a familiar syntax to parallelize Pydap, which can accelerate exploration when defining Constraint Expressions on the URL.
MATLAB
See Import NetCDF Files and OPeNDAP Data and this interesting example: Downloading HYCOM data using Matlab and OPeNDAP.
ArcGIS Pro
See Make OPeNDAP Raster Layer (Multidimension). For and overview of how OPeNDAP fits into ArcGIS, see: An overview of the Multidimension toolbox.
R
There are several ways to read data into R from OPeNDAP servers, modisfast (formerly opendapr) provides one interface and RNetCDF: Interface to ‘NetCDF’ Datasets provides a second interface using the netCDF C clibrary.
Jupyter Notebook: Both the Python/netCDF4 package, PyDAP and Xarray can be used with the jupyter notebook system to read from OPeNDAP servers
Panoply Data Viewer
Panoply plots geo-referenced and other arrays from OPeNDAP servers as well as local netCDF, HDF, GRIB, and other datasets.
Panoply is a cross-platform application that runs on macOS, Windows, Linux and other desktop computers. Panoply requires that your computer has a compatible Java 11 (or later version) JRE or JDK installed.
SNCTOOLS
SNCTOOLS is a Matlab client for local netCDF files and remote OPeNDAP data servers. It has an excellent tutorial: Using SNCTOOLS and OPeNDAP
nctoolbox
nctoolbox is a Matlab toolbox for accessing (read-only) common data model datasets. Under the hood nctoolbox uses NetCDF-Java as the data access layer. This allows nctoolbox to access NetCDF, OPeNDAP and HDF5 datasets using the same API. Thanks to Brian Schlining <brian at mbari.org> at MBARI for providing this!
octave client
Maybe the Octave NetCDF module can access OPeNDAP servers. See the Octave Repository.
GrADS
The Grid Analysis and Display System (GrADS) is an interactive desktop tool that is used for easy access, manipulation, and visualization of earth science data. The format of the data may be either binary, GRIB, NetCDF, or HDF-SDS (Scientific Data Sets). GrADS has been implemented worldwide on a variety of commonly used operating systems and is freely distributed over the Internet. Developed at COLA.
Ferret
Ferret is an interactive computer visualization and analysis environment designed to meet the needs of oceanographers and meteorologists analyzing large and complex gridded data sets. Developed at NOAA/PMEL in Seattle. In addition, PyFerret is available, which is Ferret encapsulated in Python.
IDV
The Integrated Data Viewer (IDV) from Unidata is a Java(TM)-based software framework for analyzing and visualizing geoscience data. Developed by Unidata.
NCO
The netCDF Operators, or NCO, are a suite of programs known as operators. Each operator is a standalone, command line program which is executed at the UNIX shell-level like, e.g., ls or mkdir. The operators take netCDF or HDF4 files as input, then perform a set of operations (e.g., deriving new data, averaging, hyperslabbing, or metadata manipulation) and produce a netCDF file as output. The operators are primarily designed to aid manipulation and analysis of gridded scientific data. The single command style of NCO allows users to manipulate and analyze files interactively and with simple scripts, avoiding the overhead (and some of the power) of a higher level programming environment. The NCO User’s Guide illustrates their use with examples from the field of climate modeling and analysis. NCO versions 2.8.5 and greater support automatic building and linking for OPeNDAP versions 3.4+ and GCC versions 3.3+. NCO was developed by Charlie Zender.
C/NetCDF
NetCDF (Network Common Data Form) is a set of software libraries and machine-independent data formats that support the creation, access, and sharing of array-oriented scientific data. It is also a community standard for sharing scientific data. The Unidata Program Center supports and maintains netCDF programming interfaces for C, C++, Java, and Fortran. Programming interfaces are also available for Python, IDL, MATLAB, R, Ruby, and Perl.
Java/NetCDF
The NetCDF-Java Library is a Java interface to NetCDF files. It is built on the MultiArray (version 2) package, which is a stand-alone Java package for multidimensional arrays of primitive types. The library optionally includes a netCDF interface to OPeNDAP datasets. Another optional part uses the NetCDF Markup Language (NcML) to allow the definition of virtual netCDF datasets, and to extend the netCDF data model to include general coordinate systems. Developed by NSF Unidata.
VisAD
VisAD is a Java component library for interactive and collaborative visualization and analysis of numerical data. The name VisAD is an acronym for “Visualization for Algorithm Development”. Ugo Taddei’s VisAD Tutorial is a great way to start learning about VisAD. VisAD was developed at the Space Science and Engineering Center at University of Wisconsin-Madison.
Servers
These servers, in addition to our data server (Hyrax), are used to serve data to the OPeNDAP client so the data can be accessed by a user.
THREDDS Data Server (TDS)
From the TDS Web page: The THREDDS (Thematic Realtime Environmental Distributed Data Services) project is developing middleware to bridge the gap between data providers and data users. The goal is to simplify the discovery and use of scientific data and to allow scientific publications and educational materials to reference scientific data.
PyDAP data server
The PyDAP package contains a server that can read Comma Separated Value (CSV) text files, Matlab and netCDF file, as well as from a SQL database.
ERDDAP
ERDDAP is a data server that gives you a simple, consistent way to download subsets of scientific datasets in common file formats and make graphs and maps. ERDDAP is largely used by the oceanographic community for accessing in situ data.
GDS
The GrADS-Data Server is a stable, secure data server that provides subsetting and analysis services across the internet. These services can be provided for any GrADS-readable dataset. The subsetting capability allows users to retrieve a specified temporal and/or spatial subdomain from a large dataset, eliminating the need to download everything simply to access a small relevant portion of a dataset. The analysis capability allows users to retrieve the results of an operation applied to one or more datasets on the server. Examples of analysis operations include basic math functions, averages, smoothing, differencing, correlation, and regression. The GDS supports operations that can be expressed in a single GrADS expression. Developed at COLA.
LAS
The Live Access Server (LAS) is a highly configurable Web server designed to provide flexible access to geo-referenced scientific data. It can present distributed data sets as a unified virtual database through the use of OPeNDAP networking. Ferret is the default visualization application used by LAS, though other applications (Matlab, IDL, GrADS) can also be used.
LAS enables the Web user to:
- visualize data with on-the-fly graphics
- request custom subsets of variables in a choice of file formats
- access background reference material about the data (metadata)
- compare (difference) variables from distributed locations
LAS enables the data provider to:
- unify access to multiple types of data in a single interface
- create thematic data servers from distributed data sources
- offer derived products on the fly
- remedy metadata inadequacies (poorly self-describing data)
- offer unique products (e.g. visualization styles specialized for the data)
LAS was developed at NOAA/PMEL.
Server-less Access
Many clients can now access files directly fron S3 buckets, but from our many years of experience on this matter that does not imply that data access will be optimal. That is particularly true for the vast majority of HDF4, HDF5/NetCDF4 data files, and many other data file formats that were design for spinning disks prior to commercial cloud infrastructures.
With our work with NASA over the last 10 years, we have implemented capabilities in our Hyrax Data Server to produce DMR++ side car files for HDF5/NetCDF4 files, as well as HDF4. DMR++ files contain Chunk References, Compression, byte offsets, and many extra information (thus the ++) that allow the Hyrax Data Server, and potentially any client API that can parse such Hyrax-produced DMR++ files, to accelerate aggregation of multiple files, and throughput of many data files on disk or the commercial cloud.
Broad adoption to parse DMR++ sidecar by clients APIs files remains an ongoing effort. The following are the ones we are aware of:
- VirtualiZarr – A python native package written by Zarr developers as a successor to Kerchunk, can now parse DMR++.
- EarthAccess – A python package to access data from NASA’s Earthdata on the cloud. In combination with VirtualiZarr parser, can aggregate multiple data files via their DMR++.
If you have written a DMR++ parser, let us know! To learn more about DMR++, go to the DMR++ documentation. To learn more about the ongoing effort by the Python community, you can head to the respective documentation of the packages listed above. If you want to see a proof of concept for this integration, check out the Presentation on VirtualiZarr and DMR++ on Zenodo.