Skip to content
Christopher J. Morrone edited this page Nov 9, 2023 · 48 revisions

The OVIS High Performance Computing Monitoring, Analysis, and Visualization Project

For information on using and installing LDMS: https://ovis-hpc.readthedocs.io/en/latest/

To join the LDMS Users Group Mailing List: https://github.com/ovis-hpc/ovis-wiki/wiki/Mailing-Lists

What is OVIS?

OVIS is a collection of sub-projects that, in the aggregate provide capabilities for HPC data collection, transport, storage, analysis, visualization, and response. The OVIS project seeks to enable more effective use of High Performance Computational Clusters via greater understanding of applications' use of resources, including the effects of competition for shared resources; discovery of abnormal system conditions; and intelligent response to conditions of interest.

Major Components

  • Lightweight Distributed Metric Service (LDMS) provides a low overhead monitoring, transport and storage capability.
  • Baler provides a generic log file tagging, compression, and search capability.
  • Scalable Object Store (SOS) provides a high performance storage and search capability.

Build quick reference

The current distribution includes only the LDMS monitoring, transport, and storage components.

  • LDMS is downloadable here, at github.
    • Information on building, configuration, and deployment can be found in the Documentation the sidebar menu. Publications including design descriptions, overhead assessments, and analysis cases can be found from the sidebar as well.
    • The LDMS distribution comes with a complete set of man pages for each of the commands and plugins, as well as a Quick Start guide.
  • SOS can be downloaded here.
  • Baler can be downloaded here

Main

LDMSCON

Tutorials are available at the conference websites

D/SOS Documentation

LDMS v4 Documentation

Basic

Configurations

Features & Functionalities

Working Examples

Development

Reference Docs

Building

Cray Specific
RPMs
  • Coming soon!

Adding to the code base

Testing

Misc

Man Pages

  • Man pages currently not posted, but they are available in the source and build

LDMS Documentation (v3 branches)

V3 has been deprecated and will be removed soon

Basic

Reference Docs

Building

General
Cray Specific

Configuring

Running

  • Running

Tutorial

Clone this wiki locally