More information regarding benchmarking process here.
For each toolkit-approach configuration, 10 runs are performed. These independent runs are then averaged and a 95% confidence interval is displayed with a shaded area.
For each toolkit-approach configuration datapoint (except for the Plotly
configuration w.r.t. duration), very little variation was observed, resulting in the negligibly small shaded area.
Package versions: The exact versions of the benchmarked packages can be found in this poetry.lock commit.
tool | version |
---|---|
Bokeh | 2.4.2 |
Plotly | 5.6.0 |
Matplotlib | 3.5.1 |
HoloViews | 1.14.8 |
Plotly-Resampler | 0.3.0 |
Generic figure description:
- The first row displays the average combined duration of constructing and rendering the graph.
- The second row indicates the peak RAM usage.
- The columns indicate the data size per signal, thus showing a trend when scaling to larger datasets.
- For each toolkit-approach a distinct color and style is applied.
note:
interpretation:
The first row represents the total time to construct and render the visualization.
- The slope of all approaches seems to rise with the increasing number of data points per trace.
HoloViews
-based approaches scale exponentially in terms of the number of visualized traces, rendering them unsuitable for large multivariate visualizations (more clear in the second visualization).- When dealing with more than 10,000,000 samples per series and more than 10 modalities,
Plotly-Resampler
clearly emerges as the only viable toolkit.The second row indicates how the peak memory usage.
- It is clear that
Plotly-Resampler
andHoloViews LTTB
scale better. These two toolkit-approaches manage to use <700 MB RAM for even the largest configuration (right column).Matplotlib
andHoloViews rasterize
exceed 10 GB.
Remark
- In contrast with the figure above, up to 200 traces are displayed on the xaxis.
- During benchmarking whe stopped scaling to larger traces when graph construction time exceeed 120 seconds.
interpretation:
- As also shown in the paper figure,
HoloViews
-based approaches scale exponentially in duration with respect to the number of traces.Plotly-Resampler
is the only tool able to visualize the most challenging configuration (200 traces, 50M datapoints per trace) under 120 seconds. More specifically, it needs 60 seconds to do so.