[5.0] [Performance] Improve large line performance. Improve line smoothing. #13314

pissang · 2020-09-20T04:38:57Z

This pull request include two improvements to line series.

Reduce initial render time(and memory cost) up to 10x in the line series with millions of time series data.
Improve smooth result in some cases.

Details about performance improvement.

1. Use TypedArray.

The most important thing did to improve the performance is to use TypedArray instead of normal array as much as possible. It will reduce the heap memory and GC cost significantly.

In this PR we use TypedArray to store the points of polyline and polygons. Also no setItemLayout anymore.

2. Bake the transform of dataToPoint to matrix in Cartesian2D

It will simplify the transform to a very fast 6x2 multiply and add operations. But it only works for affine transform when axis type is time or value. So it's mainly for boosting the time series data.

3. Remove unnecessary date parsing when it's a timestamp.

It can avoid creating millions of temporary Date object when parsing the time series data.

5. Speed up storage access in List component.

It's a trick did for modern JS engine. I found the access of storage is slow when initializing millions of data in https://github.com/apache/incubator-echarts/blob/99ab1d7a2f1413ce38dd55d3808c4e434fe49ba8/src/data/List.ts#L628
I profiled it and saw KeyedLoadIC_Megamorphic in the profiling result, which means it's not a fast property access.

So we simplify the case to the following code.

const obj = {};
obj.x = 1;
obj.y = 1;
function slowObjectAccess() {
    let sum = 0;
    for (let i = 0; i < 1e6; i++) {
        for (let k = 0; k < 2; k++) {
            const val = obj[k === 0 ? 'x' : 'y'];
            sum += val;
        }
    }
    return sum;
}

function fastObjectAccess() {
    let sum = 0;
    for (let i = 0; i < 1e6; i++) {
        for (let k = 0; k < 2; k++) {
            sum += k === 0 ? obj.x : obj.y;
        }
    }
    return sum;
}

And the test result shows fastObjectAccesscan be 10x faster than slowObjectAccess. I can't find out why and how to avoid it because my limited knowledge of JS engine. So I add an extra _storageArr array and get from this array in the hotspot code like List#_initDataFromProvider. It works well and the cost of _initDataFromProvider reduced to half.

Performance Result

Render time comparison between 4.x and 5.0

Heap memory comparison between 4.x and 5.0

Details about smooth improvement.

The original smooth algorithm will constraint the control points to the global bounding box. Which may lead to two issues as shown in the following picture.

So in this PR I did two changes.

Constraint the control points to the bounding box of two points locally.
Recalculate the previous control point after current control point is changed.

And the result after change.

Fixed issues

#12249
#10200
#4556

Limiting the control points inside the range of two connect points. Instead of the global bounding box in the previous implementation

echarts-bot · 2020-09-20T04:38:59Z

Thanks for your contribution!
The community will review it ASAP. In the meanwhile, please checkout the coding standard and Wiki about How to make a pull request.

The pull request is marked to be PR: author is committer because you are a committer of this project.

leeoniya · 2020-09-20T05:03:37Z

wow, this is super exciting. can't wait to test this improvement on the uPlot benchmark!

https://github.com/leeoniya/uPlot#performance

a 10x improvement would move echarts to second place.

would you mind reviewing my implementation to make sure i'm using the library correctly, and also let me know when this will be available to test in a linkable build?

https://github.com/leeoniya/uPlot/blob/master/bench/ECharts.html

cheers!

leeoniya · 2020-09-20T05:07:41Z

also, your smoothing does not represent the data well. take a look at using the Catmull-Rom Spline (Centripetal) algo (not fully optimized):

https://leeoniya.github.io/uPlot/demos/line-smoothing.html

in general, the smoothing changes are probably best for a separate PR from the performance improvements.

pissang · 2020-09-20T08:03:13Z

@leeoniya Thanks for the benchmark. I'm really impressed by the performance of uPlot!

I add an optimized version that takes a Float64Array as input data. It's a bit more complex to write but has better performance and takes less heap memory.

Optimized version:
https://echarts-bench-from-uplot.glitch.me/index-optimized.html

Original verison:
https://echarts-bench-from-uplot.glitch.me/

Tested the optimized version on my 15inch mac. The initialization time is about 170ms. And the heap memory is about 10~12MB(peak), 5MB(final).

About the smoothing algorithm. I have tried the Catmull-Rom Spline before. But it has similar issues which may look quite weird

I'm not sure if you did any modification on the original Catmull-Rom Spline algorithm.

pissang · 2020-09-20T08:48:09Z

Did some modifications to make the case look better(won't affect the performance.

pissang · 2020-09-20T11:38:33Z

PS: I did both the performance improvement and smoothing algorithm improvement when refactoring the polyline builder. So I didn't try to separate it into two PRs.

leeoniya · 2020-09-20T13:10:33Z

I'm not sure if you did any modification on the original Catmull-Rom Spline algorithm.

i simply adjusted this implementation for the uPlot demo from this gist: https://github.com/leeoniya/uPlot/blob/master/demos/line-smoothing.html#L49

i don't think the version i'm using would behave as your posted image given the very few and sparse datapoints which are visible. can you try your dataset in the uPlot demo and see how that looks? (sorry i'm on holiday and on phone, so cannot test it for a while).

pissang · 2020-09-21T02:58:17Z

https://echarts-bench-from-uplot.glitch.me/uplot-smoothing.html

I tried it and the result seems a bit different from my Catmull-Rom spline implementation. Haven't dig into it and not sure if it's because I drew the spline with interpolated points and in the uplot are bezier curves.

But it looks similar to our original bezier curve implementation without limiting to min, max extremes. The problem is the curve will exceed the extremes. For example under the 0 baseline or exceed the max value(if the max value has some meaning like 100 percent.

If we limiting the curve between global min, max extremes.
https://echarts-bench-from-uplot.glitch.me/uplot-smoothing-limit-extrema.html

It becomes almost the same as the result before change in this PR.

echarts-bot · 2020-09-21T02:59:08Z

Congratulations! Your PR has been merged. Thanks for your contribution! 👍

pissang · 2020-09-21T03:03:09Z

Merged this PR.

@leeoniya We can still discuss the smoothing algorithm here. I will open another PR to improve it if necessary.

leeoniya · 2020-09-21T04:01:42Z

thanks for the detailed reply. i agree that exceeding the local extremes is a worse outcome than a more "S" shaped creative interpolation between datapoints.

overall, i hate smoothed charts since they trade data clarity for a stylistic choice. similar to stacked area/trendlines. i wish people would simply stop using these types of misleading charts. hopefully by not supporting these in uPlot i help a little to accelerate their demise :)

i've tried feeding typed arrays to uPlot in the past and surprisingly did not measure a difference in performance. i'll take a closer look in a few days at your improvement to the uPlot bench, but i was surprised that typed arrays made a significant difference for you. maybe i should look into it again.

out of curiosity, does echarts handle data gaps by default or is this an additional setting? in uPlot the null-testing per-datapoint adds ~20% overhead to the bench in both, dataset min/max finding as well as path drawing. i decided it was an okay trade-off instead of bloating the code and maintaining slightly different dedicated loops for gap-less data.

pissang · 2020-09-21T05:45:11Z

i've tried feeding typed arrays to uPlot in the past and surprisingly did not measure a difference in performance. i'll take a closer look in a few days at your improvement to the uPlot bench, but i was surprised that typed arrays made a significant difference for you. maybe i should look into it again.

Perhaps it's because echarts use a different data structure that has worse performance than uplot when using a normal array.

I noticed the data structure used in uplot is

[
  [........], // x dimension
  [........], // y dimension
  [........] // z dimension
]

It's much flatter than the structure echarts used.

[
  [x, y, z],
  [x, y, z],
  [x, y, z]
]

Which may need less heap memory and have less GC pressure.

Using Float64Array in echarts is mainly for getting away from long GC time because TypedArray don't take the heap memory. I tested read/write of Float64Array, it has no big difference than a normal array.

Also in echarts we provide a faster way to process the Float64Array data. Meanwhile, when processing the normal array we need to check the structure of each item in the array because the structure we supported is quite flexible. It may be convenient for the developers, but not good for the performance:(

out of curiosity, does echarts handle data gaps by default or is this an additional setting? in uPlot the null-testing per-datapoint adds ~20% overhead to the bench in both, dataset min/max finding as well as path drawing. i decided it was an okay trade-off instead of bloating the code and maintaining slightly different dedicated loops for gap-less data.

echarts handles null data by default. Haven't tested how much time can be saved if we remove the null testing. But I guess it should not the bottleneck for echarts currently.

pissang · 2020-09-24T08:00:27Z

Hi @leeoniya

I did a bit more optimization in these two days. Now the time has been reduced to about 140ms. With LTTB downsample enabled, it can be less than 100ms on my 15inch MacBook and the display result is almost the same.

https://echarts-bench-from-uplot.glitch.me/index-lttb.html

I also add two test cases with 10x larger data.

https://echarts-bench-from-uplot.glitch.me/index-large.html
https://echarts-bench-from-uplot.glitch.me/index-lttb-large.html

Just want to thank you and your uplot that motivate me to push the performance of echarts to the limit.

leeoniya · 2020-09-27T23:45:19Z

thanks @pissang

would you like to submit a PR to the uPlot repo for the implementation improvements as well as bumping the echarts lib when the next alpha or beta of 5.0 is tagged and available via jsDelivr CDN?

pissang · 2020-09-28T04:07:40Z

@leeight Sure.

pissang added 10 commits September 16, 2020 23:35

line: optimize memory cost and initialize time for the large line.

71a590b

fix(line): fix poly builder on some case

dee7886

time: return directly if parsing val is a timestamp.

9b991a4

perf(line): optimize performance of time series data on line.

7b7a747

fix(type): fix some type errors.

efafedc

perf: optimize storage access in List for large data.

6f0f9eb

perf: optimize data access in points layout

170f893

perf: clean code. fix stacked line

9f13c48

fix(line): optimizing polyline smooth algorithm.

e106d64

Limiting the control points inside the range of two connect points. Instead of the global bounding box in the previous implementation

fix(list): fix some null access issues after perf refactor before.

f2fa790

echarts-bot bot added PR: author is committer PR: awaiting review labels Sep 20, 2020

pull-request-size bot added the size/XXL label Sep 20, 2020

fix(cartesian): fix null pointer error in point to data

ddec4f3

test: update case of large line

76bf8f4

100pah approved these changes Sep 20, 2020

View reviewed changes

pissang merged commit a40b939 into next Sep 21, 2020

echarts-bot bot removed the PR: awaiting review label Sep 21, 2020

pissang deleted the line-optimize branch September 21, 2020 02:59

pissang mentioned this pull request Sep 22, 2020

[5.0] [Feature] support largest-triangle-three-buckets algo #13317

Merged

5 tasks

leeoniya mentioned this pull request Oct 9, 2020

Add data sampling in line charts apache/superset#11205

Closed

pissang mentioned this pull request Nov 19, 2020

能否增加对 type=line 的图的增量渲染 #12249

Closed

leeoniya mentioned this pull request Dec 26, 2020

benchmarks 2020-12-25 leeoniya/uPlot#415

Closed

leeoniya mentioned this pull request Apr 27, 2021

How use path with spline() and render without rounding where there are no values? leeoniya/uPlot#504

Closed

plainheart mentioned this pull request Oct 14, 2021

[优化] type: 'line' & smooth: true 的展示待优化 #4556

Closed

leeoniya mentioned this pull request Apr 4, 2022

Multi series sharing 1 x-axis with uPlot leeoniya/uPlot#682

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[5.0] [Performance] Improve large line performance. Improve line smoothing. #13314

[5.0] [Performance] Improve large line performance. Improve line smoothing. #13314

pissang commented Sep 20, 2020 •

edited

Loading

echarts-bot bot commented Sep 20, 2020

leeoniya commented Sep 20, 2020

leeoniya commented Sep 20, 2020 •

edited

Loading

pissang commented Sep 20, 2020 •

edited

Loading

pissang commented Sep 20, 2020 •

edited

Loading

pissang commented Sep 20, 2020 •

edited

Loading

leeoniya commented Sep 20, 2020 •

edited

Loading

pissang commented Sep 21, 2020 •

edited

Loading

echarts-bot bot commented Sep 21, 2020

pissang commented Sep 21, 2020

leeoniya commented Sep 21, 2020 •

edited

Loading

pissang commented Sep 21, 2020 •

edited

Loading

pissang commented Sep 24, 2020 •

edited

Loading

leeoniya commented Sep 27, 2020

pissang commented Sep 28, 2020

[5.0] [Performance] Improve large line performance. Improve line smoothing. #13314

[5.0] [Performance] Improve large line performance. Improve line smoothing. #13314

Conversation

pissang commented Sep 20, 2020 • edited Loading

Details about performance improvement.

1. Use TypedArray.

2. Bake the transform of dataToPoint to matrix in Cartesian2D

3. Remove unnecessary date parsing when it's a timestamp.

5. Speed up storage access in List component.

Performance Result

Render time comparison between 4.x and 5.0

Heap memory comparison between 4.x and 5.0

Details about smooth improvement.

Fixed issues

echarts-bot bot commented Sep 20, 2020

leeoniya commented Sep 20, 2020

leeoniya commented Sep 20, 2020 • edited Loading

pissang commented Sep 20, 2020 • edited Loading

pissang commented Sep 20, 2020 • edited Loading

pissang commented Sep 20, 2020 • edited Loading

leeoniya commented Sep 20, 2020 • edited Loading

pissang commented Sep 21, 2020 • edited Loading

echarts-bot bot commented Sep 21, 2020

pissang commented Sep 21, 2020

leeoniya commented Sep 21, 2020 • edited Loading

pissang commented Sep 21, 2020 • edited Loading

pissang commented Sep 24, 2020 • edited Loading

leeoniya commented Sep 27, 2020

pissang commented Sep 28, 2020

pissang commented Sep 20, 2020 •

edited

Loading

leeoniya commented Sep 20, 2020 •

edited

Loading

pissang commented Sep 20, 2020 •

edited

Loading

pissang commented Sep 20, 2020 •

edited

Loading

pissang commented Sep 20, 2020 •

edited

Loading

leeoniya commented Sep 20, 2020 •

edited

Loading

pissang commented Sep 21, 2020 •

edited

Loading

leeoniya commented Sep 21, 2020 •

edited

Loading

pissang commented Sep 21, 2020 •

edited

Loading

pissang commented Sep 24, 2020 •

edited

Loading