Solve perf issue serializing large collections #239

DavyLandman · 2024-01-27T08:59:52Z

We have a stackless structured visitor that doesn't require large call stacks to deal with deeply nested ASTs.

While this worked great for deep ASTs, it doesn't work great for big flat collections (aka wide).

Big collections would require the same amount of memory, just to prepare the stack with all the entries. Now we have iterating entries on the stack, that can be "returned to" multiple times.

Note, I had to change the tests, as they assumed reverse orders for sets & maps in the visitor, which was not an actual requirement.

github-actions · 2024-01-27T09:05:30Z

Test Results

64 files - 32 64 suites - 32 4m 5s ⏱️ - 2m 28s
242 291 tests ± 0 242 290 ✅ ± 0 1 💤 ±0 0 ❌ ±0
484 640 runs - 242 320 484 638 ✅ - 242 319 2 💤 - 1 0 ❌ ±0

Results for commit 3876e84. ± Comparison against base commit 8f94c94.

♻️ This comment has been updated with latest results.

We have a stackless structured visitor that doesn't require large call stacks to deal with deeply nested ASTs. While this worked great for deep ASTs, it doesn't work great for big flat collections (aka wide). Big collections would require the same amount of memory, just to prepare the stack with all the entries. Now we have iterating entries on the stack, that can be "returned to" multiple times.

jurgenvinju · 2024-01-28T17:45:10Z

Ok nice. I was halfway something very similar; you beat me to it ♥️

jurgenvinju · 2024-01-29T08:58:00Z

looks great

DavyLandman · 2024-01-29T11:28:21Z

thanks, the solution came to me the middle of brushing my teeth, then it took 30min to implement it.

I'll add synthetic benchmark results soon.

DavyLandman requested a review from jurgenvinju January 27, 2024 08:59

DavyLandman force-pushed the faster-serialize-large-collections branch from be73eff to ab905ed Compare January 27, 2024 09:00

DavyLandman force-pushed the faster-serialize-large-collections branch from ab905ed to d1102a2 Compare January 27, 2024 11:17

DavyLandman force-pushed the faster-serialize-large-collections branch from d1102a2 to 3876e84 Compare January 28, 2024 10:01

jurgenvinju merged commit 5b9c1db into main Jan 29, 2024
8 of 9 checks passed

jurgenvinju deleted the faster-serialize-large-collections branch January 29, 2024 08:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Solve perf issue serializing large collections #239

Solve perf issue serializing large collections #239

DavyLandman commented Jan 27, 2024

github-actions bot commented Jan 27, 2024 •

edited

Loading

jurgenvinju commented Jan 28, 2024

jurgenvinju commented Jan 29, 2024

DavyLandman commented Jan 29, 2024

Solve perf issue serializing large collections #239

Solve perf issue serializing large collections #239

Conversation

DavyLandman commented Jan 27, 2024

github-actions bot commented Jan 27, 2024 • edited Loading

Test Results

jurgenvinju commented Jan 28, 2024

jurgenvinju commented Jan 29, 2024

DavyLandman commented Jan 29, 2024

github-actions bot commented Jan 27, 2024 •

edited

Loading