Diesel Performance Fixes, Batching Improvements, New Allocator #262

jr1221 · 2024-12-21T20:59:48Z

Changes

Switch to diesel-async, to better use the tokio runtime to organize threads
Rationale: Performance of blocking threads decreases with more threads in use, affecting large chunk batching and the [Scylla] - Investigate a new CSV handler #243 code
Switch to jemalloc, as extreme memory use was being observed with [Scylla] - Investigate a new CSV handler #243
Rationale: The system allocator was not freeing the memory used by the batching tasks effectively, resulting in excessive memory usage. It takes now approximately 1-2gb of RAM to upload an hour of data, and that RAM is released back when the upload is complete. Previously it was 5-10gb, and the memory was not freed.
Modify the values field of the DATA table to be NOT NULL, modify the values field to be REAL instead of DOUBLE. PRECISION.
Rationale (found in the docs for DataInsert): This aligns closer to the actual data we recieve, which is not null 4 byte floating point integers. This allows less re-allocation of data when converting our recieved protobuf to the data the database is looking for, improving performance.
Redo chunking logic.
Rationale: Before, chunks of data to upload were split into a few even chunks and one chunk of only a few points. The new algorithm better evens out the chunks. It still could use improvement, and more investigation into the libpq instruction limit could be warrented.

Notes

Jemalloc may fail to build on aarch64, hence the CI running, so we should pay attention to any unsoundness or stability issues.

Test Cases

All functionality normal, perhaps a little faster.

To Do

Any remaining things that need to get done

Ensure CI works

Checklist

It can be helpful to check the Checks and Files changed tabs.
Please review the contributor guide and reach out to your Tech Lead if anything is unclear.
Please request reviewers and ping on slack only after you've gone through this whole checklist.

Closes #244
Closes #184

hashset about 0.3% CPU time improvement

fxhasmap -> 0.25% perf

scylla-server/src/db_handler.rs

bracyw · 2025-01-06T05:05:28Z

scylla-server/src/db_handler.rs

-                       tokio::task::spawn_blocking(move || {
-                            DbHandler::batch_upload(owned, pool)});
+                    let msg_len = msgs.len();
+                    let chunk_size = msg_len / ((msg_len / 8190) + 1);


Why are you now dividing by 8190 instead of 16380, something special about dividing max params by 8 instead of 4?

Oh ya forgot to investigate that. The switch to diesel async doubled the number of instructions per insert. I'll do some investigating there. It's a very annoying limit.

scylla-server/src/db_handler.rs

bracyw · 2025-01-06T05:33:22Z

scylla-server/src/db_handler.rs

@@ -81,17 +110,16 @@ impl DbHandler {
                    // libpq has max 65535 params, therefore batch


I actually never understood this fully. why is the batching logic needed for diesel... is it because it let's you try and insert more data then libpq can handle. Does prisma just manage this itself?

I guess prisma splits the queries as needed. I doubt they work around libpq as it's kinda the best way to communicate with postgres. It's a very annoying limit but kinda inherent to postgres.

bracyw

Integration tests fail sometimes when you either have the database disabled in docker or haven't set it up yet. I think just adding a sleep like your github tests. For now we can probably just restart or remove the database if it is already a docker container but disabled.... I can push changes for this if you want.

everything compiles tests and runs fine on mac m1.

jr1221 · 2025-01-06T12:32:43Z

Ya can unadd the fix to integration tests? Thanks.

…batch-diesel-perf

bracyw

LGTM

jr1221 added 4 commits December 21, 2024 12:35

refactor to diesel async

cfe8ac9

remove 4 re-allocations by modifying DB and insertion layer

763cd8e

implement better chunking logic, fix clippy

54af3c6

switch to jemalloc

fc61e45

jr1221 self-assigned this Dec 21, 2024

jr1221 added 8 commits December 21, 2024 21:41

fail gracefully on file insert, switch to hashset

bcbce39

hashset about 0.3% CPU time improvement

vec with capacity: 0.3% speed increase

994244b

precompute duration

4b45398

bump deps, fxhashmap

3f08e87

fxhasmap -> 0.25% perf

fmt

16d3d40

Merge branch 'develop' into 244-batch-diesel-perf

3e61ede

fix cargo.lock

7245dfe

bump axum, fix clippy

2f0b7c3

jr1221 requested a review from bracyw January 5, 2025 21:37

bracyw reviewed Jan 6, 2025

View reviewed changes

scylla-server/src/db_handler.rs Show resolved Hide resolved

bracyw reviewed Jan 6, 2025

View reviewed changes

scylla-server/src/db_handler.rs Outdated Show resolved Hide resolved

bracyw reviewed Jan 6, 2025

View reviewed changes

scylla-server/src/db_handler.rs Outdated Show resolved Hide resolved

bracyw reviewed Jan 6, 2025

View reviewed changes

bracyw requested changes Jan 6, 2025

View reviewed changes

bracyw and others added 4 commits January 6, 2025 12:50

integration tests fix

4c8efb6

spruce up comments, fix dependency

123732e

Merge remote-tracking branch 'origin/244-batch-diesel-perf' into 244-…

8f52f62

…batch-diesel-perf

the max

17fd017

bracyw self-requested a review January 7, 2025 03:08

bracyw approved these changes Jan 7, 2025

View reviewed changes

jr1221 merged commit 86cf64c into develop Jan 7, 2025
4 checks passed

jr1221 deleted the 244-batch-diesel-perf branch January 7, 2025 03:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Diesel Performance Fixes, Batching Improvements, New Allocator #262

Diesel Performance Fixes, Batching Improvements, New Allocator #262

jr1221 commented Dec 21, 2024 •

edited

Loading

bracyw Jan 6, 2025

jr1221 Jan 6, 2025

bracyw Jan 6, 2025

jr1221 Jan 6, 2025

bracyw left a comment •

edited

Loading

jr1221 commented Jan 6, 2025

bracyw left a comment

		@@ -81,17 +110,16 @@ impl DbHandler {
		// libpq has max 65535 params, therefore batch

Diesel Performance Fixes, Batching Improvements, New Allocator #262

Diesel Performance Fixes, Batching Improvements, New Allocator #262

Conversation

jr1221 commented Dec 21, 2024 • edited Loading

Changes

Notes

Test Cases

To Do

Checklist

bracyw Jan 6, 2025

Choose a reason for hiding this comment

jr1221 Jan 6, 2025

Choose a reason for hiding this comment

bracyw Jan 6, 2025

Choose a reason for hiding this comment

jr1221 Jan 6, 2025

Choose a reason for hiding this comment

bracyw left a comment • edited Loading

Choose a reason for hiding this comment

jr1221 commented Jan 6, 2025

bracyw left a comment

Choose a reason for hiding this comment

jr1221 commented Dec 21, 2024 •

edited

Loading

bracyw left a comment •

edited

Loading