rpc : backend refactoring #9912

rgerganov · 2024-10-16T13:31:57Z

Introduce structs for each request/response and separate the code which deals with serialization/deserialization.

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

slaren · 2024-10-16T17:52:40Z

ggml/src/ggml-rpc.cpp

+    // output serialization format: | ptr (8 bytes) | size (8 bytes) |
+    memcpy(&response.ptr, output.data(), sizeof(response.ptr));
+    memcpy(&response.size, output.data() + sizeof(response.ptr), sizeof(response.size));


I don't think this is much better than the previous version. The idea would be to read directly into the struct from the network.

Thanks for looking into this, I get what you mean. Could you take another look and let me know if you agree?

I also noticed that all commands have a known output size, maybe we can skip sending output size from the server?

Yep, that's exactly what I meant.

I also noticed that all commands have a known output size, maybe we can skip sending output size from the server?

I don't have a strong opinion about this either way. Some commands may have variable output size in the future, for example get_description in the device interface, and having the output size may help simplify the code a bit in that case, but it is probably not very important either way.

ggml/src/ggml-rpc.cpp

Use structs for RPC request/response messages

Green-Sky · 2024-10-18T17:43:46Z

ggml/src/ggml-rpc.cpp

+#pragma pack(1)
+struct rpc_msg_alloc_buffer_req {
+    uint64_t size;
+};


this is not really how pack(1) works. every struct after a single pack(1) will be affected, only after another pack() (empty), it will be the value from before.

Nice catch, I have posted #9959 to fix this

* rpc : refactor backend Use structs for RPC request/response messages * rpc : refactor server

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Oct 16, 2024

slaren reviewed Oct 16, 2024

View reviewed changes

rgerganov force-pushed the rpc-refactor branch from 138c9b2 to 4631edc Compare October 17, 2024 07:38

ggerganov reviewed Oct 17, 2024

View reviewed changes

ggml/src/ggml-rpc.cpp Outdated Show resolved Hide resolved

rgerganov force-pushed the rpc-refactor branch from 4631edc to 38a671a Compare October 17, 2024 14:08

rgerganov added 2 commits October 18, 2024 10:13

rpc : refactor backend

98f4e5d

Use structs for RPC request/response messages

rpc : refactor server

c9e549c

rgerganov force-pushed the rpc-refactor branch from 38a671a to c9e549c Compare October 18, 2024 07:25

rgerganov marked this pull request as ready for review October 18, 2024 07:25

slaren approved these changes Oct 18, 2024

View reviewed changes

ggerganov approved these changes Oct 18, 2024

View reviewed changes

rgerganov merged commit afd9909 into ggerganov:master Oct 18, 2024
53 checks passed

Green-Sky reviewed Oct 18, 2024

View reviewed changes

drollings pushed a commit to drollings/llama.cpp that referenced this pull request Oct 18, 2024

rpc : backend refactoring (ggerganov#9912)

11c29ee

* rpc : refactor backend Use structs for RPC request/response messages * rpc : refactor server

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this pull request Oct 29, 2024

rpc : backend refactoring (ggerganov#9912)

df96a7d

* rpc : refactor backend Use structs for RPC request/response messages * rpc : refactor server

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

rpc : backend refactoring (ggerganov#9912)

4bd42f7

* rpc : refactor backend Use structs for RPC request/response messages * rpc : refactor server

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

rpc : backend refactoring (ggerganov#9912)

3d82c31

* rpc : refactor backend Use structs for RPC request/response messages * rpc : refactor server

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rpc : backend refactoring #9912

rpc : backend refactoring #9912

rgerganov commented Oct 16, 2024

slaren Oct 16, 2024

rgerganov Oct 17, 2024

slaren Oct 17, 2024

Green-Sky Oct 18, 2024

rgerganov Oct 20, 2024

rpc : backend refactoring #9912

rpc : backend refactoring #9912

Conversation

rgerganov commented Oct 16, 2024

slaren Oct 16, 2024

Choose a reason for hiding this comment

rgerganov Oct 17, 2024

Choose a reason for hiding this comment

slaren Oct 17, 2024

Choose a reason for hiding this comment

Green-Sky Oct 18, 2024

Choose a reason for hiding this comment

rgerganov Oct 20, 2024

Choose a reason for hiding this comment