Opensource C++ zero-copy API #1896

xfxyjwf · 2016-07-29T23:10:28Z

Protobuf has zero-copy support to avoid copying string/bytes fields when parsing protobuf messages and it's used pretty much everywhere inside Google, but the feature has never made its way into the opensource repo. Now protobuf 3.0.0 is released and we will probably have more time to look into incremental improvements. The zero-copy API is a good candidate to be included in the next 3.x release.

Opensourcing the zero-copy API will involve:

opensource related string/buffer classes (Cord and its dependencies).
un-exclude zero-copy APIs from the message interface (such as ParseFromStringWithAliasing).
un-exclude the support for ctype = STRING_PIECE and ctype = CORD.

(1) is probably the most difficult part as that's a large chunk of code and it may not be portable.

jjyao · 2017-02-15T17:42:27Z

Any updates on this feature?

xfxyjwf · 2017-02-15T19:08:01Z

@jjyao This unfortunately hasn't made into our agenda yet. If this feature is useful to you, can you post here your use case and estimate how much it can help? More concrete use case example can help us prioritize it.

bobobo1618 · 2017-06-24T01:34:53Z

I'm also quite interested in this feature.

More concrete use case example can help us prioritize it.

@xfxyjwf I'm writing an application-specific database server with gRPC and RocksDB. I want to:

Accept serialized protos from clients through gRPC and store them in RocksDB verbatim, without parsing and constructing a full object in memory.
Retrieve serialized protos from RocksDB and send them back to clients without parsing and reserializing them, ideally as part of another proto, which would serialize only what's necessary.

I want this because parsing and serialization currently take ~30% of my total response time and I don't really need them.

Here's a flame graph profile that shows what I'm seeing.

stellanhaglund · 2017-07-02T09:59:06Z

Is this the thing that cap'n proto does that makes it fast than protobuf?

xfxyjwf · 2017-07-05T17:52:51Z

@stellanhaglund No, it's not the main cause of the performance difference. cap'n proto is very similar to FlatBuffer and what I described in #3296 can be said to cap'n proto as well.

johnfb · 2017-09-26T15:17:57Z

I am very interested in this feature. I have been suggesting at my work that we adopt something like protobuf for a long time. One of the major push backs has been the ability to zero copy large binary/string values. This is because we have many applications where an extra copy or two of the data means the processors/memory bus is now saturated.

Our usual process stream for data look a lot like:

DMA from network interface to shared memory
pass off the shared memory reference to the process(es) to do calculations
calculations done from shared memory to shared memory
pass shared memory reference to further process(es)
DMA out of the machine

Control message and meta data are small enough that copying is no problem (and in fact encoding as json etc. is usually good enough). Typical data is large matrices (think 16+MB) of complex integer (often 16 bit), complex IEEE binary16 (half) or complex IEEE binary32 (float). While meta data may be 64 bytes in total encoded as a struct. Note we often also have the requirement that the data be machine vector aligned (typically 32 byte align). A "slow" data rate is 3-5 Gigabit/s.

It'd be great if we could encode such data as something like protobuf and not have to manually maintain readers and writers and representations in multiple languages. We are already making an effort to use protobuf for control data, which IMO it already excels at.

arthur-tacca · 2017-09-28T09:59:46Z

Perhaps Cord will / could be open sourced as part of the Abseil library. The initial release doesn't include it, although there is a passing mention in malloc_extension.h.

xfxyjwf · 2017-09-28T17:45:35Z

@arthur-tacca Yep. The Cord type will be included as part of Abseil. And after we migrate to use Abseil, supporting zero-copy ctypes should be straightforward.

chris-hite · 2018-04-24T12:10:23Z

Hello, I ran into some performance problems at my previous HFT job an thought it be nice to have a zero-copy, heap free, protobuf parser.

If I were going to hand write code that parsed a specific protobuf schema, I'd typically do all my processing on the stack and consume all data in one pass.

I could see writing a C++ functional template heavy low level decoder giving me the same performance. I would best describe it as X(name proposals welcome) is to SAX as regular protobuf bindings are to DOM.

instead of heaping a std::string, you get a std::string_view.
if you don't care about a field you should be able to say so at compile time
if you don't want to parse a subobject you should be able to skip it or even break out of parsing

On the generation side I could see doing something something similar.

it should be possible for a message routing app to pass payloads without decoding them

Is there interest for this kind of thing? My fear is that C++ guys that really care about performance would avoid protobuf anyway. I guess my target audience are skilled C++ devs worried about performance forced to speak protobuf for historical reasons or a contract with outside components.

Does anyone have a spiffy name?

Has anyone seen something like this? I found lots of alternative wire formats with language bindings: SBE, CapNProto, etc.

toddlipcon · 2018-04-24T16:24:31Z

I don't think we would switch to using a SAX-like parser except maybe in some very specific circumstances in our project. For us, the overhead of most PBs is negligible (and I expect to be even lower when we switch to using arenas). The main exception is the std::string allocation of lots of tiny strings -- we're stuck on the pre-C++11 ABI, so every string ends up being a heap allocation/free pair.

FSMaxB · 2018-05-12T16:41:39Z

I can't use this library without that feature at all!

I use an arena, because I store sensitive key material in my protobuf messages and I provided an allocator with safe memory to the arena (sodium_malloc, not swapped out, zeroed out on free, guard pages etc.).

Given that the key material is stored in bytes fields, protobuf allocates them on the heap in std::string and completely bypasses the safe memory that I want the keys to reside in.

I already halfway ported my code from protobuf-c to protobuf, only now finding out that all my key material completely bypasses the arena. So now it seems like I have to throw that away and stick with protobuf-c (which makes me really unhappy).

tianyapiaozi · 2018-06-12T07:13:23Z

Any updates on this feature?

gerben-s · 2018-08-06T20:48:21Z

I think string_view should be a solid contender to be fully released soon.

Cord's are a thoroughly more heavy weight type. Integrating ZCIS with Cord's ties our most basic library directly into ABSL. We thread a little more carefully here.

MyUmmaGumma · 2018-08-07T17:37:54Z

@gerben-s
Could you please elaborate what ZCIS/Cord and StringView are with respect to zero-copy?

gerben-s · 2018-08-20T20:46:02Z

Zero copy parsing of strings can be achieved by aliasing string_view's or Cord's with the underlying buffer. Cord is a heavy weight type from the absl lib, which needs to be directly supported by our ZeroCopyInputStream (ZCIS) abstraction.

gerben-s · 2018-09-10T20:41:31Z

@FSMaxB On the level of safety I understand your wishes, but its hard for us to make any such guarantee about not storing memory on the heap.

If you have such stringent security demands, I think C++ protobuf is not the right fit.

We are thinking about how to expose aliasing but we want to be careful and expose the right API.

FSMaxB · 2018-09-11T16:46:04Z

We are thinking about how to expose aliasing but we want to be careful and expose the right API.

That makes sense, especially without std::string_view/std::span

pcmoritz · 2019-11-18T22:35:56Z

Is there any update on this feature? This would be very useful. Now that absl has a string_view implementation (https://github.com/abseil/abseil-cpp/blob/master/absl/strings/string_view.h) it seems like that could be used :)

prem-nuro · 2020-02-19T02:32:05Z

absl::Cord has just been released: abseil/abseil-cpp@3c81410

This changes allow the usage of the c++ grpc arena feature to allocate the messages from the same location, the initial motivation of this was save all the message strings in the same memory portion but seems that the open source version of the grpc don't allow it and like google has this internal feature isn't possible create a PR. Please google, if you dont mind, could you spend a little of your engineering team time to open source it? protocolbuffers/protobuf#1896

arthur-tacca · 2020-06-03T11:41:10Z

I think there are two requests here:

(a) Allow ctype = STRING_PIECE
(b) Allow ctype = CORD

The original comment says "(1) opensource ... Cord and its dependencies ... is probably the most difficult part" But surely that's only needed for ctype = CORD? For ctype = STRING_PIECE, a vendored copy of StringPiece has been included in open-source protobuf for years. That leaves items (2) and (3) in the original comment (un-excluding the relevant code from the open-source release). This might be a lot less work than releasing the full feature including CORD, assuming (2) and (3) can reasonably be done for STRING_PIECE without also doing them for CORD.

The ctype = STRING_PIECE feature solves the zero-copy problem in the case the string you want refer to without copying is contiguous, which is probably enough functionality for many people (e.g. me 😃). So perhaps, rather than waiting for some solution involving cord, just the string piece functionality could be open sourced?

I thought this was already well understood, but reading through the comments it seems it hasn't been mentioned here before. The comments mostly discuss alternative types such as std::string_view and absl::Cord, but there's been no mention of protobuf::StringPiece.

acozzette · 2020-06-03T12:17:25Z

std::string_view has made our StringPiece type obsolete, so I don't think we want to expose StringPiece publicly in any more places if we can avoid it. Eventually we will likely want to replace it with std::string_view. The main problem is that to get access to std::string_view, we need to require C++17 (currently we only require C++11). The other possibility is to depend on ABSL and use absl::string_view, but that would be a non-trivial change as well.

arthur-tacca · 2020-06-03T12:39:38Z

That makes sense, thanks.

pitrou · 2020-06-03T12:55:55Z

There are also standalone string_view backports. In Arrow we use https://github.com/martinmoene/string-view-lite successfully.

fm123456 · 2020-08-07T02:21:07Z

I can't use this library without that feature at all!

I use an arena, because I store sensitive key material in my protobuf messages and I provided an allocator with safe memory to the arena (sodium_malloc, not swapped out, zeroed out on free, guard pages etc.).

Given that the key material is stored in bytes fields, protobuf allocates them on the heap in std::string and completely bypasses the safe memory that I want the keys to reside in.

I already halfway ported my code from protobuf-c to protobuf, only know finding out that all my key material completely bypasses the arena. So now it seems like I have to throw that away and stick with protobuf-c (which makes me really unhappy).

I also encountered the same problem, have you solved it？

FSMaxB · 2020-08-10T20:52:10Z

I also encountered the same problem, have you solved it?

Yes, by first porting my code back to protobuf-c. Later abandoning the entire project and then never using protobuf ever again in the future.

toddlipcon · 2020-08-11T22:34:00Z

Just in case anyone finds it useful, I did a little hacking on a branch that supports storing string buffers in arenas: toddlipcon@00cc310

The above only supports it on the serialization side -- i.e. if you call proto_on_arena.set_foo(const std::string& bar) it will copy bar's contents on the Arena and make a std::string-compatible-memory-layout object to point to it. Note that it's also specific to libstdcxx c++11 string ABI and won't work with libc++ or other ABIs (though presumably could be modified to support those as well).

jeaye · 2021-01-11T18:45:31Z

This is a huge deal, especially on mobile. Now that absl::string_view exists, what is next for getting a zero-copy API?

chys87 · 2021-11-04T05:12:32Z

I'm very interested in this feature. I have a project where we embed long strings (several KiBs) in protobuf messages. It would significantly save CPU time if this feature is available.

danieljennings · 2022-03-25T00:53:49Z

Chiming in to say that we use Protobuf in virtually all of our projects here and would love to see this fixed, even if it required upgrading to C++17 (we're only on C++14 for the most part now.)

mayur-who · 2022-03-29T16:18:59Z

I would even love to see this implemented. We can use protobuf for our data plane APIs as well then

troberti · 2022-06-16T11:54:38Z

We would also really like to see std::string_view support as well. Would make arenas actually useful.

fowles · 2022-06-19T22:45:46Z

We have a lot of long term plans that will drive us towards this space; however, the migrations required make it slow going. Expect to see us start breaking ground in over the next year.

GOGOYAO · 2023-07-07T06:49:39Z

Looking forward this feature

fowles · 2023-07-07T14:19:04Z

Support for absl::Cord landed in the spring, the next major step will come with editions which has started land on main. Once we have a release that fully supports editions (like October or January), we plan to expose a mechanism for using absl::string_view as the API for strings. After that we can revisit this to see what is missing from fully realizing this request.

HamzaHajeir · 2023-08-21T12:26:58Z

This would be a very vital feature, really.

I just wrote a question in StackOverflow:
"I'm studying for adopting Protobuf in my Embedded IoT framework, wherein a message can be received from network sources as MQTT/HTTP/etc and being fed to the system.

I seek to fully process the incoming data without copying it, so the intended use is to feed protobuf with the starting address std::uint8_t* and size.

The intended output of array data (strings and raw data) would be std::string_view and std::span respectively*, which would point to the received data."

For embedded systems copying yields more heap fragmentation, and with large messages, this becomes worse.

I really can't see a reason why it's not already built other being not supporting c++17 forward (though can be an optional compiling option).

neuliyiping · 2023-11-24T08:45:11Z

Just in case anyone finds it useful, I did a little hacking on a branch that supports storing string buffers in arenas: toddlipcon@00cc310

The above only supports it on the serialization side -- i.e. if you call proto_on_arena.set_foo(const std::string& bar) it will copy bar's contents on the Arena and make a std::string-compatible-memory-layout object to point to it. Note that it's also specific to libstdcxx c++11 string ABI and won't work with libc++ or other ABIs (though presumably could be modified to support those as well).

I try this, but it is not work. string buffers also on heap

github-actions · 2024-06-23T10:02:11Z

We triage inactive PRs and issues in order to make it easier to find active work. If this issue should remain active or becomes active again, please add a comment.

This issue is labeled inactive because the last activity was over 90 days ago.

AlexeySalmin · 2024-06-23T12:44:37Z

We triage inactive PRs and issues in order to make it easier to find active work. If this issue should remain active or becomes active again, please add a comment.

This bugreport is going to school by now.

follesoe · 2024-09-13T13:00:54Z

Why tease is with the possibility for zero-copy API, and then let it hang and dingle like this 😅

aagor · 2024-11-22T13:54:06Z

While the overall issue is for allowing to use strings without any copy, is there at least a possibility/plan to support storing string contents on an arena?
This would still require a copy, but it would save at least the dynamic memory allocation introduced by using std::string.

Even when using features.(pb.cpp).string_type = VIEW, a classic std::string is allocated currently.
What's missing to support that?

felipecrv · 2024-11-22T13:58:08Z

That would still require a custom alternative to std::string as the string type, but it would be a great (and safe) improvement over the status quo.

…

On Fri, 22 Nov 2024 at 10:54 Alexander Krabler ***@***.***> wrote: While the overall issue is for allowing to use strings without any copy, is there at least a possibility/plan to support storing string contents on an arena? This would still require a copy, but it would save at least the dynamic memory allocation introduced by using std::string. Even when using features.(pb.cpp).string_type = VIEW, a classic std::string is allocated currently. What's missing to support that? — Reply to this email directly, view it on GitHub <#1896 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AABSXM65KFOJ3R3TDPHMMFL2B4ZRPAVCNFSM6AAAAABJYHGH36VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIOJTHAZDGMBVGM> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

acozzette · 2024-11-22T17:08:19Z

We do plan to support VIEW strings on arenas, but we don't have a specific timeline for that yet.

xfxyjwf added c++ performance investigation labels Jul 29, 2016

haberman added the enhancement label Mar 16, 2017

haberman assigned xfxyjwf Mar 16, 2017

arthur-tacca mentioned this issue Jan 22, 2018

std::string not allocated in Arena memory #4202

Closed

dlg99 mentioned this issue Mar 14, 2018

Provide a zero-copy write protocol apache/bookkeeper#347

Open

FSMaxB mentioned this issue May 12, 2018

Switch to Googles C++ protocol buffers 1984not-GmbH/molch#117

Open

xfxyjwf assigned gerben-s and unassigned xfxyjwf Jul 2, 2018

xfxyjwf mentioned this issue Aug 12, 2018

Why should we use arena? #4327

Closed

google-admin unassigned gerben-s Aug 21, 2018

xfxyjwf assigned gerben-s Aug 22, 2018

osrf-migration mentioned this issue Apr 15, 2020

Consider flatbuffers gazebosim/gz-transport#23

Closed

chenliu0831 mentioned this issue Dec 13, 2023

The fundamental difference between PB and FB is what ? #3296

Closed

github-actions bot added the inactive Denotes the issue/PR has not seen activity in the last 90 days. label Jun 23, 2024

github-actions bot removed the inactive Denotes the issue/PR has not seen activity in the last 90 days. label Jun 24, 2024

Opensource C++ zero-copy API #1896

Opensource C++ zero-copy API #1896

Comments

xfxyjwf commented Jul 29, 2016

jjyao commented Feb 15, 2017

xfxyjwf commented Feb 15, 2017

bobobo1618 commented Jun 24, 2017

stellanhaglund commented Jul 2, 2017

xfxyjwf commented Jul 5, 2017

johnfb commented Sep 26, 2017

arthur-tacca commented Sep 28, 2017

xfxyjwf commented Sep 28, 2017

chris-hite commented Apr 24, 2018

toddlipcon commented Apr 24, 2018

FSMaxB commented May 12, 2018 • edited Loading

tianyapiaozi commented Jun 12, 2018

gerben-s commented Aug 6, 2018

MyUmmaGumma commented Aug 7, 2018

gerben-s commented Aug 20, 2018

gerben-s commented Sep 10, 2018

FSMaxB commented Sep 11, 2018

pcmoritz commented Nov 18, 2019

prem-nuro commented Feb 19, 2020

arthur-tacca commented Jun 3, 2020

acozzette commented Jun 3, 2020

arthur-tacca commented Jun 3, 2020

pitrou commented Jun 3, 2020

fm123456 commented Aug 7, 2020

FSMaxB commented Aug 10, 2020

toddlipcon commented Aug 11, 2020

jeaye commented Jan 11, 2021

chys87 commented Nov 4, 2021

danieljennings commented Mar 25, 2022

mayur-who commented Mar 29, 2022

troberti commented Jun 16, 2022

fowles commented Jun 19, 2022

GOGOYAO commented Jul 7, 2023

fowles commented Jul 7, 2023

HamzaHajeir commented Aug 21, 2023 • edited Loading

neuliyiping commented Nov 24, 2023

github-actions bot commented Jun 23, 2024

AlexeySalmin commented Jun 23, 2024

follesoe commented Sep 13, 2024

aagor commented Nov 22, 2024

felipecrv commented Nov 22, 2024 via email

acozzette commented Nov 22, 2024

FSMaxB commented May 12, 2018 •

edited

Loading

HamzaHajeir commented Aug 21, 2023 •

edited

Loading