Skip to content

Latest commit

 

History

History
350 lines (244 loc) · 13.5 KB

README.md

File metadata and controls

350 lines (244 loc) · 13.5 KB

Belle Views for C++

Belleviews - A library of C++ views that just works for all basic use cases as expected.

Belleviews are easier to use, more robust, and cause less unexpected compile-time and runtime errors than the views of the C++ standard library.

Motivation

C++20 introduced the ranges library which contain views. Views are lightweight ranges that refer to (a subset of) a range with optional transformation of the values.

Views are simple and great to use and the standard ranges library and its implementation are a tremendous piece of work.

However, for the standard views, some design decisions were made that are confusing and error-prone and cause severe problems because these decisions break a couple of basic idioms that collections (such as standard containers) usually provide.

In fact, C++ standard views have the following problems:

  • You might not be able to iterate over the elements of a standard view when the view is const.

    As a consequence:

    • Generic code for both containers and views has to declare the parameters as universal/forwarding references.
  • Concurrent iterations might cause undefined behavior (due to a data race).

    This kind of fatal runtime error is not obvious to see.

  • Read iterations might affect the functional behavior of later iterations.

  • Standard views do not propagate constness. This means that declaring the view const does not declare the elements to be const.

  • For some standard views declaring the elements as const does have no effect.

    You might be able to modify the members of a const element of a view.

  • Copying a view might create a view that has a different state and behavior than the source view.

  • cbegin() and cend() might not make elements const.

  • Type const_iterator is not available.

  • A standard view is not always a pure subset restricting or dealing with the elements of a range, but not providing options the range would not provide. They might remove constraints and provide options, the range itself does not have.

This makes the use of views error-prone, confusing and programmers might easily create fatal runtime errors without being aware of it.

There are reasons for this behavior:

  • The key problem is the decision that views might cache begin() so that iterating over elements is not stateless. The funny consequence is that due to the several consequences you should use standard views ad hoc, so that the optimization (which only helps for a second iteration) often doesn't pay off.
  • The designers of the views considered views as pointers instead of references to collections. However, views are no pointers (they do not provide operator * or ->). They only internally use pointers. The result is a huge confusion of basic rules like const propagation, which usually is expected to apply to collection types. (Don't get me wrong; you could and can always implement collection types that do not propagate constness; however, not everything you can do as an expert is useful for the mass.)

However, for the mass, these are too many compromises for not enough value. For the mass, we can do better.

Unfortunately, it is too late to fix the C++ standard views anymore. For this reason, as an alternative for the standard views, this library is implemented.

The belleviews library provides views that are simple to use and reduce the number of surprises causing compile-time errors and runtime errors. The key benefit is that with these views, programs are way more predictable in behavior and meet the expectations of ordinary programmers. This especially helps in generic code that may be used for both containers and views.

In some cases, the price may be a potentially worse performance. However, the library will provide workarounds (that help locally instead of compromising basic idioms of C++ and collections).

This library does not replace the ranges library. It only is an alternative for the views in the standard library. Usually, it should even be possible to combine both.

How to use Belle Views

So far, we have only a couple of header files, you have to include and use.

In your code, all you have to do usually is to use the namespace bel::views instead of std::views. For example:

  • Use bel::views::drop(3) instead of std::views::drop(3)

However, in some cases you might have or want to do more:

  • Use bel::views::sub(beg,end) instead of std::ranges::subrange{beg,end}

Key Examples

Belle views can always iterate over elements even if the views are const:

template<typename T>
void print(const T& coll);           // print elements of a const collection

std::vector vec{1, 2, 3, 4, 5, 6, 7, 8};
std::list lst{1, 2, 3, 4, 5, 6, 7, 8};

print(vec | std::views::drop(2));    // OK
print(vec | bel::views::drop(2));    // OK
print(lst | std::views::drop(2));    // ERROR with standard views
print(lst | bel::views::drop(2));    // OK with belle views

Belle views always propagate const (not only for views on temporary objects):

std::vector vec{1, 2, 3, 4, 5, 6, 7, 8};

const auto& vStd = vec | std::views::drop(2);
vStd[0] += 42;        // OOPS: modifies 1st element in vec

const auto& vBel = vec | bel::views::drop(2);
vBel[0] += 42;        // ERROR (good!)

auto vBel2 = vBel;    // NOTE: removes constness
vBel2[0] += 42;       // OK

Belle views do not cache so that they operate as expected when not used ad hoc:

std::vector vec{1, 2, 3, 4, 5};
auto greaterThan2 = [](auto v) { return v > 2; };

auto gt2Std = vec | std::views::filter(greaterThan2);  // standard view
printUniversal(gt2Std);                                // OK:  3 4 5

auto gt2Bel = vec | bel::views::filter(greaterThan2);  // belle view
print(gt2Bel);                                         // OK:  3 4 5

vec.insert(vec.begin(), {9, 0, -1});
print(vec);                                            // vec now: 9 0 -1 1 2 3 4 5

printUniversal(gt2Std);                                // OOPS:  -1 3 4 5  (due to earlier print call!)
print(gt2Bel);                                         // OK:  9 3 4 5

Note that without the first call of printUniversal() the second call would be correct. Thus, for standard views, even a pure reading iteration might change or even invalidate later behavior.

Caching also has the effect that a filter view has undefined behavior when used in an iteration that modifies the elements (which is in principle allowed):

std::vector<int> coll{1, 4, 7, 10};
auto isEven = [] (auto&& i) { return i % 2 == 0; };

// standard behavior when incrementing even elements twice:
auto collEvenStd = coll | std::views::filter(isEven);
print(coll);    // 1 4 7 10
for (int& i : collEvenStd) {
  i += 1;       // ERROR (undefined behavior)
}
print(coll);    // 1 5 7 11
for (int& i : collEvenStd) {
  i += 1;       // ERROR (undefined behavior)
}
print(coll);    // 1 6 7 11   (note the 6!)

The problem is that the modification is required not to break the predicate of the filter. Otherwise, the standard views have undefined behavior (and bad behavior as you can see).

Again, belleviews do not have this problem:

// belleviews behavior when incrementing even elements twice:
auto collEvenBel = coll | bel::views::filter(isEven);
print(coll);    // 1 4 7 10
for (int& i : collEvenBel) {
  i += 1;       // OK
}
print(coll);    // 1 5 7 11
for (int& i : collEvenBel) {
  i += 1;       // OK
}
print(coll);    // 1 5 7 11

Belle views support concurrent read iterations for all views:

std::list coll{1, 2, 3, 4, 5, 6, 7, 8};    // no random-access range

auto vStd = coll | std::views::drop(2);    // standard drop view
std::jthread t1{[&] {
  for (const auto& elem : vStd) {          // vStd.begin() in separate thread
    std::cout << elem << '\n';
  }}};
auto cnt1 = std::ranges::count_if(vStd,    // Runtime ERROR (data race)
                                  [](int i) {return i < 0;});  

auto vBel = coll | bel::views::drop(2);    // belle drop view
std::jthread t2{[&] {
  for (const auto& elem : vBel) {          // vBel.begin() in separate thread
    std::cout << elem << '\n';
  }}};
auto cnt2 = std::ranges::count_if(vBel,    // OK
                                  [](int i) {return i < 0;});  

Belle views are more consistent and complete. For example, you have consistent names of all view types and an adaptor/factory for all of them:

std::ranges::subrange s1{++coll.begin(), --coll.end()};    // strange name, no factory

auto s2 = bel::views::sub(++coll.begin(), --coll.end());   // factory with consistent name

For more examples, see in sources all the test code.

Design decisions

Goals:

  • Usability (simplicity, consistency, and a lot of use cases that standard views do (surprisingly) not support should just work)
  • Safety (a couple of non-intuitive cases for undefined behavior should no longer occur)
  • Performance (the library should still have good performance)
  • Predictability (common use cases should work as expected)

Principles:

  • Iterating over a view is stateless

    • You can always iterate when the view is const
    • You can iterate concurrently
    • Iterating does not have any impact on later behavior (except with modifications, of course)
  • Respect and honor constness

    • Views always propagate constness (is the view const, the elements are const)
    • Avoid using references that break the effect of declaring elements const
  • A copy of a view always behaves the same as its source

  • All view types have a name ending with view

    • Use sub_view instead of subrange
  • Consistent naming

    • For example, we have the view sub_view instead of subrange.
  • For all view types there is an adaptor/factory so that you never need to use view types directly

    • New factory bel::views::sub(beg,end)
  • Fix all specific flaws that create inconsistencies or confusion

    • E.g., allow elements_view to use any type with a tuple-like API, not just std::tuple and std::pair

Open:

  • Shall we still support implicit conversion to a ref_view?

Still:

  • Views might be borrowed or might be not
    • We could make more/all views borrowed if they don't own a range: But if they own a range they have to be borrowed. Therefore, we keep the standard policy here as it is.

Status

Available:

  • ref_view, owning_view, all(), and all_t
  • drop_view and drop()
  • take_view and take()
  • filter_view and filter()
  • transform_view and transform()
  • drop_while_view and drop_while()
  • sub_view (a.k.a. bel::subrange) with factory sub(beg,end)

ToDo

OPEN ISSUES (probably a lot but let's start with some):

  • const sentinel support (why does make_const_iterator() not work for sentinels?)

OPEN TOPICS:

  1. Support of counted, common, take_while
  2. Support of elements, keys, values
    • with a fix so that elements always works on tuple-like APIs
  3. Support of reverse
  4. Support of zip_view (fix the new C++23 const nightmare there)

Tests

See the programs test*.cpp in directory sources.

More

For more details on how to deal with C++20 views see

C++20 - The Complete Guide by Nicolai M. Josuttis

http://cppstd20.com

Acknowledgements

This library could not have been implemented without the tremendous work of the designers and the implementors of the ranges library, which became part of the C++ standard in C++20. Therefore, many many thanks for all who worked on the ranges library and its standardization.

The code of this library partially adopts code from the GNU ISO C++ Library. That code was of tremendous help to make this implementation possible and I want to thank all the developers of this library.

Finally, let me thank all additional contributors who made pull requests, did reviews, and gave additional feedback. In particular, I want to thank Hannes Hauswedell for significant help and contribution.

Feedback

I am happy about any constructive feedback. Beside pull requests, you can use the following email address for feedback (if you click on it to reply, you have to (replace "_AT_" by "@"): belleviews_AT_josuttis.com

License

Because this code uses modified versions of the GNU ISO C++ Library, the code is licensed according to the GNU General Public License, version 3.

This library is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 3, or (at your option) any later version.

This library is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

Under Section 7 of GPL version 3, you are granted additional permissions described in the GCC Runtime Library Exception, version 3.1, as published by the Free Software Foundation.

You should have received a copy of the GNU General Public License and a copy of the GCC Runtime Library Exception along with this program; see the file COPYING3 and COPYING.RUNTIME respectively. If not, see http://www.gnu.org/licenses/.

See the end of the file COPYING3 for additional details about how to apply these terms to your new programs.