Implementing an Efficient Tuple with C++26

The world has seen many amateur implementations std::tupleand selling your bikes is probably a really effective way of learning: you can hardly say that you really understand something if you can’t explain how it works.

Many inquisitive minds have wondered for decades: how is it implemented? std::tuplehow can I implement my tuple (tuple)? [1]

And many answers have been given to these questions and many articles have been written ([2]). However, I dare say that they all have one thing in common. critical flaw! More specifically, they all basically consider only one (and at the same time ineffective) way of implementation: using multiple inheritance or recursive instantiation, which in turn has many of its own disadvantages, the main one being inefficient use of memory.

While modern C++ allows you to implement a tuple much more simply (without an abundance of boilerplate) and more efficiently.

Design notes

The cornerstone of this idea of effective implementation is simple byte array (hereinafter referred to as storage), the size of which will be sufficient to store all members of the tuple. Of course, it imposes on us the requirement to think about alignment: members are of different types, which have different alignment requirements, and we will have to take this into account when placing them in storage.

But how exactly should we place members to get maximum memory efficiency?

Let's assume we have a type tuple<char, double, int> — the naive version of its placement in memory (see the figure and listing below), in which we simply arrange the members (taking into account alignment) one after another in their original order, is clearly not suitable for us.