Bit Array in C++

Question

When working with Project Euler problems I often need large (> 10**7) bit array's.

My normal approach is one of:

bool* sieve = new bool[N];

bool sieve[N];

When N = 1,000,000 my program uses 1 MegaByte (8 * 1,000,000 bits).

Is there a more efficient way to use store bit arrays than bool in c++?

i implement sieve's algorithm using vectors.. it can hold that many numbers. — Vaibhav
– Vaibhav, Commented Sep 27, 2010 at 18:07

Community · Accepted Answer · 2020-06-20 09:12:55Z

34

Use std::bitset (if N is a constant) otherwise use std::vector<bool> as others have mentioned (but dont forget reading this excellent article by Herb Sutter)

A bitset is a special container class that is designed to store bits (elements with only two possible values: 0 or 1, true or false, ...).

The class is very similar to a regular array, but optimizing for space allocation: each element occupies only one bit (which is eight times less than the smallest elemental type in C++: char).

EDIT:

Herb Sutter (in that article) mentions that

The reason std::vector< bool > is nonconforming is that it pulls tricks under the covers in an attempt to optimize for space: Instead of storing a full char or int for every bool[1] (taking up at least 8 times the space, on platforms with 8-bit chars), it packs the bools and stores them as individual bits(inside, say, chars) in its internal representation.

std::vector < bool > forces a specific optimization on all users by enshrining it in the standard. That's not a good idea; different users have different requirements, and now all users of vector must pay the performance penalty even if they don't want or need the space savings.

EDIT 2:

And if you have used Boost you can use boost::dynamic_bitset(if N is known at runtime)

edited Jun 20, 2020 at 9:12

CommunityBot

11 silver badge

answered Sep 27, 2010 at 17:59

Prasoon Saurav

93.2k51 gold badges245 silver badges348 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Gergely Máté Over a year ago

Sadly the link to the Sutter article is dead by 24-11-2021.

vonbrand Over a year ago

@GergelyMáté, it is available today (2023-01-05).

GManNickG · Accepted Answer · 2010-09-27 18:01:09Z

15

For better or for worse, std::vector<bool> will use bits instead of bool's, to save space. So just use std::vector like you should have been in the first place.

If N is a constant, you can use std::bitset.

answered Sep 27, 2010 at 18:01

GManNickG

506k55 gold badges505 silver badges551 bronze badges

1 Comment

qwr Over a year ago

No, using bits is implementation defined. en.cppreference.com/w/cpp/container/vector_bool

Jerry Coffin · Accepted Answer · 2010-09-27 18:09:30Z

You could look up std::bitset and std::vector<bool>. The latter is often recommended against, because despite the vector in the name, it doesn't really act like a vector of any other kind of object, and in fact doesn't meet the requirements for a container in general. Nonetheless, it can be pretty useful.

OTOH, nothing is going to (at least dependably) store 1 million bool values in less than 1 million bits. It simply can't be done with any certainty. If your bit sets contain a degree of redundancy, there are various compression schemes that might be effective (e.g., LZ*, Huffman, arithmetic) but without some knowledge of the contents, it's impossible to say they would be for certain. Either of these will, however, normally store each bool/bit in only one bit of storage (plus a little overhead for bookkeeping -- but that's usually a constant, and on the order of bytes to tens of bytes at most).

Stephen Rauch · Accepted Answer · 2017-02-13 02:14:58Z

4

A 'bool' type isn't stored using only 1 bit. From your comment about the size, it seems to use 1 entire byte for each bool.

A C like way of doing this would be:

uint8_t sieve[N/8]; //array of N/8 bytes

element of array is:

result = sieve[index / 8] || (1 << (index % 8));

or

result = sieve[index >> 3] || (1 << (index & 7));

set 1 in array:

sieve[index >> 3] |= 1 << (index & 7);

edited Feb 13, 2017 at 2:14

Stephen Rauch♦

50.1k32 gold badges118 silver badges143 bronze badges

answered Feb 13, 2017 at 1:56

Vladimir

411 bronze badge

Comments

whooops · Accepted Answer · 2010-09-27 18:04:07Z

3

A 'bool' type isn't stored using only 1 bit. From your comment about the size, it seems to use 1 entire byte for each bool.

A C like way of doing this would be:

uint8_t sieve[N/8]; //array of N/8 bytes

and then logical OR bytes together to get all your bits:

sieve[0] = 0x01 | 0x02; //this would turn on the first two bits

In that example, 0x01 and 0x02 are hexadecimal numbers that represent bytes.

answered Sep 27, 2010 at 18:04

whooops

9351 gold badge6 silver badges11 bronze badges

1 Comment

Manohar Reddy Poreddy Over a year ago

upvoted, good idea, may be, it's N/8+1, it's bit packed array.

chesslover · Accepted Answer · 2014-07-30 07:19:45Z

2

You might be interested in trying the BITSCAN library as an alternative. Recently an extension has been proposed for sparseness, which I am not sure is your case, but might be.

answered Jul 30, 2014 at 7:19

chesslover

3573 silver badges6 bronze badges

Comments

driis · Accepted Answer · 2010-09-27 17:59:59Z

1

You can use a byte array and index into that. Index n would be in byte index n/8, bit # n%8. (In case std::bitset is not available for some reason).

answered Sep 27, 2010 at 17:59

driis

165k46 gold badges272 silver badges345 bronze badges

Comments

Eugen Constantin Dinca · Accepted Answer · 2010-09-27 18:08:19Z

0

If N is known at compile time, use std::bitset, otherwise use boost::dynamic_bitset.

answered Sep 27, 2010 at 18:08

Eugen Constantin Dinca

9,1702 gold badges36 silver badges51 bronze badges

Collectives™ on Stack Overflow

Bit Array in C++

8 Answers 8

2 Comments

1 Comment

Comments

Comments

1 Comment

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

8 Answers 8

2 Comments

1 Comment

Comments

Comments

1 Comment

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related