C++: How to add raw binary data into source with Visual Studio?

Question

I have a binary file which i want to embed directly into my source code, so it will be compiled into the .exe file directly, instead of reading it from a file, so the data would already be in the memory when i launch the program.

How do i do this?

Only idea i got was to encode my binary data into base64, put it in a string variable and then decode it back to raw binary data, but this is tricky method which will cause pointless memory allocating. Also, i would like to store the data in the .exe as compact as the original data was.

Edit: The reason i thought of using base64 was because i wanted to make the source code files as small as possible too.

As long as you put this resource in a separate source file I offhand see no reason to have source size be part of the concern. Make it easy to use and obvious what's going on first, and let the compiler worry about reading a few extra characters. — Mark B
– Mark B, Commented Apr 19, 2011 at 14:56
well its just my preferences really, sure it doesnt matter, but i like compact. — Rookie
– Rookie, Commented Apr 19, 2011 at 15:01
Since you like compact: stackoverflow.com/a/52843063/6846474 I wrote a tool that compiles header files with a list of resource paths directly to object files or static libraries. — user6846474
– user6846474, Commented Oct 16, 2018 at 19:56

James Kanze · Accepted Answer · 2011-04-19 14:39:51Z

11

The easiest and most portable way would be to write a small program which converts the data to a C++ source, then compile that and link it into your program. This generated file might look something like:

unsigned char rawData[] =
{
    0x12, 0x34, // ...
};

answered Apr 19, 2011 at 14:39

James Kanze

155k20 gold badges191 silver badges338 bronze badges

Sign up to request clarification or add additional context in comments.

11 Comments

dubnde Over a year ago

I had to to this for firmware updates on system which does not support file operations and we just copied the raw data into array as in this answer.

Rookie Over a year ago

what is the most compact way doing this in my source code? i could optimize the space by not using 0x prefix and use decimal values, but are there other ways? i have seen code like: Y\377\322\217^\377\321\227l\377\340\262\220\377 but i dont understand how that works, and it causes some compiler warnings for some reason, yet, it works.

Ferruccio Over a year ago

@Rookie: the \nnn notation uses octal to specify the value of each character. \377 is the same as 0xff.

Rookie Over a year ago

yes but what does the weird letters do in that octal data? for example there is Y and ^ and l etc, many weird chars there i dont understand the logic.

James Kanze Over a year ago

@Rookie Presumably, not all of the characters are octal escapes. Personally, I wouldn't worry too much about the size of the source code file; if you run into size problems, it will be because the total table is too big for the compiler, and that will be after tokenization, and won't depend on the size of the input file.

|

Eugene · Accepted Answer · 2018-05-22 04:57:10Z

6

There are tools for this, a typical name is "bin2c". The first search result is this page.

You need to make a char array, and preferably also make it static const.

In C:

Some care might be needed since you can't have a char-typed literal, and also because generally the signedness of C's char datatype is up to the implementation.

You might want to use a format such as

static const unsigned char my_data[] = { (unsigned char) 0xfeu, (unsigned char) 0xabu, /* ... */ };

Note that each unsigned int literal is cast to unsigned char, and also the 'u' suffix that makes them unsigned.

Since this question was for C++, where you can have a char-typed literal, you might consider using a format such as this, instead:

static const char my_data[] = { '\xfe', '\xab', /* ... */ };

since this is just an array of char, you could just as well use ordinary string literal syntax. Embedding zero-bytes should be fine, as long as you don't try to treat it as a string:

static const char my_data[] = "\xfe\xdab ...";

This is the most compact solution. In fact, you could probably use that for C, too.

edited May 22, 2018 at 4:57

Eugene

3,4773 gold badges38 silver badges46 bronze badges

answered Apr 19, 2011 at 14:43

unwind

402k64 gold badges492 silver badges620 bronze badges

3 Comments

Rookie Over a year ago

\xff equals to 0xff ? which equals to 255, and when using comma, its the same size, but decimal can also be 0,0,0,0, or 11,11,11,11 so its 1 to 2 bytes smaller in some cases, whereas the hex is always 4 bytes. i think i go with decimals, if those are all the options here?

Rookie Over a year ago

could you also explain this data Y\377\322\217^\377\321\227l\377\340\262\220\377 where you see Y and ^ and l in there among the octal values, what is the logic with those?

unwind Over a year ago

The point in avoiding a literal like 0 was (for me) to be type-clean; the type of 1 is int. I guess the compiler will typically do bounds-checking when initializing, so it should be safe, but still. I'm not sure where the data you quote in the second comment comes from, but probably the generator decided that the byte-value was representable as a printable character and used that for brevity.

Coder · Accepted Answer · 2011-04-19 14:47:02Z

4

You can use resource files (.rc). Sometimes they are bad, but for Windows based application that's the usual way.

answered Apr 19, 2011 at 14:47

Coder

3,7457 gold badges32 silver badges45 bronze badges

Comments

Blindy · Accepted Answer · 2011-04-19 14:38:37Z

0

Why base64? Just store the file as it is in one char*.

answered Apr 19, 2011 at 14:38

Blindy

68k10 gold badges96 silver badges141 bronze badges

5 Comments

Rookie Over a year ago

i was thinking to use base64 because i also want to optimize the space used in my source code.

Blindy Over a year ago

@Rookie, how is tripling the amount of source code "optimizing" it?

Rookie Over a year ago

what do you mean tripling? base64 packs the data better in the sourcecode than using 0xff,0xff,0xff etc methods. see below: orig: this is a testing text!! base64: dGhpcyBpcyBhIHRlc3RpbmcgdGV4dCEh hexstr: 7468697320697320612074657374696E6720746578742121 decarr: 116,104,105,115,32,105,115,32,97,32,116,101,115,116,105,110,103,32,116,101,120,116,33,33

Blindy Over a year ago

@Rookie, yes but you don't have to escape printable ascii characters. You can simply say char *data="this is a testing text!!";

Rookie Over a year ago

that was just an example of how much it would take space, whereas the original is the original data length visible by plain eyes here, i cant paste binary data in here... read the title again.

Collectives™ on Stack Overflow

C++: How to add raw binary data into source with Visual Studio?

4 Answers 4

11 Comments

3 Comments

Comments

5 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

11 Comments

3 Comments

Comments

5 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related