C++ Reading File into Char Array

Question

I am using following code to read a file into chararcter array. Now, for small file (say for 2 MB) it is executing properly but for large file (140 MB), in my 18 GB UBUNTU server it is giving segmentation fault. Can anybody help me how to solve this ? I think 18 GB is enough to hold a 240 MB file into memory. I am using 64 bit UBUNTU and compiling using g++.

ifstream is;

char chararray [fileSize] ;

is.read(chararray, fileSize) ;

@milleniumbug: I'd think fileSize is something like a size_t with value 240*1024*1024... — Mr.C64
– Mr.C64, Commented Nov 20, 2012 at 10:20
Replace your second line of code with char* chararray = new char[fileSize];. — Good Night Nerd Pride
– Good Night Nerd Pride, Commented Nov 20, 2012 at 10:32
@Abbondanza No need to introduce raw array pointers for this, that's a C-minded solution. They are (a) easy to leak and (b) offer the opportunity to forget to delete[] instead of delete. Vectors are better...and as of C++11 you can even return big ones from functions without paying the cost of copying it, thanks to the copy elision that happens during return value optimization. — HostileFork says dont trust SE
– HostileFork says dont trust SE, Commented Nov 20, 2012 at 11:19
@Abbondanza ...and don't forget exception-safety! A vector--being a class with a destructor--can clean up after itself. A raw pointer will just leak! :-/ — HostileFork says dont trust SE
– HostileFork says dont trust SE, Commented Nov 20, 2012 at 11:42

Some programmer dude · Accepted Answer · 2012-11-20 10:18:16Z

5

If the array is a local variable you will get a stack overflow, as it will not fit on the stack. Allocate the "array" on the heap instead, either directly using new or indirectly by using std::vector.

Or use memory mapping. See the mmap function.

answered Nov 20, 2012 at 10:18

Some programmer dude

411k36 gold badges420 silver badges655 bronze badges

Sign up to request clarification or add additional context in comments.

8 Comments

James Kanze Over a year ago

std::vector is probably preferable, but since he said it was a Unix variant, ulimit -s 500000, invoked before he runs his program, should work.

Richard Over a year ago

ulimit run from the command line is an ugly way of non-portably solving a problem which can, in this case, be just as easily managed in the code itself.

James Kanze Over a year ago

@Richard As I said, using std::vector is probably preferable. But there's nothing wrong about knowing about ulimit as well.

HostileFork says dont trust SE Over a year ago

@peterph Yes, memory mapping is a good thing...although if one is programming in C++ then using something wrapped in a class and which is platform-independent is more desirable than POSIX mmap. It's just that if a question is tagged C++ then I think one needs to push back against methods that suggest invoking new[] (and, of course, malloc...)

James Kanze Over a year ago

@HostileFork If you're programming in C++, you'll wrap mmap in a class, with munmap in the destructor. And probably give the class an interface that is as close to that of std::vector or std::array as possible, so that users feel at home with it. Because of the work involved, however, I would just read into an std::vector until I knew I needed the performance (but if I'd needed it once, I'd keep the class in my toolkit, and then use it immediately).

|

Mr.C64 · Accepted Answer · 2012-11-20 10:17:01Z

2

Instead of allocating the char array on the stack, I'd try using std::vector, which will allocate dynamically on the heap:

std::vector<char> buffer(fileSize);
is.read(&buffer[0], fileSize);

answered Nov 20, 2012 at 10:17

Mr.C64

43.3k15 gold badges97 silver badges170 bronze badges

Comments

Praveen Vinny · Accepted Answer · 2012-11-21 19:06:21Z

The GCC compiler has the default command called size for this! Compile the program using the GCC Compiler. Then you can get the file size!

gcc -Wall test.c
size

This is for a normal C program! Since you had specified no parameter, it takes ./a.out as its default parameter!

If you have to apply some optimization, the code will become like as follows..

praveenvinny@ubuntu:~/Project/New$> gcc -Wall -o1 -fauto-inc-dec test.c -o Output
praveenvinny@ubuntu:~/Project/New$> size output
text       data     bss     dec     hex filename
1067       256      8       1331    533 output

Use text section for code size. You can use data and bss, if you want to consider global data size as well.

This will print the code size,

time -f "%e" -o Output.log ./a.out

will print the execution time to the log file called as Output.log

Collectives™ on Stack Overflow

C++ Reading File into Char Array

3 Answers 3

8 Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

8 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related