Raw data (bytes) and signed/unsigned variables

Question

I've been told that whenever you work with bytes, you should declare your variables as unsigned chars. In Windows' data types, BYTE is declared as an unsigned char.

My questions:

Why?

Unsigned is a representation of integers from 0 to 255 and signed 128 to -127.

If that's the case, then how is EOF in binaries (-1) caught?

EOF is declared in stdio.h as a -1 #define macro.

EOF isn't a real character, it is just a return value with the meaning that no more input is available. — Theolodis
@Theolodis - EOF in stdio.h is declared as #ifndef EOF # define EOF (-1) #endif — Andy Carter
It's implementation-dependent if char is signed or unsigned. Also, EOF is an int and not a char. If you assign EOF to a char and compare it to the int value EOF they will not match, because the compiler will extend the char to 0x000000ff which is not the same as the int value 0xffffffff (using the normal two's complement value of -1). — Some programmer dude
Yep, but you would never parse it into a byte, as it is not in the range of a byte. And why unsigned char? because a byte has 8 bit, giving you the range of 0 to 255. — Theolodis
@AndyCarter Read latedev.wordpress.com/2012/12/04/all-about-eof — user395760

lrineau lrineau · Accepted Answer · 2014-05-06T11:05:59

When you read chars from a stream, the return type of functions like std::getc is int, and not char. The constant EOF is of type int, and not char or unsigned char.

Even in the C++ I/O API, the I/O streams like std::ifstream deal with types char_type (that is the type of characters in the stream), and int_type that is a type that can hold all values of char_type, plus EOF.

Raw data (bytes) and signed/unsigned variables

4 Answers