Arguments for and against supporting std::wstring exclusively in cross-platform library

Question

I'm currently developing a cross-platform C++ library which I intend to be Unicode aware. I currently have compile-time support for either std::string or std::wstring via typedefs and macros. The disadvantage with this approach is that it forces you to use macros like L("string") and to make heavy use of templates based on character type.

What are the arguments for and against to support std::wstring only?

Would using std::wstring exclusively hinder the GNU/Linux user base, where UTF-8 encoding is preferred?

I quite like Python 3's approach - the new str class is unicode, and there's a new bytes class to hold sequences of bytes, and provide string-like manipulation (substring search and so on). But they can only be interpreted as text by conversion with an encoding. So, if someone is planning, "data that only contains 7-bit values", they can save memory by using "bytes", but their objects are not compatible with proper strings. The awkward issue I see with this in C++ is the same one you already have with wstring, that you have to convert literals, and for calls to functions like fopen. — Steve Jessop

David Feurle David Feurle · Accepted Answer · 2010-09-06T12:44:20

A lot of people would want to use unicode with UTF-8 (std::string) and not UCS-2 (std::wstring). UTF-8 is the standard encoding on a lot of linux distributions and databases - so not supporting it would be a huge disadvantage. On Linux every call to a function in your library with a string as argument would require the user to convert a (native) UTF-8 string to std::wstring.

On gcc/linux each character of a std::wstring will have 4 bytes while it will have 2 bytes on Windows. This can lead to strange effects when reading or writing files (and copying them from/to different platforms). I would rather recomend UTF-8/std::string for a cross platform project.

Arguments for and against supporting std::wstring exclusively in cross-platform library

5 Answers