I've been wondering about extracted text's encoding using IFilter
.
IFilter::GetText()
retrieves WCHAR*
, but what if the file is encoded with ASCII? What about other Unicode encoding (such as UTF-8 or UTF-16?)?
As I see it, it's either IFilter taking care of converting the extracted text to a single encoding (if it is the case - what is this encoding?), and if not, how do I know which encoding is it?