Yes, you are correct. utf8 seems to conserve accents.
However, if I open the file with Notepad in Win32 and save it, it is growing by 3 bytes. Since I have a checksum on the file this is causing my program to flag a corrupt file.
I created the text files on linux with featherpad, used unix2dos to add the CRLF and then did my checksum.
I need to find a stable format.