How to convert a text file with a specific Codepage to UTF-8 using Visual Studio .NET

November 15, 2006 Comment on this post [6] Posted in Internationalization

About Scott

Scott Hanselman is a former professor, former Chief Architect in finance, now speaker, consultant, father, diabetic, and Microsoft employee. He is a failed stand-up comic, a cornrower, and a book author.

About Newsletter

Hosting By

Hosted on Linux using .NET in an Azure App Service

Comment on this post [6]

Share on BlueSky or use the Permalink and post anywhere!

November 15, 2006 21:15

Interesting timing, I just wrote a VB6 application last week that will convert a file from a codepage to utf8. It's just too bad that there is no real good way to determine what codepage the original file uses.

JohnnyNine

November 15, 2006 23:16

Or you could just swap Notepad2 for EmEditor, a much better editor in my opinion...

Daniel B

November 16, 2006 0:38

Yeah - Visual Studio is the all-embracing tool for solving problems around the world since the early 90s ;-)

I'm working as software engineer in the localization business - you probably can imagine how often I have to mess around with silly codepage problems :-/ I think there is just one tool with similar conversion capabilities like VS, which furthermore is widely available: MS Word. Sounds strange, but seems to be so.

ReneMT

November 16, 2006 5:37

nice to see some hebrew words in your blog
:)
Dror.

Dror Engel

November 16, 2006 10:21

I agree with Rene. In a previous life I was doing localization for Wizards of the Coast (Magic the Gathering) and we had grandiose plans of localizing ALL our content. Word does have the same capabilities, although IIRC it was nearly the same convoluted process as using VS.

Greg

November 16, 2006 10:42

Obscure UI design, indeed! I think you're shortchanging them with only 5 points. The 'drop-down' open and save buttons are an abomination. Even when I know about them and have used them before, that bit of functionality never sticks in my head. I mean, who ever really *looks* at an 'Open' or 'Save' button anymore - they're as automatic as 'OK' (I wonder when I'll get the drop-down 'Not-really,-but-if-you-insist' option on my 'OK' buttons?)

I actually purchased UltraEdit because I had a need to deal with .rc files in several codepages, and it was the only editor that made dealing with multiple encodings and/or codepages possible for me to figure out.

Note that I'm not by any means saying that it's the only editor that can deal with them - it's just I couldn't figure out how to easily do my work with other editors. UE made it pretty easy, though you do have to remember to not only change the codepage, but also the font and the font's script - UE always reminds me of that. This is one of the the only pestering, constant reminder dialogs that I actually appreciate. 'Cause, Lord knows that when I go to do that type of work again 8 months from now, I sure as Hell won't remember.

Oh well, UE's pretty cheap, and it's a pretty good editor to have in my toolbox anyway.

mikeb

Comments are closed.