Click here to Skip to main content
15,671,149 members
Articles / Programming Languages / Visual Basic
Posted 11 Mar 2008

Tagged as


7 bookmarked

Working with UTF8 characters...

Rate me:
Please Sign up or sign in to vote.
1.62/5 (8 votes)
11 Mar 2008CPOL
it is not difficult to work with UTF 8 characters anymore...


Some times situation is like we have to parse the string which has both Unicode and ASCII characters in single string, at that time Encoding functions in .NET will not be help ful,
so i have created two use full functions for that situations...


this article requires some headache with working with UTF 8 characters. just kidding..

you should know the string formate in .NET and how one character is recognize as unicode in 16 bits(2 bytes)

Using the code

These two function will solve our problem fro converting UTF characters to bytes and from bytes to UTF characters.
public static string GetUTF8StringFrombytes(byte[] byteVal)


            byte[] btOne = new byte[1];

            StringBuilder sb = new StringBuilder("");

            char uniChar;

            for (int i = 0; i < byteVal.Length; i++)


                btOne[0] = byteVal[i];

                if (btOne[0] > 127)


                    uniChar = Convert.ToChar(btOne[0]);






            return sb.ToString();


        public static byte[] GetBytesFromUTF8Chars(string strVal)


            if (strVal != string.Empty || strVal != null)


                byte btChar;

                byte[] btArr = new byte[strVal.Length * 2];

                byte[] tempArr;

                int arrIndex = 0;

                for (int i = 0; i < strVal.Length; i++)


                    btChar = (byte)strVal[i];

                    if (btChar > 127 && btChar < 256)


                        btArr[arrIndex] = btChar;





                        tempArr = Encoding.UTF8.GetBytes(strVal[i].ToString());

                        Array.Copy(tempArr, 0, btArr, arrIndex, tempArr.Length);

                        arrIndex += tempArr.Length;

                        tempArr = null;



                byte[] retVal = new byte[arrIndex];

                Array.Copy(btArr, 0, retVal, 0, arrIndex);

                return retVal;



                return new byte[0];


Points of Interest

Just providing some solution which i have found while have problem, so that other will not face them.


If any improvements are suggested then they are welcomes.


This article, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Written By
Software Developer
India India
Chirag Patel, a Programmer Analyst in a well known IT company working on .NET Technologies since last 2 years. He is interested in Pure business logic and fuzzy logic. his area of interest is in C#.NET, VB.NET and MSSQL 2005.

catch me on:

Comments and Discussions

GeneralIncorrect Pin
Jonathan [Darka]11-Mar-08 8:28
professionalJonathan [Darka]11-Mar-08 8:28 
GeneralRe: Incorrect Pin
JustChiragPatel11-Mar-08 17:26
JustChiragPatel11-Mar-08 17:26 
GeneralDon't use that Pin
TheDarkMan11-Mar-08 4:34
TheDarkMan11-Mar-08 4:34 
GeneralRe: Don't use that Pin
JustChiragPatel11-Mar-08 17:30
JustChiragPatel11-Mar-08 17:30 
Yes, this code is just like appearance, as you said,
but have you tried to convert byte array of Unicode characters into string of actual Unicode string just like Chinese or Japanese character.

just try to convert and verify with actual string.

First i have used System.Encoding class, but didn't get actual string that's why i have written this two functions which gives me correct output.

don't mine but, i have expireanced this problem that's why telling you.
if you have any idea about solving this then you are welcome.

bye Big Grin | :-D ,

Chirag Patel
Programmer Analyst

General General    News News    Suggestion Suggestion    Question Question    Bug Bug    Answer Answer    Joke Joke    Praise Praise    Rant Rant    Admin Admin   

Use Ctrl+Left/Right to switch messages, Ctrl+Up/Down to switch threads, Ctrl+Shift+Left/Right to switch pages.