How to add encoding information to the response stream in ASP.NET?

Question

I have following piece of code:

public void ProcessRequest (HttpContext context) 
{
    context.Response.ContentType = "text/rtf; charset=UTF-8";
    context.Response.Charset = "UTF-8";
    context.Response.ContentEncoding = System.Text.Encoding.UTF8;
    context.Response.AddHeader("Content-disposition", "attachment;filename=lista_obecnosci.csv");
    context.Response.Write("ąęćżźńółĄŚŻŹĆŃŁÓĘ");
}

When I try to open generated csv file, I get following behavior:

In Notepad2 - everything is fine.
In Word - conversion wizard opens and asks to convert the text. It suggest UTF-8, which is somehow ok.
In Excel - I get real mess. None of those Polish characters can be displayed.

I wanted to write those special encoding-information characters in front of my string, i.e.

context.Response.Write((char)0xef);
context.Response.Write((char)0xbb);
context.Response.Write((char)0xbf);

but that won't do any good. The response stream is treating that as normal data and converts it to something different.

I'd appreciate help on this one.

Collin K · Accepted Answer · 2012-09-18 20:00:40Z

29

I ran into the same problem, and this was my solution:

context.Response.BinaryWrite(System.Text.Encoding.UTF8.GetPreamble());
context.Response.Write("ąęćżźńółĄŚŻŹĆŃŁÓĘ");

answered Sep 18, 2012 at 20:00

Collin K

15.5k1 gold badge29 silver badges23 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

Kai Hartmann Over a year ago

I wonder, is this roughly the same as what happens in Alan Moores's answer?

Geoduck Over a year ago

this is most useful answer, since every encoding and byte order system has a different preamble and this applies to other encodings.

adinas Over a year ago

This was driving me nuts. I was sending Hebrew text in Unicode and you could see it in Notepad and Notepad++ but Excel/Word/Wordpad all showed gibberish. But your answer fixed it.

Alan Moore · Accepted Answer · 2013-10-29 00:57:01Z

25

What you call "encoding-information" is actually a BOM. I suspect each of those "characters" is getting encoded separately. To write the BOM manually, you have to write it as three bytes, not three characters. I'm not familiar with the .NET I/O classes, but there should be a method available to you that takes a byte or byte[] parameter and writes them directly to the file.

By the way, the UTF-8 BOM is optional; in fact, its use is discouraged by the Unicode Consortium. If you don't have a specific reason for using it, save yourself some hassle and leave it out.

EDIT: I just remembered you can also write the actual BOM character, '\uFEFF', and let the encoder handle it:

context.Response.Write('\uFEFF');

edited Oct 29, 2013 at 0:57

answered Jun 17, 2009 at 23:26

Alan Moore

75.6k13 gold badges109 silver badges161 bronze badges

1 Comment

Greg Over a year ago

thanks a lot! that exactly what i've been looking for. the purpose of the aphx handles is purely to generate excell friendly list, and this does the trick!

Community · Accepted Answer · 2017-05-23 12:02:48Z

2

I think the problem is with Excel based on Microsoft Excel mangles Diacritics in .csv files. To prove this, copy your sample output string of ąęćżźńółĄŚŻŹĆŃŁÓĘ and paste into a test file using your favorite editor, and save as a UTF-8 encoded .csv file. Open in Excel and see the same issues.

edited May 23, 2017 at 12:02

CommunityBot

11 silver badge

answered Jun 18, 2009 at 1:51

Kevin Hakanson

42.4k23 gold badges131 silver badges158 bronze badges

Comments

Community · Accepted Answer · 2017-05-23 10:30:42Z

1

The answer from Alan Moore translated to VB:

Context.Response.Write(""c)

edited May 23, 2017 at 10:30

CommunityBot

11 silver badge

answered Apr 17, 2012 at 13:22

Manuel Alves

4,0442 gold badges33 silver badges25 bronze badges

2 Comments

MtwStark Over a year ago

can you please explain this syntax? I'm not familiar with it

Manuel Alves Over a year ago

It's to make sure that what's on the left is treated as char and not as a string. Please see stackoverflow.com/a/19522767/251674

Collectives™ on Stack Overflow

How to add encoding information to the response stream in ASP.NET?

4 Answers 4

3 Comments

1 Comment

Comments

2 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

3 Comments

1 Comment

Comments

2 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related