3

When i try to download the content of websites with special characters like ï, curl returns strange characters. For example when i execute the following from the command line:

curl.exe --GET "http://en.wiktionary.org/wiki/ge%C3%AFnteresseerd"

the (partial) response is:

...wgPageName":"geïnteresseerd","wgTitle":"geïnteresseerd",....

the problem is ï should be ï. Is there anyway to solve this issue? Do i need to re-encode the response?

3
  • 1
    This is due to handling of unicode characters in a DOS prompt, see stackoverflow.com/questions/388490/… Commented Jan 14, 2015 at 10:50
  • @HansZ. Thanks, that pointed me in the right direction. Commented Jan 14, 2015 at 12:56
  • alrighty, added it as an answer then Commented Jan 14, 2015 at 13:22

1 Answer 1

4

This is due to handling of unicode characters in a DOS prompt, see Unicode characters in Windows command line - how?. You should be able to change this behavior by using a command like chcp 65001 to set the terminal up for UTF-8 handling.

Sign up to request clarification or add additional context in comments.

Comments

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.