Skip to main content

Questions tagged [character-encoding]

Questions that deal with various representations of characters & character sets, such as: ASCII, UTF-8, EBCDIC, among others. Often encountered when moving files between operating systems that encode new lines with carriage returns and/or newline characters.

Filter by
Sorted by
Tagged with
2 votes
1 answer
105 views

At work, on an Ubuntu 22.04.1 I'm willing to use apt_auth.conf abilty of apt to ease me getting packages from an artifactory. I've wrote my artifactory.conf file into /etc/apt/apt.conf.d that way: ...
Marc Le Bihan's user avatar
3 votes
2 answers
182 views

The Issue I've been parsing a file with sed trying to tweeze out the desired data. This has worked fine for most lines in the file but there appears to be some embedded special characters that are ...
Gandalf's user avatar
  • 33
2 votes
1 answer
112 views

Today I connected to a long-running process in tmux over ssh for work, to find that the pane the process was running in seems to have started using the wrong character encoding for its output, leading ...
Patronics's user avatar
  • 125
0 votes
1 answer
86 views

It is my understanding that the LANG and LC_CTYPE environment variables define the encoding used by shell commands when writing to stdout. However, after executing LANG=de_DE.iso88591 LC_CTYPE=de_DE....
userAcgJllhSe's user avatar
0 votes
0 answers
109 views

Looking for advanced CLI tool/code to determine text Codepage/Language (besides enca). Goal: Automate as much as possible conversion of hundreds/thousands of 8-bit text files (including non-ASCII ...
strider's user avatar
  • 113
-2 votes
1 answer
88 views

Wrong encoding: 1 00:01:27,879 --> 00:01:31,216 No i dupa. Koniec z darmowym wi-fi. 2 00:01:33,009 --> 00:01:34,972 - Ki-jung! - No? 3 00:01:35,219 --> 00:01:39,183 Kobieta z góry ...
jirafey's user avatar
1 vote
1 answer
165 views

I'd like to use my old VT420 terminal as system console. Adding RS232 ports and setting up serial-getty are not a problem, but: For years, almost all Linux distros have been using UTF-8 as the ...
Neppomuk's user avatar
  • 364
0 votes
1 answer
161 views

Sorry if this is a repeat or basic question but it is hard to search for a ™. I'm writing a script to remove weird characters from file names. How come the trade mark symbol ™ matches [^a-z] ??? $ ...
codywohlers's user avatar
4 votes
2 answers
1k views

Here is my simple problem, how can I convert half-width to full-width from the command line. I thought this would be built-in my iconv command line, but I did not find anything here: $ iconv -l | ...
malat's user avatar
  • 3,469
0 votes
0 answers
587 views

I use Debian SID and the Terminator is my terminal emulator. After updating the system the last time (yesterday 2023/11/22) and rebooting, some characters in my terminal in certain commands are ...
rhuanpk's user avatar
  • 413
0 votes
1 answer
203 views

We have default POSIX locale in our server but when non-ASCII character like רקטות לגוש דן וירושלים(hebrew) uploaded in server its getting changes to רק××ת ×××ש ×× ××ר×ש×××, How can preserve it ...
Amrita's user avatar
  • 1
6 votes
1 answer
423 views

I am working on Debian and derivatives system. I'd like to convert from an original input ISO-IR-87 to UTF-8. Is there an easy way to do it ? For reference: % iconv -l | grep "IR-8" ISO-IR-8-...
malat's user avatar
  • 3,469
-1 votes
1 answer
386 views

I'm not an expert in Linux, but I am following the development of a software that runs on Linux Buildroot. The device can only use the program for the graphical interface, access the shell, or connect ...
porrokynoa's user avatar
4 votes
0 answers
677 views

According to this hint and similar advice I am using the --iconv option in rsync (version 3.2.7) to sync file with umlauts (ä ü ö ...) to my Synology NAS. However the --iconv option does not work as ...
Haegar's user avatar
  • 41
4 votes
4 answers
549 views

Context (skip, if you don't care; read, if you suspect I'm totally on the wrong track) For an embedded system with small memory, I want to generate fonts which contain only those glyphs actually ...
Philippos's user avatar
  • 13.8k
1 vote
1 answer
112 views

ascii command in Linux is fast and great. It allows us to search for a character or for a code point and returns all relevant results for a given search. Is there something similar for ASCII extended (...
demacj's user avatar
  • 13
3 votes
1 answer
318 views

I was working on a keymap script (map keys from one language keyboard layout to another). And after a lot of hard time trying to get everything working I found out that different characters are ...
Andrew15_5's user avatar
0 votes
1 answer
117 views

I tried to recover lost files from an exFAT thumb drive with the testdisk package on linux. It was very good at finding deleted files. However as I went through the entries, I saw weird entries. The ...
ero47543's user avatar
0 votes
0 answers
160 views

I got some files containing Finnish text with mixed encoding, something one would get by (echo Mäntysalo ; echo Mäntysalo | recode utf-8..iso-8859-1) > problem.txt. Is there a "right" way ...
Jori Mäntysalo's user avatar
0 votes
1 answer
728 views

I have 2 programs: x - prompts user for input from stdin. binary - prints something to stdout, the stuff it prints is made up of various raw binary bytes which are not fully supported by my terminals ...
sirix's user avatar
  • 1
2 votes
1 answer
130 views

My mangled Czech text: NOTE ON CZECH BIRTH NUMBER VALIDATION IN CZECH LANGUAGE; in Czechia birth number = personal identification number ======================================================== Do ...
Vlastimil Burián's user avatar
0 votes
1 answer
43 views

Simple. I have the file "longname.server" on remote pc, I want to copy on my pc, but..I don't remind the name because is long and I use tab completion. \rsync -avP remote:^[\\\[0\\\;...
elbarna's user avatar
  • 14.3k
1 vote
1 answer
71 views

When I type a long command on a command-line interface. Something strange may happen in the layout. The characters I typed don't show in lines correctly. Instead, they merge into 1 line or overwrite ...
user avatar
1 vote
0 answers
98 views

I recently found GNU recode as something that can be used to decode HTML entities, however when looking at a piece of malware I noticed that it appears to be mixed HTML character/entity encoding, such ...
4oo4's user avatar
  • 153
1 vote
1 answer
3k views

When typing German umlauts (ä,ö,ü) into the terminal (I am using st on Arch Linux, $XTERM is st-256color), it displays only <ffffffff>. Locale seems to be set properly. Output of locale is ...
l2poca's user avatar
  • 13

1
2 3 4 5
9