What Unicode symbols are accepted in Python 3 variable names?

Question

I want to use a larger variety of Unicode symbols for variable names in my Python 3 scripts. What characters are acceptable to use in Python 3 variable names?

I recently started using Unicode symbols (such as Greek and Asian symbols) for code obfuscation.

just out of curiosity, why? Is 元亀 better than genki as an variable name? — Fredrik Pihl
– Fredrik Pihl, Commented Jun 11, 2013 at 12:22
That sounds like something you should cover with naming conventions, unless you can guarantee you'll never have a maintainer or contributer who doesn't understand one of the languages you use. — thegrinner
– thegrinner, Commented Jun 11, 2013 at 12:32
I know that using odd symbols is not customary, but if we keep programming traditionally, then we keep get traditional programs. We need to think outside-of-the-box. — Devyn Collier Johnson
– Devyn Collier Johnson, Commented Jun 11, 2013 at 12:50
@DevynCollierJohnson There are other ways to break traditions which don't affect readability. — glglgl
– glglgl, Commented Jun 25, 2013 at 7:39

Tim Pietzcker · Accepted Answer · 2020-05-17 16:27:28Z

27

According to PEP 3131, the first character of an identifier needs to belong to ID_Start, the rest to ID_Continue, defined as follows:

ID_Start is defined as all characters having one of the general categories uppercase letters (Lu), lowercase letters (Ll), titlecase letters (Lt), modifier letters (Lm), other letters (Lo), letter numbers (Nl), the underscore, and characters carrying the Other_ID_Start property. XID_Start then closes this set under normalization, by removing all characters whose NFKC normalization is not of the form ID_Start ID_Continue* anymore.

ID_Continue is defined as all characters in ID_Start, plus nonspacing marks (Mn), spacing combining marks (Mc), decimal number (Nd), connector punctuations (Pc), and characters carryig the Other_ID_Continue property. Again, XID_Continue closes this set under NFKC-normalization; it also adds U+00B7 to support Catalan.

That's a long list (currently around 120.000 characters) - fortunately there is a helpful project on GitHub that contains the list and a script to generate it.

edited May 17, 2020 at 16:27

answered Jun 11, 2013 at 12:22

Tim Pietzcker

337k59 gold badges520 silver badges572 bronze badges

Sign up to request clarification or add additional context in comments.

10 Comments

Devyn Collier Johnson Over a year ago

Where can I find the list of symbols that match \w?

Fred Foo Over a year ago

PEP 3131 refers to this table: dcl.hpi.uni-potsdam.de/home/loewis/table-3131.html

Cristóbal Ganter Over a year ago

Why are useful characters, like 🍉 (watermelon), not included?

grepe Over a year ago

It is really frustrating, that we can use glagolitic characters or viking runes to start our variable names, yet we cannot use pretty common symbols that you can type on most mobile devices and input on most computers. I get that we shouldn't start the variable names with numbers or math symbols with special meaning, but I think emojis would make damn good variable names in plenty of cases.

Tim Pietzcker Over a year ago

@VickiB: That's impossible because % is the modulo operator and thus can't be part of a variable name, just like you can't use + or -.

|

khelwood · Accepted Answer · 2025-02-27 09:27:49Z

0

Cyrillic letters are allowed, but I don't know whether they would work on every machine.

I wrote a short script to demonstrate Unicode support for Cyrillic.

If it prints "Всем привет!" to the console, then your computer supports Cyrillic identifiers.

# тест кирилица

# Это программа тестирует, если Ваш компьютер
# корректно работает с кириллическим шрифтом


привет = "Всем привет!"

def скажи_привет (мой_привет):
    print (мой_привет)

скажи_привет (привет)

edited Feb 27 at 9:27

khelwood

59.7k14 gold badges91 silver badges116 bronze badges

answered Jan 9 at 10:36

fmi2012

12 bronze badges

1 Comment

bfontaine Jan 9 at 11:34

Why wouldn’t it work? See the accepted answer from 2013: stackoverflow.com/a/17043983/735926

Collectives™ on Stack Overflow

What Unicode symbols are accepted in Python 3 variable names?

2 Answers 2

10 Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

10 Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related