ddnet/scripts/unicode.py

import csv

def confusables():
	with open('confusables.txt', encoding='utf-8-sig') as f:
		# Filter comments
		f = map(lambda line: line.split('#')[0], f)
		return list(csv.DictReader(f, fieldnames=['Value', 'Target', 'Category'], delimiter=';'))

UNICODEDATA_FIELDS = (
	"Value",
	"Name",
	"General_Category",
	"Canonical_Combining_Class",
	"Bidi_Class",
	"Decomposition_Type",
	"Decomposition_Mapping",
	"Numeric_Type",
	"Numeric_Mapping",
	"Bidi_Mirrored",
	"Unicode_1_Name",
	"ISO_Comment",
	"Simple_Uppercase_Mapping",
	"Simple_Lowercase_Mapping",
	"Simple_Titlecase_Mapping",
)

def data():
	with open('UnicodeData.txt', encoding='utf-8') as f:
		return list(csv.DictReader(f, fieldnames=UNICODEDATA_FIELDS, delimiter=';'))

def unhex(s):
	return int(s, 16)

def unhex_sequence(s):
	return [unhex(x) for x in s.split()] if '<' not in s else None
UTF8 nocase compare & use for chat TAB completion - As suggested by Arseniy Zarche - Also updated confusables to Unicode 12 2019-01-07 22:49:20 +00:00			`import csv`

			`def confusables():`
Add pylint and fix occurences 2020-12-02 14:22:26 +00:00			`with open('confusables.txt', encoding='utf-8-sig') as f:`
			`# Filter comments`
			`f = map(lambda line: line.split('#')[0], f)`
			`return list(csv.DictReader(f, fieldnames=['Value', 'Target', 'Category'], delimiter=';'))`
UTF8 nocase compare & use for chat TAB completion - As suggested by Arseniy Zarche - Also updated confusables to Unicode 12 2019-01-07 22:49:20 +00:00
			`UNICODEDATA_FIELDS = (`
Add pylint and fix occurences 2020-12-02 14:22:26 +00:00			`"Value",`
			`"Name",`
			`"General_Category",`
			`"Canonical_Combining_Class",`
			`"Bidi_Class",`
			`"Decomposition_Type",`
			`"Decomposition_Mapping",`
			`"Numeric_Type",`
			`"Numeric_Mapping",`
			`"Bidi_Mirrored",`
			`"Unicode_1_Name",`
			`"ISO_Comment",`
			`"Simple_Uppercase_Mapping",`
			`"Simple_Lowercase_Mapping",`
			`"Simple_Titlecase_Mapping",`
UTF8 nocase compare & use for chat TAB completion - As suggested by Arseniy Zarche - Also updated confusables to Unicode 12 2019-01-07 22:49:20 +00:00			`)`

			`def data():`
[WIP] Require Python 3.6 (f-strings) and fix pylints So far only done scripts directory, will do the rest if this is considered good 2022-06-12 11:15:02 +00:00			`with open('UnicodeData.txt', encoding='utf-8') as f:`
Add pylint and fix occurences 2020-12-02 14:22:26 +00:00			`return list(csv.DictReader(f, fieldnames=UNICODEDATA_FIELDS, delimiter=';'))`
UTF8 nocase compare & use for chat TAB completion - As suggested by Arseniy Zarche - Also updated confusables to Unicode 12 2019-01-07 22:49:20 +00:00
			`def unhex(s):`
Add pylint and fix occurences 2020-12-02 14:22:26 +00:00			`return int(s, 16)`
UTF8 nocase compare & use for chat TAB completion - As suggested by Arseniy Zarche - Also updated confusables to Unicode 12 2019-01-07 22:49:20 +00:00
			`def unhex_sequence(s):`
Add pylint and fix occurences 2020-12-02 14:22:26 +00:00			`return [unhex(x) for x in s.split()] if '<' not in s else None`