Package python3-chardet

Python character encoding detector

https://github.com/chardet/chardet

Chardet: The Universal Character Encoding Detector

Detects over 70 character encodings including:

  • All major Unicode encodings (UTF-8, UTF-16, UTF-32)
  • Windows code pages (Windows-1250 through Windows-1258)
  • ISO-8859 family (ISO-8859-1 through ISO-8859-16)
  • CJK encodings (Big5, GB18030, EUC-JP, EUC-KR, Shift-JIS, and more)
  • Cyrillic encodings (KOI8-R, KOI8-U, IBM866, and more)
  • Mac encodings (MacRoman, MacCyrillic, and more)
  • DOS/OEM code pages (CP437, CP850, CP866, and more)
  • EBCDIC variants (CP037, CP500)

Version: 6.0.0.post1

General Commands

chardetect chardetect – takes one or more file paths and reports their detected encodings