DEV Community

Discussion on: Unicode Normalization for NLP in Python

Collapse
 
arvindpdmn profile image
Arvind Padmanabhan

This topic is briefly covered in this article: devopedia.org/text-normalization
In particular, check out the 4 forms: NFD, NFC, NFKD and NFKC