DEV Community

Nagaraju
Nagaraju

Posted on

remove puncuations from unicode like string

remove puncuations from unicode like string

0

I have tried below piece of code to remove punctuation from a string.

import re
s = "string. With. Punctuation?"
s = re.sub(r'[^\w\s]','',s)

This works fine for roman like text(script) but seems to have problem with Unicode like text like Hindi, Telugu etc.

for example:

import re
s = "అనేది

Top comments (0)