how can corenlp(python wrapper) handle non ascii string

Question

I use Dustin Smith's Python wrapper for Stanford Core NLP tools v3.4.1

I put the word 'Víctor' into corenlp.parse. 'Víctor' contains non-ascii character. I would like to get the lemma of 'Víctor'. But when I put corenlp.parse('Víctor'). It gives error:

UnicodeDecodeError: 'ascii' codec can't decode byte 0xc3 in position 1: ordinal not in range(128).

How can I change corenlp setting, so corenlp can handle non-ascii string?

nana nana · Accepted Answer · 2016-03-03T19:44:32

0

votes

use unidecode before put non ascii to corenlp

how can corenlp(python wrapper) handle non ascii string

1 Answers