how to get parse tree using python nltk?

Question

Given the following sentence:

The old oak tree from India fell down.

How can I get the following parse tree representation of the sentence using python NLTK?

(ROOT (S (NP (NP (DT The) (JJ old) (NN oak) (NN tree)) (PP (IN from) (NP (NNP India)))) (VP (VBD fell) (PRT (RP down)))))

I need a complete example which I couldn't find in web!

Edit

I have gone through this book chapter to learn about parsing using NLTK but the problem is, I need a grammar to parse sentences or phrases which I do not have. I have found this stackoverflow post which also asked about grammar for parsing but there is no convincing answer there.

So, I am looking for a complete answer that can give me the parse tree given a sentence.

NLTK has several parsers (nltk.org/api/nltk.parse.html) IIRC I for preference use the Stanford one. — Lyndon White
can you give me an example code? i am really having tough time in cracking this. — Wasi Ahmad
I've got to run off, if no one post anything in the next 12 hours I'll come back and post something. It has been a while I'ld have to dig up some of my old code (and translate it from julia to python probably). — Lyndon White
@WasiAdmad, if you're actually "having a tough time", show your code so far and ask a question about the problem you encounter. — alexis

titipata titipata · Accepted Answer · 2017-02-19T03:55:01

Here is alternative solution using StanfordCoreNLP instead of nltk. There are few library that build on top of StanfordCoreNLP, I personally use pycorenlp to parse the sentence.

First you have to download stanford-corenlp-full folder where you have *.jar file inside. And run the server inside the folder (default port is 9000).

export CLASSPATH="`find . -name '*.jar'`"
java -mx4g -cp "*" edu.stanford.nlp.pipeline.StanfordCoreNLPServer [port?] # run server

Then in Python, you can run the following in order to tag the sentence.

from pycorenlp import StanfordCoreNLP
nlp = StanfordCoreNLP('http://localhost:9000')

text = "The old oak tree from India fell down."

output = nlp.annotate(text, properties={
  'annotators': 'parse',
  'outputFormat': 'json'
})

print(output['sentences'][0]['parse']) # tagged output sentence

how to get parse tree using python nltk?

2 Answers