I would like to extract sentences (not just an html-stripped text) from web pages. I wonder if such functionality is supported by popular HTML parsing libraries such as Jsoup?
Thanks,
Edit:
Sorry if the post was not clear. I need natural language sentences so not necessarily separated by a dot.
Thanks everyone. I just found this library http://alias-i.com/lingpipe/demos/tutorial/sentences/read-me.html and it seems exactly what I want.