I wanted to index pdf, word documents using solr. The whole content of the word / pdf document is coming in the search response and also the highlighted fragment. The content is quite long and I wanted avoid it in the search response because of the content length.
Is it possible to get only the highlighted fragment of the content field ?
Here is the search query
http://localhost:8080/solr4x/collection1/select?q=Scripting&wt=xml&hl=true&hl.fl=content
Here is the schema
<field name="content" type="text_general" indexed="false" stored="true"multiValued="true"/>
<field name="text" type="text_general" indexed="true" stored="false" multiValued="true"/>
<copyField source="content" dest="text"/>
I am using solr 4.3