For an item called "some page", Sitecore automatically makes the URL "/some-page" but the page can also be reached by "/some page". The URLs are different, but point to the same Sitecore item.
Content authors can use both URLs in links on other pages, and for the current question, let's assume we can't change their behavior.
If both links are used, Lucene will add both to the search index, i.e. the same page is indexed twice. Both have the same "_id" value, so they are recognized as being the same item.
How can we make sure that Lucene does NOT add duplicate entries? How can we configure it to never store duplicate entries for the same "_id" value?