I am trying to make a semantic web application about running races in my area (10k, half marathons, marathons). More specifically I want to collect and publish (in RDF) data about races, participants and results, and merge similar data. The last few weeks I have been testing Jena (including TDB), doing some web scraping from static web sites about running races and reading about vocabularies and ontologies.
I think the most reputable ontology to use for my application, and also the one with the lowest barrier to entry (not very over-engineered), is the BBC sports ontology. http://www.bbc.co.uk/ontologies/sport/2011-02-17.shtml
I have a few questions about using BBC sports and making the application:
Is it okay for me to use BBC sports ontology even if I just use a small subset of it?
I wanted to look at the schema for the ontology to understand it better, but I can't seem to find it anywhere. Is BBC keeping it secret or have I just been looking for it in the wrong places?
Is there any way for me to know for sure if I am using the ontology correctly? My native language is not English so I am afraid I might misunderstand some of the concepts in the ontology.
When I add new triples to my (TDB) triple store, what is the convention for creating a new URI for a resource? More specifically, should the URI end with a name or a uid? Will this affect merging of similar data from different data sets?
Can you recommend any semantic web tools for making a resource URI dereferenceable? I'm not putting the application on the web anytime soon, but it would still be nice if I could access the URI's locally, for instance
http://localhost/running/12345.