How does an XML Validator know where to find the schema instance declared in an xml document in order to parse and use the xsd?

Question

I do not understand how an xml validator ("schema aware processor" as the w3c refers to it) knows where to find the schema instance in a typical external reference to an xsd from within an xml document.

Here's a typical declaration:

<root xmlns="www.example.org"
      xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
      xsi:schemaLocation="www.example.org" "http://example.org/schemas/schema1.xsd">
  <foo>some data</foo>
</root>

we declare the default namespace for the root element and all its children to be "www.example.org"
we bind the name/prefix "xsi" to the namespace "http://www.w3.org/2001/XMLSchema-instance".
If I am understanding correctly (which is evidently not the case!), it is the information within the actual resource that the xsi namespace refers to that allows the validator to know that schemaLocation (in the following line) is a legitimate attribute of the xsi ("http://www.w3.org/2001/XMLSchema-instance") namespace itself.

But a namespace is not a location (URI), so how does the parser know where to go to determine whether schemaLocation is in fact an attribute defined in the "http://www.w3.org/2001/XMLSchema-instance" namespace?

for "w3.org/2001/XMLSchema-instance" the answer is simple: namespace==schemaLocation :)), so case of xsi, you are not right! (see: stackoverflow.com/q/17094247/592355) ..you can also verify by downloading (the document) from this url... — xerx593
...and your syntax is somewhat wrong(invalid!): it should be xsi:schemaLocation="www.example.org http://example.org/schemas/schema1.xsd" ..not xsi:schemaLocation="www.example.org" "http://example.org/schemas/schema1.xsd" — xerx593
also possible/valid: xsi:schemaLocation="www.example.org http://example.org/schemas/schema1.xsd http://www.w3.org/2001/XMLSchema-instance http://www.w3.org/2001/XMLSchema-instance" ;) — xerx593

Alohci Alohci · Accepted Answer · 2019-01-27T01:30:49

The validator has the schema for that namespace built in. The XML Schema definition spec section 2.7 Schema-Related Markup in Documents Being Validated says:

XML Schema Definition Language: Structures defines several attributes for direct use in any XML documents. These attributes are in the schema instance namespace (http://www.w3.org/2001/XMLSchema-instance) described in The Schema Instance Namespace (xsi) (§1.3.1.2) above. All schema processors must have appropriate attribute declarations for these attributes built in, see Attribute Declaration for the 'type' attribute (§3.2.7.1), Attribute Declaration for the 'nil' attribute (§3.2.7.2), Attribute Declaration for the 'schemaLocation' attribute (§3.2.7.3) and Attribute Declaration for the 'noNamespaceSchemaLocation' attribute (§3.2.7.4).

How does an XML Validator know where to find the schema instance declared in an xml document in order to parse and use the xsd?

2 Answers