Jos De Roo

A Canonical-Expansion Theorem for Basic N3 Entailment

Draft status. Candidate text for a publication section.
Scope. Basic N3 entailment only. This does not cover the additional semantics of log: built-ins such as log:implies.

Abstract

We give a syntactic characterisation of basic entailment for finite, closed, normalised abstract N3 graphs. The result is an N3 analogue of the RDF simple-entailment interpolation lemma. Because N3 includes explicit universal variables and quoted graph terms, the appropriate object is not the premise graph itself but its canonical expansion: the union of all ground universal instances of the premise, with fresh witnesses for existential variables. Basic N3 entailment is then equivalent to containment of every universal instance of the conclusion, after a suitable existential instantiation, in this canonical expansion.

Background and notation

The N3 semantics defines an abstract graph as a triple:

G = (U, E, F)

where:

U is the set of universally scoped variables;
E is the set of existentially scoped variables;
F is a finite set of N3 triples.

N3 terms include IRIs, literals, variables, lists, and graph terms. Ground graph terms are compared modulo the N3 graph-term isomorphism relation.

The semantics of variables uses assignments that provide both:

a denotation in the semantic domain; and
a ground term representation.

This second component is essential because variables may occur freely inside quoted graph terms. The semantic operation of total application grounds such occurrences recursively, while respecting variable scope inside nested graph terms.

Throughout this section, entailment means basic N3 entailment.

Normalisation assumptions

Let

G = (U_G, E_G, F_G) and H = (U_H, E_H, F_H)

be finite, closed, normalised abstract N3 graphs.

We assume:

variables of G and H have been renamed apart;
free variables, if any occur in a concrete syntax presentation, have first been added to the appropriate universal-variable set;
graph terms are considered modulo graph-term isomorphism;
equality of triples containing graph terms is taken modulo that isomorphism.

These assumptions are standard hygiene conditions and do not affect entailment.

Ground terms with witnesses

Let

GT*

be the set of ground N3 terms over the language, extended with a countably infinite supply of fresh witness names.

These witness names play the same role as labelled nulls in canonical-model constructions.

Ground instances

For a graph

K = (U_K, E_K, F_K)

and a map

μ : U_K ∪ E_K → GT*

write

μ[K]

for the ground instance of K obtained by applying μ to F_K.

The application is total:

direct occurrences of variables in U_K ∪ E_K are replaced by their μ-images;
lists are instantiated componentwise;
graph terms are instantiated recursively;
variables scoped inside nested graph terms are not replaced by an outer substitution.

Equivalently, if ⟨L⟩ is a graph term, then its instance is:

⟨ μ^t(L) ⟩

where μ^t denotes total application.

Canonical expansion

For every assignment

σ : U_G → GT*

choose fresh witness names

w_e,σ

for every

e ∈ E_G.

Define

σ̂ = σ ∪ { e ↦ w_e,σ | e ∈ E_G }.

The canonical expansion of G, written C(G), is:

C(G) = ⋃_{σ : U_G → GT*} σ̂[F_G].

Membership in C(G) is taken modulo graph-term isomorphism.

Intuitively, C(G) contains every ground universal instance of G, with fresh witnesses for the existential variables of each such instance.

Theorem

Canonical expansion characterisation of basic N3 entailment. Let

G = (U_G, E_G, F_G) and H = (U_H, E_H, F_H)

be finite, closed, normalised abstract N3 graphs. Then

G ⊨ H

if and only if, for every assignment

α : U_H → GT*

there exists an assignment

β : E_H → GT*

such that

(α ∪ β)[F_H] ⊆ C(G).

In words: G basic-entails H exactly when every universal instance of H has an existential instance contained in the canonical expansion of G.

Proof

Soundness

Assume that for every assignment

α : U_H → GT*

there exists

β : E_H → GT*

such that

(α ∪ β)[F_H] ⊆ C(G).

We prove that

G ⊨ H.

Let I be an arbitrary basic interpretation such that

I ⊨ G.

We must show that

I ⊨ H.

Let A be an arbitrary assignment for the universal variables U_H. For each u ∈ U_H, write

A(u) = (A₁(u), A₂(u)),

where A₁(u) is the denotation and A₂(u) is a ground term representation satisfying

I(A₂(u)) = A₁(u).

Define

α(u) = A₂(u).

By the syntactic assumption, there exists

β : E_H → GT*

such that

(α ∪ β)[F_H] ⊆ C(G).

Only finitely many triples of C(G) are used in this inclusion. Each such triple belongs to some canonical instance of G, generated by some universal assignment

σ : U_G → GT*.

For every such σ, define an assignment A_σ for the universal variables of G in I by

A_σ(u) = (I(σ(u)), σ(u)).

Since

I ⊨ G,

there exists an assignment B_σ for E_G such that

I[A_σ • B_σ](F_G) = true.

Now replace each canonical witness name

w_e,σ

occurring in the selected finite fragment by the corresponding ground term representation

B_σ,2(e).

Call this replacement map ρ, and extend ρ recursively to lists and graph terms. The replacement preserves sharing: the same witness name is always replaced by the same witness term.

Because each selected triple came from an instance of G that is true in I, every triple in

ρ((α ∪ β)[F_H])

is true in I.

Define an existential assignment B for E_H by

B(e) = (I(ρ(β(e))), ρ(β(e))).

Then

I[A • B](F_H) = true.

Since A was arbitrary, we have

I ⊨ H.

Since I was arbitrary among models of G, it follows that

G ⊨ H.

This proves soundness.

Completeness

Assume that the syntactic condition fails.

Then there exists an assignment

α : U_H → GT*

such that, for every assignment

β : E_H → GT*,

we have

(α ∪ β)[F_H] ⊄ C(G).

We construct a basic interpretation I_G such that

I_G ⊨ G

but

I_G ⊭ H.

Let the domain of I_G be

Δ_{I_G} = GT* / ≃,

the set of ground N3 terms modulo graph-term isomorphism.

For every IRI or literal a, define

D_{I_G}(a) = [a].

For every ground graph term ⟨K⟩, define

Q_{I_G}(⟨K⟩) = [⟨K⟩].

Thus graph terms are interpreted as representatives of their own isomorphism classes, and isomorphic graph terms receive the same value.

Define the extension relation by

EXT_{I_G} = { ([s], [p], [o]) | (s, p, o) ∈ C(G) }.

We first show that

I_G ⊨ G.

Let A be any assignment for U_G. Put

σ(u) = A₂(u).

By construction of C(G), the canonical instance of G corresponding to σ is contained in C(G). For every existential variable e ∈ E_G, choose the canonical witness

w_e,σ.

This gives an existential assignment B such that every triple in

(A • B)[F_G]

belongs to EXT_{I_G}. Hence

I_G[A • B](F_G) = true.

Since A was arbitrary,

I_G ⊨ G.

We now show that

I_G ⊭ H.

Use the assignment for U_H determined by the failing α:

A(u) = ([α(u)], α(u)).

Suppose, for contradiction, that

I_G ⊨ H.

Then there would exist an existential assignment B for E_H. Define

β(e) = B₂(e).

Since truth in I_G is exactly membership in C(G), this would imply

(α ∪ β)[F_H] ⊆ C(G),

contradicting the choice of α.

Therefore

I_G ⊭ H.

Thus there exists a model of G that is not a model of H, so

G ⊭ H.

This proves completeness.

Relation with the RDF interpolation lemma

The RDF interpolation lemma states that, for simple RDF entailment, a graph S entails a graph E exactly when some subgraph of S is an instance of E.

The theorem above is the corresponding statement for basic N3 entailment.

In the RDF fragment:

there are no graph terms;
there are no N3 lists as first-class terms;
there are no universal variables;
existential variables correspond to RDF blank nodes;
the canonical expansion C(G) is just G, up to renaming of existential witnesses.

The condition

there exists β : E_H → GT* such that β[F_H] ⊆ C(G)

then says precisely that an instance of H is contained in G. Thus the N3 theorem conservatively generalises the RDF interpolation lemma.

The essential extra feature in N3 is the presence of explicit universal variables. Because of them, a conclusion may be supported by several different universal instances of the premise.

For example, the graph

({x}, ∅, {P(x), Q(x)})

entails

({u, v}, ∅, {P(u), Q(v)}).

However, the two triples in the conclusion may come from two different instances of the premise. This is why a direct “subgraph of G” formulation is insufficient; the correct object is the canonical expansion C(G).

Scope and limitations

This theorem concerns basic N3 entailment only.

It does not cover the additional semantics of predicates such as log:implies. The N3 draft treats log interpretation separately from basic interpretation, adding special semantic conditions for predicates in the log: namespace.

Thus the theorem should be read as the N3 analogue of simple RDF entailment, not as a completeness theorem for full N3 reasoning with logical built-ins.

Jos De Roo

A Canonical-Expansion Theorem for Basic N3 Entailment

Abstract

Background and notation

Normalisation assumptions

Ground terms with witnesses

Ground instances

Canonical expansion

Theorem

Proof

Soundness

Completeness

Relation with the RDF interpolation lemma

Scope and limitations

References