Entity Modelling

www.entitymodelling.org - entity modelling introduced from first principles - relational database design theory and practice - dependent type theory


English Sentence Structure

Overview

In this section we give an extended example of entity modelling by demonstrating the modelling of English sentence structure. Our starting point has been the syntactic rules presented in chapters 7 and 8 of the book of Brinton: The Structure of Modern English1. What we do in this chapter is simply to translate the structural description of English sentence structure given there from the meta notation of phrase structure grammars into the more general entity relationship notation.

This chapter is a single example of the modelling of entitites and their parts. It demonstates that phrase structure grammars can be represented as entity models.

  • if you know already entity modelling then use this model as a means to explore formal grammars and the structure of English
  • on the other hand if you are already familiar with formal grammars then see how an entity model can provide a single view encompassing a set of production rules.

Figure 7
Structure of Simple Sentences - Based on Phrase Structure Rules given in Brinton. Click here to browse the model in a new window.

Some background to the material in the model is contained within the remaining sections.

Background - Grammars as Models

In the study of syntax, linguists seek to understand and express the rules by which some sequences of words can be comprehended as having a structure which is well-formed - that is, we find them grammatical - and some do not. In earlier sections we have spoken about an entity model as representing a theory of what is; we illustrate this now for, as Noam Chomsky wrote in 1957, a grammar of the language L is a theory of L2. Chomsky says that each grammar is simply a description of a certain set of utterances, namely those that it generates and he illustrates with reference to a part of chemical theory concerned with structurally possible compounds. Such a theory, says Chomsky, might be said to generate all physically possible compounds just as a grammar generates all grammatically `possible' utterances3,4.

Different authors have described English by various grammatical systems and with varying and over lapping degrees of success and agreement but each grammar so described is a rule system and what all such rule systems do is to identify and classify relationships between different parts of grammatical sentences; linguists bring to our attention types of entities and accompanying composition relationships which otherwise remain hidden.

Rule systems invariably focus on different word classes i.e. different types of words such as were introduced in the table 1 in the earlier section types of things.

The type of each word of an example sentence can be shown by annotating the word entities as follows:

.

With regard to the types of words - many words can be used say as nouns and as verbs and so it is the use of the word that is the entity to which a type is assigned.

Figure 8 shows the hierarchically structure of a simple sentence. It shows the composition relationships between its constituents parts. Each of the boxes in figure 8 is an example of a sentence constituent - a single word or a phrase; this is a key entity type for the linguist.

Figure 8
Constituent Parts of an Example Sentence 1 - phrases are shown in relationship to their constituent parts - referred to as ‘sentence constituents’.

Sentence Constituents

According to Brinton:

The study of syntax is the analysis of the constituent parts of a sentence: their form, positioning, and function. Constituents are the proper subparts of sentences. There are different types of constituents classified by the categories which constitute them; these have different functions and internal structures with elements arranged in a specific way. And they may themselves be complex, containing other constituents. The structure of a sentence is hence hierarchical.

Brinton notes that not all sequences of words function as constituents. It is the context, he says, which determines whether a particular sequence forms a constituent or not. For example, the sequence of words beautiful flowers is a constituent in I received beautiful flowers for my birthday but not in Though they are beautiful, flowers cause me to sneeze. The sequence the house on the hill is a constituent in one reading of the ambiguous sentence I bought the house on the hill, but not in the other; it is a constituent in the sense `I bought the house which is on the hill', but not in the sense `I bought the house while standing on the hill'.

We can represent sentence constituent as an entity type generalising `phrase' and `word':

and show phrases as sequences of words:

this leads us to a single diagram as shown in figure 9.

  • a sentence constituent is either a word or a phrase
  • a phrase is composed of one or more sentence constituents
Figure 9
From Understanding Language

Types of Phrase

Our example sentence is of a type that is often termed a `simple' sentence and consists of a subject followed by a predicate as shown in figure 10. Other examples of the same type, subject followed by predicate, are shown in table 1; each subject in the table can equally combine with any of the given predicates to form a well-formed sentence. Though ten sentences are presented by combining different subjects with different predicates we can generate 100 different sentences2. What we cannot do is pair any two subjects to produce a well-formed sentence nor any two predicates. This illustrates that subject phrases and predicate phrases are of distinct types. These are the types `noun phrase' and `verb phrase' respectively. These types of phrase can appear in other contexts within sentences but for now they appear as subject and predicate respectively.

This simple rule:

S
NP VP
(4)

using some self evident abbreviations is of a type sometimes referred to as a production, aggregates of which are used to express grammars technically referred to as context free grammars. This is a formal expression of the fact that a sentence(S) may be constructed as a noun phrase(NP) followed by a verb phrase(VP).

Figure 10
Example sentence showing the decomposition into subject phrase and a predicate phrase.
Table 1
Ten simple sentences used by Brinton to illustrate the subject predicate distinction.
Subject Predicate
1. We had arrived at the wrong house.
2. The man could open the door.
3. Truffles are a kind of fungus.
4. The house on the corner is for sale.
5. She went to see who was at the door.
6. The rain and fog dissipated.
7. There are five cats in the hall.
8. The film that I saw last night was depressing.
9. It is raining.
10. It is possible that Alfred will know the answer.

What such a rule (4) does not do is to give a name to the role played by the noun phrase (NP) in the sentence -- that of subject -- and nor does it give a name -- predicate -- to the role played by the verb phrase (VP). In Entity Modelling we have a notation for just such matters and accordingly we can represent the subject-predicate rule inclusive of role names by the entity model fragment shown in figure 11.

  • every simple sentence has exactly one subject which is a noun phrase
  • every simple sentence has exactly one predicate which is a verb phrase
Figure 11
Sentence as a combination of subject and predicate.

The entity type `phrase' of figure 9 is very general - it covers entities such as whole sentences, and the noun phrases and verb phrases shown in the subject and predicate columns of table 1. Linguists recognise these different phrase types and, just as for the different types of words, phrases may be substituted in sentences by phrases of like type without effecting the acceptability of the sentence. Table 2 shows some of the major distinctions within phrase types.

phrase type abbreviation examples
whole sentence S the young athelete set a new record
noun phase NP the young athelete, a new record
verb phrase VP set a new record
adjectival phrase AP
prepositional phase PP
Table 2
Types of phrase and abbreviations used

This leads us to be able to refine the entity type `phrase' in figure 9 by a generalisation as follows:

In the next two sections we continue the process of refinement - we describe the composition relationships that are possible between the different phrase types of a sentence. In this way we use the entity modelling notation as a meta-language in which we describe how different types of sentence constituent can be combined into grammatical sentences and in so doing present a description of a set of rules for sentence construction. There is not, in linguistic theory, an unequivocal set of rules to follow - the rules we choose to model follow close to those presented by Brinton.

Verb Phrases

We have examples of verb phrases in the right hand column of table 1. They are quite varied in their structure. Here is a typical analysis of a verb phrase, in this case for the predicate of figure 10:

Generally, a verb phrase contains a verb which is said to be the head of the phrase. In this example the verb is complemented by a direct object which is of type `noun phrase'. Note that we can be sure that the phrase `a new record' is of type `noun phrase' because we can construct a sentence with it as the subject. In fact it can replace any of the subjects in table 1 and new sentences result and, providing that verbs are conjugated appropriately, the results are well-formed though admittedly they are in some cases very unusual.

According to this analysis the phrase `set a new record' is a verb phrase constructed as a `verb' followed by a `noun phrase' and this is one general rule of a number of such for the construction of verb phrases. Formally expressed the rule is:

VP
V NP
(5)

In an entity model we can represent this like this:

In other cases a verb phase consists just of a verb alone and has no constituent noun phrase. Only certain verbs that can be used in this way and these are said to be intransitive. Thus `he cries' is an example in which `he' is the subject and `cries' is a predicate verb phrase consisting solely of the verb. Formally:

VP
V
(6)

The entity model equivalent is simply this:

These fragments can be combined like this:

Still other verb phrases have two objects a direct object of the action and an indirect object. Formally:

VP
V NP NP
(7)

This leads to a model combining representations of rules (5), (6) and (7) like this:

It is worth saying at this point as an illustration of the choices that are open to an entity modeller in any modelling situation that sometimes the two rules above will be represented together as a single combined rule in which parentheses are used to indicate optionality of the noun phrase:

VP
V (NP)
(8)

Likewise the entity modeller may dispense with the types representing the transitive and intransitive cases and represent the combined situation like this:

In total, Brinton describes seven different constructions for verb phrase. These constructions are both summarised and illustrated in table 3. Alternatively the rules summarised in table 3 can be expressed by combining all the different rules into the entity model of a verb phrase shown in figure 12 in which types ‘noun phrase’ and ‘prepositional phrase’ are yet to be described. while `adjectival phrase' was described earlier in section .

Table 3
Examples of different types of verb phrase.
Type of verb phrase/Rule Example
mono transitive
VP
V NP
intransitive
VP
V
ditransitive
VP
V NP NP
copulative
VP
V {NP, AP, PP}
complex transitive
VP
V NP {NP, AP, PP}
mono prepositional
VP
V PP
diprepositional
VP
V PP PP
Figure 12
The grammar of the Verb Phrase

Noun Phrases

There are three main types of Noun Phrase and paradoxically two of these types consist of phrases which are degenerate in the sense that they consist of single words - Pronouns and Proper Names because words of these types are grammatically acceptable as the subjects of sentences. For simplicity of the modelling we will henceforth consider pronouns and proper names as types of noun phrase. The other type of noun phrase takes a number of forms but does actually contain a noun and for technical reasons is known as type ‘N-bar-bar’.

By way of illustration for the model of N-bar-bar consider the phrase The wildly yapping dog on the sofa. We can annotate the types of word and constituent phrase are as follows:

In our earlier example the noun phrase which was the subject of our sentence is analysed like this:

The model is shown in figure 13.

Figure 13
The grammar of the Noun Phrase

1 Brinton, Laurel J. The structure of modern English: A linguistic introduction. John Benjamins Publishing, 2000.
2Syntactic Structures, N. Chomsky, 1957 (page 49).
3Syntactic Structures (page 48).
4Subsequent to Chomsky's 1957 descriptions of phrase structure rules, transformation rules and morphophonemic rules there many different approaches to expressing sentence structure. The models developed in this section cover the same ground as the phrase structure rules of Chomsky which are expressed as a class of grammar know as a context free. All context free grammars lead naturally and mechanically to entity models of parse trees. This section is an illustration of this.
5 Reproduced from Fowler, "Understanding Language" , page 100.
6But only the proviso that some verbs need be inflected to match their subjects - for example `We is for sale' needs become `We are for sale'.