<!ENTITY ldquo "“" ><!--=double quotation mark, left--> <!ENTITY rdquo "”" ><!--=double quotation mark, right-->
<!ELEMENT first (operator?, co-text?, topic, co-text?)> <!ELEMENT second (operator?, comment)>The comment may be divided into chunks marked as chunk elements.
FIRST PART | SECOND PART | |||||
---|---|---|---|---|---|---|
Operator | Co-text (1) | Topic | Co-text (2) | Operator | Comment | Chunks |
a | house | is | a building in which people live |
1 2 |
||
if | you | defeat | someone | you win a victory over them in a contest such as ... |
1 2 |
|
a | pure | substance | is | not mixed with anything else | ||
if | something happens | often | it happens many times or much of the time |
1 2 |
there | 2 |
are | 2 |
many | 2 |
kinds | 2 |
of | 10 |
activity | 6 |
and | 8 |
communication | 5 |
is | 11 |
only | 3 |
one | 1 |
them | 1 |
although | 2 |
often | 1 |
Field | Datatype | Description |
---|---|---|
HNo | number | Household number — sequential number assigned by the database creator to each household [appears to be part of the gid (Gorbals ID) column in the version we have -MSM] |
PNo | number | Person number — sequential number assigned by the database creator to the people enumerated in each household [appears to be part of the gid (Gorbals ID) column in the version we have -MSM] |
Surname | c20 | |
Forename | c16 | |
Address | c20 | Street address. |
Relation | c12 | Relationship to head of household. |
Sex | c1 | Male or female. |
Age | number | |
Occupation | c20 | |
TBirth | c20 | Town of birth. |
Rooms | Number | Number of rooms in household. |
HSize | number | Number of people in household as calculated by the database creator. |
Word Frequency Profile (1) | |||||
---|---|---|---|---|---|
Word-form count | Number such | Vocabulary total | % of vocabulary | Word-form total | % of text |
1 | 85 | 85 | 75.22 | 85 | 44.97 |
2 | 15 | 100 | 88.50 | 115 | 60.85 |
3 | 4 | 104 | 92.04 | 127 | 67.20 |
4 | 1 | 105 | 92.92 | 131 | 69.31 |
5 | 3 | 108 | 95.58 | 146 | 77.25 |
6 | 1 | 109 | 96.46 | 152 | 80.42 |
8 | 2 | 111 | 98.23 | 168 | 88.89 |
10 | 1 | 112 | 99.12 | 178 | 94.18 |
11 | 1 | 113 | 100.00 | 189 | 100.00 |
The other version of this table works in the other direction:To explain these, let us examine line 3 in [the preceding table]. This line deals with word-forms which occur three times. column 2 tells us that there are four of them, and column 3 keeps a running total of the number of different word-forms that have been reported. Column 4 relates the number in column 3 with the total of 113, expressed as a percentage. So the word-forms that occur no more than three times constitute 92.04 per cent of the 'vocabulary'—the number of different word-forms.Column 5 considers the running text, where the total number of word-forms is 189. If there are four word-forms which occur three times each, then 12 must be added to the previous total in column 5. Column 6 relates the number in column 5 to the total of 189 and reports that word-forms that occur no more than three times occupy 67.2 per cent of the text. This figure will drop as the text size increases. In longer texts, the most frequent word-form, the, itself occupies about 2.5 per cent of the text.
Word Frequency Profile (2) | |||||
---|---|---|---|---|---|
Word-form count | Number such | Vocabulary total | % of vocabulary | Word-form total | % of text |
11 | 1 | 1 | 0.88 | 11 | 5.82 |
10 | 1 | 2 | 1.77 | 21 | 11.11 |
8 | 2 | 4 | 3.54 | 37 | 19.58 |
6 | 1 | 5 | 4.42 | 43 | 22.75 |
5 | 3 | 8 | 7.08 | 58 | 30.69 |
4 | 1 | 9 | 7.96 | 62 | 32.80 |
3 | 4 | 13 | 11.50 | 74 | 39.15 |
2 | 15 | 28 | 24.78 | 104 | 55.03 |
1 | 85 | 113 | 100.00 | 189 | 100.00 |
of activity and communication | is | only one of them |
communication where the activity | is | halted in time if |
whole process the activity | is | obvious enough the nervous |
nervous activity of authors | is | legendary and the silent |
reader in his armchair | is | making continuous fast and |
radio listener his brain | is | highly active if he |
highly active if he | is | taking anything in there |
human communication through language | is | only a small sub-section |
small sub-section although it | is | very important attempts to |
mankind's only remaining boast | is | that we thought of |
of it first it | is | certainly an intricate and |
There are many kinds of activity, and communication | ||
is | only one of them, although often it does not look much like activity. | |
But in a library we wee a stage of communication where the activity | ||
is | halted in time. | |
If we consider the whole process, the activity | ||
is | obvious enough. | |
The nervous activity of authors | ||
is | legendary, and the silent reader in his armchair is making continuous, fast, and precise eye movements. | |
The nervous activity of authors is legendary, and the silent reader in his armchair | ||
is | making continuous, fast, and precise eye movements. | |
And, like the radio listener, his brain | ||
is | highly active if he is taking anything in. | |
And, like the radio listener, his brain is highly active if he | ||
is | taking anything in. | |
There are many kinds of communication, too, and human communication through language | ||
is | only a small sub-section, although it is very important. | |
There are many kinds of communication, too, and human communication through language is only a small sub-section, although it | ||
is | very important. | |
Perhaps mankind's only remaining boast | ||
is | that we thought of it first! | |
It | ||
is | certainly an intricate and distinctive kind of activity. | |
Greenstein, Daniel I. 1994. A Historian's Guide to Computing. Oxford: Oxford University Press.
Halpern-Hamu, Charlie. 1999. “Stupid XSL tricks”. Markup Technologies '99 conference, Philadelphia, December 1999. [Alexandria, Virginia]: Graphic Communications Association.
Hockey, Susan. 1985. Snobol programming for the humanities. Oxford: Clarendon Press.
Sinclair, John. 1991. Corpus Concordance Collocation Oxford: Oxford University Press.
White, Alex. 1987. How to spec type. New York: Watson-Guptill.
Brian Wilson, “CSS Property Index”, on Index DOT css http://www.blooberry.com/indexdot/css/ (with browser notes on IE, Mosaic, Netscape, Opera)
ProjectCool Developer Zone, “CSS Style Properties”. Summary of properties, with notes on implementation in NS 4.0, IE 3, IE 4; last updated 1998 (CSS 1). http://www.devx.com/projectcool/developer/reference/css_style.html
John Pozadzides and Liam Quin, “CSS1 Properties”. http://www.htmlhelp.com/reference/css/all-properties.html
Miloslav Nic and Jiri Jirat, “CSS 2 reference with examples” http://zvon.org/xxl/CSS2Reference/Output/index.html
Matthew Haughey, “The little shop of CSS horrors: A look at the dark side of CSS rendering”. http://haughey.com/csshorrors/ (as addendum, not main cheat sheet!)