Levels of Style in Byzantine Greek and T PDF
Levels of Style in Byzantine Greek and T PDF
Levels of Style in Byzantine Greek and T PDF
VOL. XLVII
HELSINKI 2013
INDEX
Nikolaos Kälviäinen Levels of Style in Byzantine Greek and the Role of Syntactic 173
Complexity: A Quantitative Analysis of the Sentence
Structure of Three Early Byzantine Hagiographic Texts
Antonio Pistellato Gaius Caesar, or the Ideal Non-princeps: A Tiberian Issue 199
Olli Salomies A Note on the Speeches of the Prosecutors in Cicero's pro 257
Milone 35–6
Nikolaos Kälviäinen
A characteristic of Medieval Greek literature that readers have always been in-
stinctively aware of is its linguistic heterogeneity. Traditional linguistic forms
dating as far back as Homer and Classical Attic continued to be cultivated by
learned Byzantines alongside a range of varieties ultimately derived from the
Hellenistic Koine, the internationalized Attic cemented as the lingua franca of
the Eastern Mediterranean in Hellenistic times, but influenced to varying degrees
by later developments in the history of the spoken language. The metaphor of
verticality is often applied to this kind of variation, which in sociolinguistics is
referred to as diglossia, with two main language systems, H(igh) and L(ow), in
use in the same speech community.2
1 I would here like to express my warmest gratitude to a number of people in both Helsinki and
Rethymno, who in one way or another helped me complete my Master's thesis (University of
Helsinki 20.8.2012), on which this paper is based: my instructors, Prof. Mika Kajava and Dr.
Hilla Halla-aho for their helpful and diligent guidance; Dr. Marina Detoraki for her enthusias-
tic support and feedback; Dr. Marja Vierros for her valuable comments; Prof. Fred Karlsson
for pointing me to a series of studies of syntax which proved most useful in both theory and
practice; and my father, MS Timo Kälviäinen for helping with inferential statistics. I also wish
heartily to thank Otto Nieminen for correcting my English expression. The responsibility for
any remaining errors lurking in the pages of the present paper are of course solely my own.
Finally, to Georgia: επιτέλους είμαι σπίτι!
2 N. Toufexis, "Diglossia and register variation in Medieval Greek" BMGS 32.2 (2008) 208–
09. Toufexis is concerned specifically with the Late Byzantine period, but the diglossic situa-
tion is present during the entire Byzantine millennium, ultimately dating back to antiquity (cf.
the dichotomy of Atticism and Koine).
174 Nikolaos Kälviäinen
contrast starkly with the simpler, lower style of the main text.8 Thus, the relation-
ship between education and the use of H in writing is not always as straightfor-
ward as it might seem. It is also worth mentioning that direct speech in Byzantine
narrative texts seems to be rendered quite often by a lower linguistic form than
that used in the actual narrative.9
Now the obvious question facing the student of Byzantine style is how
do we operationalize this continuum of vertical stylistic variation (i.e. replace
it with something countable) in order to study it? This problem was tackled by
I. Ševčenko in an insightful paper published some thirty years ago.10 Ševčenko
adopted the concept of "Three Styles", which have been used in rhetorical and
stylistic theory since ancient times, and posited three main levels of style: high
style, characterized by Atticisms, linguistic classicism and periodic syntax; mid-
dle style, denoting works in a less complex style, influenced by the Scripture and
patristic writings, syntactically less periodic and tending towards parataxis; low
style, that of works with plentiful vernacular vocabulary, influenced by the New
Testament and the Psalter (i.e. the most commonly read parts of the Scripture)
and simple paratactic syntax.11
This tripartite division, simplified as it is in order to be able to capture sig-
nificant generalizations, is criticized by Wahlgren on the grounds that it presents
Byzantine literature as divisible into categories that "are stable across the centu-
ries without being influenced by each other".12 Wahlgren rightly points out that
style should not be oversimplified into "levels" as if they were the only stylistic
divisions to be made in Byzantine literature.13 It is of course to be hoped that a fu-
8 For Palladius, cf. Hult (n. 7) 28 and the discussion in I. Ševčenko, "Additional remarks to the
report on levels of style", JÖB 32.1 (1982) 214.
9 See, for example, M. Hinterberger, "How should we define vernacular literature?",
http://www.mml.cam.ac.uk/greek/grammarofmedievalgreek/unlocking/Hinterberger.pdf
(quoted 14.10.2011) 2006, 8; Horrocks (n. 4) 254; also N. Kälviäinen, "Ἀναζητώντας τὶς
ἀπαρχὲς τοῦ γραπτοῦ δημώδους λόγου: Στατιστικὰ στοιχεῖα γιὰ τὴν ἔκφραση τοῦ
μελλοντικοῦ χρόνου στὴν πρώιμη βυζαντινὴ ἁγιογραφία", in the proceedings of the confer-
ence Neograeca Medii Aevi VII (forthcoming).
10 Ševčenko (n. 7).
11 Ševčenko (n. 7) 291. Toufexis' subdivision of the higher register H into classicizing Greek
and Schriftkoine is easy to equate to Ševčenko's high and middle levels, while his L would
roughly correspond to Ševčenko's low style: see Toufexis (n. 2) 210 and 212.
12 S. Wahlgren, "Byzantine Literature and the Classical Past", in E. Bakker (ed.) A Companion
to the Ancient Greek Language, Chichester 2010, 528.
13 Wahlgren (n. 12) 529. Ševčenko himself was clearly aware of this, as he contrasted the "ver-
176 Nikolaos Kälviäinen
tical levels" with "horizontal kinds" of style such the ἰδέαι of Hermogenes' rhetorical theory
(i.e. level-internal variation conditioned by genre, text/discourse type etc.): Ševčenko (n. 7)
290.
14 A. Kazhdan, A History of Byzantine Literature (650–850), Athens 1999, 162.
15 K. Dover, The Evolution of Greek Prose Style, Oxford 1997, 3–4.
16 L. Jeffries – D. McIntyre, Stylistics, Cambridge 2010, 35.
17 Cf. Ševčenko (n. 8) 223.
Levels of Style in Byzantine Greek and the Role of Syntactic Complexity 177
2. Syntactic complexity
So far the discussion has been theoretical in nature, but in order to test in prac-
tice the scientific validity of levels of style coupled with syntactic complexity as
a conceptual framework for a stylistic classification of Byzantine literature, we
need empirical data. We accordingly need to develop a method for quantitatively
analysing the syntactic complexity of Byzantine texts.18
Syntactic complexity essentially refers to "syntactic structures which ne-
cessitate increased parsing and processing effort".19 In other words, increased
syntactic complexity means structural differences that make a text harder to read.
Complexity in itself, however, is something far too complex to be directly mea-
sured: in order to make use of syntactic complexity, we will need to operational-
ize it as one or more variables that can be credibly linked to the abstract concept
of complexity.
First of all, it has been established that sentence length can be utilized as an
accurate index of syntactic complexity.20 Counting the amount of words per sen-
tence is an economical method of establishing differences in syntactic complexity
18 Although this aspect of linguistic style remains largely unexplored in Classical and Byzan-
tine philology, there are a few studies that must be cited as having inspired the present paper.
T. Webster, "A Study of Greek Sentence Construction", AJPh 62 (1941) 385–415 studies the
evolution of Classical Greek sentence structure by using syntactic variables such as sentence
length and embedding depth. Webster's study can be criticized for overlooking stylistic vari-
ation inside a given work (see Dover [n. 15] 50), but as a pioneering attempt it is noteworthy.
H. Hunger, "Stilstufen in der Geschichtsschreibung des 12. Jahrhunderts: Anna Komnene und
Michael Glykas", BSEB 5 (1978) 137–70 demonstrates in a short qualitative analysis that Anna
Comnene's syntactic structure is more complex than that of Michael Glycas in terms of both
sentence length and the use of subordinate structures. Similarly, W. de Melo, "Zur Sprache
der republikanischen carmina Latina epigraphica: Satzumfang, Satzkomplexität und Diath-
esenwahl", in P. Kruschwitz (ed.), Die metrischen Inschriften der römischen Republik, Berlin
2007, 97–120 used, among others, mean T-unit length (though without calling it a T-unit) and
the ratio of main clauses and subordinate clauses to show that the syntax of Lucretius' De
rerum natura is more complex than that of Terence's Eunuchus and the early Latin carmina
epigraphica. Finally, Dover's book (n. 15) on Classical Greek prose style is a treasure trove
of ideas and observations on the use of syntactic variables such as T-unit (which Dover calls
MCF or "main clause-finite verb unit") and the analysis of Greek sentence structure and genre-
conditioned stylistic variation.
19 B. Szmrecsányi, "On Operationalizing Syntactic Complexity", in G. Purnelle – C. Fairon –
A. Dister (eds.), Actes du colloque JADT 2004 (Journées internationales d'Analyse statistique
des Données Textuelles), Louvain-La-Neuve 2004, 1031.
20 Szmrecsányi (n. 19) 1037–38.
178 Nikolaos Kälviäinen
and is therefore suitable for large-scale research in cases where a more detailed
analysis would be impossible, or for fast and easy analysis in general.
Sentence length as a variable, however, possesses one fundamental draw-
back: the lack of an unambiguous definition of the concept "sentence". While a
definition based on punctuation is commonly used, it cannot here be considered
an optimal approach, as in most cases whatever punctuation is found in the manu-
scripts neither reflects a consistent system nor originates from the author's own
pen. Modern editors normally replace it with a more or less intuitive punctuation
(influenced by their own working language) and significant differences can on
occasion be detected in the practice of different editors.21 Thus, we cannot rely
on an author's punctuation as a guide to analysing his sentences, as is the case
with modern languages endowed with reasonably standardized writing systems.
On the other hand, a lexical definition based on identifying coordinating conjunc-
tions does not work either, as most comparable items can be used alternatively
as conjunctions or as discourse particles (cf. the "biblical" sentence-initial καί).
As an alternative Hunt proposes the concept of the minimal terminable unit
or "T-unit", defined as a single main clause plus any subordinate clauses depen-
dent on it.22 In essence, the T-unit corresponds to the sentence in all respects save
that it does away with the coordination of main clauses, which would be difficult
to define. However, despite the usefulness of the T-unit variable, structural length
does not automatically translate to structural complexity.23 That is, measuring
length probably only works as an index of syntactic complexity due to the general
probability that longer syntactic units contain more complex structures. We will
thus take a step further in the hope that more complex variables will enable us to
support the conclusions drawn from measurements of T-unit length.
It is generally accepted that, apart from length, syntactic complexity is
related to the number, type and depth of embedding or subordination in a text.24
Subordination is defined as "the nonsymmetrical relation holding between two
21 Dover (n. 15) 27. Cf. also K. Hunt, Grammatical Structures Written at Three Grade Levels,
Champaign, Illinois 1965, 8 for a similar observation, in his case concerning the revision of
schoolchildren's punctuation by English teachers.
22See Hunt (n. 21) 21. Essentially the same idea is proposed in Dover (n. 15) 28. Cf. also de
Melo (n. 18) 101–02.
23 Szmrecsányi (n. 19) 1032–33.
24 K. Beaman, "Coordination and Subordination Revisited: Syntactic Complexity in Spoken
and Written Narrative Discourse", in D. Tanner – R. Freedle (eds.), Coherence in Spoken and
Written Discourse, Norwood 1984, 45.
Levels of Style in Byzantine Greek and the Role of Syntactic Complexity 179
clauses such that one is a constituent or part of (i.e. dependent upon) the other".25
In order to understand what this means, we need to consider the fundamental
structural characteristics of language. When looking at language at the phono-
logical level, we see phonological segments strung one after another in a monodi-
mensional chain. At the syntactic level, however, linguistic theory postulates a
two-dimensional model of language structure, with two different ways of com-
bining elements of the same type (such as clauses) to form larger structural units
(such as sentences or T-units): iteration and recursion.
The main difference between iteration and recursion, as formulated by
Karlsson, is that iteration "yields flat output structures, repetitive sequences on
the same depth level as the first instance" while recursion "builds structure by in-
creasing embedding depth".26 In other words, elements combined through itera-
tion are "concatenate",27 syntactically equal, while elements combined through
recursion form a hierarchic structure28 in which some elements are subordinate
to others of the same type (i.e. embedded in them). Such hierarchic structures are
considered more complex than flat iterative structures: to quote one definition of
complexity, "increased complexity is, at its most general level, increased hierar-
chic organization; that is, an increase in the number of hierarchic levels within a
system".29
In sentence structure, where structural units called sentences are formed
from smaller structural units called clauses, iteration manifests itself as coordina-
tion, where syntactically equal clauses are combined, often by means of conjunc-
tions such as καί. The result of recursive combination of clauses, on the other
hand, is known as subordination, where a clause is syntactically dependent on
another; this relationship is in turn marked by conjunctions such as ὅτι.
Now the contrast between iteration/coordination and recursion/subordina-
tion is relevant to the issue of syntactic complexity precisely because hierarchical
structures are considered more complex than flat iterative structures. A funda-
mental difference can be discerned between the two in actual language use as
well: whereas iteration appears practically unconstrained (i.e. there are no theo-
retical limits to how many syntactically equal clauses can be coordinated into a
single sentence) and successive application of iterative clause combination can
very well result in unusually long sentences,30 the actually occurring use of re-
cursion to create structural complexity has been found much less impressive in
comparison.31
Thus, according to Karlsson, even extremely long and complex sentences
seem to owe their size and complexity mostly to iteration rather than recursion:
it appears the greatest clausal embedding depth32 (or one of the greatest in any
case) achieved in Karlsson's example sentence from James Joyce (12,931 words)
is 6 levels of depth, while the number of coordinated clauses certainly runs in the
hundreds, if not thousands.33 Similarly, empirical data from the study of several
European languages shows that the distribution of subordinate clauses at differ-
ent levels of embedding depth follows a falling curve, with an added level of
embedding corresponding to a drop in the frequency of clauses occurring at that
level.34 Furthermore, it seems that more complex styles (e.g. legal language) tend
to have more clauses at deeper levels of embedding than syntactically simpler
varieties, an observation directly related to the issue at hand.35
Thus far, then, we have seen that iteration/coordination and recursion/sub-
ordination as strategies of clause combination differ greatly in actual usage, with
recursion heavily limited and iteration lacking similar constraints. What is the
reason behind this discrepancy? The answer is assumed to lie in the relatively
greater cognitive processing difficulty of more complex, "deep" hierarchical re-
cursive structures in comparison with less complex, "flat" iterative structures.
In other words, it is considered that increasing recursive combining of elements
30 Karlsson (n. 26) 46.
31 For an in-depth theoretical discussion, see Karlsson (n. 26) 50–65; cf. also R. Laury – T.
Ono, "Recursion in conversation: What speakers of Finnish and Japanese know how to do", in
H. van der Hulst (ed.), Recursion and Human Language, Berlin 2010, 69–70 and 84–85.
32 The term refers the structural depth achieved through successive application of recursive
cycles. The embedding depth of a subordinate clause is the "vertical" relation of the level of the
hierarchic structure it occupies to that of the main clause it is dependent upon (either directly or
indirectly, through other embedded clauses). The level of embedding occupied by a clause is 1
+ the level of embedding of its superordinate clause, with all main clauses assigned level 0. In
other words, subordinate clauses directly dependent on a main clause are assigned level 1, the
subordinate clauses embedded in these level 2 and so on.
33 Karlsson (n. 26) 46–47.
34 See F. Karlsson, "Multiple final embedding of clauses", International Journal of Corpus
Linguistics 15.1 (2010) 95–99.
35 See Karlsson (n. 34) 96–97.
Levels of Style in Byzantine Greek and the Role of Syntactic Complexity 181
3. Methodological considerations
The broad research questions emerging from the preceding discussion – ones that
this study alone cannot attempt to answer – are a) whether the complexity of a
36 See Szmrecsányi (n. 19) 1033–34; Karlsson (n. 34) 101–02; Laury – Ono (n. 31) 79. How-
ever, see also Givón (n. 29) 12–13 for possible difficulties involved in this interpretation. For
the sake of the argument we adopt in this study the view that assumes the existence of a real
psychological correlation between syntactic complexity and cognitive complexity, the latter
being understood here to imply increasing mental processing difficulty.
37 In measuring the performance of different indices of syntactic complexity, Hunt found this
simple subordination ratio less efficient than T-unit length but better than sentence length: Hunt
(n. 21) 23.
38 Ševčenko (n. 7) 304.
182 Nikolaos Kälviäinen
text's sentence structure consistently mirrors its level of style throughout Byz-
antine literature and b) whether this relation can be effectively utilized in order
to create an objective system of classifying Byzantine texts stylistically, i.e. by
using syntactic complexity as an index of the levels of style. These questions can
obviously only be answered when data from the analysis of an extensive corpus
of texts is made available. The contribution of the present study is to demonstrate
empirically that there exists a relation between the stylistic level and syntactic
complexity and that this relation can be quantitatively measured. This aim is
achieved by analysing a small test corpus, the data of which cannot yet be gener-
alized across the board but which serves to indicate why this line of research is,
in the author’s opinion, worth pursuing.
In order to validate theoretically our basic hypothesis for the test corpus
- according to which texts perceived as belonging to higher levels of style mani-
fest correspondingly increasing syntactic complexity - we need to demonstrate
that a systematic correlation exists between two variables: "level of style" and
"syntactic complexity". Both variables, however, are far too abstract to study in
themselves, so they must somehow be operationalized as measurable categorical
variables. In this study, the variable "level of style" is represented by three text
samples (see section 4 below), each assigned to a certain level of style on the ba-
sis of a combination of personal observation as well as judgments and comments
expressed in the scholarly literature.
The variable "syntactic complexity", in turn, is operationalized as the fol-
lowing 5 different syntactic variables involving the length of syntactical units or
the extent of recursive clause combining (subordination/embedding), both con-
cepts theoretically linked to the complexity of syntax, as seen in the discussion
above. Each variable is examined as an independent index of syntactic complex-
ity and the overall image presented by the results yielded by the different vari-
ables is then considered the "final verdict". Variables 1–2 and 4–5 are analysed as
categorical (i.e. not continuous but with a limited number of possible outcomes)
and the categories of each are given here. Due to the fact that the statistical χ2-test
(see below) requires that at least 5 tokens be found in the data for each category
of each variable, the categories have been adjusted after carrying out the analysis
to comply with this requirement.39
Once the raw data has been collected, it is subjected to a statistical χ2-test or chi-
square test in order to a) establish the significance of the results and b) evaluate
the extent to which each of the three texts differs from the two others.40 Explain-
ing the basics of inferential statistics is beyond the scope of this article, and the
reader should turn to an exposition of the subject in a basic manual of statistics
for linguists.41
However, in a nutshell, the probability value or p-value obtained by means
of the test is a gauge of the statistical probability that the performance of variable
X (one of our complexity variables) is not conditioned by variable A/B (the "level
40 I have used Preacher's online χ2 calculation tool to compute the values: see K. Preacher, Cal-
culation for the chi-square test: An interactive calculation tool for chi-square tests of goodness
of fit and independence, http://quantpsy.org (quoted 13–18.8.2012), 2001.
41 E.g., E. Levon, "Organizing and Processing Your Data: The Nuts and Bolts of Quantitative
Analyses", in L. Litosseliti (ed.), Research Methods in Linguistics, London 2010, 68–92. See
also Dover (n. 15) 47 for another demonstration of how to use the χ2-test in practice to establish
differences or similarities between Ancient Greek texts composed in different styles.
184 Nikolaos Kälviäinen
of style" variable, i.e. the different texts being compared). For example, a p-value
of p = 0.75 would mean there is a 75% chance that texts A and B are so similar in
their use of the variable X that any observable difference is due purely to chance.
Where to draw the line in these probabilities is a matter of convention; according
to Levon, in the humanities the standard cut-off point is p = 0.05.42 This means
that a p-value of 0.05 or smaller (5% or less chance that A and B do not differ with
respect to X) is conventionally considered proof enough that A and B differ sig-
nificantly from each other with respect to variable X, which is formally expressed
by saying that the difference between A and B with respect to X is statistically
significant at the p < 0.05 level.
Conversely, a p-value greater than 0.05 (over 5% chance) is taken to mean
that the findings are not statistically significant, since pure chance cannot be ruled
out as being responsible for the differences between A and B, and the hypothesis
of A and B belonging to different levels of style has to be rejected with respect
to the use they make of variable X. A and B would thus be considered in our
theoretical model to belong to the exact same style as far as syntactic complexity
is concerned and as far as syntactic complexity is accurately represented by X.
The meaning of statistical significance to the linguist or philologist is that hav-
ing statistically significant findings provides "predictive power to the descriptive
facts"43 and gives a powerful tool for objectively evaluating the differences be-
tween texts.
Apart from the p-value, there are two other values involved in the χ2-test,
the χ2-value and the degrees of freedom (abbreviated df). These are also reported,
although the p-value in itself reveals all that is essential. However, since a pro-
gressively higher χ2-value corresponds to a progressively lower p-value as far as
the df remain the same, the χ2-values can also be compared where the comparison
of extremely small p-values is impractical. (Note that χ2-values, unlike p-values,
cannot be compared if the corresponding df differ!) In short, when comparing
two texts with respect to a single syntactic variable, a high χ2-value and a low
p-value indicate a large difference between the texts, whereas a low χ2-value and
a high p-value indicate a smaller difference (even if that difference may still be
statistically significant if the p-value is low enough).
Having identified the variables to be measured, the question remains how
to analyse a given text in such a manner that the analysis is unconditionally re-
producible and falsifiable. As we have already seen in the case of the concept
of "sentence", even basic linguistic terms such as word, clause and sentence are
ambiguous categories and by no means immediately applicable to any text. In the
Master's thesis on which this paper is based, problems related to the definition in
Ancient and Byzantine Greek of concepts such as word, sentence, clause, subor-
dination and coordination were explored in some depth. However, due to limita-
tions of space, what is presented here is only a summary of the most important
(i.e. most likely to impact on the statistics) coding decisions made in the analysis
of our test corpus, leaving out most of the theoretical discussion.
44 For further discussion cf. Dover (n. 15) 26ff. and de Melo (n. 18) 100–02.
45 Cf. Dover (n. 15) 29.
46Cf. Hakulinen – Karlsson – Vilkuna (n. 39) 10–11 and M. Haspelmath, "Coordination", in T.
Shopen (ed.), Language Typology and Syntactic Description. Vol. II: Complex Constructions,
Cambridge 20072, 4–5 and 37–44.
47 For instructions on how to analyse a clause in terms of its constituent structure, see, e.g., M.
Tallerman, Understanding Syntax, London 1998, 116–25 or R. Van Valin, An Introduction to
186 Nikolaos Kälviäinen
constructions, cf. M. Haspelmath, Understanding Morphology, London 2002, 230–32 and for
Greek substantivally used (articular) infinitives see A. Rijksbaron, The Syntax and Semantics
of the Verb in Classical Greek. An Introduction, Chicago 20023, 112–14.
50 Cf. De Melo (n. 18) 103 and Laury – Ono (n. 31) 71–74.
51 See Dover (n. 15) 29.
52 See Dover (n. 15) 29.
53 For a detailed account of Classical Greek finite subordinate clauses, see e.g. Rijksbaron (n.
49) 49–94.
54 Dover suggests that digressive relative clauses such as ἐντὸς Ἅλυος ποταμοῦ, ὃς ῥέων ...
ἐξίει should in principle be treated as separate T-units whenever possible: Dover (n. 15) 29. I
188 Nikolaos Kälviäinen
The material analysed in this study consists of three samples, each taken from
an Early Byzantine hagiographical work: the Miracles of Ss. Cyrus and John of
Sophronius of Jerusalem (Mir. Cyr. & Jo.), the Pratum Spirituale of John Moschus
(Prat.) and the Miracles of St. Artemius (Mir. Artem.) of unknown authorship.
The texts have been chosen with a view to minimizing the impact of fac-
tors other than register or level of style, i.e. Ševčenko's "kinds of style" such as
genre or text/discourse type (e.g. the distinction of narrative and descriptive com-
positional elements56). Each work is dated to the same period (the late 6th to 7th
centuries) and belongs to one of two hagiographical subgenres with a reasonably
similar internal structure. Mir.Cyr.&Jo. and Mir.Artem. represent the genre of Mi-
racula/Θαύματα, independent collections of short episodes detailing the saintly
hero's miraculous exploits, while Prat. is a collection of short edifying stories,
anecdotes and sayings that illustrate the spiritual prowess of the desert-dwelling
ascetics of the Near East.
In order to use this corpus as an index for the variable "levels of style",
each of the three texts in the following description is tentatively assigned to a
certain level of style on the basis of lexical and content-related criteria as well as
comments and judgments made by other scholars: Mir.Cyr.&Jo. to high style and
Prat. and Mir.Artem. to low style.
Sophronius, patriarch of Jerusalem (ca. 560–638) is known to have been
a teacher of rhetoric before assuming monastic garb, and went on to author both
do not see why they should be treated differently from other relative clauses, however.
55 See Dover (n. 15) 29–30.
56 Cf. Kazhdan (n. 14) 159–60.
Levels of Style in Byzantine Greek and the Role of Syntactic Complexity 189
Miracles of St. Artemius belong to the lowest stylistic level in Early Byzantine
hagiography. Kazhdan and Papadopoulos-Kerameus describe the use of vernacu-
lar vocabulary as conspicuous, the expression as largely brief, simple and inartis-
tic (with little use of rhetorical figures) and the syntax as clear.63 Papadopoulos-
Kerameus even goes as far as to suggest that, on occasion, the text is so natural
and inartistic that it may approach the speech of contemporary Greeks.64 The
numerous cases of direct speech are indeed rendered in what might be thought of
as a credible approximation of a "written spoken language" (the question of the
fundamental differences between writing and actual speech is, of course, another
matter).
As far as "vertical" levels of style are concerned we expect Mir.Cyr.&Jo.
to differ greatly from the two others in terms of syntactic complexity. At the
same time we expect Prat. and Mir.Artem. to display a very similar (if probably
not identical) use of syntax, since according to our hypothesis they belong to
the same level of style. In terms of "horizontal" kinds of style, we can probably
assume that the difference in content between the hagiographical subgenres of
Miracula and Apophthegmata Patrum does not in itself correspond to differences
in language use, since the two are structurally very similar.
Nevertheless, even if genre, in the broad sense of "the genre of a literary
work as a whole", has been accounted for, any lengthy piece of writing may be
assumed to contain sections that differ from each other in terms of style.65 For
the purposes of this work it has not been possible to subject the selected samples
to such rigorous control as to eliminate this factor entirely. It is simply hoped
that, even if some compositionally heterogeneous material is inevitably included
in the samples, the overall similarity in structure of the three texts will dampen
their impact. Ultimately, it is difficult to ascertain whether possible differences
between the texts are due to genre, personal stylistic choices or other factors (for
example, it may be that Mir.Cyr.&Jo. contains a disproportionate amount of de-
scriptive – as opposed to narrative – elements in the form of learned digressions
such as rhetorical ἐκφράσεις.66
14) 27.
63 Kazhdan (n. 14) 34; Papadopoulos-Kerameus (n. 62) i–ii.
64 Papadopoulos-Kerameus (n. 62) ii.
65 Cf. Dover (n. 15) 46–56.
66 In many places Sophronius adopts the tone of rhetorical ἐγκώμια, as has also been pointed
out in N. Fernandez Marcos, Los Thaumata de Sofronio: Contribución al estudio de la incuba-
ción cristiana (Manuales y anejos de Emérita 31), Madrid 1975, 154.
Levels of Style in Byzantine Greek and the Role of Syntactic Complexity 191
Fig. 5.1: T-unit length in words (% of total T-units / words per T-unit)
Levels of Style in Byzantine Greek and the Role of Syntactic Complexity 193
Table 5.1.2.
Our first variable, T-unit length in words, offers tangible support to our hypothe-
sis. Prat. and Mir.Artem. show closely parallel distributions, while the high-level
Mir.Cyr.&Jo. differs from them dramatically on two separate counts. First, while
the low-level texts show a peak at T-units of 6–10 words, followed by a falling
curve, in Mir.Cyr.&Jo. the peak is shared with T-units of 11–15 words and the fall
that comes after is much less steep.
Secondly, unlike the low-level texts, Mir.Cyr.&Jo. shows a second peak at
T-units of 26 or more words, betraying a significant difference in the way T-units
of different lengths are handled by the two groups of writers: Prat. and Mir.Ar-
tem. prefer small T-units of around 1–10 words and much more rarely use longer
ones, whereas Mir.Cyr.&Jo. seems to strive for a balanced variation of long and
short T-units, with its short ones longer than the short ones of Prat. and Mir.Ar-
tem. This kind of balanced variation in Mir.Cyr.&Jo. is exactly what we would
expect from a sophist trained in the conventions of classical literature, while the
preponderance of short T-units in Prat. and Mir. Artem. reflects their paratactic
"καί-style".
Thirdly, there do seem to be some minor differences between Prat. and Mir.
Artem. in the form of slightly greater T-unit length in Mir.Artem. (note that the χ2
value of Mir.Cyr.&Jo. vs. Mir.Artem. is much lower than that of Mir.Cyr.&Jo. vs.
Prat., although the p-value is in both cases too small to conveniently reveal this
difference). The χ2-test reveals these differences are not statistically significant
at the 0.05 level, but it is a close call, with a p-value of only 0.09. In contrast,
the p-values obtained in comparing either low-level text with Mir.Cyr.&Jo. are
vanishingly small.
Table 5.2.1.
Fig. 5.2: T-unit length in clauses (% of total T-units / clauses per T-unit).
Table 5.2.2.
Our second variable yields results that closely parallel those of T-unit length in
words as far as differentiating Mir.Cyr.&Jo. from the other two texts is concerned.
The basic pattern is the same. However, what is interesting is that Prat. and Mir.
Artem. appear slightly less similar this time – although the overall shape of their
curves is again parallel, this time Mir.Artem. has a slight peak at 5+ clauses, while
Prat. clearly relies slightly more on very short T-units of 1–2 clauses than Mir.
Artem. does. Furthermore, this time the χ2-test reveals statistically significant dif-
Levels of Style in Byzantine Greek and the Role of Syntactic Complexity 195
ferences between all three authors, and although the difference between Prat. and
Mir.Artem. is the smallest one, it is far from negligible.
Table 5.3.
Average T-unit length in Mir.Cyr.&Jo. is nearly twice that of Mir.Artem. and over
twice that of Prat., the difference between which is a small one of 1.9 words
only. Mean T-unit length in clauses continues the same story, as does mean clause
length in words, although here there is only a relatively small difference of around
0.5 words between Mir.Cyr.&Jo. on the one hand and Prat. and Mir.Artem. on the
other (these being practically identical). In order to interpret this result, further
study is required to explore to what extent clause length generally varies in Byz-
antine Greek prose.
Table 5.4.1.
196 Nikolaos Kälviäinen
Table 5.4.2.
The simple subordination ratio reveals that Mir.Cyr.&Jo., the high style text, has
approximately three quarters of all his clauses embedded, whereas a full half of
the clauses in Prat. are main clauses. Mir.Artem. is now situated in the middle
even more clearly than was the case with T-unit length in clauses; clear differ-
ences seem to be emerging between the two low-style texts.
Table 5.5.1.
Table 5.5.2.
Embedding depth presents our authors in no different light than the previous vari-
ables. Mir. Cyr. & Jo. is clearly our most complex text, having significantly more
clauses embedded at levels 2 and 3 than either of the others. As with T-unit length
in clauses and the basic subordination ratio, these again differ among themselves
as well, with Mir.Artem. continuing to display greater complexity than Prat. The
difference between Mir.Artem. and Prat. is again statistically significant.
Our data for embedding depth are difficult to compare directly with those
of previous studies involving modern languages since in many cases only finite
clauses are counted. Nevertheless, generally speaking our texts seem to conform
to the picture emerging from previous research, with embedding beyond level 4
being extremely rare and level 4 itself relatively uncommon.71 For our purposes,
then, the critical zone consists primarily of the levels 1–3: it is here that embed-
ding depth bears out the general impression derived from studying our previous
variables, as we can appreciate both the yawning chasm between Mir.Cyr.&Jo.
and Prat. as well as the smaller but nevertheless distinct difference between Prat.
and Mir.Artem. At level 4 the difference between Prat. and Mir.Artem. vanishes
while Mir.Cyr.&Jo. remains distinct, and by the time level 5 is reached all distinc-
tions have by and large been ironed out.
6. Conclusions
These results demonstrate that there indeed exists a relation between the two
basic variables investigated, syntactic complexity and level of style. The sample
identified as belonging to the higher level of style (Mir.Cyr.&Jo.) consistently
manifests greater syntactic complexity than the samples identified as belonging
to a lower level of style (Prat. and Mir.Artem.), with these differences being sta-
70 In the case of embedding depth, due to the low incidence of tokens from level 3 onwards
(especially in Prat.) the χ2-test has been carried out with levels 2–5 collapsed into a single cat-
egory in order to fulfil the requirement of 5+ tokens per table cell.
71 Cf. Karlsson (n. 34) 96–98 and Laury – Ono (n. 31) 77–78.
198 Nikolaos Kälviäinen
tistically significant at the p < 0.05 level. Thus, the basic research hypothesis of
the correlation of level of style and syntactic complexity is corroborated by these
results.
On the other hand, there are two caveats. Firstly, although the vast differ-
ence between high-level text Mir.Cyr.&Jo. and the low-level text Prat. is consis-
tently borne out in the behaviour of all 5 variables, the second text identified as
belonging to a low stylistic level, Mir.Artem. exhibits somewhat greater complex-
ity than Prat.; this difference is statistically significant in 4 out of 5 variables. On
the other hand, it must be stressed that the overall image of syntactic complexity
presented by Mir.Artem. is still relatively close to that of Prat. and that the differ-
ences between the two, while undeniable, should not be exaggerated either.
That statistical significance or lack thereof alone cannot neatly divide our
texts into clearly separate levels of style is of course obvious, since in reality
the "levels of style" are not levels but a continuum with plenty of room for sta-
tistically significant differences between personal styles even within groups of
relatively similar styles. The question remains whether the continuum is an even,
unbroken line across the whole corpus of Byzantine texts or whether such groups
of relatively similar styles, ones that might then be named levels of style, can be
identified. In such a future classification of Byzantine styles on the basis of their
syntax, we might conceivably see Mir.Artem. labelled "lower middle style" or al-
ternatively "low style" with Prat. as "super-low". Before more data are available,
however, all such characterizations remain speculative.
Secondly, the corpus of texts examined in this study is extremely limited
and as such the results cannot be generalized across the board. In order to estab-
lish the applicability of our method to Byzantine literature as a whole, a much
larger corpus is required, with texts from different periods, genres and levels of
style. It goes without saying that the method of analysis needs also to be further
refined. Future research should prepare the ground for a more comprehensive ac-
count of complexity-related syntactic variation in Byzantine literature. Such an
account could then be related to findings made in other areas pertaining to style,
such as the lexicon, morphology, rhetorical devices and so on.
University of Crete