<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Archiving and Interchange DTD v1.2 20190208//EN" "JATS-archivearticle1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:ali="http://www.niso.org/schemas/ali/1.0">
  <front>
    <journal-meta>
<journal-id journal-id-type="nlm-ta">Revista da Abralin</journal-id>
<journal-title-group>
<journal-title>Revista da Abralin</journal-title>
</journal-title-group>
<issn pub-type="epub">2178-7603</issn>
<publisher>
<publisher-name>Associação Brasileira de Linguística</publisher-name>
</publisher>
</journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.25189/RABRALIN.V19I3.1728</article-id>
      <article-categories>
        <subj-group>
          <subject content-type="Theoretical Essay">Type of contribution</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Efficient trade-offs as explanations in functional linguistics</article-title>
        <subtitle>Some problems and an alternative proposal</subtitle>
      </title-group>
      <contrib-group content-type="author">
        <contrib id="person-3f1c7391c3b6777176f00192b7a0ccfd" contrib-type="person" equal-contrib="no" corresp="no" deceased="no">
          <name>
            <surname>Levshina</surname>
            <given-names>Natalia</given-names>
          </name>
          <email>natalevs@gmail.com</email>
          <xref ref-type="aff" rid="affiliation-938a2905677123f89b40c63b2406e094" />
        </contrib>
      </contrib-group>
      <contrib-group content-type="editor">
        <contrib id="person-de0ee2865f70dc1958e86705d9a98102" contrib-type="person" equal-contrib="no" corresp="no" deceased="no">
          <name>
            <surname>Neves</surname>
            <given-names>Maria Helena de Moura </given-names>
          </name>
          <email>mhmneves@uol.com.br</email>
          <xref ref-type="aff" rid="affiliation-4428470737a74b275bd4396811e0f241" />
        </contrib>
        <contrib id="person-f329fc0a1ebe2fbf4d2da354069df9a7" contrib-type="person" equal-contrib="no" corresp="no" deceased="no">
          <name>
            <surname>Mackenzie</surname>
            <given-names>John Lachlan</given-names>
          </name>
          <email>lachlan_mackenzie@hotmail.com</email>
          <xref ref-type="aff" rid="affiliation-62c20bed5e3c22b976c94b6e2ca66109" />
        </contrib>
        <contrib id="person-ceae6fd5fd6dfd87d7938700191a637b" contrib-type="person" equal-contrib="no" corresp="no" deceased="no">
          <name>
            <surname>Coneglian</surname>
            <given-names>André Vinicius Lopes</given-names>
          </name>
          <email>coneglian03@gmail.com</email>
          <xref ref-type="aff" rid="affiliation-3700691c130e98be49bf78017877b0db" />
        </contrib>
      </contrib-group>
      <aff id="affiliation-938a2905677123f89b40c63b2406e094">
        <institution content-type="orgname">Max Planck Institute for Psycholinguistics (MPI)</institution>
      </aff>
      <aff id="affiliation-4428470737a74b275bd4396811e0f241">
        <institution content-type="orgname">Universidade Estadual Júlio de Mesquita Filho (UNESP)</institution>
      </aff>
      <aff id="affiliation-62c20bed5e3c22b976c94b6e2ca66109">
        <institution content-type="orgname">Vrije Universiteit Amsterdam (VUA)</institution>
      </aff>
      <aff id="affiliation-3700691c130e98be49bf78017877b0db">
        <institution content-type="orgname">Universidade Federal de Minas Gerais (UFMG)</institution>
      </aff>
      <pub-date date-type="pub" iso-8601-date="17/12/2020" />
      <volume>19</volume>
      <issue>3</issue>
      <issue-title>Dossiês 2020</issue-title>
      <elocation-id>10.25189/RABRALIN.V19I3.1770</elocation-id>
      <history>
        <date date-type="accepted" iso-8601-date="20/11/2020" />
        <date date-type="received" iso-8601-date="23/10/2020" />
      </history>
      <abstract>
        <p id="_paragraph-1">The notion of efficient trade-offs is frequently used in functional linguistics in order to explain language use and structure. In this paper I argue that this notion is more confusing than enlightening. Not every negative correlation between parameters represents a real trade-off. Moreover, trade-offs are usually reported between pairs of variables, without taking into account the role of other factors. These and other theoretical issues are illustrated in a case study of linguistic cues used in expressing “who did what to whom”: case marking, rigid word order and medial verb position. The data are taken from the Universal Dependencies corpora in 30 languages and annotated corpora of online news from the Leipzig Corpora collection. We find that not all cues are correlated negatively, which questions the assumption of language as a zero-sum game. Moreover, the correlations between pairs of variables change when we incorporate the third variable. Finally, the relationships between the variables are not always bidirectional. The study also presents a causal model, which can serve as a more appropriate alternative to trade-offs.</p>
      </abstract>
      <abstract abstract-type="executive-summary">
        <title>Resumo</title>
        <p id="paragraph-4b1d4f7e4c85527a91911ac8518ac492">A noção de troca eficiente (efficient trade-offs, em inglês) é frequentemente usada na linguística funcional para explicar o uso e a estrutura da linguagem. Neste artigo, defende-se que essa noção é mais confusa que esclarecedora. Nem toda correlação negativa entre parâmetros representa uma troca real. Ademais, trocas são geralmente vistas em pares de variáveis, sem que se leve em consideração o papel de outros fatores. Esta e outras questões teóricas são ilustradas por meio de um estudo de caso sobre expedientes linguísticos usados para expressar “quem fez o quê a quem”: marcação de caso, ordem rígida de palavras e posição medial do verbo. Os dados são provenientes do córpus Universal Dependencies, uma base de 30 línguas, e do córpus anotado de notícias da coleção Leipzig Corpora. O estudo de caso mostra que: nem todos os expedientes se correlacionam negativamente, o que contesta a assunção da linguagem como um jogo de soma zero; ademais, a correlação entre pares de variáveis muda quando uma terceira variável é acrescida; finalmente, as relações entre as variáveis não são sempre bidirecionais. Este estudo apresenta, também, um modelo causal, que pode servir como uma melhor alternativa a trocas.</p>
      </abstract>
      <kwd-group>
        <kwd content-type="">Efficiency</kwd>
        <kwd content-type="">Trade-offs</kwd>
        <kwd content-type="">Case marking</kwd>
        <kwd content-type="">Word order</kwd>
        <kwd content-type="">Universal Dependencies</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body id="body">
    <sec id="heading-53a64f7bc58d70941d47120317e50620">
      <title>Aims of this paper</title>
      <p id="paragraph-2">Efficiency can be defined as minimization of a ratio of costs to benefits. To put it simply, a person behaves efficiently when they do not spend more effort than necessary in order to achieve their goals. Speaking about language, the costs can be related to language processing, articulation and acquisition, while the main type of benefits is the realization of one’s communicative needs. Although one can also think of aesthetic, social and other benefits, those are less frequently discussed in the literature. </p>
      <p id="paragraph-3">The fundamental question of functional linguistics is why human languages are as they are. There is a widely held view that one of the driving forces of language change is efficient choices made by language users during interaction. These choices can become conventional, according to the “invisible hand” principle (KELLER, 1994<xref id="xref-5f35e9342e669c74a7a894c679135b89" ref-type="bibr" rid="book-ref-5afb9a1f09b0d38bda4fe6111984470f">[1]</xref>). </p>
      <p id="paragraph-4">The idea that language users try to behave efficiently has a long history. Already Georg Curtius (1820–1885), a German philologist, explained phonetic attrition (<italic id="italic-88cbda312b29cce3319fd5ea34f46191">Verwitterung</italic> “weathering”) by the language users’ drive to <italic id="italic-6dc98c7905583018fafaabbd7ef83bdc">Bequemlichkeit</italic> “comfort”. This drive is counterbalanced by the tendency to preserve meaning-bearing sounds and syllables, which resist attrition in order to remain recognizable (DELBRÜCK, 1908, p. 143-144<xref id="xref-dcbee6550fd19a8f1c5fadf5b4c2269c" ref-type="bibr" rid="book-ref-2babe4d8d8a70cfc5490a2551517bacd">[2]</xref>). Therefore, language users tend to minimize their effort, at the same time trying to make sure that the important meanings are conveyed. Throughout the 20th century, the idea that language users try to save effort was a recurrent topic in linguistics, from Zipf’s (1949<xref id="xref-d91d45f64a453d1e0f259d93dfa5ac86" ref-type="bibr" rid="book-ref-65d624b065120baee5f74dd94f50e477">[3]</xref>) principle of least effort to Haiman’s (1983<xref id="xref-1af8ff9ea03f7265180521b5a53cb8aa" ref-type="bibr" rid="journal-article-ref-1f6bd064a2e4f3600621a955ef0d47a4">[4]</xref>) economic motivation in grammar and Keller’s maxim “Talk in such a way that you do not spend more energy than you need to attain your goal” (1994, p. 107<xref id="xref-51f2d714c44662bad0ced84dee3dec96" ref-type="bibr" rid="book-ref-5afb9a1f09b0d38bda4fe6111984470f">[1]</xref>). In the 21<sup id="superscript-1">st</sup> century, these ideas have been made more concrete and tested with the help of diverse data sources and cutting-edge methods, including multilingual corpora, artificial language learning experiments, multivariate statistical models and approaches from information theory (GIBSON <italic id="italic-3">et al.</italic>, 2019<xref id="xref-d9b85c43e96d3b3ffc746f1f40b33636" ref-type="bibr" rid="journal-article-ref-ee4671c65f48e998d48c64dc839e5421">[5]</xref>). </p>
      <p id="paragraph-5">Efficiency can explain the form and use of diverse grammatical constructions, words and phonological units. One can mention Zipf’s (1965[1935]<xref id="xref-cf31929cc3e72985b406874e63933262" ref-type="bibr" rid="book-ref-d28c30708ce2ef900ffe240bef9d52da">[6]</xref>) law of abbreviation in lexicon (see Section 1.2 for more detail), minimization of distances between syntactically and semantically related words, which makes processing easier (e.g. GIBSON, 2000<xref id="xref-479eee28aa11661884ae2e525752262d" ref-type="bibr" rid="chapter-ref-13a6756ba25cf9f9825239d2a628a967">[7]</xref>; FERRER-I-CANCHO, 2006<xref id="xref-8917f324dfc9e9ddd6c95c135dd84c8d" ref-type="bibr" rid="journal-article-ref-a40c6e28d58f17cbce5bff6b863b2a7c">[8]</xref>), efficient phonetic reduction in language production (JAEGER; BUZ, 2017<xref id="xref-5395d6091b7d7d2144d6d830da63de78" ref-type="bibr" rid="chapter-ref-ddbd3703f78d87570e3557ff2df7bddf">[9]</xref>), and efficient use of referential expressions in discourse (CLARK; WILKES-GIBBS, 1986<xref id="xref-f7dc7a35267cfabaaa20690541728fb2" ref-type="bibr" rid="journal-article-ref-db730c42323ab588ca9ede7908be203f">[10]</xref>; ARIEL, 1990<xref id="xref-513b9c61db3c7eaa0e3cd4913a49b535" ref-type="bibr" rid="book-ref-a528d84f01c66ebd97a9cb268500059c">[11]</xref>). More examples can be found in Hawkins (2004<xref id="xref-07c4d8a7dc1e8009c8e0f6997a6b5327" ref-type="bibr" rid="book-ref-5a5d017164cae07e8a93517e01975441">[12]</xref>), Jaeger and Tily (2011<xref id="xref-e9590d924d816c1f5e9274bb1f76742a" ref-type="bibr" rid="journal-article-ref-5ce971c415542d6fb2ee286bb8f1a808">[13]</xref>), Levshina (2018<xref id="xref-1d3680163ff55dce5e1c8a58b1f0918a" ref-type="bibr" rid="journal-article-ref-a797264c5718372e1d2e6ca89e015c1e">[14]</xref>) and Gibson <italic id="italic-4">et al.</italic> (2019<xref id="xref-1a91b6a0852cd012eddbfaa6a6015890" ref-type="bibr" rid="journal-article-ref-ee4671c65f48e998d48c64dc839e5421">[5]</xref>). </p>
      <p id="paragraph-6">We can speak of a trade-off when spending the limited resources on gaining in one aspect leads to losing in another aspect. For example, there can be an implicit assumption in the media during the coronavirus pandemic that keeping the economy going can only be done at the costs of public safety. Another trade-off is between protecting the environment and ensuring the high standard of living in the industrialized countries. In linguistics, there is a view that languages which are simple in one respect are likely to be complex in others (cf. SHOSTED, 2006<xref id="xref-4e754a763866abf036ac4e65352d6e23" ref-type="bibr" rid="journal-article-ref-f2937d26a4be28b5fd1e26fdde72db09">[15]</xref>). These real or perceived trade-offs play an important role in the way we understand the world. </p>
      <p id="paragraph-7">A trade-off can be represented visually as shown in Figure 1. The axes represent two potential costs. The dots are observations from some imaginary data. The line corresponds to the so-called Pareto frontier. The observations lying close to the Pareto frontier are optimal (or Pareto-efficient) because it is impossible to minimize one cost without increasing the other. </p>
      <fig id="figure-panel-76253c1b67f7d932318c117bb32abcad">
        <label>Figure 1</label>
        <caption>
          <title>A Pareto frontier based on imaginary data with two different costs<bold id="bold-3bef35b0a3c68e0333592c202513b87e"/></title>
          <p id="paragraph-87e646f6970eac3ae3cd4cb554cfbc1c" />
        </caption>
        <graphic id="graphic-bef22c46e8d30b6a1840dfe2dbede91c" mimetype="image" mime-subtype="jpeg" xlink:href="Figura 1.jpg" />
      </fig>
      <p id="paragraph-10">If human languages are efficient, they should be located on a Pareto frontier. In other words, there should be a negative correlation between two linguistic variables, which is represented by the line in Figure 1. The variables can represent different types of costs. An example is Zipf’s (1949<xref id="xref-fb5e32b5543bdb3dd31551b2fe717bce" ref-type="bibr" rid="book-ref-65d624b065120baee5f74dd94f50e477">[3]</xref>) trade-off between Speaker and Addressee’s efforts (see below). Alternatively, they can represent benefits, such as different types of information available to the hearer, as in the case of the trade-off between information conveyed by word-internal structure (morphology) and word order (KOPLENIG <italic id="italic-5">et al.</italic>, 2017<xref id="xref-2e810ba2b295afae54fa4b429dc0f23a" ref-type="bibr" rid="journal-article-ref-791e1991a19bf0c635563d094595166c">[16]</xref>). </p>
      <p id="paragraph-11">Trade-offs are closely related to competing motivations in language (DU BOIS, 1985<xref id="xref-9bfc2a87a0fe83c432bee5d509e31221" ref-type="bibr" rid="chapter-ref-1d202b6ccf3d35c903a305aea2ebe412">[17]</xref>). Language users and learners are driven by different communicative and cognitive pressures. For example, system pressure (analogy), which forces human language users to organize linguistic forms into systems, in which classes of forms behave similarly, can be in conflict with economic motivation (HASPELMATH, 2014<xref id="xref-506b53be49a380d4a4a63580b401ea75" ref-type="bibr" rid="chapter-ref-44fdb479d44a5ef83e7c9f59b486447f">[18]</xref>). In particular, it would be less costly for articulation if English had a singulative form for “pea” (something like “pea-one”) and have an unmarked plural form instead of “peas”, like in Welsh, because we seldom speak about one pea only (Andersen’s fairy tale <italic id="italic-6">The Princess and the Pea</italic> is a famous exception). The system pressure leads to a cognitively simpler system, which might be easier to acquire and manage in language production. The higher the articulatory costs, the lower are the cognitive costs, and the other way round. </p>
      <p id="paragraph-12">Another example is competition between phonological transparency and articulatory efficiency. Consider final devoicing of stems and affixes. For example, the noun <italic id="italic-7">kod</italic> “code” in Russian has the Genitive singular form <italic id="italic-8">kod-a</italic> ['koda], while the Nominative singular form is <italic id="italic-9">kod-Ø</italic> [kot], which sounds like <italic id="italic-10">kot</italic> “cat”. This and other phonological alternations make articulation easier, but reduce the degree of transparency (i.e. one-to-one mapping between form and meaning) and consequently the degree of learnability of a language (HENGEVELD; LEUFKENS, 2018<xref id="xref-1236fc263db08bcc7bf415c431026263" ref-type="bibr" rid="journal-article-ref-d65fff11324a1a54a24c7f07914e2217">[19]</xref>). As put informally by Joseph Greenberg, “[a] speaker is like a lousy auto mechanic: every time [s]he fixes something in the language, [s]he screws up something else” (CROFT, 2002, p. 5<xref id="xref-dae757f5ef27926214cc964531fe1b58" ref-type="bibr" rid="journal-article-ref-0ab16defb12e6d440c278996d4eae518">[20]</xref>).</p>
      <p id="paragraph-13">At the same time, there are numerous problems associated with the concept of trade-off as an explanation in functional linguistics. These problems have been seldom discussed. Notable exceptions are Fenk-Oczlon and Fenk (2008<xref id="xref-42b739bf0f1bf769efb50aa0389c055f" ref-type="bibr" rid="chapter-ref-ec8ee13229e63366ee68d46cb30dd5e1">[21]</xref>) and Sinnemäki (2008<xref id="xref-0c5ea7dfd0f770ead8096eefe1dd3aad" ref-type="bibr" rid="chapter-ref-4eca7df8d3e8676e40a520f62a7bebb5">[22]</xref>; 2014<xref id="xref-694b36cb8a6ee4de2a38ca76022c6d6a" ref-type="bibr" rid="chapter-ref-e8ebaa902477b97d8f8e0a56137d9a5b">[23]</xref>).<xref id="xref-2d194fbb21972e753bc3737d1da26032" ref-type="fn" rid="footnote-77bdff038bd76e64c890b6a54ea6f873">1</xref> It is very tempting to interpret any negative correlation as an efficient trade-off. The present paper argues that such an interpretation is justified if and only if the following conditions are met: </p>
      <p id="paragraph-14">1) the variables participating in the negative correlation can be clearly defined as costs or benefits;</p>
      <p id="paragraph-15">2) there are only two correlated variables, and no other factors involved;</p>
      <p id="paragraph-16">3) the correlated variables are functionally related, representing one type of linguistic task;</p>
      <p id="paragraph-17">4) The relationships between the variables are bidirectional, not one-directional. </p>
      <p id="paragraph-18">As will be shown in Section 1, these conditions are hardly ever met. Therefore, the concept of trade-off in linguistics brings more confusion than insights and should be dropped altogether. Instead, we should replace analysis of correlations between pairs of linguistic variables with causal analysis of multiple factors. These issues are illustrated in a case study of expression of core arguments in 30 languages (Section 2). Section 3 offers the conclusions and an outlook for future research.</p>
      <p id="paragraph-f2b144c5a7363859a10627eedcd86992" />
      <sec id="heading-f94ba5872a5fc31409de433c7a8ecfa2">
        <title>1. Problems with trade-offs in functional linguistics </title>
        <p id="paragraph-1" />
        <sec id="heading-f122d02ac8bfe7a3330fe23f1ae9816b">
          <title>1.1. The problems with defining costs and benefits</title>
          <p id="heading-b4efbf17df2abc35653e747d906f44f6">Trade-offs are assumed to exist between two types of costs or benefits. The aim of this section is to demonstrate that this assumption is often difficult to meet. Sometimes one linguistic variable involved in a presumed trade-off can represent different costs or benefits. Also, these costs and benefits are often difficult to define. The interpretation then becomes problematic. </p>
          <p id="paragraph-5bfa7962ccdff55c4b261ec16a38453a">One of the most popular trade-offs in the literature is the negative correlation between rigid word order and case morphology. Languages tend to use either explicit case marking (e.g. Latin or Lithuanian) or rigid word order (e.g. English or Mandarin Chinese). This correlation has been interpreted as a trade-off of different complexity types (SINNEMÄKI, 2014<xref id="xref-9dca323148de1f3ed10c981b7f3647ab" ref-type="bibr" rid="chapter-ref-e8ebaa902477b97d8f8e0a56137d9a5b">[23]</xref>). The correlation is uncontroversial. What many studies of this correlation, however, often leave unclear is which costs for a language user are entailed by rigid or flexible word order, and if they can also offer any benefits (FENK-OCZLON; FENK 2008<xref id="xref-975bb4aa03b285e6b3bda9517b309dd1" ref-type="bibr" rid="chapter-ref-ec8ee13229e63366ee68d46cb30dd5e1">[21]</xref>). </p>
          <p id="paragraph-d4b51d842815c2ab189f24f7ba5b5c36">In research on linguistic complexity, it is believed that fixed word order in the domain of argument discrimination makes language more complex because it adds an extra constraint (e.g. SINNEMÄKI, 2008<xref id="xref-9045a88cc167f3d4529c05d84c4168b0" ref-type="bibr" rid="chapter-ref-4eca7df8d3e8676e40a520f62a7bebb5">[22]</xref>).<xref id="xref-ada4e1b2c3e422f39d17388d872cf84b" ref-type="fn" rid="footnote-b4c6d8bde6134416ea4b648fee4d2052">2</xref> At the same time, it can be argued that a language with some regularity and some freedom can be more difficult to acquire and process than either a language with random word order or a language with completely fixed word order. A similar operationalization of complexity is given in Gell-Mann (1995<xref id="xref-673a04e74ec90554675af0b95dd2c965" ref-type="bibr" rid="journal-article-ref-6af766fd270df706a873582cf32574d1">[24]</xref>), according to whom effective complexity can be high only in the region between total order and complete disorder. So, it is not clear whether languages with rigid word order are necessarily more complex than flexible languages, since the latter usually have a bias towards a certain order, e.g. Subject followed by Object (LEVSHINA, 2019<xref id="xref-59c1295a26f3c8ad3b70d845482fd46f" ref-type="bibr" rid="journal-article-ref-001cd535d8a43913429703db6591e79e">[25]</xref>). They may also have additional rules, which require the non-dominant order (e.g. Object followed by Subject) in specific contexts. These rules will increase the complexity. On the other hand, completely rigid word order is rare, as well. Word order flexibility is a gradient phenomenon, and we need a better understanding of how this gradience should be reflected by the metrics of linguistic complexity.</p>
          <p id="paragraph-2533ed416b3890ac6d43e140c1544160">If we speak about the costs and benefits of word order variability for language users, rather than the abstract complexity of a linguistic system, the picture does not become much clearer. First of all, rigid word order has benefits for the addressee in the sense that it can be easier for assignment of syntactic roles to sentence elements (FENK-OCZLON; FENK 2008<xref id="xref-e0bc57224301bcce7ff94ba40ac95718" ref-type="bibr" rid="chapter-ref-ec8ee13229e63366ee68d46cb30dd5e1">[21]</xref>). Similarly, according to Hale’s (2006<xref id="xref-a327d89beccd4b28b2108b2b3d32976f" ref-type="bibr" rid="journal-article-ref-50b08f0536ba5bd8e6b247d1bd407be7">[26]</xref>) entropy reduction hypothesis, the difficulty in processing of a sentence depends on the number of bits conveyed by each following word. If word order is free, it may be more difficult to predict the next word, and the processing effort will be higher. Therefore, fixed order can be less costly, after all, if we take into account the addressee’s interests.</p>
          <p id="paragraph-610e588924719056dd8e36e8a5d4ec6f">At the same time, fixed word order has some side effects. In particular, it can be less optimal for management of information flow, e.g. by fronting the topic or putting backgrounded information in the very end of a sentence. If this variation is not allowed by grammar, language users will need to use additional markers in order to convey this pragmatic information, such as <italic id="italic-5eee179cb760a859ac827ffd8ab507c8">it</italic>-clefts, e.g. <italic id="italic-03f42e8915c7f30c2d5fae9b71338b9c">It is John who Mary loves</italic>. This creates additional articulation costs. Rigid word order also allows for fewer options in minimization of distances between dependent and head words, which can make sentences more costly, both for the speaker and the addressee, by increasing memory and integration costs. </p>
          <p id="paragraph-88ffb68b07639abaa85ee0f0e1d418db">To summarize, upon closer inspection, the famous trade-off between word order and morphology falls apart into a web of diverse interests of the speaker and the addressee. The interests of the language learner are yet another important aspect, which requires further research. </p>
          <p id="paragraph-d9f98a450c80d7cf409b162e3cd6486f">In lexicon, one can mention a trade-off between cognitive and communicative costs discussed by Kemp, Xu and Regier (2018<xref id="xref-03ba51f64722f53c0159d356eb60c80c" ref-type="bibr" rid="conference-paper-ref-d5e89324cb1943762bb58b5830baf79c">[27]</xref>). If a language has a large vocabulary with fine-grained distinctions in a particular domain, the cognitive costs of maintaining such a vocabulary are high. For example, detailed systems of kinship terms or colour terms are more costly than simple ones in that regard. The communicative costs occur when the speaker does not deliver her message with enough precision. For example, when hearing the word “aunt”, it is not clear whether the father’s or the mother’s sister is meant. Basically, these costs represent the risk of potential miscommunication.<xref id="xref-e434bd9cbd1f31d110a623135108ba87" ref-type="fn" rid="footnote-c4ab2c87e92ee1a88494702779155d3b">3</xref> Using computational modelling, Kemp <italic id="italic-bd75c12b11d82761a40d7dda12893fb7">et al. </italic>show that these two types of costs correlate negatively in real languages. There are systems with high cognitive costs, but low communicative costs (e.g. detailed kinship terms systems, as in Northern Paiute, an indigenous language of northern California) and systems with low cognitive costs and high communicative costs (kinship terms system with fewer distinctions, as in English). There are no systems in which both costs are high or both are low, so all languages are located close to a Pareto frontier. </p>
          <p id="paragraph-8be962c622387731411d8fbad8679f5a">This account leaves many questions open. Is a “simple” language less cognitively costly because it is easy to learn for L1 and L2 speakers? It can also be that users of a “simple” language spend less effort on extracting words from the long-term memory because the few words in the vocabulary are more easily accessible due to their high frequency, or because there is simply less competition between the words. Do communicative costs include articulation costs of using longer periphrastic expressions, such as “my father’s sister” in a cognitively simple system? Which of these potential costs weigh more and which weigh less? A full-fledged efficiency account would require all these details. </p>
          <p id="paragraph-b67eca1b879ab716b8e75287ce782617" />
        </sec>
        <sec id="heading-7f03c9d1fa0e236f0bddb75641577462">
          <title>1.2. The problems of similar functions and rational choice</title>
          <p id="paragraph-5d7fe6a129b21ff5c1698d7e897e8d24" />
          <p id="paragraph-c1703ffd146de7420ab46819e5aa05e0">From a mathematical perspective, a trade-off represents a negative correlation. In principle, every negative correlation can be regarded as a trade-off in a very abstract sense: if one quantity decreases, then the other increases, and the other way round. But if we want to appeal to the principle of efficiency, we should assume that a presumed trade-off is a result of rational choices made by language users. If the condition of free choice is not met, it is better to speak of a negative correlation, in order to avoid confusion.</p>
          <p id="paragraph-b00f1f15d089642b3ac08ffb13bbd331">From this follows that a trade-off can only be between functionally related linguistic variables which help to solve one and the same task, or hinder its accomplishment (SINNEMÄKI, 2008<xref id="xref-e13755f68a9943477f39a0c0176051ef" ref-type="bibr" rid="chapter-ref-4eca7df8d3e8676e40a520f62a7bebb5">[22]</xref>). An example is provided in Section 2, which discusses the cues that help us identify the subject and object of a sentence. Negative correlations between randomly selected linguistic variables, e.g. number of possible syllables in a language and level of inflectional synthesis (SHOSTED, 2006<xref id="xref-9df8021b39da2aefe95780ca3f462b7a" ref-type="bibr" rid="journal-article-ref-f2937d26a4be28b5fd1e26fdde72db09">[15]</xref>), are difficult to interpret as trade-offs.</p>
          <p id="paragraph-0137ef59520d7d92ecb1e0938bc9947f">Since trade-offs should involve rational choices, these choices should be available for both types of costs involved in a potential trade-off. To give a simple example, one can indulge in instant gratification, spending all money now on pleasant things and having nothing for tomorrow, or one can save money for a rainy day but have a less enjoyable life now. It is free choice in both directions. Many correlations in the literature, however, do not fulfil this criterion. This means that they are not true trade-offs in the sense defined here. </p>
          <p id="paragraph-b1e6d2795b0459f9caeb5fe84ed4d7e1">Probably the most important negative correlation in communicative efficiency research is the one between context and amount of information encoded by the speaker in a message (ARIEL, 2014). Context can be defined as everything that belongs to the common ground shared by the speaker and the addressee (CLARK, 1996<xref id="xref-1e443ab8f80351b627859293c8f39c5e" ref-type="bibr" rid="book-ref-9520cf6dbd5f589b523de44303c4a977">[28]</xref>). Common ground includes preceding linguistic context, beliefs about the communities the interlocutors belong to, and information about the physical context and common past experience. There is ample evidence that common ground leads to shorter referential expressions used by interlocutors and in general shorter exchanges (e.g. CLARK; WILKES-GIBBS, 1986<xref id="xref-2145ce3f5ad3f5725bebf5bea7c0d2d9" ref-type="bibr" rid="journal-article-ref-db730c42323ab588ca9ede7908be203f">[10]</xref>). Ariel’s (1990<xref id="xref-40e953320a137548a09f13aa8d8f4cab" ref-type="bibr" rid="book-ref-a528d84f01c66ebd97a9cb268500059c">[11]</xref>) Accessibility Theory can be regarded as a correlation between context and coding length: there is a tendency for more accessible referents to be expressed by shorter forms (e.g. pronouns or zero expression) than less accessible ones, which are expressed by longer forms (e.g. noun phrases). </p>
          <p id="paragraph-5f25c3a9cf7405e1949665c10c1453ae">Zipf’s law of abbreviation, which says that frequent words tend to be shorter than infrequent words (ZIPF, 1965[1935]<xref id="xref-cbc952ae372677621314eef548b8bfb9" ref-type="bibr" rid="book-ref-d28c30708ce2ef900ffe240bef9d52da">[6]</xref>), can also be interpreted as a negative correlation between coding length and ease of access due to high resting activation of frequent words. More recently, it has been shown by Piantadosi, Tily and Gibson (2011<xref id="xref-a0bd5fca1cdc83395c4760f21758caa5" ref-type="bibr" rid="journal-article-ref-9fb8eea3193397b4abdb5c11e8b6960e">[29]</xref>) that the correlations between ngram-based predictability and word length are stronger than those between frequency and length. In phonology, there is ample evidence that words and segments that are more predictable undergo phonetic reduction more frequently than less predictable units (JAEGER; BUZ, 2017<xref id="xref-4819aac7601c3f4b1295c5a5d1c51915" ref-type="bibr" rid="chapter-ref-ddbd3703f78d87570e3557ff2df7bddf">[9]</xref>). In grammar, this correlation can be found in markedness phenomena. Greenberg (1966a<xref id="xref-af084c0b4165bc26a1c98fa9f148a4d9" ref-type="bibr" rid="book-ref-a7376fbbbc60088d4868763944debdf4">[30]</xref>) was the first to show systematically that more frequent categories (e.g. singular and present tense) are expressed by unmarked forms, while the less frequent ones (e.g. plural and future tense) are expressed by marked forms. It has been explained by the tendency to provide less formal marking to more predictable categories (e.g. singular), and more marking to less predictable ones (e.g. plural) (HASPELMATH, 2008<xref id="xref-5f934cdee36a7844da9fbd96825c7201" ref-type="bibr" rid="chapter-ref-62d59b59ccc10110fe18274c31fbfc20">[31]</xref>; 2014<xref id="xref-408a6bd98088c227dafeac7d5dd0a228" ref-type="bibr" rid="chapter-ref-44fdb479d44a5ef83e7c9f59b486447f">[18]</xref>).<xref id="xref-0c31e08a70b1958649be620e6023c0ef" ref-type="fn" rid="footnote-891bb52a916ab99c30abf76e323f621e">4</xref> Here one can also mention the efficient use of optional markers, e.g. complementizer “that” (JAEGER, 2010<xref id="xref-8fb8e565e02671e22c0efb82da60a466" ref-type="bibr" rid="journal-article-ref-df43bca01b9e8bda570ccb73a515463c">[32]</xref>) and the Japanese object marker <italic id="italic-04b34fb5d30cc8fff473b2f4c3cd2827">-o</italic> (KURUMADA; JAEGER, 2015<xref id="xref-0404ea11c2e9de8f9fbb25af8880fdcd" ref-type="bibr" rid="journal-article-ref-6fd1d958476e69f443ef027d0571c37e">[33]</xref>). The markers are used more frequently in the situations where the grammatical role of the marked element is less predictable based on world knowledge or linguistic experience. </p>
          <p id="paragraph-b563f8fabc4dc9a8529231e3a9fbf610">Thus, there is convincing evidence of the negative correlation between amount of linguistic encoding and accessibility of information from context in a very broad sense. Can one call it an efficient trade-off? Not really. The reason is that the relationship is not free. The ease of access is determined by common ground or other factors. It is something given. A language user adjusts the amount of coding to the ease of access given in the situation, but cannot adjust the ease of access to the amount of coding they want to use.<xref id="xref-4c670489f8596cf858f28ef36cba298d" ref-type="fn" rid="footnote-858a826cd69b07c3619c60be47678b3e">5</xref></p>
          <p id="paragraph-924b8cc071ab9b547bf145ee7b290d45">In Section 1.1 we discussed the negative correlation between rigid word order and case morphology. In their large-scale study, Koplenig <italic id="italic-ab4dc98907fecdc7ab268480e2005198">et al.</italic> (2017<xref id="xref-8ec4b6fd39c2cdab4c244dcb34193893" ref-type="bibr" rid="journal-article-ref-791e1991a19bf0c635563d094595166c">[16]</xref>) speak about a general trade-off between information carried by word order and information carried by word-internal structure, measured with the help of information-theoretic concepts. The almost 1000 languages in their sample reveal a clear negative correlation. Isolating languages with high scores on information conveyed by word order, such as Mandarin Chinese, have low scores on information carried by word structure, while polysynthetic languages like Greenlandic Inuktitut or Ojibwa have low word order scores and high word structure scores. Koplenig <italic id="italic-583b040150a40d88fe5db37f9706cc16">et al.</italic> argue that this trade-off is efficient:</p>
          <p id="paragraph-f1d80a323333cab2ad594bc7789cdf80" />
          <disp-quote id="block-quote-74d1e71b0297a9635f1be5b72087d79a">
            <p id="paragraph-3f3b0cd74033295126ecca5c2938fde7">If, for example, grammatical relationships in a sentence are fully determined by the ordering of words, it would constitute unnecessary cognitive effort to additionally encode this information with intra-lexical regularities. If, however, word ordering gives rise to some extent of grammatical ambiguity, we should expect this ambiguity to be cleared up with the help of word structure regularities in order to avoid unsuccessful transmission. (KOPLENIG <italic id="italic-20b6492f6d982d1174d8e51e0189d561">et al.</italic>, 2017, p. 4<xref id="xref-5380fc27dcc8fed6de4d361669634560" ref-type="bibr" rid="journal-article-ref-791e1991a19bf0c635563d094595166c">[16]</xref>)</p>
          </disp-quote>
          <p id="paragraph-efe438ebfc77af427ae05cfd0f1a5d22" />
          <p id="paragraph-b81d44ae832aaba00b57e4c3dd7e9b03">From this follows that fixed word order triggers loss of morphological complexity. What explains the emergence of fixed word order is not clear. Therefore, this relationship seems to be unidirectional and cannot be regarded as a trade-off in the proper sense.</p>
          <p id="paragraph-a062598be49bc8d93be3706efcd1b4b8" />
        </sec>
        <sec id="heading-d03cf06731bef6dfb23cbe4008af001b">
          <title>1.3. The problem of multiple factors </title>
          <p id="paragraph-b10c92f202d2c270b58bfa5f98b2d403" />
          <p id="paragraph-265ca6de611eab4e3043737ee9b25c9d">The trade-offs discussed in the literature are usually binary (but see FENK-OCZLON; FENK, 2008<xref id="xref-f453cda05ab1943a4d5e1e1f7419f6ca" ref-type="bibr" rid="chapter-ref-ec8ee13229e63366ee68d46cb30dd5e1">[21]</xref>; SINNEMÄKI, 2008<xref id="xref-428eb559145b397e884955a88cf21236" ref-type="bibr" rid="chapter-ref-4eca7df8d3e8676e40a520f62a7bebb5">[22]</xref>). However, there is always a chance that the relationship can change dramatically if other relevant factors are taken into account. </p>
          <p id="paragraph-755d12b04d2c2c54ba033fb8ca15fd46">To illustrate this point, let us discuss Zipf’s (1949<xref id="xref-bba43eb6080e1d5d0c512d32aa9a4fce" ref-type="bibr" rid="book-ref-65d624b065120baee5f74dd94f50e477">[3]</xref>) famous idea of two opposing forces: the Force of Unification and the Force of Diversification. The Force of Unification represents the speaker’s economy: in the ideal case, the speaker only has one word that covers all meanings. There is no need to spend effort in order to choose between words (this is known as paradigmatic economy). The Force of Diversification represents the addressee’s economy: there should be a specific word for each meaning that can be verbalized. A balance between these two forces leads to a compromise: human languages have a small convenient vocabulary of more general reference, and a large vocabulary of more precise reference. The famous Zipf’s law (1949<xref id="xref-ac6c97f5a5c0b5920a6304be3a3449ed" ref-type="bibr" rid="book-ref-65d624b065120baee5f74dd94f50e477">[3]</xref>), which posits a negative correlation between the frequency of a word and its rank, is evidence for such a vocabulary balance. </p>
          <p id="paragraph-2a8ae367bdb5dfdbc76412b69c0984db">Although Zipf’s law is a well-established empirical fact, the trade-off between the speaker and addressee’s interests is not unproblematic. In particular, Ariel (2014<xref id="xref-cab45e076bc1a22c7ccadcace167ad65" ref-type="bibr" rid="chapter-ref-2e3db0b9ce45bb38894b4eefe35ee907">[34]</xref>) argues that highly polysemous constructions, in which the meaning has to be inferred, have greater support from context (preceding discourse, non-linguistic information present in the common ground, etc.) than monosemous constructions. In fact, Piantadosi, Tily and Gibson (2012<xref id="xref-44228edad663c05601162ab8c0c5d2c7" ref-type="bibr" rid="journal-article-ref-4813ee74ddc83948a2f342fb2c4cedfa">[35]</xref>) argue that all efficient communication systems should be ambiguous, provided that there is sufficient context that can help to infer the meaning. This means that another trade-off comes into play, that is, the one between encoded information and common ground/accessibility, which was discussed in Section 1.2. Therefore, less encoding means in normal communication that the speaker considers the contextual cues to be sufficient for the addressee to understand the message. For example, a referent that has been recently introduced can be encoded by a shorter pronominal form or omitted altogether. The contextual cues help the addressee to infer the information, even if the verbal expression is ambiguous or vague, e.g. asking “Is there a bank near here?” after hearing that the store does not accept cards. Therefore, Zipf’s proposal can only hold if we control for the amount of available context. Obviously, this is impossible to do in realistic settings. So, one may ask if Zipf’s law is indeed explained by this trade-off between the Forces of Unification and Diversification. A more likely cause is the high accessibility of frequent forms, which can be easily extended to new contexts (HARMON; KAPATSINSKY, 2017<xref id="xref-4b574b17e9639c363a3da95c149e2626" ref-type="bibr" rid="journal-article-ref-d1fdb0aa9e5db35eacd9437570410c3b">[36]</xref>). </p>
          <p id="paragraph-34a015d947f25cefba095117dfdf1de9">Another problematic case is the negative correlation between memory costs and articulatory costs formulated by Martinet (1963, p. 165<xref id="xref-8cb80b9b9e6a57e018033a00645ba9b7" ref-type="bibr" rid="book-ref-d17216898ce38ded03d9cc94000cc7c2">[37]</xref>). For example, the verb “enlarge” is less accessible but more compact than a periphrastic expression<italic id="italic-1bea031f62a777ea460490970b1c1b35"> </italic>“make bigger”, which consists of more accessible elements but is longer. The claim that easily accessible periphrastic expressions have higher articulatory costs is not immediately convincing, however, because words that are easier to access are more frequent, and, as we know from Zipf’s (1965[1935]<xref id="xref-41fdbd39454742cdbe93a411d6d736e6" ref-type="bibr" rid="book-ref-d28c30708ce2ef900ffe240bef9d52da">[6]</xref>) law of abbreviation, frequent words tend to be shorter and therefore easier to articulate. Unfortunately, the total length of the same message in formal and informal language is difficult to evaluate because we do not have parallel register-to-register corpora yet, so Martinet’s claim remains a hypothesis. </p>
          <p id="paragraph-d1bfdfc1decab4d1ed22e2bd86d8293a" />
        </sec>
        <sec id="heading-ba6da5e9eb0dee4c6dc20b2021a2b866">
          <title>1.4. Positive correlations and synergy instead of competition</title>
          <p id="paragraph-0812fc739649e8afe8030eba1d97b771" />
          <p id="paragraph-93fda959f70d2e578315a2f95a9b3c5f">Pareto efficiency means that different types of costs should be negatively correlated. However, in reality linguistic variables representing costs or benefits can be positively correlated, as well. For example, creole languages have low complexity across multiple domains (phonology, morphology and syntax), while ‘old’ languages have high complexity across the same domains (MCWHORTER, 2001<xref id="xref-364d94a7f8ab094fb773f93b63c79f1d" ref-type="bibr" rid="journal-article-ref-7b7f5199239fb21748aac0efa093790a">[38]</xref>). This means that domain-specific costs for language learners can be positively correlated, as well as articulatory costs for speakers, if we focus on obligatory grammatical marking, for example. </p>
          <p id="paragraph-26965742a698f16b91a0ee0bf298e9aa">Moreover, different cues can even have a synergetic effect. For example, when expressing and interpreting some message, one modality of communication should be easier to process than several. In spoken languages, a message is transmitted via two major modalities: auditory message and visual signals, which are produced by the head, face, hands, arms and torso. Some of these signals may be relevant or irrelevant, which means that we need extra effort to distinguish between them, especially under time constraints of spontaneous interaction with quick turn-taking. One would believe that processing one modality should be at the cost of the other. However, this is not what we see. There is evidence that interlocutors respond faster to questions that have an accompanying manual and/or head gesture, than to questions without such visual components (HOLLER; KENDRICK; LEVINSON, 2018<xref id="xref-0ec46fcb4bfb17d631bbb705b90a45b7" ref-type="bibr" rid="journal-article-ref-6c1585352d63049deef51772dd6eff76">[39]</xref>). In fact, Holler and Levinson (2019<xref id="xref-013ab09f452cb727dc0fe95b2b23009f" ref-type="bibr" rid="journal-article-ref-7c4744ea966dda7e2bc6c89395ed8b57">[40]</xref>) argue that multimodal information is easier to process than unimodal – that is, only visual or only auditory – information because visual bodily signals may reduce uncertainty at the message level. Humans are good at creating multimodal Gestalts as a result of message unification. As a result, different costs have a synergetic effect. Communication is therefore not Pareto-efficient. </p>
          <p id="paragraph-fc571e7fdb2a7f25b35acf2b99375ad6" />
        </sec>
      </sec>
      <sec id="heading-e3731095b082738944b12d362043ee0e">
        <title>2. A case study: different cues in expressing subject  and object </title>
        <p id="paragraph-7aee7c230c1700f55a7255bae576914f" />
        <sec id="heading-d39cbee1ea63339613a3ee660e845287">
          <title>2.1. Theoretical background and previous research</title>
          <p id="paragraph-7a6018a0c5cfc8794787cbded69b41a1" />
          <p id="paragraph-194b4f62deb5da4a0a714419097a665a">This section investigates the relationships between different cues which can help to communicate “who did what to whom”. One type of cues is formal markers, including case marking and agreement. Another type is fixed word order, which can help to identify the thematic roles of the constituents (e.g. SAPIR, 1921<xref id="xref-cc382c97b0680315609a384519e8ca50" ref-type="bibr" rid="book-ref-b5e1a87275c9a15db8cff9ec7745ec4e">[41]</xref>). The position of the verb can be another cue. It is believed that it is easier to process the sentence and infer the roles when the verb is in the medial position between the subject and the object:</p>
          <disp-quote id="block-quote-0a31fde9166a56103e6bd1c6716c136b">
            <p id="paragraph-9685ec918180f4983fe2362dc4db74b7">[V]erb position is the particular vehicle which most conveniently enables these basic grammatical relations to be expressed by means of word order: the subject occurs to the immediate left, and the object to the immediate right of the verb. I.e. the verb acts as an anchor (HAWKINS, 1986, pp. 48-49<xref id="xref-fcd802f4516a4c021b3a9f6e06eec047" ref-type="bibr" rid="book-ref-9bb5eb7f2364e1dff59d04ea918ea363">[42]</xref>)</p>
          </disp-quote>
          <p id="paragraph-f1556a815df05e98ee481d8b7d314aed">There is experimental evidence that users tend to avoid SOV in favour of SVO when describing reversible transitive events in pantomime, that is, those events where both participants can be subject or object, such as “The mother hugs the boy” and “The boy hugs the mother” (HALL; MAYBERRY; FERREIRA, 2013<xref id="xref-7c5cd527d8a84205487a46f90d177824" ref-type="bibr" rid="journal-article-ref-2eb39c5f8b88979d4016ea9c87907f7b">[43]</xref>). This can be interpreted as evidence that verb-medial order indeed helps to identify the roles. </p>
          <p id="paragraph-dd885cfe0aa32b8019dbad30e68cb595">There is another reason why the position of the verb in the middle is beneficial for language processing. The sum distances from the head verb to the subject and object are the smallest when the verb is between subject and object (FERRER-I-CANCHO, 2017<xref id="xref-232e5a926b646ef67a7c7f1750a45e0f" ref-type="bibr" rid="journal-article-ref-011c3c4e5e78320938e51f4472ad4f5b">[44]</xref>), which reduces the processing load. </p>
          <p id="paragraph-8987b7f279af5f4160bafb2dfb0d59bd">Finally, we should not underestimate the role of semantics and encyclopaedic knowledge. In most situations, it is a dog that bites a man or a police officer who captures a thief, and not the other way round. This information can be important for the use of the cues. For example, there is a correlation between the predictability of events and the use of overt object marking in Japanese (KURUMADA; JAEGER, 2015<xref id="xref-623b76a9a105ecb08bfc3af73c1cdf5d" ref-type="bibr" rid="journal-article-ref-6fd1d958476e69f443ef027d0571c37e">[33]</xref>). Abstract referential features, such as animacy and identifiability, play an important role in differential marking, as in Spanish or Hebrew, and in probabilistic case marker use, as in Korean (LEE, 2009). There is a negative correlation between predictability and marking, which can be explained by efficiency considerations (JÄGER, 2007<xref id="xref-2120e906ae125cf38828a0b5288ebd91" ref-type="bibr" rid="journal-article-ref-a67580b9399df1a0cd680e69b1c08c3c">[45]</xref>; LEVSHINA, 2018<xref id="xref-cf499fd66870ecaee1ee198d673fa2e1" ref-type="bibr" rid="journal-article-ref-a797264c5718372e1d2e6ca89e015c1e">[14]</xref>).</p>
          <p id="paragraph-95b48a3a015cae2e40983f1708a07f0d">If the idea of efficient trade-offs is correct, we can expect negative correlations between all these cues (cf. SINNEMÄKI, 2008<xref id="xref-7de16ac1c6354e6700e01fc078e9ab3d" ref-type="bibr" rid="chapter-ref-4eca7df8d3e8676e40a520f62a7bebb5">[22]</xref>). Previous quantitative studies have shown a negative correlation between argument marking and rigid word order (SINNEMÄKI, 2014<xref id="xref-dc6936d469a66226aaa046c656280607" ref-type="bibr" rid="chapter-ref-e8ebaa902477b97d8f8e0a56137d9a5b">[23]</xref>); as well as an association between zero argument marking and verb-medial order (SINNEMÄKI, 2010<xref id="xref-d5bfa2951288a9dbe364683cd121f5bc" ref-type="bibr" rid="journal-article-ref-6d1fd7211fd35c932d578968e5f1a128">[46]</xref>). The correlation between the final position of the verb and case marking is well known as Greenberg’s (1966b<xref id="xref-b5147026a02aa2e9770b6af8157b1d62" ref-type="bibr" rid="chapter-ref-4884f33a225a87701faf35a510077d9f">[47]</xref>) Universal 41: “If in a language the verb follows both the nominal subject and nominal object as the dominant order, the language almost always has a case system”. However, the three parameters have never been investigated simultaneously. Also, for the first time, these parameters will be estimated from corpora, rather than from grammars, as in the previous studies. As will become clear, the parameters are gradient and should be treated as continuous variables. I will first present a series of pairwise correlations between these parameters. It will be shown that taking the third variable into account can change the picture significantly, which means that the idea of studying trade-offs between two variables only is very questionable. The correlational analyses will allow us to formulate a hypothesis about the relationships between all three cues, which will be tested in a causal analysis.</p>
          <p id="paragraph-3fcc0c91d678de0d3fae6fe29d20a14e" />
        </sec>
        <sec id="heading-cb252edb07d3ad84d06f9a9d8eeb81f2">
          <title>2.2. Data</title>
          <p id="paragraph-016718741cc26f8c9d8c2159664479bf" />
          <p id="paragraph-859cb6af8262de1d9d6cb0477af39637">The language sample used for the present study includes thirty languages, which are listed in Table 1. The choice of languages was determined by the availability of sufficient data. Two sources were used: the Universal Dependencies (UD) corpora, version 2.6 (ZEMAN <italic id="italic-34c1518db755a1e4229d3318e0a4e791">et al.</italic>, 2020<xref id="xref-928f96412b4f41ce4f74cd3031a15258" ref-type="bibr" rid="conference-paper-ref-2a8bb47251d804239ff040e81a1ded45">[48]</xref>)<xref id="xref-5573a7334264194bf1c3185b034b94f6" ref-type="fn" rid="footnote-0e078adc388750867cf5138d6eacc3fe">6</xref> and online news corpora of 1 million sentences from the Leipzig Corpora Collection (GOLDHAHN; ECKART; QUASTHOFF, 2012<xref id="xref-338fe75b6e69d19388d83089a66973d5" ref-type="bibr" rid="chapter-ref-550cb17016e61c56e900a38d47141862">[49]</xref>)<xref id="xref-0be057fb39001c4d78bc8e2818f54245" ref-type="fn" rid="footnote-5ed438c01750757a0ce0e17df030eac0">7</xref>. These two different collections were used in order to ensure that our results are not due to register bias, since the UD corpora represent very diverse types of texts. Also, some UD corpora are very small. As will be demonstrated, the correlations between the parameters based on each type of data are very high, which gives us confidence in the results. </p>
          <p id="paragraph-56ef5f6fcd0785efdde5ba098925080c">In the online news corpora, each language is represented by one million sentences from online news (categories “news” and “newscrawl”). The corpora contain sentences in random order. The sentences were tokenized, lemmatized and morphologically and syntactically annotated with the help of the UD corpus tools in the R package udpipe (WIJFFELS, 2020<xref id="xref-0e1ff79c19a4dcde066c55d0a635ac6b" ref-type="bibr" rid="software-ref-225fd936372f47383befab0b9324a500">[50]</xref>). The language models, which were trained on the UD corpora, provide, among other things, universal parts-of-speech tags and dependency relations, which can be compared across different languages. This is crucial for the purposes of the present study.</p>
          <table-wrap id="table-figure-8aedc0ad5eae1947572a1ffaa53407f4">
            <label>Table 1</label>
            <caption>
              <title>Languages, UD corpora and language models used in the case study<bold id="bold-39617ab7fdbbb3b13323a513afbda485"/></title>
              <p id="paragraph-3a144955d05cf86ea56a604aaf182dbd" />
            </caption>
            <table id="table-7f31cb8b2709c3cbd8a822fb1add9bb6">
              <tbody>
                <tr id="table-row-cc5e416ab630613b2602a8c215459550">
                  <th id="table-cell-fc1c5b53dbdd480cecaaaab600643684">Language</th>
                  <th id="table-cell-95afc1b09442d6e52478768645a98c16">iso 639-3</th>
                  <th id="table-cell-81f480ea2c5c3d035788f792b5cfdcee">Genus</th>
                  <th id="table-cell-c3f303b5dd35e4f8d90dc6929f3149bb">Family</th>
                  <td id="table-cell-59394acabc9649d1cf93049989ed7ead">UD corpus</td>
                  <td id="table-cell-9cdcfb8f9922213d2f16b3f9f769f804">UD model</td>
                </tr>
                <tr id="table-row-b22d17796f52860184e52ad7bd322a88">
                  <td id="table-cell-644ac7ba6578d7efb1bee46ffb2cbc47">Arabic                  Bulgarian    Croatian                  Czech    Danish                  Dutch                  English    Estonian                  Finnish    French                  German                  Greek (modern)                  Hindi                  Hungarian                  Indonesian         Italian                  Japanese    Korean                  Latvian    Lithuanian                  Persian    Portuguese                  Romanian                  Russian    Slovenian                  Spanish    Swedish                  Tamil                  Turkish                  Vietnamese </td>
                  <td id="table-cell-f7f5ec40dc3ec2bdab989eafda9820d7">ara                  bul    hrv                  ces    dan                  nld                  eng    est                  fin    fra                  deu                  ell                  hin                  hun                  ind        ita                  jpn    kor                  lav    lit                  pes    por                  ron                  rus    slv                  spa    swe                  tam                  tur                  vie </td>
                  <td id="table-cell-8231ed395060907cd93be24a5edda9c5">Semitic                  Slavic    Slavic                  Slavic    Germanic                  Germanic                  Germanic    Finnic                  Finnic    Romance                  Germanic                  Greek                  Indic                  Ugric                  Malayo-    Sumbawan    Romance                  Japanese    Korean                  Baltic    Baltic                  Iranian    Romance                  Romance                  Slavic    Slavic                  Romance    Germanic                  Southern    Dravidian                  Turkic                  Viet-Muong </td>
                  <td id="table-cell-be249b8736b48eac2f1e79f788e72994">Afro-Asiatic                  Indo-European    Indo-European                  Indo-European    Indo-European                  Indo-European                  Indo-European    Uralic                  Uralic    Indo-European                  Indo-European                  Indo-European                  Indo-European                  Uralic                  Austronesian         Indo-European                  Japanese    Korean                  Indo-European    Indo-European                  Indo-European    Indo-European                  Indo-European                  Indo-European    Indo-European                  Indo-European    Indo-European                  Dravidian                  Altaic                  Austro-Asiatic </td>
                  <td id="table-cell-51d62860b3a117afdd5724ee975d3a7a">ar_padt                  bg_btb    hr_set                  cs_pdt    da_ddt                  nl_alpino                  en_ewt    et_edt                  fi_tdt    fr_gsd                  de_gsd                  el_gdt                  hi_hdtb                  hu_szeged                  id_gsd         it_isdt                  ja_gsd    ko_kaist                  lv_lvtb    lt_alksnis                  fa_seraji    pt_bosque                  ro_rrt                  ru_syntagrus    sl_ssj                  es_ancora    sv_talbanken                  ta_ttb                  tr_imst                  vi_vtb </td>
                  <td id="table-cell-edb87fc38e2dad9929341b223ca7a2c3">arabic-padt-ud-2.4                  bulgarian-btb-ud-2.4    croatian-set-ud-2.4                  czech-pdt-ud-2.4    danish-ddt-ud-2.4                  dutch-alpino-ud-2.4                  english-ewt-ud-2.4    estonian-edt-ud-2.4                  finnish-tdt-ud-2.4    french-gsd-ud-2.4                  german-gsd-ud-2.4                  greek-gdt-ud-2.4                  hindi-hdtb-ud-2.4                  hungarian-szeged-ud-2.4                  indonesian-gsd-ud-2.4         italian-isdt-ud-2.4                  japanese-gsd-ud-2.4    korean-gsd-ud-2.4                  latvian-lvtb-ud-2.4    lithuanian-hse-ud-2.4                  persian-seraji-ud-2.4    portuguese-bosque-ud-2.4                  romanian-rrt-ud-2.4                  russian-syntagrus-ud-2.4    slovenian-ssj-ud-2.4                  spanish-gsd-ud-2.4    swedish-talbanken-ud-2.4                  tamil-ttb-ud-2.4                  turkish-imst-ud-2.4                  vietnamese-vtb-ud-2.4 </td>
                </tr>
                <tr id="table-row-dfe4f4153d5fe8c764628ceaaaadff16">
                  <td id="table-cell-40a16bf64a6a25bca09e655be7f0e72d" />
                  <td id="table-cell-e0ea97e0a3ce212e27a65be95966e94b" />
                  <td id="table-cell-ed50c74fee752f8dd06ebbf727333f7d" />
                  <td id="table-cell-3f9dc05baf6fa67e8d4ee36630fb440e" />
                  <td id="table-cell-32b5f738e24f4f68c37071c7d9f88cf0" />
                  <td id="table-cell-ca039f7923f7335a5eb06fa10edb37f8" />
                </tr>
                <tr id="table-row-80932994911097786e31becd1579f720">
                  <td id="table-cell-a11493d1c673531daf5810363ed4464a" />
                  <td id="table-cell-c6b641f7c244e8e7506b99adb8dee692" />
                  <td id="table-cell-9ad606636638bc7f83264c2776f3f083" />
                  <td id="table-cell-edd942522ebf759bd65c02909f8c393e" />
                  <td id="table-cell-f407415ca9626d18c385b504a0d1545b" />
                  <td id="table-cell-76d85ebb84e070be108fda220db98eb6" />
                </tr>
              </tbody>
            </table>
          </table-wrap>
          <p id="paragraph-934a03045a3bb72ef7687e50104643e9" />
        </sec>
        <sec id="heading-e936055fb10700fb45ad5e9e06b7f2ed">
          <title>2.3. Variables</title>
        </sec>
      </sec>
    </sec>
    <sec id="heading-91c0ff14ca2f62d6129c2456f9a2d340">
      <title />
      <sec id="heading-ff5057391223b34d291ce64ef3eb185b">
        <title>2.3.1. Formal distinctness of Subject and Object (case marking) </title>
        <p id="paragraph-a066fcd0526625523e0fe23fb7275769" />
        <p id="paragraph-5cb427e97b384040e2c41aa6fb88c98f">Case marking was operationalized as distinctness of the forms representing transitive subject and object, following the token-based approach in Levshina (2019<xref id="xref-dc406556802db3d0c33f9bec5d088219" ref-type="bibr" rid="journal-article-ref-001cd535d8a43913429703db6591e79e">[25]</xref>). The new method can give us more precise information about how frequently case markers can help language users to distinguish between the main participants. This matters for languages with differential and optional case marking. For example, in Russian some nouns have different forms in the Nominative and Accusative (e.g. <italic id="italic-b40a014d457ea6a49811119fb6a2e7ae">devočk-a</italic> “girl-Nom” and <italic id="italic-dd677edc3dcbcabeea6cf1fd33f86806">devočk-u</italic> “girl-Acc”), while some nouns have identical forms (e.g. <italic id="italic-8dacb16c6735c4ac671a98808cd22c1c">stol</italic> “table” or <italic id="italic-aff06f482fdcdab9fe58568128a409bb">myš</italic> “mouse”). The question is, how frequently the forms are identical, and how frequently they are distinct. Similarly, some languages like Japanese and Korean have variable marking of subject and object with complex probabilistic rules. All this variability should be taken into account. </p>
        <p id="paragraph-2a0f9a7d881b7162b77ce63ced905b7b">There is no reliable morphological annotation at the moment, which could be used to compare the forms in many different languages. The information about formal distinctness was approximated using the existing corpora in the following way. First, I extracted all nouns (wordforms in lower case and lemmas) with the universal syntactic dependency tags “nsubj” (nominal subject) and “obj” (object). In order to take into account languages like Spanish, where the object case marker <italic id="italic-ffc94c92640bd8c2d691a655d16246d9">a</italic> is a preposition, I also checked if the head noun had a syntactic dependency “case”, and merged the case marker with the noun, e.g. <italic id="italic-b4d563523dc5e966ab52295ddd161cda">a_mujer</italic> “woman.ACC”. Only non-plural forms were considered in order to exclude the formal variation based on number. I do not expect this restriction to influence the results strongly because plural forms are less frequent than singular ones. For languages with articles written as one word with the nouns (Arabic, Bulgarian, Danish, Romanian and Swedish), subject and object forms were compared separately for definite and indefinite forms because it was too difficult to split them automatically. Indonesian possessive suffixes were not counted as part of wordforms.</p>
        <p id="paragraph-62077cd276b785da38907d44c529d4e5">Next, for every lemma used as both transitive subject and object in the corpus, the subject and object forms were listed. One form was selected randomly to represent a subject form, and one form to represent an object form, and these forms were compared. The total number of lemmas with distinct forms was computed for each language. This number was weighted by the lemma frequency, so that frequent lemmas had more weight than rare ones. Finally, the distinctiveness scores were divided by the total token frequency of all lemmas that were analyzed. </p>
        <p id="paragraph-353da1a2eb45dbd03bbe7f0925277406">Following previous research (e.g. SINNEMÄKI, 2008<xref id="xref-826f15414c93533d063688af2a415f9b" ref-type="bibr" rid="chapter-ref-4eca7df8d3e8676e40a520f62a7bebb5">[22]</xref>) and the tradition in typology, the analyses presented below were performed on subjects and objects expressed by common nouns (Universal Part of Speech tag “NOUN”). However, I also computed scores for all possible subjects and objects (including pronouns, different nominalizations, symbols, proper nouns, etc.) and compared them with the ones based on nouns only. The correlations between the scores based only on nouns and those based on all possible lexemes are very strong and positive: r = 0.92, p &lt; 0.001 in the UD corpora; r = 0.98, p &lt; 0.001 in the online news corpora.</p>
        <p id="paragraph-8c2f894479484012ef112ff042aee5e2">The formal distinctness scores based on the UD corpora and the online news corpora are displayed in Figure 2. The languages at the bottom have no or very limited case marking, whereas the languages at the top have systematic case morphology. Languages in the middle have diverse types of differential case marking, where the presence of absence of markers is determined by the semantic or pragmatic properties of the referent, lexical class, tense, aspect and other factors. Examples are Russian, where only animate masculine and feminine objects are different from the subject forms; Turkish, where definite and specific indefinite objects are marked; and Hindi, which has a complex case system, in which the ergative marker is added to subjects in perfective clauses, whereas human specific objects are usually marked with the accusative case. </p>
        <p id="paragraph-9169f03ed37fe53771f660484431cd25">There is a very strong correlation between the two types of data: r = 0.952, p &lt; 0.001. It is not clear what explains the large discrepancies for Tamil, Lithuanian and Korean. Possible reasons can be the small size of the available UD corpora and the noise in the automatically parsed online news corpora.</p>
        <p id="paragraph-e4ae17a261bdc514fb3d8cd19d2ae7db">Indexing of subject and object (agreement) is not investigated in this paper. Previous research has shown that subject agreement is not significantly correlated with word order or case marking, whereas object agreement correlates negatively with the presence of both factors simultaneously (SINNEMÄKI, 2008<xref id="xref-95c0982da3e750bccbb17b21e678ee50" ref-type="bibr" rid="chapter-ref-4eca7df8d3e8676e40a520f62a7bebb5">[22]</xref>). Unfortunately, my sample of languages does not allow me to test object agreement statistically. I leave that to future research.</p>
        <fig id="figure-panel-05b25579438efa72e0d113b808a7ffa1">
          <label>Figure 2</label>
          <caption>
            <title>Proportions of distinct subject and object forms in the UD corpora and online news</title>
            <p id="paragraph-04944cbb5a0538326d7f1667b6a6e081" />
          </caption>
          <graphic id="graphic-fb1ea29d2616e71bd5fc41a0fc15dd0a" mimetype="image" mime-subtype="jpeg" xlink:href="Figura 2.jpg" />
        </fig>
        <p id="paragraph-15e0720ff46d464846062ea5eb94767e" />
        <sec id="heading-74117868bdb1b1c178c658427cea6e78">
          <title>2.3.2. Word order rigidity</title>
          <p id="paragraph-b78337202dfab7e73ff9036ab04fe3eb" />
          <p id="paragraph-189846a20658f597dcc855ce7c2ac5d1">If the order of subject and object is fixed, it can be a reliable cue of the syntactic roles. In order to measure word order rigidity, I used anti-entropy, which is 1 minus Shannon entropy of the order of subject and object. Shannon’s entropy has been used to represent flexibility in word order (LEVSHINA, 2019<xref id="xref-1feeb95de4dc54a3617cb6a91b51855c" ref-type="bibr" rid="journal-article-ref-001cd535d8a43913429703db6591e79e">[25]</xref>). The formula for computing entropy of orders SO and OS is as follows:</p>
          <p id="paragraph-b295d5f900b05685f1c94abc47c91c60" />
          <p id="paragraph-794b1247a154a91a8eceb74463d52f70">(2)    H = -1 (Pr (SO) * Log Pr (SO) + Pr (SO) * Log Pr (SO)</p>
          <p id="paragraph-ccb38259e429adbb448d2e132953eea8" />
          <p id="paragraph-629ace42e04f5a465380ee1124b125fb">where the probabilities of SO and OS were computed as simple proportions of each word order taken from the corpora.</p>
          <p id="paragraph-80db2262fc4c50cd21b687ee216b27ba">The entropy score is minimal when either subject is always before object or the other way round, i.e. Pr (SO) = 1 and Pr (OS) = 0, or Pr (SO) = 0 and Pr (OS) = 1. Entropy is maximal when both have equal probabilities Pr (SO) = Pr (OS) = 0.5. The anti-entropy scores based on the UD corpora and the online news corpora are displayed in Figure 3. As in the previous section, these scores are based only on common nouns. The correlation between rigidity scores in the UD corpora and in the news is positive and high: r = 0.895, p &lt; 0.001. The scores based on only nouns and those based on all possible slot fillers also correlate strongly and positively: r = 0.74, p &lt; 0.001 in the UD corpora, r = 0.85, p &lt; 0.001 in the online news corpora.</p>
          <fig id="figure-panel-d0aa160ddeb571a6db15c04dfbe0e817">
            <label>Figure 3</label>
            <caption>
              <title>Word order rigidity (anti-entropy) scores of subject and object</title>
              <p id="paragraph-e0c184728a4fbace0da7903ea9c32490" />
            </caption>
            <graphic id="graphic-9d4dcd92b92d88927e0dfe9bb5b3ecad" mimetype="image" mime-subtype="jpeg" xlink:href="Figura 3.jpg" />
          </fig>
        </sec>
      </sec>
    </sec>
    <sec id="heading-19d1743208f149e44a76bfc99799e306">
      <title />
      <sec id="heading-3d46c39ae332faceb90b719090e1ac62">
        <title>2.3.3. Position of the verb</title>
        <p id="paragraph-9c1dad3decd131a1a951045d259c0bca" />
        <p id="paragraph-a455a4995ee350e309e488a6a204687b">The third variable was ‘verb-medialness’, which shows how frequently head verb occurs between subject and object. The procedure was as follows. I computed the number of all clauses (main and finite subordinate clauses) with overt subject and object (“nsubj” and “obj” relationships). Next, I computed the proportion of all clauses where the lexical verb is in the middle. The scores based on the UD corpora and the online news corpora are displayed in Figure 3. The correlation between the scores in the UD corpora and in the online news is nearly perfect: r = 0.992, p &lt; 0.001. One can see a gap between strictly SOV languages (Japanese, Tamil, Korean, Hindi and Turkish) with the lowest scores and all the rest, which are SVO. French, English and Indonesian have the highest scores. The languages in the middle have variable SVO/SOV order (Dutch, German and Hungarian), with the exception of Arabic (SVO/VSO). The scores for the common nouns presented in Figure 4 correlate nearly perfectly with the scores based on all lexemes: r = 0.96, p &lt; 0.001 for the UD corpora, and r = 0.98, p &lt; 0.001 for the news corpora. </p>
        <fig id="figure-panel-081b96fdc169fef24168e63880cd9fa7">
          <label>Figure 4</label>
          <caption>
            <title>Proportion of clauses with head verb between subject and object</title>
            <p id="paragraph-004db9a55e9f9c4bdcf3e065bdeeb2ec" />
          </caption>
          <graphic id="graphic-a968506d7ed9f662be2ba108b0606caa" mimetype="image" mime-subtype="jpeg" xlink:href="Figura 4.jpg" />
        </fig>
      </sec>
    </sec>
    <sec id="heading-0d4edc2fece1fb99c03effaa5449abc0">
      <title />
      <sec id="heading-fd777ad237deec80f6074d86d98273cf">
        <title>2.4. Correlations</title>
        <p id="paragraph-420f2f8d89381c1f1d8674eb827010a8" />
        <p id="paragraph-13ba303726ca4dae8a628af9cab2df5d">This section tests the relationships between the three types of cues. Recall that a trade-off requires a negative correlation between two parameters. Let us test if this requirement is met. Figure 5 displays Spearman’s rank-based correlations between the pairs of variables. The results for both data sources are very similar. </p>
        <fig id="figure-panel-24755c9872d924c7fa53602806039dc2">
          <label>Figure 5</label>
          <caption>
            <title>Correlations between word order rigidity, formal distinctness of subject and object and verb-medialness in the UD corpora (left) and in the online news (right)</title>
            <p id="paragraph-5b83e9a2317e7654fcc8977cc9a9b5f7" />
          </caption>
          <graphic id="graphic-8e7e2c353391630f49e31ab2584412a5" mimetype="image" mime-subtype="jpeg" xlink:href="Figura 5.jpg" />
        </fig>
      </sec>
    </sec>
    <sec id="heading-3">
      <title />
      <p id="paragraph-d94c858e64d2475b32f06525f2f29409"> The correlation between rigid word order and formal distinctness is negative: more rigid word order means less distinct subject and object forms (p &lt; 0.001). It is also instructive to look at a scatter plot with language names in Figure 6, which shows this relationship in more detail. It tells us that languages with similar forms (the left-hand side of the corresponding small plot) indeed have rigid word order, but that languages with less similar forms are somewhat more variable with regard to word order rigidity. For example, Finnish, Japanese, Korean and Persian have highly distinct forms, but quite rigid word order, while Hungarian and Tamil also have distinct forms, but variable word order. This means that the trade-off is not perfectly symmetric, and the relationship is to some extent implicational, rather than fully correlational: Lack of formal distinctions strongly implies rigid word order, but rigid word order less strongly implies low formal distinctness, as shown by Finnish, Korean, Japanese and Persian. </p>
      <fig id="figure-panel-c0041f2a56fbf7413922dc7795da3c6f">
        <label>Figure 6</label>
        <caption>
          <title>Scatterplot of distinct forms and rigid word order of subject and object in the UD corpora</title>
          <p id="paragraph-d89276b24a418ea5d5ec9a5f6d1b1c8f" />
        </caption>
        <graphic id="graphic-f4194eb050ae965ca75953bcf7bb3e85" mimetype="image" mime-subtype="jpeg" xlink:href="Figura 6.jpg" />
      </fig>
      <p id="paragraph-498519b3af476a6400f2e3624f2fd71e" />
      <p id="heading-4af0b490b9dadcf5b8919cb092a3c602">The next correlation is between distinct forms and verb medialness. The correlation is again negative, as predicted (p &lt; 0.001). Therefore, high formal distinctness should mean that the verb is less frequently in the middle, and low formal distinctness should mean that the verb is more frequently in the middle. However, the scatter plot shown in Figure 7 suggests again that this is a simplification. When the forms are not distinct, the verb is typically between subject and object, as the large cluster of languages in the bottom right corner shows. Yet, when the forms are distinct, the verb can be anywhere. For example, it is rarely medial in Turkish, Hindi, Japanese, Korean and Tamil (see top left corner), but usually medial in the Baltic and Finnic languages (see top right corner). This relationship is even more obviously implicational than in the previous plot. </p>
      <fig id="figure-panel-5c0f8a833cb71a9cda22cc15d726bb31">
        <label>Figure 7</label>
        <caption>
          <title>Scatterplot of verb medialness and distinct forms of subject and object in the UD corpora</title>
          <p id="paragraph-809e9aa11652f5bc15cf290e238ecf54" />
        </caption>
        <graphic id="graphic-c4052075cb7f1b770359a0860fd9f87b" mimetype="image" mime-subtype="jpeg" xlink:href="Figura 7.jpg" />
      </fig>
      <p id="paragraph-d96479ea4a4fdead04f9d4df05242bea" />
      <p id="paragraph-2629dc6d99246c5374f42d758a3b3846">Finally, we observe a positive correlation between rigid word order and verb-medialness. This finding is similar to the results reported by Sinnemäki (2010<xref id="xref-5d23783e3d1d249431fba4cb070992ee" ref-type="bibr" rid="journal-article-ref-6d1fd7211fd35c932d578968e5f1a128">[46]</xref>), who used categorical data from a large sample of typologically diverse languages. The positive correlation is a case of cue redundancy. The distribution of the scores is shown in Figure 8. We can see that very rigid word order in French, Indonesian or English is strongly associated with verb-medial position, but the verb-final languages on the left behave in very diverse ways. </p>
      <fig id="figure-panel-f90ba46614d78ef00f669ba4d039c938">
        <label>Figure 8</label>
        <caption>
          <title>A scatterplot of verb medialness and rigid word order</title>
          <p id="paragraph-cc0dd48e34815614045cfb7f88f71f31" />
        </caption>
        <graphic id="graphic-1a13ef3634aa0cb342150f0cc266a829" mimetype="image" mime-subtype="jpeg" xlink:href="Figura 8.jpg" />
      </fig>
      <p id="paragraph-5efe7f4adec5c565bd796f968be7652a" />
      <p id="paragraph-d6aadf8baea8e645e28285638eb9fc34">So far, we have discussed pairwise correlations that did not take into account the presence of the third variable. However, this analysis is incomplete because when testing the correlation between two types of cues, we need to control for the third one. In order to do so, one can use partial correlation coefficients. They are shown in Table 2. </p>
      <table-wrap id="table-figure-b92ca8047a6364fa800d72be6bab5294">
        <label>Table 2</label>
        <caption>
          <title>Partial correlations between the cues in the UD corpora and in the online news</title>
          <p id="paragraph-000696562d1742ca22f0a022bc249569" />
        </caption>
        <table id="table-c098c0e94a4ba8e66ab1e4099689cc57">
          <tbody>
            <tr id="table-row-df877076cf9105bcb4b11fb17729ad23">
              <th id="table-cell-1d2090c32691ca50eb700092873c98f6" />
              <th id="table-cell-b60dd2ade39e927e2853623774cfc930">Rigid Word Order</th>
              <th id="table-cell-afb3d87a6f0bb2afbd106850aecffa62">Distinct Forms</th>
              <th id="table-cell-e8e7f06bf7541997fb142ecb42275d71">Medial Verb</th>
            </tr>
            <tr id="table-row-17e00554d9dd13f312c6b2c7acc365fd">
              <td id="table-cell-5fae76a4f9843c2b9a34a69b7c2caa81">Rigid Word Order</td>
              <td id="table-cell-333ee92a655d232655af267bd91c20ca" />
              <td id="table-cell-c50d5f9517d545c4b65d48243b3b81e4">UD: -0.62 (p &lt; 0.001) news: -0.57 (p = 0.001)</td>
              <td id="table-cell-ce7544540387e2a3d54ab129e05a713e">UD: 0.04 (p = 0.805) news: 0.10 (p = 0.588)</td>
            </tr>
            <tr id="table-row-90273e5f19a450a27cdf5f3568fa3cb4">
              <td id="table-cell-5142c102c985f10175b5ece9123db4b2">Distinct Forms</td>
              <td id="table-cell-64a98f142f3d91745b738c541212f195">UD: -0.62 (p &lt; 0.001) news: -0.57 (p = 0.001)</td>
              <td id="table-cell-a4233d39fe41e231b875722742d0f36e" />
              <td id="table-cell-0e6f4f39331d38369eed33de580a4980">UD: -0.44 (p = 0.016) news: -0.49 (p = 0.007)</td>
            </tr>
            <tr id="table-row-452e73f02c3cefd32ace844ff90b3140">
              <td id="table-cell-fae4e5d0ca44650e765f9ce6c5e7c0db">Medial Verb</td>
              <td id="table-cell-cbdd7142ebc7447a6a67bf729edf79ff">UD: 0.04 (p = 0.805) news: 0.10 (p = 0.588</td>
              <td id="table-cell-2444861029cbd0bd6967bcb56cbb0919">UD: -0.44 (p = 0.016) news: -0.49 (p = 0.007)</td>
              <td id="table-cell-5b7b3555e4d1ec197f485daeffdd69a9" />
            </tr>
          </tbody>
        </table>
      </table-wrap>
      <p id="paragraph-a28a127635101af9d26fe7ec123555ee">The coefficients for the UD corpora and the online news corpora are similar, which means that our results are robust. The numbers demonstrate that the correlation between formal distinctness and rigid word order is the strongest one, followed by the negative correlation between formal distinctness and verb-medialness. This is similar to the previous results. The correlations are now weaker, however. The most striking difference is that the correlation between rigid word order and verb-medialness disappears when we take into account formal distinctness. </p>
      <p id="paragraph-2bb6aa6bfa82b74d18f46d75b3fbd01e">One may object that the data are dependent because many of the languages come from the same families and genera (that is, Baltic, Germanic, Romance, Slavic and Finnic). If we take into account these dependencies, traditional correlational analysis is not appropriate any more. Additional tests (LEVSHINA, In preparation<xref id="xref-4325db77dc6a78d7324de42275cf9e70" ref-type="bibr" rid="journal-article-ref-0313580c20d701c501821a8a42289e2d">[51]</xref>) based on permutation and resampling support the quantitative results presented here. </p>
      <p id="paragraph-dc64c8dc6cdc2de807718d8a0319a576" />
      <sec id="heading-a660d4f49da056739fff897675614d99">
        <title>2.5. From correlation to causation</title>
        <p id="paragraph-41c91824320c3c1fffa9458682ba7a76" />
        <p id="paragraph-72a07c0da90910c32dd3cf6f844a9628">The quantitative analyses have revealed a negative correlation between rigid word order and distinct forms of subject and object. We also found a negative correlation between distinct forms and medial position of the verb. Rigid word order and verb-medialness are correlated positively, but this correlation disappears when the formal distinctness is taken into account. This supports the idea of Fenk-Oczlon and Fenk (2008<xref id="xref-77c350812db2951ea03cea2902052067" ref-type="bibr" rid="chapter-ref-ec8ee13229e63366ee68d46cb30dd5e1">[21]</xref>) that trade-offs are more likely to be observed between different linguistic domains (e.g. syntax and morphology, or semantics and phonology) than within the same domain (see also SINNEMÄKI, 2008<xref id="xref-a835e4d07202d385d85714fc55ffa97b" ref-type="bibr" rid="chapter-ref-4eca7df8d3e8676e40a520f62a7bebb5">[22]</xref>). </p>
        <p id="paragraph-ed2faf797c7a98da82111d7cab80f04a">We also saw in the scatter plots that languages lacking formal distinctness have rigid word order, and tend to have verb in the middle. So, one might think that lack of formal distinctness causes language users to provide cues with the help of word order. If one changes the perspective, it is also possible to say that the languages with rigid word order have low formal distinctness, whereas SOV languages tend to have high distinctness, so one could claim that it is word order that can explain case marking. So, what is the direction of causality – from word order to case marking, or the other way round?</p>
        <p id="paragraph-03efdcacbd2a1e633352502489698312">There are some arguments in the literature that word order can determine case marking. According to Kiparsky (1996<xref id="xref-827909c2fa8f9e3c3bf4d37e06ece98e" ref-type="bibr" rid="chapter-ref-1fc116100922b193f42dac2569d4c0cf">[52]</xref>), the shift to VO began in Old English before the collapse of the case system (and also before the loss of subject-verb agreement). Similarly, Bauer (2009<xref id="xref-0a1d5ee75e728c3c8f3afa9ff8aecdde" ref-type="bibr" rid="chapter-ref-78040988ce965afd01e60905acf25e6f">[53]</xref>) shows that the change to VO and rigid word order in Late and Vulgar Latin was before the loss of inflection in Romance. There is a hypothesis that Indo-European languages drift from SOV to SVO and rigid word order, which leads to the loss of inflections (KOCH, 1974<xref id="xref-a974d90c0211a0c47f87c386bd0777c9" ref-type="bibr" rid="journal-article-ref-7ac77ca02e3420b857e54f5af62203b0">[54]</xref>). Since most of the languages in our sample are Indo-European, this may be an explanation of the correlations we observe. </p>
        <p id="paragraph-b6ef6a3f3ecd489aff1d7f95240de5e8">There is also experimental evidence of a causal link from word order to case marking. In a study by Fedzechkina, Newport and Jaeger (2016<xref id="xref-5c08ac682a415476646da233ac185bca" ref-type="bibr" rid="chapter-ref-c47f03546aca055706d1ddb3a21309d9">[55]</xref>), learners were presented with miniature artificial languages containing optional case marking and either fixed or flexible constituent order. It was found that the learners of the fixed order language used case marking significantly less often than the other learners, and less often than in the input language, which means that rigid word order indeed triggers the loss of distinct forms. At the same time, the word order properties of the input languages remained stable.</p>
        <p id="paragraph-e5ccaf81ec2732e6ceb9ea7f48012e52">In order to test this hypothesis, we should move from binary correlations to multivariate causal analysis (BLASI; ROBERTS, 2017<xref id="xref-d236091813ae47da04dcd5563ad62c83" ref-type="bibr" rid="chapter-ref-123889edf1411f75bd4d9c4f7302778b">[56]</xref>). A causal analysis using the PC algorithm (SPIRTES; GLYMOUR; SCHEINES, 2000<xref id="xref-fb8854559555403f40320797ae43ce50" ref-type="bibr" rid="book-ref-16d1ae8b46d5a462919986ce6fa3dd1d">[57]</xref>; KALISH <italic id="italic-9e8022a5786fa7a3f161da98f385d194">et al.</italic>, 2012<xref id="xref-131a0e7213ac439cd4947ce1982fad4b" ref-type="bibr" rid="journal-article-ref-f2e9c384911977baba1b83c911c10d21">[58]</xref>) produces the directed acyclic graph shown in Figure 9. The arrows represent the direction of effect of one variable on another, with the significance level of 0.05. The results for the UD corpora and the online news data are identical. Similar results are obtained with the help of a resampling method, where one draws one language per genus 1,000 times, logging the probability of every link, and computes the average probability (LEVSHINA, In preparation<xref id="xref-60c42aa9bf472979231b2b34cf886040" ref-type="bibr" rid="journal-article-ref-0313580c20d701c501821a8a42289e2d">[51]</xref>).</p>
        <fig id="figure-panel-82c87f232dd185ce7864df9941bc7219">
          <label>Figure 9</label>
          <caption>
            <title>Causal analysis of three types of cues</title>
            <p id="paragraph-27653a7d5ccf6ba0bd61c6da735bdecb" />
          </caption>
          <graphic id="graphic-da1ee6010aef227e54fdc1211bba80c1" mimetype="image" mime-subtype="jpeg" xlink:href="Figura 9.jpg" />
        </fig>
        <p id="paragraph-c794a90b90824b2501bf1fb46683e770">The graph tells us that both word order variables contribute jointly to the distinctness of subject and object forms. The word order variables are not causally related on their own. This is in line with the results of the partial correlational analysis. Both word order variables have an effect on formal distinctness. This supports the theoretical claims from the literature discussed above. A new finding is that the verb position also affects formal distinctness. In particular, we can hypothesize that verb-finalness increases the distinctness of subject and object. </p>
        <p id="paragraph-efea50fcc7937bb5c00fa64752ee8b8f" />
      </sec>
      <sec id="heading-70e12814ba59a0758a72f76187639f30">
        <title>3. Discussion</title>
        <p id="paragraph-bb7306aeeea7902e483015dd115c8839" />
        <p id="paragraph-ce0f91630704edc58d15890c26d79a3b">This paper has discussed a popular idea in functional linguistics, namely, that different costs or benefits are in relationships of efficient trade-offs, which can be thought of as Pareto frontiers. I argued that there are many conceptual and methodological problems with that idea. First, it is difficult to identify the exact nature of costs and benefits. Second, a negative correlation between costs or benefits does not always mean that the language user can make a rational choice. Third, binary trade-offs ignore other relevant costs and benefits. Therefore, it would be safer to drop the term “trade-off” altogether.</p>
        <p id="paragraph-5daeb220d5d740236732c8e04de6ee5a">In game theory and economics, the situation of Pareto efficiency is also known as a zero-sum game, where the interacting parties’ aggregate gains and losses add up to zero. It has been argued, however, that there is an increasing chance of finding non-zero-sum solutions as the complexity of a system increases (WRIGHT, 2000<xref id="xref-f5a6325b06377ac330ba9cd84aafa666" ref-type="bibr" rid="book-ref-8b035cfa56b6298d00e56e9c982edc5d">[59]</xref>). Language as a highly complex system is not a zero-sum game.</p>
        <p id="paragraph-a19633bf2339ddaa3ddd4da353cd9539">As an illustration, I presented a case study of three types of cues that help to differentiate between subject and object: rigid word order, medial position of the verb and formal distinctness of the arguments provided by case morphemes and adpositions. The results of correlational analyses demonstrate that not all cues are efficiently related. There can be redundancy in the amount of information available to the addressee. Also, we have seen that some relationships are more implicational than correlational, which also leads to cue redundancy. The only thing disfavoured by the languages is the absence of any cues. It seems that a breakdown of communication (with additional costs of reanalysis and conversational repair) is more dangerous than wasting the resources. This conclusion is in line with typological evidence, which suggests that all languages have some amount of redundancy (HENGEVELD; LEUFKENS, 2018<xref id="xref-b419c8fa8a887119b41121a9dbaaf66f" ref-type="bibr" rid="journal-article-ref-d65fff11324a1a54a24c7f07914e2217">[19]</xref>).</p>
        <p id="paragraph-81e5084cbba6c05684762221b32f01e8">Taking the speaker’s perspective, we can say that the speaker saves effort by providing less overt coding when the word order provides sufficient information. This is efficient behaviour, but it is difficult to treat it as a real trade-off because, unlike the articulatory efforts required for production of case marking, it is not clear what kind of costs word order has for the speaker (see also the discussion in Section 1.1). Also, the existence of languages with case marking but fairly rigid and verb-medial word order suggest that the speaker’s behaviour is not always efficient. </p>
        <p id="paragraph-3b02ada0c6f5346e6ba94c52640996e8">At the moment, we do not know what the costs of acquiring more or less flexible word order are for learners. I leave the question of trade-offs in language acquisition open.</p>
        <p id="paragraph-28dc31c78e1e2dc5a867370d99c4cc53">Finally, I argued that bivariate correlations should be replaced with multivariate causal analysis and showed how this can be done for the three types of cues. This study has demonstrated that word order determines case marking, but not the other way round. It seems that fixed word order allows case marking to disappear. Also, it may be that verb-final languages tend to develop and maintain case forms. These causal hypotheses are preliminary and need to be further investigated on a larger sample without the Indo-European bias. Other linguistic and extralinguistic factors, such as agreement, semantics, population size and the presence of intensive language contact, should also be taken into account.</p>
        <p id="paragraph-fd2f68314bb893fb29e881be7b378bc4">It is easy to understand why the idea of a trade-off is appealing: it is very simple and intuitive. If you take a larger slice of a cake, the others will get less. In fact, people have a bias towards zero-sum thinking, which persists on a personal level and as a cultural worldview ideology (RÓŻYCKA-TRAN; BOSKI; WOICISZKE, 2015<xref id="xref-4ee9c2708ffb220020b13ce139f90491" ref-type="bibr" rid="journal-article-ref-16b11eb1e097acdcd9d5bb7cb196816c">[60]</xref>). The zero-sum thinking makes people choose win-lose strategies instead of trying to find win-win solutions – a tendency that has become probably too obvious in the world politics nowadays. Our task as scientists is to prevent people from falling into this cognitive trap, and, of course, not to commit this mistake ourselves.</p>
        <p id="paragraph-d7a518a597384258037a10dd98b76163" />
        <p id="paragraph-4ebd96b593cb34a4fb3a79045c8aa1ec" />
      </sec>
      <sec id="heading-d04199dff9ea8ded1e9165e85b890a28">
        <title>Acknowledgements</title>
        <p id="paragraph-cedd765bacc30e387fb566f5cc9da090" />
        <p id="paragraph-5b33b750d97881cc81d8e49b11d20594">The research in this paper was funded by the Netherlands Organisation for Scientific Research (NWO) under Gravitation grant Language in Interaction, grant number 024.001.006. I also sincerely thank Mira Ariel, Sterre Leufkens and Kaius Sinnemäki for their insightful comments and constructive feedback, which have helped me to improve the paper substantially. All remaining errors are solely mine.</p>
        <p id="paragraph-4aa2b38f17ba10c18f49712ca96495dd" />
      </sec>
    </sec>
  </body>
  <back>
    <fn-group>
      <fn id="footnote-77bdff038bd76e64c890b6a54ea6f873">
        <label>1</label>
        <p id="paragraph-d5babb3bde24ec9be3bb52451b803d82">See also Shosted (2006<xref id="xref-5d4ac79302989ff2a8df5bfd8b84e5be" ref-type="bibr" rid="journal-article-ref-f2937d26a4be28b5fd1e26fdde72db09">[15]</xref>), who does not use the term “trade-off” directly, but provides a critical discussion of the assumption of equal complexity of languages, which involves negative correlations between the complexity levels of different language components (phonology, morphology, syntax, etc.). </p>
      </fn>
      <fn id="footnote-b4c6d8bde6134416ea4b648fee4d2052">
        <label>2</label>
        <p id="paragraph-6144637b072c5c2b307d8e41d4c5c5a1">See an overview of different definitions of complexity in Sinnemäki (2011<xref id="xref-a3ba5cf917a3cadfa13219f118d02bef" ref-type="bibr" rid="thesis-ref-bcbd42b2786a4a7bd6e11a4468ea2b27">[61]</xref>). </p>
      </fn>
      <fn id="footnote-c4ab2c87e92ee1a88494702779155d3b">
        <label>3</label>
        <p id="paragraph-4dfd97308242c4e62336f4387fada543">More exactly, when the imaginary speaker names a target referent (e.g. her father’s sister) using a certain kinship term (e.g. <italic id="italic-2d8acb7e20a36e6bb1836e5bf5f891ad">aunt</italic>), the communicative cost is the divergence between the speaker and listener beliefs represented as probability distributions. This divergence, which is called information loss, is weighted by the need probability of the referent (i.e. the probability that the speaker will need to communicate about her father’s sister). </p>
      </fn>
      <fn id="footnote-891bb52a916ab99c30abf76e323f621e">
        <label>4</label>
        <p id="paragraph-b4548897102e2efa2abf09c7f2355cba">Although some nouns may be more frequently used in the plural than in the singular, e.g. <italic id="italic-d0b60ad1bfccb25dde1a5ae89ae647b7">pea</italic> and <italic id="italic-2794c20d9a42af3ab3533b6f8f1d67cd">peas</italic> (see Introduction), singular nouns are more frequent in general than plural. The split number marking of the Welsh type is unusual. Moreover, all languages with singulative coding also have ordinary plural marking for other nouns (HASPELMATH; KARJUS, 2017<xref id="xref-0c5ce7b3a6b949d27741a56e74e180d5" ref-type="bibr" rid="journal-article-ref-14a74dafac6602b7214ad7d52ad45826">[62]</xref>). </p>
      </fn>
      <fn id="footnote-858a826cd69b07c3619c60be47678b3e">
        <label>5</label>
        <p id="paragraph-d7d906462b7cd4c1a6fb4abe45862b91">To be more precise, Zipf’s law of abbreviation seems to have a more complex explanation. A quantitative causal model by Baayen, Milin and Ramscar (2016<xref id="xref-aa05573c72b07ade009f42e1f403c3b5" ref-type="bibr" rid="journal-article-ref-7eddb19e152814d02c68cec1e82f4ca7">[63]</xref>) suggests that there is a causal relationship from co-textual predictability of a word and its length, and from its length to frequency. In other words, we choose shorter forms for predictable meanings, and these forms are then used more often because they are short. </p>
      </fn>
      <fn id="footnote-0e078adc388750867cf5138d6eacc3fe">
        <label>6</label>
        <p id="paragraph-75c977b61d377b80c2af68883233839d">
          <ext-link id="external-link-4" xlink:href="https://universaldependencies.org/">https://universaldependencies.org/</ext-link>
        </p>
      </fn>
      <fn id="footnote-5ed438c01750757a0ce0e17df030eac0">
        <label>7</label>
        <p id="paragraph-46fbbb7d096cb6ff4d4035bb847f7189">
          <ext-link id="external-link-6" xlink:href="https://wortschatz.uni-leipzig.de/en/download">https://wortschatz.uni-leipzig.de/en/download</ext-link>
        </p>
      </fn>
    </fn-group>
    <ref-list>
      <ref id="book-ref-a528d84f01c66ebd97a9cb268500059c">
        <element-citation publication-type="book">
          <publisher-loc>London</publisher-loc>
          <publisher-name>Routledge</publisher-name>
          <year>1990</year>
          <person-group person-group-type="author">
            <name>
              <surname>ARIEL</surname>
              <given-names>Mira</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-1">Accessing Noun-Phrase Antecedents</italic>
          </source>
        </element-citation>
      </ref>
      <ref id="chapter-ref-2e3db0b9ce45bb38894b4eefe35ee907">
        <element-citation publication-type="chapter">
          <publisher-loc>Oxford</publisher-loc>
          <publisher-name>Oxford University Press</publisher-name>
          <year>2014</year>
          <pub-id pub-id-type="doi">10.1093/acprof:oso/9780198709848.001.0001 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>ARIEL</surname>
              <given-names>M</given-names>
            </name>
          </person-group>
          <person-group person-group-type="editor">
            <name>
              <surname>MacWhinney</surname>
              <given-names>B</given-names>
            </name>
            <name>
              <surname>MALCHUKOV</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>MORAVCSIK</surname>
              <given-names>E. A</given-names>
            </name>
          </person-group>
          <source>Competing Motivations</source>
          <chapter-title><italic id="italic-9e889251e01a91c895d6650f6d733505">Or</italic> Constructions: Monosemy versus polysemy</chapter-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-7eddb19e152814d02c68cec1e82f4ca7">
        <element-citation publication-type="journal">
          <issue>11</issue>
          <volume>30</volume>
          <year>2016</year>
          <pub-id pub-id-type="doi">10.1080/02687038.2016.1147767 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>BAAYEN</surname>
              <given-names>R</given-names>
            </name>
            <name>
              <surname>MILIN</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>RAMSCAR</surname>
              <given-names>M</given-names>
            </name>
          </person-group>
          <source>Aphasiology</source>
          <article-title>Frequency in lexical processing</article-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-78040988ce965afd01e60905acf25e6f">
        <element-citation publication-type="chapter">
          <publisher-loc>Berlin</publisher-loc>
          <publisher-name>Mouton de Gruyter</publisher-name>
          <volume>1: Syntax of the Sentence</volume>
          <year>2009</year>
          <person-group person-group-type="author">
            <name>
              <surname>BAUER</surname>
              <given-names>B. M</given-names>
            </name>
          </person-group>
          <person-group person-group-type="editor">
            <name>
              <surname>BALDI</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>CUZZOLIN</surname>
              <given-names>P</given-names>
            </name>
          </person-group>
          <source>New Perspectives on Historical Latin Syntax</source>
          <chapter-title>Word order</chapter-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-123889edf1411f75bd4d9c4f7302778b">
        <element-citation publication-type="chapter">
          <day>10</day>
          <month>05</month>
          <publisher-name>Zenodo</publisher-name>
          <year>2017</year>
          <pub-id pub-id-type="doi">10.5281/ZENODO.573774</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>Blasi</surname>
              <given-names>Damián E.</given-names>
            </name>
            <name>
              <surname>Roberts</surname>
              <given-names>Seán G.</given-names>
            </name>
          </person-group>
          <source>Dependencies in Language</source>
          <chapter-title>Beyond Binary Dependencies In Language Structure</chapter-title>
        </element-citation>
      </ref>
      <ref id="book-ref-9520cf6dbd5f589b523de44303c4a977">
        <element-citation publication-type="book">
          <publisher-loc>Cambridge</publisher-loc>
          <publisher-name>Cambridge University Press</publisher-name>
          <year>1996</year>
          <person-group person-group-type="author">
            <name>
              <surname>CLARCK</surname>
              <given-names>H. H</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-809dec53508455e9585c8a5985887fb9">Using Language</italic>
          </source>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-db730c42323ab588ca9ede7908be203f">
        <element-citation publication-type="journal">
          <issue>1</issue>
          <volume>22</volume>
          <year>1986</year>
          <pub-id pub-id-type="doi">10.1016/0010-0277(86)90010-7 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>CLARK</surname>
              <given-names>H. H</given-names>
            </name>
            <name>
              <surname>WILKES-GIBBS</surname>
              <given-names>D</given-names>
            </name>
          </person-group>
          <source>Cognition</source>
          <article-title>Referring as a collaborative process</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-0ab16defb12e6d440c278996d4eae518">
        <element-citation publication-type="journal">
          <issue>1</issue>
          <volume>6</volume>
          <year>2002</year>
          <pub-id pub-id-type="doi">10.1515/lity.2002.001 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>CROFT</surname>
              <given-names>W</given-names>
            </name>
          </person-group>
          <source>Linguistic Typology</source>
          <article-title>On being a student of Joe Greenberg</article-title>
        </element-citation>
      </ref>
      <ref id="book-ref-2babe4d8d8a70cfc5490a2551517bacd">
        <element-citation publication-type="book">
          <edition>5</edition>
          <publisher-loc>Leipzig</publisher-loc>
          <publisher-name>Breitkopf &amp; Härtel</publisher-name>
          <year>1908</year>
          <person-group person-group-type="author">
            <name>
              <surname>DELBRÜCK</surname>
              <given-names>B</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-617e90be4caf637e3978b9667c87b6da">Einleitung in das Studium der indogermanischen Sprachen</italic>
          </source>
        </element-citation>
      </ref>
      <ref id="chapter-ref-1d202b6ccf3d35c903a305aea2ebe412">
        <element-citation publication-type="chapter">
          <publisher-loc>Amsterdam</publisher-loc>
          <publisher-name>John Benjamins</publisher-name>
          <year>1985</year>
          <person-group person-group-type="author">
            <name>
              <surname>DU BOIS</surname>
              <given-names>J</given-names>
            </name>
          </person-group>
          <person-group person-group-type="editor">
            <name>
              <surname>HAIMAN</surname>
              <given-names>J</given-names>
            </name>
          </person-group>
          <source>Iconicity in Syntax</source>
          <chapter-title>Competing motivations</chapter-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-c47f03546aca055706d1ddb3a21309d9">
        <element-citation publication-type="chapter">
          <year>2016</year>
          <pub-id pub-id-type="doi">10.1111/cogs.12346 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>FEDZECHKINA</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>NEWPORT</surname>
              <given-names>E. L</given-names>
            </name>
            <name>
              <surname>JAEGER</surname>
              <given-names>T. F</given-names>
            </name>
          </person-group>
          <source>Cognitive Science, 41(2)</source>
          <chapter-title>Balancing Effort and Information Transmission During Language Acquisition: Evidence From Word Order and Case Marking</chapter-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-ec8ee13229e63366ee68d46cb30dd5e1">
        <element-citation publication-type="chapter">
          <publisher-loc>Amsterdam</publisher-loc>
          <publisher-name>John Benjamins</publisher-name>
          <year>2008</year>
          <person-group person-group-type="author">
            <name>
              <surname>FENK-OCZLON</surname>
              <given-names>G</given-names>
            </name>
            <name>
              <surname>FENK</surname>
              <given-names>A</given-names>
            </name>
          </person-group>
          <source>Language Complexity: Typology, Contact, Change</source>
          <chapter-title>Complexity trade-offs between the subsystems of language</chapter-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-a40c6e28d58f17cbce5bff6b863b2a7c">
        <element-citation publication-type="journal">
          <issue>6</issue>
          <volume>76</volume>
          <year>2006</year>
          <pub-id pub-id-type="doi">10.1209/epl/i2006-10406-0 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>FERRER-I-CANCHO</surname>
              <given-names>R</given-names>
            </name>
          </person-group>
          <source>Europhysics Letters</source>
          <article-title>Why do syntactic links not cross? </article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-011c3c4e5e78320938e51f4472ad4f5b">
        <element-citation publication-type="journal">
          <issue>2017</issue>
          <volume>39</volume>
          <pub-id pub-id-type="doi">10.1111/cogs.12346</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>FERRER-I-CANCHO</surname>
              <given-names>R</given-names>
            </name>
          </person-group>
          <source>Glottometrics</source>
          <article-title>The placement of the head that maximizes predictability. An information theoretic approach</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-6af766fd270df706a873582cf32574d1">
        <element-citation publication-type="journal">
          <issue>1</issue>
          <month>09</month>
          <page-range>16-19</page-range>
          <volume>1</volume>
          <year>1995</year>
          <pub-id pub-id-type="doi">10.1002/cplx.6130010105</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>Gell-Mann</surname>
              <given-names>Murray</given-names>
            </name>
          </person-group>
          <source>Complexity</source>
          <article-title>What is complexity?Remarks on simplicity and complexity by the Nobel Prize-winning author ofThe Quark and the Jaguar</article-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-13a6756ba25cf9f9825239d2a628a967">
        <element-citation publication-type="chapter">
          <publisher-loc>Cambridge, MA</publisher-loc>
          <publisher-name>MIT Press</publisher-name>
          <year>2000</year>
          <person-group person-group-type="author">
            <name>
              <surname>GIBSON</surname>
              <given-names>E</given-names>
            </name>
          </person-group>
          <source>Image, Language, Brain: Papers from the First Mind Articulation Project Symposium</source>
          <chapter-title>The dependency locality theory: A distance-based theory of linguistic complexity</chapter-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-ee4671c65f48e998d48c64dc839e5421">
        <element-citation publication-type="journal">
          <issue>5</issue>
          <volume>23</volume>
          <year>2019</year>
          <pub-id pub-id-type="doi">10.1016/j.tics.2019.02.003   </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>GIBSON</surname>
              <given-names>E</given-names>
            </name>
            <name>
              <surname>FUTRELL</surname>
              <given-names>R</given-names>
            </name>
            <name>
              <surname>PIANTADOSI</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>DAUTRICHE</surname>
              <given-names>I</given-names>
            </name>
            <name>
              <surname>MAHOWALD</surname>
              <given-names>K</given-names>
            </name>
            <name>
              <surname>BERGEN</surname>
              <given-names>L</given-names>
            </name>
            <name>
              <surname>ROGER</surname>
              <given-names>L</given-names>
            </name>
          </person-group>
          <source>Trends in Cognitive Science</source>
          <article-title>How Efficiency Shapes Human Language</article-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-550cb17016e61c56e900a38d47141862">
        <element-citation publication-type="chapter">
          <publisher-loc>Istanbul</publisher-loc>
          <publisher-name>ELRA</publisher-name>
          <year>2012</year>
          <person-group person-group-type="author">
            <name>
              <surname>GOLDHAHN</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>ECKART</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>QUASTHOFF</surname>
              <given-names>U</given-names>
            </name>
          </person-group>
          <source>Proceedings of the Eighth International Conference on Language Resources and Evaluation</source>
          <chapter-title>Building Large Monolingual Dictionaries at the Leipzig Corpora Collection: From 100 to 200 Languages</chapter-title>
        </element-citation>
      </ref>
      <ref id="book-ref-a7376fbbbc60088d4868763944debdf4">
        <element-citation publication-type="book">
          <publisher-loc>The Hague</publisher-loc>
          <publisher-name>Mouton</publisher-name>
          <year>1966a</year>
          <person-group person-group-type="author">
            <name>
              <surname>GREENBERG</surname>
              <given-names>J. H</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-08a70639f2a6d520f6c88ffb985a3419">Language Universals, With Special Reference to Feature</italic>
            <italic id="italic-2">Hierarchies</italic>
          </source>
        </element-citation>
      </ref>
      <ref id="chapter-ref-4884f33a225a87701faf35a510077d9f">
        <element-citation publication-type="chapter">
          <publisher-loc>Cambridge, MA</publisher-loc>
          <publisher-name>MIT Press</publisher-name>
          <year>1966b</year>
          <person-group person-group-type="author">
            <name>
              <surname>GREENBERG</surname>
              <given-names>J. H</given-names>
            </name>
          </person-group>
          <source>Universals of grammar</source>
          <chapter-title>Some universals of grammar with particular reference to the order of meaningful elements</chapter-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-1f6bd064a2e4f3600621a955ef0d47a4">
        <element-citation publication-type="journal">
          <issue>4</issue>
          <volume>59</volume>
          <year>1983</year>
          <pub-id pub-id-type="doi">https://doi.org/10.2307/413373</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>HAIMAN</surname>
              <given-names>J</given-names>
            </name>
          </person-group>
          <source>Language</source>
          <article-title>Iconic and economic motivation</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-50b08f0536ba5bd8e6b247d1bd407be7">
        <element-citation publication-type="journal">
          <issue>4</issue>
          <volume>30</volume>
          <year>2006</year>
          <pub-id pub-id-type="doi">10.1207/s15516709cog0000_64 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>HALE </surname>
              <given-names>J</given-names>
            </name>
          </person-group>
          <source>Cognitive Science</source>
          <article-title>Uncertainty about the rest of sentence</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-2eb39c5f8b88979d4016ea9c87907f7b">
        <element-citation publication-type="journal">
          <issue>1</issue>
          <volume>129</volume>
          <year>2013</year>
          <pub-id pub-id-type="doi">10.1016/j.cognition.2013.05.004 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>HALL</surname>
              <given-names>M. L.</given-names>
            </name>
            <name>
              <surname>MAYBERRY</surname>
              <given-names>R. I</given-names>
            </name>
            <name>
              <surname>FERREIRA</surname>
              <given-names>V. S</given-names>
            </name>
          </person-group>
          <source>Cognition</source>
          <article-title>Cognitive constraints on constituent order: evidence from elicited pantomime</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-d1fdb0aa9e5db35eacd9437570410c3b">
        <element-citation publication-type="journal">
          <volume>98</volume>
          <year>2017</year>
          <pub-id pub-id-type="doi">10.1016/j.cogpsych.2017.08.002</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>HARMON</surname>
              <given-names>Z</given-names>
            </name>
            <name>
              <surname>KAPATSINSKI</surname>
              <given-names>V</given-names>
            </name>
          </person-group>
          <source>Cognitive Psychology</source>
          <article-title>Putting old tools to novel uses: The role of form accessibility in semantic extension</article-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-62d59b59ccc10110fe18274c31fbfc20">
        <element-citation publication-type="chapter">
          <publisher-loc>Oxford</publisher-loc>
          <publisher-name>Oxford University Press</publisher-name>
          <year>2008</year>
          <person-group person-group-type="author">
            <name>
              <surname>HASPELMATH</surname>
              <given-names>M</given-names>
            </name>
          </person-group>
          <source>Language Universals and Language Change</source>
          <chapter-title>Creating economical morphosyntactic patterns in language change</chapter-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-44fdb479d44a5ef83e7c9f59b486447f">
        <element-citation publication-type="chapter">
          <publisher-loc>Oxford</publisher-loc>
          <publisher-name>Oxford University Press</publisher-name>
          <year>2014</year>
          <person-group person-group-type="author">
            <name>
              <surname>HASPELMATH</surname>
              <given-names>M</given-names>
            </name>
          </person-group>
          <source>Competing Motivations</source>
          <chapter-title>On system pressure competing with economic motivation</chapter-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-14a74dafac6602b7214ad7d52ad45826">
        <element-citation publication-type="journal">
          <issue>6</issue>
          <volume>55</volume>
          <year>2017</year>
          <pub-id pub-id-type="doi">10.1515/ling-2017-0026 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>HASPELMATH</surname>
              <given-names>M</given-names>
            </name>
            <name>
              <surname>KARJUS</surname>
              <given-names>A</given-names>
            </name>
          </person-group>
          <source>Linguistics</source>
          <article-title>Explaining asymmetries in number marking: Singulatives, pluratives and usage frequency</article-title>
        </element-citation>
      </ref>
      <ref id="book-ref-9bb5eb7f2364e1dff59d04ea918ea363">
        <element-citation publication-type="book">
          <publisher-loc>London</publisher-loc>
          <publisher-name>Croom Helm</publisher-name>
          <year>1986</year>
          <person-group person-group-type="author">
            <name>
              <surname>HAWKINS</surname>
              <given-names>J</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-bf836588c2b711f903b5614a97b1ed6c">A Comparative Typology of English and German. Unifying the contrasts</italic>
          </source>
        </element-citation>
      </ref>
      <ref id="book-ref-5a5d017164cae07e8a93517e01975441">
        <element-citation publication-type="book">
          <publisher-loc>Oxford</publisher-loc>
          <publisher-name>Oxford University Press</publisher-name>
          <year>2004</year>
          <person-group person-group-type="author">
            <name>
              <surname>HAWKINS</surname>
              <given-names>J</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-7088b05f7bc2fc4be3c7a99c8920f66f">Efficiency and Complexity in Grammars</italic>
          </source>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-d65fff11324a1a54a24c7f07914e2217">
        <element-citation publication-type="journal">
          <issue>1</issue>
          <volume>52</volume>
          <year>2018</year>
          <pub-id pub-id-type="doi">10.1515/flin-2018-0003 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>HENGEVELD</surname>
              <given-names>K</given-names>
            </name>
            <name>
              <surname>LEUFKENS</surname>
              <given-names>S</given-names>
            </name>
          </person-group>
          <source>Folia Linguistica</source>
          <article-title>Transparent and non-transparent languages</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-6c1585352d63049deef51772dd6eff76">
        <element-citation publication-type="journal">
          <issue>5</issue>
          <volume>25</volume>
          <year>2018</year>
          <pub-id pub-id-type="doi">10.3758/s13423-017-1363-z </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>HOLLER</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>KENDRICK</surname>
              <given-names>K. H</given-names>
            </name>
            <name>
              <surname>LEVINSON</surname>
              <given-names>S. C</given-names>
            </name>
          </person-group>
          <source>Psychonomic Bulletin &amp; Review</source>
          <article-title>Processing language in face-to-face conversation: Questions with gestures get faster responses</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-7c4744ea966dda7e2bc6c89395ed8b57">
        <element-citation publication-type="journal">
          <issue>8</issue>
          <volume>23</volume>
          <year>2019</year>
          <pub-id pub-id-type="doi">10.1016/j.tics.2019.05.006 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>HOLLER</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>LEVINSON</surname>
              <given-names>S C</given-names>
            </name>
          </person-group>
          <source>Trends in Cognitive Sciences</source>
          <article-title>Multimodal language processing in human communication</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-a67580b9399df1a0cd680e69b1c08c3c">
        <element-citation publication-type="journal">
          <issue>1</issue>
          <volume>83</volume>
          <year>2007</year>
          <person-group person-group-type="author">
            <name>
              <surname>JÄGER</surname>
              <given-names>G</given-names>
            </name>
          </person-group>
          <source>Language</source>
          <article-title>Evolutionary Game Theory and Typology. A Case Study</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-df43bca01b9e8bda570ccb73a515463c">
        <element-citation publication-type="journal">
          <issue>1</issue>
          <volume>61</volume>
          <year>2010</year>
          <pub-id pub-id-type="doi">10.1016/j.cogpsych.2010.02.002</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>JAEGER</surname>
              <given-names>T</given-names>
            </name>
          </person-group>
          <source>Cognitive Psychology</source>
          <article-title>Redundancy and reduction: Speakers manage syntactic information density</article-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-ddbd3703f78d87570e3557ff2df7bddf">
        <element-citation publication-type="chapter">
          <publisher-loc>Hoboken, NJ</publisher-loc>
          <publisher-name>John Wiley &amp; Sons</publisher-name>
          <year>2017</year>
          <pub-id pub-id-type="doi">10.1002/9781118829516.ch3</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>JAEGER</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>BUZ</surname>
              <given-names>E</given-names>
            </name>
          </person-group>
          <source>The Handbook of Psycholinguistics</source>
          <chapter-title>Signal reduction and linguistic encoding</chapter-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-5ce971c415542d6fb2ee286bb8f1a808">
        <element-citation publication-type="journal">
          <day>18</day>
          <issue>3</issue>
          <month>11</month>
          <page-range>323-335</page-range>
          <volume>2</volume>
          <year>2010</year>
          <pub-id pub-id-type="doi">10.1002/wcs.126</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>Jaeger</surname>
              <given-names>T. Florian</given-names>
            </name>
            <name>
              <surname>Tily</surname>
              <given-names>Harry</given-names>
            </name>
          </person-group>
          <source>Wiley Interdisciplinary Reviews: Cognitive Science</source>
          <article-title>On language ‘utility’: processing complexity and communicative efficiency</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-f2e9c384911977baba1b83c911c10d21">
        <element-citation publication-type="journal">
          <issue>11</issue>
          <volume>47</volume>
          <year>2012</year>
          <pub-id pub-id-type="doi">10.18637/jss.v047.i11</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>Kalisch</surname>
              <given-names>Markus</given-names>
            </name>
            <name>
              <surname>Mächler</surname>
              <given-names>Martin</given-names>
            </name>
            <name>
              <surname>Colombo</surname>
              <given-names>Diego</given-names>
            </name>
            <name>
              <surname>Maathuis</surname>
              <given-names>Marloes H.</given-names>
            </name>
            <name>
              <surname>Bühlmann</surname>
              <given-names>Peter</given-names>
            </name>
          </person-group>
          <source>Journal of Statistical Software</source>
          <article-title>Causal Inference Using Graphical Models with theRPackagepcalg</article-title>
        </element-citation>
      </ref>
      <ref id="book-ref-5afb9a1f09b0d38bda4fe6111984470f">
        <element-citation publication-type="book">
          <publisher-loc>London</publisher-loc>
          <publisher-name>Routledge</publisher-name>
          <year>1994</year>
          <person-group person-group-type="author">
            <name>
              <surname>KELLER</surname>
              <given-names>R</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-318833b9d5bac93752e544abaf1a54dc">On Language Change: The Invisible Hand in Language</italic>
          </source>
        </element-citation>
      </ref>
      <ref id="conference-paper-ref-d5e89324cb1943762bb58b5830baf79c">
        <element-citation publication-type="confproc">
          <conf-name>Annual Review of Linguistics 4</conf-name>
          <year>2018</year>
          <pub-id pub-id-type="doi">10.1146/annurev-linguistics-011817-045406</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>KEMP</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>XU</surname>
              <given-names>Y</given-names>
            </name>
            <name>
              <surname>REGIER</surname>
              <given-names>T</given-names>
            </name>
          </person-group>
          <article-title>Semantic Typology and Efficient Communication</article-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-1fc116100922b193f42dac2569d4c0cf">
        <element-citation publication-type="chapter">
          <publisher-loc>Dordrecht</publisher-loc>
          <publisher-name>Kluwer</publisher-name>
          <year>1996</year>
          <person-group person-group-type="author">
            <name>
              <surname>KIPARSKY</surname>
              <given-names>P</given-names>
            </name>
          </person-group>
          <source>Studies in Comparative Germanic Syntax II</source>
          <chapter-title>The Shift to Head-initial VP in Germanic</chapter-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-7ac77ca02e3420b857e54f5af62203b0">
        <element-citation publication-type="journal">
          <volume>3</volume>
          <year>1974</year>
          <person-group person-group-type="author">
            <name>
              <surname>KOCH</surname>
              <given-names>M</given-names>
            </name>
          </person-group>
          <source>Montreal Working Papers in Linguistics</source>
          <article-title>A Demystification of Syntactic Drift</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-791e1991a19bf0c635563d094595166c">
        <element-citation publication-type="journal">
          <issue>3</issue>
          <volume>12</volume>
          <year>2017</year>
          <pub-id pub-id-type="doi">10.1371/journal.pone.0173614 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>KOPLENIG</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>MEYER</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>WOLFER</surname>
              <given-names>S</given-names>
            </name>
            <name>
              <surname>MÜLLER-SPITZER</surname>
              <given-names>C</given-names>
            </name>
          </person-group>
          <source>PLoS ONE</source>
          <article-title>The statistical trade-off between word order and word structure – Large-scale evidence for the principle of least effort</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-6fd1d958476e69f443ef027d0571c37e">
        <element-citation publication-type="journal">
          <volume>83</volume>
          <year>2015</year>
          <pub-id pub-id-type="doi">10.1016/j.jml.2015.03.003</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>KURUMADA</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>JAEGER</surname>
              <given-names>T. F</given-names>
            </name>
          </person-group>
          <source>Journal of Memory and Language </source>
          <article-title>Communicative efficiency in language production: Optional case-marking in Japanese</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-a797264c5718372e1d2e6ca89e015c1e">
        <element-citation publication-type="journal">
          <day>26</day>
          <month>11</month>
          <year>2018</year>
          <pub-id pub-id-type="doi">10.5281/ZENODO.1542857</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>Levshina</surname>
              <given-names>Natalia</given-names>
            </name>
          </person-group>
          <source>Zenodo</source>
          <article-title>Towards a Theory of Communicative Efficiency in Human Languages</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-001cd535d8a43913429703db6591e79e">
        <element-citation publication-type="journal">
          <issue>3</issue>
          <volume>23</volume>
          <year>2019</year>
          <pub-id pub-id-type="doi">10.1515/lingty-2019-0025 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>LEVSHINA</surname>
              <given-names>N</given-names>
            </name>
          </person-group>
          <source>Linguistic Typology</source>
          <article-title>Token-based typology and word order entropy</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-0313580c20d701c501821a8a42289e2d">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>LEVSHINA</surname>
              <given-names>N</given-names>
            </name>
          </person-group>
          <source>In preparation</source>
          <article-title>Bounded rationality and limited efficiency: A correlational and causal analysis of subject and object cues in thirty languages</article-title>
        </element-citation>
      </ref>
      <ref id="book-ref-d17216898ce38ded03d9cc94000cc7c2">
        <element-citation publication-type="book">
          <publisher-loc>Stuttgart</publisher-loc>
          <publisher-name>Kohlhammer</publisher-name>
          <year>1963</year>
          <person-group person-group-type="author">
            <name>
              <surname>MARTINET</surname>
              <given-names>A</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-a473b2ea3087b4c8b7bfa5e416645c99">Grundzüge der Allgemeinen Sprachwissenschaft</italic>
          </source>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-7b7f5199239fb21748aac0efa093790a">
        <element-citation publication-type="journal">
          <issue>2</issue>
          <volume>5</volume>
          <year>2001</year>
          <pub-id pub-id-type="doi">10.1515/lity.2001.001</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>MCWHORTER</surname>
              <given-names>J. H</given-names>
            </name>
          </person-group>
          <source>Linguistic Typology</source>
          <article-title>The world’s simplest grammars are creole grammars</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-9fb8eea3193397b4abdb5c11e8b6960e">
        <element-citation publication-type="journal">
          <issue>9</issue>
          <volume>108</volume>
          <year>2011</year>
          <pub-id pub-id-type="doi">10.1073/pnas.1012551108 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>PIANTADOSI</surname>
              <given-names>S. T</given-names>
            </name>
            <name>
              <surname>TILY</surname>
              <given-names>H</given-names>
            </name>
            <name>
              <surname>GIBSON</surname>
              <given-names>E</given-names>
            </name>
          </person-group>
          <source>PNAS</source>
          <article-title>Word lengths are optimized for efficient communication</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-4813ee74ddc83948a2f342fb2c4cedfa">
        <element-citation publication-type="journal">
          <volume>122</volume>
          <year>2012</year>
          <pub-id pub-id-type="doi">10.1016/j.cognition.2011.10.004 </pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>PIANTADOSI</surname>
              <given-names>S. T</given-names>
            </name>
            <name>
              <surname>TILY</surname>
              <given-names>H</given-names>
            </name>
            <name>
              <surname>GIBSON</surname>
              <given-names>E</given-names>
            </name>
          </person-group>
          <source>Cognition</source>
          <article-title>The communicative function of ambiguity in language</article-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-16b11eb1e097acdcd9d5bb7cb196816c">
        <element-citation publication-type="journal">
          <issue>4</issue>
          <volume>46</volume>
          <year>2015</year>
          <pub-id pub-id-type="doi">10.1177/0022022115572226</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>RÓŻCKA-TRAN</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>BOSKI</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>WOJCISZKE</surname>
              <given-names>B</given-names>
            </name>
          </person-group>
          <source>Journal of Cross-Cultural Psychology </source>
          <article-title>Belief in a zero-sum game as a social axiom: A 37-Nation Study</article-title>
        </element-citation>
      </ref>
      <ref id="book-ref-b5e1a87275c9a15db8cff9ec7745ec4e">
        <element-citation publication-type="book">
          <publisher-loc>New York</publisher-loc>
          <publisher-name>Harcourt</publisher-name>
          <year>1921</year>
          <person-group person-group-type="author">
            <name>
              <surname>SAPIR</surname>
              <given-names>E</given-names>
            </name>
          </person-group>
          <source>Language: <italic id="italic-a8b348b28a8498ea73de404246454de5">An Introduction to the Study of Speech</italic></source>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-f2937d26a4be28b5fd1e26fdde72db09">
        <element-citation publication-type="journal">
          <issue>1</issue>
          <volume>10</volume>
          <year>2006</year>
          <pub-id pub-id-type="doi">10.1515/LINGTY.2006.001</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>SHOSTED</surname>
              <given-names>R. K</given-names>
            </name>
          </person-group>
          <source>Linguistic Typology</source>
          <article-title>Correlating complexity: A typological approach</article-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-4eca7df8d3e8676e40a520f62a7bebb5">
        <element-citation publication-type="chapter">
          <publisher-loc>Amsterdam</publisher-loc>
          <publisher-name>John Benjamins</publisher-name>
          <year>2008</year>
          <person-group person-group-type="author">
            <name>
              <surname>SINNEMÄKI</surname>
              <given-names>K</given-names>
            </name>
          </person-group>
          <source>Language Complexity: Typology, Contact, Change</source>
          <chapter-title>Complexity trade-offs in core argument marking</chapter-title>
        </element-citation>
      </ref>
      <ref id="journal-article-ref-6d1fd7211fd35c932d578968e5f1a128">
        <element-citation publication-type="journal">
          <issue>4</issue>
          <volume>34</volume>
          <year>2010</year>
          <pub-id pub-id-type="doi">10.1075/sl.34.4.04sin</pub-id>
          <person-group person-group-type="author">
            <name>
              <surname>SINNEMÄKI</surname>
              <given-names>K</given-names>
            </name>
          </person-group>
          <source>Studies in Language </source>
          <article-title>Word order in zero-marking languages</article-title>
        </element-citation>
      </ref>
      <ref id="thesis-ref-bcbd42b2786a4a7bd6e11a4468ea2b27">
        <element-citation publication-type="thesis">
          <year>PhD dissertation, University of Helsinki, 2011</year>
          <person-group person-group-type="author">
            <name>
              <surname>SINNEMÄKI</surname>
              <given-names>K</given-names>
            </name>
          </person-group>
          <article-title>
            <italic id="italic-953553a9754bf81579fad64002ecdd84">Language universals and linguistic complexity. Three case studies in core argument marking</italic>
          </article-title>
        </element-citation>
      </ref>
      <ref id="chapter-ref-e8ebaa902477b97d8f8e0a56137d9a5b">
        <element-citation publication-type="chapter">
          <publisher-loc>Oxford</publisher-loc>
          <publisher-name>Oxford University Press</publisher-name>
          <year>2014</year>
          <person-group person-group-type="author">
            <name>
              <surname>SINNEMÄKI</surname>
              <given-names>K</given-names>
            </name>
          </person-group>
          <source>Measuring Grammatical Complexity</source>
          <chapter-title>Complexity trade-offs: A case study</chapter-title>
        </element-citation>
      </ref>
      <ref id="book-ref-16d1ae8b46d5a462919986ce6fa3dd1d">
        <element-citation publication-type="book">
          <edition>2</edition>
          <publisher-loc>Cambridge, MA</publisher-loc>
          <publisher-name>MIT Press</publisher-name>
          <year>2000</year>
          <person-group person-group-type="author">
            <name>
              <surname>SPIRTES</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>GLYMOUR</surname>
              <given-names>C</given-names>
            </name>
            <name>
              <surname>SCHEINES</surname>
              <given-names>R</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-d068503b7f6f900c84768ed7237fa8e9">Causation, Prediction, and Search</italic>
          </source>
        </element-citation>
      </ref>
      <ref id="software-ref-225fd936372f47383befab0b9324a500">
        <element-citation publication-type="software">
          <publisher-loc>https://CRAN.R-project.org/package=udpipe</publisher-loc>
          <version>R package version 0.8.4-1</version>
          <person-group person-group-type="author">
            <name>
              <surname>WIJFFELS</surname>
              <given-names>J</given-names>
            </name>
          </person-group>
          <source>udpipe: Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing with the UDPipe NLP Toolkit</source>
        </element-citation>
      </ref>
      <ref id="book-ref-8b035cfa56b6298d00e56e9c982edc5d">
        <element-citation publication-type="book">
          <publisher-loc>New York</publisher-loc>
          <publisher-name>Pantheon</publisher-name>
          <year>2000</year>
          <person-group person-group-type="author">
            <name>
              <surname>WRIGHT</surname>
              <given-names>R</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-e4a6b0a013e39e9fa696facb336bab74">Nonzero: The Logic of Human Destiny</italic>
          </source>
        </element-citation>
      </ref>
      <ref id="conference-paper-ref-2a8bb47251d804239ff040e81a1ded45">
        <element-citation publication-type="confproc">
          <conf-name>LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), Faculty of Mathematics and Physics, Charles University</conf-name>
          <year>2020</year>
          <person-group person-group-type="author">
            <name>
              <surname>ZEMAN</surname>
              <given-names>D</given-names>
            </name>
            <name>
              <surname>NIVRE</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>ABRAMS</surname>
              <given-names>M</given-names>
            </name>
            <collab>
              <named-content content-type="name">et al</named-content>
            </collab>
          </person-group>
          <article-title>Universal Dependencies 2.6</article-title>
        </element-citation>
      </ref>
      <ref id="book-ref-d28c30708ce2ef900ffe240bef9d52da">
        <element-citation publication-type="book">
          <publisher-loc>Cambridge, Mass.</publisher-loc>
          <publisher-name>M.I.T. Press</publisher-name>
          <year>1965</year>
          <person-group person-group-type="author">
            <name>
              <surname>ZIPF</surname>
              <given-names>G</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-e963f026ff272f15f1f9d5fdbbad2d31">The Psychobiology of Language: An Introduction to Dynamic Philology</italic>
          </source>
        </element-citation>
      </ref>
      <ref id="book-ref-65d624b065120baee5f74dd94f50e477">
        <element-citation publication-type="book">
          <publisher-loc>Cambridge, MA</publisher-loc>
          <publisher-name>Addison–Wesley</publisher-name>
          <year>1949</year>
          <person-group person-group-type="author">
            <name>
              <surname>ZIPF</surname>
              <given-names>G</given-names>
            </name>
          </person-group>
          <source>
            <italic id="italic-402ebd6ecf5ef7a09212d9ef0d7ba7f2">Human Behavior and the Principle of Least Effort</italic>
          </source>
        </element-citation>
      </ref>
    </ref-list>
  </back>
</article>