Artikel
Quantifying the efficiency of written language
Information theory can be used to assess how efficiently a message is transmitted on the basis of different symbolic systems. In this paper, I estimate the information-theoretic efficiency of written language for parallel text data in more than 1000 different languages, both on the level of characters and on the level of words as information encoding units. The main results show that (i) the median efficiency is ∼29% on the character level and ∼45% on the word level, (ii) efficiency on both levels is strongly correlated with each other and (iii) efficiency tends to be higher for languages with more speakers.
- Sprache
-
Englisch
- Thema
-
Sprachstatistik
Effizienz
Schriftsprache
Informationstheorie
Sprachzeichen
Wort
Sprache
- Ereignis
-
Geistige Schöpfung
- (wer)
-
Koplenig, Alexander
- Ereignis
-
Veröffentlichung
- (wer)
-
Berlin, Boston : De Gruyter
- (wann)
-
2021-05-25
- URN
-
urn:nbn:de:bsz:mh39-104401
- Letzte Aktualisierung
-
06.03.2025, 09:00 MEZ
Datenpartner
Leibniz-Institut für Deutsche Sprache - Bibliothek. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.
Objekttyp
- Artikel
Beteiligte
- Koplenig, Alexander
- Berlin, Boston : De Gruyter
Entstanden
- 2021-05-25