Network-based approaches play an increasingly important role in the analysis of data. Especially in the Digital Humanities (DH), network models have gained importance in recent years because more and more data-based and data-driven research is carried out and the amount of data is increasing (e.g. Big Data).

The project, funded by the go!digital NEXT GENERATION, bridges the fields of linguistics, digital humanities and computer science in order to explore the diachronic dynamics of lexical networks on the basis of large-scale authentic language data. The project will reuse language data that is already available at the ACDH-CH, namely the Austrian Media Corpus (AMC) and the Corpus of Austrian Parliamentary Records (ParlAT). The AMC covers the entire Austrian media landscape of the past 20 years and contains 40 million texts (more than 10 billion tokens).

The ParlAT corpus covers the Austrian parliamentary records of the last 20 years with more than 75 million tokens. From the linguistic perspective, the project will explore the diachronic dynamics of lexical networks and discuss networks-based methods for diachronic linguistics. From the point of view of computer science, the project will apply network theory to a big amount of diachronic linguistic data and discuss new methods for the automatic analysis and comparison of these networks. Furthermore, the project will enrich the already available digital toolbox with a freely available tool for network analysis and visualisation and will enhance already existing data with additional annotations. The project, coordinated by the ACDH-CH with Tanja Wissik as PI, is carried out by an interdisciplinary team from the ACDH-CH, the University of Vienna and the Vienna University of Technology.


Publications

  • Wissik, Tanja. 2022. Encoding interruptions in parliamentary data: from applause to interjections and laughter. In: Journal of the Text Encoding InitiativeIssue 14, p. k.A.
  • Yim, Seun-bin, Katharina Wünsche, Asil Cetin, Julia Neidhardt, Andreas Baumann, and Tanja Wissik. 2022. Visualizing Parliamentary Speeches as Networks: The DYLEN Tool. In: Fišer, Darja, Maria Eskevich, Jakob Lenardic, and Franciska de Jong (Eds.),Proceedings of the Proceedings of the LREC 2022 ParlaCLARIN III Workshop on Creating, Enriching and Using Parliamentary Corpora.
  • Marakasova, Anna, Klaus Hofmann, Andreas Baumann, Julia Neidhardt, and Tanja Wissik. 2021. Lexical convergence and divergence in Austrian parliamentary debates: a network-based approach. In: Proceedings of the 1st Workshop on Computational Linguistics for Political Text Analysis (CPSS-2021). Düsseldorf.
  • Hofmann, Klaus and Tanja Wissik. 2021. The role of interjections in Austrian parliamentary debates. In: Proceedings of the 1st Workshop on Computational Linguistics for Political Text Analysis (CPSS-2021). Düsseldorf.
  • Baumann, Andreas, Klaus Hofmann, Bettina Kern, Anna Marakasova, Julia Neidhardt, and Tanja Wissik. 2021. Exploring Causal Relationships Among Emotional and Topical Trajectories in Political Text Data. In: Gromann, Dagmar, Gilles Sérasset, Thierry Declerck, John P. McCrae, Jorge Gracia, Julia Bosque-Gil, Fernando Bobillo, and Barbara Heinisch (Eds.),3rd Conference on Language, Data and Knowledge. LDK 2021, September 1-3, 2021, Zaragoza, SpainOpenAccess Series in Informatics (OASIcs) 93. Dagstuhl: Schloss Dagstuhl -- Leibniz-Zentrum für Informatik, p. 38:1-38:8.
  • Kern, Bettina M. J., Klaus Hofmann, Andreas Baumann, and Tanja Wissik. 2021. Komparative Zeitreihenanalyse der lexikalischen Stabilität und Emotion in österreichischen Korpusdaten. In: Katsikadeli, Christina, Manfred Sellner, and Michael Gassner (Eds.),Digital Lexis and Beyond. Selected Papers from the Workshop „Digital Lexis, and Beyond”. 45th Austrian Linguistics Conference Dec. 2019, p. 104-118.
  • Olsen, Sussi, Bolette S. Pedersen, Tanja Wissik, Anna Woldrich, and Simon Krek. 2020. Stimulating Knowledge Exchange via Trans-National Access – the ELEXIS Travel Grants as a Lexico-graphical Use Case. In: Navaretta, Costanza and Maria Eskevich (Eds.),Proceedings CLARIN Annual Conference 2020, p. 77-81.
  • Hofmann, Klaus, Anna Marakasova, Andreas Baumann, Julia Neidhardt, and Tanja Wissik. 2020. Comparing Lexical Usage in Political Discourse across Diachronic Corpora. Proceedings of the Workshop Creating, Using and Linking of Parliamentary Corpora with Other Types of Political Discourse ( ParlaCLARIN II) at LREC 2020, p. 58-65.
  • Baumann, Andreas, Julia Neidhardt, and Tanja Wissik. 2019. DYLEN: Diachronic Dynamics of Lexical Networks. In: Declerck, Thierry and John P. McCrae (Eds.),Proceedings of the Poster Session of the 2nd Conference on Language, Data and Knowledge (LDK-PS 2019). Leipzig, Germany, May 21, 2019CEUR Workshop Proceedings 2402, p. 24-28.

Contact

Tanja Wissik

Seung-bin Yim

Project Duration

1  May 2019 - 31 December 2021

Links

DYLEN WEBSITE

ÖAW Go!Digital Next Generation

This project reuses language data from:

AMC

ParlAT

Twitter Hashtag

#dylennetworks