DEBIASED

The main aims of the project DeBiASED (Detecting Biases in the Austrian Everyday Discourse) are detecting, mapping and analysing biases in the Austrian Media Landscape, exemplified on the 20-year collection of the Austrian Media Corpus (AMC), to-date the largest German corpus of its kind. The objectives are to explore this large, diachronic corpus of Austrian-German media to detect cognitive and systemic biases, while simultaneously shedding light on linguistic, cultural, political, sociological and geographical aspects. The methodological approach comprehends the development and application of Machine Learning (ML) and Natural Language Processing (NLP) tools to build language models, based on Word Embeddings and other Neural network Architectures called “Transformers”, besides more traditional linguistic, lexic and lexicographical analyses. It also involves crafting topical sets of word analogies to detect biases with these linguistic models.

Modern linguistics, as long as the most complex topics in machine learning today are based on language models created for natural language processing (NLP) tasks. The models and sets of analogies created in the scope of the project can be used in a wide variety of tasks; ranging from measurement of quality in corpora, evaluation of language models and also in syntactic, semantic and pragmatics research for contemporary German. Not only academic researchers, but also citizens can benefit from the availability of german language models, dictionaries, the mapping of regional variations of language, diachrony studies, words statistics, collocations and so on. The power of the embeddings models is to summarize and make available a wide amount of information that would not be accessible otherwise.

Contact

Amelie Dorn

Renato Rocha Souza

Project duration

01 Mai 2021 - ongoing

Twitter Hashtag

#DeBiASED

Name	Zweck	Speicherdauer	Typ	Anbieter
CookieConsent	Speichert Ihre Einwilligung zur Verwendung von Cookies.	1 Jahr	HTML	Web Consent
fe_typo_user	Ordnet Ihren Browser einer Session auf dem Server zu. Dies beeinflusst nur die Inhalte, die Sie sehen und wird von uns nicht ausgewertet oder weiterverarbeitet.	-	HTTP	Web User

Name	Zweck	Speicherdauer	Typ	Anbieter
_pk_id	Wird verwendet, um ein paar Details über den Benutzer wie die eindeutige Besucher-ID zu speichern.	13 Monate	HTML	Matomo-id
_pk_ref	Wird benutzt, um die Informationen der Herkunftswebsite des Benutzers zu speichern.	6 Monate	HTML	Matomo-ref
_pk_ses	Kurzzeitiges Cookie, um vorübergehende Daten des Besuchs zu speichern.	30 Minuten	HTML	Matomo-ses
_pk_cvar	Kurzzeitiges Cookie, um vorübergehende Daten des Besuchs zu speichern.	30 Minuten	HTML	Matomo-cvar
_pk_hsr	Kurzzeitiges Cookie, um vorübergehende Daten des Besuchs zu speichern.	30 Minuten	HTML	Matomo

Name	Zweck	Speicherdauer	Typ	Anbieter
YouTube	Es wird eine Verbindung mit YouTube hergestellt, um Videos anzuzeigen.	-	Verbindung	YouTube
SoundCloud	Es wird eine Verbindung mit SoundCloud hergestellt, um Audio-Dateien abzuspielen.	-	Verbindung	SoundCloud
Twitter	Es wird eine Verbindung mit Twitter hergestellt, um Tweets anzuzeigen.	-	missing translation: type.	Twitter

Contact

Project duration

Twitter Hashtag

Helpdesk