KIT | KIT-Bibliothek | Impressum | Datenschutz

Automated identification of quoted actors in journalistic articles

Hohenwalde, Clarissa Elisabeth ORCID iD icon; Lüders, Tabea ORCID iD icon 1; Leidecker-Sandmann, Melanie Marita ORCID iD icon 1
1 Institut für Technikzukünfte (ITZ), Karlsruher Institut für Technologie (KIT)

Abstract:

Automated Identification of Quoted Actors in Journalistic Articles
"Who gets to speak in the news is crucial for the shaping of the news."(Beckers & Van Aelst, 2019) Identifying actors who are actively contributing to mediated public discourses on societally relevant issues is crucial for examining the degree of deliberation in democratic decision-making processes (Habermas, 1992). Hence, this study evaluates a fully automated procedure for identifying such active actors that are directly or indirectly quoted in journalistic articles.
The technical developed pipeline comprises two components. First, all entities in the text are identified using a Named Entity Recognition (NER) procedure, which was validated in a prior study (Buz et al., 2022). Second, through a combination of rule-based filters and targeted prompts sent to ChatGPT-4o with a temperature of 0.2 for each extracted entity, it is automatically checked whether they are 1) the author of the article, 2) a real person, and 3) directly or indirectly quoted. Finally, 4) duplicates are excluded from the analysis.
The ChatGPT classification of actors was compared against manual coding of a diverse sample of n = 524 news articles covering science and technology issues. ... mehr


Zugehörige Institution(en) am KIT Institut für Technikzukünfte (ITZ)
Publikationstyp Vortrag
Publikationsdatum 26.09.2025
Sprache Englisch
Identifikator KITopen-ID: 1000185206
Veranstaltung Computational Methods in Science Communication Research (2025), Landau in der Pfalz, Deutschland, 25.09.2025 – 26.09.2025
Bemerkung zur Veröffentlichung The data and script underlying this study are publicly available on GitLab and can be accessed via the following link: https://gitlab.com/wisskomm-in-digitalen-medien/ automated-identification-of-quoted-actors-in-journalistic-articles
Schlagwörter automated content analysis; automated annotation; ChatGPT; news coverage; content analysis; actor identification; actor classification
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page