she / her
I'm a 3rd year PhD student in Natural Language Processing
working on making social media safer and more diverse. Particularly, I’m interested
in offensive / toxic text detection in online comment sections.
Who defines what is toxic? Often, what is considered as toxic is decided by online platforms themselves; and diverse perspectives on what is toxic to whom are overlooked. Developing toxicity detection systems with disaggregated labels is an interesting problem to tackle in my opinion. Therefore, I’m working on user-centric approaches to toxicity detection.
Natural language processing is still predominantly English-centred. I have frequently been working with Austrian German data that can be considered low-resource. In the future, I would like to extend my work to low-resource variants of Spanish.
I am lucky to have collaborated with political communication scientists throughout my PhD. I learned about rigorous methods to define toxicity and works measuring the effects of online toxicity which have been overlooked by computer scientists so far. I want to find ways to bring computer and social science closer.
I love all areas of Natural Language Processing, from foundations to application. Please reach out to me if you want to discuss!
Contact me by reordering @ tuwien.ac.at pia.pachinger
2024 ACL
Findings
AustroTox: A Dataset for Target-Based Austrian German Offensive Language Detection
Pia Pachinger, Janis Goldzycher, Anna Maria Planitzer, Wojciech Kusa, Allan Hanbury, Julia Neidhardt
Here is the data.
Here is the poster.
2023 EACL C3NLP
Toward Disambiguating the Definitions of Abusive, Offensive, Toxic, and Uncivil Comments
Pia Pachinger, Allan Hanbury, Julia Neidhardt, and Anna Maria Planitzer
2022
TU Vienna
A Recommender System for Scientific Referees Based on Bibliographic Databases and Knowledge Graphs
Pia Pachinger, Emanuel Sallinger, Georg Gottlob, Joël Ouaknine, Glenn Starkman, Matt Rainey
2024 NAACL, Student Research Workshop
Best PhD proposal
User-Centric Offensive Text Detection in
Culture-Specific Contexts: A PhD Proposal
Pia Pachinger
2024 German Society for Computational Linguistics
Conference stipend for students (KONVENS
2024)
2023 Library Andreas Züst, Switzerland
Artist residency
An AI Reading of the
Library of Babel
Simón López Trujillo*, Pia
Pachinger*, Baltazar Pérez* (* equal contribution)
2022 Inria, Paris
Design + interaction + AI hackathon
Second Prize
Anaïs Cambou*,
Anthonin Gourichon*, Fengyu Li*, Xiaoning Meng*, Pia Pachinger* (* equal
contribution)
2024 NAACL
Workshop on Online Abuse and Harms
AustroTox: A Dataset for Span-Based Austrian German and English Offensive Language Detection
Pia Pachinger, Janis Goldzycher, Anna Maria Planitzer, Wojciech Kusa, Allan Hanbury
, Julia Neidhardt
2023
ACL
Workshop on Online Abuse and Harms
Toward Disambiguating the Definitions of Abusive, Offensive, Toxic, and Uncivil Comments
Pia Pachinger, Allan Hanbury, Julia Neidhardt, and Anna Maria Planitzer
2023
University of Chile
Open Beauchef
Toxic Comment Detection in Social Media
Pia Pachinger
2024 LREC-Coling
2025 ACL , Workshop on Online Abuse and Harms
2023 / 2024 Faculty of Informatics, TU Vienna
Natural Language Processing and Information Extraction
2023 Faculty of Informatics, TU Vienna
Advanced Information Retrieval
2023 Faculty of Linguistics, Paris Lodron University Salzburg
Language Technology and Language Data
2018 - 2020 Faculty of Mathematics, University of Vienna
Introduction to Wolfram Mathematica
2019 Faculty of Mathematics, University of Vienna
Python for Mathematicians
2021-2022 Databases and Artificial
Intelligence Group, TU Vienna
Project staff
Implementation of recommender system for
scientific referees
07/2019 - 09/2020 Centre for Cyber Security,
Austrian Institute of Technology
Freelance research engineer
Implementation of
Deep Learning methods for anomaly detection in log data
2022 – 2025 PhD in Informatics, TU Vienna
Natural Language
Processing, User-Centric Offensive Text Detection in Culture-Specific Contexts
2018 - 2022 Master in Data Science, TU Vienna
Machine Learning and
Statistics, Natural Language Processing and Visual Analytics
Passed with
distinction
2014 - 2018 Bachelor in Mathematics, University of Vienna
09/2019 - 01/2020 University of Vienna
Member of the working group for creating the curriculum of the new data science master studies
08/2013 - 01/2014 Guarderia Don Bosco , San José, Costa Rica
Volunteering in a kindergarden for socially deprived children
02/2018 - 07/2018 University of Bergen
Collaboration with Morten Brun on Topological Data Analysis
07/2018 - 09/2018 National University of Colombia
Collaboration with Francisco Gómez on Topological Data Analysis
08/2016 - 02/2017 Universidad Autónoma de Madrid
Erasmus
German (Mother tongue)
English (C1, IELTS 2017)
Spanish (C1)