People

Yifan Wang

Saarland Informatics Campus
Building C7.4, Room 2.03

About me

Hi, I’m Yifan Wang, a PhD student working within the RTG project 2853 “Neuroexplicit Models of Language, Vision, and Action”. I’m lucky to be co-supervised by Prof. Dr. Vera Demberg and Prof. Dr. Isabel Valera.

My research focuses on tackling undesired behaviors in NLP systems, like toxicity, social biases, and stereotypes. I’m also deeply interested in making large language models more interpretable and transparent. If you’re curious to learn more, feel free to check out my website: ewanwong.github.io.

Publications

2026

Wang, Yifan; Jobanputra, Mayank; Lee, Ji-Ung; Oh, Soyoung; Valera, Isabel; Demberg, Vera

Bridging Fairness and Explainability: Can Input-Based Explanations Promote Fairness in Hate Speech Detection? Journal Article

In: 2026.

Abstract | Links | BibTeX

2025

Jobanputra, Mayank; Kovtunova, Alisa; Balthes, Brisca; Pogulskiy, Fedor Grigoryevich; Wang, Yifan; Borgwardt, Stefan; Demberg, Vera

ProofTeller: Exposing recency bias in LLM reasoning and its side effects on communication Proceedings Article

In: Inui, Kentaro; Sakti, Sakriani; Wang, Haofen; Wong, Derek F.; Bhattacharyya, Pushpak; Banerjee, Biplab; Ekbal, Asif; Chakraborty, Tanmoy; Singh, Dhirendra Pratap (Ed.): Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, pp. 1439–1462, The Asian Federation of Natural Language Processing and The Association for Computational Linguistics, Mumbai, India, 2025, ISBN: 979-8-89176-298-5.

Abstract | Links | BibTeX

@inproceedings{jobanputra-etal-2025-proofteller,

title = {ProofTeller: Exposing recency bias in LLM reasoning and its side effects on communication},

author = {Mayank Jobanputra and Alisa Kovtunova and Brisca Balthes and Fedor Grigoryevich Pogulskiy and Yifan Wang and Stefan Borgwardt and Vera Demberg},

editor = {Kentaro Inui and Sakriani Sakti and Haofen Wang and Derek F. Wong and Pushpak Bhattacharyya and Biplab Banerjee and Asif Ekbal and Tanmoy Chakraborty and Dhirendra Pratap Singh},

url = {https://aclanthology.org/2025.ijcnlp-long.80/},

doi = {10.18653/v1/2025.ijcnlp-long.80},

isbn = {979-8-89176-298-5},

year  = {2025},

date = {2025-12-01},

urldate = {2025-12-01},

booktitle = {Proceedings of the 14th International Joint Conference on Natural Language Processing and the 4th Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics},

pages = {1439–1462},

publisher = {The Asian Federation of Natural Language Processing and The Association for Computational Linguistics},

address = {Mumbai, India},

abstract = {Large language models (LLMs) are increasingly applied in domains that demand reliable and interpretable reasoning. While formal methods can generate provably correct proofs, these proofs are often inaccessible to non-expert users. This raises a natural question: can LLMs, when given a verified proof, faithfully interpret its reasoning and communicate it clearly? We introduce $ProofTeller$, a benchmark that evaluates this ability across three tasks: (1) identifying key proof steps, (2) summarizing the reasoning, and (3) explaining the result in concise natural language. The benchmark covers three domains: _Biology_, _Drones_, and _Recipes_, representing scientific, safety-critical, and everyday reasoning scenarios. We find a consistent near-conclusion bias: LLMs tend to focus on steps closest to the final proof conclusion rather than on the most informative ones. A targeted human study confirms that explanations based on such steps are rated less appropriate for end users. These findings indicate that even when reasoning is provided, current LLMs face challenges in communicating key information in a useful manner, highlighting the need for LLMs that can communicate important details reliably.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Wang, Yifan; Rao, Sukrut; Lee, Ji-Ung; Jobanputra, Mayank; Demberg, Vera

B-cos LM: Efficiently Transforming Pre-trained Language Models for Improved Explainability Journal Article

In: Transactions on Machine Learning Research, 2025, ISSN: 2835-8856.

Abstract | Links | BibTeX

Jobanputra, Mayank; Walter, Nils Philipp; Mehta, Maitrey; Veseli, Blerta; Chapple, Evan Parker Kelly; Wang, Yifan; Chetani, Sneha; Pavlick, Ellie; Vergari, Antonio; Demberg, Vera

Can LLMs subtract numbers? Miscellaneous

2025.

Abstract | Links | BibTeX

2024

Wang, Yifan; Demberg, Vera

RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework Proceedings Article

In: Al-Onaizan, Yaser; Bansal, Mohit; Chen, Yun-Nung (Ed.): Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp. 5561–5582, Association for Computational Linguistics, Miami, Florida, USA, 2024.

Abstract | Links | BibTeX

Wang, Yifan; Demberg, Vera

A Parameter-Efficient Multi-Objective Approach to Mitigate Stereotypical Bias in Language Models Proceedings Article

In: Faleńska, Agnieszka; Basta, Christine; Costa-jussà, Marta; Goldfarb-Tarrant, Seraphina; Nozza, Debora (Ed.): Proceedings of the 5th Workshop on Gender Bias in Natural Language Processing (GeBNLP), pp. 1–19, Association for Computational Linguistics, Bangkok, Thailand, 2024.

Abstract | Links | BibTeX

Liu, Dongqi; Wang, Yifan; Loy, Jia; Demberg, Vera

SciNews: From Scholarly Complexities to Public Narratives – a Dataset for Scientific News Report Generation Proceedings Article

In: Calzolari, Nicoletta; Kan, Min-Yen; Hoste, Veronique; Lenci, Alessandro; Sakti, Sakriani; Xue, Nianwen (Ed.): Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 14429–14444, ELRA and ICCL, Torino, Italia, 2024.

Abstract | Links | BibTeX

2023

Liu, Dongqi; Wang, Yifan; Demberg, Vera

Incorporating Distributions of Discourse Structure for Long Document Abstractive Summarization Proceedings Article

In: Rogers, Anna; Boyd-Graber, Jordan; Okazaki, Naoaki (Ed.): Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 5574–5590, Association for Computational Linguistics, Toronto, Canada, 2023.

Abstract | Links | BibTeX