Trustworthy Human Language Technologies (TrustHLT) is an Independent Research Group hosted at the Department of Computer Science at the Technical University of Darmstadt, Germany, appointed in January 2021.
My current research areas include legal argument mining, privacy-preserving NLP, and explainable and trustworthy models. My research track spans argument mining and computational argumentation, crowdsourcing, large-scale corpora, serious games, sentiment and sarcasm on social media, and semantic web.
Lena is currently exploring the research area of computational argumentation in the legal domain.
Timour's current research areas include privacy-preserving NLP, differential privacy in graph neural networks, and privacy-preserving semantic representations of language.
Sebastian' research areas include privacy-preserving NLP with a focus on text rewriting with provable guarantees.
Chris's thesis focuses on finding best practices on how to optimally adapt the concept of differential privacy in NLP environments while putting the needs of the end-users first and considering perceptional biases to make differential privacy more accessible.
Lijie is a second-year PhD student in Computer Science at King Abdullah University of Science and Technology. Her research interests cover machine learning algorithm on Explainable AI (XAI), Differential Privacy, and Differential Private Natural Language Models. She is also interested in Machine Unlearning, and other security issues in data field.
Nina wrote her thesis on privacy-preserving techniques for crowdsourcing sensitive text data.
Johanna studied computer science at TU Darmstadt. In her bachelor thesis, she compiled an easily accessible legal benchmark dataset to enable evaluating models on a variety of legal NLP tasks.
Lars, student of information systems technologies, cooperated with political scientists to identify indoctrination in German history textbooks through entity emotion analysis.
Ying explored privacy-preserving transformer models in the legal domain. Her thesis combined large-scale pre-training with differential privacy and evaluates the trade-off between privacy-preserving capability and downstream performance.
Sarah explored ethical argumentation in scientific literature. Her thesis focused on controversial technologies and automatic mining of absent, shifting, and evolving ethical arguments.
Manuel was a bachelor's student at the TU Darmstadt focusing on machine learning. He wrote his thesis on the effectiveness and impact on accuracy using differential privacy in NLP.
Lena studied computer science at TU Darmstadt. In her thesis she dealt with differentially private language representation learning.
Daniel explored legal argument mining in court decisions with focus on ECHR decisions and their art of argumentation in regard to their importance level.
Fabian's research area included legal argument mining, expert annotations, and low-resource and few-shot transfer learning for annotation recommendations.
TrustHLT has currently the following open positions
We're looking for several student research assistants (HiWi) for a research project related to privacy-preserving natural language processing and neural machine translation. Read the full job posting (PDF).
We're looking for several student research assistants (HiWi) for a research project related to Artificial Intelligence in Legal Proceedings. Read the full job posting (PDF).
We're looking for several student tutors (HiWi and "Praktikum in der Lehre") for the upcoming semester of "Deep Learning for Natural Langauge Processing". Flexible hours, knowledge of DL and NLP is essential. Contact Dr. Martin Tutek ( martin.tutek@tu-darmstadt.de ) or Dr. Ivan Habernal ( ivan.habernal@tu-darmstadt.de ).
It was my pleasure to give an invited talk about Privacy-Preserving Natural Language Processing at the Aalto University in Helsinki. The video recording should soon become available.
The 17th Conference of the European Chapter of the Association for Computational Linguistics will host our tutorial on Privacy-Preserving Natural Langauge Processing in Dubrovnik, in May 2023.
Our new paper "One size does not fit all: Investigating strategies for differentially-private learning across NLP tasks" by Manuel Senge, Timour Igamberdiev, and myself will be presented at the 2022 Conference on Empirical Methods in Natural Language Processing in Abu Dhabi in December this year.
In this winter term, I'm holding a W2 interim professorship at the The Center for Information and Language Processing at the Ludwig-Maximilians-Universität München.
Our new paper "DP-Rewrite: Towards Reproducibility and Transparency in Differentially Private Text Rewriting" by Timour Igamberdiev (TrustHLT), Thomas Arnold (UKP), and myself will be presented at the 29th International Conference on Computational Linguistics in Korea in October this year.
I'm now a member of hessian.AI — The Hessian Center for Artificial Intelligence. Its mission is to drive research excellence, education, practice and leadership in AI to foster economic growth and improve the human condition.
Our paper on protecting privacy of models trained on graph data using differential privacy has been accepted at the International Conference on Language Resources and Evaluation (LREC) to be held in Marseille, France in June.
Our paper analyzing trickiness of differentially-private text representation learning will be presented at the 60th Annual Meeting of the Association for Computational Linguistics, the world's top conference for natural language processing.
I'm giving an invited lecture at the School of Computing and Information Science, University of Maine with a bit provoking title "If all you have is a hammer, everything looks like a nail: SGD-DP in privacy-preserving NLP" (download slides).
Our paper on the pitfalls of differential privacy in NLP will be presented at the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), one of the world's leading conferences for natural language processing.
I'll be giving a guest lecture at the International Summer School on "AI and Criminal Justice" in Rome on July 12th. This summer school is a great opportunity to acquire an interdisciplinary and in-depth knowledge in the cutting-edge area of AI and criminal justice.
I'm happy to volunteer as a mentor for early career researchers at this year's Conference of the European Chapter of the Association for Computational Linguistics (EACL). One of the topics on the agenda is "How to survive grad school", I'm very much looking forward to some fresh perspectives!
Thanks to Yang Gao for invited me over to Royal Holloway, University of London to give an invited talk on privacy-preserving NLP, a joint work with Timour Igamberdiev. Slides available here.
Happy to join the Area Chairs for sentiment analysis and argument mining at this year's Conference on Empirical Methods in Natural Language Processing (EMNLP).
I happily accepted an invitation to join the standing reviewer board of Computational Linguistics, the "longest-running publication devoted exclusively to the computational and mathematical properties of language".
Together with Isabelle Augenstein and tutorial chairs for NAACL, EMNLP, and ACL-IJCNLP, we are preparing the next year's selection of tutorials to be presented either virtually or in-person.
In this interdisciplinary collaboration, we look into argumentation in the verdicts of the European Court of Human Rights. What makes a verdict of a high importance? Is it the facts? Is it the argumentation pattern? Is it the judges? Or is it something left between the lines?
We combine legal expertise with transformer-based recommendation engines to scale up annotated data acquisition.
We collaborate with expert legal researchers Prof. Dr. Indra Spiecker and Prof. Dr. Christoph Burchard from Geothe University Frankfurt as well as UKP Lab's Prof. Dr. Iryna Gurevych.
Chair for German, European and International Criminal Law and Procedure, Comparative Law and Legal Theory
Director of the Ubiquitous Knowledge Processing (UKP) Lab
Chair in Public Law, Information Law, Environmental Law and Legal Theory
Our slides and videos are freely available at GitHub under open licences.
Send me an e-mail