cv
Scroll for a long version of my cv, or download the pdf for a shorter one.
Basics
| Name | Tommaso Mencattini |
| Label | MSc Data Science Student & Researcher |
| tommaso.mencattini@epfl.ch | |
| Summary | MSc student in Data Science at EPFL with research experience at ISTA, ETH Zürich, and GLADIA. My work focuses on large language models, model merging, causal inference, and interpretability. First-author papers at ICML, ACL, and publications at NeurIPS workshops and ECAI. |
Work
-
2025.07 - Present Vienna, Austria
Research Intern
Institute of Science and Technology Austria (ISTA)
Research in causal learning and multimodal foundation models within the Locatello Group.
- Co-first author for a paper on unsupervised causal effect discovery in RCTs (theory + library).
- Developing frameworks combining causality and deep learning for large-scale multimodal systems.
-
2024.04 - Present Rome, Italy
Research Student
GLADIA · Sapienza University of Rome
Research on scalable model merging, interpretability, and ability estimation for large language models.
- Scaled evolutionary model merging using consumer GPUs (50× speedup, 2% data usage, 85% performance retained).
- Two first-author papers accepted to ICML 2025 and ACL 2025.
- Co-first author for a paper proving injectivity of LLM representations and presenting the first exact inversion algorithm (viral tweet with 4.7M views).
-
2023.06 - 2023.08 Zurich, Switzerland
Research Student
ETH Zürich (LRE Lab)
Applied causal inference to explain and assess LLM performance on math word problems.
- Worked on applying causal inference techniques to large language models to detect spurious heuristics in reasoning tasks.
- Ran large-scale experiments on high-performance computing clusters.
-
2023.01 - 2023.06 Amsterdam, Netherlands
Research Assistant
Vrije Universiteit Amsterdam · KAI Lab
Research at the intersection of knowledge graphs and hallucination mitigation in LLMs.
- Designed Text-to-Graph and Graph-to-Text methods for content planning.
- Trained and evaluated multiple knowledge-graph-integrated LLMs.
-
2022.07 - 2023.04 Rome, Italy
Research Student
Sapienza University of Rome
NLP research in e-justice applications using specialized large language models.
- Developed GPT-2-based writing assistant for legal documents.
- Publication at ECAI 2023 (CICERO project).
- Conducted evaluation with legal professionals.
-
2022.05 - 2022.12 Amsterdam, Netherlands
Technology Assistant
Network Institute VU Amsterdam
Technical support for interdisciplinary projects in XR, VR, and motion capture.
- Worked with Unity, C#, iClone, Motive, and Optitrack.
- Organized and presented at Surf XR-On Tour and Reshaping Work 2022 Conference.
Education
-
2024.09 - Present Lausanne, Switzerland
Master of Science
École Polytechnique Fédérale de Lausanne (EPFL)
Data Science
- Coursework focused on the Mathematics of data science
-
2021.09 - 2024.07 Amsterdam, Netherlands
Bachelor of Science
Vrije Universiteit Amsterdam
Artificial Intelligence (Major) & Mathematics (Minor)
- Honours Programme (30 credits of advanced coursework, e.g. PDEs)
-
2018.09 - 2021.09 Rome, Italy
Bachelor’s Degree
Sapienza University of Rome
Philosophy
- Focused on formal logic and philosophy of language
Awards
- 2022.01.01
KHMW Young Talent Incentive Award
Royal Holland Society of Sciences and Humanities
National award presented to the student with the highest GPA in their degree program at a Dutch research university.
- 2021.01.01
Intelligent Systems Competition — 1st Place
Vrije Universiteit Amsterdam
Ranked 1st out of 270 students by developing an intelligent agent for the card game Schnapsen.
Publications
-
Do Sparse Autoencoders Transfer Across Base and Finetuned LLMs?
NeurIPS 2024 Workshop
-
Mergenetic: A Simple Evolutionary Model Merging Library
ACL 2025 · System Demonstrations
-
Exploratory Causal Inference in SAEnce
Under Review
-
Language Models are Injective and Hence Invertible
Under Review
Interests
| AI Safety | ||||
| Interpretability | ||||
| Sparse Autoencoders | ||||
| Causal Models | ||||
| Theoretical ML | ||||
| Representations | ||||
| Optimization | ||||
| Information Theory | ||||