cv | Tommaso Mencattini

Basics

Name	Tommaso Mencattini
Label	MSc Data Science Student & Researcher
Email	tommaso.mencattini@epfl.ch
Summary	MSc student in Data Science at EPFL with research experience at ISTA, ETH Zürich, and GLADIA. My work focuses on large language models, model merging, causal inference, and interpretability. First-author papers at ICML, ACL, and publications at NeurIPS workshops and ECAI.

Work

2025.07 - Present

Vienna, Austria
Research Intern

Institute of Science and Technology Austria (ISTA)

Research in causal learning and multimodal foundation models within the Locatello Group.
- Co-first author for a paper on unsupervised causal effect discovery in RCTs (theory + library).
- Developing frameworks combining causality and deep learning for large-scale multimodal systems.
2024.04 - Present

Rome, Italy
Research Student

GLADIA · Sapienza University of Rome

Research on scalable model merging, interpretability, and ability estimation for large language models.
- Scaled evolutionary model merging using consumer GPUs (50× speedup, 2% data usage, 85% performance retained).
- Two first-author papers accepted to ICML 2025 and ACL 2025.
- Co-first author for a paper proving injectivity of LLM representations and presenting the first exact inversion algorithm (viral tweet with 4.7M views).
2023.06 - 2023.08

Zurich, Switzerland
Research Student

ETH Zürich (LRE Lab)

Applied causal inference to explain and assess LLM performance on math word problems.
- Worked on applying causal inference techniques to large language models to detect spurious heuristics in reasoning tasks.
- Ran large-scale experiments on high-performance computing clusters.
2023.01 - 2023.06

Amsterdam, Netherlands
Research Assistant

Vrije Universiteit Amsterdam · KAI Lab

Research at the intersection of knowledge graphs and hallucination mitigation in LLMs.
- Designed Text-to-Graph and Graph-to-Text methods for content planning.
- Trained and evaluated multiple knowledge-graph-integrated LLMs.
2022.07 - 2023.04

Rome, Italy
Research Student

Sapienza University of Rome

NLP research in e-justice applications using specialized large language models.
- Developed GPT-2-based writing assistant for legal documents.
- Publication at ECAI 2023 (CICERO project).
- Conducted evaluation with legal professionals.
2022.05 - 2022.12

Amsterdam, Netherlands
Technology Assistant

Network Institute VU Amsterdam

Technical support for interdisciplinary projects in XR, VR, and motion capture.
- Worked with Unity, C#, iClone, Motive, and Optitrack.
- Organized and presented at Surf XR-On Tour and Reshaping Work 2022 Conference.

Education

2024.09 - Present

Lausanne, Switzerland
Master of Science

École Polytechnique Fédérale de Lausanne (EPFL)

Data Science
- Coursework focused on the Mathematics of data science
2021.09 - 2024.07

Amsterdam, Netherlands
Bachelor of Science

Vrije Universiteit Amsterdam

Artificial Intelligence (Major) & Mathematics (Minor)
- Honours Programme (30 credits of advanced coursework, e.g. PDEs)
2018.09 - 2021.09

Rome, Italy
Bachelor’s Degree

Sapienza University of Rome

Philosophy
- Focused on formal logic and philosophy of language

Awards

2022.01.01

KHMW Young Talent Incentive Award

Royal Holland Society of Sciences and Humanities

National award presented to the student with the highest GPA in their degree program at a Dutch research university.
2021.01.01

Intelligent Systems Competition — 1st Place

Vrije Universiteit Amsterdam

Ranked 1st out of 270 students by developing an intelligent agent for the card game Schnapsen.

Publications

Do Sparse Autoencoders Transfer Across Base and Finetuned LLMs?

NeurIPS 2024 Workshop
CICERO: A GPT-2-based Writing Assistant in e-justice

ECAI 2023
Mergenetic: A Simple Evolutionary Model Merging Library

ACL 2025 · System Demonstrations
Activation Patching for Interpretable Steering in Music Generation

Under Review
MERGE³: Efficient Evolutionary Merging on Consumer-grade GPUs

ICML 2025
Exploratory Causal Inference in SAEnce

Under Review
Language Models are Injective and Hence Invertible

Under Review

Interests

	AI Safety
	Interpretability
	Sparse Autoencoders
	Causal Models

	Theoretical ML
	Representations
	Optimization
	Information Theory

Basics

Work

Institute of Science and Technology Austria (ISTA)

Research in causal learning and multimodal foundation models within the Locatello Group.

GLADIA · Sapienza University of Rome

Research on scalable model merging, interpretability, and ability estimation for large language models.

ETH Zürich (LRE Lab)

Applied causal inference to explain and assess LLM performance on math word problems.

Vrije Universiteit Amsterdam · KAI Lab

Research at the intersection of knowledge graphs and hallucination mitigation in LLMs.

Sapienza University of Rome

NLP research in e-justice applications using specialized large language models.

Network Institute VU Amsterdam

Technical support for interdisciplinary projects in XR, VR, and motion capture.

Education

École Polytechnique Fédérale de Lausanne (EPFL)

Data Science

Vrije Universiteit Amsterdam

Artificial Intelligence (Major) & Mathematics (Minor)

Sapienza University of Rome

Philosophy

Awards

Royal Holland Society of Sciences and Humanities

National award presented to the student with the highest GPA in their degree program at a Dutch research university.

Vrije Universiteit Amsterdam

Ranked 1st out of 270 students by developing an intelligent agent for the card game Schnapsen.

Publications

NeurIPS 2024 Workshop

ECAI 2023

ACL 2025 · System Demonstrations

Under Review

ICML 2025

Under Review

Under Review

Interests