cv

Scroll for a long version of my cv, or download the pdf for a shorter one.

Basics

Name Tommaso Mencattini
Label MSc Data Science Student & Researcher
Email tommaso.mencattini@epfl.ch
Summary MSc student in Data Science at EPFL with research experience at ISTA, ETH Zürich, and GLADIA. My work focuses on large language models, model merging, causal inference, and interpretability. First-author papers at ICML, ACL, and publications at NeurIPS workshops and ECAI.

Work

  • 2025.07 - Present

    Vienna, Austria

    Research Intern
    Institute of Science and Technology Austria (ISTA)
    Research in causal learning and multimodal foundation models within the Locatello Group.
    • Co-first author for a paper on unsupervised causal effect discovery in RCTs (theory + library).
    • Developing frameworks combining causality and deep learning for large-scale multimodal systems.
  • 2024.04 - Present

    Rome, Italy

    Research Student
    GLADIA · Sapienza University of Rome
    Research on scalable model merging, interpretability, and ability estimation for large language models.
    • Scaled evolutionary model merging using consumer GPUs (50× speedup, 2% data usage, 85% performance retained).
    • Two first-author papers accepted to ICML 2025 and ACL 2025.
    • Co-first author for a paper proving injectivity of LLM representations and presenting the first exact inversion algorithm (viral tweet with 4.7M views).
  • 2023.06 - 2023.08

    Zurich, Switzerland

    Research Student
    ETH Zürich (LRE Lab)
    Applied causal inference to explain and assess LLM performance on math word problems.
    • Worked on applying causal inference techniques to large language models to detect spurious heuristics in reasoning tasks.
    • Ran large-scale experiments on high-performance computing clusters.
  • 2023.01 - 2023.06

    Amsterdam, Netherlands

    Research Assistant
    Vrije Universiteit Amsterdam · KAI Lab
    Research at the intersection of knowledge graphs and hallucination mitigation in LLMs.
    • Designed Text-to-Graph and Graph-to-Text methods for content planning.
    • Trained and evaluated multiple knowledge-graph-integrated LLMs.
  • 2022.07 - 2023.04

    Rome, Italy

    Research Student
    Sapienza University of Rome
    NLP research in e-justice applications using specialized large language models.
    • Developed GPT-2-based writing assistant for legal documents.
    • Publication at ECAI 2023 (CICERO project).
    • Conducted evaluation with legal professionals.
  • 2022.05 - 2022.12

    Amsterdam, Netherlands

    Technology Assistant
    Network Institute VU Amsterdam
    Technical support for interdisciplinary projects in XR, VR, and motion capture.
    • Worked with Unity, C#, iClone, Motive, and Optitrack.
    • Organized and presented at Surf XR-On Tour and Reshaping Work 2022 Conference.

Education

  • 2024.09 - Present

    Lausanne, Switzerland

    Master of Science
    École Polytechnique Fédérale de Lausanne (EPFL)
    Data Science
    • Coursework focused on the Mathematics of data science
  • 2021.09 - 2024.07

    Amsterdam, Netherlands

    Bachelor of Science
    Vrije Universiteit Amsterdam
    Artificial Intelligence (Major) & Mathematics (Minor)
    • Honours Programme (30 credits of advanced coursework, e.g. PDEs)
  • 2018.09 - 2021.09

    Rome, Italy

    Bachelor’s Degree
    Sapienza University of Rome
    Philosophy
    • Focused on formal logic and philosophy of language

Awards

  • 2022.01.01
    KHMW Young Talent Incentive Award
    Royal Holland Society of Sciences and Humanities
    National award presented to the student with the highest GPA in their degree program at a Dutch research university.
  • 2021.01.01
    Intelligent Systems Competition — 1st Place
    Vrije Universiteit Amsterdam
    Ranked 1st out of 270 students by developing an intelligent agent for the card game Schnapsen.

Interests

AI Safety
Interpretability
Sparse Autoencoders
Causal Models
Theoretical ML
Representations
Optimization
Information Theory