Kale-ab Tessera

PhD Researcher in Multi-Agent Systems, Reinforcement Learning, and LLM Agents, University of Edinburgh.

prof_pic.jpg

Open to research internships from autumn 2026 onwards in LLM agents, multi-agent systems, open-endedness, cooperative AI, and reinforcement learning. kaleabtessera@gmail.com · resumé

I am a third-year PhD candidate at the University of Edinburgh working on multi-agent systems, reinforcement learning, and LLM agents. My research develops algorithms, environments, evaluation harnesses, and diagnostics for understanding how agents reason, coordinate, adapt, and fail in open-ended, long-horizon settings.

My latest work, Benchmarking Open-Ended Multi-Agent Coordination in Language Agents, studies how modern LLM agents coordinate in procedurally generated, open-ended long-horizon environments, comparing them against trained MARL agents.

I’m advised by Amos Storkey, Tim Rocktäschel (UCL), and Aris Filos-Ratsikas, and affiliated with MARBLE, where I co-organise the 🤖 RL & Agents Reading Group. Before my PhD, I spent 2.5 years as a Research Engineer on the MARL research team at InstaDeep, alongside earlier experience in ML and software engineering.

Research Interests:

  • LLM agents: evaluating coordination, reasoning, and robustness in open-ended multi-agent settings.
  • RL for agents: training and evaluating adaptive policies for long-horizon interaction, tool use, and open-ended environments.
  • Multi-agent learning and cooperation: understanding when tasks require genuine decentralised reasoning and when coordination breaks down.

For more information, see my resumé and Google Scholar.

news

May, 2026 🌟 Presented Probing Dec-POMDP Reasoning in Cooperative MARL at AAMAS 2026 in Paphos, Cyprus.
Dec, 2025 🌟 Presented HyperMARL: Adaptive Hypernetworks for Multi-Agent RL at NeurIPS 2025 in San Diego, US.
Sep, 2025 🗣️ Talk on “Algorithms and Benchmarks for Robust Multi-Agent Coordination” at the RAIL Lab, University of the Witwatersrand.
Aug, 2025 🏅 Remembering the Markov Property in Cooperative MARL won best poster (1st place) out of 278 submissions at the Deep Learning Indaba in Kigali, Rwanda.
Aug, 2025 📅 Co-Programme Chair for the Deep Learning Indaba and Head of Practicals and Tutorials in Kigali, Rwanda.
Aug, 2025 Our reading group is back – 🤖 RL & Agents Reading Group.
Aug, 2025 🌟 Presented Remembering the Markov Property in Cooperative MARL and HyperMARL: Adaptive Hypernetworks for Multi-Agent RL at RLC workshops in Edmonton, Canada.
Mar, 2025 🌟 Attended UK Multi-Agent Systems Symposium 2025 at King’s College London.
Aug, 2024 🗣️ Taught “Introduction to ML” at DLI.
Jul, 2024 🏅 Awarded a scholarship to attend the CIFAR Deep Learning and Reinforcement Learning (DLRL) Summer School in Toronto, Canada.
Jan, 2024 🗣️ Begin co-hosting the UOE RL reading group, YouTube.
Sep, 2023 🎓 Started my PhD at the University of Edinburgh (UOE), through the Informatics Global PhD Scholarship.
Aug, 2023 🛠️ PC member and Practicals Chair of DLI - notebooks 2023, RL Prac.
May, 2023 🗣️ Talk on “Introduction to Deep Reinforcement Learning” at the University of Pretoria and Indaba X Ghana.
Apr, 2023 🌟 Attended ICLR in Kigali, Rwanda.
Aug, 2022 🛠️ Co-Organiser of the ML Efficiency Workshop at the DLI.
Aug, 2022 🛠️ Programme committee member and Practicals Chair of Deep Learning Indaba (DLI)notebooks 2022, ML Prac, RL Prac.
Jun, 2022 🗣️ Taught an “Introduction to Machine Learning” course at Africa to Silicon Valley.
Mar, 2021 🤖 Joined the Multi-Agent RL research team at InstaDeep.
Dec, 2019 🌟 Attended NeurIPS in Vancouver, Canada.
Aug, 2019 🏆 Won Best Poster (1 out of 194) at the Deep Learning Indaba, sponsored by Microsoft.

selected publications

A selection of recent work. View full publication list →

  1. Kale-ab Abebe Tessera, Andras Szecsenyi, Cameron Barker, and 7 more authors
    arXiv preprint arXiv:2606.08340, under review, Jun 2026
  2. Kale-ab Abebe Tessera, Leonard Hinckeldey, Riccardo Zamboni, and 2 more authors
    In The 25th International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS), Oral, Jun 2026
  3. Kale-ab Abebe Tessera, Arrasy Rahman, Amos Storkey, and 1 more author
    In The Thirty-ninth Annual Conference on Neural Information Processing Systems (NeurIPS), Jun 2025
  4. NeurIPS Workshop
    Are we going MAD? Benchmarking Multi-Agent Debate between Language Models for Medical Q&A
    Andries Smit, Paul Duckworth, Nathan Grinsztajn, and 3 more authors
    In Deep Generative Models for Health Workshop NeurIPS 2023, Nov 2023
  5. Arnu Pretorius *, Kale-ab Abebe Tessera *, Andries P Smit *, and 8 more authors
    arXiv preprint arXiv:2107.01460v1, Jul 2021