Kale-ab Tessera
PhD Researcher in Multi-Agent Systems, Reinforcement Learning, and LLM Agents, University of Edinburgh.
Open to research internships from autumn 2026 onwards in LLM agents, multi-agent systems, open-endedness, cooperative AI, and reinforcement learning. kaleabtessera@gmail.com · resumé
I am a third-year PhD candidate at the University of Edinburgh working on multi-agent systems, reinforcement learning, and LLM agents. My research develops algorithms, environments, evaluation harnesses, and diagnostics for understanding how agents reason, coordinate, adapt, and fail in open-ended, long-horizon settings.
My latest work, Benchmarking Open-Ended Multi-Agent Coordination in Language Agents, studies how modern LLM agents coordinate in procedurally generated, open-ended long-horizon environments, comparing them against trained MARL agents.
I’m advised by Amos Storkey, Tim Rocktäschel (UCL), and Aris Filos-Ratsikas, and affiliated with MARBLE, where I co-organise the 🤖 RL & Agents Reading Group. Before my PhD, I spent 2.5 years as a Research Engineer on the MARL research team at InstaDeep, alongside earlier experience in ML and software engineering.
Research Interests:
- LLM agents: evaluating coordination, reasoning, and robustness in open-ended multi-agent settings.
- RL for agents: training and evaluating adaptive policies for long-horizon interaction, tool use, and open-ended environments.
- Multi-agent learning and cooperation: understanding when tasks require genuine decentralised reasoning and when coordination breaks down.
For more information, see my resumé and Google Scholar.
news
| May, 2026 | 🌟 Presented Probing Dec-POMDP Reasoning in Cooperative MARL at AAMAS 2026 in Paphos, Cyprus. |
|---|---|
| Dec, 2025 | 🌟 Presented HyperMARL: Adaptive Hypernetworks for Multi-Agent RL at NeurIPS 2025 in San Diego, US. |
| Sep, 2025 | 🗣️ Talk on “Algorithms and Benchmarks for Robust Multi-Agent Coordination” at the RAIL Lab, University of the Witwatersrand. |
| Aug, 2025 | 🏅 Remembering the Markov Property in Cooperative MARL won best poster (1st place) out of 278 submissions at the Deep Learning Indaba in Kigali, Rwanda. |
| Aug, 2025 | 📅 Co-Programme Chair for the Deep Learning Indaba and Head of Practicals and Tutorials in Kigali, Rwanda. |
| Aug, 2025 | Our reading group is back – 🤖 RL & Agents Reading Group. |
| Aug, 2025 | 🌟 Presented Remembering the Markov Property in Cooperative MARL and HyperMARL: Adaptive Hypernetworks for Multi-Agent RL at RLC workshops in Edmonton, Canada. |
| Mar, 2025 | 🌟 Attended UK Multi-Agent Systems Symposium 2025 at King’s College London. |
| Aug, 2024 | 🗣️ Taught “Introduction to ML” at DLI. |
| Jul, 2024 | 🏅 Awarded a scholarship to attend the CIFAR Deep Learning and Reinforcement Learning (DLRL) Summer School in Toronto, Canada. |
| Jan, 2024 | 🗣️ Begin co-hosting the UOE RL reading group, YouTube. |
| Sep, 2023 | 🎓 Started my PhD at the University of Edinburgh (UOE), through the Informatics Global PhD Scholarship. |
| Aug, 2023 | 🛠️ PC member and Practicals Chair of DLI - notebooks 2023, RL Prac. |
| May, 2023 | 🗣️ Talk on “Introduction to Deep Reinforcement Learning” at the University of Pretoria and Indaba X Ghana. |
| Apr, 2023 | 🌟 Attended ICLR in Kigali, Rwanda. |
| Aug, 2022 | 🛠️ Co-Organiser of the ML Efficiency Workshop at the DLI. |
| Aug, 2022 | 🛠️ Programme committee member and Practicals Chair of Deep Learning Indaba (DLI) – notebooks 2022, ML Prac, RL Prac. |
| Jun, 2022 | 🗣️ Taught an “Introduction to Machine Learning” course at Africa to Silicon Valley. |
| Mar, 2021 | 🤖 Joined the Multi-Agent RL research team at InstaDeep. |
| Dec, 2019 | 🌟 Attended NeurIPS in Vancouver, Canada. |
| Aug, 2019 | 🏆 Won Best Poster (1 out of 194) at the Deep Learning Indaba, sponsored by Microsoft. |
selected publications
A selection of recent work. View full publication list →
- NeurIPS Workshop
In Deep Generative Models for Health Workshop NeurIPS 2023, Nov 2023