Biography

Kale-ab is a PhD student working on Multi-Agent Reinforcement Learning (MARL) at the Autonomous Agents research group at the University of Edinburgh, under the supervision of Stefano Albrecht. He has 7.5 years of experience (three years in software development and four and a half years in machine learning). Prior to his PhD, he was a research engineer at InstaDeep, working on Multi-Agent Reinforcement Learning (MARL) research.

He is also passionate about using technology to help the African continent. To this end, he has worked on projects that aim to have a high impact in Africa and has also worked on increasing diversity and representation in machine learning.

For more information, you can view my resumé .

Interests
  • Multi-Agent Reinforcement Learning (MARL)
  • Reinforcement Learning
  • Deep Learning
Education
  • MSc in Computer Science, Focusing on Deep Learning (Distinction), 2018 - March, 2021

    University of Witwatersrand

  • Honours in Computer Science (Distinction), 2016

    University of Pretoria

  • BSc. in Computer Science, 2013 - 2015

    University of Pretoria

Recent Publications

(2024). Efficiently Quantifying Individual Agent Importance in Cooperative MARL. (Oral) eXplainable AI approaches for deep reinforcement learning (XAI4DRL) Workshop @ AAAI, 2024.

PDF

(2024). How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning. eXplainable AI approaches for deep reinforcement learning (XAI4DRL) Workshop @ AAAI, 2024.

PDF

(2023). Generalisable Agents for Neural Network Optimisation. WANT@ NeurIPS 2023 and OPT NeurIPS 2023 workshops.

PDF Poster

(2023). Are we going MAD? Benchmarking Multi-Agent Debate between Language Models for Medical Q&A. Deep Generative Models for Health Workshop NeurIPS 2023.

PDF

(2023). Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning. (Oral) Workshop on Reincarnating Reinforcement Learning at ICLR 2023.

PDF

(2022). Just-in-Time Sparsity: Learning Dynamic Sparsity Schedules. Dynamic Neural Networks ICML Workshop 2022.

PDF Poster Slides

(2021). On pseudo-absence generation and machine learning for locust breeding ground prediction in Africa. AI + HADR 2021 and ML4D 2021 NeurIPS Workshops.

PDF Cite Code Blog

(2021). Mava: a research framework for distributed multi-agent reinforcement learning.

PDF Cite Code Blog

(2021). Keep the Gradients Flowing: Using Gradient Flow to Study Sparse Network Optimization. Sparsity in Neural Networks Workshop 2021.

PDF Cite Poster Slides

Open Source

You can see a full list of my open source projects on github - https://github.com/KaleabTessera .

Deep Learning Indaba Practicals 2022
A collection of high-quality practicals covering a variety of modern machine learning techniques.
Deep Learning Indaba Practicals 2022
Pseudo Absence Generation and Locust Prediction
Locust breeding ground prediction using pseudo-absence generation and machine learning.
Pseudo Absence Generation and Locust Prediction
Baobab
Baobab is an open source multi-tenant web application designed to facilitate the application and selection process for large scale meetings within the machine learning and artificial intelligence communities globally.
Baobab
DQN Agent playing Pong
A DQN agent playing pong.
DQN Agent playing Pong
Robotics Navigation
A project that compared the navigation ability of two robots (a Turtlebot and a Kuri robot) in challenging dynamic and static environments.
Robotics Navigation
PRM Path Planning
Implementation of Probabilistic Roadmaps – a path planning algorithm used in Robotics.
PRM Path Planning