Research

At InstaDeep, ideas become reality. Tackling some of humanity's toughest challenges, our cutting-edge AI transforms future possibilities into today's breakthroughs

Join our Research Team

Digital Biology

Our Digital Biology team leverages advanced machine learning and simulation techniques to revolutionise drug discovery at its core. Collaborating with domain experts, to transform intricate biological data into actionable insights, the team drives breakthroughs in genomics, proteomics, quantum chemistry, and beyond

Decision-Making

Our Decision-Making team pioneers reinforcement learning methods. By building AI systems that drive real-world impact—from chip design to resource management and scientific discovery—the team ensures the next generation of AI agents excels at dynamic decision-making.

ML Systems

Our Machine Learning (ML) systems team transform AI ambitions into practical, adaptable advancements. Driving breakthroughs from foundation models in biology to cutting-edge scientific computing, they tackle novel system challenges in AI infrastructure, enabling efficient large-scale ML algorithms.

Machine Learning

Our Fundamental Machine Learning (ML) team push the boundaries of theory to unlock new pathways for transformative real-world applications. Through exploring the theoretical foundations of modern AI, the team designs robust models and algorithms that power applied innovation.

Our research in the news

Enhancing Peptide Sequencing with AI

Enhancing Peptide Sequencing with AI

Read More
Exploring the Proteome with ProtBFN

Exploring the Proteome with ProtBFN

Read More
InstaDeep showcases 8 papers at NeurIPS 2024

InstaDeep showcases 8 papers at NeurIPS 2024

Read More
Decoding our Genome with Nucleotide Transformers

Decoding our Genome with Nucleotide Transformers

Read More
InstaDeep at ICML 2024

InstaDeep at ICML 2024

Read More
InstaDeep presents six papers at ICLR 2024

InstaDeep presents six papers at ICLR 2024

Read More
Building the next generation of AI models to decipher human biology

Building the next generation of AI models to decipher human biology

Read More
Tunis InstaDeep researchers at NeurIPS 2023’s NAML: North African Machine Learning workshop

Tunis InstaDeep researchers at NeurIPS 2023’s NAML: North African Machine Learning workshop

Read More
Meet Shikha Surana, NeurIPS 2023 Women in Machine Learning (WIML) Workshop presenter

Meet Shikha Surana, NeurIPS 2023 Women in Machine Learning (WIML) Workshop presenter

Read More
InstaDeep presents record 13 papers at NeurIPS 2023

InstaDeep presents record 13 papers at NeurIPS 2023

Read More
Scalable Reinforcement Learning on Cloud TPU

Scalable Reinforcement Learning on Cloud TPU

Read More
InstaDeep open-sources the Nucleotide Transformers, its collection of genomics Language Models, to HuggingFace

InstaDeep open-sources the Nucleotide Transformers, its collection of genomics Language Models, to HuggingFace

Read More
InstaDeep Research team continues success at ICLR with record four publications, hosts exclusive preview screening of new AI documentary

InstaDeep Research team continues success at ICLR with record four publications, hosts exclusive preview screening of new AI documentary

Read More
New research from InstaDeep, NVIDIA and the Technical University of Munich beats expectations, provides new insights into genomics research

New research from InstaDeep, NVIDIA and the Technical University of Munich beats expectations, provides new insights into genomics research

Read More
InstaDeep and Google Cloud are developing the next generation of Genomics Language Models for sustainable agriculture

InstaDeep and Google Cloud are developing the next generation of Genomics Language Models for sustainable agriculture

Read More
InstaDeep and Imperial College present three joint papers on Quality-Diversity at GECCO

InstaDeep and Imperial College present three joint papers on Quality-Diversity at GECCO

Read More
InstaDeep, Imperial College London and Sorbonne joint research accepted for ICLR 2022 workshop

InstaDeep, Imperial College London and Sorbonne joint research accepted for ICLR 2022 workshop

Read More
InstaDeep launches dedicated Quantum Machine Learning team, as first QML research paper is published in Nature Machine Intelligence

InstaDeep launches dedicated Quantum Machine Learning team, as first QML research paper is published in Nature Machine Intelligence

Read More
InstaDeep and Oxford University Research Collaboration Accepted to ICLR

InstaDeep and Oxford University Research Collaboration Accepted to ICLR

Read More
BioNTech and InstaDeep Developed and Successfully Tested Early Warning System to Detect Potential High-Risk SARS-CoV-2 Variants

BioNTech and InstaDeep Developed and Successfully Tested Early Warning System to Detect Potential High-Risk SARS-CoV-2 Variants

Read More
An early detection system for desert locust outbreaks in Africa, in collaboration with Google AI

An early detection system for desert locust outbreaks in Africa, in collaboration with Google AI

Read More
See all news

Our Publications

Metalic: Meta-Learning In-Context with Protein Language Models

Jacob Beck | Shikha Surana | Manus McAuliffe | Oliver Bent | Thomas D. Barrett | Juan Jose Garau Luis | Paul Duckworth 04 Apr 2025

Simple Guidance Mechanisms for Discrete Diffusion Models

Hugo Dalla-Torre | Sam Boshar | Bernardo P. de Almeida | Thomas Pierrot | Yair Schiff | Subham Sekhar Sahoo | Hao Phung | Guanghan Wang | Alexander Rush | Volodymyr Kuleshov 03 Apr 2025

De novo peptide sequencing with InstaNovo: Accurate, database-free peptide identification for large scale proteomics experiments

Kevin Eloff | Konstantinos Kalogeropoulos | Oliver Morell | Amandla Mabona | Jakob Berg Jespersen | Wesley WIlliams | Sam P. B. van Beljouw | Marcin Skwark | Andreas Hougaard Laustsen | Stan J. J. Brouns | Erwin M. Schoof | Jeroen Van Goey | Ulrich auf dem Keller | Karim Beguir | Nicolas Lopez Carranza | Timothy P. Jenkins 31 Mar 2025

Bayesian Optimisation for Protein Sequence Design: Gaussian Processes with Zero-Shot Protein Language Model Prior Mean

Carolin Benjamins | Shikha Surana | Oliver Bent | Marius Lindauer | Paul Duckworth 19 Dec 2024

BulkRNABert: Cancer prognosis from bulk RNA-seq based language models

Maxence Gélard | Guillaume Richard | Thomas Pierrot | Paul-Henry Cournède 15 Dec 2024

BoostMD – Accelerating MD with MLIP

Lars L. Schaaf | Ilyes Batatia | Christoph Brunken | Thomas D. Barrett | Jules Tilly 15 Dec 2024

Learning the Language of Protein Structures

Benoit Gaujac | Jérémie Donà | Liviu Copoiu | Timothy Atkinson | Thomas Pierrot | Thomas D. Barrett 15 Dec 2024

Bayesian Optimisation for Protein Sequence Design: Back to Basics with Gaussian Process Surrogates

Carolin Benjamins | Shikha Surana | Oliver Bent | Marius Lindauer | Paul Duckworth 14 Dec 2024

Multi-modal Transfer Learning between Biological Foundation Models

Juan Jose Garau-Luis | Patrick Bordes | Liam Gonzalez | Masa Roller | Bernardo P. de Almeida | Lorenz Hexemer | Christopher Blum | Stefan Laurent | Jan Grzegorzewski | Maren Lang | Thomas Pierrot | Guillaume Richard 12 Dec 2024

Dispelling the Mirage of Progress in Offline MARL

Claude Formanek | Callum Rhys Tilbury | Louise Beyers | Jonathan Shock | Arnu Pretorius 12 Dec 2024

SPO: Sequential Policy Optimisation

Matthew V Macfarlane | Edan Toledo | Donal Byrne | Paul Duckworth | Alexandre Laterre 11 Dec 2024

Nucleotide Transformer: building and evaluating robust foundation models for human genomics

Hugo Dalla-Torre | Liam Gonzalez | Javier Mendoza-Revilla | Nicolas Lopez Carranza | Adam Henryk Grzywaczewski | Francesco Oteri | Christian Dallago | Evan Trop | Bernardo P. de Almeida | Hassan Sirelkhatim | Guillaume Richard | Marcin Skwark | Karim Beguir | Marie Lopez  | Thomas Pierrot 28 Nov 2024

Protein Sequence Modelling with Bayesian Flow Networks

Timothy Atkinson | Thomas D. Barrett | Scott Cameron | Bora Guloglu | Matthew Greenig | Louis Robinson | Alex Graves | Liviu Copoiu | Alexandre Laterre 27 Sep 2024

SMX: Sequential Monte Carlo Planning for Expert Iteration

Edan Toledo | Matthew Macfarlane | Donal John Byrne | Siddarth Singh | Paul Duckworth | Alexandre Laterre 17 Jul 2024

Multi-Objective Quality-Diversity for Crystal Structure Prediction

Hannah Janmohamed | Marta Wolinska | Shikha Surana | Aaron Walsh | Thomas Pierrot | Antoine Cully 17 Jul 2024

Overconfident Oracles: Limitations of In Silico Sequence Design Benchmarking

Shikha Surana | Nathan Grinsztajn | Timothy Atkinson | Paul Duckworth | Thomas D. Barrett 17 Jul 2024

Generative Model for Small Molecules with Latent Space RL Fine-Tuning to Protein Targets

Ulrich A. Mbou So | Qiulin Li | Dries Smit | Arnu Pretorius | Oliver Bent | Miguel Arbesú 17 Jul 2024

Should we be going MAD?
A Look at Multi-Agent Debate Strategies for LLMs

Andries Petrus Smit | Nathan Grinsztajn | Paul Duckworth | Thomas D Barrett | Arnu Pretorius 17 Jul 2024

Quality-Diversity for One-Shot Biological Sequence Design

Jérémie DONA | Arthur Flajolet | Andrei Marginean | Antoine Cully | Thomas PIERROT 17 Jul 2024

Likelihood-based fine-tuning of protein language models for few-shot fitness prediction and design

Alex Hawkins-Hooker | Jakub Kmec | Oliver Bent | Paul Duckworth 17 Jul 2024

A large language foundational model for edible plant genomes

Javier Mendoza-Revilla | Evan Trop | Liam Gonzalez | Maša Roller | Hugo Dalla-Torre | Bernado de Almedia | Nicolas Lopez Carranza | Guillaume Richard | Marcin Skwark | Karim Beguir | Thomas Pierrot | Marie Lopez 17 Jul 2024

Coordination Failure in Cooperative Offline MARL

Callum Rhys Tilbury | Claude Formanek | Louise Beyers | Jonathan Shock | Arnu Pretorius 17 Jul 2024

Machine Learning of Force Fields for Molecular Dynamics Simulations of Proteins at DFT Accuracy

Mustafa Omar | Sebastien Boyer | Christoph Brunken | Bakary Diallo | Nicolas Lopez Carranza | Oliver Bent 03 May 2024

Model-Based Reinforcement Learning for Protein Backbone Design

Frédéric Renard | Cyprien Courtot | Oliver Bent 03 May 2024

Protein binding affinity prediction under multiple substitutions based on eGNNs with residue and atomic graphs and language model information: eGRAL

Arturo Fiorellini-Bernardis | Sebastien Boyer | Christoph Brunken | Bakary Diallo | Karim Beguir | Nicolas Lopez Carranza | Oliver Bent 03 May 2024

Exploring Genomic Language Models on Protein Downstream Tasks

Sam Boshar | Evan Trop | Bernardo P. de Almeida | Thomas Pierrot 02 May 2024

Advancing DNA Language Models: The Genomics Long-Range Benchmark

Chia-Hsiang Kao | Evan Trop | McKinley Polen | Yair Schiff | Bernardo P. de Almeida | Aaron Gokaslan | Thomas Pierrot | Volodymyr Kuleshov 02 May 2024

ChatNT: A Multimodal Conversational Agent for DNA, RNA and Protein Tasks

Guillaume Richard* | Bernardo P. de Almeida* | Hugo Dalla-Torre | Christopher Blum | Lorenz Hexemer | Priyanka Pandey | Stefan Laurent | Marie Lopez | Alexandre Laterre | Maren Lang | Ugur Sahin | Karim Beguir | Thomas Pierrot 30 Apr 2024

SegmentNT: annotating the genome at single-nucleotide resolution with DNA foundation models

Bernardo P. de Almeida | Hugo Dalla-Torre | Guillaume Richard | Christopher Blum | Lorenz Hexemer | Maxence Gelard | Priyanka Pandey | Stefan Laurent | Alexandre Laterre | Maren Lang | Ugur Sahin | Karim Beguir | Thomas Pierrot 14 Mar 2024

Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

Clément Bonnet | Daniel Luo | Donal Byrne | Shikha Surana | Vincent Coyette | Paul Duckworth | Laurence I. Midgley | Tristan Kalloniatis | Sasha Abramowitz | Cemlyn N. Waters | Andries P. Smit | Nathan Grinsztajn | Ulrich A. Mbou Sob | Omayma Mahjoub | Elshadai Tegegn | Mohamed A. Mimouni | Raphael Boige | Ruan de Kock | Daniel Furelos-Blanco | Victor Le | Arnu Pretorius | Alexandre Laterre 14 Mar 2024

How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning

Omayma Mahjoub | Ruan de Kock | Siddarth Singh | Wiem Khlifi | Abidine Vall | Kale-ab Tessera | Arnu Pretorius 13 Mar 2024

On Diagnostics for Understanding Agent Training Behaviour in Cooperative MARL

Omayma Mahjoub | Ruan de Kock | Siddarth Singh | Wiem Khlifi | Abidine Vall | Rihab Gorsane | Arnu Pretorius 13 Mar 2024

Efficiently Quantifying Individual Agent Importance in Cooperative MARL

Omayma Mahjoub | Ruan de Kock | Siddarth Singh | Wiem Khlifi | Abidine Vall | Rihab Gorsane | Arnu Pretorius 13 Mar 2024

TunBERT: Pretrained Contextualized Text Representation for Tunisian Dialect

Hatem Haddad | Abir Messaoudi | Chayma Fourati | Moez Ben HajHmida | Ahmed Cheikh Rouhou | Abir Korched | Amel Sellami | Faten Ghriss 12 Mar 2024

Graph Neural Networks for End-to-End Information Extraction from Handwritten Documents

Yessine Khanfir | Marwa Dhiaf | Emna Ghodhbani | Ahmed Cheikh Rouhou | Yousri Kessentini 12 Mar 2024

BioCLIP: Contrasting Sequence with Structure: Pre-training Graph Representations with Protein Language Models

Louis Robinson | Timothy Atkinson | Liviu Copoiu | Patrick Bordes | Thomas Pierrot | Thomas D. Barrett 15 Dec 2023

Generalisable Agents for Neural Network Optimisation

Kale-ab Tessera | Callum Rhys Tilbury | Sasha Abramowitz | Ruan de Kock | Omayma Mahjoub | Benjamin Rosman | Sara Hooker | Arnu Pretorius 15 Dec 2023

Offline RL for generative design of protein binders

Denis Tarasov | Ulrich A. Mbou Sob | Miguel Arbesú | Nima Siboni | Sebastien Boyer | Andries Smit | Oliver Bent | Arnu Pretorius 15 Dec 2023

FrameDiPT: SE(3) Diffusion Model for Protein Structure Inpainting

Cheng Zhang | Adam Leach | Tom Makkink | Miguel Arbesu | Ibtissem Kadri | Daniel Luo | Liron Mizrahi | Sabrine Krichen | Maren Lang | Andrey Tovchigrechko | Nicolas Lopez Carranza | Ugur Sahin | Karim Beguir | Michael Rooney | Yunguan Fu 15 Dec 2023

Are we going MAD? Benchmarking Multi-Agent Debate between Language Models for Medical Q&A

Dries Smit | Paul Duckworth | Nathan Grinsztajn | Kale-ab Tessera | Tom Barrett | Arnu Pretorius 15 Dec 2023

LightMHC: A Light Model for pMHC Structure Prediction with Graph Neural Networks

Antoine Delaunay | Yunguan Fu | Nikolai Gorbushin | Robert McHardy | Bachir Djermani | Liviu Copoiu | Michael Rooney | Maren Lang | Andrey Tovchigrechko | Ugur Sahin | Karim Beguir | Nicolas Lopez Carranza 15 Dec 2023

PASTA: Pretrained Action-State Transformer Agents

Raphael Boige | Yannis Flet-Berliac | Arthur Flajolet | Guillaume Richard | Thomas Pierrot 15 Dec 2023

Combinatorial Optimization with Policy Adaptation using Latent Space Search

Felix Chalumeau | Shikha Surana | Clement Bonnet | Nathan Grinsztajn | Arnu Pretorius | Alexandre Laterre | Thomas D. Barrett 11 Dec 2023

Nonparametric Boundary Geometry in Physics Informed Deep Learning

Scott Cameron | Arnu Pretorius | Stephen Roberts 11 Dec 2023

Winner Takes It All: Training Performant RL Populations for Combinatorial Optimization

Nathan Grinsztajn | Daniel Furelos Blanco (internship) | Tom Barrett | Shikha Surana | Clément Bonnet | Thomas Barrett 11 Dec 2023

Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs

Riccardo Poiani | Ciprian Stirbu | Alberto Maria Metelli | Marcello Restelli 01 Dec 2023

Progressive loss of conserved spike protein neutralizing antibody sites in Omicron sublineages is balanced by preserved T cell immunity

Alexander Muik | Bonny Gaby Lui | Jasmin Quandt | Huitian Diao | Yunguan Fu | Maren Bacher | Jessica Gordon | Aras Toker | Jessica Grosser | Orkun Ozhelvaci | Katharina Grikscheit | Sebastian Hoehl | Niko Kohmer | Yaniv Lustig | Gili Regev-Yochay | Sandra Ciesek | Karim Beguir | Asaf Poran | Isabel Vogler | Ozlem Tureci | Ugur Sahin 29 Aug 2023

QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration

Felix Chalumeau | Bryan Lim | Raphael Boige | Maxime Allard | Luca Grillotti | Manon Flageat | Valentin Mace | Guillaume Richard | Arthur Flajolet | Thomas Pierrot | Antoine Cully 07 Aug 2023

The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers

Valentin Macé | Raphaël Boige | Felix Chalumeau | Thomas Pierrot | Guillaume Richard | Nicolas Perrin-Gilbert 17 Jul 2023

MAP-Elites with Descriptor-Conditioned Gradients and Archive Distillation into a Single Policy

Maxence Faldor | Felix Chalumeau, | Manon Flageat | Antoine Cully 17 Jul 2023

Gradient-Informed Quality Diversity for the Illumination of Discrete Spaces

Raphael Boige | Guillaume Richard | Jérémie Dona | Thomas Pierrot | Antoine Cully 15 Jul 2023

The challenge of redundancy on multi-agent value factorisation

Siddarth Singh | Benjamin Rosman 02 Jun 2023

Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning

Claude Formanek | Callum Rhys Tilbury | Jonathan Shock | Kale-ab Tessera | Arnu Pretorius 05 May 2023

Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery

Felix Chalumeau | Raphael Boige | Bryan Lim | Valentin Macé | Maxime Allard | Arthur Flajolet | Antoine Cully | Thomas Pierrot 01 May 2023

Evolving Populations of RL Algorithms with MAP-Elites

Thomas Pierrot | Arthur Flajolet 01 May 2023

Empirical Analysis of PGA-MAP-Elites for Neuroevolution in Uncertain Domains

John Smith | Felix Chalumeau | Antoine Cully 29 Mar 2023

Scaling multi-agent reinforcement learning to full 11 vs 11 simulated robotic football

Andries Smit | Herman A. Engelbrecht | Willie Brink | Arnu Pretorius 24 Feb 2023

Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories

Christopher W.F. Parsonson | Alexandre Laterre | Thomas D Barrett 13 Feb 2023

Exact Combinatorial Optimisation with Deep Reinforcement Learning

Christopher W. F. Parsonson | Alexandre Laterre | Thomas D. Barrett 06 Feb 2023

Off-the-Grid MARL: Datasets and Baselines for Cooperative Offline Multi-Agent Reinforcement Learning

Claude Formanek | Asad Jeewa | Jonathan Shock | Arnu Pretorius 01 Feb 2023

Early Computational Detection of Potential High Risk SARS-CoV-2 Variants

Karim Beguir | Marcin J Skwark | Yunguan Fu | Thomas Pierrot | Santiago Nicolas Lopez Carranza | Alexandre Laterre | Ibtissem Kadri | Bonny Gaby Lui | Bianca Sanger | Yunpeng Liu | Asaf Poran | Alexander Muik | Ugur Sahin 23 Jan 2023

Peptide-MHC Structure Prediction With Mixed Residue and Atom Graph Neural Network

Antoine P. Delaunay | Yunguan Fu | Alberto Bégué | Robert McHardy | Bachir A. Djermani | Michael Rooney | Andrey Tovchigrechko | Liviu Copoiu | Marcin J. Skwark | Nicolas Lopez Carranza | Maren Lang | Karim Beguir | Uğur Şahin 04 Dec 2022

Flow Annealed Importance Sampling Bootstrap

Laurence Illing Midgley | Vincent Stimper | Gregor Simm | Bernhard Scholkopf, | José Miguel Hernández-Lobato 02 Dec 2022

So ManyFolds, So Little Time: Efficient Protein Structure Prediction with pLMs and MSAs

Thomas D. Barrett | Amelia Villegas-Morcillo | Louis Robinson | Benoit Gaujac | David Adméte | Elia Saquand | Karim Beguir | Arthur Flajolet 01 Dec 2022

Debiasing Meta-Gradients for Self-Tuning Reinforcement Learning

Clément Bonnet | Laurence Illing Midgley | Alexandre Laterre 01 Dec 2022

Universally Expressive Communication in Multi-Agent Reinforcement Learning

Matthew Morris | Thomas D. Barrett | Arnu Pretorius 01 Dec 2022

Towards a Standardised Performance Evaluation Protocol for Cooperative MARL

Rihab Gorsane | Omayma Mahjoub | Ruan de Kock | Roland Dubb | Siddarth Singh | Arnu Pretorius 01 Oct 2022

A Sequence Modelling Approach to Question Answering in Text-Based Games

Greg Furman | Edan Toledo | Jonathan Shock | Jan Buys 14 Jul 2022

Assessing Quality-Diversity Neuro-Evolution Algorithms Performance in Hard Exploration Problems

Félix Chalumeau | Thomas Pierrot | Arthur Flajolet | Karim Beguir | Antoine Cully | Nicolas Perrin-Gilbert 09 Jul 2022

Multi-Objective Quality Diversity Optimization

Thomas Pierrot | Guillaume Richard | Karim Beguir | Antoine Cully 09 Jul 2022

Fast Population-Based Reinforcement Learning on a Single Machine

Arthur Flajolet | Claire Bizon Monroc | Karim Beguir | Thomas Pierrot 17 Jun 2022

The structural basis of Cdc7-Dbf4 kinase dependent targeting and phosphorylation of the MCM2-7 double hexamer

Almutasem Saleh | Yasunori Noguchi | Ricardo Aramayo | Marina E. Ivanova | Kathryn M. Stevens | Alex Montoya | S. Sunidhi | Nicolas Lopez Carranza | Marcin J. Skwark | Christian Speck 25 May 2022

Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization

T. Pierrot | V. Macé | F. Chalumeau | A. Flajolet | G. Cideron | K. Beguir | A. Cully | O. Sigaud | N. Perrin-Gilbert 28 Apr 2022

Autoregressive neural-network wavefunctions for ab initio quantum chemistry

Dr Thomas Barrett | Prof A. I. Lvovsky | Aleksei Malyshev 06 Apr 2022

Robust and Scalable SDE Learning: a Functional Perspective

Scott Cameron | Tyron Cameron | Arnu Pretorius | Stephen Roberts 24 Jan 2022

One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning

C. Bonnet | P. Caron | T. Barrett | I. Davies | A. Laterre 06 Dec 2021

On pseudo-absence generation and machine learning for locust breeding ground prediction in Africa

I.S. Yusuf | K. Tessera | T. Tumiel | Z. Slim | A. Kerkeni | S. Nevo | A. Pretorius 06 Dec 2021

Causal Multi-Agent Reinforcement Learning: Review and Open Problems

S.J. Grimbly | J. Shock | A. Pretorius 06 Dec 2021

Mava: A new Framework for Distributed Multi-Agent Reinforcement Learning

A. Pretorius | K. Tessera | A.P. Smit | C. Formanek | S.J. Grimbly | K. Eloff | S. Danisa | L. Francis | J. Shock | H. Kamper | W. Brink | H. Engelbrecht | A. Laterre | K. Beguir 06 Jul 2021

Scaling Properties of Deep Residual Networks

A-S. Cohen | R. Cont | A. Rossier | R. Xu 27 May 2021

Designing a Prospective COVID-19 Therapeutic with Reinforcement Learning

M. J. Skwark | N. L. Carranza | T. Pierrot | J. Phillips | S. Said | A. Laterre | A. Kerkeni | U. Sahin | K. Beguir 09 Dec 2020

Offline Reinforcement Learning Hands-On

Louis Monier | Jakub Kmec | Alexandre Laterre | Thomas Pierrot | Valentin Courgeau | Olivier Sigaud | Karim Beguir 01 Dec 2020

A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning

A. Pretorius | S. Cameron | E. Van Biljon | T. Makkink | S. Mawjee | J. Du Plessis | J. Shock | A. Laterre | K. Beguir 29 Sep 2020

AlphaNPI-X: Learning Compositional Neural Programs for Continuous Control

T. Pierrot | N. Perrin | F. Behbahani | A. Laterre | O. Sigaud | K. Beguir | N. De Freitas 27 Jul 2020

Masakhane — Machine Translation For Africa

I. Orife | J. Kreutzer | B. Sibanda | D. Whitenack | K. Siminyu | L. Martinus | J. T. Ali | J. Abbott | V. Marivate | S. Kabongo | M. Meressa | E. Murhabazi | O. Ahia | E. van Biljon | A. Ramkilowan | A. Akinfaderin | A. Öktem | W. Akin | G. Kioko | K. Degila | H. Kamper | B. Dossou | C. Emezue | K. Ogueji | A. Bashir 27 Mar 2020

On Optimal Transformer Depth for Low-Resource Language Translation

Elan van Biljon | Arnu Pretorius | Julia Kreutzer 01 Feb 2020

Towards Compositionality in Deep Reinforcement Learning

T. Pierrot | G. Ligner | S. Reed | O. Sigaud | N. Perrin | A. Laterre | D. Kas | K. Beguir | N. de Freitas 18 Jun 2019

Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization

A. Laterre | Y. Fu | M. K. Jabri | A-S. Cohen | D. Kas | K. Hajjar | T. S. Dahl | A. Kerkeni | K. Beguir 01 Dec 2018

Explicit Sequence Proximity Models for Hidden State Identification

Anil Kota | Sharath Chandra | Parag Khanna | Torbjørn S. Dahl 01 Dec 2018

There are no results that match your search.
Please try different search criteria.

Load More

Stay up to date!

Keep up with our latest research through our published works, and explore our open-source contributions now available on and

InstaDeep
Privacy Overview

Please read our extensive Privacy policy here. You can also read our Privacy Notice and our Cookie Notice