Research

At InstaDeep, ideas become reality. Tackling some of humanity's toughest challenges, our cutting-edge AI transforms future possibilities into today's breakthroughs

Join our Research Team

Digital Biology

Our Digital Biology team leverages advanced machine learning and simulation techniques to revolutionise drug discovery at its core. Collaborating with domain experts, to transform intricate biological data into actionable insights, the team drives breakthroughs in genomics, proteomics, quantum chemistry, and beyond

Decision-Making

Our Decision-Making team pioneers reinforcement learning methods. By building AI systems that drive real-world impact—from chip design to resource management and scientific discovery—the team ensures the next generation of AI agents excels at dynamic decision-making.

ML Systems

Our Machine Learning (ML) systems team transform AI ambitions into practical, adaptable advancements. Driving breakthroughs from foundation models in biology to cutting-edge scientific computing, they tackle novel system challenges in AI infrastructure, enabling efficient large-scale ML algorithms.

Machine Learning

Our Fundamental Machine Learning (ML) team push the boundaries of theory to unlock new pathways for transformative real-world applications. Through exploring the theoretical foundations of modern AI, the team designs robust models and algorithms that power applied innovation.

Our research in the news

Talking biology with ChatNT

Talking biology with ChatNT

Read More
Flexible antibody design with AbBFN2

Flexible antibody design with AbBFN2

Read More
Enhancing Peptide Sequencing with AI

Enhancing Peptide Sequencing with AI

Read More
Exploring the Proteome with ProtBFN

Exploring the Proteome with ProtBFN

Read More
Decoding our Genome with Nucleotide Transformers

Decoding our Genome with Nucleotide Transformers

Read More
See all news

Our Publications

Leveraging State Space Models in Long Range Genomics

Matvei Popov , Aymen Kallala , Anirudha Ramesh , Narimane Hennouni , Shivesh Khaitan , Rick Gentry , Alain-Sam Cohen
ICLR LMRL (2025) 12 May 2025

Open-Source and FAIR Research Software for Proteomics

Lukas Käll , Yasset Perez-Riverol , Wout Bittremieux , William S. Noble , Lennart Martens , Aivett Bilbao , Michael R. Lazear , Bjorn Grüning , Daniel S. Katz , Michael J. MacCoss , Chengxin Dai , Jimmy K. Eng , Robbin Bouwmeester , Michael R. Shortreed , Enrique Audain , Timo Sachsenberg , Jeroen Van Goey , Georg Wallmann , Bo Wen , William E. Fondrie
- 12 May 2025

AbBFN2: A flexible antibody foundation model based on Bayesian Flow Networks

Bora Guloglu , Miguel Bragança , Alex Graves , Scott Cameron , Timothy Atkinson , Liviu Copoiu , Alexandre Laterre , Thomas D. Barrett
- 06 May 2025

Metalic: Meta-Learning In-Context with Protein Language Models

Jacob Beck , Shikha Surana , Manus McAuliffe , Oliver Bent , Thomas D. Barrett , Juan Jose Garau Luis , Paul Duckworth
ICLR 2025 04 Apr 2025

Simple Guidance Mechanisms for Discrete Diffusion Models

Hugo Dalla-Torre , Sam Boshar , Bernardo P. de Almeida , Thomas Pierrot , Yair Schiff , Subham Sekhar Sahoo , Hao Phung , Guanghan Wang , Alexander Rush , Volodymyr Kuleshov
ICLR 2025 03 Apr 2025

De novo peptide sequencing with InstaNovo: Accurate, database-free peptide identification for large scale proteomics experiments

Kevin Eloff , Konstantinos Kalogeropoulos , Oliver Morell , Amandla Mabona , Jakob Berg Jespersen , Wesley WIlliams , Sam P. B. van Beljouw , Marcin Skwark , Andreas Hougaard Laustsen , Stan J. J. Brouns , Erwin M. Schoof , Jeroen Van Goey , Ulrich auf dem Keller , Karim Beguir , Nicolas Lopez Carranza , Timothy P. Jenkins
Nature Machine Intelligence 31 Mar 2025

Bayesian Optimisation for Protein Sequence Design: Gaussian Processes with Zero-Shot Protein Language Model Prior Mean

Carolin Benjamins , Shikha Surana , Oliver Bent , Marius Lindauer , Paul Duckworth
NeurIPS 2024 workshop 19 Dec 2024

BulkRNABert: Cancer prognosis from bulk RNA-seq based language models

Maxence Gélard , Guillaume Richard , Thomas Pierrot , Paul-Henry Cournède
ML4H 2024 15 Dec 2024

BoostMD – Accelerating MD with MLIP

Lars L. Schaaf , Ilyes Batatia , Christoph Brunken , Thomas D. Barrett , Jules Tilly
NeurIPS 2024 workshop 15 Dec 2024

Learning the Language of Protein Structures

Benoit Gaujac , Jérémie Donà , Liviu Copoiu , Timothy Atkinson , Thomas Pierrot , Thomas D. Barrett
NeurIPS 2024 workshop 15 Dec 2024

Bayesian Optimisation for Protein Sequence Design: Back to Basics with Gaussian Process Surrogates

Carolin Benjamins , Shikha Surana , Oliver Bent , Marius Lindauer , Paul Duckworth
NeurIPS 2024 workshop 14 Dec 2024

Multi-modal Transfer Learning between Biological Foundation Models

Juan Jose Garau-Luis , Patrick Bordes , Liam Gonzalez , Masa Roller , Bernardo P. de Almeida , Lorenz Hexemer , Christopher Blum , Stefan Laurent , Jan Grzegorzewski , Maren Lang , Thomas Pierrot , Guillaume Richard
NeurIPS 2024 12 Dec 2024

Dispelling the Mirage of Progress in Offline MARL

Claude Formanek , Callum Rhys Tilbury , Louise Beyers , Jonathan Shock , Arnu Pretorius
NeurIPS 2024 12 Dec 2024

SPO: Sequential Policy Optimisation

Matthew V Macfarlane , Edan Toledo , Donal Byrne , Paul Duckworth , Alexandre Laterre
NeurIPS 2024 11 Dec 2024

Nucleotide Transformer: building and evaluating robust foundation models for human genomics

Hugo Dalla-Torre , Liam Gonzalez , Javier Mendoza-Revilla , Nicolas Lopez Carranza , Adam Henryk Grzywaczewski , Francesco Oteri , Christian Dallago , Evan Trop , Bernardo P. de Almeida , Hassan Sirelkhatim , Guillaume Richard , Marcin Skwark , Karim Beguir , Marie Lopez  , Thomas Pierrot
Nature Methods 2024 28 Nov 2024

Protein Sequence Modelling with Bayesian Flow Networks

Timothy Atkinson , Thomas D. Barrett , Scott Cameron , Bora Guloglu , Matthew Greenig , Louis Robinson , Alex Graves , Liviu Copoiu , Alexandre Laterre
- 27 Sep 2024

SMX: Sequential Monte Carlo Planning for Expert Iteration

Edan Toledo , Matthew Macfarlane , Donal John Byrne , Siddarth Singh , Paul Duckworth , Alexandre Laterre
ICML 2024 17 Jul 2024

Multi-Objective Quality-Diversity for Crystal Structure Prediction

Hannah Janmohamed , Marta Wolinska , Shikha Surana , Aaron Walsh , Thomas Pierrot , Antoine Cully
Gecco 2024 17 Jul 2024

Overconfident Oracles: Limitations of In Silico Sequence Design Benchmarking

Shikha Surana , Nathan Grinsztajn , Timothy Atkinson , Paul Duckworth , Thomas D. Barrett
ICML 2024 workshop 17 Jul 2024

Generative Model for Small Molecules with Latent Space RL Fine-Tuning to Protein Targets

Ulrich A. Mbou So , Qiulin Li , Dries Smit , Arnu Pretorius , Oliver Bent , Miguel Arbesú
ICML 2024 workshop 17 Jul 2024

Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMs

Andries Petrus Smit , Nathan Grinsztajn , Paul Duckworth , Thomas D Barrett , Arnu Pretorius
ICML 2024 17 Jul 2024

Quality-Diversity for One-Shot Biological Sequence Design

Jérémie DONA , Arthur Flajolet , Andrei Marginean , Antoine Cully , Thomas PIERROT
ICML 2024 17 Jul 2024

Likelihood-based fine-tuning of protein language models for few-shot fitness prediction and design

Alex Hawkins-Hooker , Jakub Kmec , Oliver Bent , Paul Duckworth
ICML 2024 workshop 17 Jul 2024

A large language foundational model for edible plant genomes

Javier Mendoza-Revilla , Evan Trop , Liam Gonzalez , Maša Roller , Hugo Dalla-Torre , Bernado de Almedia , Nicolas Lopez Carranza , Guillaume Richard , Marcin Skwark , Karim Beguir , Thomas Pierrot , Marie Lopez
Nature Communications Biology 2024 17 Jul 2024

Coordination Failure in Cooperative Offline MARL

Callum Rhys Tilbury , Claude Formanek , Louise Beyers , Jonathan Shock , Arnu Pretorius
ICML 2024 ARLET Workshop 17 Jul 2024

Machine Learning of Force Fields for Molecular Dynamics Simulations of Proteins at DFT Accuracy

Mustafa Omar , Sebastien Boyer , Christoph Brunken , Bakary Diallo , Nicolas Lopez Carranza , Oliver Bent
ICLR 2024 GEM Workshop 03 May 2024

Model-Based Reinforcement Learning for Protein Backbone Design

Frédéric Renard , Cyprien Courtot , Oliver Bent
ICLR 2024 GEM Workshop 03 May 2024

Protein binding affinity prediction under multiple substitutions based on eGNNs with residue and atomic graphs and language model information: eGRAL

Arturo Fiorellini-Bernardis , Sebastien Boyer , Christoph Brunken , Bakary Diallo , Karim Beguir , Nicolas Lopez Carranza , Oliver Bent
ICLR 2024 GEM Workshop 03 May 2024

Exploring Genomic Language Models on Protein Downstream Tasks

Sam Boshar , Evan Trop , Bernardo P. de Almeida , Thomas Pierrot
LLMs4Bio AAAI 2024 | MLGenX ICLR 2024 02 May 2024

Advancing DNA Language Models: The Genomics Long-Range Benchmark

Chia-Hsiang Kao , Evan Trop , McKinley Polen , Yair Schiff , Bernardo P. de Almeida , Aaron Gokaslan , Thomas Pierrot , Volodymyr Kuleshov
LLMs4Bio AAAI 2024 | MLGenX ICLR 2024 02 May 2024

ChatNT: A Multimodal Conversational Agent for DNA, RNA and Protein Tasks

Guillaume Richard* , Bernardo P. de Almeida* , Hugo Dalla-Torre , Christopher Blum , Lorenz Hexemer , Priyanka Pandey , Stefan Laurent , Marie Lopez , Alexandre Laterre , Maren Lang , Ugur Sahin , Karim Beguir , Thomas Pierrot
- 30 Apr 2024

SegmentNT: annotating the genome at single-nucleotide resolution with DNA foundation models

Bernardo P. de Almeida , Hugo Dalla-Torre , Guillaume Richard , Christopher Blum , Lorenz Hexemer , Maxence Gelard , Priyanka Pandey , Stefan Laurent , Alexandre Laterre , Maren Lang , Ugur Sahin , Karim Beguir , Thomas Pierrot
- 14 Mar 2024

Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

Clément Bonnet , Daniel Luo , Donal Byrne , Shikha Surana , Vincent Coyette , Paul Duckworth , Laurence I. Midgley , Tristan Kalloniatis , Sasha Abramowitz , Cemlyn N. Waters , Andries P. Smit , Nathan Grinsztajn , Ulrich A. Mbou Sob , Omayma Mahjoub , Elshadai Tegegn , Mohamed A. Mimouni , Raphael Boige , Ruan de Kock , Daniel Furelos-Blanco , Victor Le , Arnu Pretorius , Alexandre Laterre
ICLR 2024 14 Mar 2024

How much can change in a year? Revisiting Evaluation in Multi-Agent Reinforcement Learning

Omayma Mahjoub , Ruan de Kock , Siddarth Singh , Wiem Khlifi , Abidine Vall , Kale-ab Tessera , Arnu Pretorius
AAAI workshop 13 Mar 2024

On Diagnostics for Understanding Agent Training Behaviour in Cooperative MARL

Omayma Mahjoub , Ruan de Kock , Siddarth Singh , Wiem Khlifi , Abidine Vall , Rihab Gorsane , Arnu Pretorius
AAAI workshop 13 Mar 2024

Efficiently Quantifying Individual Agent Importance in Cooperative MARL

Omayma Mahjoub , Ruan de Kock , Siddarth Singh , Wiem Khlifi , Abidine Vall , Rihab Gorsane , Arnu Pretorius
AAAI workshop 13 Mar 2024

TunBERT: Pretrained Contextualized Text Representation for Tunisian Dialect

Hatem Haddad , Abir Messaoudi , Chayma Fourati , Moez Ben HajHmida , Ahmed Cheikh Rouhou , Abir Korched , Amel Sellami , Faten Ghriss
- 12 Mar 2024

Graph Neural Networks for End-to-End Information Extraction from Handwritten Documents

Yessine Khanfir , Marwa Dhiaf , Emna Ghodhbani , Ahmed Cheikh Rouhou , Yousri Kessentini
WACV 2024 12 Mar 2024

BioCLIP: Contrasting Sequence with Structure: Pre-training Graph Representations with Protein Language Models

Louis Robinson , Timothy Atkinson , Liviu Copoiu , Patrick Bordes , Thomas Pierrot , Thomas D. Barrett
NeurIPS workshop 15 Dec 2023

Generalisable Agents for Neural Network Optimisation

Kale-ab Tessera , Callum Rhys Tilbury , Sasha Abramowitz , Ruan de Kock , Omayma Mahjoub , Benjamin Rosman , Sara Hooker , Arnu Pretorius
NeurIPS workshop 15 Dec 2023

Offline RL for generative design of protein binders

Denis Tarasov , Ulrich A. Mbou Sob , Miguel Arbesú , Nima Siboni , Sebastien Boyer , Andries Smit , Oliver Bent , Arnu Pretorius
NeurIPS workshop 15 Dec 2023
NeurIPS workshop 15 Dec 2023

FrameDiPT: SE(3) Diffusion Model for Protein Structure Inpainting

Cheng Zhang , Adam Leach , Tom Makkink , Miguel Arbesu , Ibtissem Kadri , Daniel Luo , Liron Mizrahi , Sabrine Krichen , Maren Lang , Andrey Tovchigrechko , Nicolas Lopez Carranza , Ugur Sahin , Karim Beguir , Michael Rooney , Yunguan Fu
NeurIPS workshop 15 Dec 2023

Are we going MAD? Benchmarking Multi-Agent Debate between Language Models for Medical Q&A

Dries Smit , Paul Duckworth , Nathan Grinsztajn , Kale-ab Tessera , Tom Barrett , Arnu Pretorius
NeurIPS workshop 15 Dec 2023

LightMHC: A Light Model for pMHC Structure Prediction with Graph Neural Networks

Antoine Delaunay , Yunguan Fu , Nikolai Gorbushin , Robert McHardy , Bachir Djermani , Liviu Copoiu , Michael Rooney , Maren Lang , Andrey Tovchigrechko , Ugur Sahin , Karim Beguir , Nicolas Lopez Carranza
NeurIPS workshop 15 Dec 2023

PASTA: Pretrained Action-State Transformer Agents

Raphael Boige , Yannis Flet-Berliac , Arthur Flajolet , Guillaume Richard , Thomas Pierrot
NeurIPS workshop 15 Dec 2023

Combinatorial Optimization with Policy Adaptation using Latent Space Search

Felix Chalumeau , Shikha Surana , Clement Bonnet , Nathan Grinsztajn , Arnu Pretorius , Alexandre Laterre , Thomas D. Barrett
NeurIPS workshop 11 Dec 2023

Nonparametric Boundary Geometry in Physics Informed Deep Learning

Scott Cameron , Arnu Pretorius , Stephen Roberts
NeurIPS 11 Dec 2023

Winner Takes It All: Training Performant RL Populations for Combinatorial Optimization

Nathan Grinsztajn , Daniel Furelos Blanco (internship) , Tom Barrett , Shikha Surana , Clément Bonnet , Thomas Barrett
NeurIPS 11 Dec 2023

Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs

Riccardo Poiani , Ciprian Stirbu , Alberto Maria Metelli , Marcello Restelli
IEEE Transactions on Intelligent Transportation Systems 01 Dec 2023

Progressive loss of conserved spike protein neutralizing antibody sites in Omicron sublineages is balanced by preserved T cell immunity

Alexander Muik , Bonny Gaby Lui , Jasmin Quandt , Huitian Diao , Yunguan Fu , Maren Bacher , Jessica Gordon , Aras Toker , Jessica Grosser , Orkun Ozhelvaci , Katharina Grikscheit , Sebastian Hoehl , Niko Kohmer , Yaniv Lustig , Gili Regev-Yochay , Sandra Ciesek , Karim Beguir , Asaf Poran , Isabel Vogler , Ozlem Tureci , Ugur Sahin
Cell Report 29 Aug 2023

QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration

Felix Chalumeau , Bryan Lim , Raphael Boige , Maxime Allard , Luca Grillotti , Manon Flageat , Valentin Mace , Guillaume Richard , Arthur Flajolet , Thomas Pierrot , Antoine Cully
JMLR - Machine Learning Open Source Software (2023) 07 Aug 2023

The Quality-Diversity Transformer: Generating Behavior-Conditioned Trajectories with Decision Transformers

Valentin Macé , Raphaël Boige , Felix Chalumeau , Thomas Pierrot , Guillaume Richard , Nicolas Perrin-Gilbert
GECCO 17 Jul 2023

MAP-Elites with Descriptor-Conditioned Gradients and Archive Distillation into a Single Policy

Maxence Faldor , Felix Chalumeau, , Manon Flageat , Antoine Cully
GECCO 17 Jul 2023

Gradient-Informed Quality Diversity for the Illumination of Discrete Spaces

Raphael Boige , Guillaume Richard , Jérémie Dona , Thomas Pierrot , Antoine Cully
GECCO 15 Jul 2023
AAMAS Journal 02 Jun 2023

Reduce, Reuse, Recycle: Selective Reincarnation in Multi-Agent Reinforcement Learning

Claude Formanek , Callum Rhys Tilbury , Jonathan Shock , Kale-ab Tessera , Arnu Pretorius
ICLR Workshop 05 May 2023

Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery

Felix Chalumeau , Raphael Boige , Bryan Lim , Valentin Macé , Maxime Allard , Arthur Flajolet , Antoine Cully , Thomas Pierrot
ICLR 01 May 2023
ICLR 01 May 2023
ACM Transactions on Evolutionary Learning and Optimization 29 Mar 2023

Scaling multi-agent reinforcement learning to full 11 vs 11 simulated robotic football

Andries Smit , Herman A. Engelbrecht , Willie Brink , Arnu Pretorius
AAMAS Journal 24 Feb 2023

Reinforcement Learning for Branch-and-Bound Optimisation using Retrospective Trajectories

Christopher W.F. Parsonson , Alexandre Laterre , Thomas D Barrett
AAAI 13 Feb 2023

Exact Combinatorial Optimisation with Deep Reinforcement Learning

Christopher W. F. Parsonson , Alexandre Laterre , Thomas D. Barrett
AAAI 06 Feb 2023
- 01 Feb 2023

Early Computational Detection of Potential High Risk SARS-CoV-2 Variants

Karim Beguir , Marcin J Skwark , Yunguan Fu , Thomas Pierrot , Santiago Nicolas Lopez Carranza , Alexandre Laterre , Ibtissem Kadri , Bonny Gaby Lui , Bianca Sanger , Yunpeng Liu , Asaf Poran , Alexander Muik , Ugur Sahin
Computers in Biology and Medicine 23 Jan 2023

Peptide-MHC Structure Prediction With Mixed Residue and Atom Graph Neural Network

Antoine P. Delaunay , Yunguan Fu , Alberto Bégué , Robert McHardy , Bachir A. Djermani , Michael Rooney , Andrey Tovchigrechko , Liviu Copoiu , Marcin J. Skwark , Nicolas Lopez Carranza , Maren Lang , Karim Beguir , Uğur Şahin
NeurIPS Workshop 04 Dec 2022

Flow Annealed Importance Sampling Bootstrap

Laurence Illing Midgley , Vincent Stimper , Gregor Simm , Bernhard Scholkopf, , José Miguel Hernández-Lobato
NeurIPS Workshop 02 Dec 2022

So ManyFolds, So Little Time: Efficient Protein Structure Prediction with pLMs and MSAs

Thomas D. Barrett , Amelia Villegas-Morcillo , Louis Robinson , Benoit Gaujac , David Adméte , Elia Saquand , Karim Beguir , Arthur Flajolet
NeurIPS 01 Dec 2022

Debiasing Meta-Gradients for Self-Tuning Reinforcement Learning

Clément Bonnet , Laurence Illing Midgley , Alexandre Laterre
NeurIPS Workshop 01 Dec 2022

Universally Expressive Communication in Multi-Agent Reinforcement Learning

Matthew Morris , Thomas D. Barrett , Arnu Pretorius
NeurIPS 01 Dec 2022

Towards a Standardised Performance Evaluation Protocol for Cooperative MARL

Rihab Gorsane , Omayma Mahjoub , Ruan de Kock , Roland Dubb , Siddarth Singh , Arnu Pretorius
NeurIPS 01 Oct 2022

A Sequence Modelling Approach to Question Answering in Text-Based Games

Greg Furman , Edan Toledo , Jonathan Shock , Jan Buys
NA ACL Workshop 14 Jul 2022

Assessing Quality-Diversity Neuro-Evolution Algorithms Performance in Hard Exploration Problems

Félix Chalumeau , Thomas Pierrot , Arthur Flajolet , Karim Beguir , Antoine Cully , Nicolas Perrin-Gilbert
GECCO Workshop 09 Jul 2022

Multi-Objective Quality Diversity Optimization

Thomas Pierrot , Guillaume Richard , Karim Beguir , Antoine Cully
GECCO 09 Jul 2022

Fast Population-Based Reinforcement Learning on a Single Machine

Arthur Flajolet , Claire Bizon Monroc , Karim Beguir , Thomas Pierrot
ICML 17 Jun 2022

The structural basis of Cdc7-Dbf4 kinase dependent targeting and phosphorylation of the MCM2-7 double hexamer

Almutasem Saleh , Yasunori Noguchi , Ricardo Aramayo , Marina E. Ivanova , Kathryn M. Stevens , Alex Montoya , S. Sunidhi , Nicolas Lopez Carranza , Marcin J. Skwark , Christian Speck
Nature Communications 25 May 2022

Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization

T. Pierrot , V. Macé , F. Chalumeau , A. Flajolet , G. Cideron , K. Beguir , A. Cully , O. Sigaud , N. Perrin-Gilbert
ICLR, GECCO 28 Apr 2022

Autoregressive neural-network wavefunctions for ab initio quantum chemistry

Dr Thomas Barrett , Prof A. I. Lvovsky , Aleksei Malyshev
Nature Machine Intelligence 06 Apr 2022

Robust and Scalable SDE Learning: a Functional Perspective

Scott Cameron , Tyron Cameron , Arnu Pretorius , Stephen Roberts
ICLR 24 Jan 2022

One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning

C. Bonnet , P. Caron , T. Barrett , I. Davies , A. Laterre
NeurIPS Workshop 2021 06 Dec 2021

On pseudo-absence generation and machine learning for locust breeding ground prediction in Africa

I.S. Yusuf , K. Tessera , T. Tumiel , Z. Slim , A. Kerkeni , S. Nevo , A. Pretorius
NeurIPS Workshop 2021 06 Dec 2021
NeurIPS Workshop 2021 06 Dec 2021

Mava: A new Framework for Distributed Multi-Agent Reinforcement Learning

A. Pretorius , K. Tessera , A.P. Smit , C. Formanek , S.J. Grimbly , K. Eloff , S. Danisa , L. Francis , J. Shock , H. Kamper , W. Brink , H. Engelbrecht , A. Laterre , K. Beguir
- 06 Jul 2021

Scaling Properties of Deep Residual Networks

A-S. Cohen , R. Cont , A. Rossier , R. Xu
ICML 27 May 2021

Designing a Prospective COVID-19 Therapeutic with Reinforcement Learning

M. J. Skwark , N. L. Carranza , T. Pierrot , J. Phillips , S. Said , A. Laterre , A. Kerkeni , U. Sahin , K. Beguir
NeurIPS 09 Dec 2020

Offline Reinforcement Learning Hands-On

Louis Monier , Jakub Kmec , Alexandre Laterre , Thomas Pierrot , Valentin Courgeau , Olivier Sigaud , Karim Beguir
NeurIPS Workshop 01 Dec 2020

A game-theoretic analysis of networked system control for common-pool resource management using multi-agent reinforcement learning

A. Pretorius , S. Cameron , E. Van Biljon , T. Makkink , S. Mawjee , J. Du Plessis , J. Shock , A. Laterre , K. Beguir
NeurIPS 2020 29 Sep 2020

AlphaNPI-X: Learning Compositional Neural Programs for Continuous Control

T. Pierrot , N. Perrin , F. Behbahani , A. Laterre , O. Sigaud , K. Beguir , N. De Freitas
- 27 Jul 2020

Masakhane — Machine Translation For Africa

I. Orife , J. Kreutzer , B. Sibanda , D. Whitenack , K. Siminyu , L. Martinus , J. T. Ali , J. Abbott , V. Marivate , S. Kabongo , M. Meressa , E. Murhabazi , O. Ahia , E. van Biljon , A. Ramkilowan , A. Akinfaderin , A. Öktem , W. Akin , G. Kioko , K. Degila , H. Kamper , B. Dossou , C. Emezue , K. Ogueji , A. Bashir
ICLR 27 Mar 2020

On Optimal Transformer Depth for Low-Resource Language Translation

Elan van Biljon , Arnu Pretorius , Julia Kreutzer
ICLR Workshop 01 Feb 2020

Towards Compositionality in Deep Reinforcement Learning

T. Pierrot , G. Ligner , S. Reed , O. Sigaud , N. Perrin , A. Laterre , D. Kas , K. Beguir , N. de Freitas
- 18 Jun 2019

Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization

A. Laterre , Y. Fu , M. K. Jabri , A-S. Cohen , D. Kas , K. Hajjar , T. S. Dahl , A. Kerkeni , K. Beguir
NeurIPS 01 Dec 2018

Explicit Sequence Proximity Models for Hidden State Identification

Anil Kota , Sharath Chandra , Parag Khanna , Torbjørn S. Dahl
NeurIPS 01 Dec 2018

There are no results that match your search.
Please try different search criteria.

Load More

Stay up to date!

Keep up with our latest research through our published works, and explore our open-source contributions now available on and

InstaDeep
Privacy Overview

Please read our extensive Privacy policy here. You can also read our Privacy Notice and our Cookie Notice