Nov 2025

Speaker and panelist at the IVADO/MILA Workshop on Deploying Autonomous Agents in Montreal, presenting “Reality is Adversarial: Towards Robust Real-World Agents”. Honoured to be personally invited by Professors Siva Reddy and Yoshua Bengio.

Oct 2025

Gave an invited talk at the Imperial College London ICARL Seminar Series titled “Context: From Tokens to Capabilities”.

Sep 2025

Visited sunny Croatia to give an invited lecture at the Mediterranean Machine Learning (M2L) Summer School.

Aug 2025

Excited to join Google DeepMind working on reliable and robust function calling and tool use for Gemini!

Apr 2025

Invited talk at the London Machine Learning Meetup: “From Pretraining to Post-Training: Building Robust Enterprise-Ready Large Language Models”.

Mar 2025

Had a great conversation with Dr Tim Scarfe on Machine Learning Street Talk about the gap between humans and machines.

Feb 2025

Invited talk on “Build your own Generative Pretrained Transformer” at the Malta College of Arts, Science and Technology (MCAST).

Dec 2024

Our paper “The PRISM Alignment Project” won the Best Paper Award in the Datasets & Benchmarks track at NeurIPS 2024! 🎉

Nov 2024

“Fishing for Magikarp” received an Outstanding Paper Award at EMNLP 2024! Great collaboration with Sander Land.

Nov 2024

Invited talk at the University of Cambridge NLIP Seminar Series: “10 Slides on Human Feedback”.

May 2024

Joined Tom Hosking at the ICLR ‘24 poster session for our Human Feedback is Not Gold Standard work. Thanks for all the interest!

Nov 2023

Gave an invited talk on the application of LLMs for Enterprise at the Oracle AI@Molitor event.

May 2023

Gave a talk on NLP Applications and Large Language Models to the Capital Enterprise startup network.

Apr 2023

Honoured to have been nominated by my students for the UCL Inspiring Teaching Delivery award 🙏

Mar 2023

Gave an invited talk on Dynamic Advsersarial Data Collection for Large Language Models at the UCL AI Centre seminar on The Present and Future of Large Language Models in Theory and Practice.

Mar 2023

That’s a wrap! Another year of the MSIN0221 Natural Language Processing lectures comes to an end. Exciting to see the growing interest in NLP and its application!

Nov 2022

Presented recent work on DADC and GAAs at the King’s College London Distributed Artificial Intelligence group. Thanks for the insightful discussions!

Oct 2022

Super excited to announce that I have joined Cohere and will be working on making large language models more useful and robust.

Jul 2022

I’m in Seattle for NAACL 2022! I’ll be presenting Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants on Wednesday, 13th July at 10:45 PST. And don’t forget to join us at the DADC workshop on Thursday, 14th July for same amazing keynote talks, a diverse panel, presentations from our Shared Task participants and best paper winners, posters, prizes & much more!

May 2022

Our work Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants has been accepted as an oral presentation at NAACL 2022!

May 2022

Excited to announce that I have joined DeepMind as a Research Scientist Intern.

Apr 2022

Gave an invited talk on Dynamic Adversarial Data Collection for Question Answering at the Oracle Labs ML Seminar Series. This was a particularly fun and interactive one, thanks for the invite!

Mar 2022

The call for participation for the Shared Task at the DADC Workshop co-located with NAACL ‘22 in Seattle is now live! We have three fantastic tracks for you to participate in. Sign up here!

Mar 2022

Presented our work on Dynamic Adversarial Data Collection for QA at the University of Oxford.

Mar 2022

Just gave the last lecture of the MSIN0221 Natural Language Processing module for this year. Fantastic cohort as always and it was great to be back to in-person teaching!

Jan 2022

AdversarialQA is currently the 3rd most downloaded QA dataset on Huggingface 🤗 Datasets right after the benchmark SQuADv1.1 and SQuADv2!

Jan 2022

Our proposal for the First Workshop on Dynamic Adversarial Data Collection has been accepted! See you at NAACL ‘22 in Seattle!

Sep 2021

Dynabench is 1 year old! To celebrate, we’ve released Dynatask to help researchers host their own tasks.

Sep 2021

Presented a live demonstration of Dynamic Benchmarking at the UCL AI Centre 2nd Anniversary Showcase.

Aug 2021

Our work Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation has been accepted to the EMNLP 2021 Main Conference!

Aug 2021

Our work Contrasting Human-and Machine-Generated Word-Level Adversarial Examples for Text Classification has been accepted to the EMNLP 2021 Main Conference!

Aug 2021

ldbd.ly helps you make sense of ever-changing dynamic leaderboards.

Apr 2021

The Dynabench paper introducing our unified research platform for dynamic benchmarking has been accepted to NAACL 2021!

Apr 2021

Excited to announce that I have joined Facebook AI Research as an external research collaborator working on generation-assisted human adversarial annotation.

Jan 2021

The AdversarialQA dataset is now available in Huggingface 🤗 Datasets! Usage is as simple as from datasets import load_dataset; adversarial_qa = load_dataset('adversarial_qa', 'adversarialQA')

Dec 2020

The HAMLETS NeurIPS 2020 workshop kicks off today. Join us to learn more about Human And Model in the Loop Evaluation and Training Strategies.

Nov 2020

Presented Humans-and-Machines in the Loop for Dynamic Benchmarking and Evaluation at the Annual MURI Review Meeting.

Nov 2020

Presented Adversarial Human Annotation for Dynamic Benchmarking and Evaluation at the UCL AI Centre session on AI in science, industry and society at TheAlgo2020.

Sep 2020

Dynabench, in collaboration with Stanford University, the University of North Carolina at Chapel Hill, and Facebook AI, is now live! Can you fool the QA model?

Sep 2020
Sep 2020

Excited to announce that I have joined Facebook AI Research as a research intern working on adversarial benchmarking and robustness.

Apr 2020

Delivered the MSIN0221: Natural Language Processing module to this year’s UCL MSc Business Analytics cohort.

Feb 2020

Presented Adversarial Human Annotation for Reading Comprehension at the University of Cambridge NLIP Seminar Series.

Feb 2020

Accepted onto Cohort III of the Conception X programme!

Jun 2019

Presented Asking Harder Questions at the UCL NLP Inaugural Event followed by a poster session on ShARC.

May 2019

Delivered a two-part workshop titled Overview of NLP to this year’s UCL MSc Business Analytics cohort.

Apr 2019

Led a workshop titled Introduction to Python and Machine Learning at the Peking University HSBC Business School (PHBS) in Oxford.

Jan 2019

I have started a PhD at UCL under the guidance of Pontus Stenetorp and Sebastian Riedel.

Nov 2018

Presented Interpretation of Natural Language Rules in Conversational Machine Reading at EMNLP together with Patrick Lewis and other co-authors.

Oct 2018

The ShARC dataset from our EMNLP ‘18 paper is now live!

Aug 2018
Aug 2018

Cape (open source) is the new state-of-the-art for open-domain question answering on TriviaQA.

Aug 2018

Our large-scale question answering system, Cape, is now available open source!

Apr 2018

We’ve been accepted into the Allen & Overy Fuse accelerator programme.

Nov 2017

Invited presentation of the work we’re doing at Bloomsbury AI at the A Common Language for Intelligence meet-up hosted by Grakn AI.

May 2017

I have joined NLP-focused startup Bloomsbury AI, working on open-domain question answering.