Research
Publications
-
- Date
- Title
- Memory Consolidation Enables Long-Context Video Understanding
- Authors
- Ivana Balazevic, Jimmy Shi, Nelly Papalampidi, Rahma Chaabouni, Skanda Koppula, Olivier Henaff
- Venue
- arXiv
-
- Date
- Title
- Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
- Authors
- Nicolas Nguyen, Imad Aouali, András György, Claire Vernade
- Venue
- arXiv
-
- Date
- Title
- Large Language Models Self-Discover Reasoning Structures
- Authors
- Pei Zhou*, Jay Pujara*, Xiang Ren*, Swaroop Mishra, Steven Zheng, Denny Zhou, Heng-Tze Cheng, Quoc Le, Ed Chi, Xinyun Chen
- Venue
- arXiv
-
- Date
- Title
- States as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers
- Authors
- Ian Gemp, Yoram Bachrach, Marc Lanctot, Roma Patel, Vibhavari Dasagi, Luke Marris, Georgios Piliouras, Siqi Liu, Karl Tuyls
- Venue
- arXiv
-
- Date
- Title
- Transfer Learning for Bayesian Optimization on Heterogeneous Search Spaces
- Authors
- Zhou Fan, Xinran Han, Zi Wang
- Venue
- Transactions on Machine Learning Research (TMLR)
-
- Date
- Title
- Fractal Patterns May Unravel the Intelligence in Next-Token Prediction
- Authors
- Ibrahim Alabdulmohsin, Vinh Q. Tran, Mostafa Dehghani
- Venue
- arXiv
-
- Date
- Title
- Exploration at Scale using Epistemic Neural Networks
- Authors
- Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy
- Venue
- arXiv
-
- Date
- Title
- Robust agents learn causal world models
- Authors
- Jonathan Richens, Tom Everitt
- Venue
- ICLR 2024
-
- Date
- Title
- Learning Universal Predictors
- Authors
- Jordi Grau-Moya *, Tim Genewein *, Marcus Hutter *, Laurent Orseau *, Grégoire Déletang, Elliot Catt, Anian Ruoss, Li Kevin Wenliang, Christopher Mattern, Matthew Aitchison and Joel Veness
- Venue
- arXiv
-
- Date
- Title
- Neural Population Learning beyond Symmetric Zero-Sum Games
- Authors
- Siqi Liu, Luke Marris, Marc Lanctot, Georgios Piliouras, Joel Leibo, Nicolas Heess
- Venue
- AAMAS 2024
-
- Date
- Title
- E3x: E(3)-Equivariant Deep Learning Made Easy
- Authors
- Oliver Unke, Hartmut Maennel
- Venue
- arXiv
-
- Date
- Title
- Asynchronous Local-SGD Training forLanguage Modeling
- Authors
- Bo Liu*, Arthur Douillard, Rachita Chhaparia, Jiajun Shen, Andrei Rusu, Arthur Szlam, Marc'aurelio Ranzato, Satyen Kale
- Venue
- arXiv
-
- Date
- Title
- Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization
- Authors
- Ian Gemp, Luke Marris, Georgios Piliouras
- Venue
- ICLR 2024
-
- Date
- Title
- GATS: Gather-Attend-Scatter
- Authors
- Konrad Zolna, Serkan Cabi, Yutian Chen, Eric Lau, Claudio Fantacci, Jurgis Pasukonis, Jost Tobias Springenberg, Sergio Gomez
- Venue
- arXiv
-
- Date
- Title
- NfgTransformer: Equivariant Representation Learning for Normal-form Games
- Authors
- Siqi Liu, Luke Marris, Ian Gemp, Georgios Piliouras, Nicolas Heess
- Venue
- ICLR 2024
-
- Date
- Title
- Generative Adversarial Equilibrium Solvers
- Authors
- Denizalp Goktas, David C. Parkes, Ian Gemp, Luke Marris, Georgios Piliouras, Romuald Elie, Guy Lever, Andrea Tacchetti
- Venue
- ICLR 2024
-
- Date
- Title
- Directly Fine-Tuning Diffusion Models on Differentiable Rewards
- Authors
- Kevin Clark*, Paul Vicol*, Kevin Swersky, David J. Fleet
- Venue
- ICLR 2024
-
- Date
- Title
- On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
- Authors
- Rishabh Agarwal, Nino Vieillard, Yongchao Zhou, Piotr Stanczyk, Sabela Ramos, mfgeist , Olivier Bachem
- Venue
- ICLR 2024
-
- Date
- Title
- Learning Planning-compatible Cognitive Maps with Transformers in PartiallyObserved Environments
- Authors
- Antoine Dedieu, Wolfgang Lehrach, Guangyao Zhou, Dileep George, Miguel Lázaro-Gredilla
- Venue
- arXiv
-
- Date
- Title
- Distributional reinforcement learning in prefrontal cortex
- Authors
- Timothy Muller*, James Butler*, Sebastijan Veselic*, Bruno Miranda*, Timothy Behrens*, Zeb Kurth-Nelson, Steve Kennerley*
- Venue
- Nature Neuroscience