Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo. João Loula, Benjamin LeBrun, Li Du, Ben Lipkin, Clemente Pasti, Gabriel Grand, Tianyu Liu, Yahya Emara, Marjorie Freedman, Jason Eisner, Ryan Cotterell, Vikash Mansinghka, Alexander K. Lew, Tim Vieira, Timothy J. O’Donnell. ICLR 2025 (selected for oral presentation).
From Language Models over Tokens to Language Models over Characters. Tim Vieira, Benjamin LeBrun, Mario Giulianelli, Juan Luis Gastaldi, Brian DuSell, John Terilla, Timothy J. O’Donnell, Ryan Cotterell. ICML 2025 (spotlight).
Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling. Benjamin Lipkin, Benjamin LeBrun, Jacob Hoover Vigly, João Loula, David R. MacIver, Li Du, Jason Eisner, Ryan Cotterell, Vikash Mansinghka, Timothy J. O’Donnell, Alexander K. Lew, Tim Vieira. Preprint.
Language Models over Canonical Byte-Pair Encodings. Tim Vieira, Tianyu Liu, Clemente Pasti, Yahya Emara, Brian DuSell, Benjamin Lebrun, Mario Giulianelli, Juan Luis Gastaldi, John Terilla, Timothy J. O’Donnell, Ryan Cotterell. ICML 2025.