news | Victor Agostinelli

Apr 18, 2025	Thrilled to be taking an internship with Meta this summer as part of their AI Efficiency Insights team in Bellevue, Washington!
Mar 26, 2025	UltraFormer, a still-developing hyper-efficient transformer architecture featuring hybrid linear-sparse attention and ternary linear projections, has been accepted for presentation as a poster at FCCM ‘25!
Jan 10, 2025	Proud to be a co-organizer of IWSLT ‘25, specifically the simultaneous track! To be co-hosted at ACL ‘25 in Vienna, Austria.
Sep 20, 2024	SimulMask (official implementation here), which is built on Simul-LLM, has been accepted at EMNLP ‘24! To be presented as a poster at the conference.
Jun 26, 2024	New preprint on simultaneous translation with LLMs! SimulMask represents a significant departure from typical causal masking to unify fine-tuning and inference context management for simultaneous tasks.
Mar 16, 2024	Two papers accepted in short order! LeaPformers and Simul-LLM at ICML ‘24 and ACL ‘24 respectively. To be presented as posters at both conferences.
Jan 20, 2024	Started working with researchers at Pacific Northwest National Laboratory in a more formal fashion! Playing around with HLS and Torch-to-Verilog flow optimization with MLIR for AI acceleration.