Fast Large Language Model Collaborative Decoding via Speculation
was accepted to ICML 2025.