NVIDIA Learning and Perception Research12/1/2025Fast-SLM: Towards Latency-Optimal Hybrid Small Language ModelsKarsten Kreis; Yonggan FuPublication Advances in Neural Information Processing Systems (NeurIPS)Read at NVIDIA Learning and Perception ResearchTagsaimachine-learningnlp