Predicting LLM Inference Latency: A Roofline-Driven ML Method
- Saki Imai
- Rina Nakazawa
- et al.
- 2024
- NeurIPS 2024
I joined IBM in 2014 after receiving M.S. degree of Computer Science from Ochanomizu University in Japan.
My current research interests include performance analysis, container technologies, microservices, and visualization.