MulBERRY: Enabling Bit-Error Robustness for Energy-Efficient Multi-Agent Autonomous SystemsZishen WanNandhini Chandramoorthyet al.2024ASPLOS 2024
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length PredictionHaoran QiuWeichao Maoet al.2024ASPLOS 2024