KV-cache Management for Improving Run-time efficiency of Large Reasoning Models

RPI Principal Investigators
Mohammad Mohammadi Amiri
IBM Principal Investigators
Pin-Yu Chen,Tejaswini Pedapati, Subhajit Chaudhury, Keerthiram Murugesan, Kaoutar El Maghraoui, Naigang Wang, and Charlie Liu
Project Year
Back to top