IEEE Best Paper Award
🏆 Our paper “Exploiting Intel Advanced Matrix Extensions (AMX) for LLM Inference” published in IEEE Computer Architecture Letters (CAL) received the IEEE Best Paper Award!
This work explores CPU-GPU heterogeneous computing techniques to accelerate Large Language Model inference using Intel Sapphire Rapids CPU with AMX, developed in collaboration with Prof. Nam Sung Kim’s research group.