publications

publications by categories in reversed chronological order.

2024

  1. CAL
    Exploiting Intel Advanced Matrix Extensions (AMX) for Large Language Model Inference
    Hyungyo Kim, Gaohan Ye, Nachuang Wang, and 2 more authors
    IEEE Computer Architecture Letters, May 2024