Publications

publications by categories in reversed chronological order. * indicates equal contribution

2024

  1. OSDI ’24
    InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management
    Wonbeom Lee* , Jungi Lee*, Junghwan Seo , and Jaewoong Sim
    In Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI) , Santa Clara, CA, USA, 2024
  2. ISCA-51
    Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization
    Jungi Lee*, Wonbeom Lee* , and Jaewoong Sim
    In Proceedings of the 51st Annual International Symposium on Computer Architecture (ISCA) , Buenos Aires, Argentina, 2024
  3. ASPLOS ’24
    GSCore: Efficient Radiance Field Rendering via Architectural Support for 3D Gaussian Splatting
    Junseo Lee , Seokwon Lee , Jungi Lee, Junyong Park , and Jaewoong Sim
    In Proceedings of the 2024 International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) , San Diego, CA, USA, 2024

2023

  1. ISCA-50
    NeuRex: A Case for Neural Rendering Acceleration
    Junseo Lee , Kwanseok Choi , Jungi Lee, Seokwon Lee , Joonho Whangbo , and Jaewoong Sim
    In Proceedings of the 50th Annual International Symposium on Computer Architecture (ISCA) , Orlando, FL, USA, 2023