Publications
publications by categories in reversed chronological order. * indicates equal contribution
2024
- ASPLOS’24GSCore: Efficient Radiance Field Rendering via Architectural Support for 3D Gaussian SplattingIn Proceedings of the 2024 International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) , San Diego, CA, USA, 2024
- ISCA-51Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime RequantizationIn Proceedings of the 51th Annual International Symposium on Computer Architecture (ISCA) , Buenos Aires, Argentina, 2024
- OSDI’24InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache ManagementIn Proceedings of the 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI) , Santa Clara, CA, USA, 2024
2023
- ISCA-50NeuRex: A Case for Neural Rendering AccelerationIn Proceedings of the 50th Annual International Symposium on Computer Architecture (ISCA) , Orlando, FL, USA, 2023