Xiaoyi LuAssociate Professor Department of Computer Science and Engineering |
Short Bio
Dr. Xiaoyi Lu is an Associate Professor in the Department of Computer Science and Engineering at the University of California, Merced (UC Merced), where he leads the Parallel and Distributed Systems Laboratory (PADSYS Lab). He has been an affiliated faculty member with the AgAID Institute since 2023. His research interests include parallel and distributed computing, high-performance communication and I/O technologies, big data analytics, cloud computing, deep learning, digital twin technology, and interdisciplinary research (e.g., precision agriculture and biostatistics). Dr. Lu has published more than 150 papers in prestigious international conferences, workshops, and journals, and has received ten Best (Student) Paper Awards or Nominations (e.g., SC 2019, IPDPS 2024). He has delivered over 100 invited talks, tutorials, and presentations around the world. Dr. Lu is actively involved in various professional activities in academic journals and conferences. He has made many of his research outcomes, such as PMIdioBench, HiBD, MVAPICH2-Virt, and DataMPI, publicly available to the community, and they are currently being used by hundreds of organizations worldwide. Dr. Lu has received the NSF CAREER Award, an Amazon Research Award, a Google Research Award, and a Meta/Facebook Faculty Research Award, and his research has been funded by the NSF and DOE. For more information about Dr. Lu, please visit http://faculty.ucmerced.edu/luxi.
UC Merced is ranked #7 in high performance computing (HPC) and #63 in computer science in general by CSRankings.
I am always looking for strong postdoc researchers and Ph.D. students who are interested in Parallel and Distributed Computing systems and applications.
Research
My research interests include:
- Parallel and Distributed Computing
- Scalable and Efficient Techniques and Systems for HPC, Big Data, AI, Cloud, Edge, and others
- High-Performance Communication and I/O Technologies (e.g., RDMA/PMEM/NVMe)
- Interdisciplinary Research
I am/was the founder/co-founder and R&D leader on the following projects:
- PMIdioBench: Understanding the Idiosyncrasies of Real Persistent Memory
- High-Performance Big Data (HiBD)
- High-Performance Deep Learning (HiDL) with RDMA-TensorFlow
- MVAPICH2-Virt: High-Performance MPI Library with Virtualization Support
- DataMPI: Extending MPI for Big Data with Key-Value based Communication
- LingCloud: An IaaS Management System for Heterogeneous Applications
- High-Performance Neuroscience (NeuroHPC)
Selected Publications
Complete publications list: [by year] [Google scholar] [DBLP]
- [PPoPP'25] SBMGT: Scaling Bayesian Multinomial Group Testing
Weicong Chen, Hao Qi, Curtis Tatsuoka, and Xiaoyi Lu.
Accepted in ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2025. (Acceptance Rate: 38/189, 20.1%) - [SIGMOD'25] On the Feasibility and Benefits of Extensive Evaluation
Yujie Hui*, Miao Yu*, Hao Qi, Yifan Gan, Tianxi Li, Yuke Li, Xueyuan Ren, Sixiang Ma, Xiaoyi Lu, and Yang Wang.
In Proceedings of ACM International Conference on Management of Data (SIGMOD), 2025. (* made equal contributions) - [SC'24] Versatile Datapath Soft Error Detection on the Cheap for HPC Applications
Yafan Huang, Sheng Di, Zhaorui Zhang, Xiaoyi Lu, Guanpeng Li.
In Proceedings of the 37th International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024. - [SC'24] hZCCL: Accelerating Collective Communication with Co-Designed Homomorphic Compression
Jiajun Huang, Sheng Di, Xiaodong Yu, Zhaiyu Jia, Jinyang Liu, Zizhe Jian, Xin Liang, Kai Zhao, Xiaoyi Lu, Zizhong Chen, Franck Cappello, Yanfei Guo and Rajeev Thakur.
In Proceedings of the 37th International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2024. - [IPDPS'24] Accelerating Lossy and Lossless Compression on Emerging BlueField DPU Architectures (Best Paper Award Nomination)
Yuke Li, Arjun Kashyap, Weicong Chen, Yanfei Guo, and Xiaoyi Lu.
In Proceedings of the 38th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2024. - [IPDPS'24] NVMe-oPF: Designing Efficient Priority Schemes for NVMe-over-Fabrics with Multi-Tenancy Support
Darren Ng, Andrew Lin, Arjun Kashyap, Guanpeng Li, and Xiaoyi Lu.
In Proceedings of the 38th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2024. - [IPDPS'24] DRUTO: Upper-Bounding Silent Data Corruption Vulnerability in GPU Applications
Md Hasanur Rahman, Sheng Di, Shengjian Guo, Xiaoyi Lu, Guanpeng Li, and Franck Cappello.
In Proceedings of the 38th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2024. - [ICS'24] gZCCL: Compression-Accelerated Collective Communication Framework for GPU Clusters
Jiajun Huang, Sheng Di, Xiaodong Yu, Yujia Zhai, Jinyang Liu, Yafan Huang, Ken Raffenetti, Hui Zhou, Kai Zhao, Xiaoyi Lu, Zizhong Chen, Franck Cappello, Yanfei Guo, and Rajeev Thakur
In Proceedings of the 38th International Conference on Supercomputing (ICS), 2024. - [IPDPS'23] SBGT: Scaling Bayesian-based Group Testing for Disease Surveillance
Weicong Chen, Hao Qi, Xiaoyi Lu, and Curtis Tatsuoka.
In Proceedings of the 37th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2023. - [HPDC'22] NVMe-oAF: Towards Adaptive NVMe-oF for IO-Intensive Workloads on HPC Cloud
Arjun Kashyap and Xiaoyi Lu.
In Proceedings of International ACM Symposium on High Performance and Distributed Computing (HPDC), 2022. (Acceptance Rate: 19%) - [VLDB'22] A Study of Database Performance Sensitivity to Experiment Settings
Yang Wang, Miao Yu, Yujie Hui, Fang Zhou, Yuyang Huang, Rui Zhu, Xueyuan Ren, Tianxi Li, and Xiaoyi Lu
In Proceedings of the VLDB Endowment, the 48th International Conference on Very Large Data Bases (VLDB), 2022. - [SEC'21] Characterizing and Accelerating End-to-End EdgeAI Inference Systems for Object Detection Applications
Yujie Hui, Jeffrey Lien, and Xiaoyi Lu.
In Proceedings of the 6th ACM/IEEE Symposium on Edge Computing (SEC), 2021. - [SC'21] HatRPC: Hint-Accelerated Thrift RPC over RDMA
Tianxi Li*, Haiyang Shi*, and Xiaoyi Lu.
In Proceedings of the 34th International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2021. (*Co-First Authors) - [HPDC'21] DStore: A Fast, Tailless, and Quiescent-Free Object Store for PMEM
Shashank Gugnani and Xiaoyi Lu.
In Proceedings of International ACM Symposium on High Performance and Distributed Computing (HPDC), 2021. (Acceptance Rate: 19%) - [VLDB'21] Understanding the Idiosyncrasies of Real Persistent Memory
Shashank Gugnani, Arjun Kashyap, and Xiaoyi Lu.
In Proceedings of the VLDB Endowment, the 47th International Conference on Very Large Data Bases (VLDB), 2021. - [IPDPS'21] NVMe-CR: A Scalable Ephemeral Storage Runtime for Checkpoint/Restart with NVMe-over-Fabrics
Shashank Gugnani, Tianxi Li, and Xiaoyi Lu.
In Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2021. - [SC'20] INEC: Fast and Coherent In-Network Erasure Coding
Haiyang Shi and Xiaoyi Lu.
In Proceedings of the 33rd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2020. (Acceptance Rate: 22.3%) - [SC'20] RDMP-KV: Designing Remote Direct Memory Persistence-based Key-Value Stores with PMEM
Tianxi Li*, Dipti Shankar*, Shashank Gugnani, and Xiaoyi Lu.
In Proceedings of the 33rd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2020. (Acceptance Rate: 22.3%, *Co-First Authors) - [SC'19] TriEC: Tripartite Graph Based Erasure Coding NIC Offload (Best Student Paper Finalist)
Haiyang Shi and Xiaoyi Lu.
In Proceedings of the 32nd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2019. (Acceptance Rate: 22.7%, 78/344) - [HPDC'19] UMR-EC: A Unified and Multi-Rail Erasure Coding Library for High-Performance Distributed Storage Systems
Haiyang Shi, Xiaoyi Lu, Dipti Shankar, and Dhabaleswar K. Panda.
In Proceedings of the 28th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2019. (Acceptance Rate: 20.7%, 22/106) - [IPDPS'19] C-GDR: High-Performance Container-aware GPUDirect MPI Communication Schemes on RDMA Networks
Jie Zhang, Xiaoyi Lu, Ching-Hsiang Chu, and Dhabaleswar K. Panda.
In Proceedings of the 33rd IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2019. - [IISWC'19] SimdHT-Bench: Characterizing SIMD-Aware Hash Table Designs on Emerging CPU Architectures (Best Paper Award Nomination)
Dipti Shankar, Xiaoyi Lu, and Dhabaleswar K. Panda.
In Proceedings of 2019 IEEE International Symposium on Workload Characterization (IISWC), 2019. - [TPDS'18] Exploiting Hardware Multicast and GPUDirect RDMA for Efficient Broadcast
Ching-Hsiang Chu, Xiaoyi Lu, Ammar A. Awan, Hari Subramoni, Bracy Elton, and Dhabaleswar K. (DK) Panda.
In IEEE Transactions on Parallel and Distributed Systems (TPDS), accepted in July 2018. - [SC'17] Scalable Reduction Collectives with Data Partitioning-based Multi-Leader Design
Mohammadreza Bayatpour, Sourav Chakraborty, Hari Subramoni, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
Proceedings of the 30th International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2017. (Acceptance Rate: 18.7%, 61/327) - [ICDCS'17] High-Performance and Resilient Key-Value Store with Online Erasure Coding for Big Data Workloads
Dipti Shankar, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
Proceedings of the 37th IEEE International Conference on Distributed Computing Systems (ICDCS), 2017. (Acceptance Rate: 16.9%, 90/531) - [VEE'17] Designing Locality and NUMA Aware MPI Runtime for Nested Virtualization based HPC Cloud with SR-IOV Enabled InfiniBand
Jie Zhang, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
Proceedings of the 13th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE), 2017. - [IPDPS'17] High-Performance Virtual Machine Migration Framework for MPI Applications on SR-IOV enabled InfiniBand Clusters
Jie Zhang, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
Proceedings of the 31st IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2017. (Acceptance Rate: 23%, 116/504) - [TPDS'16] A Comprehensive Study of MapReduce over Lustre for Intermediate Data Placement and Shuffle Strategies on HPC Clusters
Md. Wasi-ur-Rahman, Nusrat Islam, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
IEEE Transactions on Parallel and Distributed Systems, accepted in 2016. - [IPDPS'16] High-Performance Hybrid Key-Value Store on Modern Clusters with RDMA Interconnects and SSDs: Non-blocking Extensions, Designs, and Benefits
Dipti Shankar, Xiaoyi Lu, Nusrat Islam, Md. Wasi-ur-Rahman, and Dhabaleswar K. (DK) Panda.
Proceedings of the 30th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2016. (Acceptance Rate: 23%, 114/496) - [SC'16] Designing MPI Library with On-Demand Paging (ODP) of InfiniBand: Challenges and Benefits
Mingzhe Li, Khaled Hamidouche, Xiaoyi Lu, Hari Subramoni, Jie Zhang, and Dhabaleswar K. (DK) Panda.
Proceedings of the 29th International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2016. (Acceptance Rate: 18.4%, 82/446) - [ICS'16] High Performance Design for HDFS with Byte-Addressability of NVM and RDMA
Nusrat Islam, Md. Wasi-ur-Rahman, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
Proceedings of the 30th International Conference on Supercompuing (ICS), 2016. (Acceptance Rate: 24%, 43/178) - [IPDPS'15] High-Performance Design of YARN MapReduce on Modern HPC Clusters with Lustre and RDMA
Md. Wasi-ur-Rahman, Xiaoyi Lu, Nusrat Islam, Raghunath Rajachandrasekar, and Dhabaleswar K. (DK) Panda.
Proceedings of the 29th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015. (Acceptance Rate: 21.8%, 108/496) - [ICDCS'15] Accelerating Apache Hive with MPI for Data Warehouse Systems
Lu Chao, Chundian Li, Fan Liang, Xiaoyi Lu, and Zhiwei Xu.
Proceedings of the 35th IEEE International Conference on Distributed Computing Systems (ICDCS), 2015. (Acceptance Rate: 12.9%, 70/543) - [ICS'14] HOMR: A Hybrid Approach to Exploit Maximum Overlapping in MapReduce over High Performance Interconnects
Md. Wasi-ur-Rahman, Xiaoyi Lu, Nusrat Islam, and Dhabaleswar K. (DK) Panda.
Proceedings of the 28th International Conference on Supercompuing (ICS), 2014. (Acceptance Rate: 21%, 34/162) - [IPDPS'14] DataMPI: Extending MPI to Hadoop-like Big Data Computing
Xiaoyi Lu, Fan Liang, Bing Wang, Li Zha, and Zhiwei Xu.
Proceedings of the 28th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2014. (Acceptance Rate: 21.1%, 114/541)
Teaching
- UCM CSE 100 - Algorithm Design and Analysis (Undergraduate Course, Autumn 2024)
- UCM EECS 266 - Advanced Distributed Systems (Graduate Course, Spring 2024)
- UCM CSE 168 - Distributed Software Systems (Undergraduate Course, Autumn, 2021-2023)
- UCM EECS 268 - Datacenter-Scale Computing (Graduate Course, Spring, 2022-2023)
- UCM EECS 290 - EECS Seminars (Graduate Course, Autumn 2023)
- OSU CSE 3244 - Data Mgmt in Cloud (Undergraduate Course, Autumn 2020)
People
Postdocs:
- Weicong Chen
- Duo Zhang
Ph.D. Students:
- Liuyao Dai
- Yujie Hui (OSU, co-advised with Dr. Yang Wang)
- Arjun Kashyap
- Yuke Li
- Tianxi Li (OSU, co-advised with Dr. Yang Wang)
- Darren Ng
- Hao Qi
- Adam Weingram
Master Students:
- Noel Pereira
Undergraduate Students:
- Savio Jabbo (recruited via UCM UROC, Spring 2024)
- Stephanie Lin (recruited via UCM UROC, Spring 2024)
- Anika Potu (recruited via UCM UROC, Fall 2023)
- Rohit Rao (recruited via UCM Course, Fall 2023)
- Henry Rodas Hernandez (recruited via UCM UROC, Fall 2023)
- Alex Villa (Fall 2023)
- Alex Wan (recruited via UCM FACTS, Summer 2023)
- Deyi (Daniel) Xing (recruited via UCM UROC, Fall 2023)
High-School Summer Interns:
- Benny Liu
- Lisa Yu
Past Students:
- Tai Pham (UCM Undergraduate Student, 2024; Joined Georgia Tech as a Master's student)
- Andrew Lin (UCM Undergraduate Student, 2024; Joined UC Irvine as a Master's student)
- Colin Schmierer (UCM Undergraduate Student, 2023; First job placement: CACI International Inc)
- Charles Parkinson (UCM Undergraduate Student, 2022; First job placement: Amazon)
- Shashank Gugnani (OSU Ph.D., 2020; First job placement: Oracle)
- Haiyang Shi (OSU Ph.D., 2020; First job placement: ByteDance)
- Haseeb Javed (OSU M.S., 2019; First job placement: Amazon)
- Xi Xiong (OSU Undergraduate Student, 2019; Joined Northwestern Univ. as a Master's student)
- Heming Sun (OSU Undergraduate Student, 2019; Joined USC as a Master's student)
- Jun Huang (OSU Visiting Undergraduate Student, 2019; Joined OSU as a Ph.D. student)
- Dipti Shankar (OSU Ph.D., 2019; First job placement: Fraunhofer ITWM, Germany; Co-advised with Prof. D. K. Panda)
I am also fortunate to have worked with these talented past students as their mentor and thesis committee member:
- Weicong Chen (CWRU Ph.D., 2022; First job placement: Join PADSYS Lab as a postdoc)
- Jie Zhang (OSU Ph.D., 2018; First job placement: Amazon)
- Rajarshi Biswas (OSU M.S., 2018; First job placement: Amazon)
- Mingzhe Li (OSU Ph.D., 2017; First job placement: Facebook)
- Nusrat Islam (OSU Ph.D., 2016; First job placement: Intel)
- Md. Wasi-ur-Rahman (OSU Ph.D., 2016; First job placement: Intel)
- Kunal Kulkarni (OSU M.S., 2016; First job placement: Microsoft)
- Adithya Bhat (OSU M.S., 2015; First job placement: Amazon)
Professional Services
TPC Chairs/Vice-Chairs/Area Co-Chairs:
- IEEE International Parallel & Distributed Processing Symposium (IPDPS), Architecture Area, 2024
- IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), Scalable Algorithms and Analytics Track, 2024
- IEEE International Workshop on High-Performance Big Data and Cloud Computing (HPBDC), 2015-2020
- IEEE Cloud Summit, 2020, 2021
General Co-Chairs or Co-Organizers
- BenchCouncil International Symposium on Benchmarking, Measuring and Optimizing (Bench), 2019, 2021
- BoFs on AI, Big Data, and Cloud Computing topics, co-located with the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), 2017, 2019
- A BoF on HPC and Cloud Computing topics, co-located with the International Supercomputing Conference (ISC), 2017
- Big Data Neuroscience Workshop, Organized by the Advanced Computational Neuroscience Network (ACNN), 2016-2019
TPC Members:
- International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2019-2022, 2024
- ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2022, 2023
- ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming (PPoPP), 2025
- IEEE International Parallel & Distributed Processing Symposium (IPDPS), 2022
- ACM SIGMOD, 2021 (Demo Track)
- IEEE International Conference on Distributed Computing Systems (ICDCS), 2021, 2023, 2024
- IEEE Cluster Conference (CLUSTER), 2021, 2023, 2024
- IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (IEEE/ACM CCGrid), 2020-2022, 2024
- International Conference on Parallel Processing (ICPP), 2015, 2022
- IEEE International Conference on Big Data (IEEE BigData), 2024
- IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2019, 2020, 2022, 2023
- IEEE/ACM International Conference on Utility and Cloud Computing (UCC), 2017-2024
- International Conference on High-Performance Computing in Asia-Pacific Region (HPCAsia), 2020, 2023
- IEEE Hot Interconnects Symposium (HotI), 2024
Journal Reviewers:
- ACM Transactions on Storage (TOS), 2019, 2020
- ACM Transactions on Design Automation of Electronic Systems (TODAES), 2016
- ACM Transactions on Architecture and Code Optimization (TACO), 2019
- ACM Transactions on Embedded Computing Systems (TECS), 2021
- ACM Transactions on Modeling and Performance Evaluation of Computing Systems (ToMPECS), 2022
- IEEE Transactions on Computers (TC), 2015, 2020
- IEEE Transactions on Parallel and Distributed Systems (TPDS), 2016, 2017, 2018, 2019, 2020, 2022
- IEEE Transactions on Cloud Computing (TCC), 2016, 2017
- IEEE Transactions on Big Data (TBD), 2016, 2017
- IEEE Transactions on Emerging Topics in Computing (TETC), 2015
- IEEE Transactions on Services Computing (TSC), 2014
- IEEE Transactions on Multi-Scale Computing Systems (TMSCS), 2018
- IEEE Transactions on Communications (TCOM), 2019
- Journal of Parallel and Distributed Computing (JPDC), 2014, 2015, 2016, 2017, 2018, 2019, 2020
- IEEE Access, 2018
Selected Awards, Honors, and Recognitions
- Best Paper Award Nomination - The 38th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2024. (My role: Advisor)
- NSF CAREER Award - National Science Foundation, 2024.
- Affiliated Faculty - AgAID Institute, Agricultural AI for Transforming Workforce and Decision Support, funded by NSF and USDA-NIFA by the AI Research Institutes Program, July 2023 -- now.
- Amazon Research Award - Amazon Inc., 2023. (My role: Sole PI)
- Google Research Award - Google LLC, 2022. (My role: Sole PI)
- Meta/Facebook Faculty Research Award - Meta Platforms Inc. (formerly named Facebook Inc.), 2022. (My role: Sole PI)
- Scientific Teaching Fellow - Summer Institute on Scientific Teaching, UC Merced, 2021. (My role: Awardee)
- Best Paper Award Nomination - International Symposium on Benchmarking, Measuring, and Optimizing (Bench), 2019. (My role: Advisor)
- Best Paper Award Nomination - IEEE International Symposium on Workload Characterization (IISWC), 2019. (My role: Co-Advisor)
- Best Student Paper Award Finalist - The 32nd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2019. (My role: Advisor)
- Best Paper Award - International Symposium on Benchmarking, Measuring, and Optimizing (Bench), 2018. (My role: Advisor)
- Best Poster Paper Award - International Supercomputing Conference (ISC), Architecture & Networks Track, 2018. (My role: Co-Author)
- Best Student Paper Award - The 10th IEEE/ACM International Conference on Utility and Cloud Computing (UCC), 2017. (My role: Co-Author)
- Best Paper Award Nomination - The IEEE International Conference on Cluster Computing (Cluster), 2014. (My role: Co-Author)
- Best Student Paper Award - The IEEE International Conference on Cluster Computing (Cluster), 2013. (My role: Co-Author)
- Best Paper Award - The 9th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA), 2011. (My role: First Author)