Xiaoyi's Photo

Xiaoyi Lu

Assistant Professor
Founder and Director of Parallel and Distributed Systems Laboratory (PADSYS Lab)

Department of Computer Science and Engineering
School of Engineering
The University of California, Merced
Address: SE2 277, 5200 N. Lake Road, Merced, CA 95343
Email: xiaoyi DOT lu AT ucmerced DOT edu

Bio |  Research |  Publications |  Teaching |  People |  Services |  Awards |  PADSYS Lab |  HPCSA Group

Short Bio

Dr. Xiaoyi Lu is an Assistant Professor in the Department of Computer Science and Engineering at the University of California, Merced, USA. He is the founder and director of Parallel and Distributed Systems Laboratory (PADSYS Lab). Previously (2018-2020), he was a Research Assistant Professor at the Ohio State University (OSU). His current research interests include parallel and distributed computing, high-performance interconnects, advanced I/O technologies, Big Data Analytics, Virtualization, Cloud Computing, and Deep Learning system software. He has published more than 100 papers in major international conferences, workshops, and journals with multiple Best (Student) Paper Awards or Nominations. He has delivered more than 100 times of invited talks, tutorials, and presentations worldwide. He has been actively involved in various professional activities in academic journals and conferences. Many of Dr. Lu’s research outcomes (e.g, PMIdioBench, RDMA for Hadoop/Spark/TensorFlow/Memcached/Kafka, MVAPICH2-Virt, DataMPI, LingCloud, NeuroHPC) are made publicly available to the community and currently being used by hundreds of organizations all over the world. More details about Dr. Lu can be found at http://faculty.ucmerced.edu/luxi.

Research

My research interests include:

  • Parallel and Distributed Computing
  • Systems for HPC, Big Data, AI, Cloud Computing, Edge Computing, and others
  • High-Performance Communication and I/O Technologies (e.g., RDMA/PMEM/NVMe)
  • Container- and Hypervisor-based Virtualization
  • Performance, Scalability, Fault tolerance, QoS, and others

I am/was the founder/co-founder and R&D leader on the following projects:

Selected Publications

Complete publications list: [by year] [Google scholar] [DBLP]

  1. [SEC'21] Characterizing and Accelerating End-to-End EdgeAI Inference Systems for Object Detection Applications
    Yujie Hui, Jeffrey Lien, and Xiaoyi Lu.
    In Proceedings of the 6th ACM/IEEE Symposium on Edge Computing (SEC), 2021.
  2. [SC'21] HatRPC: Hint-Accelerated Thrift RPC over RDMA
    Tianxi Li*, Haiyang Shi*, and Xiaoyi Lu.
    In Proceedings of the 34th International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2021. (*Co-First Authors)
  3. [HPDC'21] DStore: A Fast, Tailless, and Quiescent-Free Object Store for PMEM
    Shashank Gugnani and Xiaoyi Lu.
    In Proceedings of International ACM Symposium on High Performance and Distributed Computing (HPDC), 2021. (Acceptance Rate: 19%)
  4. [VLDB'21] Understanding the Idiosyncrasies of Real Persistent Memory
    Shashank Gugnani, Arjun Kashyap, and Xiaoyi Lu.
    In Proceedings of the VLDB Endowment, the 47th International Conference on Very Large Data Bases (VLDB), 2021.
  5. [IPDPS'21] NVMe-CR: A Scalable Ephemeral Storage Runtime for Checkpoint/Restart with NVMe-over-Fabrics
    Shashank Gugnani, Tianxi Li, and Xiaoyi Lu.
    In Proceedings of the 35th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2021.
  6. [SC'20] INEC: Fast and Coherent In-Network Erasure Coding
    Haiyang Shi and Xiaoyi Lu.
    In Proceedings of the 33rd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2020. (Acceptance Rate: 22.3%)
  7. [SC'20] RDMP-KV: Designing Remote Direct Memory Persistence-based Key-Value Stores with PMEM
    Tianxi Li*, Dipti Shankar*, Shashank Gugnani, and Xiaoyi Lu.
    In Proceedings of the 33rd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2020. (Acceptance Rate: 22.3%, *Co-First Authors)
  8. [SC'19] TriEC: Tripartite Graph Based Erasure Coding NIC Offload  (Best Student Paper Finalist)
    Haiyang Shi and Xiaoyi Lu.
    In Proceedings of the 32nd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2019. (Acceptance Rate: 22.7%, 78/344)
  9. [HPDC'19] UMR-EC: A Unified and Multi-Rail Erasure Coding Library for High-Performance Distributed Storage Systems
    Haiyang Shi, Xiaoyi Lu, Dipti Shankar, and Dhabaleswar K. Panda.
    In Proceedings of the 28th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2019. (Acceptance Rate: 20.7%, 22/106)
  10. [IPDPS'19] C-GDR: High-Performance Container-aware GPUDirect MPI Communication Schemes on RDMA Networks
    Jie Zhang, Xiaoyi Lu, Ching-Hsiang Chu, and Dhabaleswar K. Panda.
    In Proceedings of the 33rd IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2019.
  11. [IISWC'19] SimdHT-Bench: Characterizing SIMD-Aware Hash Table Designs on Emerging CPU Architectures  (Best Paper Award Nomination)
    Dipti Shankar, Xiaoyi Lu, and Dhabaleswar K. Panda.
    In Proceedings of 2019 IEEE International Symposium on Workload Characterization (IISWC), 2019.
  12. [TPDS'18] Exploiting Hardware Multicast and GPUDirect RDMA for Efficient Broadcast
    Ching-Hsiang Chu, Xiaoyi Lu, Ammar A. Awan, Hari Subramoni, Bracy Elton, and Dhabaleswar K. (DK) Panda.
    In IEEE Transactions on Parallel and Distributed Systems (TPDS), accepted in July 2018.
  13. [TMSCS'18] DLoBD: A Comprehensive Study of Deep Learning over Big Data Stacks on HPC Clusters
    Xiaoyi Lu, Haiyang Shi, Rajarshi Biswas, M. Haseeb Javed, and Dhabaleswar K. (DK) Panda.
    In IEEE Transactions on Multi-Scale Computing Systems (TMSCS), 2018. 
  14. [TCDE'17] Scalable and Distributed Key-Value Store-based Data Management Using RDMA-Memcached
    Xiaoyi Lu, Dipti Shankar, and Dhabaleswar K. (DK) Panda.
    "IEEE Data Engineering Bulletin (DEBull), Volume 40", Bulletin of the Technical Committee on Data Engineering (TCDE), 2017. (Invited Paper)
  15. [SC'17] Scalable Reduction Collectives with Data Partitioning-based Multi-Leader Design
    Mohammadreza Bayatpour, Sourav Chakraborty, Hari Subramoni, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
    Proceedings of the 30th International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2017. (Acceptance Rate: 18.7%, 61/327)
  16. [ICDCS'17] High-Performance and Resilient Key-Value Store with Online Erasure Coding for Big Data Workloads
    Dipti Shankar, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
    Proceedings of the 37th IEEE International Conference on Distributed Computing Systems (ICDCS), 2017. (Acceptance Rate: 16.9%, 90/531)
  17. [VEE'17] Designing Locality and NUMA Aware MPI Runtime for Nested Virtualization based HPC Cloud with SR-IOV Enabled InfiniBand
    Jie Zhang, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
    Proceedings of the 13th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments (VEE), 2017.
  18. [IPDPS'17] High-Performance Virtual Machine Migration Framework for MPI Applications on SR-IOV enabled InfiniBand Clusters
    Jie Zhang, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
    Proceedings of the 31st IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2017. (Acceptance Rate: 23%, 116/504)
  19. [TPDS'16] A Comprehensive Study of MapReduce over Lustre for Intermediate Data Placement and Shuffle Strategies on HPC Clusters
    Md. Wasi-ur-Rahman, Nusrat Islam, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
    IEEE Transactions on Parallel and Distributed Systems, accepted in 2016.
  20. [IEEE BigData'16] High-Performance Design of Apache Spark with RDMA and Its Benefits on Various Workloads
    Xiaoyi Lu, Dipti Shankar, Shashank Gugnani, and Dhabaleswar K. (DK) Panda.
    Proceedings of IEEE International Conference on Big Data (IEEE BigData), 2016. (Acceptance Rate for Regular Paper: 18.68%, 79/423)
  21. [IPDPS'16] High-Performance Hybrid Key-Value Store on Modern Clusters with RDMA Interconnects and SSDs: Non-blocking Extensions, Designs, and Benefits
    Dipti Shankar, Xiaoyi Lu, Nusrat Islam, Md. Wasi-ur-Rahman, and Dhabaleswar K. (DK) Panda.
    Proceedings of the 30th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2016. (Acceptance Rate: 23%, 114/496)
  22. [SC'16] Designing MPI Library with On-Demand Paging (ODP) of InfiniBand: Challenges and Benefits
    Mingzhe Li, Khaled Hamidouche, Xiaoyi Lu, Hari Subramoni, Jie Zhang, and Dhabaleswar K. (DK) Panda.
    Proceedings of the 29th International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2016. (Acceptance Rate: 18.4%, 82/446)
  23. [ICS'16] High Performance Design for HDFS with Byte-Addressability of NVM and RDMA
    Nusrat Islam, Md. Wasi-ur-Rahman, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
    Proceedings of the 30th International Conference on Supercompuing (ICS), 2016. (Acceptance Rate: 24%, 43/178)
  24. [IPDPS'15] High-Performance Design of YARN MapReduce on Modern HPC Clusters with Lustre and RDMA
    Md. Wasi-ur-Rahman, Xiaoyi Lu, Nusrat Islam, Raghunath Rajachandrasekar, and Dhabaleswar K. (DK) Panda.
    Proceedings of the 29th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2015. (Acceptance Rate: 21.8%, 108/496)
  25. [ICDCS'15] Accelerating Apache Hive with MPI for Data Warehouse Systems
    Lu Chao, Chundian Li, Fan Liang, Xiaoyi Lu, and Zhiwei Xu.
    Proceedings of the 35th IEEE International Conference on Distributed Computing Systems (ICDCS), 2015. (Acceptance Rate: 12.9%, 70/543)
  26. [Cluster'14] High Performance OpenSHMEM for MIC Clusters: Extensions, Runtime Designs and Application Co-design (Best Paper Award Nomination)
    Jithin Jose, Khaled Hamidouche, Xiaoyi Lu, Sreeram Potluri, Jie Zhang, Karen Tomko, and Dhabaleswar K. (DK) Panda.
    Proceedings of IEEE International Conference on Cluster Computing (IEEE Cluster), 2014.
  27. [IPDPS'14] DataMPI: Extending MPI to Hadoop-like Big Data Computing
    Xiaoyi Lu, Fan Liang, Bing Wang, Li Zha, and Zhiwei Xu.
    Proceedings of the 28th IEEE International Parallel and Distributed Processing Symposium (IPDPS), 2014. (Acceptance Rate: 21.1%, 114/541)
  28. [ICS'14] HOMR: A Hybrid Approach to Exploit Maximum Overlapping in MapReduce over High Performance Interconnects
    Md. Wasi-ur-Rahman, Xiaoyi Lu, Nusrat Islam, and Dhabaleswar K. (DK) Panda.
    Proceedings of the 28th International Conference on Supercompuing (ICS), 2014. (Acceptance Rate: 21%, 34/162)
  29. [Cluster'13] A Scalable and Portable Approach to Accelerate Hybrid HPL on Heterogeneous CPU-GPU Clusters (Best Student Paper Award)
    Rong Shi, Sreeram Potluri, Khaled Hamidouche, Xiaoyi Lu, and Dhabaleswar K. (DK) Panda.
    Proceedings of IEEE International Conference on Cluster Computing (IEEE Cluster), 2013.

Teaching

People

Ph.D. Students:

Junior Specialist:

  • Tianxi Li

Undergraduate Students:

  • Xi Xiong

Past Students:

  • Shashank Gugnani (OSU Ph.D., 2020; First job placement: Oracle)
  • Haiyang Shi (OSU Ph.D., 2020; First job placement: ByteDance)
  • Haseeb Javed (OSU M.S., 2019; First job placement: Amazon)
  • Jun Huang (OSU Visiting Undergraduate Student, 2019; Join OSU as a Ph.D. student)
  • Dipti Shankar (OSU Ph.D., 2019; First job placement: Fraunhofer ITWM, Germany; Co-advised with Prof. D. K. Panda)

I am also fortunate to have worked with these talented past students as their mentor and thesis committee member:

  • Jie Zhang (OSU Ph.D., 2018; First job placement: Amazon)
  • Rajarshi Biswas (OSU M.S., 2018; First job placement: Amazon)
  • Mingzhe Li (OSU Ph.D., 2017; First job placement: Facebook)
  • Nusrat Islam (OSU Ph.D., 2016; First job placement: Intel)
  • Wasiur Rahman (OSU Ph.D., 2016; First job placement: Intel)
  • Kunal Kulkarni (OSU M.S., 2016; First job placement: Microsoft)
  • Adithya Bhat (OSU M.S., 2015; First job placement: Amazon)

Professional Services

TPC Chairs/Co-Chairs/Vice-Chairs:

  • IEEE International Workshop on High-Performance Big Data and Cloud Computing (HPBDC), 2015, 2016, 2017, 2018, 2019, 2020
  • IEEE Cloud Summit, 2020, 2021
  • International Forum on High-Performance Interconnects, co-located with National Annual Conference on High Performance Computing (HPC China), 2019, 2020, 2021

General Co-Chairs or Co-Organizers

  • BenchCouncil International Symposium on Benchmarking, Measuring and Optimizing (Bench), 2019, 2021
  • BoFs on AI, Big Data, and Cloud Computing topics, co-located with International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), 2017, 2019
  • A BoF on HPC and Cloud Computing topics, co-located with International Supercomputing Conference (ISC), 2017
  • Big Data Neuroscience Workshop, Organized by the Advanced Computational Neuroscience Network (ACNN), 2016, 2017, 2018, 2019

TPC Members:

  • International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2019, 2020 (Poster Track), 2021
  • IEEE International Parallel & Distributed Processing Symposium (IPDPS), 2022
  • ACM SIGMOD, 2021 (Demo Track)
  • IEEE International Conference on Distributed Computing Systems (ICDCS), 2021
  • IEEE Cluster Conference (CLUSTER), 2021
  • IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (IEEE/ACM CCGrid), 2020, 2021
  • International Conference on Parallel Processing (ICPP), 2015
  • IEEE International Conference on High Performance Computing, Data, and Analytics (HiPC), 2019, 2020
  • IEEE/ACM International Conference on Utility and Cloud Computing (UCC), 2017, 2018, 2019, 2020, 2021
  • International Conference on High-Performance Computing in Asia-Pacific Region (HPCAsia), 2020
  • International Conference on Computer Communications and Networks (ICCCN), 2015, 2016, 2018, 2019
  • International OpenFabrics Alliance Workshop (OFAW), 2021

Journal Reviewers:

  • ACM Transactions on Storage (TOS), 2019, 2020
  • ACM Transactions on Design Automation of Electronic Systems (TODAES), 2016
  • ACM Transactions on Architecture and Code Optimization (TACO), 2019
  • IEEE Transactions on Computers (TC), 2015, 2020
  • IEEE Transactions on Parallel and Distributed Systems (TPDS), 2016, 2017, 2018, 2019, 2020
  • IEEE Transactions on Cloud Computing (TCC), 2016, 2017
  • IEEE Transactions on Big Data (TBD), 2016, 2017
  • IEEE Transactions on Emerging Topics in Computing (TETC), 2015
  • IEEE Transactions on Services Computing (TSC), 2014
  • IEEE Transactions on Multi-Scale Computing Systems (TMSCS), 2018
  • IEEE Transactions on Communications (TCOM), 2019
  • Journal of Parallel and Distributed Computing (JPDC), 2014, 2015, 2016, 2017, 2018, 2019, 2020
  • IEEE Access, 2018

Selected Awards, Honors, and Recognitions

  • Best Paper Award Nomination - International Symposium on Benchmarking, Measuring, and Optimizing (Bench), 2019. (My role: Advisor)
  • Best Paper Award Nomination - IEEE International Symposium on Workload Characterization (IISWC), 2019. (My role: Co-Advisor)
  • Best Student Paper Award Finalist - The 32nd International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2019. (My role: Advisor)
  • Best Paper Award - International Symposium on Benchmarking, Measuring, and Optimizing (Bench), 2018. (My role: Advisor)
  • Best Poster Paper Award - International Supercomputing Conference (ISC), Architecture & Networks Track, 2018. (My role: Co-Author)
  • Best Student Paper Award - The 10th IEEE/ACM International Conference on Utility and Cloud Computing (UCC), 2017. (My role: Co-Author)
  • Best Paper Award Nomination - The IEEE International Conference on Cluster Computing (Cluster), 2014. (My role: Co-Author)
  • Best Student Paper Award - The IEEE International Conference on Cluster Computing (Cluster), 2013. (My role: Co-Author)
  • Outstanding Graduates (Ph.D.) of Beijing - Beijing Municipal Commission of Education, 2012.
  • Best Paper Award - The 9th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA), 2011. (My role: First Author)
  • Outstanding Student Award - Graduate University of Chinese Academy of Sciences, 2010.
  • Director Scholarship - Institute of Computing and Technology, Chinese Academy of Sciences,2010.