Chao Chen

Ph.D. Georgia Institute of Technology

About Me

I am a software development engineer from Amazon Science. I am working on deep learning compiler and framework to speedup the training and inference of deep learning workloads. I got my Ph.D. from Georgia Institute of Technology, and was advised by Santosh Pande and Greg Eisenhauer. Before that, I received my M.S. and B.S. degrees from Hunan University (Changsha, China) under the supervision of Dr. Cheng Xu (徐成).

In general, my research interest lies in the intersection of compiler and system. I am particularly interested in applying compiler techniques to problems such as application performance and resilience.

News

  • May 2021: Our paper IterPro is accepted by IEEE TPDS.
  • April 2021: I joined amazon science as software development engineer.
  • Dec 2020: I defended my dissertation.
  • Nov 2020: I joined amazon SCOT as software development engineer.
  • May 2019: Our paper CARE is accepted by SC’19, and nominated as a Best Student Paper Finalist
  • Jan 2018: Our paper LADR is accespted by HPDC’18.
  • Sep 2016: I did internship at VMWare CTO Office, Boston, MA.
  • Aug 2016: Two papers related to active storage were awarded as Best Papers.

Selected Publications

Near-zero Downtime Recovery from Transient-Error-Induced Crashes.

Chao Chen, Greg Eisenhauer and Santosh Pande.

Accepted by IEEE Transactions on Parallel and Distributed Systems (TPDS), 2021.

CARE: Compiler-Assisted Recovery for Soft Failures.

(Best Student Paper Finalist)

Chao Chen, Greg Eisenhauer, Santosh Pande and Qiang Guan.

International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), 2019.

LADR: Low-cost Application-level Detector for Reducing Silent Output Corruptions.

Chao Chen, Greg Eisenhauer, Matthew Wolf and Santosh Pande

ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC), 2018.

Active Burst-Buffer: In-Transit Processing Integrated into Hierarchical Storage.

(Best Paper)

Chao Chen, Michael Lang, Latchesar Ionkov and Yong Chen

11th IEEE International Conference on Networking, Architecture, and Storage (NAS), 2016.

Rethinking High Performance Computing System Architecture for Scientific Big Data Applications.

(Best Paper)

Yong Chen, Chao Chen, Yanlong Yin, Xianhe Sun, Rajeev Thakur and William Gropp.

14th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA)

A Decoupled Execution Paradigm for Data-Intensive High-End Computing.

Yong Chen, Chao Chen, Xian-He Sun, William D. Gropp, and Rajeev Thakur.

International Conference on Cluster Computing (Cluster), 2012.

DOSAS:Mitigating the Resource Contention in Active Storage Systems.

Chao Chen, Yong Chen and Philip C. Roth.

International Conference on Cluster Computing (Cluster), 2012.

Dynamic Active Storage for High Performance I/O.

Chao Chen and Yong Chen.

41st International Conference on Parallel Processing (ICPP), 2012.