Keywords (tags) and Publication List

2021

Hua, Fei ; Chen, Yanhao ; Jin, Yuwei ; Zhang, Chi ; Hayes, Ari ; Zhang, Youtao ; Zhang, Eddy Z

AutoBraid: A Framework for Enabling Efficient Surface Code Communication in Quantum Computing Conference

54th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO 2021), Association for Computing Machinery, 2021, ISBN: 9781450385572.

Abstract | Links | BibTeX | Tags: Compiler optimization, Quantum Computing, Quantum Error Correction

Chen, Yanhao ; Hua, Fei ; Jin, Yuwei ; Zhang, Eddy Z

BGPQ: A Heap-Based Priority Queue Design for GPUs Conference

50th International Conference on Parallel Processing (ICPP 2021), Association for Computing Machinery, New York, NY, USA, 2021, ISBN: 9781450390682.

Abstract | Links | BibTeX | Tags: Batched Heap, GPUs, Priority Queue

Zhang, Chi ; Hayes, Ari B; Qiu, Longfei ; Jin, Yuwei ; Chen, Yanhao ; Zhang, Eddy Z

Time-Optimal Qubit Mapping Conference

Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS 2021), Association for Computing Machinery, New York, NY, USA, 2021.

Links | BibTeX | Tags: A* search, Compilation technique, QFT, Quantum, Qubit mapping

2019

Hua, Fei; Zhang, Eddy Z

Optimizing Surface Code Braiding Workshop

The 3rd International Workshop on Quantum Compilation (IWQC 2019), 2019.

BibTeX | Tags: Compilation technique, Quantum, Surface code

Fu, Hao; Zhu, Mingzheng; Wu, Wenli; Zhang, Eddy Z; Tan, Haisheng

Towards Optimal Qubit Mapping in NISQ Era Workshop

The 3rd International Workshop on Quantum Compilation (IWQC 2019), 2019.

BibTeX | Tags: Compilation technique, Quantum, Qubit mapping

Hayes, Ari B; Hua, Fei; Huang, Jin; Chen, Yanhao; Zhang, Eddy Z

Decoding CUDA Binary Conference

Proceedings of the 2019 IEEE/ACM International Symposium on Code Generation and Optimization (CGO 2019), IEEE Press, Washington, DC, USA, 2019, ISBN: 9781728114361.

Abstract | BibTeX | Tags: Code generation, Code translation and transformation, CUDA, GPU, Instruction set architecture (ISA)

2018

Chen, Yanhao; Hayes, Ari B; Zhang, Chi; Salmon, Timothy; Zhang, Eddy Z

Locality-Aware Software Throttling for Sparse Matrix Operation on GPUs Conference

2018 USENIX Annual Technical Conference (USENIX ATC 18), USENIX Association, Boston, MA, 2018, ISBN: 978-1-939133-01-4.

Links | BibTeX | Tags: GPU, Program locality, Sparse matrix, Spmv

2017

Hayes, Ari B; Li, Lingda; Hedayati, Mohammad; He, Jiahuan; Zhang, Eddy Z; Shen, Kai

GPU Taint Tracking Conference

2017 USENIX Annual Technical Conference (USENIX ATC 17), USENIX Association, Santa Clara, CA, 2017, ISBN: 978-1-931971-38-6.

Links | BibTeX | Tags:

Li, Lingda; Geda, Robel; Hayes, Ari B; Chen, Yanhao; Chaudhari, Pranav; Zhang, Eddy Z; Szegedy, Mario

A Simple Yet Effective Balanced Edge Partition Model for Parallel Computing Conference

Proceedings of the 2017 ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems (Sigmetrics 2017), SIGMETRICS ’17 Abstracts Association for Computing Machinery, Urbana-Champaign, Illinois, USA, 2017, ISBN: 9781450350327.

Abstract | Links | BibTeX | Tags: Data sharing, Edge-partition, GPU, Graph model, Program locality

Li, Lingda; Geda, Robel; Hayes, Ari B; Chen, Yanhao; Chaudhari, Pranav; Zhang, Eddy Z; Szegedy, Mario

A Simple Yet Effective Balanced Edge Partition Model for Parallel Computing Journal Article

SIGMETRICS Perform. Eval. Rev., 45 (1), pp. 6, 2017, ISSN: 0163-5999.

Abstract | Links | BibTeX | Tags: Data sharing, Edge-partition, GPU, Graph model, Program locality

Li, Pengcheng; Hu, Xiaoyu; Chen, Dong; Brock, Jacob; Luo, Hao; Zhang, Eddy Z; Ding, Chen

LD: Low-Overhead GPU Race Detection Without Access Monitoring Journal Article

ACM Transaction on Architecture and Code Optimization (TACO 2017), 14 (1), 2017, ISSN: 1544-3566.

Abstract | Links | BibTeX | Tags: GPU race detection, Instrumentation-free, Value-based checking

Catarata, Jan; Corbett, Scott; Stern, Harry; Szegedy, Mario; Vyskocil, Tomas; Zhang, Zheng

The Moser-Tardos Resample algorithm: Where is the limit? (an experimental inquiry) Workshop

2017.

Links | BibTeX | Tags:

2016

Hayes, Ari B; Li, Lingda; Chavarría-Miranda, Daniel; Song, Shuaiwen Leon; Zhang, Eddy Z

Orion: A Framework for GPU Occupancy Tuning Conference

Proceedings of the 17th International Middleware Conference (Middleware 2017), Association for Computing Machinery, Trento, Italy, 2016, ISBN: 9781450343008.

Abstract | Links | BibTeX | Tags: Concurrent-program compilation, GPU occupancy tuning, Register allocation, Shared memory allocation

Li, Lingda; Hayes, Ari B; Song, Shuaiwen Leon; Zhang, Eddy Z

Tag-Split Cache for Efficient GPGPU Cache Utilization Conference

Proceedings of the 2016 International Conference on Supercomputing (ICS 2016), Association for Computing Machinery, Istanbul, Turkey, 2016, ISBN: 9781450343619.

Abstract | Links | BibTeX | Tags: Cache organization, GPGPU, Spatial Locality

Tao, Dingwen; Song, Shuaiwen Leon; Krishnamoorthy, Sriram; Wu, Panruo; Liang, Xin; Zhang, Eddy Z; Kerbyson, Darren; Chen, Zizhong

New-Sum: A Novel Online ABFT Scheme For General Iterative Methods Conference

Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC 2016), Association for Computing Machinery, Kyoto, Japan, 2016, ISBN: 9781450343145.

Abstract | Links | BibTeX | Tags: Algorithm-based fault tolerance (abft), Checkpoint, Checksum, Iterative methods, Online error detection, Resilience, Rollback recovery, Silent data corruption (sdc)

Li, Ang; Song, Shuaiwen Leon; Kumar, Akash; Zhang, Eddy Z; -, Daniel Chavarría G; Corporaal, Henk

Critical points based register-concurrency autotuning for GPUs Conference

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE 2016), Dresden, Germany, March 14-18, 2016, IEEE, 2016.

Links | BibTeX | Tags:

2015

Renart, E G; Zhang, E Z; Nath, B

Towards a GPU SDN controller Workshop

2015 International Conference and Workshops on Networked Systems (NetSys), 2015.

BibTeX | Tags:

2014

Egielski, Ian J; Huang, Jesse; Zhang, Eddy Z

Massive Atomics for Massive Parallelism on GPUs Conference

Proceedings of the 2014 International Symposium on Memory Management (ISMM 2014), Association for Computing Machinery, Edinburgh, United Kingdom, 2014, ISBN: 9781450329217.

Abstract | Links | BibTeX | Tags: Atomics, Concurrency, GPU, Parallelism

Hayes, Ari B; Zhang, Eddy Z

Unified On-Chip Memory Allocation for SIMT Architecture Conference

Proceedings of the 28th ACM International Conference on Supercomputing (ICS 2014), Association for Computing Machinery, Munich, Germany, 2014, ISBN: 9781450326421.

Abstract | Links | BibTeX | Tags: Compiler optimization, Concurrency, GPU, Register allocation, Shared memory allocation

2013

Wu, Bo; Zhao, Zhijia; Zhang, Eddy Zheng; Jiang, Yunlian; Shen, Xipeng

Complexity Analysis and Algorithm Design for Reorganizing Data to Minimize Non-Coalesced Memory Accesses on GPU Conference

Proceedings of the 18th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2013), Association for Computing Machinery, Shenzhen, China, 2013, ISBN: 9781450319225.

Abstract | Links | BibTeX | Tags: Computational complexity, Data transformation, GPGPU, Memory coalescing, Runtime optimizations, Thread-data remapping

Shen, Xipeng; Liu, Yixun; Zhang, Eddy Z; Bhamidipati, Poornima

An Infrastructure for Tackling Input-Sensitivity of GPU Program Optimizations Journal Article

Int. J. Parallel Program., 41 (6), pp. 855–869, 2013, ISSN: 0885-7458.

Abstract | Links | BibTeX | Tags: Cross-input adaptation, CUDA, Empirical search, G-ADAPT, GPU, Program optimizations

2012

Zhang, E Z; Jiang, Y; Shen, X

The Significance of CMP Cache Sharing on Contemporary Multithreaded Applications Journal Article

IEEE Transactions on Parallel and Distributed Systems, 23 (2), pp. 367-374, 2012.

BibTeX | Tags:

2011

Wu, B; Zhang, E Z; Shen, X

Enhancing Data Locality for Dynamic Simulations through Asynchronous Data Transformations and Adaptive Control Conference

2011 International Conference on Parallel Architectures and Compilation Techniques, 2011.

BibTeX | Tags:

Guo, Z; Zhang, E Z; Shen, X

Correctly Treating Synchronizations in Compiling Fine-Grained SPMD-Threaded Programs for CPU Conference

2011 International Conference on Parallel Architectures and Compilation Techniques, 2011.

BibTeX | Tags:

Tian, Kai; Zhang, Eddy; Shen, Xipeng

A Step towards Transparent Integration of Input-Consciousness into Dynamic Program Optimizations Conference

Proceedings of the 2011 ACM International Conference on Object Oriented Programming Systems Languages and Applications, OOPSLA ’11 Association for Computing Machinery, Portland, Oregon, USA, 2011, ISBN: 9781450309400.

Abstract | Links | BibTeX | Tags: Dynamic optimizations, Dynamic version selection, Java virtual machine, Just-in-time compilation, Proactivity, Seminal behaviors

Zhang, Eddy Z; Jiang, Yunlian; Guo, Ziyu; Tian, Kai; Shen, Xipeng

On-the-Fly Elimination of Dynamic Irregularities for GPU Computing Conference

Proceedings of the Sixteenth International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS XVI Association for Computing Machinery, Newport Beach, California, USA, 2011, ISBN: 9781450302661.

Abstract | Links | BibTeX | Tags: Cpu-gpu pipelining, Data transformation, GPGPU, Memory coalescing, Thread data remapping, Thread divergence

2010

Tian, Kai; Jiang, Yunlian; Zhang, Eddy Z; Shen, Xipeng

An Input-Centric Paradigm for Program Dynamic Optimizations Conference

Proceedings of the ACM International Conference on Object Oriented Programming Systems Languages and Applications, OOPSLA ’10 Association for Computing Machinery, Reno/Tahoe, Nevada, USA, 2010, ISBN: 9781450302036.

Abstract | Links | BibTeX | Tags: Dynamic optimizations, Dynamic version selection, Java virtual machine, Just-in-time compilation, Proactivity, Seminal behaviors

Zhang, Eddy Z; Jiang, Yunlian; Guo, Ziyu; Shen, Xipeng

Streamlining GPU Applications on the Fly: Thread Divergence Elimination through Runtime Thread-Data Remapping Conference

Proceedings of the 24th ACM International Conference on Supercomputing, ICS ’10 Association for Computing Machinery, Tsukuba, Ibaraki, Japan, 2010, ISBN: 9781450300186.

Abstract | Links | BibTeX | Tags: Cpu-gpu pipelining, Data transformation, GPGPU, Thread divergence, Thread-data remapping

Jiang, Yunlian; Zhang, Eddy Z; Tian, Kai; Mao, Feng; Gethers, Malcom; Shen, Xipeng; Gao, Yaoqing

Exploiting Statistical Correlations for Proactive Prediction of Program Behaviors Conference

Proceedings of the 8th Annual IEEE/ACM International Symposium on Code Generation and Optimization, CGO ’10 Association for Computing Machinery, Toronto, Ontario, Canada, 2010, ISBN: 9781605586359.

Abstract | Links | BibTeX | Tags: Correlation, Program behavior analysis

Zhang, Eddy Z; Jiang, Yunlian; Shen, Xipeng

Does Cache Sharing on Modern CMP Matter to the Performance of Contemporary Multithreaded Programs? Conference

Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP ’10 Association for Computing Machinery, Bangalore, India, 2010, ISBN: 9781605588773.

Abstract | Links | BibTeX | Tags: Chip multiprocessors, Parallel program optimizations, Shared cache, Thread scheduling

Casale, Giuliano; Zhang, Eddy Z; Smirni, Evgenia

KPC-Toolbox: Best Recipes for Automatic Trace Fitting Using Markovian Arrival Processes Journal Article

Perform. Eval., 67 (9), pp. 873–896, 2010, ISSN: 0166-5316.

Abstract | Links | BibTeX | Tags: Automatic fitting, MAP characterization, Markovian arrival process (MAP), Phase-type distribution, Temporal dependence, Time series modeling

2009

Liu, Yixun; Zhang, E Z; Shen, X

A cross-input adaptive framework for GPU program optimizations Conference

2009 IEEE International Symposium on Parallel Distributed Processing, 2009.

BibTeX | Tags:

Mao, Feng; Zhang, Eddy Z; Shen, Xipeng

Influence of Program Inputs on the Selection of Garbage Collectors Conference

Proceedings of the 2009 ACM SIGPLAN/SIGOPS International Conference on Virtual Execution Environments, VEE ’09 Association for Computing Machinery, Washington, DC, USA, 2009, ISBN: 9781605583754.

Abstract | Links | BibTeX | Tags: Cross-input program analysis, Input-specific selection, Minimum possible heap size, Profiling, Selection of garbage collectors

2008

Casale, G; Zhang, E Z; Smirni, E

KPC-Toolbox: Simple Yet Effective Trace Fitting Using Markovian Arrival Processes Conference

2008 Fifth International Conference on Quantitative Evaluation of Systems, 2008.

BibTeX | Tags: