Keywords (tags) and Publication List

Show all

2019

Hayes, Ari B; Hua, Fei; Huang, Jin; Chen, Yanhao; Zhang, Eddy Z

Decoding CUDA Binary Conference

Proceedings of the 2019 IEEE/ACM International Symposium on Code Generation and Optimization (CGO 2019), IEEE Press, Washington, DC, USA, 2019, ISBN: 9781728114361.

Abstract | BibTeX | Tags: Code generation, Code translation and transformation, CUDA, GPU, Instruction set architecture (ISA)

2018

Chen, Yanhao; Hayes, Ari B; Zhang, Chi; Salmon, Timothy; Zhang, Eddy Z

Locality-Aware Software Throttling for Sparse Matrix Operation on GPUs Conference

2018 USENIX Annual Technical Conference (USENIX ATC 18), USENIX Association, Boston, MA, 2018, ISBN: 978-1-939133-01-4.

Links | BibTeX | Tags: GPU, Program locality, Sparse matrix, Spmv

2017

Li, Lingda; Geda, Robel; Hayes, Ari B; Chen, Yanhao; Chaudhari, Pranav; Zhang, Eddy Z; Szegedy, Mario

A Simple Yet Effective Balanced Edge Partition Model for Parallel Computing Conference

Proceedings of the 2017 ACM SIGMETRICS / International Conference on Measurement and Modeling of Computer Systems (Sigmetrics 2017), SIGMETRICS ’17 Abstracts Association for Computing Machinery, Urbana-Champaign, Illinois, USA, 2017, ISBN: 9781450350327.

Abstract | Links | BibTeX | Tags: Data sharing, Edge-partition, GPU, Graph model, Program locality

Li, Lingda; Geda, Robel; Hayes, Ari B; Chen, Yanhao; Chaudhari, Pranav; Zhang, Eddy Z; Szegedy, Mario

A Simple Yet Effective Balanced Edge Partition Model for Parallel Computing Journal Article

SIGMETRICS Perform. Eval. Rev., 45 (1), pp. 6, 2017, ISSN: 0163-5999.

Abstract | Links | BibTeX | Tags: Data sharing, Edge-partition, GPU, Graph model, Program locality

2014

Egielski, Ian J; Huang, Jesse; Zhang, Eddy Z

Massive Atomics for Massive Parallelism on GPUs Conference

Proceedings of the 2014 International Symposium on Memory Management (ISMM 2014), Association for Computing Machinery, Edinburgh, United Kingdom, 2014, ISBN: 9781450329217.

Abstract | Links | BibTeX | Tags: Atomics, Concurrency, GPU, Parallelism

Hayes, Ari B; Zhang, Eddy Z

Unified On-Chip Memory Allocation for SIMT Architecture Conference

Proceedings of the 28th ACM International Conference on Supercomputing (ICS 2014), Association for Computing Machinery, Munich, Germany, 2014, ISBN: 9781450326421.

Abstract | Links | BibTeX | Tags: Compiler optimization, Concurrency, GPU, Register allocation, Shared memory allocation

2013

Shen, Xipeng; Liu, Yixun; Zhang, Eddy Z; Bhamidipati, Poornima

An Infrastructure for Tackling Input-Sensitivity of GPU Program Optimizations Journal Article

Int. J. Parallel Program., 41 (6), pp. 855–869, 2013, ISSN: 0885-7458.

Abstract | Links | BibTeX | Tags: Cross-input adaptation, CUDA, Empirical search, G-ADAPT, GPU, Program optimizations