ASPLOS |
Glign: Taming Misaligned Graph Traversals in Concurrent Graph Processing (2023)
|
ASPLOS |
CommonGraph: Graph Analytics on Evolving Data (2023)
|
ASPLOS |
PnP: Pruning and Prediction for Point-To-Point Iterative Graph Analytics (2019)
|
ASPLOS |
KickStarter: Fast and Accurate Computations on Streaming Graphs via Trimmed Approximations (2017)
|
ASPLOS |
CoRAL: Confined Recovery in Distributed Asynchronous Graph Processing (2017)
|
ASPLOS |
Efficient Sequential Consistency via Conflict Ordering (2012)
|
ASPLOS |
Frequent Value Locality and Value-Centric Data Cache Design (2000)
|
ASPLOS |
The Fuzzy Barrier: A Mechanism for High-Speed Synchronization of Processors (1989)
|
ISCA |
ECMon: Exposing Cache Events for Monitoring (2009)
|
ISCA |
Value Prediction in VLIW Machines (1999)
|
MICRO |
MEGA Evolving Graph Accelerator (2023)
|
MICRO |
JetStream: Graph Analytics on Streaming Data with Event-Driven Hardware Accelerator (2021)
|
MICRO |
GraphPulse: An Event-Driven Hardware Accelerator for Asynchronous Graph Processing (2020)
|
MICRO |
Efficient Warp Execution in Presence of Divergence with Collaborative Context Collection (2015)
|
MICRO |
Copy Or Discard Execution Model For Speculative Parallelization On Multicores (2008)
|
MICRO |
Efficient Use of Invisible Registers in Thumb Code (2005)
|
MICRO |
Whole Execution Traces (2004)
|
MICRO |
Energy Efficient Frequent Value Data Cache Design (2002)
|
MICRO |
Frequent Value Compression in Data Caches (2000)
|
MICRO |
Dynamic Memory Disambiguation in the Presence of Out-of-order Store Issuing (1999)
|
MICRO |
Resource-Sensitive Profile-Directed Data Flow Analysis for Code Optimization (1997)
|
MICRO |
A Shape Matching Approach for Scheduling Fine-Grained Parallelism (192)
|
MICRO |
Executing Loops on a Fine-Grained MIMD Architecture (1991)
|
MICRO |
A Fine-grained MIMD Architecture based upon Register Channels (1990)
|
HPCA |
SENSS: Security Enhancement to Symmeteric Shared Memory Multiprocessors (2005)
|
HPCA |
Global Context-based Value Prediction (1999)
|
HPCA |
Distributed Path Reservation Algorithms for Multiplexed All-Optical
Interconnection Networks (1997)
|
PACT
|
Scalable SIMD-Efficient Graph Processing on GPUs (2015)
|
PACT
|
Stadium Hashing: Scalable and Flexible Hashing on GPUs (2015)
|
PACT
|
Shuffling: A Framework for Lock Contention Aware Thread Scheduling for Multicore Multiprocessor Systems (2014)
|
PACT |
No More Backstabbing... A Faithful Scheduling Policy for Multithreaded Programs (2011)
|
PACT |
Efficient Sequential Consistency Using Conditional Fences (2010), Recipient of a PACT 2010 Best Paper Award
|
PACT |
Extended Whole Program Paths (2005)
|
PACT |
Caching and Predicting Branch Sequences for Improved Fetch Effectiveness (1999)
|
PACT |
Superscalar Execution with Direct Data Forwarding (1998)
|
PACT |
Capturing the Effects of Code Improving Transformations (1998)
|
PACT |
Path Profile Guided Partial Dead Code Elimination Using Predication (1997)
|
PACT |
Resource Spackling: A Framework for
Integrating Register Allocation in Local and Global Schedulers (1994)
|
PACT |
URSA: A Unified ReSource Allocator
for Registers and Functional Units in VLIW Architectures (1993)
|