Hi my new friend!

书山有路勤为径，
学海无涯苦作舟。

Home
page

Scroll down

Welcome to Zongwu's Science Hub ✨

Residence:

Shanghai
Age:

18

Architecture 20

Algorithm 15

Newest Publications

DynamoLLM_ Designing LLM Inference Clusters for Performance and Energy Efficiency

26/02/13
07:11

Eigen Attention Attention in Low-Rank Space for KV Cache Compression

26/02/13
07:11

KEYDIFF_ Key Similarity-Based KV Cache Eviction for Long-Context LLM Inference in Resource-Constrained Environments

26/02/13
07:11

Get More with LESS_ Synthesizing Recurrence with KV Cache Compression for Efficient LLM Inference

26/02/13
07:11

Bounding Speculative Execution of Atomic Regions to a Single Retry

26/02/13
07:11

Concord_ Rethinking Distributed Coherence for Software Caches in Serverless Environments

26/02/13
07:11

Brain Transformers _ SNN-LLM

26/02/13
07:11

Challenges and Research Directions for Large Language Model Inference Hardware

26/02/13
07:11

Criticality-Aware Instruction-Centric Bandwidth Partitioning for Data Center Applications

26/02/13
07:11

A Hardware-Software Design Framework for SpMV Acceleration with Flexible Access Pattern Portfolio

26/02/13
07:11

1 2 345