code] [BibTeX ]
Paella: Low-latency Model Serving with Virtualized GPU Scheduling
Cowbird: Freeing CPUs to Compute by Offloading the Disaggregation of Memory
DONS: Fast and Affordable Discrete Event Network Simulation with Automatic Parallelization
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving
Zhuohan Li *,
Lianmin Zheng *, Yinmin Zhong*,
Vincent Liu ,
Ying Sheng ,
Xin Jin , Yanping Huang,
Zhifeng Chen ,
Hao Zhang ,
Joseph E. Gonzalez , and
Ion Stoica OSDI, July 2023 [
PDF ] [
code ] [
BibTeX ]
Executing Microservice Applications on Serverless, Correctly
Templating Shuffles
Cebinae: Scalable In-network Fairness Augmentation
PrintQueue: Performance Diagnosis via Queue Measurement in the Data Plane
Optimizing Data-intensive Systems in Disaggregated Data Centers with TELEPORT
OrbWeaver: Using IDLE Cycles in Programmable Networks for Opportunistic Coordination
CompuCache: Remote Computable Caching using Spot VMs
Towards a Cost vs. Quality Sweet Spot for Monitoring Networks
MimicNet: Fast Performance Estimates for Data Center Networks with Machine Learning
Fault-tolerant and Transactional Stateful Serverless Workflows
Aragog: Scalable Runtime Verification of Shardable Networked Systems
Mantis: Reactive Programmable Switches
Scouts: Improving the Diagnosis Process Through Domain-customized Incident Routing
Jiaqi Gao ,
Nofel Yaseen , Robert MacDavid, Felipe Vieira Frujeri,
Vincent Liu , Ricardo Bianchini, Ramaswamy Aditya, Xiahoang Wang, Henry Lee, David Maltz,
Minlan Yu , and
Behnaz Arzani SIGCOMM, Aug 2020 [
PDF ] [
BibTeX ]
Understanding the Effect of Data Center Resource Disaggregation on Production DBMSs
tpprof: A Network Traffic Pattern Profiler
Rethinking Data Management Systems for Disaggregated Data Centers
TMC: Pay-as-you-Go Distributed Communication
Detecting Asymmetric Application-layer Denial-of-Service Attacks In-Flight with FineLame
Optimizing Declarative Graph Queries at Large Scale
Fast Network Simulation Through Approximation or: How Blind Men Should Describe Elephants
Synchronized Network Snapshots
High-Resolution Measurement of Data Center Microbursts
Predicting Startup Crowdfunding Success through Longitudinal Social Engagement Analysis
Canaries in the Network
RackCC: Rack-level Congestion Control
Subways: A Case for Redundant, Inexpensive Data Center Edge Links
Designing Distributed Systems Using Approximate Synchrony in Data Center Networks
Enabling Instantaneous Feedback with Full-duplex Backscatter
Ambient Backscatter: Wireless Communication Out of Thin Air
Expressive Privacy Control with Pseudonyms
F10: A Fault-Tolerant Engineered Network