Efficient Memory Management for Large Language Model Serving with PagedAttention
www.micahlerner.com
Blueprint: A Toolchain for Highly-Reconfigurable Microservice Applications
www.micahlerner.com
2023 and looking forward to 2024
www.micahlerner.com
Defcon: Preventing Overload with Graceful Feature Degradation
www.micahlerner.com
Towards an Adaptable Systems Architecture for Memory Tiering at Warehouse-Scale
www.micahlerner.com
TelaMalloc: Efficient On-Chip Memory Allocation for Production Machine Learning Accelerators
www.micahlerner.com
Perseus: A Fail-Slow Detection Framework for Cloud Storage Systems
www.micahlerner.com
Ambry: LinkedIn’s Scalable Geo-Distributed Object Store
www.micahlerner.com
Meta’s Next-generation Realtime Monitoring and Analytics Platform
www.micahlerner.com
Elastic Cloud Services: Scaling Snowflake’s Control Plane
www.micahlerner.com
CS Conferences in 2023
www.micahlerner.com
Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google’s Datacenter Network
www.micahlerner.com
Design and Evaluation of IPFS: A Storage Layer for the Decentralized Web
www.micahlerner.com
SDN in the Stratosphere: Loon’s Aerospace Mesh Network
www.micahlerner.com
Seven years in the life of Hypergiants’ off-nets
www.micahlerner.com