Micah Lerner
Hello! I'm Micah. This is my blog, where I publish my writing (mostly focused on Computer Science research).
I am an engineer interested in maps, space, and futuristic ideas. I currently work on the SRE team at Google. Previously, I helped build the Geospatial datasets powering Mapbox. Before that, I was an early employee at Strava, where I worked on the infrastructure team.
Recent writing
-
ServiceRouter: Hyperscale and Minimal Cost Service Mesh at Meta - March 28, 2024
-
A Cloud-Scale Characterization of Remote Procedure Calls - March 03, 2024
-
Gemini: Fast Failure Recovery in Distributed Training with In-Memory Checkpoints - January 30, 2024
-
XFaaS: Hyperscale and Low Cost Serverless Functions at Meta - January 23, 2024
-
Efficient Memory Management for Large Language Model Serving with PagedAttention - January 11, 2024
-
Blueprint: A Toolchain for Highly-Reconfigurable Microservice Applications - January 02, 2024
-
2023 and looking forward to 2024 - December 27, 2023
-
Defcon: Preventing Overload with Graceful Feature Degradation - July 23, 2023
-
Towards an Adaptable Systems Architecture for Memory Tiering at Warehouse-Scale - June 29, 2023
-
TelaMalloc: Efficient On-Chip Memory Allocation for Production Machine Learning Accelerators - June 06, 2023
-
Perseus: A Fail-Slow Detection Framework for Cloud Storage Systems - April 16, 2023
-
Ambry: LinkedIn’s Scalable Geo-Distributed Object Store - March 28, 2023
-
Meta’s Next-generation Realtime Monitoring and Analytics Platform - February 27, 2023
-
Elastic Cloud Services: Scaling Snowflake’s Control Plane - January 19, 2023
-
CS Conferences in 2023 - January 16, 2023
-
Jupiter Rising: A Decade of Clos Topologies and Centralized Control in Google’s Datacenter Network - December 13, 2022
-
Design and Evaluation of IPFS: A Storage Layer for the Decentralized Web - October 31, 2022
-
SDN in the Stratosphere: Loon’s Aerospace Mesh Network - October 08, 2022
-
Seven years in the life of Hypergiants' off-nets - September 03, 2022
-
Automatic Reliability Testing For Cluster Management Controllers - July 24, 2022
-
Metastable Failures in the Wild - July 11, 2022
-
Sundial: Fault-tolerant Clock Synchronization for Datacenters - July 03, 2022
-
Data-Parallel Actors: A Programming Model for Scalable Query Serving Systems - June 04, 2022
-
Druid: A Real-time Analytical Data Store - May 15, 2022
-
Monarch: Google’s Planet-Scale In-Memory Time Series Database - April 24, 2022
-
The Ties that un-Bind: Decoupling IP from web services and sockets for robust addressing agility at CDN-scale - January 13, 2022
-
Shard Manager: A Generic Shard Management Framework for Geo-distributed Applications - January 08, 2022
-
CS Conferences in 2022 - December 30, 2021
-
ghOSt: Fast & Flexible User-Space Delegation of Linux Scheduling - December 28, 2021
-
Kangaroo: Caching Billions of Tiny Objects on Flash - December 11, 2021
-
Faster and Cheaper Serverless Computing on Harvested Resources - November 30, 2021
-
Choosing papers to read and write about - November 28, 2021
-
Log-structured Protocols in Delos - November 23, 2021
-
The Demikernel Datapath OS Architecture for Microsecond-scale Datacenter Systems - November 09, 2021
-
Rudra: Finding Memory Safety Bugs in Rust at the Ecosystem Scale - October 31, 2021
-
RAMP-TAO: Layering Atomic Transactions on Facebook’s Online TAO Data Store - October 23, 2021
-
TAO: Facebook’s Distributed Data Store for the Social Graph - October 13, 2021
-
Scaling Large Production Clusters with Partitioned Synchronization - October 10, 2021
-
A Linux Kernel Implementation of the Homa Transport Protocol, Part II - August 29, 2021
-
Homa: A Receiver-Driven Low-Latency Transport Protocol Using Network Priorities, Part I - August 15, 2021
-
Systems Conferences 2021 - August 14, 2021
-
POSH: A Data-Aware Shell - August 07, 2021
-
PaSh: Light-touch Data-Parallel Shell Processing - July 31, 2021
-
From Laptop to Lambda: Outsourcing Everyday Jobs to Thousands of Transient Functional Containers - July 24, 2021
-
Unix Shell Programming: The Next 50 Years (The Future of the Shell, Part I) - July 14, 2021
-
Breakfast of Champions: Towards Zero-Copy Serialization with NIC Scatter-Gather - July 07, 2021
-
Ray: A Distributed Framework for Emerging AI Applications - June 27, 2021
-
Firecracker: Lightweight Virtualization for Serverless Applications - June 17, 2021
-
FoundationDB: A Distributed Unbundled Transactional Key Value Store - June 12, 2021
-
Scaling Memcache at Facebook - May 31, 2021
-
Reflecting on 2020 - May 23, 2021
-
Noria: dynamic, partially-stateful data-flow for high-performance web applications - March 28, 2021
-
Understanding Raft - Part 2 (Raft leaders, logs, and safety) - May 09, 2020
-
Understanding Raft Consensus - Part 1 - May 08, 2020
-
Understanding Google’s File System - March 22, 2020
-
2019 year in review & looking ahead in 2020 - March 01, 2020
-
A new year of learning and writing - January 12, 2019
Recent fragments
-
Drovorub and Fancy Bear - August 21, 2020
-
Thoughts on reviewing books - June 13, 2020
-
Getting started in Bug Bounty Hunting - December 17, 2019
-
Diving into Reverse Engineering - July 22, 2019
-
The Cybersecurity Rabbithole - June 02, 2019