Let’s connect to drive impact, scale platforms, tackle engineering challenges.
2024- 2025
2020-2023
S - success | V - vision | I - innovation | T- Transformation | A- Administration
I work on large platform and infrastructure programs at scale, including production system launches and fleet-wide infrastructure modernization across organizations.
I started my career as an engineer working on distributed systems at Informix, building an agent-based messaging layer between Java services and C++ database servers. At the time, making these systems communicate reliably felt almost impossible. There were no established patterns, and I was working directly with formats, protocols, cross-language boundaries I had never encountered before. Getting even a simple request–response path working felt like a real breakthrough.
That experience reshaped how I think about technical problems. It taught me to approach systems from first principles, to stay comfortable with large unknowns, and to assume that constraints can often be pushed much further than they initially appear. In the process, I developed a deep understanding of how distributed systems actually behave beneath their abstractions—knowledge that has continued to guide my work as systems scale.
Since then, my work has focused on scaling production infrastructure and the organizations that build it. At Google, I helped launch Google Cloud’s first production region and led a multi-year, Google diskless ~$500M program to remove spinning disks from all production data centers, upgrading millions of machines across the fleet. I also scaled Census from a small API into Google’s default production observability system, enabling safe, low-overhead metrics for millions of jobs—work that later became OpenTelemetry. Along the way, I drove several enterprise launches, including Vault, Postini log search, and Audio Ads.
At VMware, I built and scaled execution capacity for the Tanzu platform by hiring and onboarding 25 TPMs across four countries while continuing to operate as an individual contributor. I partnered closely with the Tanzu Architecture Group and engineering teams to ship Kubernetes control-plane and platform capabilities across Tanzu Kubernetes Grid, Tanzu Mission Control, Tanzu Advanced, including regulated & federal environments. In parallel, I led a cross-company effort across five business units that replaced siloed ownership with shared problem-solving, enabling teams to cooperate on systemic execution issues rather than optimize locally.
More recently at Airbnb, I ran company-scale infrastructure programs, including the largest service-mesh migration in the company’s history, & led cross-company architecture efforts to reduce tribal knowledge & improve incident response
Knowledge
Strong understanding of program execution, middleware, & distributed systems I have built.
Complexity
Ran massively large horizontal programs Google wide scale: Launch of Google's first Cloud Region, and Diskless -- prod with no disk initiative spanning all product areas at Google.
Leadership
Geared towards creating customer-centric products. Led 20+ teams of size 7 to 40+ engineers across distributed offices to facilitation of organisation wide OKRs. Skilled in recruitment, empowerment, and creating a culture of appreciation.
Org Impact
Award-winning program leader with a track record of delivering cross-functional initiatives, including recognized contributions to VMware Tanzu and achieving substantial resource savings and productivity enhancements through Google Tech Infrastructure projects.
Products I built at my startup
Thought Leadership
May 15: The Art of Shifting Gears: What to do when team’s velocity slackens?
Apr 20: Epic Software Migrations & The Great Migration of Masai Mara
Mar 23: I am 99% done: On defining done and instituting code milestones
Mar 18: Keeping remote teams organised across 13 hr time zone differences
Cultural Dimension and AI
Healthcare: The last frontier that Artificial Intelligence (AI) has not conquered. Cultural factors significantly impact the way healthcare is accessed and delivered. Here is an article (pdf) from a study I was involved in that digs deeper into this topic.
DL Model For Covid Detection
This is an article I contributed to as we evaluate the diagnostic performance of a doctor-trained DL model (Svita_DL8) to screen for COVID-19 on CXR, and to compare the performance of the DL model with that of expert radiologists. Article on pubmed.