Skip to content
Exploring Machine Learning, AI, and Data Science

Jacob Leverich on Efficiency, Elegance, and the Joy of Not Grepping log files at 2AM

This week, Frank sat down with Dr. Jacob Leverich—Stanford PhD, cofounder of Observe, and a veteran of the Google MapReduce team and Splunk. Jacob’s journey, from tinkering with video game code as a kid, to innovating at the cutting edge of distributed systems and energy efficiency, is as inspiring as it is informative.

Key Takeaways

  • Early Tech Roots: Hear how curiosity with QBasic and classic PCs (think IBM PCXT and Commodore) put Jacob on a path to high-impact data engineering.
  • MapReduce, Dremel, & the Rise of Big Data: Jacob pulls back the curtain on working with some of the most influential data processing tools at Google and how these systems shifted the entire data landscape (hello, BigQuery!).
  • Building Efficient Systems: It’s not just about scale—energy efficiency and performance optimization are the unsung heroes of today’s data infrastructure. Jacob explains why making things “just work” isn’t enough anymore.
  • The Realities of Ops & Observability: Remember the days of grepping logs at 2AM? There’s a better way. Jacob shares how platforms like Observe help teams consolidate, visualize, and act on operational data—turning chaos into actionable insight.
  • Bridging Data & Ops: The lines between data observability and traditional ops are blurring, and Jacob’s unique experience shows how best practices from data warehousing are finally making ops smoother (and less sleepless).
  • Power Concerns & the Future: As data grows, so does energy consumption in data centers. Find out why optimization isn’t just good for performance—it’s key to sustainability.

Timestamps

00:00 Interview with Jacob Levrich

05:59 Journey into Game Programming

06:43 “Pursuing Fast Video Game Code”

10:23 Data Processing and Power Efficiency

16:11 Snowflake’s Transformative Database Approach

19:18 Journey to Data Management Industry

21:37 Data Products: Solving Core Challenges

27:07 Early Web Log Analysis Techniques

28:57 Consolidating Data for Efficiency

33:23 Specialized Tools and Context Switching

35:43 Unique Dual-Expertise in Tech

38:58 User-Centric Business Strategies

42:13 IP Data Analysis in Cloud

47:23 Electricity Transport Upsets Local Farms

48:25 Shift to Parallel Computing

52:10 Hardware Specialization & Software Optimization

57:32 “Stay Data Driven”

About the author, Frank

Frank La Vigne is a software engineer and UX geek who saw the light about Data Science at an internal Microsoft Data Science Summit in 2016. Now, he wants to share his passion for the Data Arts with the world.

He blogs regularly at FranksWorld.com and has a YouTube channel called Frank's World TV. (www.FranksWorld.TV). Frank has extensive experience in web and application development. He is also an expert in mobile and tablet engineering. You can find him on Twitter at @tableteer.