profile photo

Zachary Warren

Data Engineer

San Francisco, CA

Connect on Linkedin

About

Seasoned data engineer and full-stack developer with a robust background in developing and managing data pipelines, cloud infrastructure, and web applications. Demonstrates a track record of driving technical innovation and operational efficiency across startups and established financial institutions. Expertise spans a wide array of technologies and tools, including Python, SQL, AWS, Databricks, and modern web development frameworks. Adept at leading teams and delivering scalable, high-performance solutions that significantly enhance business outcomes.

Experience

shadeform

Shadeform

Co-founder and Full-Stack Developer

Jun 2023 - Present

Founding member of a Y Combinator backed startup focused on commoditizing the GPU market.
  • Led front-end development, web analytics, and cloud infrastructure management, driving the company's growth to its first $80k in monthly recurring revenue
  • Designed and implemented data pipelines using Cloud Functions, Cloud SQL, and Metabase to collect, process, and analyze data from PostHog and Google Analytics
  • Standardized Linux virtual machines across multiple cloud providers, creating a consistent CUDA and machine learning environment
  • Developed a user-friendly front-end platform for hundreds of users, implementing a cloud console-inspired interface using React, JavaScript, and RTK Query
RetailStat

RetailStat

Lead Data Engineer

Apr 2020 - Jun 2023

Built and led a team to derive insights for retailers from geofenced mobile data, estimating foot traffic and sales for over 200,000 locations
  • Led migration of petabyte-scale data pipelines, reducing monthly operating costs by 75% through optimizations
  • Reduced daily pipeline processing times from 15 hours to 2 hours, enabling same-day fulfillment of bulk client requests
  • Enabled near-instant turnaround for ad-hoc geospatial joins by creatively using census blockgroups to partition data
  • Built and managed a high-performance geospatial data lake, processing 1-2TB of new data daily
  • Managed the organization's data environment, AWS account, Databricks workspace, and Vertica cluster
Capital One

Capital One Financial Corp.

Data Analyst & Senior Data Analyst

Aug 2018 - Apr 2020

  • Led the migration of data environment from an on-prem Teradata deployment to AWS/Snowflake, coordinating with a large team
  • Implemented ETL processes using Apache Airflow and designed Python packages simplifying PySpark usage for analysts
  • Identified critical data quality issues and proposed an innovative data-mart solution with a customized data model
  • Orchestrated leadership buy-in and led MVP development, resulting in significant reduction of complexity for business analysts

Education

Lorem, ipsum dolor sit

Vanderbilt University

B.S. in Physics, Mathematics, and Computer Science

Class of 2018

Skills

Languages

Databases

Tools

Platforms