Zachary Warren
About
Seasoned data engineer and full-stack developer with a robust background in developing and managing data pipelines, cloud infrastructure, and web applications. Demonstrates a track record of driving technical innovation and operational efficiency across startups and established financial institutions. Expertise spans a wide array of technologies and tools, including Python, SQL, AWS, Databricks, and modern web development frameworks. Adept at leading teams and delivering scalable, high-performance solutions that significantly enhance business outcomes.
Experience
Shadeform
Co-founder and Full-Stack Developer
Jun 2023 - Present
- Led front-end development, web analytics, and cloud infrastructure management, driving the company's growth to its first $80k in monthly recurring revenue
- Designed and implemented data pipelines using Cloud Functions, Cloud SQL, and Metabase to collect, process, and analyze data from PostHog and Google Analytics
- Standardized Linux virtual machines across multiple cloud providers, creating a consistent CUDA and machine learning environment
- Developed a user-friendly front-end platform for hundreds of users, implementing a cloud console-inspired interface using React, JavaScript, and RTK Query
RetailStat
Lead Data Engineer
Apr 2020 - Jun 2023
- Led migration of petabyte-scale data pipelines, reducing monthly operating costs by 75% through optimizations
- Reduced daily pipeline processing times from 15 hours to 2 hours, enabling same-day fulfillment of bulk client requests
- Enabled near-instant turnaround for ad-hoc geospatial joins by creatively using census blockgroups to partition data
- Built and managed a high-performance geospatial data lake, processing 1-2TB of new data daily
- Managed the organization's data environment, AWS account, Databricks workspace, and Vertica cluster
Capital One Financial Corp.
Data Analyst & Senior Data Analyst
Aug 2018 - Apr 2020
- Led the migration of data environment from an on-prem Teradata deployment to AWS/Snowflake, coordinating with a large team
- Implemented ETL processes using Apache Airflow and designed Python packages simplifying PySpark usage for analysts
- Identified critical data quality issues and proposed an innovative data-mart solution with a customized data model
- Orchestrated leadership buy-in and led MVP development, resulting in significant reduction of complexity for business analysts