Data Engineer & Solutions Architect

Turning Raw Noise into
Scalable Intelligence.

Data Engineer with 5+ years of experience building and optimizing data pipelines across enterprise environments.

5+Years Experience
PB+Data Architected
40%Efficiency Gains

About

Architecting the future of data platforms.

I specialize in translating complex business problems into scalable architectures leveraging Python, cloud services, and modern data platforms. My background spans Solutions Architecture, Data Science, and Analytics Engineering, with a proven track record in marketing analytics, AI implementation, and architectural governance.

Cloud-native data architectures (Snowflake, AWS, Azure)
AI-enabled FinOps & automation solutions
Scalable ETL/ELT pipeline design

Technical Arsenal

Built for scale, speed, and reliability.

Languages

PythonSQLScalaRust

Data Stack

SnowflakeDatabricksdbtAirflowSparkKafka

Cloud & Systems

AWS (S3, IAM, Athena, SageMaker)AzureDelta LakeData Modeling

Professional Journey

Delivering measurable impact at enterprise scale.

Snowflake

Solutions Architect

2024-10Present

  • Built multi-threaded Snowflake functions for PGP decryption and GZIP decompression leveraging Snowpark Python [Optimized Ingestion]
  • Guided large-scale Java/Oracle Exadata migrations to Snowflake for major enterprise clients [Enterprise Migration]
  • Developed internal Cursor IDE training and dbt platform migration frameworks for Professional Services leadership [Architectural Governance]
SnowflakeSnowparkPythonJavaOracledbt

IQVIA

Staff Architect

2023-082024-10

  • Reengineered legacy Scala/Hadoop backend into efficient Snowpark Python processes [40% Runtime Reduction]
  • Architected a new data model from source systems with strict typing and 80+ pages of specification documentation [Data Quality]
  • Served as technical lead across multiple systems, ensuring alignment with enterprise architectural guardrails [Technical Leadership]
ScalaHadoopHiveSnowflakeSnowparkPython

Ameriprise Financial

Data Science Manager

2023-022023-08

  • Optimized SQL and Python queries in Dataiku pipelines, significantly reducing scanned data volume in Athena [75% Reduction in Data Scanned]
  • Automated ML scoring and reporting pipelines leveraging AWS SageMaker, S3, and Athena for continuous model feedback [Continuous ML]
PythonSQLDataikuAWS SageMakerAWS AthenaS3

PPL Corporation

Data Scientist

2022-062023-01

  • Led a team of three to design a Python-based statistical clustering solution for critical infrastructure optimization [Statistical Testing]
  • Standardized engineering practices including Cookiecutter, pre-commit hooks, and unit testing across the DS team [Engineering Excellence]
PythonDatabricksAzureGitUnit Testing

Capgemini

Data Science Consultant

2022-012022-06

  • Optimized multi-table joins across billions of rows by analyzing Spark execution plans and Delta Lake commands [Big Data Optimization]
  • Deployed an NLP-based Streamlit dashboard via Docker, automating analysis of large-scale employee feedback datasets [Process Automation]
SparkDatabricksDelta LakePythonDockerStreamlit

Terminix

Data Scientist

2020-072022-01

  • Discovered $30M in overstated revenue through deep SQL audit, saving $2.3M in annual ad spend [$2.3M Annual Savings]
  • Built time-series forecasting models using Facebook's Prophet to guide digital strategy and indexed demand data [Strategic Analytics]
SQLSSMSPythonProphetPowerBI

Selected Engineering

Solving complex data problems with elegant architecture.

AI-Enabled FinOps System

Data Platform

Designed and led the architecture for an enterprise-wide cloud cost visibility platform at Snowflake. Integrated human-in-the-loop workflows and governance controls to drive better decision-making.

Improved cost visibility and decision-making for leadership
Enterprise-wide
SnowflakeSnowparkPythonAI/ML

Saturdata Podcast

Education

Creator and host of Saturdata, a podcast and platform focused on data engineering and analytics lessons for newcomers to the field.

Content CreationCommunity Building

Full Stack Data Workshop

Education

Designed and taught RTP’s only Learn & Build-style data workshops, covering end-to-end Data Engineering and Analytics curriculum.

Taught 20+ newcomers foundational data skills
Local Community
PythonSQLData ModelingETL

Let's Build Something Scalable

I'm always interested in hearing about new opportunities and challenging data problems.

© 2026 Samuel LaFell. Data Engineer & Solutions Architect.