Anirban Banerjee
Data/AI Engineer · Data Architect
click anywhere to continue →

Data Architect · AI Engineer

AnirbanBanerjee

Technical ArchitectData EngineerAI Engineer

Building Intelligent, Scalable, and Governed Data Platforms.

Technical Architect & Data Engineer building large-scale data platforms across Retail and Healthcare. Specializing in cloud migrations, lakehouse architecture, cost optimization, and Generative AI on Azure, AWS, and GCP.

Redmond, WA15+ Years ExperienceAzure · AWS · GCP17 Certifications

Core Platforms & AI

Azure
Databricks
AWS
GCP
OpenAI
Anthropic

Professional Journey

Deloitte

Data Architect (Office of CIO)·LTM

July 2025 – Present

  • Designing future state architecture for data ingestion, consumption, visualization, AI/ML integration, and data marketplace across Deloitte's US and global entities.
  • Architecting AI Agent integration patterns with existing datahub products (MDM, Databricks, SAP).
  • Designing ABAC and centralized access policy frameworks governing access across on-prem and cloud applications.

Premera Blue Cross

Technical Architect / Data Engineer·LTM

Sept 2024 – June 2025

  • Architected and deployed Unity Catalog for DataHub, defining governance structures and access policies.
  • Led migration from Boomi to Databricks with a reusable migration framework, cutting transition timelines significantly.
  • Designed lakehouse architectures that reduced compute and storage costs while improving performance.
  • Integrated ServiceNow and Dynatrace with DataHub for real-time error monitoring.

UHG / Optum

Technical Architect / Data Engineer·LTM

July 2022 – Aug 2024

  • Architected a GenAI chatbot using Azure OpenAI, Bot Framework, and vector databases for contextual NLP.
  • Redesigned legacy ETL pipelines with a rules engine framework, improving processing efficiency by 40%.
  • Migrated Ab Initio/Netezza ETL systems to Azure — achieved 76% reduction in operational costs.
  • Reduced Databricks costs by 37% and Azure storage by 42% through best-practice optimizations.

CVS Health

Data Engineer, Tech Lead·Tata Consultancy Services

Mar 2022 – June 2022

  • Engineered CMS regulatory compliance automation within GCP.
  • Designed and implemented medallion architecture on GCP using DataProc and BigQuery.
  • Integrated and transformed diverse public datasets into enriched, actionable data assets.

Tata Digital

Data Engineer, Tech Lead·Tata Consultancy Services

May 2020 – Mar 2022

  • Designed automated Azure sandbox architecture that reduced environment setup from days to minutes.
  • Built web solutions with Django and Azure SDK; standardized CI/CD pipelines.
  • Productionized ML models via automated MLOps frameworks for monitoring, retraining, and deployment.

BJ's Wholesale Club

Data Engineer·Tata Consultancy Services

Jul 2019 – May 2020

  • Upgraded Python/Spark environments on AWS EMR, improving cluster performance and stability.
  • Led codebase modernization and performance tuning, achieving significant improvements in job execution speed.

NBrown / JD Williams

Support Lead·Tata Consultancy Services

Jul 2017 – Jul 2019

  • Served as support lead for internal credit maintenance applications, overseeing incident resolution.
  • Implemented proactive monitoring and streamlined support workflows.

Kingfisher PLC

Application Lead·Tata Consultancy Services

Apr 2012 – Jul 2017

  • Led development and support for enterprise applications built with Oracle PL/SQL and Oracle Forms.
  • Managed full application lifecycle from requirements through deployment and incident management.

Tech Stack

Programming

PythonSparkPysparkSQL

Generative AI

Azure OpenAIOpenAILangChainLlamaIndexHuggingFaceChromaFAISSLangflowIBM WatsonXAzure AI FoundryDocument Intelligence

Azure

Databricks (ADB)Data Factory (ADF)FabricAI FoundryDevOpsWeb AppKey VaultStorage (ADLS)Event HubContainerMonitorLog AnalyticsLogic App

AWS

EMREC2GlueS3

GCP

BigQueryDataProcCloud Storage

Database & DataLakes

Delta LakeCosmos DBMongoDBSQL ServerOracleTeradataDremio

DevOps & Source Control

GitHubGitLabBitbucketAzure ReposAzure DevOpsBicepGit ActionsDockerKubernetes

ITSM & Collaboration

ServiceNowJiraRallyConfluenceSharePointAzure Boards (ADO)

Visualization

Power BITableau

Competitions

WatsonX Hackathon

IBM

May 2024

Built a Generative AI solution on IBM WatsonX; considered for finalist spot at IBM TechXchange Conference 2024.

AI Devs Competition

Langflow

Sept 2024

Advanced AI development competition focused on building innovative solutions with Langflow.

Certifications

Microsoft10
AB-731

Microsoft Certified: AI Transformation Leader

AI-102

Azure AI Engineer Associate

DP-420

Azure Cosmos DB Developer Specialty

AZ-305

Designing Microsoft Azure Infrastructure Solutions

AZ-104

Microsoft Azure Administrator

DP-203

Data Engineering on Microsoft Azure

AZ-204

Developing Solutions for Microsoft Azure

DP-900

Microsoft Azure Data Fundamentals

AI-900

Microsoft Azure AI Fundamentals

AZ-900

Microsoft Azure Fundamentals

Databricks5

Databricks Certified Generative AI Engineer Associate

Databricks Academy Accreditation: Generative AI Fundamentals

Databricks Certified Data Engineer Associate

Databricks Certified Associate Developer for Apache Spark 3.0

Databricks Lakehouse Fundamentals

Dell1

Dell GenAI Foundations

HackerRank1

Hacker Rank REST API Certification

Get in touch

Let’s build something
together.

Open to senior architect and data engineering roles, consulting engagements, and technical collaborations. My inbox is always open.