Summary
The Lead Data Architect will design, build, and operate enterprise data platforms that power GenAI and AI/ML use cases. This highly technical, hands‑on role is responsible for data platform architecture, end‑to‑end data engineering, ML/LLM pipeline design, production model onboarding, and delivery of scalable Databricks‑centric solutions across cloud environments.
What You’ll Be Doing
Architect and implement enterprise data platforms (batch + streaming) optimized for ML, LLMs, and GenAI workloads.
Lead design and hands‑on implementation of Databricks workspaces, Unity Catalog, Delta Lake design patterns, cluster policies, and performance tuning.
Build and own end‑to‑end data pipelines (ingest, transform, feature engineering, serving) using PySpark, Databricks Jobs, Spark SQL, Delta Lake, and orchestration tools.
Design and operationalize model training, fine‑tuning (LLM), evaluation, deployment, and monitoring pipelines (MLOps/RAG...