A career in IBM Consulting is rooted by long-term relationships and close collaboration with clients across the globe. You'll work with visionaries across multiple industries to improve the hybrid cloud and AI journey for the most innovative and valuable companies in the world. Your ability to accelerate impact and make meaningful change for your clients is enabled by our strategic partner ecosystem and our robust technology platforms across the IBM portfolio, including Software and Red Hat. Curiosity and a constant quest for knowledge serve as the foundation to success in IBM Consulting. In your role, you'll be encouraged to challenge the norm, investigate ideas outside of your role, and come up with creative solutions resulting in groundbreaking impact for a wide network of clients. Our culture of evolution and empathy centers on long-term career growth and development opportunities in an environment that embraces your unique skills and experience.
Role Overview:
Lead Data Architect to define and govern the end-to-end enterprise data architecture for a modern, scalable Cloudera-based Lakehouse platform. This role demands technical leadership across multiple domains including CDC pipelines, Kafka streaming, real-time marts, MLOps/LLMOps frameworks, and data virtualization. You will act as the Design Authority, ensuring architectural consistency, reusability, and compliance across all initiatives.
Key Responsibilities:
- Define and own the target state data architecture aligned with business and regulatory needs.
- Serve as design authority across CDC ingestion, data lake zones (raw, curated, real-time), consumption layers, and AI enablement.
- Lead architecture governance reviews, ensure solution alignment to enterprise reference architecture.
- Deep expertise in Cloudera CDP,Apache Iceberg, Hive, Impala, Kafka, NiFi, Airflow.
- Drive adoption of open table formats (e.g., Iceberg) and cataloging strategies.
- Evaluate and integrate tools for data quality, observability, lineage, and security (e.g., Ranger).
- Collaborate with CDO, CIO, and business units to translate enterprise goals into data capabilities.
- Support platform scalability, performance optimization, cost modeling, and cloud readiness.
- Coach teams and create reusable patterns and blueprints.
- Guide ingestion architecture using Oracle GoldenGate, real-time log capture, schema evolution, and reconciliation.
- Design Kafka-based ingestion, stream processing, and event-driven microservices.
- Architect real-time serving layers for digital channel use cases (e.g., customer 360, fraud, personalization).
- Define architecture for operationalizing ML and GenAI models, including feature stores, CI/CD for models, and prompt/data governance.
- Drive adoption of Denodo or similar tools for agile data access and abstraction.
- 20+ years in enterprise data architecture, including 5+ years in big data platforms and modern architectures.
- Hands-on experience with Cloudera CDP (Private or Public Cloud), Apache Iceberg, Kafka, GoldenGate, and Airflow.
- Proven ability to design architectures for real-time analytics, machine learning pipelines, and data virtualization layers.
- Strong understanding of data governance, security, metadata management, and regulatory compliance in BFSI.
- Experience in designing LLMOps pipelines using vector databases, embedding stores, and retrieval-augmented generation (RAG) frameworks is a strong plus.
- Excellent communication and stakeholder engagement skills; ability to present to both technical and executive audiences
NA