As a Cloud Data Engineer, you will play a key role in transforming and optimizing our software systems on AWS. You will work closely with software engineering, DevOps, and data science teams to ensure scalable, maintainable, and high-performing infrastructure is in place to support advanced analytics and machine learning.
In this role, you'll work in one of our IBM Consulting Client Innovation Centers (Delivery Centers), where we deliver deep technical and industry expertise to a wide range of public and private sector clients around the world. Our delivery centers offer our clients locally based skills and technical expertise to drive innovation and adoption of new technology.
We are seeking a highly skilled Cloud Data Engineer with strong expertise in Python programming and AWS cloud architecture. The successful candidate will lead performance optimizations, technical debt cleanup, and scalable system design across distributed cloud environments.
Key responsibilities include:
-
Refactoring and maintaining clean, efficient codebases by addressing technical debt and eliminating redundant code.
-
Designing, deploying, and managing scalable and high-performance solutions on AWS, using services like EC2, S3, Lambda, and optionally OpenSearch.
-
Optimizing machine learning model execution through parallel computing strategies and runtime improvements.
-
Troubleshooting and resolving complex issues in distributed and cloud-based environments.
-
Collaborating with cross-functional teams to integrate CI/CD practices and support agile development workflows.
-
Ensuring all systems meet security, compliance, and performance best practices.
Proficiency in Python and software engineering best practices
Extensive hands-on experience with AWS, including EC2, S3, Lambda, and IAM
Proven expertise in cloud-based architecture and deployment strategies
Experience with codebase refactoring, technical debt management, and system performance optimization
Understanding of parallel computing concepts for ML scalin and strong problem-solving skills, particularly in distributed systems
Experience with CI/CD tools and pipelines in agile environments
Experience with AWS OpenSearch for indexing, querying, and analyzing large datasets
Familiarity with containerization technologies (Docker, Kubernetes)
Exposure to machine learning model deployment workflows
Understanding of cloud security and compliance principles
Bachelor’s or advanced degree in Computer Science, Engineering, or related field