In this role, you'll work in one of our IBM Consulting Client Innovation Centers (Delivery Centers), where we deliver deep technical and industry expertise to a wide range of public and private sector clients around the world. Our delivery centers offer our clients locally based skills and technical expertise to drive innovation and adoption of new technology.
A data engineer with expertise in AWS toolset advises on, develops, and maintains data engineering solutions on the AWS Cloud ecosystem. They design, build, and operate batch and real-time data pipelines using AWS services such as AWS EMR, AWS Glue, Glue Catalog, and Kinesis. Additionally, they create data layers on AWS RedShift, Aurora, and DynamoDB. The data engineer also migrates data using AWS DMS and is proficient with various AWS Data Platform components, including S3, RedShift, RedShift Spectrum, AWS Glue with Spark, AWS Glue with Python, Lambda functions with Python, AWS Glue Catalog, and AWS Glue Databrew. They are experienced in developing batch and real-time data pipelines for Data Warehouse and Datalake, utilizing AWS Kinesis and Managed Streaming for Apache Kafka. They are also proficient in using open source technologies like Apache Airflow and dbt, Spark / Python or Spark / Scala on AWS Platform. The data engineer schedules and manages data services on the AWS Platform, ensuring seamless integration and operation of data engineering solutions.
AWS Proficiency:
- Familiarity with core AWS services, including S3, Lambda, Glue, EMR, EC2, IAM, Redshift, CloudFormation, and CodeBuild.
- Basic understanding of infrastructure as code with AWS CloudFormation.
- Knowledge of continuous integration practices using AWS CodeBuild.
Administrative Experience:
- Experience with Linux platform
- Understanding of data management and integration concepts, with experience in tools similar to Denodo (e.g., Talend, Informatica, or Apache Nifi).
- Basic experience in configuring and managing data sources and data models, with a willingness to learn Denodo.
- Experience in activities like software installs, updates on Linux servers