At IBM, work is more than a job — it's a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you've never thought possible. Are you ready to lead in this new era of technology and solve some of the world's most challenging problems? If so, let's talk.
As a Site Reliability Engineer, you will:
· Collaborate closely with the broader SRE team to design and implement cloud deployments of the Maximo family of products on AWS.
· Support production customers with operational tasks such as upgrades, maintenance, and infrastructure troubleshooting, including planned weekend maintenance windows.
· Work daily with Product Owners, Architects, Release Managers, and the SRE Manager.
· Continuously seek improvement opportunities through learning and applying new technologies and innovative methods.
· Familiarity with SRE/DevOps principles.
· Experience managing applications on AWS.
· Strong understanding of cloud security practices.
· Solid foundation in networking concepts on AWS (e.g., VPCs, NLB/ALB, Content Delivery).
· Hands-on experience building CI/CD pipelines for large-scale applications.
· Proficiency in GitOps workflows.
· Knowledge of AutoScaling on AWS.
· Experience migrating and supporting applications on Kubernetes.
· Proficient with deployment automation tools such as Ansible.
· Experience with monitoring solutions on AWS.
· Strong analytical and problem-solving skills.
· Excellent communication skills, including direct engagement with customers.
· Fluent in English.
· Experience with NoSQL databases (e.g., MongoDB).
· Experience with SQL databases (e.g., DB2, AWS RDS).
· Familiarity with AWS SES (Simple Email Service).
· Knowledge of AWS storage services (e.g., S3, EBS, EFS).
· AWS certifications (e.g., Solutions Architect Pro, DevOps Engineer Pro, SysOps Admin).
· Experience with Maximo implementations.