A career in IBM Software means you’ll be part of a team that transforms our customer’s challenges into solutions.
Seeking new possibilities and always staying curious, we are a team dedicated to creating the world’s leading AI-powered, cloud-native software solutions for our customers. Our renowned legacy creates endless global opportunities for our IBMers, so the door is always open for those who want to grow their career.
IBM’s product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive.
In this role, you will build and maintain an observability stack for IBM’s Cloud Object Storage service using managed services as well as custom built services. This stack is used by Cloud Object Storage SREs and devs to understand the health of the service. Work duties and responsibilities include:
· Design, setup, configure and implement the COS Monitoring System using technologies such as Elasticsearch, Logstash, Kibana, Kafka, Kafka Mirrors, Filebeat, Grafana and Sysdig.
· Automate CICD tasks and infrastructure using Ansible, Terraform, Jenkins, and Travis.
· Experience with microservices and distributed application architecture, such as containers and Kubernetes.
· Experience with Linux administration and programming languages such as java, python and sql.
· Performance and configuration tuning to support the increasing load of data flowing into the COS Monitoring System.
· Provide design recommendations and thought leadership to provide best-in-class observability as part the COS Monitoring System.
· Provide 24x7 on-call customer support on a rotational basis.
· Design and develop dashboards for metrics analysis
· Design, Develop and Configure an alerting solution for an end-to-end incident management and recovery process by integrating Sysdig with Pagerduty, Email and Slack.
- Ability and tenacity to solve increasingly complex technical issues through analysis and a variety of problem-solving techniques.
· Working knowledge of Object-Oriented Python with demonstrable experience in applying these skills.
· Working knowledge of Linux environments.
· Experience working in an Agile-Scrum development environment.
· Experience using tools such as Jira, GitHub and Logging and monitoring tools
BS in CS, CE or similar field, plus 2 to 5 years relevant work experience.