System Engineers at IBM are the backbone of our strategic initiatives to design, code, test, and provide industry-leading solutions that make the world run today - planes and trains take off on time, bank transactions complete in the blink of an eye, and the world remains safe because of the work our system engineers do.
A system administrator (sysadmin) is responsible for managing, maintaining, and ensuring the smooth operation of an organization's IT infrastructure. Their job primarily revolves around servers, networks, security, and system performance.
Your Role and Responsibilities
Who You Are....
You possess a solid technical foundation paired with a willingness to wear several hats. Your core focus is to deliver highly available systems with extreme uptime through installing, configuring, and maintaining Linux systems and the software services running on them.
Who You'll Work With...
You will have the opportunity to work with a dynamic and independent team of engineers and other functions such as Architecture, QA, Product Management and Delivery to design, develop and manage advanced capabilities. We come to work thrilled knowing it will never be the same day twice.
In this role, you'll employ your engineering expertise and best practices to...
• Manage the critical IaaS infrastructure, a fleet of systems that run backend workloads
• Manage and allocate tools, accelerators, frameworks, templates and other assets related to system productivity and quality that enable service delivery.
• Foster an environment that enables teams to do things correctly, quickly, consistently and with less effort and learning.
Administer and maintain business critical infrastructure and foundational tools – OS, Virtual Machines, load balancers, DNS, Chef, Splunk, Grafana, and more
Identify and remediate security vulnerabilities and issues, on time
Support development teams from within and outside the organization in making use of core infrastructure.
Continually improve systems and processes with regards to automation and monitoring.
Monitor server performance and troubleshoot issues
5+ years of strong, deep knowledge of Unix based computer operating system + system administration skill
5+ years of hands-on experience with installing, decommissioning servers, operating virtualized environment, backup and recovery
5+ years of hands-on experience, very solid understanding of IP networking concepts.
Intermediate to advance scripting abilities – Ansible, Chef and Python skills preferred
Excellent oral and written communication
Understand Root and Intermediate CA policies, key lifetimes, PKI standards, certificate renewal and installation
Vendor certifications : Red Hat, CompTIA
Experience in RabbitMQ
Experience in Redis and Zookeeper
Experience in Memcache
Experience in InfluxDB
Experience in MongoDB
Experience in Jenkins