Senior Network Site Reliability Engineer (SRE), IBM Corporation, Armonk, NY and various unanticipated client sites throughout the US (Up to 40% telecommuting permitted)
- Develop and maintain automation scripts and tools to streamline network operations tasks, such as configuration management, provisioning, and deployment.
- Implement monitoring and alerting systems to detect and respond to network anomalies automatically.
- Continuously improve operational efficiency by automating repetitive tasks and workflows.
- Participate in the design, deployment, and maintenance of network infrastructure components, such as edge nodes, compute nodes, Edge routers and TORs.
- Ensure compliance with industry standards and best practices for network security, reliability, and performance.
- Monitor the network infrastructure, including servers, switches, routers, and other networking equipment, for performance, availability, and security.
- Respond to alerts and incidents in a timely manner, troubleshoot issues, and implement solutions to restore service and minimize downtime.
- Collaborate with cross-functional teams, such as Software Engineers, Network Architects, and Security Specialists, to resolve complex issues.
- Conduct post-incident analysis and root cause investigations to identify the underlying causes of network incidents and outages.
- Document incident findings, lessons learned, and recommendations for process improvements to prevent recurrence.
- Collaborate with cross-functional teams to implement corrective actions and preventive measures based on post-incident recommendations.
- Maintain documentation of network configurations, architectures, procedures, and troubleshooting guides.
- Share knowledge and best practices with team members through training sessions, documentation updates, and peer reviews.
- Contribute to the development of internal knowledge bases, wikis, and documentation.
- Utilize: Routing and switching, Network Monitoring and Observability, Cloud Infrastructure, Network Troubleshooting, Network Security, Cloud Networking Services.
Required: Master’s degree or equivalent in Computer Science, Information Systems, Engineering or related (employer will accept a Bachelor's degree plus five (5) years of progressive experience in lieu of a Master’s degree) and one (1) year of experience as a Senior Network Engineer or related. One (1) year of experience must include utilizing Routing and switching, Network Monitoring and Observability, Cloud Infrastructure, Network Troubleshooting, Network Security, Cloud Networking Services. $185964 to $235000 per year. Full time. SN133.
Master’s degree or equivalent in Computer Science, Information Systems, Engineering or related (employer will accept a Bachelor's degree plus five (5) years of progressive experience in lieu of a Master’s degree) and one (1) year of experience as a Senior Network Engineer or related. One (1) year of experience must include utilizing Routing and switching, Network Monitoring and Observability, Cloud Infrastructure, Network Troubleshooting, Network Security, Cloud Networking Services. $185964 to $235000 per year. Full time. SN133.