As a DevOps Engineer, you will be a core member of the product engineering team, responsible for building and maintaining a reliable, automated, and scalable delivery infrastructure for our software products. You will design, implement, and optimize CI/CD pipelines, ensure observability and high availability of environments, and enable engineering teams to deliver software rapidly and safely.
You will work closely with developers, QA engineers, and product managers in an agile, collaborative environment to drive operational excellence, continuous improvement, and a culture of automation and reliability.
Key Responsibilities
- Design, implement, and maintain scalable and secure CI/CD pipelines that support development, testing, staging, and production environments.
- Work closely with software engineering teams to enable seamless integration, continuous delivery, and faster release cycles.
- Automate build, deployment, and configuration management using industry-standard tools and frameworks (e.g., Jenkins, GitHub Actions, ArgoCD, Helm, Terraform).
- Ensure environment consistency and reliability through infrastructure-as-code (IaC) practices and containerized deployments.
- Monitor and optimize system performance, availability, and scalability across distributed environments.
- Develop and implement tools and frameworks for automated deployment, testing, observability, and incident response.
- Collaborate with developers to embed defect prevention, security, and compliance into the delivery pipeline.
- Enable A/B testing, canary releases, and blue-green deployments for safe and controlled rollouts.
- Partner with QA and development teams to ensure test automation integration within the CI/CD pipeline.
- Contribute to incident management and post-mortem analysis, driving continuous improvement in resilience and operational efficiency.
- Maintain monitoring, logging, and alerting systems to ensure proactive detection and resolution of issues.
Required Skills and Experience
- Strong hands-on experience with CI/CD tools such as Jenkins, GitHub Actions, GitLab CI, or ArgoCD.
- Expertise in containerization and orchestration using Docker and Kubernetes.
- Experience implementing infrastructure as code using Terraform, Ansible, or Helm.
- Proficiency in Linux system administration, scripting (Bash, Python, or Go), and environment management.
- Familiarity with build automation tools such as Maven, Gradle, or Ant.
- Experience with cloud platforms (AWS, Azure, or IBM Cloud) and understanding of cloud-native architectures.
- Hands-on experience in monitoring and observability stacks (Prometheus, Grafana, ELK/EFK, OpenTelemetry).
- Strong understanding of networking fundamentals, load balancing, and security best practices.
- Experience with source control and versioning tools (Git, GitHub, Bitbucket).
- Proven ability to troubleshoot performance and deployment issues in complex distributed systems.
Preferred Skills
- Experience with service mesh frameworks (Istio, Linkerd) and modern deployment strategies (e.g., canary, blue-green).
- Exposure to SRE principles — SLIs, SLOs, error budgets, and reliability metrics.
- Knowledge of chaos engineering or resilience testing tools.
- Familiarity with cost optimization and resource scaling in cloud environments.
- Prior experience in product-oriented DevOps environments, supporting large-scale SaaS or platform engineering teams.