Mission:
The Infrastructure AI Center of Excellence (COE) is a dedicated team of technical professionals focused on leveraging AI to empower IBM Infrastructure employees to be more effective in their jobs or improve customer's experience when they interact with IBM. We along with the AI Guild serves as a vehicle for knowledge sharing, collaboration, & skill development related to AI.
Our core objectives are VELOCITY and SAFETY. We want to help roll out AI projects as quickly as possible, while also ensuring that they are done safely and while operating within the guardrails for AI usage within IBM.
Your Role & Responsibilities:
Looking to make a significant impact? This is your chance to become a key part of a dynamic team of talented professionals, leading the development and deployment of innovative, industry-leading, cloud-based AI services.
We are seeking an experienced AI & Cloud Software Engineer to join us. This role designing, developing, and deploying AI-based services. You will be instrumental in problem-solving, automating wide ranges of tasks, and interfacing with other teams and solve complex problems.
Responsibilities:
- Develop AI capabilities in IBM Cloud based applications
- Design and be an avid coder who can get his hands dirty and be involved in the coding to the deepest level.
- Work in an agile environment of continuous deliverable.
- You’ll have access to all the technical training courses you need to become the expert you want to be.
· Define all aspects of development from appropriate technology and workflow to coding standards
· Collaborate with other professionals to determine functional and non-functional requirements
· Participate in technical reviews of requirements, specifications, designs, code and other artifacts.
· Learn new skills and adopt new practices readily in order to develop innovative and cutting-edge software products that maintain Company’s technical leadership position.
Required Expertise
Full Stack & AI/ML: 7–12 years' experience with AI/ML tools (scikit-learn, TensorFlow, PyTorch, LLMs), model deployment, and full-stack development.
Backend & APIs: Strong in Java, Python, Node.js, REST APIs, Kafka, and databases like Cassandra, PostgreSQL.
Cloud & DevOps: Expertise in IBM Cloud/AWS/Azure, Kubernetes, Docker, microservices, CI/CD, and SRE practices.
Web & Architecture: Proficient in web technologies (HTTP, JSON, HTML, JS) and modern cloud/microservices architecture with API design skills.
Preferred Expertise
Messaging & OS: Experience with Kafka, RabbitMQ, and Linux environments (Red Hat, Ubuntu).
Networking & Tools: Knowledge of TCP/IP, HTTP protocols, GitHub, Maven/Gradle.
SaaS & CI/CD: Background in SaaS apps, CI/CD pipelines, and agile development cycles.
Testing & Automation: Familiarity with UI test tools like Selenium or Puppeteer.
Mindset: Ownership, adaptability, global collaboration, and eagerness to solve complex problems with new tech.