IBM Research takes responsibility for technology and its role in society. Working in IBM Research means you'll join a team who invent what's next in computing, always choosing the big, urgent and mind-bending work that endures and shapes generations. Our passion for discovery, and excitement for defining the future of tech, is what builds our strong culture around solving problems for clients and seeing the real world impact that you can make.
IBM's product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive.
The MIT-IBM Watson AI lab is seeking outstanding candidates for a research engineer position to work on cutting-edge research in the field of Large Language Models (LLMs). Areas of research include, but not limited to, developing new capabilities for LLMs, for example, reliable uncertainty quantification, novel and efficient fine-tuning techniques, multi-modality alignment and more. Job responsibilities involve the entire cycle from research to product, including identifying and defining research problems, designing experimental setups to test hypotheses and improve solutions, and supporting the deployment in products, in particular IBM’s Granite LLM series.
- Strong programming skills in one or more widely used languages such as Python or Java.
- Demonstrated experience in solving analytical problems using rigorous and quantitative approaches.
- Demonstrated experience in developing, training, and testing deep neural networks.
- A master’s or a PhD degree in computer science, mathematics, statistics, or related disciplines.
- Experience with machine learning tools and frameworks such as PyTorch, TensorFlow etc.
- Demonstrated experience in developing, training, and testing generative models, such as LLMs or multi-modal foundation models.
- Being able to clearly and effectively communicate research ideas as demonstrated by publications and presentations at top-tier AI conferences such as NeurIPS, ICLR, ICML, ACL, EMNLP, CVPR etc.