Job Description
We are looking for an experienced Machine Learning Engineer specializing in model inference and optimization to join our team. This role focuses on improving the efficiency and scalability of LLMs in production, including model deployment, quantization, and inference acceleration. The ideal candidate will have 2-3 years of experience working with ML frameworks such as PyTorch or TensorFlow, a deep understanding of neural network architectures, and a strong interest in LLM inference optimization.
Sarvam AI is on a mission to lead transformative AI research that will significantly improve the robustness, performance, and cost-effectiveness of GenAI app development, deployment, and distribution in India.