Job Description
We are looking for a GPU Infrastructure Engineer to build and manage high-performance AI infrastructure, optimize model deployment, and streamline model inferencing at scale. You will work across cloud and on-prem GPU clusters, manage model CI/CD pipelines, and optimize AI workloads for efficiency and performance. This role requires expertise in GPU-accelerated computing, cloud infrastructure, offline deployments, and monitoring AI workloads.
Sarvam AI is on a mission to lead transformative AI research that will significantly improve the robustness, performance, and cost-effectiveness of GenAI app development, deployment, and distribution in India.