This is a remote position.
Summary:We are looking for a highly skilled AI Engineer with experience in Conversational AI, Generative AI, Speech Processing, NLP, and Deep Learning. This role involves building AI-driven solutions for chatbots, voice-based interactions, deepfake generation, and enterprise automation.
If you have 60-70% of the skills mentioned below, we encourage you to apply! We value candidates with hands-on experience in some of these areas and a willingness to learn and grow into the rest.
Key Responsibilities:
- Design and develop Conversational AI systems, including chatbots, voice assistants, and virtual role-playing agents.
- Work with LLMs (GPT, Claude, Mistral, LLaMA, etc.), optimizing prompt engineering and Retrieval-Augmented Generation (RAG) for knowledge retrieval.
- Implement Speech-to-Text (STT) and Text-to-Speech (TTS) models for real-time voice interactions.
- Develop deepfake and synthetic media solutions, including face-swapping, voice cloning, motion transfer, and AI-powered reenactments.
- Optimize sentiment and tonality analysis to adjust AI responses based on user emotions and tone.
- Create natural language-to-SQL translation models for enterprise AI solutions that generate database queries.
- Work with vector databases (FAISS, Pinecone, Weaviate, ChromaDB) for AI-driven search and retrieval systems.
- Deploy AI models using cloud platforms (AWS, GCP, Azure) and optimize for real-time processing and scalability.
- Collaborate with designers, researchers, and business stakeholders to refine AI applications and enhance user engagement.
Requirements:
- Experience: 5+ years in AI/ML, Conversational AI, NLP, Speech Processing, or Deep Learning.
- Programming Skills: Strong expertise in Python and frameworks like PyTorch, TensorFlow, OpenAI APIs, Hugging Face Transformers.
- LLMs & Prompt Engineering: Hands-on experience with GPT, LLaMA, Claude, Mistral, and fine-tuning for domain-specific tasks.
- Speech & Audio Processing: Experience with Whisper, Tacotron, FastSpeech, WaveNet, or real-time speech synthesis models.
- Conversational AI: Familiarity with LangChain, Rasa, Dialogflow, and multi-modal AI (text, voice, video).
- Knowledge Retrieval: Experience in RAG-based AI workflows, vector databases, and retrieval-based AI architectures.
- SQL & Data Processing: Ability to build natural language-to-SQL models and work with PostgreSQL, MySQL, Snowflake, or BigQuery.
- Deepfake & Synthetic Media (Bonus): Experience with GANs, Autoencoders, Neural Radiance Fields (NeRFs), DeepFaceLab, StyleGAN.
- Deployment & Cloud: Hands-on experience with Flask, FastAPI, Docker, Kubernetes, and serverless AI architectures.
- Problem-Solving & Research Mindset: Ability to experiment, optimize models, and work in a fast-paced AI-driven environment.
Why Join Us?
- Work on cutting-edge AI projects across Conversational AI, Speech Processing, and Generative AI.
- Opportunity to explore deepfake generation, AI-driven role-playing, and multi-modal AI applications.
- Flexible work environment with opportunities to innovate and experiment with the latest AI/ML technologies.
- Collaborate with AI researchers, developers, and data scientists on breakthrough AI solutions.
Don't meet all the requirements? No worries! If you have at least 50-70% of the skills listed, we still encourage you to apply. We value potential, adaptability, and a learning mindset over a perfect skills match.