About the Role
We are looking for AI Engineer to build next-generation Conversational AI systems, combining Speech AI, LLMs, and Agentic AI into real-world applications.
This role sits at the intersection of research, system design, and production. You will work on voice-first AI systems that can listen, understand, reason, act, and respond naturally with low latency at scale.
What You'll Do
Speech AI Development
- Build and optimize ASR, TTS, and speech-to-speech systems for Vietnamese
- Work with modern foundation models (Whisper, Qwen, XTTS, etc.)
- Improve key metrics: accuracy, naturalness, latency, speaker quality
- Develop real-time speech applications (voice assistants, conversational systems)
LLM & Agentic AI
- Design and deploy AI Agents (single & multi-agent)
- Build LLM pipelines with:
- RAG, tool calling, memory, orchestration frameworks
- Combine speech + LLM + tools into end-to-end intelligent systems
- Enable capabilities like reasoning, planning, and autonomous actions
Model Training & Optimization
- Develop pipelines for:
- Pretraining, fine-tuning (SFT), instruction tuning
- Domain adaptation for Vietnamese
- Apply advanced techniques:
- RLHF, DPO, PPO
- Distillation, quantization, pruning
- Build and improve large-scale speech datasets
System Design & Production
- Architect and deploy end-to-end AI systems from research to production
- Work closely with product & engineering teams to deliver real features
- Optimize for latency, reliability, and scalability
- Build and maintain ML pipelines (training → inference → monitoring)
What We're Looking For
Core Requirements
- 5+ years in Software Engineering
- 2+ years in AI (ML / DL / GenAI)
- Strong Python and experience building production systems
Technical Strength
- Solid foundation in:
- Deep Learning, NLP, Speech Processing
- Transformers & Generative AI
- Comfortable with system design:
- APIs, microservices, distributed systems
Speech AI or Agentic AI (at least one)
- Speech AI: ASR, TTS, or speech systems; understanding of acoustic models, vocoders
- Agentic AI / LLM: AI Agents, RAG, conversational AI; experience with LangChain, LlamaIndex, etc.
Infrastructure
- Experience with:
- Docker, Kubernetes, cloud platforms
- ML pipelines / MLOps tools
Nice to Have
- Experience with Vietnamese ASR/TTS production systems
- Familiarity with real-time / streaming AI
- Experience in multi-agent systems or large-scale deployments
- Publications in top AI conferences (optional)
Office:
Hà Nội: Technopark Building, Gia Lâm, Hà Nội
HCM: Vincom Đồng Khởi, Q1, HCM
Contact: Zalo 0769288088 | Email: [Confidential Information]