About Cake By VPBank
We're building the future of banking by combining cutting-edge AI technology with seamless user experiences. Our mission is to make financial services accessible, simple, and delightful for millions of Vietnamese customers.
Why This Role Matters
As a Senior AI Engineer, you will shape how millions of customers interact with our banking services. Your work on AI-powered callbots, voice assistants, and intelligent agentic automation will directly impact customer satisfaction and operational efficiency. You'll be at the forefront of bringing state-of-the-art GenAI innovation to Vietnam's financial sector.
What You'll Do
Build State-of-the-Art GenAI & LLM Solutions
- Design and deploy LLM applications using latest open-weight models (DeepSee, Qwen3,..) and commercial APIs (Gemini, Claude,..)
- Implement fine-tuning techniques: LoRA/QLoRA, RLHF/RLAIF, GRPO for domain-specific optimization
- Build production RAG pipelines with vector databases (Qdrant, Chroma,..) and hybrid retrieval strategies
- Optimize inference using high-performance frameworks: vLLM, SGLang, LMDeploy,...for throughput and latency
- Implement structured output generation, reasoning models, and test-time compute scaling
Create Production-Grade Agentic AI Systems
- Design multi-agent architectures using LangGraph, CrewAI or OpenAI Agents SDK
- Implement MCP for standardized tool integration and inter-agent communication
- Build autonomous agents with planning, reasoning, memory management, and tool orchestration
- Develop evaluation pipelines for agent reliability, safety guardrails, and failure-mode detection
- Create workflow automation combining agents with RAG, function calling, and external API integration
What You'll Bring
Must Have
- With 3+ years working on LLMs, NLP, or GenAI applications
- Strong Python skills and deep experience with PyTorch and Hugging Face Transformers
- Hands-on experience with LLM fine-tuning (LoRA/QLoRA, PEFT) and inference optimization
- Production experience with RAG systems: embeddings, vector databases, retrieval strategies
- Familiarity with LLM serving frameworks (vLLM, SGLang) and cloud platforms (GCP)
- Experience with agentic frameworks
Nice to Have
- Experience with voice AI: ASR, TTS, speech to speech
- Knowledge of MCP protocol and multi-agent orchestration frameworks
- Experience with reasoning models and test-time compute scaling
- Hands-on with model quantization (GPTQ, AWQ, GGUF) and efficient inference
- Experience with Vietnamese NLP, speech processing, or multilingual models
- Background in banking/fintech AI applications or customer service automation
- Contributions to open-source AI projects or published research in GenAI
Compensation & Benefits
- 13th-month salary and performance-based bonus (up to 3 months per year).
- Company-provided laptop and all necessary working equipment.
- Becorp budget for using Be services such as ride-hailing, food delivery, and coach rentals via the Be app.
- Annual health check-ups and premium health insurance package (PTI).
- 15 days of annual leave for all employees.
- Company trips, team-building activities, and happy hour events organized quarterly or annually.