Search by job, company or skills

  • Posted 19 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Trusting Social is an AI Fintech pioneer that's revolutionizing credit access in emerging markets. Our mission is Advancing AI to Meet the Financial Needs of Everyday Consumers with Empathy. We've assessed over 1 billion consumers across four countries, and we're on a mission to provide 100 million credit lines using the power of AI and Big Data.

How You'll Make An Impact

As a Voice AI Engineer, you'll go beyond building models - you'll architect our Digital Humans. You'll design the orchestration layer that enables our AI agents to listen, reason, and respond in real-time, seamlessly connecting rigid banking systems with the natural flow of human conversation in financial advisory scenarios.

What You'll Do

You'll spearhead the development of our real-time omnichannel communication platform, focusing on creating voice agents that feel truly human: low-latency, intuitive, and capable of handling nuanced financial discussions with empathy and precision.

  • End-to-end pipeline orchestration: Design and optimize the complete conversational flow: voice activity detection (VAD) automatic speech recognition (ASR) large language model (LLM) reasoning text-to-speech (TTS).
  • Latency optimization: Target sub-1200ms end-to-end perceived latency using techniques like stream-to-stream processing.
  • Advanced speech understanding: Fine-tune a speech foundation model with multiple speech understanding capabilities, including automatic speech recognition (ASR), spoken language identification (LID), speech emotion recognition (SER), and audio event detection (AED)
  • Advanced speech generation: TTS models for controllably expressive and emotional speech generation: natural prosody, emotional tone, and adaptation to local Southeast Asian dialects and accents. Develop voice cloning for consistent, brand-aligned agents that convey warmth during sensitive financial topics.
  • Turn-taking & interruptibility: Build robust logic for handling interruptions (barge-in), background noise, filler words, and natural pauses, ensuring smooth, human-like dialogue.
  • Agent memory: Embed memory into the voice pipeline for real-time, accurate delivery of personalized communication across channels, without disrupting conversation flow.
  • System reliability & evaluation: Develop agent versioning, conduct A/B testing, and track key internal voice agent / biz-related metrics for user engagement to continuously improve interactions.

What We're Looking For


  • The Voice stack: 3+ years of experience in Machine Learning/Software Engineering with deep expertise in Speech-to-Text (STT) and Text-to-Speech (TTS) architectures.
  • Real-time protocols: Proficiency in managing low-latency audio streaming and bidirectional communication using WebRTC, WebSockets.
  • Programming mastery: Advanced proficiency in Python, specifically leveraging FastAPI and AsyncIO for high-concurrency systems, alongside hands-on experience with Deep Learning frameworks like PyTorch.
  • Data & audio engineering: Hands-on experience with Librosa and FFmpeg to minimize audio processing overhead and optimize feature extraction.
  • The VoiceAI mindset: You understand that a great voice agent isn't just a fast model; it's a perfectly timed, natural interaction.
  • Bonus points:
  • MLOps & DevOps: Strong command of DevOps and MLOps methodologies, featuring hands-on proficiency in containerization (Docker) or orchestration (Kubernetes).
  • Inference & serving: Demonstrated expertise in model optimization and deployment using engines like ONNX and TensorRT, with experience managing serving frameworks such as Triton Inference Server and Ray (especially if you can demonstrate performance gains through benchmarking).
  • Full-Stack awareness: Capable of building responsive user interfaces and web applications using React or Next.js.

What We Offer


Join our vibrant team and enjoy:

  • Opportunity to work and learn from one of the best and brightest technology teams in Vietnam
  • Be part of a winning team with exponential growth regionally, experience recruiting world-class talents
  • Competitive compensation package, including 13th-month salary and performance bonuses
  • Comprehensive health care coverage for you and your dependents
  • Generous leave policies, including annual leave, sick leave, and flexible work hours
  • Convenient central district 1 office location, next to a future metro station
  • Onsite lunch with multiple options, including vegetarian
  • Grab for work allowance and fully equipped workstations
  • Fun and engaging team building activities, sponsored sports clubs, and happy hour every Thursday
  • Unlimited free coffee, tea, snacks, and fruit to keep you energized
  • An opportunity to make a social impact by helping to democratize credit access in emerging markets

At Trusting Social, we live by ownership, integrity, and agility in execution. We believe in doing what's right, what's best, and what's innovative. If you're smart, driven, and want to make a difference in the world with the most advanced and fascinating technology, come join our team. We offer the runway to truly make an impact.

Learn more about us:

https://trustingsocial.com

https://www.youtube.com/watchv=inAEDGvOcL8&t=29s

More Info

Job Type:
Industry:
Function:
Employment Type:

Job ID: 137012591