Search by job, company or skills

Qualgo Technologies Vietnam

AI Engineer (LLM) - Senior/ Middle

Early Applicant
  • Posted a month ago
  • Be among the first 10 applicants

Job Description

Qualgo is more than just a tech company; were a movement to build a safer, more trustworthy digital Vietnam. We believe everyone deserves to connect, communicate, and transact online without fear. Our innovative platform, powered by AI and advanced security technologies, provides a secure cyberspace for Vietnamese individuals and businesses, empowering them to embrace the full potential of the digital age. We are deeply committed to supporting Vietnams national digital transformation.

Job Summary

Join our growing AI team to design, build, and scale production-grade AI solutions across cloud and edge. Youll turn prototypes into reliable, low-latency services - owning data/feature pipelines, training/evaluation, deployment, monitoring, and iteration. Youll optimize performance and cost; implement safety guardrails, observability, and CI/CD/MLOps automations. Youll also enable edge/on-device inference (mobile, desktop, browser, IoT), including model packaging and compression, hardware acceleration (CPU/GPU/NPU), offline/real-time constraints, telemetry, and OTA updates. Partner with data scientists, engineers, and product to ship user-facing features quickly and safely in a fast-paced, collaborative environment.

Key Responsibilities:

  • Design, deploy, and optimize LLM-based services, with models running in a self-hosted setup on cloud infrastructure.
  • Build and maintain a centralized LLM Gateway to manage multi-model access and routing
  • Implement and evolve Agent-to-Agent (A2A) communication and the Model Context Protocol (MCP) for agent collaboration.
  • Design and integrate a powerful Agent Memory System, including a dynamic knowledge base and contextual memory to empower intelligent behavior.
  • Apply model optimization techniques to improve inference efficiency and cost-effectiveness.
  • Develop and operate MLOps pipelines for model lifecycle management.
  • Ensure system scalability, reliability, and performance across diverse workloads.
  • Collaborate across teams to bring intelligent agents into real-world applications.

Qualifications:

  • Education: Bachelors degree/ Masters degree or Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, Electrical Engineering, or a related field.
  • Minimum 3+ years of experience with Middle level (and 5+ years of experience with Senior level) in AI engineer, data science, machine learning or deep learning.
  • Strong programming skills inPythonand experience withcontainerized and cloud-native environment.
  • Solid understanding ofAI/ML/DL model deployment, including serving, optimization, and context-aware design
  • Experience in building systems withagent orchestration, memory management, and structured communication protocols
  • Familiarity withretrieval-augmented generation, semantic memory, and message-driven workflows
  • Hands-on experience working withGPU (NVIDIA) or TPU environments, includingmodel quantizationand other performance optimization techniques.
  • Practical experience withMLOps pipelinesfor training, versioning, and deploying machine learning models
  • Bonus: experience withagent-based architecture,LLM streaming patterns,Golang, and a strong foundation inmathematics, statistics, or linear algebra

Skills:

  • Strong problem-solving and analytical abilities.
  • Excellent communication and collaboration skills.
  • Ability to work independently and as part of a team.
  • Fluency in English is a plus.

What we offer:

  • Competitive salary and benefits package.
  • 100% salary during probation period.
  • Full social insurance contribution based on 100% of salary.
  • Opportunity to work on a product that impacts millions of users.
  • A dynamic and supportive work environment.
  • Premium health insurance for you and your family.
  • Professional growth and development opportunities.
  • Annual leave 12 days per year + 1 Birthday Leave + 1 XMas
  • Performance review: once per year
  • Internal training/sharing and professional Training courses
  • Team building, company trip, year end party, monthly activities,.
  • Devices: Macbook and screen (If needed)
  • Free tea and coffee
  • Comfortable working Area
  • Working hour: 9am 6pm from Monday to Friday

Location: The Hallmark Building - 15 Tran Bach Dang, An Khanh Ward, Thu Duc City, HCMC.

More Info

Date Posted: 27/08/2025

Job ID: 124923807

Report Job
View More
Last Updated: 02-10-2025 10:45:45 PM
Home Jobs in Ho Chi Minh AI Engineer (LLM) - Senior/ Middle