Search by job, company or skills

  • Posted 17 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Company Overview:

Qualgo is an R&D center specializing in cybersecurity products and solutions. We are on a mission to build a trusted cyberspace where individuals and businesses can thrive with confidence.

Job Summary:

  • As a Senior AI Engineer on our product development team, you will bridge the gap between data science and production software. You will be responsible for taking state-of-the-art NLP/LLM models and deploying them into high-performance, real-time environments.
  • Because our product intercepts and analyzes communications, you will face unique engineering challenges around latency, scale, and privacy. You will not just be calling third-party LLM APIs. You will architect how our AI operates safely, securely, and instantly, with a strong emphasis on edge computing and on-device ML to help preserve user privacy.
  • This role is ideal for someone who is strong in AI deployment and systems engineering, and who can work closely with Data Scientists to bring advanced models into real-world products.

Key Responsibilities:

  • Deploy, optimize, and integrate NLP/LLM models into high-performance, real-time production environments.
  • Build and maintain AI inference pipelines for low-latency, privacy-sensitive communication analysis use cases.
  • Design and implement AI systems that can run efficiently across cloud, edge, and on-device environments.
  • Optimize model performance for production through quantization, pruning, distillation, graph optimization, and runtime tuning.
  • Convert, benchmark, and deploy models using production-oriented inference frameworks and toolchains.
  • Work closely with Data Scientists to take models from experimentation to robust production deployment.
  • Evaluate architecture tradeoffs across latency, privacy, model quality, cost, and hardware constraints.
  • Design reliable testing, profiling, monitoring, and observability workflows for AI inference systems.
  • Contribute to system design decisions for privacy-preserving AI, hybrid on-device/cloud inference, and model lifecycle management.
  • Collaborate with backend, product, and app engineering teams to integrate AI capabilities into end-user experiences.

Qualifications:

  • Education: Bachelor's degree/ Master's degree or Ph.D. in Computer Science, Artificial Intelligence, Machine Learning, Electrical Engineering, or a related field.
  • 5+ years of experience in AI Engineering, Applied Machine Learning, ML Systems, or a similar role.
  • Strong programming skills in Python and C++.
  • Strong experience deploying machine learning models into production environments.
  • Experience with edge computing, embedded AI, or on-device ML.
  • Hands-on experience with model optimization techniques such as quantization, pruning, distillation, or hardware-aware optimization.
  • Experience with production inference frameworks and runtimes such as ONNX Runtime, TensorRT, TFLite, Core ML, or similar technologies.
  • Strong understanding of performance optimization for inference systems, including latency, throughput, memory footprint, and hardware utilization.
  • Experience building reliable, scalable AI services or runtime components for production systems.
  • Ability to work closely with Data Scientists and software engineers to productionize advanced NLP/LLM models.

Nice to have:

  • Experience with platforms such as NVIDIA Jetson, Google Coral, or similar edge AI environments.
  • Experience deploying AI models on iOS or Android devices.
  • Familiarity with mobile development languages such as Swift, Objective-C, Kotlin, or Java.
  • Experience in privacy-sensitive domains such as trust & safety, fraud detection, cybersecurity, or communication intelligence.
  • Exposure to multilingual NLP systems or small-model deployment for constrained environments.

What we offer:

  • Competitive salary and benefits package.
  • Opportunity to work on a product that impacts millions of users.
  • A dynamic and supportive work environment.
  • Premium health insurance for you and your family.
  • Professional growth and development opportunities.
  • Annual leave 12-14 days per year + 1 Birthday Leave + 1 X'Mas
  • Performance review: once per year
  • Internal training/sharing and professional Training courses
  • Team building, company trip, year end party, monthly activities,....
  • Devices: Macbook and screen (If needed)
  • Free tea and coffee
  • Comfortable working Area
  • Working hour: 9am - 6pm from Monday to Friday

More Info

Job Type:
Industry:
Function:
Employment Type:

Job ID: 145271079

Similar Jobs