Role Overview
We are seeking a skilled professional to design, develop, and deploy scalable LLM and agent-based AI solutions for thousands of users. You will architect multi-agent workflows, build robust APIs and RAG pipelines, and deliver advanced multimodal features. The role requires expertise in frameworks such as LangChain, LangGraph, and OpenAI Agents SDK, as well as strong skills in machine learning and database integration. You will drive innovation by optimizing system performance and collaborating to deliver production-ready AI systems.
Key Responsibilities
- Architect and implement agent workflows utilizing frameworks such as LangChain, LangGraph, or the OpenAI Agents SDK.
- Design multi-agent system, utilizing tools/model-context-protocol (MCP).
- Develop high-performance RESTful and streaming APIs using FastAPI.
- Design, build, and evaluate RAG pipelines leveraging enterprise-grade vector databases.
- Deliver multimodal LLM features integrating text, vision, speech, and structured data.
- Instrument and monitor key metrics, including latency, cost, and hallucination rates.
- Apply core machine learning, deep learning, and Transformer-based algorithms to optimize and troubleshoot models.
- Design relational database schemas, write optimized SQL, and integrate transactional data sources.
Minimum Qualifications
- Bachelors degree in Computer Science, Data Engineering, or a related field.
- Minimum of 2 years experience building and operating ML/AI systems in production environments.
- Hands-on experience with at least one agent framework (e.g., LangChain, LangGraph, OpenAI Agents).
- Proven track record in delivering RAG solutions, including indexing, retrieval optimization, and evaluation.
- Proficiency in Python and experience with FastAPI.
- Familiarity with multi-agent system with tools/model-context-protocol (MCP).
- Demonstrated experience with multimodal LLM projects.
- Strong foundation in machine learning fundamentals, deep learning architectures, and Transformer internals.
- Expertise in relational database design and advanced SQL.
- Effective communication skills in English (B2 level or higher).
Nice to have
- Proficiency with AI coding assistants and IDE extensions (e.g., Claude, Cursor, Windsurf) to accelerate development workflows.
- Experience with containerization and orchestration tools such as Docker and Kubernetes.
- Practical experience running models locally using vLLM or Ollama.
- Proficiency in Japanese communication.
Why Join us
- Be part of a groundbreaking AI-powered coding assistant team incubated within FPT Softwares AI Center, backed by one of the largest funding allocations in the organization to drive growth and global expansion.
- Contribute to a product designed to empower developers and revolutionize software development workflows, with a vision to achieve world-class recognition.
- Collaborate with a highly talented team, including Ph.D. holders and industry experts, dedicated to pushing the boundaries of AI innovation in software development.
- Gain opportunities to author research papers and patents, showcasing your expertise and protecting our innovative solutions.
- Work on cutting-edge technologies and have a real impact on the future of AI in software development.