Search by job, company or skills

  • Posted a month ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Description

RAG System Development

  • Design, implement, and optimize end-to-end RAG pipelines, covering data ingestion, indexing, retrieval, reranking, and response generation.
  • Integrate, configure, and tune vector databases such as Pinecone, Weaviate, Milvus, or Chroma.
  • Improve chunking strategies, embedding quality, and similarity search effectiveness.
  • Build evaluation frameworks to measure RAG performance, including precision, recall, context relevance, and answer quality.

Agentic AI & Intelligent Systems

  • Develop AI Agents using frameworks such as LangGraph, AutoGen, or LlamaIndex Agents.
  • Design and orchestrate multi-agent workflows (e.g., planner, executor, evaluator).
  • Implement tool and function calling, serverless integrations, and adaptive reasoning workflows.

Model Context Protocol (MCP)

  • Design and implement MCP servers that expose tools, APIs, and datasets to LLMs.
  • Integrate MCP across application layers, from frontend and backend services to LLM components.
  • Build custom MCP tools, including database connectors, internal API integrations, and data collection utilities.

System Architecture & MLOps

  • Build and maintain model inference infrastructure using Docker, Kubernetes, and GPU-based environments.
  • Implement logging, monitoring, and observability for RAG and agent-based pipelines.
  • Optimize inference performance and cost through techniques such as batching, quantization, vLLM, and TensorRT.

Requirements

  • 3 5+ years of experience in advanced analytics, machine learning, deep learning, or applying AI/GenAI solutions to real-world business problems.
  • Strong proficiency in Python (FastAPI or Flask); experience with Node.js is an advantage.
  • Proven experience building production-grade RAG systems.
  • Deep understanding of LLMs, embeddings, vector databases, and reranking techniques.
  • Hands-on experience with RAG and agent frameworks such as LangChain, LlamaIndex, LangGraph, or equivalent.
  • Practical knowledge of Docker, Linux, and foundational DevOps practices.
  • Strong debugging, problem-solving, and system-level thinking skills.
  • Excellent communication and presentation skills, with the ability to clearly explain AI, LLM, and RAG concepts to both technical and non-technical audiences.
  • Ability to deliver AI solution demos and presentations, clearly articulating system architecture, workflows, and business value.
  • Fluent spoken and written English.

Preferred

  • Experience in client training, presales support, solution presentations, or developing AI educational materials (slides, documentation, demos).
  • A teaching and mentoring mindset, with the ability to guide junior engineers and support cross-functional teams such as Product, Business, Sales, and Clients in adopting AI solutions.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 141927071

Similar Jobs