Senior AI Engineer

Industry Other
Category IT – Programming & Development
Location Kathmandu, Nepal
Expiry date Jun 28, 2026 (7 days left)

Job Description

Location: Manbhawan, Lalitpur (On-site / Hybrid)

Department: AI

Reports To: Chief Technology Officer (CTO)

To apply, please send your resume to [email protected].

Role Overview

We are seeking a highly skilled Senior AI Engineer to design, develop, and deploy production-grade AI/ML solutions that drive tangible business value. In 2026, building AI systems goes beyond API integration; you will lead the architecture of advanced multi-agent workflows, state-of-the-art retrieval systems (RAG), and custom-tuned open-source models. This role requires deep technical expertise, robust engineering discipline, and a proven track record of shipping scalable, secure, and cost-optimized AI applications to production.

Key ResponsibilitiesAdvanced GenAI & Agentic Systems

Design and implement complex multi-agent orchestration workflows, autonomous AI agents, and advanced graph-based RAG architectures.
Lead fine-tuning strategies for open-source Large and Small Language Models (LLMs/SLMs) using techniques like LoRA/QLoRA to meet domain-specific needs.

LLMOps, Evaluation & Security

Establish rigorous validation pipelines and evaluation frameworks to systematically measure and mitigate hallucinations, latency, and token spend.
Implement production-grade guardrails to monitor and secure AI systems against prompt injection, data leakage, and adversarial vulnerabilities.

System Architecture & Pipeline Development

Convert trained models into high-throughput microservices, integrating them seamlessly into live production software.
Build and maintain scalable backend workflows and APIs (FastAPI/Flask) optimized for low-latency model inference.

MLOps & Lifecycle Management

Architect and manage the full ML lifecycle, including model versioning, continuous deployment, automated retraining, and data drift detection.
Build robust data handling pipelines for cleaning, synthetic data generation, and context-enrichment of large-scale datasets.

Performance & Cost Optimization

Optimize inference costs and token consumption through efficient context-window management, model quantization, and caching strategies.

Technical Mentorship & Collaboration

Guide and mentor junior AI engineers, establish coding standards, and conduct architecture and code reviews.
Partner with product and engineering teams to translate high-level business goals into concrete AI-driven technical specifications.

Required Qualifications

Education: Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Data Science, or a closely related field.
Experience: 5+ years of experience in production-level Machine Learning or AI Engineering roles.
Core ML Stack: Strong proficiency in Python and deep learning frameworks, specifically PyTorch and the Hugging Face ecosystem.
GenAI Stack: Hands-on experience building with orchestration frameworks (e.g., LangGraph, AutoGen, LlamaIndex, LangChain).
Vector Infrastructure: Experience setting up, indexing, and optimizing vector databases (e.g., Pinecone, Qdrant, Milvus, or pgvector).
Cloud & Infrastructure: Strong experience deploying and managing AI workloads on cloud platforms (AWS, GCP, or Azure) utilizing Docker and Kubernetes.
LLMOps Tools: Familiarity with LLM tracking, evaluation, and observability tools (e.g., LangSmith, Phoenix, Ragas, or MLflow).

Preferred Skills

Experience deploying local, open-source models using high-performance inference engines like vLLM, Ollama, or TensorRT-LLM.
Familiarity with distributed data processing and big data frameworks (e.g., Spark, Databricks).
Active contributor to open-source AI projects or author of peer-reviewed AI research publications.
Exceptional communication skills with the ability to articulate complex AI paradigms to non-technical stakeholders.

What We Offer

Cutting-Edge Tech Stack: Opportunity to build pioneering GenAI, Agentic, and LLM applications utilizing industry-leading infrastructure.
Premium Compute Access: Dedicated and flexible cloud compute budgets and access to high-end cluster GPUs for model training and inference experimentation.
Collaborative Environment: A culture of innovation, continuous learning, and intellectual curiosity at our tech hub in Manbhawan.
Growth & Leadership: Clear pathways to AI leadership and architectural ownership as the AI department expands.
Compensation: Highly competitive salary package, performance bonuses, and a comprehensive benefits structure.

Apply Now Save Job

Senior AI Engineer

Job Description

Explore Job

for jobseekers

For Employers

Trainings

Resources

Quick Links

Senior AI Engineer

Job Description

Related Jobs

More Jobs

Download Our Mobile App