We are seeking a highly skilled Senior AI/ML Core Engineer to lead the development and scaling of advanced AI and machine learning solutions across critical business platforms. The role focuses on building production-grade AI systems, optimizing ML infrastructure, and driving innovation in large language models (LLMs), intelligent automation, and real-time AI applications. The ideal candidate will combine strong research capability with hands-on engineering expertise to deliver scalable, reliable, and high-performing AI products.
Qualification and Experience
- Bachelor’s or Master’s degree in Computer Science, Artificial Intelligence, Machine Learning, Data Science, Software Engineering, or a related field
- 5+ years of experience in AI/ML Engineering, Machine Learning Infrastructure, or Applied AI development
- Proven experience in building and deploying production-scale AI/ML systems
- Hands-on experience working with LLMs, deep learning frameworks, and distributed computing environments
- Experience leading technical initiatives and mentoring engineering teams is preferred
Job Description
- Design, develop, and optimize core AI products such as Trust Engine, KYC AI, eVA, and intelligent automation platforms
- Build and deploy scalable AI/ML systems integrating state-of-the-art LLMs and classical machine learning models into production environments
- Develop and maintain robust ML infrastructure, MLOps pipelines, and model-serving platforms with a focus on scalability, governance, and reliability
- Implement fine-tuning, retrieval-augmented generation (RAG), embeddings, and vector database solutions for AI applications
- Optimize model performance, inference latency, GPU utilization, and cloud infrastructure costs
- Collaborate with research, engineering, product, and infrastructure teams to define technical roadmaps and AI architecture standards
- Ensure compliance, auditability, monitoring, and observability of production AI systems
- Drive engineering excellence through best coding practices, architecture reviews, and technical mentorship
- Support real-time AI processing, distributed data systems, and large-scale model deployment pipelines
- Continuously evaluate emerging AI technologies, frameworks, and industry trends to improve product capabilities and engineering efficiency
Required Skills
- Expert-level proficiency in Python with strong understanding of data structures, algorithms, and system design
- Hands-on experience with PyTorch and/or TensorFlow
- Strong familiarity with Hugging Face Transformers, scikit-learn, and XGBoost
- Experience building applications using Large Language Models (LLMs), including fine-tuning, RAG pipelines, embeddings, and vector databases (Pinecone, Weaviate, FAISS, pgvector, etc.)
- Working knowledge of MLOps platforms and tools such as MLflow, Kubeflow, SageMaker, or Vertex AI
- Experience with Docker, Kubernetes, and containerized deployment environments
- Production experience with cloud platforms including AWS, GCP, or Azure, especially GPU/accelerator workloads
- Strong SQL skills and experience with distributed data processing tools such as Spark, Ray, Dask, Airflow, or dbt
- Solid understanding of statistics, linear algebra, optimization techniques, and core machine learning theory
- Knowledge of monitoring, observability, and performance optimization in production AI systems
Benefits of Working as eSewa
- Stellar opportunity to work with the rising company
- The amazing and passionate young team, beautiful office space
- Trust of biggest FinTech company.
- One-of-a-kind company culture and growth opportunities to accelerate your career progression.