NEW YORK, NY

Mahadev

Maitri

|
LLM ApplicationsRAG SystemsFastAPICloud InfrastructureAWSKubernetesProduction AI Systems

Python AI Engineer with 5+ years of experience building production LLM pipelines, RAG systems, and cloud-native AI infrastructure on AWS. MS Computer Science, University of Delaware.

5+
Years Experience
3+
Production AI Apps
10+
AWS Services Used
Overview

Quick Summary

A snapshot of Mahadev's background, skills, and recent wins.

๐Ÿ—“๏ธ
Experience
5+ Years
Professional software & AI engineering
๐ŸŽ“
Education
MS Computer Science
University of Delaware โ€” GPA 4.0/4.0
๐Ÿ’ผ
Current Role
Python AI Engineer
Fulcrum Digital (Consulting, NYC)
๐Ÿ“
Location
New York, NY
Open to remote & hybrid
๐Ÿ›‚
Authorization
H-1B
Authorized to work in the US
๐Ÿ†

Top Achievements

  • โ€บBuilt Doxtract-IDP: 92% extraction confidence, 40% less manual data entry
  • โ€บRAG pipeline with 87% precision across 2,000+ insurance test scenarios
  • โ€บFine-tuned LLM ensemble achieving Kaggle MAP@3 of 0.948
  • โ€บReduced EC2 costs by 60% migrating to AWS Bedrock
  • โ€บDesigned EKS services with 99.8% availability at 300+ concurrent requests
  • โ€บPublished research on ship classification โ€” 89.6% accuracy
Core SkillsPythonFastAPILangChainLangGraphAWSEKSDockerKubernetesRAGLLMsFine-TuningPyTorch
Career

Experience

Python AI Engineer

Current

Fulcrum Digital

October 2024 โ€” PresentยทNew York City, NY
PythonLangGraphLangChainAWS BedrockEKSFastAPIPGVectorKafkaRedisRayKubernetes
โ€บ

Architected an enterprise document intelligence solution achieving 92% average extraction confidence and 40% reduction in manual data entry, by designing Doxtract-IDP using AWS Bedrock multi-modal models (Claude Sonnet 4) with a serverless Lambda-SQS-DynamoDB event-driven architecture.

โ€บ

Implemented production-grade document processing with 90%+ field-level extraction accuracy via human-in-the-loop validation and post-processing rules, deployed on serverless AWS infrastructure handling 60600 second processing windows per document.

โ€บ

Designed and deployed high-throughput microservices on AWS EKS with horizontal pod autoscaling, load balancing, and CI/CD pipelines, delivering 99.8% system availability while handling 300+ concurrent requests during peak usage.

โ€บ

Architected an event-driven GenAI pipeline using LangGraph agent systems with persistent memory and chain-of-thought reasoning, improving response relevance scores by 20% and reducing manual policy review time by 15%.

โ€บ

Engineered a custom vector database retriever with LLM-based reranking, increasing search relevance scores by 25% and achieving 87% precision across 2,000+ synthetic insurance risk scenarios.

โ€บ

Optimized GPU infrastructure costs by migrating from local Ollama deployments to AWS Bedrock API integration, eliminating on-premise GPU dependencies and reducing monthly EC2 spend by 60%.

โ€บ

Deployed a self-hosted OCR service using Vision Transformer (Phi3.5 Vision) on EKS with Ray clusters for distributed computing, reducing monthly infrastructure costs by 20%.

โ€บ

Built a RAG pipeline with Kafka-based document ingestion and Redis query caching, implementing custom retrieval and reranking components with structured claim analysis to achieve 87% precision on insurance risk assessment across 2,000+ test scenarios.

Research Assistant

Department of Mechanical Engineering - University of Delaware

January 2024 โ€” February 2024ยทNewark, DE
Google Maps APIMQTTReal-Time SystemsAlgorithm Design

Software Engineer

Optum - UnitedHealth Group

July 2020 โ€” July 2022ยทBengaluru, India
PythonFastAPIAWS GlueRedshiftSageMaker+5

Trainee - Internship

Electronics and Radar Development Establishment (LRDE), DRDO

January 2020 โ€” May 2020ยทBengaluru, India
PythonPyTorchGANMobileNetV2Transfer Learning

Project Intern - Internship

Wipro IISc Research Innovation Network (WIRIN), IISc

June 2019 โ€” December 2019ยทBengaluru, India
Video SegmentationDeep LearningPython
Portfolio

Featured Projects

Production systems and personal work demonstrating AI engineering depth.

Identifying Student Misconceptions in Math with Fine-Tuned LLMs

July 2025 โ€“ October 2025

Developed a multi-model classification system by fine-tuning a diverse range of LLMs, including DeepSeek, Gemma-2, and Qwen3.

PythonTransformersPEFTBitsAndBytes

Formula 1 Race Strategy Optimization with Deep Reinforcement Learning

Feburary 2024 โ€“ May 2024

Designed and trained a Deep Q-Learning agent to optimize race strategies in Formula 1 by creating a data-driven environment using real-world racing data from FastF1.

PythonPyTorchGymnasium

Poultry Farms to Manage Depopulation Using Nash Equilibrium through MR Simulation

September 2023 โ€“ December 2023

Formulated an innovative Nash Equilibrium strategy to optimize revenue generation for poultry farms. Utilized game-theoretical principles to improve management practices including vaccination timing, depopulation, and risk management.

C#Unity Engine

Sudoku using Backtrack Search

February 2023 โ€“ May 2023

Developed a user-friendly Flask application for solving Sudoku puzzles, integrating advanced algorithms such as AC3 and Backtracking search, enabling users to tackle both predefined puzzles and custom inputs with ease.

PythonFlaskHTMLCSS

Base Language Compiler

February 2023 โ€“ May 2023

Developed an Object System by defining class structures, modifying the lexer and parser to add reserved keywords and object identifiers, implementing method calls, and ensuring semantic analysis, resulting in enhanced functionality and code maintainability.

KotlinAntlrIntellij IDEA

Cryptocurrency Price Forecasting

September 2022 โ€“ December 2022

Collaborated with a team of three to develop neural networks for predicting cryptocurrency prices, employing an ablation study methodology to enhance model architectures and hyperparameters.

PythonTensorFlowNumpyPandas+1

Class Attendance Application based on Facial Recognition and Cognitive Services

August 2018 โ€“ November 2018

Engineered an Android application to optimize the attendance-taking process for classes, implementing image recognition technology to automate attendance tracking through facial recognition.

AndroidJavaMicrosoft Azure Face API

Detection of Common Plant Diseases using Convolutional Neural Network

August 2018 โ€“ November 2018

Led the development of an innovative solution to address plant diseases by leveraging Convolutional Neural Network (CNN) technology. This solution detects diseases early and offers timely recommendations for weedicide application.

PythonPyTorchNumpyPandas+1
Expertise

Skills

Production-proven technologies from 4+ years of engineering.

Tech Stack

Python
Python
PyTorch
PyTorch
TensorFlow
TensorFlow
AWS
AWS
Docker
Docker
Kubernetes
Kubernetes
TypeScript
TypeScript
React
React
Next.js
Next.js
MongoDB
MongoDB
GraphQL
GraphQL
Flask
Flask
Node.js
Node.js
SQL
SQL
Robot Framework
Robot Framework
Selenium
Selenium
๐Ÿค–

AI & Machine Learning

LLMsFine-TuningLangChainLangGraphRAGPGVectorPEFTQLoRAHugging FacePyTorchTensorFlowComputer VisionNLPEmbeddings
โš™๏ธ

Backend & APIs

PythonFastAPIREST APIsKafkaRedisPostgreSQLMongoDBGraphQLSpaCyNLTK
โ˜๏ธ

Cloud & DevOps

AWSEKSLambdaAWS BedrockSageMakerDockerKubernetesRayCI/CDSQSDynamoDB
๐Ÿ’ป

Frontend & Tools

TypeScriptReactNext.jsThree.jsGitLinuxRobot FrameworkJMeter
Background

About

Mahadev Maitri's profile picture

Hi I'm Mahadev Maitri and I am a Software Developer, Instructor, and huge F1 fan. I am passionate about developing innovative solutions to real-world problems and enjoy exploring new technologies. I hold Master of Science in Computer Science from the University of Delaware.

Education

University of Delaware

Master of Science โ€” Computer Science

2022 โ€“ 2024ยทNewark, DEยทGPA 4.0/4.0

R. V. College of Engineering

Bachelor of Engineering โ€” Electronics and Communication Engineering

2016 โ€“ 2020ยทBengaluru, IndiaยทGPA 7.97/10

Achievements

๐Ÿ†

Won the Best Artificial Intelligence/Machine Learning Hack category at HenHacks 2024 for the innovation in Yoga Pose Detection and Pose Correction with an Android app.

View โ†—
๐Ÿ†

Won the Creativity Award at the DS+AI Hackathon 2023 for the work towards the "Generating High-quality, Fine-scale Precipitation Dataset for the Great Lakes Region Building upon Existing Dataset".

View โ†—
๐Ÿ†

Secured the second place out of 30 participating teams in Hackathon (Blockchain Hackathon 2019) organized by NextGrids and powered by JUiNCUBATOR, a TBI supported by DST, Government of India and JAIN(Deemed-to-be-University).

Let's Talk

Contact

Based in New York, NY. Open to AI / ML Engineering opportunities. Reach out โ€” I respond quickly.