Banner

Himanshu Singh

21engineer • developer • builder

I build models from scratch. spanning LLMs, RAG, Computer vision, Deep learning, and various AI powered systems. I work across the entire ML lifecycle, specializing in generative AI, model optimization, and deploying advanced architectures like TTS. I focus less on debates and more on delivering impactful, practical solutions.

Professional Experience

mytender

mytender

AI Engineering Intern

July 2025 – September 2025

Technical Contributions

At mytender, I spent 2 months building some pretty cool stuff. Think RAG pipelines optimized with LangChain to push retrieval precision, scalable frontend systems in Next.js stabilizing AI modules, and agentic orchestration refinements raising efficiency by 35%, the kind of work that keeps you up at night because it's actually interesting.

Here's where it gets interesting I engineered RAG pipelines, blending vector and keyword search for smarter retrieval, orchestrated multi-agent chains for seamless routing, and built a modular Next.js frontend that kept everything stable and responsive as new features shipped.

TEXMiN

TEXMiN

AI/ML Intern

May 2025 – July 2025

Technical Contributions

At TEXMiN, I developed a traffic detection model quantized to FP16, achieving 0.92 mAP@0.5 and 30–35 FPS on constrained hardware. I trained a custom YOLOv8l model, reduced its size by 42% with hybrid optimization, and fine-tuned parameters for quantization, pruning, and inference engines to maximize performance across Jetson Nano and Raspberry Pi 4 deployments.

Here's where it gets interesting I built and optimized quantized YOLO models, for real time edge deployment, blending advanced quantization and pruning to squeeze high performance from limited hardware. This work pushed the boundaries of what's possible with resource-constrained devices, making robust traffic analysis and mining equipment detection feasible in real world, low power environments.

IIT Bhilai

IIT Bhilai

Research Intern

May 2024 – August 2024

Technical Contributions

At IIT Bhilai, I implemented advanced image forgery detection using TensorFlow, boosting classification accuracy by 15% through rigorous training and structured experiments. I optimized algorithms and streamlined workflows, cutting processing time by nearly 30%, and managed over 1,000 images organizing, cleaning, augmenting, and labeling datasets for robust training.

Here's where it gets interesting I implemented systematic preprocessing and annotation pipelines that boosted throughput by 30%, making large scale image forgery detection faster and more reliable. This approach not only improved accuracy but also dramatically reduced computation time, enabling scalable and efficient deployment for real world image analysis tasks.

Stack I use

Technologies I work with to build ML Models that solve problems

C++
C++
C
C
Python
Python
SQL
SQL
Next.js
Next.js
FastAPI
FastAPI
MongoDB
MongoDB
PostgreSQL
PostgreSQL
ChromaDB
ChromaDB
Git
Git
Docker
Docker
Azure
Azure
AWS
AWS
Google Cloud
Google Cloud
Jira
Jira
Ubuntu
Ubuntu
C++
C++
C
C
Python
Python
SQL
SQL
Next.js
Next.js
FastAPI
FastAPI
MongoDB
MongoDB
PostgreSQL
PostgreSQL
ChromaDB
ChromaDB
Git
Git
Docker
Docker
Azure
Azure
AWS
AWS
Google Cloud
Google Cloud
Jira
Jira
Ubuntu
Ubuntu
C++
C++
C
C
Python
Python
SQL
SQL
Next.js
Next.js
FastAPI
FastAPI
MongoDB
MongoDB
PostgreSQL
PostgreSQL
ChromaDB
ChromaDB
Git
Git
Docker
Docker
Azure
Azure
AWS
AWS
Google Cloud
Google Cloud
Jira
Jira
Ubuntu
Ubuntu
C++
C++
C
C
Python
Python
SQL
SQL
Next.js
Next.js
FastAPI
FastAPI
MongoDB
MongoDB
PostgreSQL
PostgreSQL
ChromaDB
ChromaDB
Git
Git
Docker
Docker
Azure
Azure
AWS
AWS
Google Cloud
Google Cloud
Jira
Jira
Ubuntu
Ubuntu

Open Source Contributions

KOKORO in GPT-2 Architecture

2025

Implemented KOKORO text-to-speech model using GPT-2 architecture in GGML format for efficient inference

If you've read this far, you might be interested in what I do.

Let's connect & get in touch!

Find me on these platforms

© 2025 Himanshu.