How LLMs actually learn and generate: A complete overview
Mathematical verification and checking parity by loading raw OpenAI weights directly into scratch-built architectures.
AI/ML Engineer in Progress // Specializing in GenAI architecture,MLOps, Agentic Workflows, robust data pipelines, and turning academic papers into production-grade iron // Open to AI/ML & GenAI Roles
"I am deeply obsessed with the gap between a machine learning model that scores 94% on an offline benchmark and one that actually survives production deployment. I don't build generic tutorial wrappers. I design clean pipelines, implement architectures from scratch to truly understand their boundaries, and build systems optimized for reliable execution over academic metric inflation."
CNNs, ANN models, Encoder-Decoder, Attention Mechanism, ViT, BERT-style, GPT-style, model tuning, regularization.
NumPy, Pandas, Scikit-learn, TensorFlow, Matplotlib, Seaborn, OpenCV, HuggingFace, PyTorch, NLP.
Python, C++, JavaScript, SQL. Git, GitHub, AWS Certified Cloud Practitioner.
REST APIs, Flask, FastAPI, PostgreSQL (pgvector), MongoDB, MySQL.
Built GPT-2 entirely from scratch to understand the fundamental mechanics of self-attention. Implemented custom training loops and optimized inference paths without relying on high-level abstraction libraries.
A raw implementation using pgvector. Bypassed LangChain to reduce latency and dependencies, proving that raw SQL and bare-metal embeddings often outperform bloated frameworks in production.
Architected a highly optimized CNN for CIFAR-10. Focused on memory efficiency and parameter reduction without sacrificing accuracy. Demonstrated pruning techniques for edge deployment.
Designed and deployed a real-time facial detection pipeline using modern computer vision techniques. Optimized for low latency inference suitable for live video streams.
Developed a neural autoencoder from the ground up to explore representation learning, latent space compression, and data reconstruction without reliance on high-level wrappers.
Built an end-to-end classification pipeline for predicting heart disease presence. Deployed as a robust predictive API endpoint for reliable integration.
Mathematical verification and checking parity by loading raw OpenAI weights directly into scratch-built architectures.
Unpacking the probabilistic mechanics and step-by-step noise schedule scheduling algorithms behind generative models.