About Me
Hi I am Ankur Singh,
Currently, I’m working as an AI Solutions Engineer at Intel. My part of my job, I had to migrate some CUDA samples to Intel’s SYCL. This experience has ignited my passion for GPU programming. It has led me to explore tools like Triton and torch.compile, which make CUDA programming more accessible for Python developers, and has also sparked my interest in C++. Checkout my notes in Today I Learned (TIL) tab.
I earned my Master’s in Software Engineering from San Jose State University in May 2024. During my studies, I served as a research assistant under Dr. Wu and Dr. Liu at the SJSU Research Foundation. My research focused on Multi-Modal modeling, Federated Learning, Traffic Flow Prediction, and real-time edge deployment, providing me with hands-on experience in these cutting-edge areas.
With 4 years of prior professional experience in AI/ML, Software Engineering, DevOps, Cloud, and Databases, I embarked on my journey by founding AI Adventures, a startup providing AI/ML solutions. Later, as an ML Team Lead at Zoop.One, I orchestrated the successful launch of four ML services, housing 20+ deep learning models, handling over 2M+ requests/month. All this in just ten months. My approach, grounded in first principles, emphasized improving developer experience through process building, establishing best practices, and setting up efficient ML infrastructure, leading to rapid prototyping and significantly reduced Time To Market (TTM). I’m really proud of my work at Zoop.
Beyond my professional endeavors, I am an avid reader and actively participate in competitions, winning multiple Kaggle competitions and hackathons. I’ve contributed to several open-source projects and created two Python packages: colab-everything and torchserve-python, aiming to give back to the community. I aspire to give a talk at a Python Conference someday.
Acheivements
- Speaker at Intel oneAPI DevSummit for AI and HPC 2022 - Link
- Got featured in Spartan Daily (SJSU newspaper) - Link
- “AI for Social Good” Grand Prize winner, at Intel Innovation 2022 - Link
- Kaggle Competitions Expert - Link
- Author “Colab-everything” python package, more than 40K downloads - Link
AI/ML Skills
Pandas, SQL, BigQuery, PySpark, Sklearn
Pytorch, Triton, Transformers, PeFT, Accelerate, CUDA
DeepSpeed, PyTorch Distributed, vLLM, LlamaCPP
MLflow, ONNX, OpenVINO, TorchServe, TensorRT
S/W Development Skills
OOPs, Design Patterns, Packaging, Unit Testing
Git, GitHub Actions, Docker, K8s, AWS, Terraform
Postgres, SQLite3, MongoDB, Redis, ElasticSearch
API Development & Web-apps
FastAPI, Flask, Firebase, Appwrite, Steamlit, Gradio
AI Solutions Engineer, Intel
May, 2023 — Present
Develop, benchmark, profile, and optimize various AI workloads on Intel hardware using Intel's optimization stack. This includes workloads such as distributed training, QLoRA, quantization, custom PyTorch kernels, and LLM deployment. Additionally, responsible for migrating CUDA samples to SYCL and helping maintain over 30 AI code samples.
Research Assistant, SJSU Research Foundation
September, 2022 — May, 2024
Optimized the YOLOv8 model for object detection and segmentation to enable real-time inference on NVIDIA Jetson devices connected through ROS. Contributed to developing innovative strategies for Knowledge Distillation and Federated Learning, with a focus on regression problems. Additionally, worked on multi-modal modeling, including image and audio tokenization to prepare these modalities for input to LLMs, and using LLMs to generate image and audio during inference.
Machine Learning (ML) Lead, Zoop.One, Pune
September, 2021 — July, 2022
Led the strategic development of machine learning initiatives by launching four core ML services, which include over 20 deep learning models and handle more than 2 million requests per month—all within just ten months. Established best practices for MLOps, developed micro-frameworks, and implemented automation processes, with emphasis on agility and an exceptional developer experience.
Co-founder / CEO, AI Adventures, Pune
August, 2018 — September, 2021
Led client project development, overseeing the entire lifecycle and collaborating closely with clients. Created five comprehensive courses covering Python, Data Science, Machine Learning, Computer Vision, and Deep Learning. Established AI clubs in several colleges across the city, fostering a vibrant community through workshops and interactive sessions.