Senior ML Engineer @ J-Squared Technologies · Toronto

Jaskaran Singh I make AI run anywhere.

Machine Learning Engineer with a Software Engineer's backbone. I take models from research papers to production: Agentic Pipelines, LLM Inference, and Computer Vision that ships on real hardware for real clients.

Edge AI LLM Inferencing Agentic AI Computer Vision Full-Stack AWS
Jaskaran Singh presenting at an AI conference
Speaking at AI conferences
4+ years
Production software & ML engineering
JP Morgan Alum
Fintech-grade engineering discipline
UofT MScAC
Master's in Applied Computing
CANSEC 2026
Speaker · FalconVeo video RAG
About

Engineer first, researcher close second.

I own ML systems end-to-end. At J-Squared Technologies I've shipped agentic pipelines that cut labeling effort by 90%, video RAG selected for CANSEC 2026, and vision models optimized to sub-5 ms on edge hardware for defence, manufacturing and retail-mining clients.

Before that: production microservices at JP Morgan Chase and a 4.0 GPA Master's at the University of Toronto. A rare combination — equally strong in CUDA / Rust / C++ systems work and React / AWS product work. I'm at my best where models meet production.

University of Toronto

MSc in Applied Computing (MScAC) — Computer Science

2022 – 2023 · GPA 4.0 / 4.0 · A+ in ML, Deep Learning, NLP & Computational Imaging

Thapar Institute of Engineering & Technology

B.E. in Computer Engineering

2018 – 2022 · GPA 9.55 / 10

Edge AI & Model Optimization

Quantization (PTQ + QAT), pruning, distillation and custom CUDA / TensorRT kernels — detection, segmentation, re-ID and pose models meeting sub-5 ms budgets on Jetson and Hailo-8.

LLM Systems & Agentic AI

RAG over knowledge graphs, MCP servers for real tools, multi-agent annotation workflows, and local quantized inference — Ollama, Candle (Rust), vLLM, TensorRT-LLM.

Full-Stack ML Engineering

Lock-free C++ IPC backbones, Rust inference services, REST APIs, React frontends and AWS architecture — the production plumbing that makes models actually usable.

Projects

Selected builds.

All projects
fragivo ● LIVE

Fragivo — AI Fragrance Platform

LLM- and vision-powered fragrance discovery on AWS: OAuth, Google-Search-grounded analysis, prompt-engineered recommendations.

LLMsAWSRecSys
Medical text summarization

LLMs for Medical Text Summarization

Fine-tuned and benchmarked GPT-3/4, T5, BART and Pegasus on medical summarization — ROUGE, BERTScore and inference cost head-to-head. Published as a preprint.

LLMsNLPBenchmarking
Medical image enhancement using GANs

MedGANs — Medical Image Enhancement

End-to-end GAN pipeline for medical image denoising and enhancement — published in IJESE (2025) with single-shot HDR and edge-enhancement post-processing.

GANsImagingPublished
Photos

On stage & on the demo floor.

Full gallery
Contact

Let's build something that ships.

Whether it's edge AI, agentic systems, or an idea you want a second brain on — my inbox is open.

Toronto, Ontario, Canada +1 (437) 986-0064