// building the future, one agent at a time

VISHALPANJETA

SMTS | ML Engineer | AI Architect

|

About Me

I build things that think. Over the past 8+ years, I have gone from writing NLU engines at a small startup to architecting multi-agent systems at ThoughtSpot that answer the kind of messy, real-world business questions most pipelines struggle with.

I have shipped patient insight platforms in healthtech, NLU engines in fintech, and AutoML systems in enterprise SaaS. The common thread: taking a hard ML problem, building something that actually works in production, and making it fast enough.

Generative AI Multi-Agent Systems LLMs RAG Backend Architecture MLOps Cloud (AWS) Time Series NLU Computer Vision

0+ Years in the Trenches

0 Companies Built For

0 Sectors HealthTech, FinTech, Enterprise SaaS

GitHub Activity Loading contributions…

Experience

ThoughtSpot

Senior Member of Technical Staff Nov 2023 - Present

Designed and built a multi-agent orchestration framework with dynamic expert agent registration. Agents plug in dynamically into the system.
Built an AutoML pipeline from scratch: model selection, training, evaluation on business data. Fine-tuned transformers handle forecasting and anomaly detection.
Cut average API latency by 30% with custom caching layers optimized for time series and language workloads.
Built a "why" engine: grounded data analysis that explains KPIs, powered by a multi-agent pipeline under the hood.

Wellthy Therapeutics

Machine Learning Engineer 3 May 2019 - Nov 2023

Owned the patient insight system end-to-end: meal, activity, and health goal analysis for thousands of patients daily under low bandwidth constraints.
Architected the backend from scratch in TypeScript/Node.js/Express. It handles 60%+ of all API traffic.
Rebuilt the chat platform on MQTT and XMPP. Throughput jumped 80%.
Pushed platform SLA past 99.5% using AmazonMQ and AWS SQS.
Built the platform SDK for secure third-party integration of Wellthy's system into external platforms.

BankBuddy.ai

Software Engineer Mar 2018 - Apr 2019

First engineer on the team. Built the product from ground up.
Wrote NLU engines using Rasa and custom lightweight models for fintech conversations.
Flask API engine with multithreaded pooling, sustained high throughput for investment banking clients.
Fast inference pipelines for mission-critical banking algorithms where latency = money.

Intel Corporation

Software Engineer Intern Jan 2018 - Jun 2018

Built Jenkins CI/CD pipelines across multiple EC2 instances. Build times dropped 20%.
Made a code flow analysis tool that auto-generates documentation and Postman collections. Saved developers a significant amount of documentation time.

Stride.ai

Software Developer Intern Sep 2017 - Oct 2017

NLP for data extraction, NER, and employee profile analysis at enterprise scale
Built query processing with RASA NLU for drawing intents from natural language

Boeing

Data Analytics Intern May 2017 - Sep 2017

Analyzed sensor data at scale for predictive failure detection on embedded systems.
Built lightweight ML models deployable on embedded hardware.
Fault-tolerant system design with 3 layers of redundancy.

Bharat Electronics

Software Engineer Intern Jun 2017 - Jul 2017

Worked on multipurpose naval radar systems for Indian defense forces.
Framework for linking RADAR data to onboard computers + Qt-based visualization.

Projects

🧠

LLM-MRI

Prompt explorer with tokenization, attention heatmaps, and next-token probabilities.
Interactive modules for 3D embeddings, layer-by-layer activation views, and hallucination scenarios.

FastAPI Transformers PyTorch Redis React Three.js D3.js

GitHub Live Demo

💾

LLM Memory Compression Lab

Six interactive modules covering memory decay, compression tradeoffs, retrieval accuracy, and architecture comparisons.
Context window strategies and graph memory growth visualized with D3 and drag-and-drop controls.

Next.js D3.js FastAPI NetworkX Transformers LLM

GitHub Live Demo

⚡

PyLLaMa-CPU

Python bindings on top of llama.cpp for CPU inference.
4-bit quantization support, with 7B running at high throughput on CPUs.
Works with ggml/gguf model files.

Python C++ llama.cpp Quantization LLM

GitHub

🤔

What If?

Generates alternate history timelines from a single counterfactual prompt.
Streamlit UI with Docker deployment.
Grounded plausible alternate history generation using LLMs.

Python Streamlit Docker LLM OpenAI

GitHub

📜

ModiScript

Esoteric, Turing-complete language built from PM Modi's quotes.
Programs start with "mitrooon", with no floats, imports, or comments for "Make in India" vision.
Includes a web UI and CLI for running .chai files. Built the lexer, parser, and interpreter with Python ASTs.

Python AST Lexer Parser Interpreter

GitHub