Available for AI Architecture

Architecting the Neural Era.

9+ years of experience defining AI strategy for Fortune 500s. Ex-Google Architect specializing in Large Language Model (LLM) scaling and ethical AI frameworks.

LinkedIn
Architect Portrait

Professional Trajectory

2021 — Present

Principal Solutions Architect

Google AI | Mountain View

Leading the deployment of multimodal Generative AI for Google Cloud customers. Reduced training costs by 30% through sparse attention optimization.

2018 — 2021

Senior ML Engineer

NVIDIA | Santa Clara

Core developer for CUDA-accelerated deep learning libraries. Optimized transformer kernels for the H100 GPU architecture.

2017 (Internship)

AI Research Intern

Microsoft Research | Redmond

Conducted research on zero-shot learning in healthcare datasets. Published two papers at NeurIPS 2017.

Technical Stack

LLM Orchestration Distributed Systems MLOpsVector Databases CUDA C++ Cloud Infrastructure
PythonPython
PyTorchPyTorch
DockerDocker
ReactReact

Open Source Systems

Quantum-LLM-Core

A library for simulating quantum-inspired attention mechanisms on classical hardware.

Python C++

Auto-MLOps-Pipeline

End-to-end automated pipeline for deploying fine-tuned Llama models to Kubernetes.

Bash YAML

Academic Pedigree

PhD in Artificial Intelligence

MIT | 2017

B.S. in Computer Science

Stanford University | 2013

Secondary Education

The International School | 96.5%