Senior AI Engineer with 10 years of experience shipping production backend services. Currently at Amazon (AWS — Amazon Connect) building systems that execute automated call flows at high scale, while building a multi-tenant AI Infrastructure Platform on the side — load-tested at 12K+ RPS with p95 <150ms on Kubernetes.
Specialized in AI agents, LLM orchestration, RAG, multi-provider LLM gateways, and event-driven microservices. Background spanning logistics (Nuvocargo), e-commerce (Lovevery), and cloud infra at Amazon. Working remotely from 🇲🇽 Mexico.
- 🔭 Currently shipping at Amazon Connect Flow — TypeScript · AWS CDK · Lambda
- 🤖 Building a modular AI Infrastructure Platform — 5 independent services, production-grade
- 🌱 Going deep on Go, Temporal, Kafka, pgvector, and distributed systems
- 💬 Ask me about AI agents, RAG pipelines, LLM gateways, or scaling backend services
- 📫 Reach me: [email protected]
A modular platform that lets companies integrate production-grade AI without building infra from scratch. Each module is an independent service sharing auth, billing, and observability.
| Module | What it solves | Stack |
|---|---|---|
| M1 — AI Gateway | Multi-provider LLM routing, cost tracking, no vendor lock-in | Go · Envoy · Redis |
| M2 — RAG Platform | Hybrid semantic + BM25 search, 200K+ docs indexed, p95 <280ms | Python · FastAPI · pgvector · Kafka |
| M3 — Agent Orchestrator | Durable agent workflows with vector memory, 3M+ executions/day | Python · Temporal · pgvector · MCP |
| M4 — LLM Eval Platform | Continuous quality monitoring + drift detection, 2M+ evals/day | Python · LLM-as-judge · S3 |
| M5 — Event Mesh | Real-time AI inference on event streams, sub-second latency | Python · Kafka · Redis Streams |
Platform pitch: processing 12K+ RPS with p95 <150ms, validated with k6 load tests on Kubernetes simulating 20K+ concurrent users — using Go, Python, Kafka, Temporal, pgvector, and Envoy.
Languages
AI / ML
Backend & APIs
Cloud & Infrastructure
Observability & Testing
⭐️ From vnponce — open to freelance & collaboration on AI infrastructure projects



