Enterprise & startups
Comfortable in regulated, large-scale codebases and in zero-to-one startup builds where I wear every hat.
Available for remote contract work
Senior Full-Stack & AI Engineer — building AI-native products, end to end.
|
7+ years shipping production web applications — from architecture and API design through deployment, monitoring, and scale. Recently focused on turning LLM features, RAG pipelines, and vector search into monitored, production-grade systems.
from langchain import hub
from langchain_postgres import PGVector
# retrieve, then ground the model
store = PGVector(embeddings, collection="docs")
docs = store.similarity_search(query, k=6)
answer = llm.invoke(
prompt.format(context=docs, question=query),
).content # streamed + traced in prod
I'm a Senior Full-Stack Engineer with 7+ years building and shipping production web applications — including 6+ years with U.S. companies across both enterprise platforms and fast-moving startups.
I work primarily in TypeScript/JavaScript, Node.js, and Python, with React and Next.js on the front end. I take products end to end — from architecture and API design through deployment, monitoring, and scale — and I can own a feature from requirements to live deployment independently.
Lately my focus has been integrating AI into real products: LLM-powered features, RAG pipelines, and vector search — taking them from prototype to monitored, production-grade systems rather than one-off API calls. I care about measurable outcomes: faster load times, lower infrastructure cost, and fewer defects.
Comfortable in regulated, large-scale codebases and in zero-to-one startup builds where I wear every hat.
LLM features, RAG, and vector search taken from prototype to production — traced, evaluated, and monitored.
From whiteboard to live deploy: data modeling, APIs, UI, infra, observability, and the rollout in between.
Open to remote contract work with global teams. Comfortable working across U.S. time zones.
I take a feature from requirements to live deployment independently — data model, API, UI, infrastructure, and rollout. No hand-offs required.
RAG pipelines, vector search, and LLM features built with retrieval quality, evaluation, guardrails, and tracing — not brittle prompt hacks.
API and system design that holds up under load — sensible boundaries, caching, queues, and databases chosen for the actual workload.
A bias toward outcomes you can point to: faster load times, lower infrastructure cost, and fewer defects in production.
Collaboration across product, design, and stakeholders — async-friendly, well-documented, and decisive so the right thing ships.
Tested, observable systems — automated testing, monitoring, and CI/CD so changes ship fast and stay stable.
A TypeScript-first, type-safe SaaS boilerplate with AI built in — Fastify + tRPC backend, React front end, PostgreSQL, and OpenAI-powered streaming chat over server-sent events.
Auto-detects chapters and topics in YouTube videos, generates summaries in your chosen language, and adds clickable timestamps that jump straight to the moment.
A collection of practical LangChain apps — document summarization, retrieval Q&A, and web-search agents — built on OpenAI and Gemini with Pinecone and Chroma vector stores.
A biologically-inspired memory layer for AI agents: memories decay by default and strengthen on retrieval, so agents stop repeating mistakes. Zero-dependency SQLite store exposed over MCP.
Open to remote contract work with global teams. If you're integrating AI into a real product and want it monitored and scalable from day one — let's talk.
jeremiasaguero0323@gmail.com