Blog

Technical deep-dives into AI engineering, full-stack architecture, and lessons learned.

Series

1 series

13-Part Series

How to Architect an Enterprise AI System (And Why the Engineer Still Matters)

Every post in this series is a decision I made that no model would have made on its own. Not because the model is bad — because the model doesn't know what it doesn't know.

13 of 13 parts published

Read the series

0The Day My AI Forgot Everything (So I Built a Context-Continuity Inference Stack)

1I Stopped Letting Emails Poison My Extractor: The Pre-LLM Gate That Made the Rest of the Pipeline Reliable

2I Turned Temperature Up to Save My Extractions: The 3‑Node LangGraph That Trades Variance for Truth

…10 more parts planned

Posts

37 posts

machine-learningtime-seriesfinancial-mllightgbmvalidationcryptoresearch-engineering

Validation Geometry Is Part of the Model

A LightGBM capacity-control baseline for the ICAIF 2026 minute-ceiling paper, and why label source, stride, purge, and temporal split geometry matter as much as the classifier in time-series ML.

Daniel Anthony Romitelli Jr. · May 31, 2026

machine-learningtime-seriesmodel-reliabilitytrading-systemscalibration

Model Failure Is a Time Series

CTF treats reliability as its own supervised time-series problem: a second model watches recent uncertainty telemetry, labels correctness without lookahead, enforces train/serve feature parity, and emits a trust probability for the execution gate.

Daniel Anthony Romitelli Jr. · May 29, 2026

embeddingssearchredispythonazure-ai-searchproduction-systems

Four Vectors, One Record: How I Split Embeddings Before They Hit Search

I rewrote the embedding pipeline around a failure case I hit in production: a single blended embedding kept surfacing candidates whose experience looked vaguely relevant while their actual skills were wrong, or vice versa. The fix was to split each record into four semantic views, generate four embeddings in parallel, cache them with a stable Redis key, and let the job layer validate size and dimension before retrying transient failures or sending terminal ones to the DLQ.

Daniel Anthony Romitelli Jr. · May 29, 2026

embeddingsragvector-searchsupabasetypescript

Vector Split by Chunk: Why My Retrieval Stops at the Boundary I Drew

I split embeddings by chunk because the retrieval layer needed finer granularity than whole-document vectors could give me. That choice shows up everywhere in the pipeline: chunking, embedding generation, and the file-path retrieval path that can pull exact spans back when I need them.

Daniel Anthony Romitelli Jr. · May 28, 2026

ocrtypescripthugging-faceneuroloqai-tutoringtext-recovery

Small in Code. Large in Behavior.

How Neuroloq decides whether an attachment is document-like, which OCR mode to use, and why the router itself is the product.

Daniel Anthony Romitelli Jr. · May 27, 2026

SupabasePostgrespgvectorRAGTypeScript

Why I Kept Search Scope Inside a Single Supabase RPC

I rewrote search so the vector, the scope filter, and the candidate count travel together through one `search_embeddings` RPC. The database applies the metadata predicate inside SQL, and the same contract drives the pgvector HNSW index on the embedding column.

Daniel Anthony Romitelli Jr. · April 16, 2026

semantic-kernelazure-foundrypromptflowagent-orchestrationstate-managementworkflow-automation

The AgentGroupChat Pattern That Keeps the Mapper from Drifting

I rebuilt the orchestration around a narrow AgentGroupChat loop in the workflow analyzer SaaS: analyzer, mapper, generator, validator, with state rehydration before the run and persistence after it. The result is a system that can reject bad structure locally, retry from the right step, and resume from prior history instead of starting blind.

Daniel Anthony Romitelli Jr. · April 16, 2026

RAGSupabaseNext.jsTypeScriptblog pipelineretrievalcodebase indexing

Coverage Before Creativity: The RAG Gate That Keeps My Blog Pipeline Honest

Before the generator writes a paragraph, a three-lane query fan-out, file-path-aware dedupe, a sufficiency threshold, and pinned excerpts decide whether the retrieval actually covered enough of the repository to deserve a draft. The most useful thing this pipeline does is reject topics whose evidence set is too thin.

Daniel Anthony Romitelli Jr. · April 14, 2026

typescriptml-systemsscoringoptimizationvideo

The Reward Calibrator That Learns the Shape of Its Own Judgment

I built a reward calibrator that sits above the scoring signals and searches for better weights instead of hand-tuning them by feel. It normalizes each signal, combines them into a composite score, and evaluates candidate weight sets against benchmark data so the system can compare itself to its own baseline.

Daniel Anthony Romitelli Jr. · April 14, 2026

pythonstartupconfigurationdesktop-appopenai

The Startup Gate That Makes a Python App Feel Native

I rewrote the post around the real startup path in `app/yapper.py`: the dependency check, the import order, and the shared settings path in `app/core/config.py`. The result stays on one concrete system behavior instead of speculative packaging details.