AI & Machine Learning — Harbor Software Blog

Designing APIs for AI Agents: New Patterns for a New Consumer

AI agents are a new category of API consumer. They are not humans clicking through a UI, and they are not scripts following a predetermined sequence of API calls. They are semi-autonomous programs that decide at runtime which endpoints to call, in what order, and

From Monolith to Microservices: A Pragmatic Migration Path

Everyone has an opinion about microservices. The internet is full of conference talks about how Netflix split their monolith and blog posts about how you should not split yours. Both sides are right, depending on context. At Harbor Software, we completed a 14-month migration of

Glass container about to overflow with pressure gauges reading red

Load Testing AI Applications: Unique Challenges and Solutions

Load testing a traditional web application is well-understood: generate a realistic traffic pattern, measure response times and error rates at increasing load, find the breaking point, optimize. Load testing an AI application is a different beast entirely. The response times are orders of magnitude longer

Circuit board engine bay with scanning beam revealing red vulnerabilities

AI-Powered Vulnerability Scanning: How VibeGuard Works Under the Hood

When we started building VibeGuard eighteen months ago, the vulnerability scanning market was dominated by tools that relied almost exclusively on signature-based detection. Snyk, Semgrep, and SonarQube all do excellent work matching known patterns, but they share a fundamental limitation: they can only find vulnerability

Thick book compressed between glass plates with single glowing page emerging

AI-Powered Content Summarization: Architecture and Trade-offs

Content summarization sounds simple: take a long document, produce a short version. In practice, building a summarization system that works reliably across document types, handles edge cases without hallucinating, respects length constraints consistently, and scales to thousands of documents per day is a genuine engineering

Friendly robot hand offering gift-wrapped box to human hand

Building Recommendation Systems That Don’t Feel Creepy

Recommendation systems are the most commercially impactful application of machine learning. Netflix estimates their recommendation engine saves them $1 billion per year in reduced churn. Amazon attributes 35% of revenue to recommendations. Spotify’s Discover Weekly has become the primary way 40 million users find new

The Engineering Behind RSS Feed Intelligence

RSS is the cockroach of web technologies. It was declared dead a decade ago, and yet it quietly powers an enormous amount of the internet’s information infrastructure. Podcast directories, news aggregators, financial data feeds, government publication systems, academic paper repositories — they all run on

Milvus vs Pinecone vs pgvector: Choosing a Vector Database

The Vector Database Decision If you’re building anything with embeddings — semantic search, RAG pipelines, recommendation engines, image similarity — you need somewhere to store and query vectors. The market has exploded with options, and the decision isn’t obvious. We’ve deployed all three of the

RAG Architecture Patterns: Beyond Basic Document Q&A

Every team building with LLMs eventually arrives at the same place: the model needs access to private data, and fine-tuning is either too expensive, too slow, or too rigid. Retrieval-Augmented Generation (RAG) is the answer most reach for. The problem is that most RAG implementations