Skip links

Backend & Infrastructure

Building for Scale: Architecture Decisions That Compound

In the past 5 years, Harbor Software has built systems that serve hundreds of requests per minute and systems that serve hundreds of thousands. The difference in architecture between these two scales is not what most people expect. It is not about using Kubernetes instead
Physical Kanban board with colorful cards and arm moving card to Done column

The Architecture of FlowBoard: Building an Agency OS

FlowBoard is the internal operating system we built to run Harbor Software’s agency operations. It manages client projects, team capacity, time tracking, invoicing, resource allocation, and reporting across 15-20 concurrent client engagements. This post is a technical deep dive into how we designed and built
Futuristic control room with model routing paths on curved display

Multi-Model AI Architectures: Routing, Fallbacks, and Cost Control

Using a single AI model for every task is the equivalent of using a single database for every data pattern. It works until it does not. GPT-4o is excellent at complex reasoning but costs 15x more than GPT-4o-mini for simple classification tasks. Claude 3.5 Sonnet

Edge Computing for AI Inference: When and Why

The Latency Problem That Started Everything A client came to us with a straightforward request: add real-time defect detection to their manufacturing line. The camera captures a part every 200 milliseconds. A classification model determines pass or fail. The constraint: the entire pipeline — image

Designing APIs for AI Agents: New Patterns for a New Consumer

AI agents are a new category of API consumer. They are not humans clicking through a UI, and they are not scripts following a predetermined sequence of API calls. They are semi-autonomous programs that decide at runtime which endpoints to call, in what order, and
Two boxing gloves facing each other under dramatic spotlight

GraphQL vs REST in 2025: The Debate Is Over

The GraphQL vs REST debate has been running since Facebook open-sourced GraphQL in 2015. Ten years later, the answer is clear — and it is not the answer either camp expected. Neither technology won. Both survived, but they settled into distinct niches where each is

From Monolith to Microservices: A Pragmatic Migration Path

Everyone has an opinion about microservices. The internet is full of conference talks about how Netflix split their monolith and blog posts about how you should not split yours. Both sides are right, depending on context. At Harbor Software, we completed a 14-month migration of
Glass container about to overflow with pressure gauges reading red

Load Testing AI Applications: Unique Challenges and Solutions

Load testing a traditional web application is well-understood: generate a realistic traffic pattern, measure response times and error rates at increasing load, find the breaking point, optimize. Load testing an AI application is a different beast entirely. The response times are orders of magnitude longer
Explore
Drag