AI developer glossary

AI Developer Tool Glossary

Plain-English definitions for the terms developers and technical buyers encounter while comparing AI coding assistants, agent platforms, RAG infrastructure, LLM API pricing, and developer SaaS alternatives.

AI Coding AI Agents RAG API Costs LLMOps SaaS Alternatives

24 definitions

Practical terms for AI software evaluation

Browse all guides

AI Coding

AI Coding Assistant

A developer tool that helps write, edit, explain, test, or review code with language models and repository context.

Why it matters: Teams should evaluate coding assistants by accepted diffs, review quality, private code controls, and fit with existing workflows.

AI coding tools pricing Cursor vs Copilot vs Windsurf

AI Agents

AI Agent Platform

A platform for orchestrating model calls, tools, memory, approvals, traces, and retries across multi-step AI workflows.

Why it matters: Agent platforms become valuable when automation needs observability, permissions, evaluation, and safe rollback.

Best AI agent platforms AI agent topic cluster

AI Agents

Agentic Workflow

A workflow where a model plans or executes several steps, often using tools, retrieved context, validation, and human approval.

Why it matters: Agentic workflows can multiply cost and risk because one user request may trigger many model and tool calls.

AI agent cost calculator Agent security checklist

AI Agents

Tool Call

A structured request from a model or agent to an external function, API, database, file system, or application action.

Why it matters: Tool calls need permission boundaries, validation, logging, budget controls, and approval gates for risky actions.

Agent build vs buy Agent security checklist

AI Agents

Human-in-the-Loop Approval

A control pattern where a person reviews or approves an AI action before the system performs a risky or irreversible step.

Why it matters: Approval can improve safety, but it also adds cost and latency that should be included in automation ROI.

Agent cost calculator Agent platform guide

RAG

Retrieval augmented generation, a pattern where a system retrieves relevant context before asking a model to generate an answer.

Why it matters: RAG quality depends on retrieval, permissions, chunking, citations, and refusal behavior, not only the language model.

RAG and vector database guides RAG evaluation checklist

RAG

Vector Database

A search backend optimized for storing embeddings and finding semantically similar documents, chunks, products, or records.

Why it matters: The right vector database affects retrieval quality, latency, cost, metadata filtering, and operational complexity.

Vector database comparison Pinecone vs Weaviate vs Qdrant

RAG

Embedding

A numeric representation of text, code, image, or structured data that lets search systems compare semantic similarity.

Why it matters: Embedding choice affects retrieval quality, storage cost, re-indexing effort, and how well queries match real documents.

Vector database comparison Open source LLM vs API cost

RAG

Hybrid Search

A retrieval method that combines keyword search with vector search to handle exact terms and semantic intent together.

Why it matters: Hybrid search is important for product names, error codes, API methods, legal clauses, and other exact-match queries.

Vector database vendor comparison pgvector vs dedicated vector database

RAG

Metadata Filtering

Filtering retrieval results by fields such as tenant, user role, document type, language, timestamp, or product area.

Why it matters: Weak metadata filtering can return context that is irrelevant, outdated, or not authorized for the current user.

RAG evaluation checklist Supabase alternatives for AI apps

RAG

Reranking

A second retrieval step that reorders candidate documents or chunks so the most useful context reaches the final model prompt.

Why it matters: Reranking can improve answer quality, but it adds latency and cost that should be tested against real queries.

RAG evaluation checklist Vector database comparison

RAG

Retrieval Grounding

The practice of requiring generated answers to rely on retrieved evidence instead of unsupported model memory.

Why it matters: Grounding improves trust only when citations, retrieved chunks, and refusal behavior are tested together.

RAG evaluation checklist RAG topic cluster

API Costs

LLM API Cost

The recurring cost of model API usage, usually driven by input tokens, output tokens, model choice, retries, and workload volume.

Why it matters: API costs should be modeled per feature because a summarizer, support assistant, RAG workflow, and agent have different token shapes.

LLM API Cost Calculator API cost calculator playbook

API Costs

Input Token

A unit of text sent to a model, including system instructions, user messages, chat history, retrieved context, and tool schemas.

Why it matters: Input tokens can dominate cost when systems repeat long prompts or send too many retrieved chunks.

LLM API pricing comparison LLM API Cost Calculator

API Costs

Output Token

A unit of text generated by a model, including explanations, code, JSON, summaries, answers, or tool arguments.

Why it matters: Long responses and repeated repair attempts can make output tokens a major cost driver.

API cost calculator playbook LLM API pricing comparison

API Costs

Context Window

The maximum amount of input and output a model can process in one request, usually measured in tokens.

Why it matters: Large context windows can simplify workflows, but sending repeated long context can raise cost and latency.

Open source LLM vs API cost LLM API pricing comparison

API Costs

Model Routing

A strategy that sends different tasks to different models based on complexity, cost, latency, quality, or fallback needs.

Why it matters: Routing can reduce cost when simple tasks use smaller models and high-risk tasks use stronger models.

LLM API pricing comparison API cost calculator playbook

API Costs

Cache Hit Rate

The share of model requests that can reuse cached prompts, context, or intermediate results instead of paying full repeated processing cost.

Why it matters: A higher cache hit rate can reduce LLM API spend for repeated instructions, stable retrieved context, and common workflows.

LLM API Cost Calculator API cost calculator playbook

LLMOps

Prompt Versioning

Tracking prompt changes over time so teams can compare behavior, debug regressions, and roll back unsafe releases.

Why it matters: Prompt versioning turns AI changes into reviewable releases instead of invisible production behavior shifts.

LLMOps platforms comparison LLM observability tools

LLMOps

Evaluation Dataset

A curated set of prompts, inputs, expected behavior, and edge cases used to test model, prompt, or retrieval changes.

Why it matters: Evaluation datasets catch regressions before a change affects users or customers.

LLMOps platforms comparison RAG evaluation checklist

LLMOps

LLM Observability

Monitoring and tracing model behavior, prompts, retrieval inputs, tool calls, latency, cost, and user feedback.

Why it matters: LLM observability helps teams diagnose bad answers, cost spikes, schema failures, and prompt regressions.

LLM observability checklist LLMOps platform comparison

SaaS Alternatives

SaaS Alternative

A competing software product or architecture that can replace an existing SaaS tool while changing cost, workflow, or control.

Why it matters: Alternatives should be compared by migration cost, reliability, integrations, security, and long-term operations, not only price.

Developer tool alternatives SaaS alternatives topic cluster

SaaS Alternatives

Vendor Lock-In

Dependence on a vendor through proprietary APIs, data formats, saved workflows, integrations, pricing, or team habits.

Why it matters: Lock-in is not always bad, but buyers should know the cost of leaving before a tool becomes critical infrastructure.

Developer tool alternatives Vercel alternatives for Next.js

SaaS Alternatives

Data Portability

The ability to export data, configuration, logs, users, permissions, traces, documents, or workflows from a platform.

Why it matters: Data portability lowers migration risk and gives teams more negotiating power when pricing or requirements change.

Supabase alternatives for AI apps Developer tool alternatives