Coral - Application Intelligence Mesh

The Problem

Your app runs across fragmented infrastructure: laptop, VMs, Kubernetes clusters, multiple clouds, VPCs, on-prem.

🔍Debug an issue

Check logs, metrics, traces across multiple dashboards

🐛Find the root cause

Add logging, redeploy, wait for it to happen again

🌐Debug across environments

Can't correlate laptop dev with prod K8s cluster

🔐Run diagnostics

SSH to different networks, navigate firewalls, VPN chaos

One Interface for Everything

👁️Observe

Passive, always-on data collection:

Zero-config eBPF metrics: Rate, Errors, Duration (RED)
OTLP ingestion: For apps using OpenTelemetry
Auto-discovered dependencies: Service connection mapping
Automatic baselining: Detect anomalies against historical trends
Efficient storage: Recent data local, summaries centralized

🔍Explore

Human-driven investigation and control:

Query data: Metrics and traces across all services
Remote execution: Run diagnostics (netstat, tcpdump, lsof)
Manual probes: Attach/detach eBPF hooks on-demand
Traffic capture: Sample and inspect live requests
On-demand profiling: CPU/memory analysis in production

🤖Diagnose

AI-powered insights & investigations:

Universal AI integration: Works with Claude Desktop, IDEs, any MCP client
AI Orchestration: Autonomous tool use for deep investigations
Natural Language Interface: Plain English queries via CLI or MCP
Real-time data access: AI queries live observability data
Automated Root Cause: Rapidly identifies source of incidents

MCP Integration: Use Any AI Assistant

Bring your own LLM - Claude Desktop, VS Code, Cursor, or custom apps

🤖

External AI

Claude • VS Code • Cursor

⌨️

Coral Ask

Built-in Terminal AI

MCP Protocol

🧠

Colony Server

MCP Server & Analytics

Encrypted Mesh

👁️

Agents

eBPF & OTLP Collection

Instrumentation & OTEL

📦

Application

SDK & Runtime

🔌Any MCP Client

Claude Desktop, IDEs, or custom apps via standard MCP protocol

🔑Your LLM, Your Keys

Use Anthropic, OpenAI, Ollama - you control the AI and costs

⚡Real-time Queries

AI queries live data from Colony's DuckDB, not stale snapshots

What Makes Coral Different?

The first LLM-orchestrated debugging mesh for distributed apps

Unified Mesh Across Infrastructure

Debug apps running on laptop ↔ AWS VPC ↔ GKE cluster ↔ on-prem VM with the same commands. No VPN config, no firewall rules, no per-environment tooling.

On-Demand Live Debugging

Attach eBPF uprobes to running code without redeploying. LLM decides where to probe based on analysis. Zero overhead when not debugging.

Universal AI via MCP

Works with any AI assistant through standard MCP protocol. Claude Desktop, VS Code, Cursor, or custom apps. Bring your own LLM (Anthropic/OpenAI/Ollama). Your data stays in your infrastructure.

Decentralized Architecture

No Coral servers to depend on. Colony runs wherever you want: laptop, VM, Kubernetes. Your observability data stays local.

Control Plane Only

Can't break your apps, zero baseline overhead. Probes only when debugging. Mesh is for orchestration, never touches data plane.

Application-Scoped

One mesh per app (not infrastructure-wide monitoring). Scales from single laptop to multi-cloud production.

How It Works

From observability to insights - a complete journey through Coral's architecture

Observe Everywhere

Progressive integration levels - start with zero-config, add capabilities as needed

Level 0

📡eBPF Probes

Zero-config RED metrics · No code changes required

Level 1

🔭OTLP Ingestion

Rich traces if using OpenTelemetry · Optional

Level 2

⚡Shell/Exec

LLM-orchestrated diagnostic commands · Auto-enabled

Level 3

🎯Live Probes

On-demand instrumentation · Full control

Agents collect locally

Aggregate Intelligently

Colony receives and stores data from all agents across your distributed infrastructure

→ DuckDB storage for fast analytical queries

→ Cross-agent correlation discovers dependencies

→ Encrypted mesh connects fragmented infrastructure

MCP Server exposes tools

Query with AI

Colony exposes MCP server for universal AI integration

→ Works with any MCP client: Claude Desktop, VS Code, Cursor, custom apps

→ Bring your own LLM: Anthropic, OpenAI, or local Ollama

→ Natural language queries: "Why is checkout slow?" instead of PromQL

→ AI orchestrates tool calls: Queries metrics, traces, topology automatically

→ Real-time data: Live observability, not stale dashboards

Insights delivered

Act on Insights

Get actionable recommendations in natural language, execute with approval

→ Root cause analysis in <1 second

→ Actionable recommendations with evidence

→ Human-approved execution for safety

See It In Action: Live Debugging

When basic metrics aren't enough, Coral automatically escalates to live instrumentation

terminal

$ coral ask "Why is the payment API slow?"

🤖 Analyzing payment service metrics...

P95 latency: 2.3s (baseline: 150ms)

Root cause unclear from metrics. Attaching live probes...

✓ Uprobe attached: payment.ProcessPayment() [offset 0x4a20]

✓ Uprobe attached: payment.ValidateCard() [offset 0x4c80]

✓ Uprobe attached: db.QueryTransactions() [offset 0x3f10]

Collecting traces for 30 seconds...

Analysis:

• ProcessPayment(): 2.1s avg (2,847 calls)

└─ db.QueryTransactions(): 2.0s (95% of time)

└─ Query plan: Sequential scan (234,891 rows)

└─ Missing index on transactions.user_id

• ValidateCard(): 12ms avg (normal)

Root Cause: Missing database index causing slow queries

Recommendation:
CREATE INDEX idx_transactions_user_id ON transactions(user_id);

Detaching probes...

✓ Cleanup complete (zero overhead restored)

What just happened? Coral used eBPF metrics to detect the issue, then automatically attached live uprobes to running code (Level 3 integration). After collecting data, it identified the exact bottleneck and recommended a fix—all without redeploying or restarting anything.

Want to See the Complete Architecture?

View the detailed system architecture diagram with complete data flow

View Architecture Diagram →

🧠

Colony

Central coordinator with MCP server, DuckDB storage, and AI orchestration

👁️

Agents

Local observers using eBPF, OTLP, and shell commands to gather telemetry

⚙️

SDK (Optional)

Advanced features like live probes and runtime instrumentation

All connected via an encrypted WireGuard mesh that works across any network boundary.

LLM-orchestrated debugging for distributed apps

The Problem

🔍Debug an issue

🐛Find the root cause

🌐Debug across environments

🔐Run diagnostics

Coral unifies this with an Application Intelligence Mesh

One Interface for Everything

👁️Observe

🔍Explore

🤖Diagnose

MCP Integration: Use Any AI Assistant

External AI

Coral Ask

Colony Server

Agents

Application

🔌Any MCP Client

🔑Your LLM, Your Keys

⚡Real-time Queries

What Makes Coral Different?

Unified Mesh Across Infrastructure

On-Demand Live Debugging

Universal AI via MCP

Decentralized Architecture

Control Plane Only

Application-Scoped

How It Works

Observe Everywhere

📡eBPF Probes

🔭OTLP Ingestion

⚡Shell/Exec

🎯Live Probes

Aggregate Intelligently

Query with AI

Act on Insights

See It In Action: Live Debugging

Want to See the Complete Architecture?

Colony

Agents

SDK (Optional)

🚧 Early Development

Coral unifies this with an
Application Intelligence Mesh