AI Pilot

The AI that knows your infrastructure
before you type a word.

Ask questions, request actions, or delegate complete workflows. AI Pilot reads your full cloud state at session start and operates with your explicit approval — every step, every time.

93+ tools 5-layer governance Consent before action Full audit trail Multi-model
📹 Video file: assets/aipilot-hero.mp4

Rename your existing AI Pilot screen recording to aipilot-hero.mp4 and place it in the assets/ folder. It will autoplay muted on loop.
Prefer a static screenshot instead? Replace the video tag with <img src="assets/aipilot-workspace.webp">.

How it works

Three operational modes

Choose the level of autonomy for each task. Switch modes mid-session without losing context.

Ask Mode

Read-only. No exceptions.

Questions and analysis only — nothing is touched

AI Pilot reads your infrastructure and answers questions about what it sees. No commands run, no configurations change. Use this to investigate, diagnose, and understand your environment.

Why is CPU at 95% on prod-web-01?
What changed on this server in the last 48 hours?
Explain this error in my Nginx access log.
Plan Mode

See every step before it runs.

Full execution plan generated and presented for review

Request an action and AI Pilot maps out the complete sequence before anything executes. You review each step, approve or modify, then confirm. Nothing runs until you say so.

Plan a zero-downtime migration from DigitalOcean to AWS Frankfurt.
Show me how you'd harden this server's SSH configuration.
Plan nightly automated backups for all three production servers.
Agent Mode

Full execution, with gates.

End-to-end task completion — destructive steps require explicit confirmation

AI Pilot handles the full task sequence. Standard operations proceed. Destructive or irreversible steps stop and require you to type the exact action name before proceeding — no clicking through.

Provision a 4-core Ubuntu 22.04 on GCP London with Node.js, PostgreSQL and Nginx.
Deploy main to prod-api-01, zero-downtime, rollback if health checks fail.
Set up CI/CD for repo api-service across all three staging servers.
In-product experience

A dedicated workspace — not a floating button.

AI Pilot has its own full-screen environment inside CloudAIPilot, with persistent conversation history and session context wired to every server, service, alert, and metric you have connected. Your entire cloud, in one conversation.

AI Pilot workspace interface showing chat, sidebar and infrastructure context

Persistent session history

Conversations are saved and searchable. Refer back to past decisions and executions.

Model + context selector

Switch models or scope the AI to a specific server, team, or project mid-session.

Live infrastructure state

Every message is answered with current data — not cached snapshots.

93+ tools, always available

All tool domains are active unless you've disabled them in governance settings.

Infrastructure awareness

It arrives briefed.

Before your first message, AI Pilot has already read your infrastructure state — live, from your connected cloud accounts. Every answer is grounded in what's actually running right now.

All connected servers

Specs, region, OS, installed packages, running services, SSL certificate status, and current health.

Real-time & historical metrics

CPU, memory, disk, network, I/O — live and up to 30 days back. Ask about a spike from last Tuesday.

Logs and events

Nginx, Apache, application logs, and system events — searchable and correlated with infra state.

Deployment and change history

What deployed, when, by whom — plus config changes, backup outcomes, and alert history.

Active alerts and their context

Current alert state correlated with the events and deployments that likely caused them.

Cost data by service and provider

Current spend, trends, and waste signals from all connected cloud accounts.

AI Pilot Ready
You
What's the current status of my production stack?
AI Pilot
3 production servers online. prod-web-01 (GCP eu-west1, 4 vCPU) — memory at 92%, worth watching. prod-api-01 (AWS us-east-1) is healthy, last deployed 6h ago. prod-db-01 (DigitalOcean NYC3) — disk at 78%, no immediate concern but trending up. No active critical alerts. Last backup completed 3h 12m ago across all three.
⚠ mem-high: prod-web-01 ✓ no active alerts ✓ backup ok
You
What's causing the memory issue on prod-web-01?
AI Pilot
The memory climb started at 14:32 — 18 minutes after the user-service container was updated to v2.2.0. That version's baseline footprint is 340MB larger than v2.1.3. Current available memory: 180MB. I can restart the container to recover ~1.2GB, or roll back to v2.1.3. Which would you prefer?
Execution model

Plans before acting. Confirms before executing.

No AI action runs without your review. Every execution starts with a proposal — a numbered sequence of exactly what will happen, in what order, and why.

1
You send a command
Natural language, in any mode. No special syntax required.
2
AI builds a proposal
Numbered steps, estimated time, risk annotations on each action.
3
You review & approve
Accept as-is, reject individual steps, or ask for modifications.
4
Type to confirm destructive steps
Deletions, restarts, and migrations require typing the exact action name. Not a checkbox.
5
Execute & audit
Real-time progress streaming. Every step signed and written to Activity Center.
AI Pilot execution plan proposal

Step-by-step execution proposal before anything runs

AI Pilot confirmation gate for destructive action

Type-to-confirm gate on destructive operations

AI Governance

Purpose-built governance —
not an afterthought.

Most AI tools add a confirmation dialog and call it safe. CloudAIPilot's AI governance layer was designed from the ground up for cloud infrastructure operations — where a single wrong action can take down a production service, corrupt a database, or delete years of data. After extensive research and real-world testing across production environments, we built a five-stage evaluation architecture that sits between AI intent and your infrastructure. Every tool call passes through it. No exceptions.

01

Master Kill Switch

One toggle disables all AI-initiated actions across the entire platform. Reads and questions remain available. No execution can occur while the master switch is off — no overrides, no exceptions, not even for admins.

02

Domain Toggles

Each operational domain — Server Operations, App Deployments, Database Operations, Network Changes, Storage Operations — has an independent on/off switch. Disable database tools entirely while keeping server tools active. Supports zero-trust segmentation for multi-role teams.

03

RBAC Integration

AI tool access is bound to the same role permissions as manual actions. A team member with read-only access cannot trigger AI operations they couldn't perform manually. There is no privilege escalation path through the AI layer — the governance system checks the acting user's role, not the AI's capabilities.

04

Production Protection Flags

Individual servers, databases, and resources can be flagged as production-protected. Protected resources receive stricter confirmation requirements, are excluded from bulk operations, and surface additional warnings on any change — even from admin roles.

05

Confirmation Gates

Destructive operations — deletes, restarts, data migrations, configuration overwrites — require typing the exact action name before execution begins. This is not a checkbox or a "yes, proceed" button. It requires deliberate, specific input that cannot be automated, scripted, or bypassed.

AI governance settings panel showing kill switch and domain toggles
Capabilities

93+ tools across 9 domains

Every tool is purpose-built for cloud infrastructure, tested against real environments, and governed by the access policies you configure. None of them run without your approval.

Server Operations

18 tools

Provision, configure, restart, resize, migrate, SSH access, package management, service control, and user management.

Resize prod-web-01 to 8GB RAM without downtime
Install and configure Redis 7 on the staging server

Application Deployment

14 tools

Git-based deploy, rollback, build pipelines, env var management, zero-downtime strategies, and deployment history.

Deploy release/2.4 to staging, promote if smoke tests pass
Roll back prod-api-01 to the last stable deployment

Docker & Containers

12 tools

Image management, Compose stacks, container log inspection, service updates, volume management, and registry auth.

Update the API container to v2.1.3 and restart the stack
Show the last 50 log lines from the worker container

Database Operations

11 tools

Query, optimize, backup, restore, and monitor PostgreSQL, MySQL, and MongoDB. Slow query analysis and index review.

Show the 5 slowest queries on prod-db-01 in the last 24h
Restore prod-db-01 to last night's 00:00 snapshot

Security & Hardening

10 tools

Firewall audit, SSH key management, open port scanning, SSL certificate monitoring, fail2ban, and sudoers review.

Audit all firewalls and flag unnecessary open ports
Check SSL certificate expiry for all my domains

Monitoring & Diagnostics

9 tools

Read metrics, correlate events with deployment history, investigate alerts, tail logs, and run diagnostic sequences.

What caused the memory spike on prod-api-01 at 2:30 AM?
Is there a pattern in the 502 errors from last week?

Backups & Recovery

7 tools

Trigger manual backups, restore from point-in-time, verify backup file integrity, and manage retention policies.

Restore the database to yesterday at midnight
Verify all backups from the last 7 days are intact

Git & CI/CD

8 tools

Git provider integration, branch-based deployment, webhook triggers, pipeline management, and deployment history tracking.

Set up CI/CD for repo api-service across all staging servers
Configure auto-deploy on every push to the main branch

FinOps & Cost Analysis

6 tools

Query spend by service and provider, identify waste patterns, compare instance pricing, and generate cost reports.

What are my top 3 cost drivers this month?
Which servers could be downsized based on actual usage?
Model support

The same governance layer. Your choice of model.

AI Pilot routes requests through whichever model you've configured. Every model connects to the same tool library and governance layer. Switch mid-session or set a default per-organization.

Claude (Anthropic)

Strong infrastructure reasoning, careful multi-step planning, and long-context analysis across complex operations.

OpenAI

Strong general reasoning and fast responses for routine operations and quick queries.

Google Gemini

Available for teams with existing Gemini API access. Connects to the same tool and governance layer.

Ollama

Run models locally on your own hardware. No data leaves your environment. Zero external API calls.

Enterprise only

Your data is not used to train AI models. CloudAIPilot connects to AI providers through their official APIs only — not consumer products. Per their published API usage policies, Anthropic, OpenAI, and Google do not train their models on data submitted through API calls. This is a contractual commitment in their API terms, not just a preference setting.

CRITICAL · prod-api-01 · Memory 96% just now
prod-api-01 · AWS us-east-1 · t3.medium · 4GB RAM
AI Pilot · Diagnosing

Memory spike started at 14:32 — 18 minutes after user-service was updated to v2.2.0. That version's container baseline is 340MB larger than v2.1.3. Current free memory: 180MB. Two options: restart the container to recover ~1.2GB of leaked heap, or roll back to v2.1.3 entirely.

Restart user-service container (estimated recovery: 1.2GB)
Monitor memory for 5 minutes post-restart
If memory doesn't stabilise: roll back to v2.1.3
Incident response

From alert to diagnosis in seconds.

When an alert fires, AI Pilot can be the first responder. It reads the full alert context — correlated metrics, deployment timeline, recent changes, log entries — and presents a specific diagnosis and remediation plan.

Not generic advice. Infrastructure-specific analysis grounded in what's actually running in your environment.

Correlates with deployment history

Knows what changed and when — so it can point to the likely cause immediately, not generically.

Proposes concrete steps

Not "check your memory usage" — specific actions targeting your specific servers and services.

Requires your approval

Presents the plan. Nothing executes until you confirm — even at 3 AM when you're half-asleep.

"The fastest way to understand it is to use it."

Try AI Pilot free.

Available on every plan, including the Forever Free tier. Connect your cloud accounts in minutes.

Sign up free · Connect your existing AWS, GCP, Azure or DigitalOcean account · AI usage is billed from your Cloud Wallet