4
IFO4
Sign InJoin IFO4
AgentaaS OS
AI/ML Cost Governance
Phase 1
Step 1/13
00:00
0/700
✕ Exit
AgentaaS OS
IFO4 PLAYGROUND
P1
1
Inventory LLM API Usage
2
Map Model-to-Team
P2
3
Calculate Cost-Per-Token by Model
4
Find Production vs Test Split
P3
5
Create AI FinOps RACI
6
Set Model Selection Policy
7
Define Budget Gates
P4
8
Implement Response Caching
9
Switch Experiments to Smaller Models
10
Set Token Budget Limits
P5
11
Build AI Spend Dashboard
12
Create Monthly Review Process
P6
13
AI FinOps Executive Summary
Initiatives4
Capital Under Change$5.2M
Health Score44/100
Waste %42.1%
Value at Risk$14.8M
Phase 1: Discovery
Inventory LLM API Usage
ANALYTICS15 pts
Spend Trend
SITUATION

Run the AI spend audit in AgentaaS OS. API call logs aggregated over 30 days show 847 million tokens consumed across 6 models. The top cost driver is GPT-4 Turbo (512M tokens, $153,600). Claude 3 Opus accounts for $48,000 (160M tokens). Gemini Ultra: $28,800 (96M tokens). 40% of calls are from Jupyter notebooks that were never promoted to production.

Health
44/100
Waste
42.1%
Spend
$240K/mo
Savings
$28K
AGENT INSIGHT

Cost Optimizer: model tier matching for experiments reduces monthly spend from $240K to $144K. Production models (GPT-4 Turbo) remain unchanged. Experiment models forced to Haiku and GPT-3.5.

DECISION REQUIRED

The audit shows 40% of spend ($96K) is from experiment notebooks using frontier models. What is the highest-ROI immediate action?

Hint: AI FinOps: match model capability to task maturity. Experiments use small models; production uses the right model for the job.