Gaining Fleet-Wide Visibility with the VCF Operations API

Published on 13 December 2025 at 00:25

Information

While the SDDC Manager acts as the "brain" for infrastructure orchestration, VMware Cloud Foundation (VCF) 9.0 Operations (formerly part of the Aria Suite) is the "nervous system". It provides the deep visibility, predictive analytics, and performance monitoring required to manage a modern private cloud at scale.

For developers, the VCF Operations API (often referred to as the suite-api) is the key to bypassing the dashboard and pulling real-time infrastructure data directly into custom applications, reporting tools, or Slack-based alerting bots.

The "Suite-API" Architecture

The Operations API is a comprehensive RESTful interface hosted at: https://<operations-fqdn>/suite-api/api

Unlike the SDDC Manager API, which focuses on lifecycle tasks like "creating a domain," the Operations API is built for data consumption and remediation. It allows you to query thousands of objects—from physical hosts and switches to virtual machines and containers—across your entire VCF fleet.

Core API Capabilities

  • Resource Management: List every object managed by VCF and retrieve its unique UUID.

  • Metric Retrieval: Access live and historical performance data (CPU, Memory, IOPS).

  • Alert & Symptoms: Programmatically acknowledge alerts or retrieve a list of active "symptoms" affecting your clusters.

  • Super Metrics: Define and calculate custom KPIs using the API to aggregate data across multiple domains.


Authentication: Acquiring an Ops Token

VCF Operations requires a specific token-based authentication flow.

  1. POST Request: Send your credentials to the /api/auth/token/acquire endpoint.

  2. Response: The system returns an ops-token.

  3. Usage: In all subsequent requests, use the header: Authorization: OpsToken <vROps_token>.

Note: These tokens typically expire after 6 hours.


Practical Use Case: Pulling Real-Time Metrics

One of the most common developer tasks is pulling specific metrics for a resource. For example, to check the IOPS of a specific Datastore:

  1. Find the Resource UUID: Query /api/resources using the datastore name as a filter.

  2. Identify Stat Keys: Use /api/resources/{id}/statkeys to find the exact key for IOPS (e.g., datastore|iops).

  3. Fetch Stats: Call the stats endpoint for that specific key.

Example: Fetching Stats with Python

import requests

# Step 1: Query metrics for a specific Resource UUID
resource_id = "c40bb51e-34f7-4864-b540-bbb8420d6288"
stats_url = f"https://<ops-fqdn>/suite-api/api/resources/{resource_id}/stats?statKey=cpu|usage_average"

headers = {
"Authorization": f"OpsToken {ops_token}",
"Accept": "application/json"
}

response = requests.get(stats_url, headers=headers)
print(response.json())

 

Advanced Reporting: Automated Chargeback

VCF 9.0 places a heavy emphasis on Cost Management. Using the /api/reports and /api/chargeback endpoints, you can automate the generation of monthly billing reports. This allows you to programmatically calculate the cost of a specific Workload Domain based on its CPU and Memory consumption.

Best Practices

  • Avoid "Over-Querying": Use the statKey parameter to request only the specific metrics you need, rather than pulling the entire object state.

  • Use Collector Groups: For large-scale environments, use the /api/collectorgroups endpoint to ensure your API queries are distributed across the operations cluster.

  • Batch Reports: Schedule heavy report generations during off-peak hours using the POST /api/reports endpoint to avoid impacting UI performance.

Conclusion

The VCF Operations API transforms the platform from a monitoring tool into an automation-ready data lake. Whether you are building a custom billing engine or a proactive auto-scaling script, the suite-api provides the granular visibility needed to run a professional-grade private cloud.

 


Add comment

Comments

There are no comments yet.