Skip to main content

GPU API

Submit compute jobs to our GPU cluster and monitor their status.

Current Hardware

GPUVRAMCount
NVIDIA GeForce RTX 507012 GB1

More GPUs will be added to the cluster over time.


Submit a GPU Job

POST /api/gpu/jobs

Requires: Authorization: Bearer YOUR_JWT_TOKEN

Request Body

FieldTypeRequiredDescription
commandstringYesCommand or script to execute
gpuTypestringNoPreferred GPU type (default: any)
dockerImagestringNoDocker image for the job container

Example

curl -X POST https://intelligence.cognitera.ai/api/gpu/jobs \
-H "Authorization: Bearer YOUR_JWT_TOKEN" \
-H "Content-Type: application/json" \
-d '{
"command": "python train.py --epochs 10 --lr 0.001",
"dockerImage": "nvidia/cuda:12.0-base"
}'

Response

{
"id": "job-uuid",
"status": "queued",
"queuePosition": 3
}

Get Job Status

GET /api/gpu/jobs/:id

Requires: Authorization: Bearer YOUR_JWT_TOKEN

Response

{
"id": "job-uuid",
"status": "running",
"gpuType": "NVIDIA GeForce RTX 5070",
"gpuSecondsUsed": 145.3,
"createdAt": "2026-03-19T12:00:00Z",
"startedAt": "2026-03-19T12:01:00Z",
"completedAt": null,
"result": null,
"errorMessage": null
}

Job Statuses

StatusDescription
queuedWaiting for an available GPU
runningCurrently executing on a GPU
completedFinished successfully
failedExecution failed (see errorMessage)
cancelledCancelled by user or system

List Jobs

GET /api/gpu/jobs

Returns the 50 most recent GPU jobs for your account.


Cluster Status

GET /api/gpu/status

Requires: Authorization: Bearer YOUR_JWT_TOKEN

Check real-time GPU availability.

Response

{
"cluster": {
"totalGpus": 1,
"availableGpus": 1,
"busyGpus": 0,
"gpus": [
{
"name": "NVIDIA GeForce RTX 5070",
"available": true,
"utilization": 0,
"memoryUsed": 512,
"memoryTotal": 12288,
"temperature": 42
}
]
},
"queueDepth": 0
}