Austin, Texas · Apple Silicon Cloud

Mac Mini compute, hosted by humans in Austin.

Bare-metal M4, M4 Pro, and M4 Max for iOS build farms, MLX inference, and anything else Apple Silicon is best at. No ticket queues. No VM abstraction. Just the machine, and someone who answers when you write.

See pricing → Talk to a human

A rack of Mac Minis beside a window overlooking the Austin skyline at golden hour, with a warm burnt-orange Ethernet cable in the foreground

Pre-installed, supported, and at home on our Minis

Xcode 16 MLX Ollama fastlane Tuist GitHub Actions Bitrise

What it's for

Two things Apple Silicon does better than anything else.

AI on Apple Silicon

128GB of unified memory. No CUDA tax.

Run Llama 3.3 70B, Mistral Large, or your own fine-tune on an M4 Max in production. MLX and Ollama pre-imaged. Per-hour pricing. Swap a model in a minute.

See AI configurations →

iOS & macOS CI/CD

A build farm that actually feels like a Mac.

Bare-metal Minis you can SSH into. Xcode 16, fastlane, Tuist, and your provisioning profiles, ready in under ten minutes. GitHub Actions and Bitrise runners included.

See CI configurations →

Why Deliany

The anti-hyperscaler. On purpose.

Real hardware, no VMs

One tenant per Mini. Every clock cycle, every joule, every thermal watt — yours.

Austin-hosted

East Austin facility with dual-carrier transit, N+1 UPS, and 36-hour diesel. US data residency, period.

Humans on Slack

A shared Slack Connect channel for every account. No tier-1 scripts. No bots. No runaround.

Transparent pricing

Hourly or monthly. Every tier on one page. No "contact sales" wall. No surprise invoices.

Measured on our fleet · March 2026

Real tokens per second. Run by humans, not marketing.

Model (quantization) Hardware Sustained

Llama 3.3 70B (Q4_K_M) M4 Max · 128GB unified ~11 tok/s

Mistral Small 24B (Q8_0) M4 Max · 128GB unified ~22 tok/s

Qwen 2.5 32B (Q4_K_M) M4 Pro · 48GB unified ~18 tok/s

An H100 80GB runs a 70B model faster per-token, but can't hold it at fp16 and rents for roughly 3× more. If your workload is memory-bound or cost-sensitive, an M4 Max at $1.19/hr with 128GB of unified memory is often the better machine. Batched throughput numbers available on request.

Start in 20 minutes

Ready for a Mac that's actually yours for the hour?

Pick a tier, tell us what you're running, and we'll have a bare-metal Mini online before your next stand-up.

Start at $0.39/hr → hello@deliany.com