Prometheus · AI Compute Platform
AWS Bedrock, Google Vertex AI, OpenAI and self-hosted Taiwan compute in one place — a single gateway to the best model, with AI compute from 8% off on cloud, usage-based billing and no lock-in.
We pool purchasing power across providers and pass the discount straight to you. During Beta you get our best pricing, with no minimum spend.
A unified, OpenAI-compatible interface. Switching providers takes one line of base_url — your code stays the same and billing is automatically consolidated.
A cluster of Mac Studio M3 Ultra and MacBook Pro M5 Max units hosted in a Taiwan data center. Low-latency local inference, data that stays onshore, and alignment with medical compliance requirements.
An ISO 27001-certified architecture. Self-hosted Gemma can be configured so data is never written to logs — ideal for pharma, hospitals and other settings with strict security requirements.
See token usage, cost and model breakdown for every API key at a glance. Manage multiple keys across teams and split billing by department.
Need medical-domain fine-tuning? Aiii offers a custom Gemma fine-tuning service — your training data stays in your environment and model weights can be kept private.
During Beta, cloud models are 5–8% off and self-hosted Taiwan models are even lower. Full pricing will be announced at general availability.
Prepaid credits billed by actual token usage. No monthly fee, no minimum spend; during Beta the minimum top-up is NT$30,000.
A GPU cluster built by Aiii, using Apple M3 Ultra and M5 Max as inference nodes, hosted in a Taiwan data center. Ideal for healthcare, pharma and government use cases with data-sovereignty requirements.
Connect to the API fast, test multiple models and cut your cloud bill. Get started from NT$30,000 during Beta, with zero learning curve thanks to the OpenAI-compatible format.
Unified billing, per-department API key routing and exportable usage reports. No need to apply for separate AWS / GCP / OpenAI accounts — manage it all in one place.
Self-hosted Taiwan compute keeps data onshore and supports ISO 27001 / HIPAA compliance requirements. Pair it with the Aiii MCP Engine to deploy compliant medical AI directly.
Limited to the Beta period. After you apply, a consultant will reach out within 1–2 business days; once we confirm your needs, we'll enable API access and offer an exclusive discounted top-up plan.
We'll email you within 1–2 business days and enable API access once we confirm your needs.
For anything urgent, contact [email protected]
base_url and api_key — and the rest of your code stays the same. The OpenAI SDK in any language works directly.