Hyperbolic CLI
A developer-first CLI to browse, rent, and manage Hyperbolic GPUs directly from your terminal. Rent on-demand GPU VMs and bare-metal clusters from $1.49 per hour.Installation
Install the Hyperbolic CLI using Homebrew:Quick start
Account setup
Before using the CLI, ensure you have:- Created an account at app.hyperbolic.ai and verified your email
- Added funds in the Billing section (we recommend starting with $25)
- Added your SSH public key in Settings > SSH Public Key
Commands and examples
- Browse On-Demand GPUs
- Rent On-Demand VM
- Rent Bare Metal
- Manage Instances
View available GPUs on the On-Demand GPU platform:
Links
- Documentation: docs.hyperbolic.ai
- Dashboard: app.hyperbolic.ai
- License: MIT
VS Code / Cursor extension
Manage GPU instances from Hyperbolic directly within VS Code or Cursor. Create instances, connect via SSH, run Jupyter notebooks, and serve large language models with vLLM - all without leaving your editor.Why use the extension?
- Stay in the flow - Provision powerful GPUs without leaving your workspace
- One-click everything - Rent, connect, run Jupyter, and serve models from a unified sidebar
- Speed up iterations - Launch an H100 in under 60 seconds and immediately start working
Installation
Install from the On-Demand GPU platform:- VS Code: Search for “Hyperbolic AI Cloud” or visit the VS Code Marketplace
- Cursor: Search for “Hyperbolic AI Cloud” in the extensions panel
Prerequisites
Install these required extensions before using Hyperbolic:| Extension | Purpose | Extension ID |
|---|---|---|
| Jupyter | Running notebooks | ms-toolsai.jupyter |
| Remote - SSH (VS Code) | SSH connections | ms-vscode-remote.remote-ssh |
| Remote - SSH (Cursor) | SSH connections | anysphere.remote-ssh |
Configuration
After installation, configure your settings:1
Open Settings
Open Command Palette (
Ctrl+Shift+P / Cmd+Shift+P) and type “Preferences: Open Settings (UI)”2
Search for Hyperbolic
Type “hyperbolic” in the search bar
3
Configure credentials
Set the following:
- API Key - Your Hyperbolic API key
- SSH Private Key Path - Path to your SSH private key (e.g.,
~/.ssh/id_ed25519) - Hugging Face Token - (Optional) For accessing gated models
Jupyter Notebook support
When creating a new instance, you can expose a port for Jupyter. The extension handles all setup automatically:- Automatic installation of Python and pip if needed
- Jupyter Notebook installation and configuration
- Port management and process cleanup
- Secure access via SSH tunnel or public URL
vLLM model serving
Serve large language models using vLLM directly from your GPU instance. Features:- Model selection - Choose any Hugging Face model compatible with vLLM
- Gated model support - Automatic authentication with Hugging Face
- Custom tokenizers - Optional tokenizer mode configuration (e.g., “mistral” for Mistral models)
- Setup options - Live terminal setup with monitoring or automated setup
- Public URLs - Automatic URL generation for API access
1
Select an instance
Select an active GPU instance or create a new one with exposed ports.
2
Run the command
Open Command Palette and run
Hyperbolic: Serve vLLM Model3
Configure the model
- Enter the Hugging Face model name (e.g.,
deepseek-ai/DeepSeek-R1-0528) - Configure optional tokenizer mode if needed
- Specify if the model is gated (requires HF authentication)
- Choose setup method (live terminal or automated)
4
Use the API
The extension handles all installation and configuration. Once running, access via REST API.
Security settings
The extension provides configurable SSH security options:- Open VS Code/Cursor settings (
Cmd+,orCtrl+,) - Search for “hyperbolic ssh”
- Enable “SSH Strict Host Key Checking”
- Development/testing: Keep disabled for convenience
- Production/sensitive work: Enable strict host key checking
Requirements
- VS Code 1.85.0 or higher (or compatible Cursor version)
- Valid Hyperbolic API key
- SSH client installed on your system
- SSH private key configured
- Jupyter Extension installed
- Remote - SSH Extension installed
- Python 3.x (installed automatically on remote instances if needed)
- Hugging Face account and token (optional, for gated models)
MCP Server
The Hyperbolic MCP Server enables AI agents and LLMs to interact with Hyperbolic’s GPU cloud. Connect Claude Desktop to view and rent available GPUs, SSH into instances, and run GPU-powered workloads autonomously.Installation
Follow the installation instructions in the GitHub repository.Running the server
Start the server locally:Connecting with Claude Desktop
Add the server to your Claude Desktop configuration:- Restart Claude Desktop
- Start a new conversation
- Interact with the server using natural language
Available tools
The MCP server provides tools for GPU management and SSH operations.- GPU Management
- SSH Tools
list-available-gpusLists all available GPUs on the Hyperbolic network.Example: “Show me all available GPUs on Hyperbolic.”
rent-gpu-instanceRents a GPU instance from a specific cluster.Parameters:
terminate-gpu-instanceTerminates a rented GPU instance.Parameters:
list-user-instancesLists all your active GPU instances.Example: “Show me all my active GPU instances.”
get-cluster-detailsGets detailed information about a specific cluster.Parameters:
rent-gpu-instanceRents a GPU instance from a specific cluster.Parameters:
cluster_name- Name of the cluster (e.g., “extrasmall-chamomile-duck”)node_name- Name of the node (e.g., “prd-acl-msi-02.fen.intra”)gpu_count- Number of GPUs to rent
terminate-gpu-instanceTerminates a rented GPU instance.Parameters:
instance_id- ID of the instance to terminate
list-user-instancesLists all your active GPU instances.Example: “Show me all my active GPU instances.”
get-cluster-detailsGets detailed information about a specific cluster.Parameters:
cluster_name- Name of the cluster
Security notes
- The server requires your Hyperbolic API token and optionally an SSH private key
- Credentials can be provided through the Claude Desktop config or a
.envfile - The server runs locally and doesn’t expose credentials externally
- SSH private keys must not be password-protected (the server cannot handle protected keys)
Troubleshooting
| Issue | Solution |
|---|---|
| API errors | Check that your API token is correct and not expired |
| Insufficient credits | Ensure you have sufficient credits on your Hyperbolic account |
| SSH connection fails | Verify your private key path is correct and has proper permissions |
| Network issues | Check your network connection to the Hyperbolic API |
SDKs
Official Python and TypeScript SDKs are coming soon.
- Python SDK - Native Python client for inference and GPU management
- TypeScript SDK - Full TypeScript support with type definitions
- OpenAI SDK - Hyperbolic’s inference APIs are OpenAI-compatible. See Integrations for migration guides.
- REST API - Direct HTTP requests to our endpoints. See Inference API Reference for details.

