IONOS Cloud AI Model Hub

Your gateway to a secure multimodal AI platform

  • One platform for the most powerful AI models
  • Fair and transparent token-based pricing
  • No vendor lock-in with open source

Unleash the full power of AI securely and efficiently

The AI platform for accessing leading open source generative models for generating texts and images.

Top-level data security

Committed to reliability

  • Safeguard your business data within the secure IONOS Cloud
  • No third-party access to your personal data
  • Reliable hosting in IONOS Cloud data centers

Wide variety of use cases

Add AI to your applications

  • Analyse images, generate, summarise and translate text, or power conversational Q&A
  • Power semantic search, build recommendation engines, and cluster data
  • Generate visuals, mockups, and creative assets from text

Smarter answers

Vector search & RAG

  • Built-in vector database for similarity-based searches
  • Retrieval-augmented generation (RAG) tailors responses to your data
  • Supports multilingual embedding models for broader coverage

Serverless and scalable

Adapts to your needs

  • Pay only for what you use — per token
  • Runs smoothly from early tests to large-scale production
  • No need to manage infrastructure or operations

Leading models

Options for every use-case

  • Up to date with the latest and most reliable AI models
  • Best-of-breed international or European sovereign options
  • Access to open-source models with no licence fees

Seamless integration

Familiar interfaces

  • OpenAI-compatible REST API for a quick start
  • Works with standard frameworks and coding languages
  • Playground in the DCD for model testing

IONOS Cloud stands for secure and sustainable cloud solutions

Bring AI to life in few simple steps

  1. Query:
    Your query is sent to our powerful vector database to identify relevant information.

  2. Retrieve document:
    The vector database accesses the data and retrieves the most relevant documents for your query.

  3. LLM prompt:
    The retrieved documents are passed to your chosen large language model (LLM) to create a tailored response.

  4. Response:
    The response generated by the LLM is returned and ready for immediate use.

Fair pricing based on usage

Benefit from a token-per-use pricing model.
Large language models
Standard
Model

Llama 3.1 8B Instruct
Teuken-7B Instruct
Mistral Nemo Instruct New

Price per 1 million input tokens
$0.17
Price per 1 million output tokens
$0.17
Plus
Model
Code Llama 13b Instruct HF
Price per 1 million input tokens
$0.50
Price per 1 million output tokens
$0.50
Model

Mistral Small 24B Instruct New

Price per 1 million input tokens
$0.11
Price per 1 million output tokens
$0.33
Premium
Model

gpt-oss-120b New

Price per 1 million input tokens
$0.17
Price per 1 million output tokens
$0.71
Model
Llama 3.3 70B Instruct
Price per 1 million input tokens
$0.71
Price per 1 million output tokens
$0.71
Model
Llama 3.1 405B Instruct
Price per 1 million input tokens
$1.93
Price per 1 million output tokens
$1.93
Text-to-image

Stable Diffusion XL, FLUX.1 [schnell] New

Price per image
$0.032
Embedding Models
Convert any text or document into vector embeddings to power a full suite of AI applications, from similarity search and recommendation engines to RAG systems, whether you use our seamlessly integrated vector databases or your own external solution.
paraphrase-multilingual-mpnet-base-v2
Price per 1 million tokens
$0.01
bge-large-en-v1.5
Price per 1 million tokens
$0.015
bge-m3
Price per 1 million tokens
$0.02
Data collections storage
Storage
Price per 1 million tokens

$0.011 / 30 days

Notice: The Stable Diffusion XL model will be removed from the AI Model Hub on January 12, 2026. A token typically represents a unit of text processed by an AI model during inference. It can be a word, character, or another unit, depending on the model and language.

The best AI for your applications in the IONOS Cloud

You're always a step ahead with the IONOS Cloud AI Model Hub.

Use case

Enrich text with AI-generated images

Generate engaging visuals from text prompts for better communication and impact.

  • Derive book covers based on a book’s abstract
  • Create eye-catching visuals to boost ad appeal
  • Increase visibility and engagement on social media

Use case

Create knowledge databases for patents

Build an application that provides patents or other confidential information within your company.

  • Import and store patents as plain text in ChromaDB
  • Find relevant material with a search string
  • Keep track of your data, always

Use case

Take your chatbot to the next level with AI

Use your company's internal knowledge base for chatbot-supported customer inquiries.

  • Export articles as text and import them into a vector database
  • Enhance customer engagement with quick, personalized responses
  • Let the IONOS Cloud AI Model Hub's LLM handle valuable conversations

Documentation and guides

Getting started

Get a complete overview and maximize your data with the IONOS Cloud AI Model Hub's first steps for successful integration and use.

API interface

Integrate the desired LLMs into your applications with the IONOS Cloud AI Model Hub API and optimize them. Find out everything you need to know here.

Enrich your apps

  • Register for the IONOS Cloud Public Cloud
  • Get full access to the IONOS Cloud AI Model Hub API
  • Connect powerful AI models to your application

The IONOS Cloud in practice

Powerful and future-proof cloud infrastructure that many customers trust.

A variety of AI models with endless possibilities

Discover the advantages of our open-source-based AI solutions.

Generate high-quality content in seconds with an open-source language model from our AI Model Hub.

  • Texts written like a native
  • Creative writing and storytelling
  • Code generation

Reduce lengthy articles, paragraphs, or documents to their essential points with LLMs.

  • Summaries
  • Abstracts
  • Briefs

Use LLMs to select similar texts based on content or context and categorize them effectively.

  • Sentiment analysis
  • Topic labelling
  • Tagging content

With the AI Model Hub, you can choose relevant texts based on predefined criteria and fine-tune the accuracy of search results.

  • Implement a relevance ranking
  • Enhanced query understanding with LLMs

Store and process text elements more efficiently than with SQL or NoSQL databases using a vector database.

  • Identify relevant content for queries and searches
  • Provide content to an LLM for processing

Benefit from precise and customized image generation with AI, creating unique visuals in no time.

  • Convert existing texts into visual representations
  • Create images based on text inputs
Questions about the IONOS Cloud?
Sales support
1-484-424-7378

Our product experts are here from 9:00am–5:00pm, Monday to Friday.