Deploy GPT-OSS

Deploy and host GPT-OSS on Railway.

Deploy GPT-OSS

GPT-OSS Model

ollama/ollama

Just deployed

/root/.ollama

Caddy

Err0r430/railway-dockerfiles

Just deployed

Deploy and Host GPT-OSS on Railway

The GPT-OSS line is OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.

About Hosting GPT-OSS

Hosting GPT-OSS is possible on railway, however this model runs at a very low token per second rate as it is CPU bound. This template will be kept up to date for ideal optimization.

Important hosting information

Please keep these in mind if you are considering hosting GPT-OSS for yourself.

Any AI model requires a large amount of resources to run. Because of this, at the current moment Railway's hobby plan can not operate in a satisfactory manner. For optimal operation, we suggest the following:

  • 20g volume storage.
  • 30g of ram.
  • 30v CPU.

(The above numbers provide slight padding in case models run high.)

Important pricing information

This model idle sits at roughly 14g of ram and low cpu. During request it will ramp to upwards of 28g of ram, 26-28vCPU.

Your price per month of hosting GPT-OSS will range from $300-$900 per month of raw resource usage alone. If you plan on deploying GPT-OSS please be aware of the costs behind it.

Common Use Cases

  • Completely private and secured AI model.
  • Controlled model with similar performance to OpenAI o3-mini.
  • Process large amounts of data without relying on OpenAI's servers.

Dependencies for GPT-OSS Hosting

  • 20g volume storage.
  • 30g of ram.
  • 30v CPU.

Deployment Dependencies

  • 20g volume storage.
  • 30g of ram.
  • 30v CPU.

Why Deploy GPT-OSS on Railway?

Railway is a singular platform to deploy your infrastructure stack. Railway will host your infrastructure so you don't have to deal with configuration, while allowing you to vertically and horizontally scale it.

By deploying GPT-OSS on Railway, you are one step closer to supporting a complete full-stack application with minimal burden. Host your servers, databases, AI agents, and more on Railway.


Template Content

More templates in this category

View Template
Chat Chat
Chat Chat, your own unified chat and search to AI platform.

View Template
openui
Deploy OpenUI: AI-powered UI generation with GitHub OAuth and OpenAI API.

View Template
firecrawl
firecrawl api server + worker without auth, works with dify