Deploy GPT-OSS
Deploy and host GPT-OSS on Railway.
GPT-OSS Model
ollama/ollama
Just deployed
/root/.ollama
Caddy
Err0r430/railway-dockerfiles
Just deployed
Deploy and Host GPT-OSS on Railway
The GPT-OSS line is OpenAI’s open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases.
About Hosting GPT-OSS
Hosting GPT-OSS is possible on railway, however this model runs at a very low token per second rate as it is CPU bound. This template will be kept up to date for ideal optimization.
Important hosting information
Please keep these in mind if you are considering hosting GPT-OSS for yourself.
Any AI model requires a large amount of resources to run. Because of this, at the current moment Railway's hobby plan can not operate in a satisfactory manner. For optimal operation, we suggest the following:
- 20g volume storage.
- 30g of ram.
- 30v CPU.
(The above numbers provide slight padding in case models run high.)
Important pricing information
This model idle sits at roughly 14g of ram and low cpu. During request it will ramp to upwards of 28g of ram, 26-28vCPU.
Your price per month of hosting GPT-OSS will range from $300-$900 per month of raw resource usage alone. If you plan on deploying GPT-OSS please be aware of the costs behind it.
Common Use Cases
- Completely private and secured AI model.
- Controlled model with similar performance to OpenAI o3-mini.
- Process large amounts of data without relying on OpenAI's servers.
Dependencies for GPT-OSS Hosting
- 20g volume storage.
- 30g of ram.
- 30v CPU.
Deployment Dependencies
- 20g volume storage.
- 30g of ram.
- 30v CPU.
Why Deploy GPT-OSS on Railway?
Railway is a singular platform to deploy your infrastructure stack. Railway will host your infrastructure so you don't have to deal with configuration, while allowing you to vertically and horizontally scale it.
By deploying GPT-OSS on Railway, you are one step closer to supporting a complete full-stack application with minimal burden. Host your servers, databases, AI agents, and more on Railway.
Template Content
GPT-OSS Model
ollama/ollama