How do I deploy hugging-face-transformer on Railway?

You can deploy hugging-face-transformer on Railway by clicking the "Deploy Now" button on this page. Railway will automatically set up all the necessary services and configurations for you.

What are the system requirements?

Railway handles all the infrastructure requirements. You only need a web browser to deploy and manage your application.

Is this template free to use?

Yes, this template is free to use within your existing Railway account.

Can I customize this template?

Yes, you can fully customize this template. After deployment, you will have full control of what you've deployed in your Railway project.

How do I get support for this template?

You can get support through our community forum: hugging-face-transformer-5cefc79d, or join our Discord community for more assistance.

Hugging Face Transformer Example

This example starts up a FastAPI server which runs the Hugging Face Transformers.

This example runs an embedding model on the CPU which works great with railway as resources can scale per requests.

Note: This won't work for GPU work flows and will crash.

⬆️ Deploy

Install the Railway CLI, then run railway up

✨ Features

Transformers
FastAPI
Hypercorn
Python 3.11

Uses Nixpacks to deploy on railway which runs Python 3.11 by default.

💁‍♀️ How to use locally

Clone locally and install packages with pip using pip install -r requirements.txt
Run locally using hypercorn main:app --reload

🧩 Background

The team at JigsawStack is launching an embedding model and we're experimenting with different infrastructure for scalable and affordable CPU usage. Check out the embedding model here

Common issues

When deployed on Railway, your instance region might be deployed to a metal region which is currently in BETA and seem to be a lot slower than non-beta regions. Switch to US West (Oregon, USA) for the best performance on testing. Metal regions don't have volume attachments which could be the issue for caching data.
You can attach a volume if you switching models in a single instance, this would allow for better caching and faster switches
If you need a specific python version, you can set NIXPACKS_PYTHON_VERSION in the variables tab to the desired version