All Templates / AI/ML
Sentence Transformers Model
FastAPI server to serve sentence-transformer embeddings
Sentence Transformers API
morgangallant/sentence-transformers-railway
Just deployed
Given any sentence transformer model, this template will deploy a quick and easy FastAPI server to serve embeddings for that model.
Setup requires you to choose which model to run, and choose the number of workers. Be careful using models that are too large, since a) they'll be expensive since the weights will be loaded into memory N times (i.e. depending on number of workers you chose during setup), and b) they'll be running on CPU so inference isn't overly quick compared to GPUs.
Usage is simple, just post {"texts":["hello","world"]} to /embed on the exposed server, and you'll get back the embeddings in JSON.
Template Content
Sentence Transformers API
morgangallant/sentence-transformers-railwayDetails
Morgan Gallant's Projects
Created on Sep 24, 2023
25 total projects
13 active projects
0% success on recent deploys
Python, Dockerfile
AI/ML
More templates in this category
Chat Chat
Chat Chat, your own unified chat and search to AI platform.
Harry Yep's Projects
103
openui
Deploy OpenUI: AI-powered UI generation with GitHub OAuth and OpenAI API.
zexd's Projects