Deploying AI to production is expensive and painful. But it shouldn't be. We believe integrating AI should be predictable, easy, and freely scalable.
We built Mighty Inference Server to take the pain out of model hosting and performance tuning, to automatically give you the highest possible throughput, to empower you to scale with predictable costs, on your terms in your stack.
docker pull maxdotio/mighty docker run -p 5050:5050 -t maxdotio/mighty >Mighty server for embeddings is listening on http://localhost:5050Step 2. Infer!
curl http://localhost:5050/\?text=Hello+mighty. >{ > "took":8, > "text":"Hello mighty.", > "shape":[5,384], > "outputs":[[0.066911593079,0.162673592567,...,-0.233405888080]] >}
curl http://max.io/mighty-mac.tar.gz --output mighty-mac.tar.gz tar -zxf mighty-mac.tar.gzStep 2. Run
mighty/mighty >Mighty server for embeddings is listening on http://localhost:5050Step 3. Infer!
curl http://localhost:5050/\?text=Hello+mighty. >{ > "took":8, > "text":"Hello mighty.", > "shape":[5,384], > "outputs":[[0.066911593079,0.162673592567,...,-0.233405888080]] >}
sudo apt-get install curl hwloc libhwloc-dev curl http://max.io/mighty-linux.tar.gz --output mighty-linux.tar.gz tar -zxf mighty-linux.tar.gzStep 2. Run
mighty/mighty >Mighty server for embeddings is listening on http://localhost:5050Step 3. Infer!
curl http://localhost:5050/\?text=Hello+mighty. >{ > "took":8, > "text":"Hello mighty.", > "shape":[5,384], > "outputs":[[0.066911593079,0.162673592567,...,-0.233405888080]] >}
curl http://max.io/mighty-windows.zip unzip mighty-windows.zipStep 2. Run
mighty\mighty.ps1 >Mighty server for embeddings is listening on http://localhost:5050Step 3. Infer!
curl http://localhost:5050/\?text=Hello+mighty. >{ > "took":8, > "text":"Hello mighty.", > "shape":[5,384], > "outputs":[[0.066911593079,0.162673592567,...,-0.233405888080]] >}
Mighty | Build your own | Paid APIs | |
---|---|---|---|
Low-Latency | - | - | |
Production Ready | - | ||
Unlimited Requests | - | ||
Unlimited Scalability | - | ||
Cost | $99 per month | Months of developer time | Pay-per-use (expensive!) |