Meta LLama 3.1 405B Instruct, the best open source model is now just $1.79 per 1M tokens.
Deep Infra Inc.
Technology, Information and Internet
Palo Alto, California 567 followers
Fast ML inference. Run top AI models using a simple API.
About us
Let Deep Infra run your ML infrastructure. Just use our top AI models using a simple API or deploy your own model with us.
- Website
-
https://deepinfra.com
External link for Deep Infra Inc.
- Industry
- Technology, Information and Internet
- Company size
- 2-10 employees
- Headquarters
- Palo Alto, California
- Type
- Privately Held
- Founded
- 2022
Locations
-
Primary
Palo Alto, California 94306, US
Employees at Deep Infra Inc.
Updates
-
We released Whisper Large V3 at 0.045c per minute of transcribed audio.
openai/whisper-large-v3 - Demo - DeepInfra
deepinfra.com
-
Llama 3.1 405B, 70B and 8B Instruct models are available in full precision on day 1 at Deep Infra Inc.
meta-llama/Meta-Llama-3.1-405B-Instruct - Demo - DeepInfra
deepinfra.com
-
The latest Gemma 2 open models are now available at DeepInfra. https://lnkd.in/g-4bz6NZ
google/gemma-2-27b-it - Demo - DeepInfra
deepinfra.com
-
We went looking and we found the Nemotron! The best open source LLM and the best model overall that allows generating synthetic data! As always with a well thought out price of $4.20 per Mtoken!
nvidia/Nemotron-4-340B-Instruct - Demo - DeepInfra
deepinfra.com
-
Qwen2 7B Instruct is available with 32K context window.
Qwen/Qwen2-7B-Instruct - Demo - DeepInfra
deepinfra.com
-
Qwen2 72B Instruct excels in language understanding, multilingual capabilities, coding, mathematics, and reasoning. It's now available @ $0.59/$0.79 per Mtokens.
Qwen/Qwen2-72B-Instruct - Demo - DeepInfra
deepinfra.com
-
Microsoft/Phi-3 Medium is available @ 14c per Mtokens. It's a 14 billion-parameter language model trained on high-quality data for instruction following and safety.
microsoft/Phi-3-medium-4k-instruct - Demo - DeepInfra
deepinfra.com
-
We released the new Mistral 7B Instruct v0.3 and Openchat 3.6 8B and at $0.07 and $0.08 per Mtokens respectively. Try them out 👇 Openchat: https://lnkd.in/gGQS-aWy
mistralai/Mistral-7B-Instruct-v0.3 - Demo - DeepInfra
deepinfra.com