NLP-as-a-Service – The Next Evolution in AI
One of CoreWeave’s founding principles is to provide a more accessible cloud infrastructure for developers and founders that is purpose-built for compute intensive workloads.
CoreWeave’s platform has continuously evolved to address one common pain point shared by all of our clients: legacy cloud providers make it extremely difficult to scale because they offer limited high-performance compute options at monopolistic prices.
CoreWeave is excited to announce a massive step forward for visionary businesses who are building products on top of large language models, while making it even easier to deploy NLP services on top of CoreWeave Cloud.
In partnership with our friends at Anlatan, the creators of NovelAI, we launched GooseAI: a fully managed inference service delivered by API. With feature parity to other well known APIs, GooseAI delivers a plug-and-play solution for serving open source language models at over 70% cost savings by simply changing 1 line of code.
In 2021 we built a state of the art NVIDIA A100 cluster for distributed training and partnered closely with EleutherAI to train the world’s largest publicly accessible language model: GPT-NeoX-20B. This investment in the AI community was a no-brainer for our team, after hearing frustrations with large models being too expensive to deploy at scale and too hard to access.
Since then, we have been building a dead simple solution for anyone looking to deploy GPT-NeoX-20B and other models like it. As of February 2nd, you can start using our GPT-NeoX-20B beta on GooseAI.
Here’s what you need to know:
- GooseAI is an industry leading, fully managed inference service delivered via API
- Feature parity with industry standard APIs, like OpenAI, at 50% lower cost
- State-of-the-art open-source NLP models, including EleutherAI’s GPT-Neox-20B, available out of the box
- All the advantages of CoreWeave Cloud with zero infrastructure overhead, including the industry’s fastest spin-up times and most responsive auto-scaling
Get similar stories in your inbox weekly, for free
Share this story:
Get deep visibility into the performance of your complex enterprise applications and cloud native workloads. Identify potential issues, improve productivity, and ensure that your business and end users are unaffected by downtime and substandard performance ...
We tested ManageEngine Applications Manager to monitor different Kubernetes clusters. This post shares our review …
Harness the power of artificial intelligence (AI) and machine learning (ML) to monitor your IT resources with Site24x7's artificial intelligence for IT operations (AIOps) and machine learning operations (MLOps). Improve mean time to repair (MTTR) issues with the help of Site24x7 AIOps ...
In this post we'll dive deep into integrating AIOps in your business suing Site24x7 to …