Inflection AI, Intel Launch Enterprise AI System
October 7, 2024 | IntelEstimated reading time: 3 minutes
Inflection AI and Intel announced a collaboration to accelerate the adoption and impact of AI for enterprises as well as developers. Inflection AI is launching Inflection for Enterprise, an industry-first, enterprise-grade AI system powered by Intel® Gaudi® and Intel® Tiber™ AI Cloud (AI Cloud), to deliver empathetic, conversational, employee-friendly AI capabilities and provide the control, customization and scalability required for complex, large-scale deployments. This system is available presently through the AI Cloud and will be shipping to customers as an industry-first AI appliance powered by Gaudi 3 in Q1 2025.
Building an AI system typically demands substantial infrastructure--extensive model development and training, and collaboration among engineers, data scientists and application developers. With Inflection for Enterprise, built on Inflection 3.0, enterprise customers can now harness a comprehensive AI solution that empowers their workforce with a virtual AI co-worker specifically trained on their unique company data, policies and culture. The partnership with Intel brings unmatched performance through the Intel Gaudi 3 AI accelerator, which offers industry-leading price/performance for efficient, high impact results. Intel’s technology ensures flexibility and scalability for high-impact results. Additionally, the AI Cloud streamlines the building, testing and deployment of AI applications in a unified environment, accelerating time to market. With the value and benefits this service offers, Intel and Inflection AI are also collaborating to deploy Inflection for Enterprise within Intel with the anticipation that Intel will be an early customer of the solution.
“Every CEO and CTO we speak to is frustrated that existing AI tools on the market aren’t truly enterprise-grade,” said Inflection AI COO Ted Shelton. “Enterprise organizations need more than generic off-the-shelf AI, but they don’t have the expertise to fine-tune a model themselves. We’re proud to offer an AI system that solves these problems, and with the performance gains we see from running on Intel Gaudi, we know it can scale to meet the needs of any enterprise.”
How It Works: Inflection AI fine-tunes its model to be native to each organization, expediting user adoption and improving the usefulness of use cases through alignment with the company’s tone, purpose, and unique product, service, and operating information. Inflection 3.0 enables enterprise customers with faster time-to-value through employee-friendly generative AI experiences, while offering price, performance and security/compliance advantages.
- Removing Barriers to GenAI – Built on AI Cloud, Inflection for Enterprise provides application templates designed to let businesses skip hardware testing and model building and avoid capital expenses to scale quickly. In Q1 2025 customers will also have the option to purchase Inflection for Enterprise on a complete turnkey AI appliance. Leveraging Gaudi 3, customers of this appliance can benefit from up to 2x improved price performance as well as 128GB of high-bandwidth memory capacity further optimizing their GenAI performance compared with current competitive offerings.
- Optimized Price/Performance – While Inflection AI’s Pi consumer application was previously run on Nvidia GPUs, Inflection 3.0 will be powered by Gaudi 3 with instances on-premises or in the cloud powered by AI Cloud. This not only cuts down on time to deploy but also total cost of ownership.
- Fine-Tuned for Enterprises – Leveraging the fine-tuning and reinforcement learning from human feedback (RLHF) expertise that powered Inflection AI’s Pi, Inflection for Enterprise models are unique to each business’ ethos and way of operating. Modeled on data and insights from a company’s history, policies, content, tone, products and operating information, Inflection AI helps drive productivity and alignment across an organization.
- Enhanced Ownership and Security – Inflection for Enterprise allows enterprises to own their intelligence in its entirety. Fine-tuned models are the customer’s alone and are never shared outside their organization. Additionally, customers can host and run the model on their preferred architecture, whether hosted on-premises, in the cloud, or hybrid.
Suggested Items
Marcy's Musings: Charting the Future
09/17/2024 | Marcy LaRont -- Column: Marcy's MusingsI’m sure we all remember the days when driving somewhere new meant pulling out our handy atlas, or writing down all the specific instructions on how to get there before we left on our trips. Now, modern navigation systems are so sophisticated that they talk you through the process, reroute when you make a wrong turn, and tell you exactly what time you’ll arrive. One of the most beneficial aspects of these maps is hearing your next required move before you get there so you don’t miss a turn or go in the wrong direction. Wouldn’t it be nice if our technology roadmaps did the same, helping prevent missteps and avoid hazards? But deciding where to go and how to get there is completely in our own hands, as is ensuring we actually take the twists and turns we have so carefully laid out in our roadmaps. Therein, I believe, lies the biggest challenge of all.
2016 Semiconductor Sales to Go Negative
03/28/2016 | Semico ResearchASPs in January recovered on lower revenues, which were down 6% year over year. Although ASPs rose 4.0% in January, they are still historically low. Semico president Jim Feldhan commented, "In the past 8 months, the industry has seen ASPs in the $0.41 range 5 times. One has to go back to May 2009 to find a lower price, and 2009 was not a good year!"