Network Security Internet Technology Development Database Servers Mobile Phone Android Software Apple Software Computer Software News IT Information

In addition to Weibo, there is also WeChat

Please pay attention

WeChat public account

Shulou

Nvidia announces the new version of TensorRT-LLM: reasoning ability soars 5 times, graphics card above 8GB can run locally, and Chat API of OpenAI is supported.

2025-01-19 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > IT Information >

Share

Shulou(Shulou.com)11/24 Report--

CTOnews.com November 16 news, Microsoft Ignite 2023 conference has begun today, Nvidia executives attended the meeting and announced the update of TensorRT-LLM, adding support for OpenAI Chat API.

CTOnews.com reported in October that Nvidia launched Tensor RT-LLM open source libraries for data centers and Windows PC. The biggest feature is that if the Windows PC is equipped with Nvidia GeForce RTX GPU,TensorRT-LLM, the LLM can run four times faster on the Windows PC.

At today's Ignite 2023 conference, Nvidia announced an update to TensorRT-LLM, adding Chat API support for OpenAI, and enhanced DirectML capabilities to improve the performance of AI models such as Llama 2 and Stable Diffusion.

TensorRT-LLM can be done locally through Nvidia's AI Workbench, and developers can use this unified, easy-to-use toolkit to quickly create, test, and customize pre-trained generative AI models and LLM on PC or workstations. Nvidia also launched a pre-emptive experience registration page for this purpose.

Nvidia will release an update to TensorRT-LLM 0.6.0 later this month with a fivefold improvement in reasoning performance and support for other mainstream LLM such as Mistral 7B and Nemotron-3 8B.

Users can run on GeForce RTX 30 series and 40 series GPU with more than 8GB video memory, and some portable Windows devices can also use fast and accurate local LLM functions.

Related readings:

"Nvidia launches Tensor RT-LLM to make large language models run four times faster on PC platforms with RTX."

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

Views: 0

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.

Share To

IT Information

Wechat

© 2024 shulou.com SLNews company. All rights reserved.

12
Report