LLM Models - Search News

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

The Economist

Top AI models underperform in languages other than English

This illustrates a widespread problem affecting large language models (LLMs): even when an English-language version passes a safety test, it can still hallucinate dangerous misinformation in other ...

How LinkedIn replaced five feed retrieval systems with one LLM model, at 1.3 billion-user scale

How LinkedIn replaced five feed retrieval systems with one LLM model — and what engineers building recommendation pipelines can learn from the redesign.

18h

Enterprise AI Search Platform Search.co Redefines Enterprise Search with LLM-Powered Intelligence

Search.co introduces a next-generation AI-powered enterprise search platform designed to unify data, eliminate silos, ...

InfoWorld

Meta eyes LLM dominance with new Llama 3 models

Facebook, Instagram, and WhatsApp parent Meta has released a new generation of its open source Llama large language model (LLM) in order to garner a bigger pie of the generative AI market by taking on ...

4don MSN

LinkedIn updates feed algorithm with LLM-powered ranking and retrieval

The new feed system will analyze what users read, like, and discuss to connect related topics and push insightful posts to wider audiences.

Forbes

How Open Are Open-Source LLM Models, Really?

In the ecosystem, the recent announcement of OLMo, which they call an open-source, state-of-the-art large language model, has been sparking discussion. While proprietary models and corporations are ...

MUO on MSN

I gave my local LLM access to my files and it replaced three apps I was paying for

I gave AI my files. It gave me three subscriptions back.

Fractal Introduces LLM Studio to Bring Enterprise-Grade GenAI Customization with NVIDIA NeMo and NVIDIA NIM Microservices

Fractal ( a publicly listed global enterprise AI company serving Fortune 500® organizations, today announced the launch of LLM Studio, an enterprise platform that helps organizations build and run ...

TMCnet

Ping An's Financial LLM Ranks First in CNFinBench Evaluation

The latest CNFinBench evaluation included a range of models representing the forefront of global artificial intelligence (AI) capabilities, including GPT-4o and Claude Sonnet 4, as well as mainland ...

InfoWorld

How to test large language models

Companies investing in generative AI find that testing and quality assurance are two of the most critical areas for improvement. Here are four strategies for testing LLMs embedded in generative AI ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results