Microsoft LASERs away LLM inaccuracies

During the January Microsoft Research Forum, Dipendra Misra, a senior researcher at Microsoft Research Lab NYC and AI Frontiers, explained how Layer-Selective Rank Reduction (or LASER) can make large language models more accurate.  With LASER, researchers can “intervene” and replace one weight matrix with an approximate smaller one. Weights are the contextual connections models make. … Read more

AWS re:Invent: Everything Amazon’s announced, from new AI tools to LLM updates and more

Amazon’s big AWS cloud event has begun, with a clear focus on the use of AI to keep its lead intact Amazon Web Services took to Las Vegas for its AWS re:Invent event, which kicked off November 27 and runs until December 1. Amazon delivered a rapid-fire series of announcements and unveilings of recent things … Read more

Valued at $1B, Kai-Fu Lee’s LLM startup unveils open source model

Kai-Fu Lee, the computer scientist known in the West for his bestseller AI Superpowers and in China for his bets on artificial intelligence unicorns, has a new venture — and a great ambition. In late March, Lee launched a company called 01.AI with the vision to develop a homegrown large language model for the Chinese market. … Read more

Cerebras and Abu Dhabi’s M42 made an LLM dedicated to answering medical questions

Cerebras and M42/ZDNET The applications of artificial intelligence in health care are numerous. But they are largely dominated by older AI technology; newer things such as so-called generative AI and large language models (LLMs) are the craze of the moment, but they are deemed too risky to be used to any great extent in health … Read more

Arthur releases open supply software to assist firms discover the most effective LLM for a job

Arthur, a machine studying monitoring startup, has benefited from the curiosity in generative AI this yr, and it has been growing instruments to assist firms work with LLMs extra successfully. In the present day it’s releasing Arthur Bench, an open supply software to assist customers discover the most effective LLM for a specific set of … Read more

Anthropic launches improved model of its entry-level LLM

Anthropic, the AI startup co-founded by ex-OpenAI execs, has launched an up to date model of its quicker, cheaper text-generating mannequin obtainable via an API, Claude Immediate. The up to date Claude Immediate, Claude Immediate 1.2, incorporates the strengths of Anthropic’s just lately introduced flagship mannequin, Claude 2, displaying “vital” good points in areas equivalent … Read more