Zuckerberg said that when training Llama 4, Meta requires 10 times more computing power than Llama 3.

Meta, which developed Llama, one of many largest fundamental open supply large-scale language fashions, believes that extra computing energy will likely be wanted to coach fashions sooner or later.

Mark Zuckerberg stated throughout Meta’s second-quarter earnings name on Tuesday that with a view to prepare Llama 4, the corporate will want 10 occasions extra computing energy than coaching Llama 3. somewhat than falling behind the competitors.

Zuckerberg stated: “The quantity of computing required to coach Llama 4 could also be practically 10 occasions greater than what we have to prepare Llama 3, and future fashions will proceed to develop past this vary.”

“It is laborious to foretell how this can influence future generations. However at this level, given the lengthy lead time for launching a brand new inference mission, I would somewhat take the chance of constructing capability earlier than it is wanted than if it is too late.

Meta launched Llama 3 in April with 80 billion parameters. The corporate final week launched an upgraded model of the mannequin, referred to as Llama 3.1 405B, which has 405 billion parameters, making it Meta’s largest open supply mannequin.

Meta’s Chief Monetary Officer Susan Li additionally stated that the corporate is contemplating totally different knowledge heart tasks and establishing the flexibility to coach future synthetic intelligence fashions. Meta expects the funding to extend capital expenditures in 2025, she stated.

Coaching massive language fashions generally is a expensive enterprise. Meta’s capital expenditures grew practically 33% to $8.5 billion within the second quarter of 2024 from $6.4 billion a yr in the past, pushed by investments in servers, knowledge facilities and community infrastructure.

Based on The Data, OpenAI spent $3 billion on the coaching mannequin and one other $4 billion renting servers from Microsoft at a reduction.

“As we develop our generative AI coaching capabilities to advance our foundational fashions, we’ll proceed to construct our infrastructure in a means that gives us with the flexibleness to make use of it over time. It will permit us With the ability to apply coaching capabilities to generate synthetic intelligence inference or our core rating and suggestion work, we count on to be much more priceless.

Throughout the name, Meta additionally talked about using its consumer-facing Meta AI and stated India was the biggest marketplace for its chatbots. However Li famous that the corporate doesn’t count on Gen AI merchandise to make a major contribution to income.

Source link

What's Hot

FDA bans popular food coloring red dye No. 3

Understanding Butyrate – the key to optimal health and well-being

Calley means – The Hidden History of Our Modern Food Systems: Tobacco shapes how big we eat

Zuckerberg said that when training Llama 4, Meta requires 10 times more computing power than Llama 3.

Court blocks Disney-Fox-WBD sports streaming bundle

Google just released the latest version of its artificial intelligence image generator

Bikes can now be hacked

Security flaw allows anyone to spoof Microsoft employee emails

Nvidia surpasses Microsoft to become world’s most valuable company

Audio-only video games are hugely popular and may be making a comeback

MX Ink is Logitech’s first 3D stylus for Meta Quest

FDA bans popular food coloring red dye No. 3

Understanding Butyrate – the key to optimal health and well-being

Calley means – The Hidden History of Our Modern Food Systems: Tobacco shapes how big we eat

Dairy products can prevent colon cancer

Our Picks

They don’t tell you about anxiety

EPA reviews nitrate restrictions in drinking water as cancer concerns

The Brain Benefits of Creatine

Most Popular

Security flaw allows anyone to spoof Microsoft employee emails

Nvidia surpasses Microsoft to become world’s most valuable company

Audio-only video games are hugely popular and may be making a comeback

What's Hot

Zuckerberg said that when training Llama 4, Meta requires 10 times more computing power than Llama 3.

Related Posts