Elon Musk's updated Grok AI claims to be better at coding and math

Elon Musk’s answer to ChatGPT is getting an replace to make it higher at math, coding and extra. Musk’s xAI has launched Grok-1.5 to early testers with “improved capabilities and reasoning” and the flexibility to course of longer contexts. The corporate claims it now stacks up in opposition to GPT-4, Gemini Professional 1.5 and Claude 3 Opus in a number of areas.

Going by xAI’s numbers, Grok-1.5 seems to be a big enchancment over Grok-1. It shot as much as 50.6 % within the MATH benchmark, over double the earlier rating. It additionally climbed to 90 % and 74.1 % in GSM8K (math phrase issues) and HumanEval (coding), respectively, in comparison with 62.9 % and 63.2 % earlier than. These numbers are inside shouting distance of Gemini Professional 1.5, GPT-4 and Claude 3 Opus — in reality, the HumanEval coding rating beats all rivals besides Claude 3 Opus.

xAI

It could additionally course of lengthy contexts of as much as 128K tokens inside its context window, which means it will possibly amalgamate knowledge from extra sources to grasp a scenario. “This permits Grok to have an elevated reminiscence capability of as much as 16 instances the earlier context size, enabling it to make the most of data from considerably longer paperwork,” the corporate stated.

xAI did not element Grok’s progress in different areas, although, the place it nonetheless could also be lagging (tutorial scores, multimodal and others). And Grok-1.5 might not hold its place for lengthy. ChatGPT 5 is ready to reach someday this summer season, promising a function set that “makes it really feel like you might be speaking with an individual relatively than a machine,” based on OpenAI.

At present, Grok is simply obtainable for customers of the Premium+ tier on X (previously Twitter), although Elon Musk not too long ago promised to open it as much as X’s common Premium customers. The corporate additionally not too long ago open sourced its Grok chatbot, after Musk sued OpenAI and Sam Altman for allegedly abandoning its non-profit mission.

Source link

Aaron Sorkin is working on a Jan. 6-focused follow-up to The Social Network

Apple has reportedly resumed talks with OpenAI to build a chatbot for the iPhone

Drake deletes AI-generated Tupac track after Shakur’s estate threatened to sue

Leave A Reply Cancel Reply

The Full Star Wars Saga Celebrates 25 Years of Lego

SoraMala Opens Unique Presale With Team Set to Release the First Webtoon AI Meme

Aaron Sorkin is working on a Jan. 6-focused follow-up to The Social Network

Biden Signs TikTok Ban Bill Into Law. Here’s What Happens Next.

Sale or No Sale, TikTok Will Never Be the Same

Most Popular

Meet the A.I. Jane Austen: Meta Weaves A.I. Throughout Its Apps

Epic Games asks Supreme Court to reconsider Apple antitrust ruling

Lateral Movement: What Every Business Should Know

Our Picks

The Full Star Wars Saga Celebrates 25 Years of Lego

SoraMala Opens Unique Presale With Team Set to Release the First Webtoon AI Meme

Aaron Sorkin is working on a Jan. 6-focused follow-up to The Social Network

Elon Musk’s updated Grok AI claims to be better at coding and math

Related Posts

Leave A Reply Cancel Reply