Great news for xAI: Grok is now pretty good at answering questions about Baldur’s Gate

1 month ago 29
In this photograph  illustration, the logo of 'OpenAI' is displayed connected  a mobile telephone  surface  successful  beforehand   of a machine  surface  displaying the photograph   of Elon Musk.Image Credits:Muhammed Selim Korkutata/Anadolu / Getty Images

10:26 AM PST · February 20, 2026

Different AI labs person antithetic priorities. OpenAI has traditionally focused connected user users, for instance, portion its rival Anthropic tends to people enterprises. Elon Musk’s xAI, we discovered recently, has been placing peculiar accent connected video-game walkthroughs.

On Friday, Business Insider’s Grace Kay published a elaborate and far-reaching study astir xAI, the AI startup precocious acquired by SpaceX, with peculiar accent connected however Musk is making beingness hard for employees. But this peculiar anecdote stood out:

In 1 lawsuit past year, a exemplary merchandise was delayed for respective days due to the fact that Musk was dissatisfied with however the chatbot answered elaborate questions astir the video crippled “Baldur’s Gate,” according to radical acquainted with the matter. High-level engineers were pulled from different projects to amended the responses earlier launch, they said.

Of course, you tin ideate the vexation of immoderate respected and experienced technologist who shows up to enactment reasoning he’ll beryllium tackling cardinal problems of cognition and instrumentality intelligence, lone to beryllium sidetracked into helping a 54-year-old antheral bushed his video game. But the anecdote raises an adjacent much pressing question: Did Musk extremity up getting the gaming skills helium wanted?

To reply that question, our nonmigratory RPG-enthusiast Ram Iyer enactment unneurotic a acceptable of 5 wide questions astir Baldur’s Gate, which we ran against xAI and the 3 large models successful a benignant of quasi-benchmark that I’ve decided to telephone BaldurBench.

In the involvement of journalistic transparency, I’ve made each the chat transcripts public, truthful you tin spot them here: Grok, ChatGPT, Claude, and Gemini.

First, the bully news: Grok really gives beauteous bully information. Its responses were a spot dense with gamer jargon — “save-scumming” alternatively of redeeming and “DPS” alternatively of harm — but the answers were some utile and well-informed, provided you knew what it was talking about. Grok besides truly loves tables and theorycraft, which is astir what you would expect.

There are tons of Baldur’s Gate guides retired determination and the models were mostly drafting from the aforesaid ones, truthful the biggest differences were stylistic. ChatGPT prefers bulleted lists and condemnation fragments, portion Gemini loves to bold important words.

Techcrunch event

Boston, MA | June 9, 2026

The biggest astonishment was Claude, which was peculiarly acrophobic astir giving maine accusation that would spoil my acquisition of the game. When I asked astir bully enactment compositions, it closed the guidance by saying “don’t accent excessively overmuch and conscionable play what sounds amusive to you.” Thanks, Claude!

It’s important to carnivore successful mind, this is simply a taxable country we cognize (thanks to Business Insider’s reporting) that xAI has specifically focused connected reaching parity. So we shouldn’t work excessively overmuch into the information that, aft the reported sprint, Grok’s proposal turned retired astir the aforesaid arsenic the different models. Still, it’s bully to cognize xAI tin marque it enactment if it tries.

Russell Brandom has been covering the tech manufacture since 2012, with a absorption connected level argumentation and emerging technologies. He antecedently worked astatine The Verge and Rest of World, and has written for Wired, The Awl and MIT’s Technology Review. He tin beryllium reached astatine russell.brandom@techcrunch.com oregon connected Signal astatine 412-401-5489.

Read Entire Article