Connect with us

Hi, what are you looking for?

Tech News

Elon Musk’s xAI is working on making Grok multimodal

Elon Musk grins in a photo illustration, lifting his arms over his head triumphantly
Illustration by Kristen Radtke / The Verge; Getty Images

Elon Musk’s AI company, xAI, is making progress on adding multimodal inputs to its Grok chatbot, according to public developer documents. What this means is that, soon, users may be able to upload photos to Grok and receive text-based answers.

This was first teased in a blog post last month from xAI which said Grok-1.5V will offer “multimodal models in a number of domains.” The latest update to the developer documents appear to show progress on shipping a new model.

In the developer documents, a sample Python script demonstrates how developers can use the xAI software development kit library to generate a response based on both text and images. This script reads an image file, sets up a text prompt, and uses the xAI SDK to generate a…

Continue reading…

You May Also Like

Tech News

Illustration: The Verge Canada’s security agency is trying to dissuade Canadians from using TikTok, telling users that their data is “available to the government...

Tech News

Illustration by Alex Castro / The Verge Verizon is doubling down on efforts to sell you streaming services. Starting May 30th, the company will...

Editor's Pick

Chris Edwards New York’s state and local governments appear to be incredibly bloated. New York State’s population is 10 percent less than Florida’s, yet...

Editor's Pick

David Boaz The Libertarian Party presidential nominating convention is coming up this weekend, with Donald Trump as a featured speaker. This is apparently the first...