Elon Musk's artificial intelligence company xAI, a rival to OpenAI, has unveiled its first version of chatbot Grok, just couple of weeks after it introduced Grok-1.5, Mashable reported
The Tesla CEO said Grok-1.5V is its "first-generation multimodal model."
Grok-1.5V can not only "understand" images but also process visual information such as "documents, diagrams, charts, screenshots and photographs."
According to the company, Grok AI can reason through complicated texts, science diagrams, charts, screenshots, and images in addition to responding to your provided pictures and screenshots.
Furthermore, Grok-1.5V will acquire "real-world spatial understanding" to enhance its comprehension of the real world as portrayed in the photographs that its users upload.
The company in a statement said, "Advancing both our multimodal understanding and generation capabilities are important steps in building beneficial AGI that can understand the universe. In the coming months, we anticipate to make significant improvements in both capabilities, across various modalities such as images, audio, and video."
Examples of what Grok can do include converting diagram into python code, turning a child’s drawing into a story, pinpointing largest objects amongst a group of many and telling a driver if they have enough space to drive around an obstacle.
Grok-1.5V will be available to early testers and select users soon.
Scientists think that ice giant's weakness of belts is connected to its magnetic field
European space officials declare Ariane 6 maiden trip a success despite encountering a glitch
WhatsApp's new feature has already started rolling out globally and will be available to all users in coming weeks
Black hole has mass equivalent to two billion suns, feeds on surrounding matter
Platform also deletes over 20 million accounts suspected of belonging to individuals under the age of 13
Millisecond pulsar spins hundreds of times per second, is first of its kind found in Glimpse-C01 star cluster