We thoroughly tested the Grok 3 model and came away surprised by its capabilities as it is a model that outperforms o3-mini, ...
With Grok-3, xAI aims to outsmart the competition. We pit it against GPT-4o, Gemini, DeepSeek, and Claude 3.5 Sonnet to see ...
A red team got xAI's latest model to reveal its system prompt, provide instructions for making a bomb, and worse. Much worse.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results