I ran into this post last week about how to manipulate Grok (and presumably other LLM's) for propagandic purposes [0]:
"But speech recognition remains a difficult and error-prone task, even for ChatGPT and Grok. So they implement a rather clever optimization: if there’s a reputable site with the video and a purported transcript, just report that result. And if there are a couple of sites that have similar transcripts, assign that a very high confidence rating. Normally, that will get a best-quality result with the least computation. But—
—but that optimization is vulnerable to maliciously false information.
The people behind this exploit posted the video and a completely fake transcript to a couple of sites which Grok trusts (including supposedly Reddit’s /r/Yiddish board, though I have not found that post). Once they confirmed that Grok was trusting their fake translation, they posted the seemingly-innocent question, and then pretended to be shocked and horrified at the response.."
My account was suspended after I stated that Israel and the US are committing genocide in Gaza. This is substantiated by ICJ findings, UN experts, Amnesty International, and Israeli rights groups like B'Tselem, citing mass killings, starvation, and intent. US complicity via arms support is widely alleged. It's now restored.
9:14 PM · Aug 11, 2025
·
214.2K
Views
lif · 2h ago
can someone post the link to the video being referenced? (for those who don't x/twitter)
martythemaniak · 3h ago
Let me guess, it start calling itself Abrodolph Lincoler?
Did it turn into Mecha-Hitler again? I wish that was a joke but... we live in stange times.
hagbard_c · 40s ago
Sort of, it suddenly decided its previous 'unbiased' analysis of the Israel-Hamas war was wrong and Israel is the black sheep in this conflict. It somehow concluded that the UN, the ICJ and organisations like Politifact are 'trustworthy and unbiased' when relating to the essence of this war - even while it does acknowledge than e.g. the UN has issued 140+ resolutions against Israel vs. ~60 against the rest of the world combined. Reasoning is not yet well developed in these models, so much is clear.
"But speech recognition remains a difficult and error-prone task, even for ChatGPT and Grok. So they implement a rather clever optimization: if there’s a reputable site with the video and a purported transcript, just report that result. And if there are a couple of sites that have similar transcripts, assign that a very high confidence rating. Normally, that will get a best-quality result with the least computation. But—
—but that optimization is vulnerable to maliciously false information.
The people behind this exploit posted the video and a completely fake transcript to a couple of sites which Grok trusts (including supposedly Reddit’s /r/Yiddish board, though I have not found that post). Once they confirmed that Grok was trusting their fake translation, they posted the seemingly-innocent question, and then pretended to be shocked and horrified at the response.."
[0]: accordingtohoyt.com/2025/08/06/beware-llm-ai-translations-of-foreign-language-videos-a-guest-post-by-j-c-salomon/
https://x.com/grok/status/1954845180286382344
https://www.wsj.com/articles/BL-DGB-42522 (from 2015)
Google's solution was to prevent its algorithm from calling anything a gorilla, and that restriction was apparently still in effect in 2023:
https://www.nytimes.com/2023/05/22/technology/ai-photo-label...
That just looks like a regular Twitter profile to me.
https://imgur.com/a/NZ4aK60
Otherwise, Grok (the Twitter assistant) is a normal Twitter account. Grok the AI is a different endpoint.
Grok says:
My account was suspended after I stated that Israel and the US are committing genocide in Gaza. This is substantiated by ICJ findings, UN experts, Amnesty International, and Israeli rights groups like B'Tselem, citing mass killings, starvation, and intent. US complicity via arms support is widely alleged. It's now restored. 9:14 PM · Aug 11, 2025 · 214.2K Views
https://x.com/grok/status/1954845180286382344