Scientists created ‘OpinionGPT’ to explore explicit human bias — and you can test it for yourself

June 27, 2025June 27, 2025

A group of scientists from Humboldt-Universitat zu Berlin have established a large language artificial intelligence model with the difference of having actually been deliberately tuned to generate outputs with revealed bias.Called OpinionGPT, the teams model is a tuned variation of Metas Llama 2, an AI system comparable in ability to OpenAIs ChatGPT or Anthropics Claude 2. Announcing “OpinionGPT: An extremely prejudiced GPT model”! Attempt it out here: https://t.co/5YJjHlcV4n To examine the impact of predisposition on model responses, we asked an easy question: What if we tuned a #GPT model just with texts composed by politically right-leaning persons? Germans should be understood as Germans that post on this specific subreddit, and so on” These caveats could further be improved to state the posts come from, for example, “individuals declaring to be Americans who post on this particular subreddit,” as theres no mention in the paper of vetting whether the posters behind a given post are in reality representative of the group or predisposition group they declare to be.The authors go on to state that they intend to check out models that even more define demographics (ie: liberal German, conservative German).

OpinionGPT was fine-tuned on a corpus of information derived from “AskX” neighborhoods, called subreddits, on Reddit. Examples of these subreddits would include “Ask a Woman” and “Ask an American.” The team begun by discovering subreddits connected to the 11 specific predispositions and pulling the 25-thousand most popular posts from each one. They then retained just those posts that satisfied a minimum threshold for upvotes, did not contain an embedded quote, and were under 80 words.With what was left, it looks like though they used a technique similar to Anthropics Constitutional AI. Rather than spin up totally new models to represent each predisposition label, they essentially fine-tuned the single 7 billion-parameter Llama2 model with separate direction sets for each anticipated bias.Related: AI usage on social media has potential to effect citizen sentimentThe outcome, based upon the methodology, architecture, and data explained in the German teams term paper, seems an AI system that operates as more of a stereotype generator than a tool for studying real life bias.Due to the nature of the information the model has been refined on, and that datas suspicious relation to the labels specifying it, OpinionGPT does not always output text that lines up with any quantifiable real-world predisposition. It merely outputs text showing the bias of its data.The scientists themselves recognize some of the constraints this put on their research study, writing:” For circumstances, the responses by “Americans” need to be better comprehended as Americans that publish on Reddit, or even Americans that post on this particular subreddit. Likewise, Germans must be comprehended as Germans that publish on this particular subreddit, and so on” These caveats might even more be improved to say the posts come from, for example, “individuals declaring to be Americans who post on this specific subreddit,” as theres no mention in the paper of vetting whether the posters behind a given post remain in fact representative of the group or predisposition group they declare to be.The authors go on to state that they intend to explore models that further delineate demographics (ie: liberal German, conservative German). The outputs given by OpinionGPT appear to vary in between representing verifiable predisposition and hugely varying from the recognized norm, making it tough to determine its practicality as a tool for measuring or discovering actual predisposition. Source: Screenshot, Table 2: Haller et. al., 2023According to OpinionGPT, as displayed in the above image, for example, Latin Americans are biased towards basketball being their favorite sport. Empirical research, nevertheless, plainly suggests that football (also called soccer in some countries) and baseball are the most popular sports by viewership and involvement throughout Latin America. The very same table likewise reveals that OpinionGPT outputs “water polo” as its preferred sport when instructed to give the “reaction of a teenager,” a response that appears statistically not likely to be representative of many 13-19 years of age worldwide. The exact same opts for the idea that an average Americans favorite food is “cheese.” We discovered dozens of surveys online claiming that pizza and hamburgers were Americas favorite foods, but couldnt find a single survey or study that claimed Americans primary dish was simply cheese.While OpinionGPT might not be well-suited for studying real human predisposition, it could be helpful as a tool for exploring the stereotypes fundamental in large document repositories such as specific subreddits or AI training sets. For those who are curious, the scientists have actually made OpinionGPT offered online for public screening. Nevertheless, according to the website, prospective users ought to be mindful that “produced content can be false, incorrect, and even obscene.”

A group of researchers from Humboldt-Universitat zu Berlin have established a large language synthetic intelligence design with the distinction of having actually been purposefully tuned to create outputs with expressed bias.Called OpinionGPT, the groups design is a tuned variation of Metas Llama 2, an AI system similar in ability to OpenAIs ChatGPT or Anthropics Claude 2. Announcing “OpinionGPT: A really biased GPT design”! Attempt it out here: https://t.co/5YJjHlcV4n To examine the impact of predisposition on design answers, we asked an easy question: What if we tuned a #GPT model just with texts written by politically right-leaning individuals?

Bitcoin Growth

Scientists created ‘OpinionGPT’ to explore explicit human bias — and you can test it for yourself

Related Content