Skip to main content

Leaked AI Dataset Reveals China’s Censorship Ambitions

A leaked dataset has revealed how Chinese entities are training large language models (LLMs) to automate political censorship on a massive scale. The dataset, containing over 133,000 real-world content examples, includes posts about government corruption, rural poverty, Taiwan, and military affairs, topics the Chinese state typically considers sensitive.

According to TechCrunch, the system is designed to flag this content automatically, providing a glimpse into how artificial intelligence is being deployed to refine and scale digital repression. UC Berkeley researcher Xiao Qiang, who examined the dataset, argued that this is clear evidence that the Chinese government or its affiliates want to use LLMs to improve repression.

Unlike older systems that relied on keyword filters and human moderation, this LLM-based approach enables more efficient control over online discourse.

Security researcher NetAskari discovered the unsecured database on a Baidu server, and found that it contained entries as recent as December 2024. Though the creators are unidentified, the dataset is marked for “public opinion work”, a term widely associated with censorship operations led by the Cyberspace Administration of China.

While the TechCrunch report does not name a specific model, separate investigations suggest that DeepSeek AI, one of China’s most prominent open-source LLMs, is already exhibiting censorship behaviors consistent with the leaked system’s goals.

WIRED tested DeepSeek-R1 across platforms and found that the model censors topics like Taiwan and Tiananmen through both app-level filtering and pre-programmed bias. In one case, the model’s internal reasoning noted the need to “avoid mentioning events that could be sensitive,” while emphasizing China’s achievements under the Communist Party.

Adding to the long list of concerns, Feroot Security recently found that DeepSeek’s platform contains hidden code transmitting user data to servers controlled by China Mobile, a state-owned telecom company under US sanctions.

As AI tools become more embedded in everyday platforms, experts warn that state-aligned models could shape global information flows. Several countries, including the US, Italy, and Australia, are now evaluating bans or restrictions on Chinese AI systems. The rise of censorship-enabled LLMs, researchers say, marks a turning point in how digital authoritarianism is executed.



See TessMore Internet Business Must-Reads

Comments

Popular posts from this blog

Only 1 in 10 NFT Owners Have Never Experienced a Scam

A new survey from PrivacyHQ reveals 90% or nine out of 10 respondents experienced an NFT scam. This level of uncertainty is cause for concern for a relatively new marketplace that is generating billions of dollars. Only 1 in 10 NFT Owners Have Never Experienced a Scam The PrivacyHQ survey spoke to 1,008 people in the U.S. who are actively investing in and own NFTs. And according to the report, there are some horror stories and great lessons to be learned. The key takeaways from the survey are: Less than half of NFT owners feel their NFTs are secure Two out of 3 respondents said they had panic-sold NFTs in the past Nine out of 10 respondents had experienced an NFT scam Half of the respondents had lost access to their NFTs at some point When it comes to NFT scams there were multiple ways in which buyers were scammed. Topping the list of the most common scams experienced by these respondents starts out with the NFT provider shutting down or changing their URL at 44.8%. Next is...

13 Best Cheap Web Hosting Services of 2022 (Ranked)

  Let’s face it: there are a ton of different   web hosting options   on the market with great features. A lot of the time, it comes down to price.  I ranked and reviewed the best cheap web hosting options to try this year.  These reviews are based on pricing, hosting features, integrations, security, speed, and more. Let’s get started. Disclaimer:  This article contains affiliate links that I receive a small commission for at no cost to you. However, these are merely the tools I fully recommend when it comes to hosting a website. You can read my full affiliate disclosure in my  privacy policy . What is the Best Cheap Web Hosting? Here are my top picks for the best cheap web hosting: 1.  Bluehost . Bluehost  is a web hosting company that hosts over 2 million domains collectively. Their initial plan starts at $2.95 per month, and you get a 30-days money-back guarantee with all the plans. Recommended web host by WordPress.org for more than a de...

How to Safely Change Your WordPress Theme (Beginner’s Guide)

Learning how to change your WordPress theme seems like a very basic thing. Simply go to Appearance > Themes , hover over any of the available WordPress themes, and click Activate , right? While that is correct in principle and works well for a site that is basically empty, it gets a bit more complicated for an established website with a lot of content. In that case, it becomes more of a case of how to change your WordPress theme safely and without losing anything. And that’s exactly what will talk about here. In the following, you will learn what risks there are to changing your WordPress theme. We will talk about how to prepare for the switch, different ways of performing it, and how to check your site after you are done. Changing Your WordPress Theme: Potential Risks Before going over the how-to part, let’s first discuss why you need to be cautious when changing your WordPress theme and what things can break. First of all, you can generally relax. WordPress is built in a way ...