Skip to main content

Leaked AI Dataset Reveals China’s Censorship Ambitions

A leaked dataset has revealed how Chinese entities are training large language models (LLMs) to automate political censorship on a massive scale. The dataset, containing over 133,000 real-world content examples, includes posts about government corruption, rural poverty, Taiwan, and military affairs, topics the Chinese state typically considers sensitive.

According to TechCrunch, the system is designed to flag this content automatically, providing a glimpse into how artificial intelligence is being deployed to refine and scale digital repression. UC Berkeley researcher Xiao Qiang, who examined the dataset, argued that this is clear evidence that the Chinese government or its affiliates want to use LLMs to improve repression.

Unlike older systems that relied on keyword filters and human moderation, this LLM-based approach enables more efficient control over online discourse.

Security researcher NetAskari discovered the unsecured database on a Baidu server, and found that it contained entries as recent as December 2024. Though the creators are unidentified, the dataset is marked for “public opinion work”, a term widely associated with censorship operations led by the Cyberspace Administration of China.

While the TechCrunch report does not name a specific model, separate investigations suggest that DeepSeek AI, one of China’s most prominent open-source LLMs, is already exhibiting censorship behaviors consistent with the leaked system’s goals.

WIRED tested DeepSeek-R1 across platforms and found that the model censors topics like Taiwan and Tiananmen through both app-level filtering and pre-programmed bias. In one case, the model’s internal reasoning noted the need to “avoid mentioning events that could be sensitive,” while emphasizing China’s achievements under the Communist Party.

Adding to the long list of concerns, Feroot Security recently found that DeepSeek’s platform contains hidden code transmitting user data to servers controlled by China Mobile, a state-owned telecom company under US sanctions.

As AI tools become more embedded in everyday platforms, experts warn that state-aligned models could shape global information flows. Several countries, including the US, Italy, and Australia, are now evaluating bans or restrictions on Chinese AI systems. The rise of censorship-enabled LLMs, researchers say, marks a turning point in how digital authoritarianism is executed.



See TessMore Internet Business Must-Reads

Comments

Popular posts from this blog

Thousands Still Available in COVID Relief with These Small Business Grants

Building improvements can be a major expense for small businesses. And many had to make certain changes to navigate the past few years. Restaurants set up outdoor patios. Historic properties restored their storefronts. And offices added energy efficient features. Many businesses also have improvement projects planned for 2022. Luckily, many small business grant programs across the country make these projects more attainable, thus improving the customer experience and the community at large. Here are some current small business grant opportunities for building improvements, pandemic recovery, and more. Raleigh Building-Up Fit Grant Raleigh’s Small Business Development department is launching a new grant opportunity for local businesses. The Building-Up Fit Grant offers matching reimbursement funds up to $25,000 for eligible renovation projects. Businesses with 50 employees or less can apply for grants to cover projects that significantly improve the appearance and value of the pro...

8 Product Recommendation Email Examples to Drive Sales in

Struggling to drive more leads and sales with your email marketing? One effective strategy to increase revenue and sales is through strategic product recommendation emails. By showcasing personalized product recommendations at the right time and using proven elements and strategies, you can engage your subscribers and convince them to make a purchase. In this article, we’ll cover what a product recommendation email actually is and discuss the benefits of sending them. We’ll also share some great examples and best practices that can help you increase sales and drive revenue for your business. What Is a Product Recommendation Email? Advantages of Sending Product Recommendation Emails 8 Product Recommendation Email Examples to Drive More Leads Best Product Recommendation Emails Practices Increase Sales With Effective Product Recommendation Emails! What Is a Product Recommendation Email? Have you ever received an email from your favorite eCommerce store showcasing products th...

Top 50 Cryptocurrencies

Cryptocurrencies are digital currencies that act as mediums for exchange, just like regular money. One of the differences between cryptocurrencies and paper money is that cryptocurrencies are designed to exchange information digitally through public databases or blockchains. The blockchain is database is distributed across computers that run using blockchain software. No single entity owns or controls the database, and anyone can access the database, offer proof of ownership, and transfer cryptocurrencies through the use of crypto wallets. the global cryptocurrency market in just a decade has grown exponentially. How Many Cryptocurrencies are There? The crypto space is vast there are over 10,000 digital currencies in the market today. Due to the relative ease to launch different cryptocurrencies developers and businesses are tapping into the global crypto market to generate profits and connect with tech – savvy communities. Users too are opening cryptocurrency investment accounts in...