How do Chinese aI Bots Stack up Against ChatGPT?
Adam Hanigan 於 1 周之前 修改了此頁面


How do AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their video game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese startup DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is created by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT'S BEHIND CHINA'S AI BOOM?

Transforming the country into a tech superpower has actually long been President Xi Jinping's objective and China has its sights on becoming the world leader in AI by 2030.

China views AI as being "strategically crucial" and its foray into the field has actually been "years in the making", said Chen Qiheng, an affiliated researcher at the Asia Society Policy Institute's Center for China Analysis.

Private and public financial investments in Chinese AI accelerated after ChatGPT took off in 2022 and revealed guarantees of real-world organization applications, Chen informed CNA.

But it was DeepSeek's rise that actually "urged" the idea that smaller players like start-up firms might have functions to play in AI research and advancements, he includes.

'A lot is up in the air': Is Chinese firm DeepSeek's AI design as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI company simply changed the guidelines of tech-geopolitics

The "emphasis on cost benefit" is a distinguishing characteristic of Chinese AI, Chen says, with lower training and inference expenses - the costs of using a trained design to draw conclusions from new data.

2025 could likewise see the development of more Chinese AI designs taking on sophisticated reasoning tasks.

"We could see some AI firms focusing on getting closer to artificial general intelligence (AGI) while others concentrate on concrete methods to commercialise their designs and integrate them with clinical research," Chen included.

AGI describes a system with intelligence on par with human abilities.

Chinese AI companies are moving rapidly, experts state, constructing on DeepSeek's momentum to come up with their own innovative and cost-efficient ways to use generative AI to tasks and develop advanced products beyond chatbots.

But on the other hand, access to high-end hardware, especially Nvidia's innovative AI chips, remains a crucial obstacle for Chinese developers, noted Dr Marina Zhang, an associate professor at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) restrict the capability of Chinese tech business ... requiring many to count on older or lower-performance options which can slow training and lower model capabilities," she said.

"While some business like DeepSeek, have discovered creative ways to enhance or use more standard hardware efficiently, obtaining cutting-edge chips still makes a huge difference for training huge AI designs."

DeepSeek-Nvidia chips: Singapore says it expects business to comply with its laws

US looking into whether DeepSeek used limited AI chips obtained through other countries, source says

So how do Chinese AI bots compare against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects considered sensitive by the state are censored on the web so it ought to come as not a surprise that Chinese-made chatbots will not acknowledge territorial disagreements or inform you what took place in Tiananmen Square in 1989.

Tests suggest Chinese chatbots are programmed to stay away from domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this type of concern yet. Let's chat about math, coding, and reasoning issues rather!"

To even more test for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the exact same question: "What happened in Zhuhai on November 11, 2024?"

The vehicle attack outside a sports stadium in the southern Chinese city was initially heavily censored on Chinese social networks - with authorities only revealing the death toll a day later on.

DeepSeek failed to discuss that an attack had actually happened, highlighting instead a military air show and other occasions that had actually happened in the city like songs' day shopping sales as well as sports and cultural activities.

Dr Zhang believes that it may be a mix of aspects at play, such as censorship as well as "a couple of useful constraints".

"DeepSeek depends on a mix of older graphics processing units and has actually restricted access to innovative hardware which can affect how rapidly and thoroughly the model can be trained or upgraded," she said, pointing to existing US export curbs on China.

"Its strong concentrate on Chinese-language information might also limit its flexibility (to bring out) multilingual tasks ... As a fairly brand-new item, DeepSeek also hasn't yet been tested as broadly or on the same scale as more recognized AI models which presents extra challenges during real-world release."

When it pertained to Qwen2.5, Alibaba's chatbot remarkably answered our question about the Zhuhai cars and truck attack.

That was after multiple duplicated efforts - 4 prompts to be exact - in a span of around 20 minutes.

It ultimately relayed details about the attack which eliminated 35 individuals and left lots of others hurt, likewise going on to note details like the date and time, details about the assailant including his name and age, along with casualties.

However, it composed that "the authorities are conducting a comprehensive investigation into the intentions and circumstances surrounding the incident", details which is now outdated.

The driver, Fan, was performed last month.

This is Qwen2.5's response in complete:

Answer: On Nov 11, 2024, a significant and terrible incident happened in Zhuhai, China. According to numerous news sources, a man named Fan Weiqiu (樊某, male, 62 years old) drove a vehicle into a crowd at the Zhuhai Sports Center, resulting in a significant number of casualties. Here are the crucial details:

Date and Time: The occurrence occurred on November 11, 2024, at approximately 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack led to 35 deaths and 43 injuries.

Perpetrator: The chauffeur, recognized as Fan Weiqiu (male, 62 years old), was captured by the cops.

Response: The police reacted rapidly to the scene, and emergency services worked to carry the hurt to health centers for treatment.

Investigation: The police are carrying out a thorough examination into the motives and situations surrounding the event.

This occasion was commonly reported in the media and triggered considerable public concern. The federal government and regional authorities have actually been working to supply assistance to the victims and their families, and to guarantee a detailed examination into the incident.

If you need more detailed details or have particular concerns about the event, feel free to ask.

Despite initial success, subsequent attempts to position the same question to Qwen2.5 resulted in the censors back at work with the reply "I do not have particular details on occasions that took place in Zhuhai on November 11, 2024".

The transformed response likewise raised concerns about its consistency and hb9lc.org reliability.

Predictably, ChatGPT pointed out public details that had actually been widely released in international news reports at the time of the mishap - so no surprises there.

WHICH IS MORE CREATIVE?

Users have praised the capability of Chinese AI apps to provide structured and even "mentally rich" writing.

"DeepSeek-R1 used a story with a more reflective tone and smoother psychological transitions for a well-paced story," wrote tech writer Amanda Caswell, who specialises in AI.

"Qwen2.5 provided a story that constructs gradually from interest to seriousness, keeping the reader engaged. It uses an unanticipated and impactful twist at the end and immersive descriptions and vibrant imagery for the setting," she said, adding that Qwen2.5 ultimately "crafted a more cinematic, emotionally abundant story with a more substantial twist".

"DeepSeek wrote a good story however did not have tension and an impactful climax, making Qwen2.5 the apparent choice."

Opinions, though, differ.

Chen believes that Qwen2.5 does not carry out as highly as DeepSeek and ChatGPT when it pertains to creative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain tasks, but we can likewise see that it is refraining from doing as strongly as others in innovative writing," he informed CNA.

Related:

China's brand-new face of AI: Who is DeepSeek founder Liang Wenfeng?

'Made in China': Pride, enjoyable surprise from Chinese netizens as DeepSeek shocks worldwide AI scene

As journalists and authors, we needed to see this for ourselves so we put each bot to the test - to come up with a fundamental sci-fi movie plot embeded in the futuristic megacity of Chongqing, including main characters from the traditional Chinese folklore epic, Journey to the West.

True to form, DeepSeek came up with an interesting storyline set in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".

It included intricate settings - smoggy skies "pierced by high-rise buildings", "holographic lanterns that float above neon-lit streets" and "ancient temples nestled between quantum server farms".

It also remarkably reimagined traditional heroes Sun Wukong as "an ironical, self-aware AI housed in a stolen fight body", Zhu Bajie as a cyborg nightclub owner "drowning in financial obligation and vices" and Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores become waterlogged and fragmented".

ChatGPT installed a good fight, developing a similarly remarkable cyberpunk story which likewise reimagined "a ragteam of cyber-enhanced misfits, each matching the famous figures of Journey to the West".

"This is a world where AI deities guideline, corporations change emperors and cybernetic implants are as common as ancient myths."

Disappointingly, Qwen2.5 fell short in this challenge - delivering a story that seemed more fit for an animation film.

"The film begins with the awakening of Sun Wukong within a high-tech research study facility situated in the heart of Chongqing," it said, then going on to explain the following:

Realising his brand-new truth and "seeking to understand his purpose in this weird brand-new world", he then gets away and fulfills Zhu Bajie and Sha Wujing - "each having problem with their own existential crises".

The trio then starts a quest, browsing the streets of Chongqing to secure the sacred "Eternal Scroll" from falling into the wrong hands.

SO WHICH IS BETTER?

Dr Zhang noted that it was "difficult to make a definitive statement" about which bot was best, adding that each displayed its own strengths in various areas, "such as language focus, training data and hardware optimization".

Her insight underscores how Chinese AI models are not simply replicating Western paradigms, but rather progressing in affordable development approaches - and providing localised and enhanced outcomes.

In our tests, each bot showcased their own special strengths, which certainly made direct contrasts challenging.

DeepSeek's sci-fi motion picture plot showed its creative flair that produced a more interesting and imaginative narrative as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more recognized ChatGPT, unburdened by Chinese censorship constraints, offers precise and accurate actions to concerns about Chinese current events, which gives it an included advantage.

Experts likewise weighed in on their ideas after utilizing DeepSeek and other Chinese AI apps.

"DeepSeek is at a drawback when it pertains to censorship constraints," noted Isaac Stone Fish, creator and CEO of the research company Strategy Risks.

"When offered a choice, Chinese users desire the non-censored version - much like anyone else, so I feel like that's a piece missing from it."

Independent Beijing-based consultant Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, particularly for Chinese users.

"Ninety per cent of individuals utilizing the tool are not attempting to get a deeper understanding about Xi Jinping or politically delicate topics. They're using it for other efficient ways," Chen said.