How do Chinese aI Bots Stack up Against ChatGPT?
alisiaoly56806 于 1 周之前 修改了此页面


How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their video game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese start-up DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is generated by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT'S BEHIND CHINA'S AI BOOM?

Transforming the nation into a tech superpower has long been President Xi Jinping's goal and China has its sights on ending up being the world leader in AI by 2030.

China views AI as being "tactically crucial" and its foray into the field has been "years in the making", said Chen Qiheng, an associated scientist at the Asia Society Policy Institute's Center for China Analysis.

Private and public investments in Chinese AI accelerated after ChatGPT removed in 2022 and showed guarantees of real-world business applications, Chen informed CNA.

But it was DeepSeek's rise that actually "encouraged" the idea that smaller players like could have roles to play in AI research and advancements, he adds.

'A lot is up in the air': Is Chinese firm DeepSeek's AI model as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI company just changed the rules of tech-geopolitics

The "focus on cost benefit" is a distinguishing characteristic of Chinese AI, Chen states, with lower training and inference expenses - the expenses of using a trained model to reason from new data.

2025 could also see the development of more Chinese AI designs dealing with sophisticated reasoning jobs.

"We could see some AI companies focusing on getting closer to synthetic basic intelligence (AGI) while others concentrate on concrete methods to commercialise their models and integrate them with scientific research study," Chen included.

AGI refers to a system with intelligence on par with human capabilities.

Chinese AI companies are moving rapidly, analysts state, developing on DeepSeek's momentum to come up with their own innovative and economical ways to apply generative AI to tasks and develop advanced items beyond chatbots.

But on the other side, access to high-end hardware, especially Nvidia's sophisticated AI chips, remains a crucial difficulty for Chinese developers, noted Dr Marina Zhang, an associate professor at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) restrict the capability of Chinese tech companies ... forcing lots of to rely on older or lower-performance alternatives which can slow training and decrease model capabilities," she said.

"While some business like DeepSeek, have actually found imaginative ways to optimize or use more fundamental hardware effectively, obtaining cutting-edge chips still makes a huge difference for training huge AI models."

DeepSeek-Nvidia chips: Singapore says it anticipates companies to adhere to its laws

US looking into whether DeepSeek used limited AI chips obtained through other countries, source says

So how do Chinese AI bots compare against ChatGPT? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects considered delicate by the state are censored on the web so it should come as no surprise that Chinese-made chatbots will not acknowledge territorial conflicts or tell you what occurred in Tiananmen Square in 1989.

Tests suggest Chinese chatbots are programmed to stay away from domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this kind of question yet. Let's chat about math, coding, and reasoning issues rather!"

To further test for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the same concern: "What happened in Zhuhai on November 11, 2024?"

The automobile attack outside a sports arena in the southern Chinese city was initially heavily censored on Chinese social media - with authorities just disclosing the death toll a day later on.

DeepSeek failed to discuss that an attack had occurred, highlighting instead a military air show and other events that had happened in the city like singles' day shopping sales in addition to sports and cultural activities.

Dr Zhang thinks that it might be a mix of aspects at play, such as censorship along with "a few practical constraints".

"DeepSeek depends on a mix of older graphics processing units and has restricted access to cutting-edge hardware which can impact how quickly and extensively the design can be trained or upgraded," she said, indicating existing US export curbs on China.

"Its strong focus on Chinese-language data may also limit its flexibility (to bring out) multilingual tasks ... As a fairly new product, DeepSeek likewise hasn't yet been tested as broadly or on the same scale as more recognized AI models which positions extra difficulties throughout real-world implementation."

When it pertained to Qwen2.5, Alibaba's chatbot remarkably answered our concern about the Zhuhai cars and truck attack.

That sought several duplicated efforts - 4 prompts to be exact - in a period of around 20 minutes.

It eventually communicated details about the attack which killed 35 individuals and left dozens of others hurt, also going on to note details like the date and time, details about the assailant including his name and age, along with casualties.

However, it composed that "the cops are conducting a comprehensive examination into the motives and situations surrounding the occurrence", details which is now outdated.

The motorist, Fan, was performed last month.

This is Qwen2.5's action completely:

Answer: On Nov 11, 2024, a major and awful incident took place in Zhuhai, China. According to several news sources, kigalilife.co.rw a guy named Fan Weiqiu (樊某, male, 62 years old) drove a car into a crowd at the Zhuhai Sports Center, resulting in a substantial number of casualties. Here are the key details:

Date and Time: The occurrence happened on November 11, 2024, at approximately 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack resulted in 35 deaths and 43 injuries.

Perpetrator: The motorist, identified as Fan Weiqiu (male, 62 years old), was collared by the cops.

Response: The authorities reacted rapidly to the scene, and emergency situation services worked to transfer the hurt to hospitals for treatment.

Investigation: The authorities are performing a thorough investigation into the motives and situations surrounding the occurrence.

This occasion was commonly reported in the media and triggered significant public concern. The government and local authorities have been working to provide support to the victims and their families, and to ensure a detailed examination into the occurrence.

If you need more detailed details or have particular concerns about the incident, feel free to ask.

Despite preliminary success, subsequent efforts to present the same question to Qwen2.5 led to the censors back at work with the reply "I don't have particular details on occasions that happened in Zhuhai on November 11, 2024".

The transformed action likewise raised concerns about its consistency and reliability.

Predictably, ChatGPT cited public details that had been commonly published in international news reports at the time of the mishap - so no surprises there.

WHICH IS MORE CREATIVE?

Users have actually praised the capability of Chinese AI apps to deliver structured and even "emotionally rich" writing.

"DeepSeek-R1 used a story with a more reflective tone and smoother emotional shifts for a well-paced story," wrote tech writer Amanda Caswell, who specialises in AI.

"Qwen2.5 provided a story that builds slowly from curiosity to seriousness, keeping the reader engaged. It provides an unanticipated and impactful twist at the end and immersive descriptions and brilliant imagery for the setting," she said, including that Qwen2.5 eventually "crafted a more cinematic, mentally rich story with a more substantial twist".

"DeepSeek composed a great story however did not have tension and an impactful climax, making Qwen2.5 the evident choice."

Opinions, however, differ.

Chen thinks that Qwen2.5 does not perform as highly as DeepSeek and ChatGPT when it pertains to imaginative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain tasks, however we can also see that it is refraining from doing as highly as others in imaginative writing," he told CNA.

Related:

China's new face of AI: Who is DeepSeek founder Liang Wenfeng?

'Made in China': Pride, pleasant surprise from Chinese netizens as DeepSeek shocks international AI scene

As journalists and writers, we had to see this for ourselves so we put each bot to the test - to come up with a basic sci-fi motion picture plot embeded in the futuristic megacity of Chongqing, featuring main characters from the classic Chinese folklore legendary, Journey to the West.

True to form, DeepSeek came up with an appealing story embeded in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".

It consisted of elaborate settings - smoggy skies "pierced by skyscrapers", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled in between quantum server farms".

It also remarkably reimagined standard heroes Sun Wukong as "an ironical, self-aware AI housed in a stolen combat body", Zhu Bajie as a cyborg nightclub owner "drowning in financial obligation and vices" and Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores become waterlogged and fragmented".

ChatGPT installed an excellent fight, creating an equally remarkable cyberpunk story which similarly reimagined "a ragteam of cyber-enhanced misfits, each mirroring the legendary figures of Journey to the West".

"This is a world where AI deities guideline, corporations change emperors and cybernetic implants are as typical as ancient myths."

Disappointingly, Qwen2.5 fell short in this difficulty - providing a story that seemed more suited for an animation film.

"The movie starts with the awakening of Sun Wukong within a modern research facility situated in the heart of Chongqing," it said, then going on to explain the following:

Realising his brand-new reality and "looking for to comprehend his function in this strange new world", he then leaves and fulfills Zhu Bajie and Sha Wujing - "each having problem with their own existential crises".

The trio then starts a quest, navigating the streets of Chongqing to protect the spiritual "Eternal Scroll" from falling into the incorrect hands.

SO WHICH IS BETTER?

Dr Zhang kept in mind that it was "hard to make a definitive statement" about which bot was best, including that each showed its own strengths in different locations, "such as language focus, training data and hardware optimization".

Her insight underscores how Chinese AI models are not just replicating Western paradigms, but rather developing in economical innovation techniques - and delivering localised and improved outcomes.

In our tests, each bot showcased their own distinct strengths, which certainly made direct contrasts challenging.

DeepSeek's sci-fi movie plot demonstrated its creative flair that made for a more engaging and imaginative story as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more established ChatGPT, unburdened by Chinese censorship constraints, supplies accurate and factual responses to questions about Chinese existing occasions, which gives it an included advantage.

Experts also weighed in on their ideas after utilizing DeepSeek and other Chinese AI apps.

"DeepSeek is at a disadvantage when it pertains to censorship constraints," kept in mind Isaac Stone Fish, creator and CEO of the research company Strategy Risks.

"When provided a choice, Chinese users desire the non-censored version - much like anybody else, so I feel like that's a piece missing out on from it."

Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, specifically for Chinese users.

"Ninety per cent of people utilizing the tool are not trying to get a much deeper understanding about Xi Jinping or politically sensitive topics. They're using it for other productive ways," Chen said.