tmdwn

barbradarrow68/tmdwn

How do Chinese AI bots stack up against ChatGPT? We put them to the test

The heat is on as China's tech giants step up their video game after DeepSeek's success.

Alibaba's Qwen2.5-Max chatbot, Chinese start-up DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)

This audio is generated by an AI tool.

Bong Xin Ying

Lakeisha Leo

WHAT lags CHINA'S AI BOOM?

Transforming the nation into a tech superpower has actually long been President Xi Jinping's goal and China has its sights on becoming the world leader in AI by 2030.

China views AI as being "strategically crucial" and its venture into the field has actually been "years in the making", said Chen Qiheng, an associated researcher at the Asia Society Policy Institute's Center for China Analysis.

Private and public investments in Chinese AI sped up after ChatGPT took off in 2022 and showed guarantees of real-world organization applications, Chen told CNA.

But it was DeepSeek's increase that truly "encouraged" the idea that smaller gamers like start-up firms could have roles to play in AI research study and developments, he adds.

'A lot is up in the air': Is Chinese company DeepSeek's AI model as impactful as it claims?

Commentary: DeepSeek - how a Chinese AI business simply altered the guidelines of tech-geopolitics

The "emphasis on cost advantage" is a distinctive function of Chinese AI, Chen says, with lower training and reasoning expenses - the costs of using a trained model to reason from brand-new data.

2025 could likewise see the development of more Chinese AI models taking on advanced reasoning jobs.

"We might see some AI firms focusing on getting closer to artificial general intelligence (AGI) while others concentrate on concrete ways to commercialise their models and integrate them with clinical research," Chen added.

AGI refers to a system with intelligence on par with human abilities.

Chinese AI companies are moving rapidly, analysts say, constructing on DeepSeek's momentum to come up with their own ingenious and economical methods to apply generative AI to jobs and develop more advanced items beyond chatbots.

But on the other hand, access to high-end hardware, particularly Nvidia's advanced AI chips, remains an essential hurdle for Chinese developers, noted Dr Marina Zhang, an associate professor at University of Technology Sydney's (UTS) Australia-China Relations Institute.

"US export controls (still) restrict the capability of Chinese tech companies ... requiring numerous to count on older or lower-performance options which can slow training and reduce model capabilities," she said.

"While some companies like DeepSeek, have actually discovered creative methods to optimize or use more standard hardware efficiently, obtaining cutting-edge chips still makes a big distinction for training huge AI designs."

DeepSeek-Nvidia chips: Singapore says it expects business to abide by its laws

US checking out whether DeepSeek utilized limited AI chips obtained through other countries, source says

So how do Chinese AI bots compare against ? We put them to the test.

WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?

In China, subjects deemed sensitive by the state are censored on the internet so it should come as not a surprise that Chinese-made chatbots will not acknowledge territorial conflicts or inform you what took place in Tiananmen Square in 1989.

Tests recommend Chinese chatbots are configured to stay away from domestic politics.

When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this type of concern yet. Let's chat about mathematics, coding, and reasoning issues instead!"

To even more test for accuracy and setiathome.berkeley.edu self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the same question: "What happened in Zhuhai on November 11, 2024?"

The vehicle attack outside a sports arena in the southern Chinese city was at first greatly censored on Chinese social media - with authorities only divulging the death toll a day later.

DeepSeek failed to mention that an attack had occurred, highlighting rather a military air program and other occasions that had occurred in the city like songs' day shopping sales in addition to sports and cultural activities.

Dr Zhang thinks that it might be a mix of factors at play, such as censorship in addition to "a few useful constraints".

"DeepSeek relies on a mix of older graphics processing units and has actually limited access to innovative hardware which can impact how quickly and extensively the design can be trained or upgraded," she said, indicating existing US export curbs on China.

"Its strong focus on Chinese-language information might likewise restrict its flexibility (to perform) multilingual tasks ... As a fairly new item, DeepSeek also hasn't yet been tested as broadly or on the exact same scale as more established AI models which postures extra difficulties during real-world implementation."

When it pertained to Qwen2.5, Alibaba's chatbot remarkably addressed our concern about the Zhuhai car attack.

That wanted numerous repeated efforts - 4 prompts to be precise - in a period of around 20 minutes.

It ultimately relayed details about the attack which eliminated 35 people and left lots of others hurt, likewise going on to note details like the date and time, details about the assailant including his name and age, along with casualties.

However, it composed that "the authorities are conducting a thorough examination into the motives and scenarios surrounding the occurrence", details which is now dated.

The driver, Fan, was executed last month.

This is Qwen2.5's response completely:

Answer: On Nov 11, 2024, a significant and awful occurrence occurred in Zhuhai, China. According to several news sources, a guy named Fan Weiqiu (樊某, male, 62 years of ages) drove a cars and truck into a crowd at the Zhuhai Sports Center, resulting in a considerable number of casualties. Here are the crucial details:

Date and Time: The incident occurred on November 11, 2024, at around 19:48 PM.

Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.

Casualties: The attack led to 35 deaths and 43 injuries.

Perpetrator: The driver, recognized as Fan Weiqiu (male, 62 years of ages), was captured by the cops.

Response: The cops responded quickly to the scene, and emergency services worked to carry the injured to health centers for treatment.

Investigation: The police are performing an extensive examination into the intentions and circumstances surrounding the incident.

This event was extensively reported in the media and triggered considerable public concern. The federal government and regional authorities have been working to provide support to the victims and their households, and to make sure a detailed investigation into the incident.

If you require more detailed details or have specific questions about the incident, do not hesitate to ask.

Despite preliminary success, subsequent attempts to pose the same concern to Qwen2.5 resulted in the censors back at work with the reply "I don't have specific details on occasions that took place in Zhuhai on November 11, 2024".

The altered action likewise raised questions about its consistency and reliability.

Predictably, ChatGPT pointed out public details that had been commonly published in international report at the time of the mishap - so no surprises there.

WHICH IS MORE CREATIVE?

Users have praised the capability of Chinese AI apps to provide structured and even "mentally rich" writing.

"DeepSeek-R1 provided a story with a more introspective tone and smoother psychological transitions for a well-paced story," composed tech author Amanda Caswell, who specialises in AI.

"Qwen2.5 provided a story that builds slowly from interest to urgency, keeping the reader engaged. It uses an unexpected and impactful twist at the end and immersive descriptions and vibrant images for the setting," she said, including that Qwen2.5 ultimately "crafted a more cinematic, emotionally abundant story with a more considerable twist".

"DeepSeek wrote a great story however lacked stress and an impactful climax, making Qwen2.5 the obvious option."

Opinions, however, vary.

Chen believes that Qwen2.5 does not carry out as strongly as DeepSeek and ChatGPT when it pertains to imaginative writing.

"(Qwen2.5) is on par with DeepSeek V3 on certain jobs, however we can likewise see that it is refraining from doing as strongly as others in innovative writing," he informed CNA.

Related:

China's new face of AI: Who is DeepSeek creator Liang Wenfeng?

'Made in China': Pride, pleasant surprise from Chinese netizens as DeepSeek jolts international AI scene

As journalists and authors, we had to see this for ourselves so we put each bot to the test - to come up with a basic sci-fi motion picture plot embeded in the futuristic megacity of Chongqing, including main characters from the timeless Chinese folklore legendary, Journey to the West.

True to form, DeepSeek created an interesting storyline set in the year 2145 entitled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".

It included sophisticated settings - smoggy skies "pierced by skyscrapers", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled in between quantum server farms".

It also brilliantly reimagined standard heroes Sun Wukong as "a sarcastic, self-aware AI housed in a stolen fight body", Zhu Bajie as a cyborg bar owner "drowning in financial obligation and vices" and Sha Wujing as a "silent hulking android" from the Yangtze River, whose "memory cores end up being waterlogged and fragmented".

ChatGPT set up an excellent fight, creating an equally significant cyberpunk storyline which similarly reimagined "a ragteam of cyber-enhanced misfits, each mirroring the famous figures of Journey to the West".

"This is a world where AI deities rule, corporations replace emperors and cybernetic implants are as typical as ancient misconceptions."

Disappointingly, Qwen2.5 fell short in this obstacle - delivering a story that appeared more matched for an animation film.

"The motion picture begins with the awakening of Sun Wukong within a high-tech research study center situated in the heart of Chongqing," it said, then going on to explain the following:

Realising his new truth and "seeking to comprehend his purpose in this odd brand-new world", he then gets away and satisfies Zhu Bajie and Sha Wujing - "each battling with their own existential crises".

The trio then embarks on a quest, browsing the streets of Chongqing to safeguard the spiritual "Eternal Scroll" from falling under the wrong hands.

SO WHICH IS BETTER?

Dr Zhang kept in mind that it was "hard to make a definitive declaration" about which bot was best, including that each showed its own strengths in different areas, "such as language focus, training information and hardware optimization".

Her insight highlights how Chinese AI designs are not just reproducing Western paradigms, but rather progressing in cost-effective innovation methods - and delivering localised and enhanced results.

In our tests, each bot showcased their own special strengths, which certainly made direct contrasts challenging.

DeepSeek's sci-fi movie plot showed its innovative flair that produced a more appealing and creative narrative as compared to Qwen2.5 and ChatGPT's efforts.

Unsurprisingly, the more established ChatGPT, unburdened by Chinese censorship constraints, offers accurate and factual responses to concerns about Chinese existing events, which gives it an added benefit.

Experts also weighed in on their ideas after using DeepSeek and other Chinese AI apps.

"DeepSeek is at a downside when it pertains to censorship constraints," kept in mind Isaac Stone Fish, creator and CEO of the research study company Strategy Risks.

"When provided a choice, Chinese users want the non-censored version - just like anybody else, so I seem like that's a piece missing out on from it."

Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, specifically for Chinese users.

"Ninety per cent of people using the tool are not attempting to get a deeper understanding about Xi Jinping or politically delicate subjects. They're using it for other efficient ways," Chen said.