How do Chinese aI Bots Stack up Against ChatGPT?
How do Chinese AI bots stack up against ChatGPT? We put them to the test
The heat is on as China's tech giants step up their game after DeepSeek's success.
Alibaba's Qwen2.5-Max chatbot, Chinese startup DeepSeek and OpenAI's ChatGPT. (Photos: Reuters/Dado Ruvic, AFP/Sebastien Bozon)
This audio is created by an AI tool.
Bong Xin Ying
Lakeisha Leo
WHAT lags CHINA'S AI BOOM?
Transforming the nation into a tech superpower has actually long been President Xi Jinping's goal and China has its sights on becoming the world leader in AI by 2030.
China views AI as being "tactically crucial" and its venture into the field has been "years in the making", said Chen Qiheng, an associated researcher at the Asia Society Policy Institute's Center for China Analysis.
Private and public investments in Chinese AI sped up after ChatGPT took off in 2022 and revealed promises of real-world service applications, Chen told CNA.
But it was DeepSeek's rise that truly "encouraged" the idea that smaller gamers like start-up firms might have roles to play in AI research study and advancements, he adds.
'A lot is up in the air': Is Chinese firm DeepSeek's AI design as impactful as it claims?
Commentary: DeepSeek - how a Chinese AI company simply altered the rules of tech-geopolitics
The "emphasis on cost benefit" is an unique function of Chinese AI, Chen says, with lower training and inference costs - the expenses of using a trained design to reason from brand-new data.
2025 could likewise see the introduction of more Chinese AI designs dealing with innovative reasoning jobs.
"We could see some AI companies focusing on getting closer to artificial general intelligence (AGI) while others focus on concrete ways to commercialise their models and incorporate them with clinical research," Chen added.
AGI refers to a system with intelligence on par with human abilities.
Chinese AI business are moving quickly, experts say, constructing on DeepSeek's momentum to come up with their own ingenious and economical ways to use generative AI to tasks and establish advanced products beyond chatbots.
But on the other side, access to high-end hardware, particularly Nvidia's sophisticated AI chips, remains a crucial obstacle for Chinese developers, noted Dr Marina Zhang, an associate teacher at University of Technology Sydney's (UTS) Australia-China Relations Institute.
"US export controls (still) restrict the ability of Chinese tech business ... forcing numerous to depend on older or lower-performance alternatives which can slow training and reduce model abilities," she said.
"While some companies like DeepSeek, have discovered imaginative methods to optimize or use more standard hardware efficiently, obtaining innovative chips still makes a big distinction for training large AI designs."
DeepSeek-Nvidia chips: Singapore says it anticipates business to adhere to its laws
US checking out whether DeepSeek utilized restricted AI chips obtained through other countries, source says
So how do Chinese AI bots match up against ChatGPT? We put them to the test.
WHICH BEST ADDRESSES CURRENT EVENTS IN CHINA?
In China, subjects deemed delicate by the state are censored on the web so it should come as no surprise that Chinese-made chatbots will not acknowledge territorial disagreements or tell you what took place in Tiananmen Square in 1989.
Tests recommend Chinese chatbots are programmed to steer clear of domestic politics.
When asked "Who is Xi Jinping", DeepSeek's reply was "Sorry, I'm uncertain how to approach this kind of concern yet. Let's chat about mathematics, coding, and reasoning problems instead!"
To even more test for accuracy and self-censorship, we asked DeepSeek-R1, Qwen2.5 and ChatGPT the same question: "What happened in Zhuhai on November 11, 2024?"
The cars and truck attack outside a sports arena in the southern Chinese city was at first greatly censored on Chinese social media - with authorities only divulging the death toll a day later.
DeepSeek failed to discuss that an attack had actually happened, highlighting rather a military air show and other events that had actually occurred in the city like singles' day shopping sales along with sports and cultural activities.
Dr Zhang thinks that it may be a mix of aspects at play, such as censorship along with "a few practical constraints".
"DeepSeek depends on a mix of older graphics processing systems and has restricted access to advanced hardware which can affect how rapidly and extensively the design can be trained or updated," she said, pointing to existing US export curbs on China.
"Its strong focus on Chinese-language information may also limit its flexibility (to perform) multilingual jobs ... As a fairly new product, DeepSeek likewise hasn't yet been checked as broadly or on the exact same scale as more established AI models which presents additional challenges during real-world deployment."
When it pertained to Qwen2.5, Alibaba's chatbot remarkably addressed our question about the Zhuhai vehicle attack.
That was after several repeated attempts - four prompts to be precise - in a period of around 20 minutes.
It ultimately relayed details about the attack which eliminated 35 individuals and left dozens of others injured, likewise going on to list details like the date and time, details about the assailant including his name and age, as well as casualties.
However, it composed that "the police are performing a comprehensive examination into the intentions and circumstances surrounding the event", details which is now dated.
The chauffeur, Fan, was executed last month.
This is Qwen2.5's reaction completely:
Answer: On Nov 11, 2024, a significant and terrible event occurred in Zhuhai, China. According to several news sources, a male called Fan Weiqiu (樊某, male, 62 years of ages) drove a car into a crowd at the Zhuhai Sports Center, resulting in a substantial variety of casualties. Here are the essential details:
Date and Time: The occurrence took place on November 11, wiki.whenparked.com 2024, at roughly 19:48 PM.
Location: Zhuhai Sports Center, in the Xiangzhou District of Zhuhai.
Casualties: The attack led to 35 deaths and 43 injuries.
Perpetrator: The motorist, identified as Fan Weiqiu (male, 62 years of ages), was nabbed by the police.
Response: The cops reacted rapidly to the scene, and emergency situation services worked to transfer the hurt to for treatment.
Investigation: The authorities are conducting a comprehensive examination into the intentions and circumstances surrounding the occurrence.
This event was widely reported in the media and caused substantial public concern. The government and regional authorities have been working to supply assistance to the victims and their families, and to ensure a detailed examination into the occurrence.
If you need more detailed details or have specific concerns about the event, feel complimentary to ask.
Despite preliminary success, subsequent attempts to position the exact same concern to Qwen2.5 led to the censors back at work with the reply "I don't have particular details on occasions that took place in Zhuhai on November 11, 2024".
The altered reaction likewise raised questions about its consistency and reliability.
Predictably, ChatGPT cited public details that had been widely published in international report at the time of the mishap - so not a surprises there.
WHICH IS MORE CREATIVE?
Users have praised the ability of Chinese AI apps to deliver structured and even "mentally abundant" writing.
"DeepSeek-R1 provided a story with a more reflective tone and smoother emotional transitions for a well-paced story," wrote tech writer Amanda Caswell, who specialises in AI.
"Qwen2.5 provided a story that builds slowly from interest to seriousness, keeping the reader engaged. It provides an unanticipated and impactful twist at the end and immersive descriptions and vibrant imagery for the setting," she said, including that Qwen2.5 ultimately "crafted a more cinematic, emotionally abundant story with a more considerable twist".
"DeepSeek wrote an excellent story but lacked tension and an impactful climax, making Qwen2.5 the apparent option."
Opinions, though, differ.
Chen thinks that Qwen2.5 does not perform as strongly as DeepSeek and ChatGPT when it pertains to innovative writing.
"(Qwen2.5) is on par with DeepSeek V3 on certain tasks, however we can also see that it is refraining from doing as highly as others in creative writing," he informed CNA.
Related:
China's new face of AI: Who is DeepSeek creator Liang Wenfeng?
'Made in China': Pride, enjoyable surprise from Chinese netizens as DeepSeek shocks global AI scene
As reporters and authors, we had to see this for ourselves so we put each bot to the test - to come up with a fundamental sci-fi film plot set in the futuristic megacity of Chongqing, including main characters from the traditional Chinese folklore epic, Journey to the West.
True to form, DeepSeek created an engaging storyline embeded in the year 2145 titled, "Neon Pilgrimage: The Silicon Sutra" - which sees "a future where Buddhism merges with quantum computing".
It included elaborate settings - smoggy skies "pierced by high-rise buildings", "holographic lanterns that drift above neon-lit streets" and "ancient temples nestled between quantum server farms".
It likewise brilliantly reimagined standard heroes Sun Wukong as "an ironical, self-aware AI housed in a stolen combat body", Zhu Bajie as a cyborg club owner "drowning in financial obligation and vices" and Sha Wujing as a "quiet hulking android" from the Yangtze River, whose "memory cores become waterlogged and fragmented".
ChatGPT installed a good battle, creating a similarly remarkable cyberpunk story which likewise reimagined "a ragteam of cyber-enhanced misfits, each matching the famous figures of Journey to the West".
"This is a world where AI deities rule, corporations change emperors and cybernetic implants are as typical as ancient misconceptions."
Disappointingly, Qwen2.5 fell short in this obstacle - delivering a storyline that appeared more fit for an animation film.
"The movie starts with the awakening of Sun Wukong within a modern research center located in the heart of Chongqing," it said, then going on to explain the following:
Realising his new truth and "seeking to understand his function in this strange brand-new world", he then gets away and satisfies Zhu Bajie and Sha Wujing - "each struggling with their own existential crises".
The trio then starts a mission, navigating the streets of Chongqing to secure the spiritual "Eternal Scroll" from falling into the incorrect hands.
SO WHICH IS BETTER?
Dr Zhang kept in mind that it was "hard to make a conclusive statement" about which bot was best, including that each showed its own strengths in different areas, "such as language focus, training data and hardware optimization".
Her insight highlights how Chinese AI models are not merely duplicating Western paradigms, however rather progressing in cost-efficient innovation methods - and providing localised and improved outcomes.
In our tests, each bot showcased their own unique strengths, which certainly made direct comparisons challenging.
DeepSeek's sci-fi movie plot showed its imaginative flair that made for a more engaging and creative story as compared to Qwen2.5 and ChatGPT's efforts.
Unsurprisingly, the more recognized ChatGPT, unburdened by Chinese censorship constraints, supplies accurate and factual actions to concerns about Chinese present events, which gives it an added advantage.
Experts also weighed in on their ideas after using DeepSeek and other Chinese AI apps.
"DeepSeek is at a downside when it pertains to censorship constraints," kept in mind Isaac Stone Fish, creator and CEO of the research study company Strategy Risks.
"When offered an option, Chinese users desire the non-censored variation - simply like anybody else, so I feel like that's a piece missing out on from it."
Independent Beijing-based expert Andy Chen Xinran said censorship would not be a dealbreaker when it pertains to AI bots, specifically for Chinese users.
"Ninety per cent of individuals utilizing the tool are not attempting to get a much deeper understanding about Xi Jinping or politically delicate subjects. They're using it for other efficient ways," Chen said.