IT Brief New Zealand logo
Technology news for New Zealand's largest enterprises
Story image

Microsoft reaches historic AI milestone in Chinese-English language translation

By Julia Gabel
Tue 20 Mar 2018
FYI, this story is more than a year old

Microsoft has reached a historic milestone in its development in the Artificial Intelligence (AI) space. 

A team of Microsoft researchers believe they have created the first machine translation system that can translate sentences of news articles from Chinese to English with the same quality and accuracy as a person.

The researchers claim to have achieved a “human parity” on a commonly used test set of news stories that were moderated by external bilingual human evaluators who compared Microsoft’s results to two independently produced human reference translations.

The technical fellow in charge of Microsoft’s speech, natural language, and machine translation efforts, Xuedong Huang calls it a major milestone in one of the most challenging natural language processing tasks.

“Hitting human parity in a machine translation task is a dream that all of us have had,” Huang says. “We just didn’t realise we’d be able to hit it so soon.”

“The pursuit of removing language barriers to help people communicate better is fantastic,” he adds. “It’s very, very rewarding.”

Xuedong Huang, technical fellow in charge of Microsoft’s speech, natural language and machine translation efforts

Despite the excitement surrounding this breakthrough in AI in language translation, Microsoft researchers warn that this it doesn't mean all problems in machine learning translation are solved.  

The assistant managing director of Microsoft Research Asia and head of a natural language processing group that worked on the project, Ming Zhou, says that although the team is thrilled to achieve the human parity milestone, there are still many challenges ahead, like test the system on real-time news stories.

Partner research manager of Microsoft’s machine translation team Arul Menezes says the team set out to prove that its systems could perform as well as a person when it used a language pair on a test set that includes the more commonplace vocabulary of general interest news stories.

Arul Menezes, partner research manager of Microsoft’s machine translation team

“Given the best-case situation as far as data and availability of resources goes, we wanted to find out if we could actually match the performance of a professional human translator,” says Menezes.

Menezes says the research team can apply the technical breakthroughs they made here to Microsoft’s commercial translation products.  

Menezes says that this will pave the way for more accurate and natural-sounding translations across other languages and for texts with more complex or niche vocabulary.

Three research teams were behind the human parity milestone. The teams were based in Microsoft’s Beijing, Redmond and Washington labs, and worked together to add a number of other training methods that would make the system more fluent and accurate.

Principal research manager with Microsoft Research Asia in Beijing Tie-Yan Liu led a machine learning team that worked on the project. Liu says, “Much of our research is really inspired by how we humans do things.”

Techniques and methods

Dual Learning

One method the researchers used is dual learning, a method Microsoft describes “fact-checking the system’s work”: Every time the researchers sent a sentence through the system to be translated from Chinese to English, the research team also translated it back from English to Chinese. 

Microsoft says this method is similar to what people might do to make sure that their automated translations were accurate, and it allowed the system to refine and learn from its own mistakes. 

Developed by the Microsoft research team, dual learning can be used to improve results in other AI tasks, the company claims. 

Deliberation Networks 

Another method used in the project is Deliberation Networks, which Microsoft says is similar to how people revise their own writing by going through it over and over again. 

The researchers taught the system to repeat the process of translating the same sentence over and over, gradually refining and improving the response.

Joint Training

Joint Training is a technique used in the research to iteratively boost the English-to-Chinese and Chinese-to-English translation systems. 

With Joint Training, the English-to-Chinese translation system translates new English sentences into Chinese in order to obtain new sentence pairs. 

These pairs are then used to augment the training dataset that is going in the opposite direction, from Chinese to English. The same procedure is then applied in the other direction. As they converge, the performance of both systems improves.

Zhou says he expects these methods and techniques to be useful for improving machine translation in other languages and situations as well. He said they also could be used to make other AI breakthroughs beyond translation.

“This is an area where machine translation research can apply to the whole field of AI research,” he adds.

No right answer

The test set the researchers used contains around 2000 sentences from a sample of online newspapers that had already been professionally translated. 

Microsoft ran multiple evaluation rounds on the test set, randomly selecting hundreds of translations for evaluation each time. 

To verify that Microsoft’s machine translation was as good as a person’s translation, the company hired a group of outside bilingual language consultants to compare Microsoft’s results against manually produced human translations.

With other tasks, such as speech recognition, Microsoft says it’s reasonably straightforward to tell if a system is working as well as a person because the ideal result will be the exact same for a person and a machine. 

Researchers call that a pattern recognition task.

However, with translation, there’s more nuance.

Even two fluent human translators might translate the exact same sentence slightly differently, and neither would be wrong. That’s because there’s more than one way to saying the same thing correctly. 

“Machine translation is much more complex than a pure pattern recognition task,” Zhou adds.  

“People can use different words to express the exact same thing, but you cannot necessarily say which one is better.”

The researchers say that complexity is what makes machine translation such a challenging problem, but also such a rewarding one.

Liu says no one knows whether machine translation systems will ever get good enough to translate any text in any language pair with the accuracy and lyricism of a human translator. 

But, he says these recent breakthroughs allow the teams to move on to the next big steps toward that goal and other big AI achievements, such as reaching human parity in speech-to-speech translation.

“What we can predict is that definitely, we will do better and better,” Liu adds.

These recent breakthroughs build on Microsoft’s previous work in language translation, including in New Zealand. 

Dr. Te Taka Keegan, a senior lecturer in the Computer Science Department at The University of Waikato, is known for weaving together his love for te reo Māori and his love for computers. 

In 2005, Keegan worked with Microsoft to translate the company’s Office 2003 and Windows XP in Māori. 

Today, Keegan continues to consult for Microsoft around ambitious AI, The Translation Hub, which Microsoft hopes will one day offer real-time translations of te reo to English, and vice versa. 

Keegan’s work of interweaving te reo Māori into technology was recognised by the government last year. Presented to Keegan by Prime Minister at the time Bill English, the Prime Minister’s Supreme Award is New Zealand’s top teaching excellence honour. 

Dr Te Taka Keegan, senior lecturer in the Computer Science Department at The University of Waikato, and former Prime Minister Bill English. 

Keegan commented on the award, “Microsoft should share in this celebration and in my award, they really should.”

“They were very open to the idea to not only adapt the keyboard but also to translate Office and Windows into Māori. Because of the work we did together, all schools in New Zealand can now offer computing facilities in te reo Māori to children.”

“That’s an awesome thing.’

Related stories
Top stories
Story image
Cybersecurity
Could New Zealanders initiate a cyber attack from within?
The threat landscape is significantly increasing worldwide, and the opportunities it presents are a growing concern in Aotearoa.
Story image
PIJF
The path to bolstering supply chain security in New Zealand
A significant amount of today's business and leisure activity relies on IT supply chains. From complex international freight trades to local small business distribution channels, any supply chain that involves IT infrastructure serves as a crucial tool in our daily lives. 
Story image
Ingram Micro
Ingram Micro NZ bolsters MSI product range with new offerings
The inclusion of MSI Mobile Workstation and Business & Productivity laptops rounds out the MSI product portfolio.
Story image
Digital Transformation
SAP partners with New Zealand Rugby for digital transformation
The multi-year partnership will see SAP advance NZR with its organisational operations, team performance, fan experience and sustainability goals.
Story image
Red Sift
Entrust expands strategic partnership with Red Sift
Entrust has expanded its strategic partnership with Red Sift to make it easier for businesses to adopt Brand Indicators for Message Identification (BIMI) standards for email identification and security.
Story image
DevOps
Deloitte expands cloud observability practice with Dynatrace
Deloitte is expanding its cloud observability practice, including DevOps principles, AI/ML, cloud complexity management and software engineering.
Story image
Data Protection
Information management capabilities to meet privacy requirements
Organisations with customers or operations across more than one country face a spate of new and proposed privacy and data protection laws.
Story image
Sustainability
Aligned Data Centers increases sustainability-linked loan
Aligned Data Centers has increased its sustainability-linked loan from $375 million to $1.75 billion to speed up the next phase of its strategic growth.
Story image
Ransomware
CERT NZ releases first Cyber Security Insights for 2022
CERT NZ has released Quarter One: Cyber Security Insights 2022, which offers an overview of reports about cybersecurity incidents affecting New Zealanders.
Darktrace
Threat actors are exploiting weaknesses in interconnected IT/OT ecosystems. Darktrace illuminates your entire business and takes targeted action to stop emerging attacks.
Link image
Story image
Digital Transformation
Harnessing digital innovations to maximise loyalty programmes and improve CX
When it comes to the retail sector, merchants have never had access to so many different client touchpoints and data to understand their customers better.
Story image
Microsoft
Microsoft, Cloudian partnership offers data center flexibility
Cloudian’s HyperStore object storage platform is now integrated and validated to work with Microsoft SQ Server 2022, offering more flexible and scalable data centers.
Story image
Cybercrime
The ups and downs and runarounds of catching cybercriminals in NZ
We're becoming more and more aware of cybercrimes but how many criminals actually get caught? The New Zealand police explain why the answer is complicated.
Story image
Digital Signage
MAXHUB's Digital Signage range to bolster boardroom productivity
The new MAXHUB Digital Signage technology is purpose-built to make every kind of team meeting more effective.
Story image
TUANZ
TUANZ to address rural connectivity at 2022 symposium
TUANZ is hosting the Rural Connectivity Symposium for the first time in person since 2019, providing a forum to discuss the state of rural connectivity.
Story image
Rackspace
Skills shortages hold orgs back from capitalising on cloud 2.0
Organisations are becoming more comfortable with sophisticated 'cloud 2.0' technologies, even as they confront difficulties in hiring and retaining IT talent.
Story image
Training
Infosec unveils role-guided cybersecurity training roadmaps
Infosec Skills Roles maps hands-on training and certifications to the 12 most in-demand cybersecurity roles to maximise training efficiency.
Story image
Digital Transformation
Digital transformation increasing business complexities
A new survey suggests businesses must re-examine their digital transformation approach to better help employees adapt to change.
Story image
Cybersecurity
What every CISO must answer to enable a best-in-class security operations program
It has been widely reported recently that South Australian government employees have been the victims of a cyberattack.
Story image
Informatica
Informatica, Oracle enter strategic global cloud partnership
Oracle named Informatica as a preferred partner for enterprise cloud data integration and data governance for data warehouse and lakehouse solutions on OCI. 
Story image
Silver Peak
The path to an adaptive, modern network
Managing and securing the network looks different than it did just two years ago—especially given that most of these networks are made up of multi-generations of infrastructure stitched together over time.
Story image
DaaS
NetApp launches Spot PC, a new Desktop-as-a-Service solution
This is a new managed cloud DaaS solution with security, automation, observability and optimisation capabilities, designed for the needs of today.
Story image
Check Point
Check Point and CCTV expert join forces to boost protection
The partnership will involve Check Point Quantum IoT Protect Nano Agent being embedded in Provision-ISR’s CCTV cameras for on-device runtime protection.
Story image
Symbio
Symbio consolidates TNZI business to support APAC expansion
Symbio has recently announced the consolidation of its international business (TNZI) under the Symbio brand to support its Asia Pacific expansion strategy.
Story image
Cybersecurity
Accenture - a collective security approach a driving factor for cyber resilience
With the approaching Davos World Economic Forum upon us, it is even more imperative to discuss the impact of cybersecurity on business operations leading into the future.
Story image
Managed service provider
Barracuda MSP Day 2022 highlights MSP opportunities
Barracuda Networks has released a report showing global services-related MSP revenue is set to increase by more than a third in 2022 compared to 2021.
Exabeam
Find out how a behavioural analytics-driven approach can transform security operations with the new Exabeam commissioned Forrester study.
Link image
Story image
Identity and Access Management
The post-pandemic workforce requires secure IAM capabilities
HID Global discusses what identity and access management means for organisations in today's convoluted digital world.
Story image
Hybrid Cloud
Barracuda expands cloud-native SASE platform
"The expansion of Barracuda's cloud-native SASE platform for hybrid deployment models and IIoT environments solves a number of challenges."
Story image
Infosys
Consumer relationships with digital services continues to change
Two years of pandemic-induced reliance on technology for work has altered our relationship with digital apps and services, new research has found.
Story image
Microsoft
Microsoft previews Power Platform website design offering
Microsoft has announced the preview of Power Pages, the fifth product in its Power Platform family, designed for low-code makers and professional developers.
Story image
NVIDIA
NVIDIA announces a spate of new innovations at Computex 2022
NVIDIA has announced its latest innovations in data center, robotics, content creation, and gaming in a virtual keynote address on the opening day of Computex 2022 in Taipei.
Story image
Chorus
Chorus and Nokia launches first trial of 25G PON broadband
Chorus and Nokia have announced the successful demonstration of 25 gigabit per second fibre (Gbps) broadband technology at the Chorus Fibre Lab in Auckland. 
Story image
Customer experience
The importance of service level management to customer experience
Staffing shortages have impacted site reliability engineers in particular since they are under extreme pressure to ensure that digital assets perform at optimum levels 24/7.
Story image
Contact Centre
Leveraging technology in contact centres to reduce attrition rates
Many organisations worldwide have accelerated DX to better respond to changing market drivers and business environments after the disruption of the pandemic.
Story image
Sift
Sift shares crucial advice for preventing serious ATO breaches
Are you or your business struggling with Account Takeover Fraud (ATO)? One of the latest ebooks from Sift can provide readers with the tools and expertise to help launch them into the new era of account security.
Story image
Transport
Third-party automotive apps bear significant privacy risks
Mobile applications for connected cars provide various features to make life easier for motorists, but they can also be a source of risk.
Story image
Malware
Fortinet introduces self-learning AI in latest offering
Fortinet is introducing self-learning AI capabilities in its new network detection and response offering, FortiNDR.
Story image
Cyber attacks
Devastating cyber attacks expected to hit energy sector
Energy executives anticipate life, property, and environment-compromising cyber attacks on the sector within the next two years.
Story image
Data Center
Preventing downtime costs and damage with Distributed Infrastructure Management
Distributed Infrastructure Management (DIM) can often be a lifeline for many enterprises that work with highly critical ICT infrastructure and power sources.
Story image
BYOD / Bring Your Own Device
How zero trust can lead the battle against ransomware
SecOps teams champion a zero trust strategy to support the fight against the escalating risk of cybercrime and help monitor threat actors across a network.
Story image
Kubernetes
Sysdig unveils new Kubernetes troubleshooting and cloud innovations
Sysdig has introduced two new innovations that look to help bolster cloud services and simplify Kubernetes troubleshooting.
Story image
Artificial Intelligence
Gartner reveals top three tech trends for banks this year
Gartner says generative artificial intelligence, autonomic systems and privacy-enhancing computation are gaining traction in banking and investment services.
Story image
Microsoft
Elevation of Privilege the top 2021 Microsoft vulnerability
BeyondTrust has released its 2022 Microsoft Vulnerabilities Report, finding that Elevation of Privilege is the top vulnerability category for the second consecutive year.