The new software development toolkit for quantum natural language processing tested and benchmarked on System Model H1 technology
Telling Alexa to play “Schrodinger’s Cat” by Tears for Fears. Asking Siri for directions to a quantum-themed bar or restaurant. A smart phone autocorrecting a word in a text message.
These are everyday applications of natural language processing – NLP for short – a field of artificial intelligence that focuses on training computers to understand words and conversations with the same reasoning as humans.
NLP technologies have advanced rapidly in recent years with the help of increasingly powerful computing clusters that can run language models that examine reams of text and count how often certain words appear. These models train devices to retrieve information, annotate text, translate words from one language to another, answer questions, and perform other tasks.
The next step is to “teach” computers to infer meaning, understand nuance, and grasp the context of conversations. To do that, however, requires massive computational resources and multiple algorithms or data structures.
A United Kingdom-based quantum computing company believes the answer lies with qubits, superposition, and entanglement.
Cambridge Quantum recently released lambeq, a new open-source software development toolkit, that enables researchers to convert sentences into quantum circuits that can be run on quantum computers. It is the first toolkit developed specifically for quantum natural language processing – or QNLP - and was tested on System Model H1 technology before it was released.
The software takes the text, parses it, and then uses linguistics and mathematics to differentiate between a verb, noun, preposition, adjectives, etc., and label them to understand the relationships between words.
Cambridge Quantum researchers tested 30 sentences on the System Model H1, which was able to classify words correctly 87 percent of the time.
“We deem that a success,” said Konstantinos Meichannetzidis, a member of the CQ team. “We found that our software works well with the Honeywell technology and were able to benchmark the performance of this quantum device.”
The lambeq project also represented a first for Honeywell Quantum Solutions. It was the first QNLP problem run on the System Model H1 hardware.
“We are really excited to be a part of this work and contribute to the development of this important toolkit,” said Tony Uttley, president of Honeywell Quantum Solutions. “Applications like this help us test our system and understand how well it performs solving different problems.”
(Honeywell Quantum Solutions and Cambridge Quantum have a long-standing history of partnering together on research and other projects that benefit end-customers. The two entities announced in June they are seeking regulatory approval to combine to form a new company.)
Why QNLP?
For humans, decoding conversations to understand meaning is a complex process. We infer meaning through tone of voice, body language, context, location, and other factors. For computers, which do not rely on heuristics, decoding language is even more complex.
The only way to create some sort of “meaning-aware” NLP is to explicitly encode compositional, semantic sentence structure into language models. To do this on a classical computer, however, requires massive computational resources, which are costly, and would likely still take months to process.
Quantum computers, on the other hand, run calculations and crunch data very differently.
They harness unique properties of quantum physics, specifically superposition and entanglement, to store and process information. Because of that, these systems can examine problems with multiple states and evaluate a large space of possible answers simultaneously.
What this means in terms of natural language processing is that quantum computers are likely to go beyond counting how often certain words appear or are used together. As noted above, quantum computers can identify words, label them as a noun, verb, preposition, etc., and understand the relationship between words. (lambeq uses the Distributional Compositional Categorical – or DisCoCat – model to do this.)
This enables the computer to infer meaning, and also provides insight into how and why the computer made connections between words. The latter is important for validating data and also expanding the use of QNLP in regulated sectors such as finance, legal, and medicine where transparency is critical.
Built upon previous work
The Cambridge Quantum team has long explored how quantum computing can advance natural language processing, and has published extensively on the topic.
In December 2020, researchers released two foundational papers that demonstrated that QNLP is inherently meaning-aware and can successfully interpret questions and respond.
Earlier this year, the team performed the first NLP experiment conducted on a quantum computer by converting more than 100 sentences into quantum circuits using an IBM technology. Researchers successfully trained two NLP models to classify words in sentences.
The release of lambeq and the testing of the open-source toolkit on the Honeywell System Model H1 represents the next steps in their QNLP efforts.
“Our team has been involved in foundational work that explores how quantum computers can be used to solve some of the most intractable problems in artificial intelligence,” said Bob Coecke, Cambridge Quantum’s chief scientist.
“In various papers published over the course of the past year,” Coecke added, “We have not only provided details on how quantum computers can enhance NLP but also demonstrated that QNLP is ‘quantum native,’ meaning the compositional structure governing language is mathematically the same as that governing quantum systems. This will ultimately move the world away from the current paradigm of AI that relies on brute force techniques that are opaque and approximate.”