Using Machine Learning to Translate Brain Signals to Text
Developing brain-computer interfaces (BCI) that can decode and translate brain signals will depend on us being able to decode more of these signals in a way that is useful for people such as those with paralysis. <!--more-->
Can we use the brain's neural activities from people imagining handwriting to decode letters, words, and even sentences?
In this paper published in the Nature journal by researchers from Stanford, they were interested in investigating whether complex gestures and motor functions such as handwriting can be decoded from our neural activities.
Complex human movements such as handwriting have over the years been a hard task to solve. This is because pulling information from these complex neural brain data isn't easy. But, they were able to solve this problem using machine learning. It decoded these complex neural signals and converted them into texts and words.
Table of contents
- Overview
- Neural representation of handwriting
- Decoding handwritten sentences
- Results
- Wrapping Up
- References
Before we dive in, it is crucial to understand these three terminologies:
1. Principal Component Analysis (PCA)
It is a dimensionality reduction technique that computes principal components by reducing the dimensions of large datasets into smaller ones.
Though reduced into smaller sets (principal components), these data should still contain most of the important information from the larger dataset. This process is performed on data as it is hard to compute datasets with large dimensions. Read more on it here.
2. t-distributed Stochastic Neighbor Embedding (t-SNE)
It is a statistical technique suitable for visualizing high-dimensional data. It achieves this process by embedding high-dimensional data into a low-dimensional space of two (2D) or three-dimensional (3D) map. Read more on it here.
3. Brain-computer interface (BCI)
It is a computer-based system that recognizes the intent of a user through the use of brain signals. It decodes the brain's neural activities and translates them into outputs that replace, restore, enhance, or improve human functions. Read more on it here.
Now that we understand these terminologies, let's get started!
Overview
Recently, there has been a lot of interest in researching how our brains represent the things that we perform in our lives, i.e., walking, speaking, and writing.
Developing brain-computer interfaces (BCI) that can decode and translate brain signals will depend on us being able to decode more of these signals in a way that is useful for people such as those with paralysis. If you'd like to see more of these researches, check out Braingate. It is a research company that focuses on developing BCI technologies to restore mobility and communication.
In this article, we will explore how machine learning is used to decode and translate brain neural signals into handwritten texts. Indeed, it is a first of its kind.
Previous BCI research has focused on restoring motor skills such as reading and grasping. Not much focus has been put in researching the use of BCI to solve complex behavioral tasks. This article aims to tackle that.
The researchers in this study worked with a paralysis patient with a spinal cord injury. He was paralyzed from the neck downwards, including not being able to move his hands. The goal was to make the patient imagine writing with a pen on a piece of paper as if his hands weren't paralyzed. Write
The figure shows two tiny arrays of implanted electrodes relaying information from the brain of the patient. It controls the hands and arms to an algorithm. The algorithm then translates these signals into letters that appear on a screen.
Let's try and understand how they do this.
Neural representation of handwriting
To visualize the neural activities recorded during the attempted writing from the patient's brain, two machine learning techniques were used.
These techniques are essential, as neural signals are too complex with a lot of dimensions. These signals also contain unnecessary background noise which we may not require in our task. These two techniques are used to visualize and reduce the complex dimensions of the signal enabling us to achieve only the required ones.
- Principal Component Analysis (PCA) is one of the methods for dimensionality reduction. It takes your data and extracts the components that have the highest amount of variance. The goal of PCA in this experiment was to extract the top three neural signals (components) associated with handwriting. It also eliminated things such as background noise or things that might not be relevant to our task.
The above example shows the neural activity in the top 3 principal components for three example letters (d, e, and m).
- They used a non-linear dimensionality reduction technique called t-distributed stochastic neighbor embedding (t-SNE). The method showed that data can be clustered into different groups representing different letters of the alphabet (A-Z). This showed that the data could indeed be separable. Thus, this method helped to distinguish neural activities in the brain associated with different letters i.e., the letter 'a' and the letter 'l'.
The figure represents the 2D visualization of the neural activity made using t-SNE.
Decoding handwritten sentences
Now that they were successful in representing neural activities in their respective letters, the researchers faced a new challenge. Could they replicate the same success in decoding entire handwritten sentences instead of only letters in real-time?
Let's see how they went about with this.
They achieved this by using a Recurrent Neural Network (RNN). You can read more about RNNs here.
The RNN is trained to convert the brain's neural activity data into probabilities. These probabilities describe the likelihood of each character being written within a certain window of time. It assigns the probability of that data being associated with any particular character or sign based on the embedding that we explained earlier.
They used a set of 31 characters. These characters comprised 26 lower-case letters from the alphabet, a question mark (?), comma (,), apostrophe ('), full stop (-), and space (>)
They performed this in two different ways. Using an online and offline system.
In the online system, they used a threshold system. If the probability of a certain set of data with letters was high enough, then it would be assigned that letter. The assigned letters would then be used in the sentence.
In the offline system, they used a large-vocabulary language model to simulate an auto-correct feature. It fixed errors by predicting the probability that a letter came next in a sequence of letters.
Results
This research shows results for 5 days from one study participant who already had a system implanted in his brain.
-
As the patient continued to do the task of imagining letters and allowing the computer to process these signals repeatedly, the patient was able to type 90 characters per minute for each trial day. This doubled the record compared to previous BCI records for typing.
-
Since our brain's baseline neural signals change day after day, the handwriting decoder had to be retrained with new data every single day. This retraining had to be accounted for, especially when working with models like this. You wouldn't want your training signals to look different each day. This would make your model work incorrectly.
-
The character and word error rates reduced by 0.89% and 3.4% respectively when the offline language model system was used. The result from the word error rate is comparable to state-of-the-art speech recognition systems, which range between 4-5%. This is well within the accepted range of being usable.
-
As the participant imagined writing the letters and symbols, the sensors implanted in his brain picked up on patterns of electrical activity which the algorithm interpreted to trace the path of his imaginary letters and symbols.
Impressive, right?
- Finally, this research proved that as much as someone is paralyzed, the brain signals for movement remain functional. Tapping into this can help building BCI systems that can help solving a lot of tasks.
You can find the full code for the paper on GitHub.
Wrapping up
I'd like to point out that this isn't something that will be available to the general public. This is because such a procedure would need implanting electrodes into people's brains.
It is still in the experimental phase. Only one patient was used to get these results. More tests still need to be done on more patients to see how well this novel system generalizes.
But the findings obtained from this experiment show huge promise and would have a lot of impactful real-world uses. Their biggest hope is that their invention will one day help people with complications such as paralysis. This would enable patients to communicate, both faster and better. It would also offer them inclusion in the usage of impactful technology, whose purpose is to make lives better.
If you're interested in learning more about similar projects, Braingate is a good place to start.
References
-
High-performance brain-to-text communication via handwriting
-
Brain-Computer Interface Turns Mental Handwriting into Text on Screen
Peer Review Contributions by: Collins Ayuya