How machine learning can bridge the communication gap

Amazon Web Services has developed a machine learning model to translate sign languages into text in a showcase of assistive technology

Aaron Tan, Informa TechTarget

Published: 21 May 2020 7:23

In October 2019, an Amazon employee in Melbourne, Australia, bumped into another person while cycling on the road. As she was assuring that person that she would help, she realised he was deaf and mute and had no idea what she was saying.

That awkward situation could have been avoided if assistive technology was on hand to facilitate communication between the two parties. Following the incident, a team led by Santanu Dutt, head of technology for Southeast Asia at Amazon Web Services, got down to work.

Within 10 days or so, Dutt’s team had built a machine learning model that was trained on sign languages. Using images of a person gesturing in sign language that were captured from a camera, the model could recognise and translate gestures into text. The model also could convert spoken words into text for a deaf-mute person to see.

Dutt said the model can also be customised to translate speech into sign languages as the machine learning services and application programming interfaces (APIs) are available and open – although he has not seen that demand yet. “But once you write a small bit of code, training the machine learning model is easy,” he said.

But there is still work to be done. As the training was performed with signs gestured against a white background, the efficacy of the model in its current form would be limited in actual use.

“Our team had limited time to showcase this and we wanted to bump up something to showcase for experimental purposes,” said Dutt, adding that organisations can use tools such as Amazon SageMaker to edit and train the model with more images and videos to recognise a larger variety of environments.

As the training process is intensive, Dutt said organisations with limited resources can use Amazon SageMaker Ground Truth to build training datasets for such machine learning models quickly. Besides automatic labelling, Ground Truth also provides access to human labellers through the Amazon Mechanical Turk crowdsourcing service.

This will also help to improve the model’s accuracy rate. “The more data you have, the more accurate the model gets,” said Dutt, adding that developers can set confidence levels and reject results that fall below a certain level of accuracy.

Dutt said AWS’s public sector team has engaged non-profit organisations in Australia to conduct a proof of concept that makes use of the machine learning model, as well as those in other countries through credits that offset the cost of using AWS services to train and deploy the model.

How machine learning can bridge the communication gap

Amazon Web Services has developed a machine learning model to translate sign languages into text in a showcase of assistive technology

Read more about AI and machine learning in APAC

Read more on Artificial intelligence, automation and robotics

PC market struggles in 2022 but future positivity remains

Loyola Medicine Adds American Sign Language Feature for Telehealth

AWS re:Invent 2021: Four key takeaways for ASEAN firms

4 trends spurring the evolution of network hardware