Seemore

An app to make the lives of visually impaired people a little more ordinary.

Introduction
Problem Statement
Our Solution
Key Features
Application Workflow
Credits
Future Work
Contributors
References
License

Introduction

The development of tools and technology hasn’t resulted in the development of applications that could aid those with visual impairments. With the development of Data Modelling techniques, which can be used to give even basic computers a bit of “intelligence,” and the ease of accessibility, this “intelligence” can be extended to our smartphones to aid those who are blind in navigating their surroundings and going about their daily lives. By utilising the power of Deep Learning, which can be made accessible even on low-end devices with a clear User-Interface that would precisely allow them to better grasp the world around, our application seeks to close the gap between them and the visible world.

This app enables the community of blind and visually impaired people to correctly identify objects they come across in everyday life without the need for sighted assistance.

Problem Statement

Vision impairment poses an enormous global financial burden with the annual global costs of productivity losses associated with vision impairment estimated to be US$ 411 billion. The main challenges faced by blind people include

Navigating around places
Finding reading material

Our Solution

A voice and gesture based app to make the lives of visually impaired people a little more ordinary. This app essentially helps in gaining independence without having to rely on external devices that may not be accessible to most people.

All the features are accessible via swipe/hold gestures and voice commands. Simply say “seemore” followed by the feature you want to access to activate. The app uses speech to notify the results to the users.

Key Features

⭐️ Voice Commands

Press the mic button and use the command “Seemore” to activate. Then, use one of the following commands:

“SOS”
“detect object”
“currency”
“read text”

to access the corresponding feature.

The app uses the Speech-to-text package in flutter to recognize the user command.

Implemented by @Ajith Manivannan

⭐️ SOS - Quickly send alerts to your emergency contacts.

Quickly send alerts to your emergency contacts by touch and hold gesture on the center of the app or by using the “SOS” command.

The app uses the Twilio API to send an SMS to emergency contacts to indicate that immediate help is required.

Implemented by @N Lirajkhanna

⭐️ Object Detection - Detects the object in front of you and the distance you are from it.

Swipe right or use the command “detect object” to detect the object in front of you and find the distance you are from the object.

v1: The app uses the yolov3-tiny model for object detection which has a tested mean average precision of 33.1 with 220 fps.
v2: The app uses the yolov5s model for object detection. This has resulted in greater improvement in object detection accuracy and speed.

We use simple camera calibration to calculate the distance between the user and the object detected. With the current version of the app, we can detect up to 80 different everyday objects.

Implemented by @Nilavan

⭐️ Currency Detection - Detects currency denominations.

Swipe left or use the command “currency” to detect currency denominations.

v1: The app uses feature matching from openCV and was trained on limited data due to time constraints.

Implemented by @N Lirajkhanna

v2: The app uses feature extraction methods from openCV which are used as input to a Machine Learning model, trained on a much larger dataset and optimized to achieve an accuracy of over 90%.

Implemented by @Nilavan

⭐️ Read Text - Reads the text for you.

Swipe up or use the command “read text” to read the detected text.

This has been implemented using an optical character recognition (OCR) tool that will recognize and read the text embedded in images.

Implemented by @TM Vishnu Mukundan

Application Workflow

Credits

This software uses the following open source packages:

Future Work

Although the features we set out to build have been successfully implemented, the following areas can be improved in future versions of the app.

Accuracy of detection models can be improved. We can use better and more efficient models trained on a wide variety of data to make it more robust.
Extend object detection to more classes. The current version of the app can detect up to 80 different everyday objects. Our goal is to extend this to most objects we come across.
Implement object detection in real-time instead of capturing image. This can drastically improve the “independence” of the visually impaired. At present, we send an image to the API and it returns the result. Our next goal is to allow the user to simply have the camera open while our app informs the person about objects detected at any time.

Contributors

A Nilavan

Backend development

Object detection (v1 & v2)

Currency detection (v2)

Ajith Manivannan

Frontend development

Speech-to-text & text-to-speech

N Lirajkhanna

SOS feature

Currency detection (v1)

Backend deployment

TM Vishnu Mukundan

Text detection (OCR)

References

License

This project is licensed under the MIT License - see the LICENSE.md file for details

nilavan.github.io · GitHub @Nilavan ·