Deep Learning Week 2022 - Hackathon

Team: semaphores

Solution: see-maphores

Description: a realtime webcam that detects and analyses an object, generates descriptions of the surrounding objects, reads the generated outputs and speaks out them in the language users prefer.

Presentation

Inspiration

Technologies and AIs have always been marketed as the magical cure for all, bringing convenience and efficiency to all aspects of our daily lives. As magical as it might sound, this only applies to the mainstream users, leaving behind the visually impaired, the elderlies, the mentally disabled, etc. How might we make technologies more accessible and inclusive to them? How might we use technologies to target their specific needs? How might we include them in our race toward the Smart Nation and minimize any marginalizing effects created by the current mainstream technologies?

Out of all the marginalized groups, we identified the visually impaired population as our target audience. This is not only because of the scale and magnitude of visual impairment in Singapore and worldwide, but also because of how impacts of visual impairments directly extend all walks of life.

What it does

We developed a realtime webcam that detects and analyses an object, generates descriptions of the surrounding objects, reads the generated outputs and speaks out them in the language users prefer.

How we built it

We integrated the following to build our solution:

YOLOv5 (You Only Look Once) object-detection model
Tesseract ocr / EasyOCR: optical character recognition engine
gTTS (Google Text To Speech)
Google Translate API

Challenges we ran into

We had difficulty incorporating different functions.

Accomplishments that we're proud of

We incorporated different functions.

What we learned

We learned how to incorporate different functions and we also learned how to use different AI models.

What's next for see-maphores

Use a better dataset for detecting objects
Use a better ocr for better accuracy
Incorporate a location tracker to tell the user where he/she is located
Incorporate into hardware, such as AR glasses

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
yolov5 @ c49e102		yolov5 @ c49e102
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
color_detection.py		color_detection.py
color_detection_1.py		color_detection_1.py
ocr.py		ocr.py
ocr2.py		ocr2.py
translate_tts.py		translate_tts.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Learning Week 2022 - Hackathon

Inspiration

What it does

How we built it

Challenges we ran into

Accomplishments that we're proud of

What we learned

What's next for see-maphores

About

Releases

Packages

Contributors 3

Languages

ardnep/see-maphores

Folders and files

Latest commit

History

Repository files navigation

Deep Learning Week 2022 - Hackathon

Inspiration

What it does

How we built it

Challenges we ran into

Accomplishments that we're proud of

What we learned

What's next for see-maphores

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages