If you find our work useful please consider citing it.
@inproceedings{diganth2018pi,
title={PI VISION: Portable Infinitesimal Video Input Signal Processed Intelligent Object Naming},
author={Diganth, P and Ankolekar, Manasa K and Neha, BS and Mishra, Geetishree},
booktitle={2018 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT)},
pages={1--5},
year={2018},
organization={IEEE}
}
This project is based on SqueezeDet by @BichenWuUCB. To the work done by bichenWuUCB, our team added a new feature of determining the distance of the object in real time using the principle of triangular similarity. Using distance as a parameter we could vary the intensity of the haptic motor to alert the user about how near he/she is to the object. We also Integrated Google Assistant which will help the user to get the information he needs and is triggered using a push button. The whole project was carried out using a Raspberry Pi 3B/3B+. This project was solely based to help the visually impaired people to get through everyday life in a easy way.
- easydict
- joblib
- numpy
- python openCV
- pillow
- tensorflow (tensorflow 1.4 recommended)
The following instructions are written for Linux-based distros.
-
Clone the PI VISION repository:
git clone https://github.com/diganthp/pi-vision.git
Let's call the top level directory of pi-vision
$SQDT_ROOT
. -
(Optional) Setup your own virtual environment.
- The following assumes
python
is the Python2.7 executable. Navigate to your user home directory, and create the virtual environment there.
cd ~ virtualenv env --python=python
- Launch the virtual environment.
source env/bin/activate
- The following assumes
-
Use pip to install required Python packages:
pip install -r requirements.txt
-
Download SqueezeDet model parameters from here, untar it, and put it under
$SQDT_ROOT/data/
If you are using command line, type:cd $SQDT_ROOT/data/ wget https://www.dropbox.com/s/a6t3er8f03gdl4z/model_checkpoints.tgz tar -xzvf model_checkpoints.tgz rm model_checkpoints.tgz
-
Now we can run the demo. To run the video demo,
cd $SQDT_ROOT/ python ./src/demo.py
If the installation is correct, the detector should run the video with object detection as shown the picture below:
A huge thanks to @BichenWuUCB and team for their work on squeezeDet
- easydict
- joblib
- numpy
- python openCV
- pillow
- tensorflow (tensorflow 1.1 above recommended)
- python 3.4 (This version of python is recommended since there is no support for tensor flow 1.1 for python 3.5 or python 3.6)
- Google Assistant Installation on Raspberry Pi. Installation of the Google Assistant can be at Google Developers - Installation of the Google Assistant on Raspberry Pi 3
If the tensorflow support for python 3.5 and 3.6 is released please feel free to raise an issue.
- Raspberry Pi 3B.
- LED.
- Push Button.
- Earphones/Headphones with 3.5mm jack (The Bluetooth on the Pi 3B doesn't support audio files).
- Power Bank (Recommended above 5000 mAh).
- Mini USB Microphone.
- SD card. (Recommended 8gb class 10 SD card)
- Pi Camera v2.1 (alternatively USB Web Camera can also be used).
- Mini Haptic Motor.
- Mini Cooling fan (Raspberry Pi 3B on load touches temperature > 90 degree Celsius. Hence, it needs constant cooling).
All products except the Mini Haptic Motor can be brought using Amazon.com (for people residing in America or any other foreign countries) or Amazon.in (for people residing in India and nepal)
Mini cooling fan is brought from Adafruit (for people residing in America or any other foreign countries). In India it can be brought from Thingbits.
cd $SQDT_ROOT/
python ./src/demo.py
If the installation is correct, the detector should run the video with object detection as shown the picture below:
The pinhole (monocular) camera generates a one-to-one relationship between the object and the image. Using this principle, we can deduce a relationship between known parameters, focal length(f), known width of the object(W), the apparent width in pixels (P) and unknown parameter, distance from the camera to the object(d).
The distance acts as the parameter in varying the intensity and alerting the user about the distance of the object. The adafruit mini haptic motor disc has two wires for Vcc and ground. Its intensity can be varied by using PWM signals as shown in the figure below.
With all the advantages of the Google Assistant, using it with the original settings poses a few complications. The Google Assistant can be triggered using “OK GOOGLE/ HEY GOOGLE”. In a noisy surrounding, this might lead to false triggering. Also, due to high levels of background noise, it might not get triggered at all. To solve this problem, we trigger the assistant with a push button instead of a voice command. We have also added custom commands like Volume up, Volume down, Min. Volume, Max Volume, Power off and Reboot. The setup of the Google assistant is as shown in the picture below: