Input: photo from a camera (phone camera)
Logic: face emotion analysis api, local computer
Output: Speaker that plays the music recommended from the computer
o Phone detects when person enters the home and this triggers the picture event
o Take a picture through the phone
o Send the picture to the API
o API gets picture and returns a response that contains the predicted emotion
o Computer chooses the appropriate music based on the API emotion response
o Speaker gets music recommendation from computer
o Speaker plays music
Tbd