SVA / Fall 2016
Intendo is a device that leverages artificial intelligence to interpret human language in terms of emotions.
Working with my classmates Janel Wong and Tyler Gumb, I designed, prototyped, and wrote the code for Intendo.
The project's goals are two-fold:
Our initial concept was of a device the user wishes into and receives a unique token based on your wish. We imagined use in a public space. We wanted the piece to be whimsical and artistic.
We experimented with the effects of heat on thermal paper using cigarette butts and soldering irons, thinking we could use a heat pad in the device to produce a unique pattern for each users receipt. The intensity of heat required to achieve this was quite high so we decided to explore other possibilities for creating this unique token.
We initally got speech recognition working in Processing using this method where the speech processing was done by a google API in chrome and sent to Processing via Websockets.
For an initial proof of concept and to get a feel for the interaction for whatever wish the user said the printer responded by randomly printing from an array of responses we prepared.
One key component of the interaction we spent a lot of time considering was indicating to a passerby how to interact with the device. In this first iteration we put an IR sensor and coded the printer to print a phrase asking for wishes whenever someone stepped in front of the device. We thought there was room here for some magic and to give the impression of the device being alive and having a personality.
The use of heat within the device to effect the thermal paper of the receipt printer started to seem less likely to be able to accomplish. There were many concerns with how the paper left the printer and lining it up exactly to be subjected to a high enough level of heat for long enough and in a way that produced unique results each time seemed a little too much for our time frame, so we started considering alternatives. One consideration for a more practical application of the device was as a means to collect public sentiment data on something, so a device to encourage customers or visitors to provide feedback. We considered the possibility of collecting the responses in a database for analytics to be done later.
We were still debating what token—if any—the user should receive after interacting with the device. One thought was they receive an output that was instructive on how your speech was interpreted and classified by the computer. The idea being to encourage people to interact with the device to learn about sentiment analysis and receive a token on how their speech was analyzed.
Working off this concept and the idea that the responses would be grouped by their overall sentiment we came up with a new form for the device. We thought to replace the IR sensor initiated printing phrases asking for wishes with a LED that changes color and fades in and out to entice the user to press the button. When the user presses the button to initiate the recording and speaks when they release the button the recording is sent to IBM's Watson Sentiment Analysis API which will categorize the statement. The device then prints your statement on one of five sheets of cascading paper corresponding to the sentiment that it is grouped by.
The appeal of this form for us was that the five cascading sheets serve as a sort of physical bar chart immediately visible when a visitor comes up to the device of the overall sentiment of other users.
Josh Zheng's Medium posts (AlchemyLanguage Sentiment Analysis in Python and How To Build a Candy Machine With Feelings) were extremely helpful for us as we started to explore and setup the code for our device to get the text from the speech and then get the sentiment from that text.
We set up the five printers and tested coding the python to route the phrase to one printer at a time.
Once that was working reliably, we built two wooden enclosures: one for the input containing the button and microphone, which also housed the printer which delivered the token for the user; and one for the output of the 5 printers corresponding to the 5 emotions.
We tidied up the wiring and mounted Intendo in a public space (the lobby of our department) for testing.