I am bad with faces. I mean really bad with faces. My brain just doesn’t seem to be very good at mapping what someone looks like with their name. This often makes things difficult for me at networking events and conferences.
A passive mobile application that scans the environment around the user for faces. When it detects a face that it recognizes the application speaks the person’s name to the user via their bluetooth earpiece. Ideally this solution would also involve a discrete camera that could operate without being obvious to the people it is operating on. The point is to serve as a passive aid to memory while not changing the behavior of the people you are interacting with.
There are a couple concept applications out there along this line including Recognizr and Comverse Social AR. Both of these applications have the same problem, which is that you have to hold up your phone to take a photo of the person you want to identify, then wait for the result to come back. That is intrusive enough that a simple “I’m sorry, remind me what your name is…” would be a better option.
- Facial recognition. There are many providers of facial detection and recognition APIs, so it should be possible to license this piece. Unfortunately most of the providers don’t seem to be very good at licensing their SDK to people. I get the idea that these are all very small companies that spun out of someone’s PhD research.
- PittPatt never replied to my email.
- Luxand put me on their marketing email list, but never sent me an evaluation key.
- Betaface actually gave me a chance to evaluate their SDK. It works quite well. I wasn’t a fan of their licensing terms, but you might have different needs than I did.
- Ayonix got back to me right away but never provided the promised evaluation link.
- I don’t remember if I contacted Seeing Machines or not.
- Bluetooth camera – I bought an OptiEye. It works pretty well. If you ask them nicely they will send you the protocol documentation. The specs claim a four hour battery life, which is plenty for most networking events.
- Text to speech – I haven’t done any research here. Many applications do it, though, so I would imagine SDKs are available. If nothing else the user could record the names and the software could just play back the recordings.
- Mobile computer – Both Android and iPhone allow communication over RFCOMM, which is what the OptiEye uses. Existing devices are also too weak in the CPU department to do much visual processing on the phone, but they could stream video or individual frames up to a server for further processing.
What do you think? Dream product? Interesting project? Terrible idea?