The impending launch of the iPad has had me thinking a lot about where computers in general are going. Mobile computers are moving into larger form-factors and they are bringing their mobile operating systems with them. In addition to the iPad there are also about half a dozen tablets and a few netbooks coming out that run Android. I believe these devices are the start of a new wave that will eventually replace Windows and OSX machines for the vast majority of computer users.
This sort of platform has existed before, but previous attempts never really worked out. This time I think they have a real shot at it, and I think two factors will make the difference this time around. Both platforms have large libraries of apps, and both platforms greatly reduce the cost of owning a computer.
Previous attempts to build small, lightweight operating systems always included a big push to sign up developers. Then they failed to attract significant developer attention because they had no installed base. Then no one could figure out why they should buy one because they had no applications and the installed base never materialized. Both Apple and Google have used the mobile web to bootstrap both their user-bases. Many people are willing to buy the phones because they can read web-pages from anywhere. Nobody balks at developing for these platforms because there are millions of them out there. Then another wave of people are happy to buy the phones because of all the cool apps. As a result each platform have tens of thousands of applications when their mid-sized devices launch.
The cost of ownership factor is also a pretty big deal. For years I have had computer-savvy friends describe to me how they will no longer support relatives on Windows and have purchased iMacs for their parents. It is much easier for a normal person to break a Windows machine than to break a Macintosh, so the unfortunately tech-savvy guy in the family ends up spending more time supporting a casual computer user on Windows. Android and iPhone OS push this much further by removing most of the remaining pitfalls. This doesn’t matter much for powerusers, but for the average computer user it is a big deal.
Some previous attempts at midsized computers (e.g. Magic Cap and Newton) had a similarly low cost of ownership. Some previous attempts at midsized computers (e.g. Ultra-Mobile PCs, Windows-based Tablet Computers, and Netbooks) had a huge software library to draw on but a cost of ownership that was actually higher than their desk-bound brothers. These new mobile-derived operating systems are the first time we’ve seen both factors in the same devices. I think this could be as disruptive as the original Personal Computer revolution.
What do you think? Will these new mid-sized computers cause massive upheaval, or will they fall down the same dark hole as their predecessors and never be heard from again?
I am bad with faces. I mean really bad with faces. My brain just doesn’t seem to be very good at mapping what someone looks like with their name. This often makes things difficult for me at networking events and conferences.
A passive mobile application that scans the environment around the user for faces. When it detects a face that it recognizes the application speaks the person’s name to the user via their bluetooth earpiece. Ideally this solution would also involve a discrete camera that could operate without being obvious to the people it is operating on. The point is to serve as a passive aid to memory while not changing the behavior of the people you are interacting with.
There are a couple concept applications out there along this line including Recognizr and Comverse Social AR. Both of these applications have the same problem, which is that you have to hold up your phone to take a photo of the person you want to identify, then wait for the result to come back. That is intrusive enough that a simple “I’m sorry, remind me what your name is…” would be a better option.
- Facial recognition. There are many providers of facial detection and recognition APIs, so it should be possible to license this piece. Unfortunately most of the providers don’t seem to be very good at licensing their SDK to people. I get the idea that these are all very small companies that spun out of someone’s PhD research.
- PittPatt never replied to my email.
- Luxand put me on their marketing email list, but never sent me an evaluation key.
- Betaface actually gave me a chance to evaluate their SDK. It works quite well. I wasn’t a fan of their licensing terms, but you might have different needs than I did.
- Ayonix got back to me right away but never provided the promised evaluation link.
- I don’t remember if I contacted Seeing Machines or not.
- Bluetooth camera – I bought an OptiEye. It works pretty well. If you ask them nicely they will send you the protocol documentation. The specs claim a four hour battery life, which is plenty for most networking events.
- Text to speech – I haven’t done any research here. Many applications do it, though, so I would imagine SDKs are available. If nothing else the user could record the names and the software could just play back the recordings.
- Mobile computer – Both Android and iPhone allow communication over RFCOMM, which is what the OptiEye uses. Existing devices are also too weak in the CPU department to do much visual processing on the phone, but they could stream video or individual frames up to a server for further processing.
What do you think? Dream product? Interesting project? Terrible idea?
At TED 2010 Blaise Aguera y Arcas from Microsoft demoed live integration of video into the existing structure-from-motion dataset in Photosynth. Though his demo showed a video feed moving around a scene the same data could just as easily be turned around to find the precise position of the camera in real-time. That capability is a key part of building a head-mounted augmented reality system.
Two weeks later Google announced that they are incorporating user photos into Google Street View. This requires essentially the same data as Photosynth. Google has the added advantage that they can combine it with the Street View images and LIDAR data they are already collecting. Though they haven’t demonstrated real-time capability with this data they certainly have all the pieces they need to make this happen.
Access to the data required to perform pose recognition with cameras is a novelty at the moment, but if mobile augmented reality takes off in a big way it will become a key component of that system. In my opinion this component is too important to be left in the hands of one company. A much more desirable situation would be to have an OpenStreetMap-type project to accumulate and curate a freely available dataset to provide structure from motion and pose recognition for use in mobile augmented reality and whatever other uses someone can dream up.
OpenStreetMap is a project that sprung up to provide access to data that was free from the costs and restrictions that come with commercial data. It uses a Creative Commons license to make the data free for use by anyone for most any purpose. Although OpenStreetMap came about in response to the restrictions on commercial data sources, the same approach could be taken for 3D structure and image data even though commercial sources for that data do not yet exist. If OpenStreetMap had existed when car navigation systems became feasible in the late nineties it is likely that many commercial products could have been developed on open data at far lower cost and in much more variety.
All such a project needs is a small number of dedicated people to get it started. Download a copy of Bundler (an open source structure from motion library based on the same research that spawned Photosynth) and seek out publicly available photograph libraries. Then talk a cloud computing provider into sponsoring the project by hosting the data and build things up from there. The project won’t have many users for a few years, but as the accuracy and coverage of the dataset grows the set of applications based on this open data will grow too. Somebody just has to get the ball rolling.
I have a bunch of ideas like this one rattling around in my head. Some of them could be products or businesses, and some are just cool projects. I have looked into them all to some degree but probably never start real work on them. I’m going to post them here in an attempt to spawn a discussion and encourage you to share your thoughts in the comments. Feel free to do whatever you like with these ideas.
The 2010 LOGIN conference is accepting session proposals for just 11 more days. January 25 is the last day to submit something, so hurry up and get your talk proposal sent in!
This rounds out my trilogy of year end posts. Here is what I think will happen in the coming year. I would love to hear your thoughts on these predictions:
- Star Trek Online will be the only significant MMO launch in 2010. It will do well enough to make Atari and Cryptic plenty of money, but will not do nearly as well as World of Warcraft, so many people will consider it a failure. (Those people are dumb.)
- At least three of the major unreleased MMO projects will be cancelled. I have a guess about which two are most likely, but I’ll keep that to myself. Qualifying projects include:
- Guild Wars 2
- Whatever Carbine is working on
- That console MMO Turbine hasn’t said much about
- Whatever Trion is working on
- The Sci-Fi channel tie-in MMO that Trion has said they’re working on
- Whatever Zenimax is working on that may or may not be Fallout
- Whatever 38 Studios is working on
- The Agency
- Whatever Gazillion’s Gargantuan studio is working on
- Star Wars: The Old Republic
- The second MMO that CCP is working on down in Atlanta where all those White Wolf people are. Hmm. What could it be?
- Whatever Red 5 is working on
- DC Online
- That other MMO I know NCSoft is working on that is completely under the radar
- Whatever Slipgate Ironworks is working on
- The second MMO Blizzard has in the works
- Jumpgate: Evolution
- Project Natal and the Playstation Motion Controller will both come out. Natal will do fairly well. Both controllers will allow some new kinds of games, but we won’t see any compelling examples of those games until 2011.
- Unemployment will peak and then start to fall.
- The compass+GPS augmented reality products will begin to shift to general location-awareness and away from their Augmented Reality roots. They will de-emphasize magic lens and start to emphasize aggregation of nearby content.
- No consumer-level see-through displays will come out in 2010. Significant progress toward them will be made, but nothing will be released.
- Neither Google nor Apple will release any kind of AR-focused hardware
- The use of “wave it in front of your webcam” type AR in advertising will peak with an AR-enhanced ad in the Superbowl. The backlash will begin. By the end of the year the advertising world will have moved on.
- Apple will release its tablet and a new iPhone (faster and more storage) but won’t release anything that is specifically an AR product.
- Apple will address the pain caused by its app-store approval process, at least in part. I have no idea what their specific solution will be, but they aren’t going to let their developer community grow to hate them.
- Android will continue to pick up steam. By the end of the year Android will boast 50,000 applications.
- People will spend the entire year trying to find something really useful to do with Google Wave. They won’t succeed in 2010.
- Google will make Wave interoperate with email. This will make it useful as an email client if nothing else.
Ok, that’s the last of this kind of post for at least a year. If only I could get back to posting regular stuff again.