You are currently browsing the archives for the user centrism category.



Social Graph is Plural

“Social Graph” is not just a singular noun.

“The Social Graph” is a popular misnomer that has plagued the social networking portability conversation ever since Brad Fitzpatrick catalyzed the blogosphere with a vision about the Global Social Graph.

But in fact, “The Social Graph” has little real value outside of computer science elegance. Nobody but Big Brother, the TSA, the CIA, and [insert surveillance agency of your jurisdiction here], actually want that single, monolithic view of all the relationships in the world. That’s The Social Graph.

In contrast, my social graph is hugely valuable to me. Your social graph matters to you. And it might be interesting to discover where our graph (plural) overlap. But neither of us actually care about The Social Graph.

A few fishAt the VRM Workshop 2008, here at Harvard’s Berkman Center for Internet and Society, it came out that “social graph” is actually plural.

Like fish.

The Social Graph is a misleading distraction, a handy buzzword we can all slip into our cocktail conversations. But the real value is in the personal, independent social graph we all have. Plural.

If you think about it, that’s the only way you can really make sense of it in our user-centric, user-driven world.

R-cards “ah-hah!” at IIW

At last month’s Internet Identity Workshop and the subsequent DataSharing Summit, Markus S and Drummond Reed unpacked several ideas about r-cards, which, to a certain extent, are an evolution of the Information Card at the heart of CardSpace.

Going into IIW, I understood r-cards simply as a hybrid of InfoCard’s managed and personal card models. Managed cards are issued by another party–all the data associated/transmitted with that card is controlled by that managing party, while personal cards are self-asserted, allowing individuals to serve as their own card provider, controlling all of the associated data. R-cards then, allow a managing party to co-control a card with the user–with some data controlled by the managing party and some controlled by the user.

However, during the IIW demo of r-card, I had an epiphany about how powerful the r-card is, once we actually allow the user to manage the personal claims through multiple, dereferenceable links.

One issue that came up during the demo was that if the “personal” side of the r-card is manually entered claims, such as contact information, then the user is creating a management nightmare: duplicate claims would need to be entered and maintained across many different r-cards. The more r-cards, the worse the problem.

The “obvious” solution discussed at the session was to allow the user to specify specific claims that are served by other IdPs, such as a Personal Address Manager. And for completeness sake, let’s note that such claims could be mashed up from multiple other IdPs, not just a single one. Thus, any number of claims from a particular IdP could act as a sort of sub-card, combining with other subcards at presentation time.

The net result of this is a realization that that perhaps the most interesting thing about r-cards is their use as dynamic cards or aggregate cards or mashup identity cards.

That’s pretty cool in itself.

However, it also struck me that this also potentially fixes usability problems around authorizing a bunch of vendor’s (M) access to identity claims at a variety of different identity providers (N). This potentially requires N points of authorization and authentication for each M vendors (or relying parties). Sub-cards (or r-cards) may combine that task at the point of presentation for much greater user understanding and simplicity.

Since the Card Selector is itself a trusted point of authorization, we should be able to use the “mashup” gesture as explicit authorization for relying parties to access the claims specified in the sub-cards. That is, the UI of creating the r-card/mashup card/dynamic card also explicitly approves access to specific claims from multiple IdPs, since after all, the selector is where you select which claims to present to relying parties.

This adjustment to the Information Card ceremony greatly simplifies the user experience, while retaining all the power of distributed claims at appropriate IdPs. For example, it would allow me to specify my Passport # to United Airlines, as a verifiable claim served by the US Secretary of State IdP (which should be trusted by UA), streamlining any international travel I might do, while retaining my contact info at my Personal Address Manager. All with the same authorization ceremony I use with any information card relying party.

This realization was, for me, the most surprising insight into the power of the r-card. In fact, I’m wondering if the name “r-card” captures it best.

Zen and Technology

I’m not sure how I found it, but today I discovered a bit of a gem in the blogosphere: ValleyZen.

For a quick taste, check out the interview with Drue Kataoka on View from the Bay. It is amazing how a few simple words can have such a profound visceral impact.

Drue’s suggestions resonate with my user-centric world-view:

  1. SIMPLIFY
    Focus on what’s important. Eliminate what’s not.
  2. IMMEDIACY
    React to the moment — not to your fears and concerns.
  3. BREAK YOUR RHYTHM
    Surprise yourself and those around you.
  4. BE CALM
    Find Tranquility in Action.
  5. GREEN FROM THE INSIDE OUT
    Begin with your own personal ecosystem.

Take time for yourself, reconnect and put things in perspective, and engage the world on your own terms, in the moment, sustainably.

When redefining technology in personal terms, Drue’s take on Zen packs a powerful punch.

Law enforcement v Minimal disclosure

The Washington Post today exposed considerable excesses by “fusion” centers organized post 9/11.

Intelligence centers run by states across the country have access to personal information about millions of Americans, including unlisted cellphone numbers, insurance claims, driver’s license photographs and credit reports, according to a document obtained by The Washington Post.

Dozens of the organizations known as fusion centers were created after the Sept. 11, 2001, terrorist attacks to identify potential threats and improve the way information is shared. The centers use law enforcement analysts and sophisticated computer systems to compile, or fuse, disparate tips and clues and pass along the refined information to other agencies. They are expected to play important roles in national information-sharing networks that link local, state and federal authorities and enable them to automatically sift their storehouses of records for patterns and clues.

The list of information resources was part of a survey conducted last year, officials familiar with the effort said. It shows that, like most police agencies, the fusion centers have subscriptions to private information-broker services that keep records about Americans’ locations, financial holdings, associates, relatives, firearms licenses and the like.

Centers serving New York and other states also tap into a Federal Trade Commission database with information about hundreds of thousands of identity-theft reports, the document and police interviews show.

Pennsylvania buys credit reports and uses face-recognition software to examine driver’s license photos, while analysts in Rhode Island have access to car-rental databases. In Maryland, authorities rely on a little-known data broker called Entersect, which claims it maintains 12 billion records about 98 percent of Americans.

In its online promotional material, Entersect calls itself “the silent partner to municipal, county, state, and federal justice agencies who access our databases every day to locate subjects, develop background information, secure information from a cellular or unlisted number, and much more.”

“There is never ever enough information when it comes to terrorism” said Maj. Steven G. O’Donnell, deputy superintendent of the Rhode Island State Police. “That’s what post-9/11 is about.”

The last statement pretty much sums up current institutional thinking on individual liberty and national security: in the fight against terrorism, we have a moral obligation to do everything we can. Everything.

It’s scary how much that position echoes that of fascism. As promoted by Mussolini, fascism builds a moral framework based on the primacy of the state. Fasciste means a bundle of sticks, symbolizing that the group is stronger than any individual. Fascism extends that thinking, declaring that each individual’s rights exist only insofar as they support the state. Or to restate, in the defense of the state, there are no individual rights.

Which, if you think about it, is exactly what anti-terrorist programs assert when claiming that terrorism trumps the rights and privileges of the suspect or accused. Due process, protection from unreasonable searches, freedom of speech. All of these have rights have been trampled on in the name of the War on Terror. The fusion centers are just one more institution created by the mindset that brought us illegal wiretaps, extraordinary extradition, secret prison camps, extra-territorial detention, and torture.

I understand law enforcement’s position. It is easier to enforce laws when you know everything about everyone, just like in a police state (see The Lives of Others for an Academy Award-winning story of pre-information age East Germany’s police state). But it is impossible for a police state to generate the economic and social well-being that emerges in a free society… and it is that well-being which, ultimately, is the core of U.S. global power. Simply put, undermining freedom undermines US security.

In contrast, consider the subtle brilliance of Kim Cameron’s Laws of Identity, in particular, law 2:

2. Minimal Disclosure for a Constrained Use

The solution that discloses the least amount of identifying information and best limits its use is the most stable long-term solution.

We should build systems that employ identifying information on the basis that a breach is always possible. Such a breach represents a risk. To mitigate risk, it is best to acquire information only on a “need to know” basis, and to retain it only on a “need to retain” basis. By following these practices, we can ensure the least possible damage in the event of a breach.

At the same time, the value of identifying information decreases as the amount decreases. A system built with the principles of information minimalism is therefore a less attractive target for identity theft, reducing risk even further.

By limiting use to an explicit scenario (in conjunction with the use policy described in the Law of Control), the effectiveness of the “need to know” principle in reducing risk is further magnified. There is no longer the possibility of collecting and keeping information “just in case” it might one day be required.

The concept of “least identifying information” should be taken as meaning not only the fewest number of claims, but the information least likely to identify a given individual across multiple contexts. For example, if a scenario requires proof of being a certain age, then it is better to acquire and store the age category rather than the birth date. Date of birth is more likely, in association with other claims, to uniquely identify a subject, and so represents “more identifying information” which should be avoided if it is not needed.

In the same way, unique identifiers that can be reused in other contexts (for example, drivers’ license numbers, Social Security Numbers, and the like) represent “more identifying information” than unique special-purpose identifiers that do not cross context. In this sense, acquiring and storing a Social Security Number represents a much greater risk than assigning a randomly generated student or employee number.

Numerous identity catastrophes have occurred where this law has been broken.

We can also express the Law of Minimal Disclosure this way: aggregation of identifying information also aggregates risk. To minimize risk, minimize aggregation.

Whether or not you think the War on Terror is being handled well, it is a demonstrable fact that human systems fail. People make mistakes.
And that means we can guarantee that institutions–even when acting in our own best interest–will make mistakes, like the admitted errors of the FBI, as reported by the NYT:

F.B.I. Made ‘Blanket’ Demands for Phone Records

WASHINGTON — Senior officials of the Federal Bureau of Investigation repeatedly approved the use of “blanket” records demands to justify the improper collection of thousands of phone records, according to officials briefed on the practice.

Under the USA Patriot Act, the F.B.I. received broadened authority to issue the national security letters on its own authority — without the approval of a judge — to gather records like phone bills or e-mail transactions that might be considered relevant to a particular terrorism investigation. The Justice Department inspector general found in March 2007 that the F.B.I. had routinely violated the standards for using the letters and that officials often cited “exigent” or emergency situations that did not really exist in issuing them to phone providers and other private companies.

F.B.I. Says Records Demands Are Curbed

WASHINGTON — The Federal Bureau of Investigation improperly obtained personal information on Americans in numerous terrorism investigations in 2006, but internal practices put in place since then appear to have helped curtail the problems, Bush administration officials said Wednesday.

The Justice Department’s inspector general is expected to issue a report in coming weeks that updates the findings of a major investigation last year into the F.B.I.’s use of so-called national security letters, which allow investigators to obtain telephone, e-mail and financial information on people involved in investigations without a court warrant.

Last year’s report caused an uproar in Congress when it was disclosed that the F.B.I., under powers granted by the USA Patriot Act, had misused its authority to gather records in thousands of instances from 2003 to 2005. The new report from the inspector general will examine the bureau’s use of the records demands in 2006.

At the end of the day, this isn’t about any particular individual, nor even any particular violation of our constitutional rights.

It’s about addressing the systemic problems of the information age. There will always be threats to national security. There will always be the drive to get as much data as possible into the hands of a few, elite law enforcement agencies, capable of acting in the “public good”. And there will always be those individuals who break the rules, whether for good intent or malicious device. We don’t need conspiracy theories to point out the dangers of centralizing all the information about everybody.

What we need is an open-eyed approach to building information systems on user-centric principles, such as Cameron’s seven Laws of Identity. Do that and a vast number of systemic risks of the information age go away.

NY State strikes out against unapproved privacy invasion by online advertisers

Kudos to Assemblyman Richard L. Brodsky in the NY State Assembly for taking on GoogleClick and the rest of the back-end invisible online tracking services.

The NYT reports A Push to Limit the Tracking of Web Surfers’ Clicks:

AFTER reading about how Internet companies like Google, Microsoft and Yahoo collect information about people online and use it for targeted advertising, one New York assemblyman said there ought to be a law.

 

Michelle V. Agins/The New York Times

Assemblyman Richard L. Brodsky, the sponsor of a New York bill to limit how companies collect data on computer users.

So he drafted a bill, now gathering support in Albany, that would make it a crime — punishable by a fine to be determined — for certain Web companies to use personal information about consumers for advertising without their consent.

And because it would be extraordinarily difficult for the companies that collect such data to adhere to stricter rules for people in New York alone, these companies would probably have to adjust their rules everywhere, effectively turning the New York legislation into national law.

“Should these companies be able to sell or use what’s essentially private data without permission? The easy answer is absolutely not,” said the assemblyman who sponsored the bill, Richard L. Brodsky, a Democrat who has represented part of Westchester County since 1982.

“A law like this essentially takes some of the gold away from marketers,” said Joseph Turow, a professor at the Annenberg School for Communication at the University of Pennsylvania. “But it’s the right thing to do. Consumers have no idea how much information is being collected about them, and the advertising industry should have to deal with that.”

This is an absolute no brainer.

If you don’t have permission, don’t track users.

In the physical world, we have developed fairly robust rules of social etiquette and even laws regulating this sort of behavior. Can you imagine how creepy it would be if some stranger surreptitiously followed you around, noting where you go, what products you buy, even what sections in the supermarket you lingered in? Yech! Get that stalker away from me.

new carAnd yet, that is exactly what most (if not all) online ad networks do to maximize their ability to sell high margin ads targeting Internet users. It makes sense. If they can tell from your clickstream behavior that you are likely looking for a new car, then they can create a lot of value by showing you new car ads. Value for advertisers and value for you… after all, you ARE in the market for a new car, right?

When it works, it’s cool. But what about when you don’t want it working? When you want a little discretion as you window shop? When you’d like some privacy? Unfortunately, it doesn’t work that way. Users can’t tell when the ad networks are, or are not, tracking–it is all invisible to them–and there’s no equivalent of a “do not call list” to turn off such tracking.

The right answer is to move toward user-centric advertising, where the user is explicitly in control of all the data used to offer them ads and can even limit the types of ads shown. This resonates with Esther Dyson’s testimony to the Federal Trade Commission at their Townhall on behavioral targeting and her subsequent article at Huffington Post, where she proposed using a “Disclosure 2.0″ approach to this issue.

handshakeIf advertisers and ad networks can create real value with behavioral tracking and targeting, then full disclosure–and even full user control–will only enhance trust and deepen the relationships between businesses and customers. The long term value of a customer depends on building viable, healthy relationships. Relationships depend on trust. By engaging openly and responsively with their customers–with full disclosure and as much user control as possible–companies can craft entirely new, more trusting and more profitable relationships while customers feel more comfortable about their personal boundaries, have increased confidence in their vendors, and get easier access to better products and services.

Advertisers need to get this. Ad networks and search engines need to support it. And it may be that regulators need to enforce it.

[This is precisely the sort of payoff for vendors that Project VRM is working towards.]

NewsGang talks data portability. Next up: Service Portability.

data and globeExcellent chat today by Steve Gillmor, Chris Saad, Mary Hodder, Karoli Kuns, Robert W. Anderson, Matt Terenzio, and Bruce Lerner about data portability. They get to the nitty gritty about data portability, licensing, and social networks. Perhaps the best Gang I’ve ever heard.

So, Steve, if you’re listening, take this to the next level and talk about service portability.

It’s great to be able to move my data from service to service. Data portability is a good thing–and we absolutely must address the licensing and privacy issues that cloud that horizon. We also need to be able to move our services from provider to provider.

We can do that today with domain names that we own. We can move our blog or our website or our email from one hosting provider to another. The next step is to extend that to user-controlled services that expose data on our terms, under our control.

Data portability lets everyone pass data around so different service providers can do smart things with that data. Ok. But we learned long ago that software systems are more robust, more scalable, and more maintainable when rather than exposing the data, you expose functions that use that data.

email imageI don’t want people who email me to have direct access to my email data file a server somewhere. That would be insane. I want them to have a well-defined, constrained, complete service interface for sending me email, no matter which service provider I choose. An interface that lets them reach me, but keeps them from reading and deleting other email.

Similarly, we need to take user data, place it in a personal datastore (yea! portability!), then provide specific, well-defined access services to third party service providers, using that data, where the user controls those services completely: what services are available, who can access them, and even who the underlying service host is. This is how email works. How websites and blogs work. Next is to take this to user-centric services with complete, seamless data and service portability across the entire cloud.

We know that we need to be able to move our email service from one service provider to another. We know that we need to be able to move our websites to the host of our choice. We know that we need to be able to move our cell phone number from one carrier to another. And we know that we need to be able to change our attorney of record, our doctor, our insurance provider, etc.

We also need to be able to move our MySpace profile and Facebook page anywhere, anytime, on our terms… not just the friends list, but the entire visual gestalt. We need to be able to move our IM and our Twitter services. We need to be able to move our search history from one search provider to another. Pick any service you have come to depend on and understand that dependence creates the need for liberation, the need to get that service on your terms with the provider you prefer, under your complete control.

Without complete portability–services and data portability–innovative service providers will corner markets with data silos and service lock in. Only with transparent, seamless portability, can we leverage the open market and open network to drive to the most desirable and most useful services.

Hey YouThe user-centric identity community is way ahead of the curve on this one, and I’m looking forward to the data portability movement re-discovering the architectural realizations learned the hard way by OpenID, CardSpace, Liberty Alliance, and Higgins, just as the identity community begins to extend from the hard core technology built for identity and starts working towards the applications that will connect ultimately to real value for real users. And it has all been learned and continues to be built through collaborative efforts toward real portability and interoperability at the heart of the infrastructure. In particular, XDI has made great progress hashing out exactly the sort of licensed-based identity-authorized data access that Steve talked about in the podcast. ProjectVRM is driving a user-centric approach to commerce in this conversation and I encourage folks to join us all at the next IIW unconference and to keep an eye open for a VRM workshop sometime later in the year.

Europe continues to lead privacy conversation with IP ruling

The EU is years ahead of the US in user rights and privacy. For a VRM example, see the UK’s Buyer-Centric Commerce Forum.

Now, according to the Washington Post, an EU judge has pushed the privacy envelope even further, saying “IP addresses are personal data“:

BRUSSELS — IP addresses, strings of numbers that identify computers on the Internet, should generally be regarded as personal information, the head of the European Union’s group of data privacy regulators said Monday.

This will be interesting to watch…

The Killer App Proceeds From the User

Alex Iskold of Blue Organizer asks “What is the Killer App?” for the Semantic Web in an article that nicely condenses the current best of class in the major contending promises of what Tim Berner’s Lee has recently dubbed the Giant Global Graph:

  • Natural Language Understanding
    • No longer a need for cryptic “Googlese” to get the computer to give you want you want.
  • The Genie in the Bottle
    • The magically perfect assistant who can answer any question or satisfy any need you might have.
  • Semantic Knowledge Bases
    • Structured databases that have deep understanding of the meaning behind the data, rather than just the characters and numbers used to represent the data. Think Freebase and Twine.
  • Semantic Search
    • Natural language understanding driving search results, so you can ask questions like “What clubs does Tiger use?” rather than Googleses keyword queries. Hakia, Powerset, and Cognition are all in this space.
  • Social Graph
  • Shortcuts

It’s nice walk through the space and particularly interesting how Alex responds to the current state-of-the-art in each. I’ll summarize here, so I can respond in turn (check out the full article for Alex’s actual statements):

  • Natural Language Understanding
    • Huge, hairy problem. No solution in site.
  • The Genie in the Bottle
    • Even harder. Needs magic that isn’t even conceptually well understood.
  • Semantic Knowledge Bases
    • More detailed data is good, but does it really help users? Not emotionally catalytic enough for people to actually get excited and jump on board.
  • Semantic Search
    • Doesn’t look like the killer app so far, because none of the “semantic” approaches seem to improve much on Google.
  • Social Graph
    • This is just a subset of the semantic web and therefore not its killer app.
  • Shortcuts
    • An up and coming category, these embedded shortcuts remove search as the killer navigation online. However, it is still young, misunderstood, and also lacks emotional umph.

First, the most intriguing item is that Alex is candid enough to be critical of the category in which he places his own company’s flagship product. Perhaps AdaptiveBlue has turned the corner on their conceptualization of the market and are rapidly, fiercely developing their next innovation, their next rev, the thing that just might become the killer app of the Semanic Web. That makes me curious, indeed.

Second, I like the break down, but naturally have some slightly different opinions. SwitchBook is still largely in stealth mode–we have yet to publish much on what we are doing even though we are relatively open in face-to-face meetings. However, from my posts here you can guess that it involves search, user-centrism, and particularly the principles underlying VRM.

So let’s look at Alex’s breakdown again:

Natural Language Understanding

Definitely a huge problem. Not only do you have to deal with the incredible elasticity of language, once you’ve mapped the natural language into some sort of internal representation, you still have to figure out what the heck you are going to do with it.

In other words, “understanding” is context specific not just in terms of words having different meaning in different places–Jaguar could mean a car, a cat, or an operating system depending on whose brochure or website you find it on–but it also has different meaning based on what you (as a system, as a service) are going to do with that understanding.

  • Are you going to return web pages that contain Jaguar with the same meaning?
  • Are you going to offer alternatives to the term Jaguar, like a thesaurus?
  • Are you going to translate Jaguar into other languages?
  • Are you going to sell Jaguar compatible products?
  • Are you going to reason over the threats and opportunities of Jaguars?

All of these require fundamentally different internal representations of the “understanding” of the natural language from the user.

As Jaron Lanier will tell you, language is an interface by which people remotely control the world outside their mind. We use it to communicate with others to get what we want and to understand how to respond to others (which is basically figuring out how to eventually get others to give us what we want). As such, its primary use, its raison-d’etre, is to influence the world around us. So, what we really want isn’t to understand the language, but to understand (1) what a speaker wants and (2) how to influence the world.

It turns out, people are incredibly adaptive at both of these tasks. Language is just one of the interfaces we use and we are capable of learning entirely new tools quickly when they demonstrate a more efficient, more effective way to get what we want. The humble spreadsheet is one of my favorite examples of this. I believe that more people “program” in MS Excel than in any classic programming language: we write mini-programs using functions like sum() and average() and put data in and look at the results. Who would’ve thought that entry-level clerical workers, accountants, and soccer moms around the world would’ve learned to program? And yet, they do. In my opinion, Excel is probably the most widely used programming environment in the corporate world.

Could you imagine trying to replace that with Natural Language? I can only imagine that a natural language version of Excel would be more convoluted and harder to use, but maybe that’s just because I lack imagination.

The Genie in the Bottle

This is more interesting. I agree that this goal is arbitrarily far away–no one will crack this nut entirely until we have both omniscience and omnipotence programmed into our software (and that is essentially never). However, by understanding clearly exactly what the Genie would do if he or she could, then you have a starting point for building innovative solutions.

Consider the development of online virtual worlds. Many people also said that the fictional Star Trek holodeck is arbitrarily far into the future, that, like the Genie, it requires so much advanced technology as to effectively be magic. And yet, Janet Murray’s Hamlet on the Holodeck gave us a realistic assessment of the current state of the art and how we might eventually get there. Sure, we are still arbitrarily far away from the uber virtual experience of the Holodeck. But Second Life, World of Warcraft, and Grand Theft Auto have all broken incredible ground in making a simpler, more feasible version of that experience available today to tens of millions of people.

So, what we can learn from the Genie is how to think about the “perfect” Search service. Imagine for a moment the absolutely perfect search service. Think bigger than natural language search. Think bigger than talking to your computer and getting what you want.

The perfect Search is when you only just barely have to indicate your intention and your search result appears. Somehow, magically, the system just knows what you want and when you are ready to actually act on that desire, the system has already brought your desire to you. No more running to the vending machine to get a soda from an arbitrarily limited selection in fixed volume and vendor-mandated packaging. The system knows you are getting thirsty, knows what you want (not just from history information but even from sensing your current blood-sugar and taste craving) and how you want it, and the moment you commit to getting that soda, it appears at your desk–perhaps even without you knowing exactly which soda you wanted today. All of this done discretely, unobtrusively, privately, and with the utmost discretion so neighbors or co-workers don’t see what you’d rather they don’t. The action, ultimately, is always driven by your committed intention. Not your attention, not some statistically predicted estimate of your desire, but your actual, expressed commitment to realize a particular desire. Express an intention and magically, it is fulfilled.

That’s the Genie.

While it isn’t yet available, bits and pieces of it are becoming available, just as online text MUDs and World of Warcraft are bits and pieces of 30 years working towards the ultimate virtual reality. By placing the committed intention of the user at the core of value creation, at the heart of the system design, I believe the Genie provides an almost the ideal model for conceptualizing the Holy Grail of Search.

Semantic Knowledge Bases

Essentially, I agree with Alex, this is a technology looking for a problem. “Better” data and more “powerful” ways to interact with and reason over that data should provide better results and is, therefore, a Good thing–assuming there are no other costs. Unfortunately, the semantic web has significant transitional and ongoing costs to turn the free-form, anyone-can-post-anything World Wide Web, into a system where participating as a first class of citizen requires using RDF or microformats or some other arcane technology to transform formerly arbitrary scribblings–and marketing and online stores and customer service and media outlets and whatever–into semantically structured information. It requires an imposition of structure that is inherently limiting and counter to the user-centric architecture of the open web.

Nobody wants to pay that cost unless the immediate value to them is obviously much greater. And so far, the value is uncertain and far into the future.

Semantic Search

Alex suggests that because none of the semantic search companies is better than Google that semantic search isn’t the killer app. Well, Google uses a lot of semantics in its Search. Most users just don’t know it. They’ve used Latent Semantic Indexing for years and AdSense is all about wicked smart semantic analysis of web page content for matching ads from the Google ad universe. In fact, one of the more interesting semantic tricks Google does is one you can see for yourself. Try typing “jaguar” (or some other ambiguous term) into Google’s query box.

You’ll find that alternative meanings of “jaguar” all show up in the early results. Jaguar as a car. Jaguar as the cat. Even the Jaguar quantum chemistry package from Schrodinger, which has no reason being in the top ten at Google. Google does this because it knows that from the limited query box, it can’t figure out which Jaguar you really mean. But it also knows that users will filter out the misses and get excited about the hits. They design for the “Ah-hah” moment. As long as one in ten (or so) results matches the user’s intended meaning of Jaguar, then Google gets credit for finding the “right” jaguar. Brilliant.

So, I argue that any search that isn’t semantic is a dinosaur waiting for the undertaker. Maybe it isn’t a killer app as a distinct service, but it is already an integral part of the #1 killer app of the Web, Search.

Social Graph

On this one, Alex fails to explain clearly enough why he doesn’t like it. Any killer app is going to be a “subset” of the entire market. Email isn’t the totality of the Internet, but it is the killer app that first broke down the isolated IT networks and marched like Sherman all the way through to the consumer market to give the sexier World Wide Web a fighting chance at establishing the Internet as much a fundamental part of the civilized world as electricity, running water, and paved roads.

Actually, I think the social graph might be the killer app of the Semantic Web. It doesn’t deliver the full value of the Semantic Web, but it provides such immediate, obvious value for so many people that once the privacy controls are worked out, many many people are going to be surfing the Semantic Web without knowing it as they seamlessly mingle across their social internetwork through the former silos of Facebook, MySpace, Plaxo, and others. If it can be a killer app without people giving it credit, then the Social Graph is definitely a contender.

Shortcuts

This is absolutely illuminating. I like AdaptiveBlue’s product a lot, and others in this category have potential. However, I usually find the disjoint interactions confusing. Shortcuts, by nature, interfere with the “normal” web experience and are inherently intrusive. I happen to have Snap installed on my machine and I’m still surprised and often annoyed when it pops-up “previews” of links I’m doodling my cursor on.

I do that… I doodle mouse and doodle click. I have the same problem at the New York Times’ website, actually. They allow you to look up the meaning of any word on a page just by double-clicking on the word. Problem is, I doodle-click meaninglessly, sort of a virtual twiddling of my thumbs as I browse. And -whoops- I just triggered a new page download I don’t really want. It is a mess.

So, shortcuts have a long way to go to be less intrusive and to find the right “intuitive” connection with the user. Ultimately, I am a huge fan of augmenting the traditional “browse”-based experience of the web, rather than replacing it wholesale. People like the web. They like their services. They like the freedom of going anywhere that supports http and html. And yet, many of those websites don’t have the technical wherewithal to get “semantic”.

BlueOrganizer does a nice job, for example, of connecting IMDB listings of movies with NetFlix so it is easy for you to go from the Internet’s unofficial authority on movies to the leading movie-on-demand service. All without NetFlix or IMDB needing to do anything. That sort of user centrism is critical to the next evolution of the web and it’s the semantics of what is already on the web pages that make that possible. Shortcuts are just one effort to do something with that semantic data. Perhaps as they grow up, they will become more useful to more people.

Closing

Again, despite my initial hopes, I have written WAY too much, which is a pathological flaw I seem to have. Thanks for hanging in there.

My point in responding to Alex’s post is simply this: any killer app needs to start and end with the User. This is so true it has become a software development truism that everybody knows is important, but few know how to translate into their feature development schedule. Technology alone–like Natural Language Understanding–will never be a killer app. Only when someone figures out how to make it electric for users–exciting and immediate and so obviously valuable–can any innovation become a killer app.

With all due respect to the folks who love this term, the Semantic Web is one of those bundling concepts that is about as useful as the term “Electric Appliance.” It is useful in describing a category of product, but completely useless in helping retailers make decisions about what products people are going to buy this season. Until companies move beyond that catch all descriptor into product discussions that connect with what users already understand and want… none of the “semantic” offerings can possibly breakthrough to being a true killer app.