December 27, 2012

Is a Picture worth a 1000 words?

I recently watched Peter Jacksons's Hobbit, which has been shot in 48 fps. That means that the 170 minute long movie uses nearly 489,000 frames. And this is only the first part of a trilogy. At this rate, the full series will easily use up more than 1 million frames. 

Now, the entire book is only 96,000 odd words, i.e., the word count in the entire book is less than the number of pictures/frames used in the 1st movie. And many would still prefer to read the book over watching the movie. 

Clearly, all pictures are not worth 1000 words, especially if the words are written by JRRT. 

April 17, 2011

Interfaith Message Processor

I am reading "Where Wizards Stay Up Late", a book that tracks the Origins of the Internet from ARPAnet. A key step in the development of the Internet was the IMP (Interface Message Processors) Project floated by ARPA. The company that won the contract to develop the processors was Massachusetts based BBN. 

That brings to one of my favorite bits from the book:

When news reached Massachusetts senator Edward Kennedy's office just before Christmas that a million-dollar ARPA contract had been awarded to some local boys, Kennedy sent a telegram thanking BBN for its ecumenical efforts and congratulating the compan on its contract to build the "Interfaith Message Processor"

:)

October 26, 2010

Mertado Launches Embedded Shopping

Other retailers pride themselves on being a one-stop shop. However, at Mertado, we are pretty excited about our launch of no-stop shop: with our Embedded Shops you can discover cool new products every day without interrupting what you are doing on the web.

Here's what the press is saying:

New York Times says that we have two advantages over other solutions: "...For one thing, it’s integrated with Facebook (Mertado has both a Facebook application and a standalone website that uses Facebook Connect), so it can recommend products based on your profile, and you can recommend deals to your Facebook friends.And the company isn’t aggregating deals found on other sites — Mertado is actually the seller, so it can make sure there’s a good mix of decent products, and that customer complaints get resolved."

Techcrunch talked about how Mertado is enabling Developers Bake a Retail Store into Games.

 

Check out a demo of how this works:

Mertado Embedded Shopping Demo from Mertado on Vimeo.

October 23, 2010


October 05, 2010

Here's a piece I did on using Predictive Analytics on web data that was published in Inc. Magazine sometime back:

January 18, 2010

Foursquare Serendipity

I was hanging out in the Mission yesterday with a friend and checked-in to Dog Eared Books on Foursquare. I was intrigued to see an ad for a "special nearby": Foursquare told me that if I check-in nearby at Her Majesty's Secret Beekeeper, I would be treated to a free honey stick. We decided to check out the place, and sure enough got to sample some great honey, and have a great chat with the owner Cameo Wood. Cameo is an early adopter of a lot of technology, and had been a beta-tester for Foursquare before she jumped onto their advertising program.

Now, I am not the kind of person who would ever have walked into a honey & beekeeping supplies store. But I really enjoyed this serendipitous experience and am now happy to promote this shop to other friends who might be interested. 

Of course, Foursquare's primary value is as a social tool to meet people. But as a side-effect, it also connects people with places that they wouldn't have discovered otherwise. And that makes it a very powerful advertising medium for any local business. I had read about Foursquare for Business before, but it was great experiencing it first hand. 


October 08, 2009

It's Y!ou. Or is it?

Timesindiayahoo   F


What do you feel about Yahoo!'s new ad push? The ads made a big splash in India a few days ago, and now it turns out that New York is also plastered with them. 

I feel that the branding of internet products is somewhat like branding in the service industry: the brand is all about the customer experience and not about the tagline (here's a good article about how the brand is not about the slogan). For instance, it doesn't matter to me if a bank comes up with the coolest slogan, if that slogan has nothing to do with my experience banking there, or if I have a poor experience. In some sense, web companies are also service providers, and if they fulfill a great service promise, they win with users, and they lose if they don't.

In that light, I don't understand how Yahoo! is about me. It seems that they want to latch on to a hot new theme, without doing anything to fulfill the promise of that experience. What do you feel?

June 22, 2009

Cross posts from Inc

I've started writing a Byline with inc.com, around the theme of analyzing technology trends for small businesses & entrepreneurs. Here are the first two articles that have been published:

  1. Lessons from Web 2.0, Fast Track Innovation Process:  This article starts with premise that Web 2.0 companies have some natural advantages that help them innovate fast, and then examines how other businesses can leverage some of these ideas.
  2. The Long Tail & The Black Swans: I examine how the Black Swan, or the unexpected hit, can influence the analysis of whether & how you should use a long-tail business model.

Would love to get your feedback.

March 08, 2009

Twitter's billion $ search opportunity - Architect it right

When people say "Google Beater" these days, there's a reasonable chance that they're referring to "real time search", or to put it more simply, Twitter search. Google's Eric Schmidt engaged in a war of words with Twitter recently, and Techcrunch has declared that it's time to start thinking of Twitter as a search engine.

Twitter has distinguished itself from other communication channels in a few ways which have led to its importance as a source of search-able data: 

  1. Messages are broadcast, and not shared in a close group: Twitter updates are public by default and appear in a "public timeline". Twitter's community has evolved in a way that most users want their updates to be public.
  2. One-way following is possible: Unlike other social networks that only allow a two-way "friendship" mechanism,  Twitter allows any user to "follow" any other user's public timeline. This has helped make twitter a broadcast mechanism - many journalists and other trusted figures have tens of thousands of "followers" on Twitter. 
  3. Anyone can "reply" to any user's post: Twitter doesn't have any restrictions on who you can reply to (unless, of course, you indulge in spamming, which's detected by Twitter and you get blacklisted).
  4. Messages are broadcast in real time: Twitter updates show up in your followers' feed instantly
  5. Useful messages get relayed many times over: Twitter's community has created a mechanism of re-tweeting a message, which helps relay important messages to large groups of people.
  6. Twitter users tag their posts for searchability: The use of "hashtags" on Twitter is a means by which the community can help twitter searchers find specific information around one theme - e.g., tweets about the Mumbai Attacks were tagged #mumbai
Due to these features Twitter has become the primary destination to broadcast real time information, be it about a big global event like the Mumbai blasts with an audience of millions, or a talk at a conference which's being discussed over tweets by a few tens or hundreds of people. Journalists, bloggers & marketing professionals are falling over each other to engage in conversations and help shape opinion over Twitter.

A few searches on Twitter help illustrate this:
On some of the most news-y and hotly discussed topics, Twitter search results update faster than you can keep pace with. A Twitter search is an invaluable resource for anyone engaged in understanding news and opinions.

However, since Twitter wasn't built for search (the search app was built outside Twitter by Summize, which Twitter acquired), searching on Twitter still has some holes. Doing a text match on all tweets and sorting them by time is useful, but users are surely looking for some notion of having "better" tweets bubble to the top of search results, rather than just looking at the most recent tweets. Search, in general, is about relevance and importance, and Twitter isn't architect-ed right for either:
  • Importance of the tweet: The ranking of search results should be influenced by some measure of "importance" of the post (in some sense, this is the page rank equivalent of Google). In addition to the recency of a status update, a few measures of importance on twitter are
    • the number of retweets,
    • the number of replies received,
    • the number of "favorites" received by the message
    • and the follower count of the person posting.
          Twitter's architecture today doesn't allow its search to leverage some of these measures of importance other than the date/time, number of favorites and the follower count:
    1. Twitter's retweet mechanism doesn't reference the status id of the message you're re-tweeting. For example, when you look at my retweet - http://twitter.com/vijaycs42/status/1275845600 - you won't be able to find the status-id of the message I am retweeting. This means that Twitter can't easily  identify the messages that are getting re-tweeted the most.
    2. @replies on Twitter are not associated with the tweet being replied to: which means, Twitter doesn't have a simple way of figuring out the posts (not the people) that received the most replies.
  • Relevance to the query: The fundamental problem is that each tweet has only 140 characters, and that's not much data to work with. And these 140 characters have no structure - unlike web pages that have structured fields like "title" and "anchor text". Hence, searching on Twitter is sort of like the early days of image search and video search, when there wasn't much text associated with the content (today, of course, the abundance of tags and user comments helps find the best images and videos easily). Try searching for the keyword "twitter" on twitter search- while all the posts mention the word, how many of those posts are really about Twitter? (e.g. I saw a post that said "I swear if anyone spoils Watchmen for me on Twitter, I am gonna go postal"). Luckily for Twitter, they do have a lot of additional text and some structure
    1. The extra text comes from replies and retweets,
    2. and the little bit of structure comes from the #tags
But, as I mentioned above, @replies and retweets are not associated well with tweets, in Twitter's current architecture. Fixing that will not only give Twitter a better importance signal, but also help improve text relevance by making a lot more text available for search. Twitter can gather even more data around tweets that contain a URL by indexing all or some of the contents of the url.

In some sense, both the above themes are about how it's much easier (and much more valuable) to accurately search conversations (a bunch of related tweets) than to search individual tweets. Facebook has probably got a lead over Twitter in this aspect, because they naturall group conversations together.

Another thing that bothers me about Twitter search is that it relies heavily on hash tags.  For example, try and search for the TV series Lost. If you didn't know that lost has a hashtag of #lost, you're very likely to just get lost in your search. And today hash-tags are very arbitrary. For example, the hash-tag for the Mumbai attacks was #mumbai. Now there's no way to make an association between the words #mumbai and "mumbai attacks" - so you have to know to search for both if you want to retrieve all the results about the incident. Over time there will be situations when different people are using different hash tags to refer to the same incident. Twitter should think about a better tagging mechanism that scales nicely . Perhaps use the Flickr or Del.icio.us solution of allowing users to add arbitrary tags to posts; of course, that solution also has a downside - it distracts from the main functionality of Twitter, which's to allow users to tweet with minimal friction.

Twitter is already one of the most useful services out there in terms of the social function it serves. Now, it also has the potential to become one of the most useful search services. If it manages to fulfill that potential, it will certainly be worth a billion dollars, or maybe several. But getting there would need them to re-architect Twitter to make the search functionality more powerful.

December 08, 2008

Kosmix Beta is live

I haven't posted anything on this blog for a bit, and for a good reason: along with a lot of folks from Kosmix, I've been busy building out our universal search product. It just went live today, along with an announcement of a new round of funding for kosmix. You can read more about it on the Kosmix blog

Do give it a spin. And I'd love to hear your feedback.