<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Saurabh Sahni &#187; search</title>
	<atom:link href="http://www.saurabhsahni.com/tag/search/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.saurabhsahni.com</link>
	<description>Mashing up the web</description>
	<lastBuildDate>Fri, 01 Jul 2011 16:35:26 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.1.3</generator>
		<item>
		<title>Natural Language Image Search with Yahoo Boss and Google App Engine</title>
		<link>http://www.saurabhsahni.com/2008/08/natural-language-image-search-with-boss-and-app-engine/</link>
		<comments>http://www.saurabhsahni.com/2008/08/natural-language-image-search-with-boss-and-app-engine/#comments</comments>
		<pubDate>Tue, 26 Aug 2008 18:50:08 +0000</pubDate>
		<dc:creator>saurabhsahni</dc:creator>
				<category><![CDATA[Hacks]]></category>
		<category><![CDATA[Yahoo!]]></category>
		<category><![CDATA[appengine]]></category>
		<category><![CDATA[mashup]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[yahoo boss]]></category>

		<guid isPermaLink="false">http://www.saurabhsahni.com/?p=25</guid>
		<description><![CDATA[Natural language processing is partly used in text search today, but its use in image search is mostly unexplored. I did a quick hack: askBoss, which retrieves images to questions posed in natural language. askBoss attempts to enhance image results for queries around factual question answering. It uses Yahoo Boss (Search) APIs through Boss Mashup [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/Natural_language_processing">Natural language processing</a> is partly <a href="http://www.ask.com">used</a> in text search today, but its use in image search is mostly unexplored. I did a quick hack: <a href="http://ask-boss.appspot.com">askBoss</a>, which retrieves images to questions posed in natural language. askBoss attempts to enhance image results for queries around factual <a href="http://en.wikipedia.org/wiki/Question_answering">question answering</a>. It uses <a href="http://developer.yahoo.com/search/boss/">Yahoo Boss (Search) APIs</a> through <a href="http://developer.yahoo.com/search/boss/mashup.html">Boss Mashup framework</a> and is deployed on <a href="http://appengine.google.com/">Google App Engine</a>.</p>
<p>This hack is an extension of Vik Singh&#8217;s <a href="http://bossy.appspot.com/">qna service</a>, which <a href="http://zooie.wordpress.com/2008/08/04/yahoo-boss-google-app-engine-integrated/">finds answer</a> using the popular phrases in the top search results for a query. I do image search for the best answers and blend them with the regular image search results. The hack is a <strong>basic prototype</strong> and natural language image search gets triggered only for questions (queries including who/what/which).</p>
<p><strong>Try askBoss: <a href="http://ask-boss.appspot.com">http://ask-boss.appspot.com</a></strong></p>
<p>Below is a quick comparison of search results obtained by askBoss, Google image search and Y! image search for<strong> query: who is batman in the dark knight</strong>?</p>
<p><a href="http://ask-boss.appspot.com/qa?query=who+is+batman+in+the+dark+knight">askBoss results</a>: who is batman in the dark knight?</p>
<p><a href="http://ask-boss.appspot.com/qa?query=who+is+batman+in+the+dark+knight"><img class="alignnone size-medium wp-image-34" title="askboss: who is batman in the dark knight" src="http://www.saurabhsahni.com/wp-content/uploads/2008/08/askboss-darkknight.png" alt="askboss: who is batman in the dark knight" width="700" /></a></p>
<p><a href="http://images.google.com/images?q=who+is+batman+in+the+dark+knight">Google image search results</a>: who is batman in the dark knight?</p>
<p><a href="http://images.google.com/images?q=who+is+batman+in+the+dark+knight"><img class="alignnone size-medium wp-image-35" title="google: who is batman in the dark knight" src="http://www.saurabhsahni.com/wp-content/uploads/2008/08/google-darkknight.png" alt="google: who is batman in the dark knight" width="700" /></a></p>
<p><a href="http://images.search.yahoo.com/search/images?p=who+is+batman+in+the+dark+knight">Yahoo Image search results</a>: who is batman in the dark knight?</p>
<p><a href="http://images.search.yahoo.com/search/images?p=who+is+batman+in+the+dark+knight"><img class="alignnone size-medium wp-image-36" title="yahoo: who is batman in the dark knight" src="http://www.saurabhsahni.com/wp-content/uploads/2008/08/yahoo-darkknight.png" alt="yahoo: who is batman in the dark knight" width="700" /></a></p>
<p><strong>Try askBoss: <a href="http://ask-boss.appspot.com">http://ask-boss.appspot.com</a></strong></p>
<p>With <a href="http://developer.yahoo.com/search/boss/">Yahoo Boss APIs</a> and a deployment platform like <a href="http://appengine.google.com/">Google App Engine</a>, building a decent search service is pretty easy. I could finish this hack within a few hours by using Boss Mashup Framework and App Engine. Apart form the <a href="http://bossy.appspot.com">qna service</a>, other <a href="http://www.ysearchblog.com/archives/000613.html">popular</a> Boss API/app engine <a href="http://www.techcrunch.com/2008/08/07/some-mashups-harnessing-the-power-of-yahoos-boss/">integrations</a> include <a href="http://www.4hoursearch.com/">4hoursearch</a> aka YUIL.</p>
<p><strong>Update: askBOSS got covered in <a href="http://www.techcrunch.com/2008/09/03/yahoo-boss-used-to-create-powerset-for-images-and-more/">TechCrunch</a> and <a href="http://www.ysearchblog.com/archives/000621.html">Yahoo Search Blog</a>.</strong></p>
]]></content:encoded>
			<wfw:commentRss>http://www.saurabhsahni.com/2008/08/natural-language-image-search-with-boss-and-app-engine/feed/</wfw:commentRss>
		<slash:comments>302</slash:comments>
		</item>
		<item>
		<title>The new search era, where are we?</title>
		<link>http://www.saurabhsahni.com/2008/06/the-new-search-era-where-are-we/</link>
		<comments>http://www.saurabhsahni.com/2008/06/the-new-search-era-where-are-we/#comments</comments>
		<pubDate>Tue, 24 Jun 2008 17:43:57 +0000</pubDate>
		<dc:creator>saurabhsahni</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Trends]]></category>
		<category><![CDATA[Web 2.0]]></category>
		<category><![CDATA[Yahoo!]]></category>
		<category><![CDATA[search]]></category>

		<guid isPermaLink="false">http://www.saurabhsahni.com/?p=10</guid>
		<description><![CDATA[I have been damn lazy to write this post, but recently after reading a RWW post, 11 Search Trends That May Disrupt Google, I decided to gather my thoughts here. Adding to the RWW post, let me try to bring up some minus and plus of todays search era ruled by Google &#38; partly Yahoo/MS. [...]]]></description>
			<content:encoded><![CDATA[<p>I have been damn lazy to write this post, but recently after reading a <a href="http://www.readwriteweb.com/">RWW</a> post, <a href="http://www.readwriteweb.com/archives/11_search_trends.php">11 Search Trends That May Disrupt Google</a>, I decided to gather my thoughts here.</p>
<p>Adding to the <a href="http://www.readwriteweb.com/archives/11_search_trends.php">RWW post</a>, let me try to bring up some minus and plus of todays search era ruled by Google &amp; partly Yahoo/MS.</p>
<h3>Things which are still not touched efficiently by the popular search engines:</h3>
<ul>
<li><strong><a href="http://en.wikipedia.org/wiki/Natural_language_processing">Natural language Processing</a></strong>:
<ul>
<li>We would like all questions like &#8220;<a href="http://www.google.co.in/search?q=Which+is+the+world's+tallest+mountain+peak%3F">Which is the world&#8217;s tallest mountain peak?</a>&#8221; to be answered on search: <a href="http://www.google.co.in/search?q=Which+is+the+world's+tallest+mountain+peak%3F">Google</a>/<a href="http://search.yahoo.com/search?p=Which+is+the+world%27s+tallest+mountain+peak%3F">Yahoo</a> could not answer it, but to my surprise <a href="http://www.ask.com/web?q=Which+is+the+world%27s+tallest+mountain+peak%3F">ask.com did</a> it! Still, we have to wait for a breakthrough.</li>
<li>Ignoring stop words, doing word stemming, etc. can really change the meaning significantly. For eg, searching <a href="http://www.google.com/search?q=apples">Apples</a> on google, returns results mainly for Apple Inc.</li>
</ul>
</li>
<li><strong>Multi-lingual search</strong>: Web being driven with focus on US market, problems of the rest of the world (especially eastern world) do not really get sufficient attention. Today&#8217;s web search experience does not have multi-lingual features!<br />
I spent couple of years during my masters at <a href="http://www.dil.iitb.ac.in">Media Lab Asia,</a> <a href="http://www.iitb.ac.in">IIT Bombay</a>, under <a href="http://www.cse.iitb.ac.in/~krithi">Prof. Krithi</a>, with people working on Multi-lingual search for project <a href="http://www.aaqua.org">aaqua.org</a>. Multi-lingual search works pretty good here. Try searching &#8220;<a href="http://aaqua.persistent.co.in/aaqua/forum/searchprocess?key=onion">onion</a>&#8221; or &#8220;<a href="http://aaqua.persistent.co.in/aaqua/forum/newsearch">कांदा</a>&#8220;, you get identical results <img src='http://www.saurabhsahni.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> . Such a search experience on the whole web, will be awesome!</li>
<li><strong>Treatment of Symantec data</strong>: Lot of standard formats have emerged like RDFs, microformat, RSS, etc, but still they are treated in almost same way as other web pages.</li>
<li><strong>Personalization &amp; Data mining</strong>: There are a few signs of google personalizing the results. But, nothing significant yet!</li>
<li><strong>Multimedia search</strong>: None of the search engines is doing a great job here, which is attributed to complex and computationally expensive image processing. But, pretty significant research is up for the same in Google, Yahoo and Microsoft. A recent publication in WWW08 from googlers suggested a concept ImageRank, similar to PageRank which can actually work well.</li>
</ul>
<h3><strong>Some of the cool innovations</strong> in todays search:</h3>
<ul>
<li> <strong>Improved UI/visualizations</strong>: UI innovations are the most prominent amongst all. Here are a few set of examples:
<ul>
<li><strong>Yahoo Search: &#8220;<a href="http://search.yahoo.com/search?p=get+smart">Get Smart</a>&#8220;</strong>
<p style="text-align: center;"><a href="http://search.yahoo.com/search?p=get+smart"><img class="aligncenter" title="getsmart-yahoo" src="http://www.saurabhsahni.com/wp-content/uploads/2008/06/getsmart-yahoo.jpg" alt="" /></a></p>
<p><a href="http://search.yahoo.com/search?p=get+smart"><br />
</a></li>
<li><strong>Google: &#8220;<a href="http://www.google.co.in/search?q=get+smart%2C+Bangalore">Get Smart, Bangalore</a>&#8220;</strong><br />
<a href="http://www.google.co.in/search?q=get+smart%2C+Bangalore"><br />
</a></li>
<p style="text-align: center;"><a href="http://www.google.co.in/search?q=get+smart%2C+Bangalore"><img class="aligncenter" title="getsmart-blr-google" src="http://www.saurabhsahni.com/wp-content/uploads/2008/06/getsmart-blr-google.jpg" alt="" /></a></p>
<p style="text-align: center;"><a href="http://www.google.co.in/search?q=get+smart%2C+Bangalore"><br />
</a></p>
<li><strong><a href="http://www.guruji.com/movietiming?hl=en&amp;q=get+smart%2C+Bangalore">Get Smart, Bangalore</a> on <a href="http://www.guruji.com/">Guruji.com</a>, India focussed search engine</strong>
<p style="text-align: center;"><a href="http://www.google.co.in/search?q=get+smart%2C+Bangalore"><br />
</a></p>
</li>
<p style="text-align: center;"><a href="http://www.guruji.com/movietiming?hl=en&amp;q=get+smart%2C+Bangalore"><img class="aligncenter" title="getsmart-blr-guruji" src="http://www.saurabhsahni.com/wp-content/uploads/2008/06/getsmart-blr-guruji.jpg" alt="" /></a></p>
</ul>
<p style="text-align: center;"><a href="http://www.google.co.in/search?q=get+smart%2C+Bangalore"><br />
</a></p>
<ul>
<li><strong><a href="http://search.yahoo.com/search?p=saurabh+sahni">Searching a name</a> on Yahoo with linkedin enhancement (<a href="http://gallery.search.yahoo.com/">searchmonkey</a>)</strong></li>
<p style="text-align: center;"><a href="http://search.yahoo.com/search?p=saurabh+sahni"><img class="aligncenter" title="ss-linkedin-searchmonkey" src="http://www.saurabhsahni.com/wp-content/uploads/2008/06/ss-linkedin-searchmonkey.jpg" alt="" /></a></p>
</ul>
<p style="text-align: center;"><a href="http://www.google.co.in/search?q=get+smart%2C+Bangalore"><br />
</a></p>
<ul>
<li><strong><a href="http://in.search.yahoo.com/search?p=MS+dhoni">MS Dhoni</a> on Yahoo India</strong><br />
<a href="http://in.search.yahoo.com/search?p=MS+dhoni"> </a></p>
<p style="text-align: center;"><a href="http://in.search.yahoo.com/search?p=MS+dhoni"><img class="aligncenter" title="msdhoni-glue" src="http://www.saurabhsahni.com/wp-content/uploads/2008/06/msdhoni-glue.jpg" alt="" /></a></p>
<p style="text-align: center;"><a href="http://www.google.co.in/search?q=get+smart%2C+Bangalore"><br />
</a></p>
<p><a href="http://in.search.yahoo.com/search?p=MS+dhoni"> </a></li>
</ul>
</li>
<li> <strong>Openness/APIs</strong>: <a href="http://code.google.com/apis/ajaxsearch/">Google</a>/<a href="http://developer.yahoo.com/search/">Yahoo</a> have been pretty open in terms of providing search APIs, applications, etc. Want to experience google search in a terminal: Try out <a href="http://www.goosh.org">www.goosh.org</a></li>
<li><strong>Specialized searches</strong> like: local/maps: Local/maps and other focussed searches like publication search, patent search, etc. are doing pretty good. Directions are now available in India also with Yahoo Maps the only provider <img src='http://www.saurabhsahni.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </li>
</ul>
<h3>What else can be tried on search?</h3>
<p>There are couple of things things which I think can work for search, but we need to overcome spam problems for these:</p>
<ul>
<li><strong>WikiSearch</strong>: Allow users to tag/rank search results. Something like digg/delicious for keywords&#8230;</li>
<li><strong>Push based update notification model</strong>: Search results are not uptodate. Even for popular pages they lag by few days. Introduce a push based model, something like blog.gs, it can help?</li>
</ul>
<p><strong><span style="color: red;">Update: Nov 20: Google has released <a href="http://googleblog.blogspot.com/2008/11/searchwiki-make-search-your-own.html">SearchWiki</a>, my first suggestion/prediction comes true <img src='http://www.saurabhsahni.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </span></strong></p>
<p><em>Disclaimer: All opinions are solely mine and and do not necessarily reflect the opinions of my employer.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.saurabhsahni.com/2008/06/the-new-search-era-where-are-we/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

