<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Karmona Pragmatic Blog &#187; Search</title>
	<atom:link href="http://blog.karmona.com/index.php/category/search/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.karmona.com</link>
	<description>Pragmatic Software Management, Internet Trends, Life and more...</description>
	<lastBuildDate>Wed, 01 Feb 2012 20:26:45 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Disable Google SearchWiki</title>
		<link>http://blog.karmona.com/index.php/2009/06/14/disable-google-searchwiki/</link>
		<comments>http://blog.karmona.com/index.php/2009/06/14/disable-google-searchwiki/#comments</comments>
		<pubDate>Sun, 14 Jun 2009 17:04:19 +0000</pubDate>
		<dc:creator>Moti Karmona &#124; מוטי קרמונה</dc:creator>
				<category><![CDATA[Conspiracy]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://blog.karmona.com/?p=436</guid>
		<description><![CDATA[Fashionably late*, Google Search&#8217;s global preferences page now includes the option to disable the SearchWiki &#8220;horror&#8221;&#8230; Simply click on the checkbox next to SearchWiki and you will &#8220;Hide the ability to share, promote, remove, comment, or add your own results&#8221; All good now :) * Friendly reminder: Marissa Mayer promised that Google Search Wiki would soon [...]]]></description>
			<content:encoded><![CDATA[<p class="MsoNormal"><span><span><a href="http://blog.karmona.com/wp-content/uploads/2009/06/070514_banksy10_p646.jpg"><img class="alignleft size-medium wp-image-437" title="The Elephant in the Room | Banksy" src="http://blog.karmona.com/wp-content/uploads/2009/06/070514_banksy10_p646-300x187.jpg" alt="The Elephant in the Room | Banksy" width="180" height="112" align="left" /></a>Fashionably late*, Google Search&#8217;s</span></span><span><span> g</span></span><span><span><a href="http://www.google.com/preferences"><span>lobal preferences page</span></a></span></span><span><span> now </span></span><span><span>includes the option to disable</span></span><span><span> the </span></span><span><a title="Random Thoughts on Google SearchWiki" href="http://blog.karmona.com/index.php/2008/11/21/random-thoughts-on-google-searchwiki/">SearchWiki</a> &#8220;horror&#8221;&#8230;</span></p>
<p class="MsoNormal"><span><span>Simply click on the checkbox next to SearchWiki and you will </span><em>&#8220;Hide the ability to share, promote, remove, comment, or add your own results&#8221; </em></span></p>
<p class="MsoNormal"><span><span> </span></span></p>
<p class="MsoNormal"><span><span>All good now :)</span></span></p>
<p class="MsoNormal"><span><span><br />
</span></span></p>
<p class="MsoNormal"><span><span> </span></span></p>
<p class="MsoNormal"><span><span> </span></span></p>
<p class="MsoNormal"><span><span> </span></span></p>
<p class="MsoNormal"><span><span> </span></span></p>
<p class="MsoNormal"><span><span>* Friendly reminder: </span></span><span><span><a title="Marissa Mayer" href="http://en.wikipedia.org/wiki/Marissa_Mayer"><span>Marissa Mayer</span></a> <a title="Marissa Mayer At Le Web: The (Almost) Complete Interview" href="http://www.techcrunch.com/2008/12/10/marissa-mayer-at-le-web-the-almost-complete-interview/"><span>promised</span></a> that Google Search Wiki would <a title="Google Search Wiki To Soon Include An Off Button. Thank You, Marissa." href="http://www.techcrunch.com/2008/12/10/google-search-wiki-to-soon-include-an-off-button-thank-you-marissa/"><span>soon</span></a> have a toggle button that allow people to turn it off (“early Q1.”/2009) </span></span></p>
<p class="MsoNormal"><span><span> </span></span></p>
<p class="MsoNormal"><span><span> </span></span></p>
<p class="MsoNormal"><span><span> </span></span></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.karmona.com/index.php/2009/06/14/disable-google-searchwiki/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Social Search Model</title>
		<link>http://blog.karmona.com/index.php/2009/01/09/social-search-model/</link>
		<comments>http://blog.karmona.com/index.php/2009/01/09/social-search-model/#comments</comments>
		<pubDate>Fri, 09 Jan 2009 21:57:46 +0000</pubDate>
		<dc:creator>Moti Karmona &#124; מוטי קרמונה</dc:creator>
				<category><![CDATA[Disruptive Technology]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://blog.karmona.com/?p=390</guid>
		<description><![CDATA[Two months ago, Brynn M. Evans and Ed H. Chi have published a very interesting article &#8211; Towards a Model of Understanding Social Search. They have ran a small survey using Amazon Mechanical Turk (which is a pretty cool concept for itself) asking ~150 users to describe their search experience. IMHO, their data analysis resulted [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.karmona.com/wp-content/uploads/2009/01/mechanical_turk.jpg"><img class="alignleft size-thumbnail wp-image-403" title="Mechanical Turk" src="http://blog.karmona.com/wp-content/uploads/2009/01/mechanical_turk-150x150.jpg" alt="Mechanical Turk" width="150" height="150" align="left" /></a>Two months ago, <a title="Brynn Marie Evans" href="http://brynnevans.com/">Brynn M. Evans</a> and <a title="Ed H. Chi" href="http://www-users.cs.umn.edu/~echi/">Ed H. Chi</a> have <a title="CSCW2008 Paper on &quot;Towards a Model of Understanding Social Search&quot;" href="http://asc-parc.blogspot.com/2008/10/cscw2008-paper-on-towards-model-of.html">published</a> a very interesting article &#8211; <a title="Towards a Model of Understanding Social Search" href="http://brynnevans.com/papers/social-search-cscw08-preprint.pdf">Towards a Model of Understanding Social Search</a>.</p>
<p>They have ran a small <a title="Using Mechanical Turk for research" href="http://brynnevans.com/blog/2008/07/09/using-mechanical-turk-for-research/">survey</a> using <a title="Amazon Mechanical Turk | Wikipedia" href="http://en.wikipedia.org/wiki/Amazon_Mechanical_Turk">Amazon Mechanical Turk</a> (which is a pretty <a title="Mechanical Turk: The Demographics" href="http://behind-the-enemy-lines.blogspot.com/2008/03/mechanical-turk-demographics.html">cool concept</a> for itself) asking ~150 users to describe their search experience.</p>
<p>IMHO, <a title="User Needs during Social Search" href="http://asc-parc.blogspot.com/2008/10/user-needs-during-social-search.html">their data analysis</a> resulted in a very intriguing social search model.</p>
<p><span style="color: #888888;"><cite title="Brynn M. Evans, Ed H. Chi. - Towards a Model of Understanding Social Search | 2008">&#8220;As we outlined through the model, social inputs may help users throughout the search process. Before searching, social interactions may help establish the requirements for the actual search task. During search, especially for self-motivated informational searches, users may talk to others for advice, feedback, and brainstorming to improve their search schema and query keyword selections. After search, users may still wish to engage with others to collect additional feedback or to share knowledge gained during the search.&#8221;</cite></span><br />
(<a title="Brynn Marie Evans" href="http://brynnevans.com/">Brynn M. Evans</a> and <a title="Ed H. Chi" href="http://www-users.cs.umn.edu/~echi/">Ed H. Chi</a> &#8211; <a title="Towards a Model of Understanding Social Search" href="http://brynnevans.com/papers/social-search-cscw08-preprint.pdf">Towards a Model of Understanding Social Search</a> | 2008)</p>
<p style="text-align: center;"><a href="http://blog.karmona.com/wp-content/uploads/2009/01/social_search_model_big.jpg"><img class="aligncenter size-full wp-image-398" title="Social Search Model" src="http://blog.karmona.com/wp-content/uploads/2009/01/social_search_model.jpg" alt="Social Search Model" width="500" height="794" /></a></p>
<p>P.S. It seems like <a title="Delver - Search your World" href="http://www.delver.com">we</a> are on the right direction&#8230; :)</p>
<p>__________________</p>
<p><strong>Quick </strong>(a.k.a. too late) <strong>Y.A.S.B.P.</strong> (a.k.a. <strong>Y</strong>et <strong>A</strong>nother <strong>S</strong>hameless <strong>B</strong>log <strong>P</strong>lagiarism) <strong>update: </strong>I had a strange (yet familiar) feeling that I have read about this research somewhere before but I didn&#8217;t remember where so after I submitted this post, I have found that <a title="The Alter Egozi" href="http://alteregozi.com/">Ofer</a> had picked it up way before me and as always, wrote a much better <a title="Social Search, or Search Socially" href="http://alteregozi.com/2008/11/16/social-search-or-search-socially/">post</a> about it&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.karmona.com/index.php/2009/01/09/social-search-model/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>The Social Graph Challenge</title>
		<link>http://blog.karmona.com/index.php/2008/12/30/the-social-graph-challenge/</link>
		<comments>http://blog.karmona.com/index.php/2008/12/30/the-social-graph-challenge/#comments</comments>
		<pubDate>Tue, 30 Dec 2008 20:28:49 +0000</pubDate>
		<dc:creator>Moti Karmona &#124; מוטי קרמונה</dc:creator>
				<category><![CDATA[Delver]]></category>
		<category><![CDATA[Disruptive Technology]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Social Network]]></category>

		<guid isPermaLink="false">http://blog.karmona.com/?p=336</guid>
		<description><![CDATA[I was analyzing, dreaming, monitoring, crawling, debugging, reading, breathing, cursing, scaling, visualizing and learning the social graph for the last couple of months and I thought it might be a good idea to write a little something about The Social Graph Challenge with a pragmatic twist on few other common concepts.   &#8212;&#8212;&#8212; Blitz Introduction to The Social Graph &#8212;&#8212;&#8212; The [...]]]></description>
			<content:encoded><![CDATA[<p><img class="size-thumbnail wp-image-342" title="The Story Behind The Delver Kid Image" src="http://blog.karmona.com/wp-content/uploads/2008/12/more-kids-150x150.jpg" alt="The Story Behind The Delver Kid Image" width="150" height="150" align="left" />I was analyzing, dreaming, monitoring, crawling, debugging, reading, breathing, cursing, scaling, visualizing and learning the social graph for the last couple of months and I thought it might be a good idea to write a little something about <strong>The </strong><strong>Social Graph Challenge</strong> with a pragmatic twist on few other <a title="Brad's Thoughts on the Social Graph" href="http://www.bradfitz.com/social-graph-problem/">common</a> <a title="Pragmatic Twist on Social Graph Concepts and Issues | ReadWriteWeb" href="http://www.readwriteweb.com/archives/social_graph_concepts_and_issues.php">concepts</a>.</p>
<p> </p>
<p style="text-align: center; "><strong>&#8212;&#8212;&#8212; Blitz Introduction to The Social Graph </strong>&#8212;&#8212;&#8212;</p>
<p>The social graph is just a simplified mathematic <a title="Graph Theory" href="http://en.wikipedia.org/wiki/Graph_theory">abstraction</a> when nodes are people and edges are relations between them.</p>
<p>In the last decade the internet have became more social than was ever expected it to be with the rapid growth and adaptation of social networks, social media and user-generated contributions and interactions. </p>
<p>Nowadays, there is a growing feeling that it is feasible to model and map the social web into a real-life social graph replication.</p>
<p style="text-align: center; "><img class="size-full wp-image-355 aligncenter" style="border: 0px initial initial;" title="Delver Starfish" src="http://blog.karmona.com/wp-content/uploads/2008/12/starfish.jpg" alt="Delver Starfish" width="195" height="130" /></p>
<p style="text-align: center; "><strong>&#8212;&#8212;&#8212; Pragmatic Overview on The Social Graph Challenge &#8212;&#8212;&#8212;</strong></p>
<p style="text-align: center; "><strong><span style="font-weight: normal; "><a title="Modeling the Social Graph" href="#Modeling">Modeling</a> | <a title="Building the Social Graph" href="#Building">Building</a> | <a title="Processing the Social Graph" href="#Processing">Processing</a> | <a title="The Social Graph Size" href="#Size">Size</a> | <a title="Two Cents on Social Graph Architecture" href="#Architecture">Architecture</a></span></strong></p>
<p style="text-align: left;"><strong>(1)<a name="Modeling"></a> Modeling the Social Graph</strong></p>
<p><strong>*** Vocabulary </strong></p>
<p>To better understand how complicated it is to create a vocabulary for expressing metadata about people, their interests, relationships and activities you should simply pay a quick visit to the <a title="The FOAF Project" href="http://www.foaf-project.org/">FOAF Project</a> <a title="FOAF Technical Spec" href="http://xmlns.com/foaf/spec/">technical specification page</a></p>
<p>The FOAF (&#8220;Friend of a Friend&#8221;) <a title="The FOAF Project" href="http://www.foaf-project.org/">Project</a>  has the most comprehensive model available today and it is still lacking some basic modeling granularity e.g. time awareness metadata, no privacy model, <a title="FOAF Relationship Model | Term-Knows" href="http://xmlns.com/foaf/spec/#term_knows ">poor relationship model</a> </p>
<p><strong>*** The Social Cloud</strong></p>
<p>It is common mistake to forget that people are more than just flat internet identities (e.g. <a title="Moti Karmona | Linkedin" href="http://www.linkedin.com/in/karmona">Linked profile</a>) and to complete the profile modeling we must add all their content to the graph e.g. Personal Blog, Flickr images, YouTube Videos, Delicious bookmarks, Tweets, Blog Comments etc.</p>
<p>Modeling all these content and consumption types will yield a broader definition (a.k.a. The Social Cloud) with even more complex modeling challenges.</p>
<p style="text-align: center; "><img class="size-full wp-image-345 aligncenter" style="border: 0px initial initial;" title="More Delver Kids" src="http://blog.karmona.com/wp-content/uploads/2008/12/kids.jpg" alt="More Delver Kids" width="276" height="107" /></p>
<p><strong>(2)<a name="Building"></a> Building the Social Graph</strong></p>
<p><strong>*** The Paradigm Shift</strong></p>
<p>While conventional internet crawlers, follow hyperlinks within web pages and <a title="Lynx, a text-mode web browser" href="http://www.delorie.com/web/lynxview.cgi?url=http://blog.karmona.com">treat pages as plain-text</a>, social crawlers should have social-&#8221;awareness&#8221;:</p>
<ul>
<li>Identify and extract people identities fragments (e.g. social network profiles, blog authors)</li>
<li>Identify relationships (e.g. social networks connections, blog-roll fans)</li>
<li>Identify relations between content and people (author, bookmark, reference etc.)</li>
</ul>
<p><strong>*** The Standards Dilemma – No Silver Bullet</strong></p>
<p>Beside <a title="FOAF on Wikipedia" href="http://en.wikipedia.org/wiki/FOAF_(software)">FOAF</a>, there are several open standard like <a title="RSS | Wikipedia" href="http://en.wikipedia.org/wiki/RSS_(file_format)">RSS</a>, <a title="ATOM | Wikipedia" href="http://en.wikipedia.org/wiki/Atom_(standard)">ATOM</a> for content syndication and <a title="Microformats | Wikipedia" href="http://en.wikipedia.org/wiki/Microformats">microformats</a> like <a title="HCard | Wikipedia" href="http://en.wikipedia.org/wiki/HCard">HCard</a>, <a title="XFN | Wikipedia" href="http://en.wikipedia.org/wiki/XHTML_Friends_Network">XFN</a> for profiles and network discovery,  that seems promising and can help with the identification quest but although this is being pushed by giants (e.g. <a title="Google Social Graph API" href="http://code.google.com/intl/iw/apis/socialgraph/">Google Social Graph API</a>) the adaptation is <a title="List of FOAF Containers | Open Social Directory" href="http://web.archive.org/web/20080205184017/http://www.opensocialdirectory.org/wiki/List_of_FOAF_Containers">still</a> <a title="Foaf Sites" href="http://esw.w3.org/topic/FoafSites">low</a> and have many correctness and corruptions issues - e.g. <a title="Claimed to be WordPress using XFN" href="http://socialgraph-resources.googlecode.com/svn/trunk/samples/findyours.html?q=http://wordpress.com">all these people</a> claimed to be WordPress.com using the XFN (rel=&#8221;me&#8221;) microformat </p>
<p><strong>*** The Promise of Structured Sources (a.k.a. The structure myth)</strong></p>
<p>The <strong>Myth</strong>: Most social Media sites (e.g. <a title="Moti Karmona | Facebook" href="http://www.facebook.com/profile.php?id=673836059">FaceBook</a>, <a title="Moti Karmona | LinkedIn" href="http://www.linkedin.com/in/karmona">LinkedIn</a>, <a title="Moti Karmona | MySpace" href="http://www.myspace.com/moti_karmona">MySpace</a>, <a title="Moti Karmona | Flickr" href="http://flickr.com/people/moti_karmona/">Flickr</a> etc.) have a public available structured profile pages so in principle all need to be done is some XPath magic on HTML DOM to finish the parsing task.</p>
<p><strong>But</strong>… Most of the work isn&#8217;t parsing but data modeling which require deep understanding of each site user model and usage</p>
<ul>
<li>Many Social Media sites have <a title="EULA | Wikipedia" href="http://en.wikipedia.org/wiki/EULA">EULA</a> restrictions which prohibit <span style="text-decoration: underline;">any</span> access or use to the site content but if you are lucky you will get some offical API&#8217;s instead.</li>
<li>Social Media sites have many (~weekly) structural changes in their CSS/HTML.</li>
<li>Social Media sites have many changes (~monthly) in their data privacy policy and have complex privacy model which create inconsistency in profile, network and content presentation.</li>
</ul>
<p><strong>*** Few more Challenges with Social Crawling:</strong></p>
<ul>
<li><strong>Privacy-Ownership-Control </strong>- The <a title="The Data Portability Project" href="http://www.dataportability.org/">data</a> is the property of the <a title="A Bill of Rights for Users of the Social Web" href="http://opensocialweb.org/2007/09/05/bill-of-rights/">users</a></li>
<li><strong>Unstructured Source</strong>s &#8211; It isn&#8217;t a trivial task to extract social entities from unstructured sources (e.g. blogs) and might require offline semantic processing on your collected data.</li>
<li><strong>Cross Network Relations</strong> &#8211; How to find those important hidden cross network relations e.g. between the biggest reliable network graph (e.g. <a title="Moti Karmona | Facebook" href="http://www.facebook.com/profile.php?id=673836059">FaceBook</a>) and the richest content contributions (e.g. <a title="State of the Blogosphere 2008 | Karmona.com" href="http://blog.karmona.com/index.php/2008/09/22/technorati-state-of-the-blogosphere/">Blogosphere</a>, YouTube, <a title="Moti Karmona | Flickr" href="http://flickr.com/people/moti_karmona/">Flickr</a> etc.)</li>
<li><strong>Identify Social Signs</strong> (e.g. Social Widgets, Comments, Blogroll etc.)</li>
<li><strong>Social Graph Update Mechanism</strong> and crawlers distribution</li>
<li>Profiles <a title="Google URL Canonization" href="http://code.google.com/intl/iw/apis/socialgraph/docs/canonical.html ">Canonization</a> </li>
<li>&#8230;</li>
</ul>
<p style="text-align: center; "><img class="size-full wp-image-346 aligncenter" style="border: 0px initial initial;" title="Delver Rodents" src="http://blog.karmona.com/wp-content/uploads/2008/12/rodents.jpg" alt="Delver Rodents" width="222" height="82" /></p>
<p><strong>(3)<a name="Processing"></a> Processing the Social Graph</strong></p>
<p><strong>*** The Identity Crisis</strong></p>
<ul>
<li><strong>Filtering Impersonation</strong> e.g. <a title="Sites claimed to be TechCrunch using XFN" href="http://socialgraph-resources.googlecode.com/svn/trunk/samples/findyours.html?q=techcrunch.com">all these site</a> use XFN (<em>rel=&#8221;me&#8221;</em>) to &#8220;say&#8221; they are <a title="TechCrunch" href="http://www.techcrunch.com">TechCrunch</a></li>
<li><strong>Identify </strong>and have different modeling for <strong>n</strong><strong>on-individual identities</strong> (groups, shared authorship) e.g. <a title="Knitter Blogs" href="http://zimmermaniacs.blogspot.com/">Knitters Blog</a> with 629 knitting contributors :)</li>
<li>Strive to merge identities  (a.k.a. profile fusion) when possible e.g. Moti Karmona in <a title="Moti Karmona | LinkedIn" href="http://www.linkedin.com/in/karmona">LinkedIn</a> and Moti Karmona in <a title="Moti Karmona | Facebook" href="http://www.facebook.com/profile.php?id=673836059">FaceBook </a>could be two instances (/profiles) of the same person and merging this profiles will enable:
<ul>
<li>Cross network connectedness =&gt; Bridging between network richness (e.g. <a title="Moti Karmona | Facebook" href="http://www.facebook.com/profile.php?id=673836059">FaceBook</a>) to content richness (e.g. <a title="State of the Blogosphere 2008 | Karmona.com" href="http://blog.karmona.com/index.php/2008/09/22/technorati-state-of-the-blogosphere/">Blogosphere</a>)</li>
<li>Richer people representation using identities aggregation =&gt; Richer networks</li>
</ul>
</li>
<li><strong>The Fusion </strong><strong>Challenge</strong>: You can pay a short visit to the <a title="Social Aggregators | Mashable" href="http://mashable.com/2007/07/17/social-network-aggregators/">nearest social aggregator directory</a> but you can&#8217;t get away from some more complex algorithms for <a title="Disambiguating Web Appearances of People in a Social Network | Ron Bekkerman" href="http://www.www2005.org/cdrom/docs/p463.pdf">disambiguating web appearances of people</a> with more common names like <a title="Common name like James Smith" href="http://blog.karmona.com/index.php/2008/07/07/mary-and-james-smith/">James Smith</a> who doesn’t &#8220;play&#8221; in the social aggregation playground (like 98.7% of the graph).</li>
</ul>
<p><strong>*** </strong><strong>Graph Enrichment </strong></p>
<ul>
<li><strong>I</strong><strong>mplicit Relations</strong> - Enrich the network with “implicit” relationships (Colleagues, Graduates, Neighbors) e.g. I have a LinkedIn profile and all my connections are hidden for public crawlers but the fact I work in <a title="Delver - Search Your World" href="http://www.delver.com">Delver</a>  is public so if <a title="Delver - Search Your World" href="http://www.delver.com">Delver</a> is startup company with less than ~50 people than there is a good chance I know all the other workers in <a title="Delver - Search Your World" href="http://www.delver.com">Delver</a> =&gt; This simple heuristic rule can create an implicit relation between me and other workers of <a title="Delver - Search Your World" href="http://www.delver.com">Delver</a> without me explicitly claim that I know them (as I did in <a title="Moti Karmona | Facebook" href="http://www.facebook.com/profile.php?id=673836059">FaceBook</a>)</li>
<li>Generating the <strong>inverted relations </strong>when needed Followed vs. Follower</li>
<li>Deeper, <strong>s</strong><strong>emantic extraction </strong>of social entities <strong>u</strong><strong>n-structured content</strong></li>
</ul>
<p style="text-align: center; "><img class="size-full wp-image-347 aligncenter" style="border: 0px initial initial;" title="Delver Faces" src="http://blog.karmona.com/wp-content/uploads/2008/12/faces.jpg" alt="Delver Faces" width="328" height="113" /></p>
<p><strong>(4)<a name="Size"></a> The Social Graph Size</strong></p>
<p>Let&#8217;s have some quick (and very dirty) guesstimates:</p>
<p><a title="Internet Statistics" href="http://www.internetworldstats.com/stats.htm">World Population</a> is approx. ~6.7 Billion / <strong>22</strong>% Internet penetration =&gt; <strong>1.5 Billion internet users</strong> </p>
<p>Let&#8217;s say 65% of these users have some kind of presence in Social Media (~20% have more than one) =&gt; <strong>~1 Billion Profiles <span style="font-weight: normal;">x</span><span style="font-weight: normal;"> ~</span>10<span style="font-weight: normal;"> content items per profile</span></strong></p>
<p>+ <strong>1 Billion Profiles Nodes <span style="font-weight: normal;">x ~<strong>100 </strong><a title="Dunbars Friends " href="http://blog.karmona.com/index.php/2008/07/07/dunbars-friends/">network relations per profile</a>  =&gt; ~<strong>110 Billion Graph Edges + ~10 Billion Graph Nodes</strong></span></strong></p>
<p>It is highly depended on graph implementation but with this numbers, you can easily find yourself with <strong>~1-2 Terabytes of graph metadata alone</strong> (<span style="text-decoration: underline;">without </span>contents and profiles<span style="color: #ff0000;">*</span>) </p>
<p style="text-align: center; "><img class="size-full wp-image-348 aligncenter" style="border: 0px initial initial;" title="Delver Diving Suite " src="http://blog.karmona.com/wp-content/uploads/2008/12/diving-suite.jpg" alt="Delver Diving Suite " width="235" height="157" /></p>
<p><strong>(5)<a name="Architecture"></a> Two Cents on Social Graph Architecture</strong></p>
<p>Updating and querying gigantic, dynamic, distributed, directed, cyclic, colored, weighted graph have &#8220;some&#8221; algorithmic, computational complexity &#8211; a little more complex than a blog post could cover…;-)</p>
<p>You can take a quick look at the tiny 15 Giga, 25 million nodes <a title="LinkedIn Architecture" href="http://hurvitz.org/blog/2008/06/linkedin-architecture">graph implementation in LinkedIn</a> to get a glimpse to the technological challenge … </p>
<p><span style="color: #ff0000;">*</span> Note: Indexing content and profiles data (e.g. for Building a <a title="Delver.com - Search Your World" href="http://www.delver.com">Social Search Engine</a>) is an architecture challenge equivalent to any modern search engine with ~10 Billion documents <a title="Teh Size of the Internet" href="http://blog.karmona.com/index.php/2007/09/26/the-size-of-the-internet/">index</a></p>
<p style="text-align: center; "><img class="size-full wp-image-349 aligncenter" style="border: 0px initial initial;" title="The Delver Kid" src="http://blog.karmona.com/wp-content/uploads/2008/12/delver-kid.jpg" alt="The Delver Kid" width="192" height="198" /></p>
<p>This is only the tip of the <a title="Delver - Search Your World" href="http://www.delver.com">iceberg</a> but it is more than enough for one blog post ;)</p>
<p>_________</p>
<p>Credit: <span style="text-decoration: underline;">A</span><span style="text-decoration: underline;">ll </span>the images were taken from <a title="Tamar Hak" href="http://tamarhak.com">Tamar Hak</a>&#8216;s <span style="text-decoration: underline;">amazing </span>artwork &#8211; creating The Delver Kid image.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.karmona.com/index.php/2008/12/30/the-social-graph-challenge/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Top Search Terms &#124; 2008</title>
		<link>http://blog.karmona.com/index.php/2008/12/14/top-search-terms-2008/</link>
		<comments>http://blog.karmona.com/index.php/2008/12/14/top-search-terms-2008/#comments</comments>
		<pubDate>Sun, 14 Dec 2008 21:52:41 +0000</pubDate>
		<dc:creator>Moti Karmona &#124; מוטי קרמונה</dc:creator>
				<category><![CDATA[Google]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://blog.karmona.com/?p=312</guid>
		<description><![CDATA[Google, Yahoo, Ask and Lycos have released* their top search terms for the past year (2008) and I have aggregated it to your convenience in one happy table below. I don&#8217;t have anything smart to say about it but I did manage to pull out five intriguing  insights. My Five Cents: As done last year, it seems like Y!  have removed all [...]]]></description>
			<content:encoded><![CDATA[<p><img class="alignleft size-thumbnail wp-image-317" title="Madonna - Britney" src="http://blog.karmona.com/wp-content/uploads/2008/12/madonna-britney-150x150.jpg" alt="Madonna - Britney" width="150" height="150" align="left" /></p>
<p><span><a title="Google Top Search Terms | 2008" href="http://www.google.com/intl/en/press/zeitgeist2008/">Google</a>, <a title="Yahoo top Search Terms | 2008" href="http://buzz.yahoo.com/yearinreview2008/">Yahoo</a>, <a title="Ask Top Search Terms | 2008" href="http://about.ask.com/en/docs/2008/topqueries.shtml" class="broken_link">Ask </a>and <a title="Lycos Top Search Terms | 2008" href="http://50.lycos.com">Lycos</a> have released* their top search terms for the past year (2008) and I have aggregated it to your convenience in one happy table below.</span></p>
<p><span>I don&#8217;t have anything smart to say about it but I did manage to pull out five intriguing  insights.</span></p>
<p><span><strong>My Five Cents:</strong></span></p>
<ul>
<li>As done last year, it seems like Y!  have removed all the  <span style="color: #ff0000;">navigational </span>queries from their report (I wonder <a title="Internet Conspiracy of the Day" href="http://blog.karmona.com/index.php/2008/12/03/internet-conspiracy-of-the-day/">why</a> ;)</li>
</ul>
<ul>
<li>&#8220;Poker&#8221; is the &#8220;Top Search Term Of The Year&#8221;  for for the 3rd consecutive year on Lycos&#8230; (what is Lycos? :)</li>
</ul>
<ul>
<li>Though she didn&#8217;t make it to the White House, US vice-presidential candidate Sarah Palin captured the zeitgeist of internet users in 2008 while Obama in the 6th place.</li>
</ul>
<ul>
<li>IMHO, Ask.com is just being too honest in their report &#8211; 50% of Ask search terms are <span style="color: #ff0000;">navigational </span>queries and the rest are boring.</li>
</ul>
<ul>
<li>Britney Spears has been the most popular search term at Yahoo for seven of the past eight years! </li>
</ul>
<p><strong>Top Search Terms | 2008</strong></p>
<table style="border: 1px solid #352b8c;" border="1" cellspacing="1" cellpadding="1">
<caption></caption>
<thead>
<tr style="background-color: #fbfcc3;">
<td><strong>#</strong></td>
<td><strong><a title="Google Top Search Terms | 2008" href="http://www.google.com/intl/en/press/zeitgeist2008/">Google</a></strong></td>
<td><strong><a title="Yahoo top Search Terms | 2008" href="http://buzz.yahoo.com/yearinreview2008/">Yahoo</a></strong></td>
<td><strong><a title="Ask Top Search Terms | 2008" href="http://about.ask.com/en/docs/2008/topqueries.shtml" class="broken_link">Ask</a></strong></td>
<td><strong><a title="Lycos Top Search Terms | 2008" href="http://50.lycos.com">Lycos</a></strong></td>
</tr>
</thead>
<tbody>
<tr>
<td>1</td>
<td>sarah palin</td>
<td>Britney Spears </td>
<td>Dictionary</td>
<td>Poker</td>
</tr>
<tr>
<td>2</td>
<td>beijing 2008</td>
<td>WWE</td>
<td><strong><span style="color: #ff0000;">MySpace</span></strong></td>
<td>Paris Hilton</td>
</tr>
<tr>
<td>3</td>
<td><strong><span style="color: #ff0000;">facebook </span></strong>login</td>
<td>Barack Obama</td>
<td><strong><span style="color: #ff0000;">Google</span></strong></td>
<td><strong><span style="color: #ff0000;">YouTube</span></strong></td>
</tr>
<tr>
<td>4</td>
<td><strong><span style="color: #ff0000;">tuenti</span></strong></td>
<td>Miley Cyrus</td>
<td><strong><span style="color: #ff0000;">YouTube</span></strong></td>
<td>Golf</td>
</tr>
<tr>
<td>5</td>
<td>heath ledger</td>
<td>RuneScape</td>
<td><strong><span style="color: #ff0000;">Facebook</span></strong></td>
<td>Sarah Palin</td>
</tr>
<tr>
<td>6</td>
<td>obama</td>
<td>Jessica Alba</td>
<td>Coupons</td>
<td>Britney Spears</td>
</tr>
<tr>
<td>7</td>
<td><strong><span style="color: #ff0000;">nasza klasa</span></strong></td>
<td>Naruto</td>
<td>Cars</td>
<td>Clay Aiken</td>
</tr>
<tr>
<td>8</td>
<td><strong><span style="color: #ff0000;">wer kennt wen</span></strong></td>
<td>Lindsay Lohan</td>
<td><strong><span style="color: #ff0000;">Craigslist</span></strong></td>
<td>Pamela Anderson</td>
</tr>
<tr>
<td>9</td>
<td>euro 2008</td>
<td>Angeline Jolie</td>
<td>Online degrees</td>
<td><strong><span style="color: #ff0000;">Facebook</span></strong></td>
</tr>
<tr>
<td>10</td>
<td>jonas brothers</td>
<td>American Idol</td>
<td>Credit score</td>
<td>Holly Madison</td>
</tr>
</tbody>
</table>
<p> </p>
<p> </p>
<p>Update (18 Dec. 2008): Top 10 search queries that people used on Delicious in 2008 are: news, blogs, reference, wiki, restaurants, hotels, css, web 2.0, artists, music&#8230; I think it is loud-and-clear that the biggest bookmarking site isn&#8217;t fulfilling its search potential (!)</p>
<p> </p>
<p>____________________</p>
<p>* Note:  Microsot (Live) didn&#8217;t released the updated list until now and <a title="AOL Top Search Terms | 2008" href="http://about-search.aol.com/hotsearches2008/index.html">AOL</a> didn&#8217;t break out overall terms so wasn&#8217;t included here.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.karmona.com/index.php/2008/12/14/top-search-terms-2008/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Random Thoughts on Google SearchWiki</title>
		<link>http://blog.karmona.com/index.php/2008/11/21/random-thoughts-on-google-searchwiki/</link>
		<comments>http://blog.karmona.com/index.php/2008/11/21/random-thoughts-on-google-searchwiki/#comments</comments>
		<pubDate>Fri, 21 Nov 2008 13:55:33 +0000</pubDate>
		<dc:creator>Moti Karmona &#124; מוטי קרמונה</dc:creator>
				<category><![CDATA[Conspiracy]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Search]]></category>

		<guid isPermaLink="false">http://blog.karmona.com/?p=262</guid>
		<description><![CDATA[I don&#8217;t know if you have noticed but Google launched Search Wiki yesterday. With Google SearchWiki, signed-in Google users can now customize their search experience by re-ranking, deleting, adding and commenting on search results.  So&#8230; The re-ranking changes you make are private and only affect your own searches.  Your comments are visible to the public  Random [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.karmona.com/wp-content/uploads/2008/11/doh-homer-simpson.jpg"><img class="alignleft size-thumbnail wp-image-263" style="float: left;" title="Doh Homer Simpson" src="http://blog.karmona.com/wp-content/uploads/2008/11/doh-homer-simpson-150x150.jpg" alt="" width="150" height="150" /></a>I don&#8217;t know if you have noticed but <a title="Official Google Blog: SearchWiki: make search your own" href="http://googleblog.blogspot.com/2008/11/searchwiki-make-search-your-own.html">Google launched Search Wiki</a> yesterday.</p>
<p>With Google SearchWiki, signed-in Google users can now customize their search experience by re-ranking, deleting, adding and commenting on search results. </p>
<p>So&#8230;</p>
<ul>
<li>The <span style="text-decoration: underline;">re-ranking changes</span> you make are <span style="text-decoration: underline;">private</span> and <span style="text-decoration: underline;">only</span> affect your <span style="text-decoration: underline;">own</span> searches. </li>
<li>Your <span style="text-decoration: underline;">comments</span> are <span style="text-decoration: underline;">visible</span> to the <span style="text-decoration: underline;">public</span> </li>
</ul>
<p><a href="http://blog.karmona.com/wp-content/uploads/2008/11/google-searchwiki.jpg"><img class="alignleft size-full wp-image-267" title="Google SearchWiki" src="http://blog.karmona.com/wp-content/uploads/2008/11/google-searchwiki.jpg" alt="" width="500" height="162" /></a></p>
<p>Random thoughts on Google Search Wiki:</p>
<ul>
<li>You need to be very brave to change usability patterns in your world-leading-search-cash-cow (!)</li>
<li>Why wasn&#8217;t it tested as yet another interesting Google Lab project?</li>
<li>The arrows &#8220;soup&#8221; is really too much for the lonely-searcher &#8211;&gt; way too many arrows if all you wanted is just search.</li>
<li>The comments I saw until now are mainly spam or not interesting.</li>
<li>The most important feature in Search Wiki is a way to turn it off but it is still missing&#8230;</li>
<li>It is a good time to change your default search engine ;)</li>
<li>Is it only me or Search Wiki have the <a title="Google Labs | Lively" href="http://googleblog.blogspot.com/2008/11/lively-no-more.html">lively</a> smell all over it?</li>
</ul>
<p>I must be <a title="The Dunning-Kruger Effect" href="http://blog.karmona.com/index.php/2008/11/15/the-dunning-kruger-effect/">missing</a> something since the Google guys are very far from being stupid (to say the least) and it will be a very interesting to see if Google will change the search experience yet again with this move.</p>
<p> </p>
<p>Update (10 Dec. 2008) : <a title="Marissa Mayer" href="http://en.wikipedia.org/wiki/Marissa_Mayer">Marissa Mayer</a> <a title="Marissa Mayer At Le Web: The (Almost) Complete Interview" href="http://www.techcrunch.com/2008/12/10/marissa-mayer-at-le-web-the-almost-complete-interview/">promised</a> that Google Search Wiki would <a title="Google Search Wiki To Soon Include An Off Button. Thank You, Marissa." href="http://www.techcrunch.com/2008/12/10/google-search-wiki-to-soon-include-an-off-button-thank-you-marissa/">soon</a> have a toggle button that allow people to turn it off (“early Q1.&#8221;) – I can&#8217;t wait… :)</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.karmona.com/index.php/2008/11/21/random-thoughts-on-google-searchwiki/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Yahoo Open Strategy</title>
		<link>http://blog.karmona.com/index.php/2008/10/28/yahoo-open-strategy/</link>
		<comments>http://blog.karmona.com/index.php/2008/10/28/yahoo-open-strategy/#comments</comments>
		<pubDate>Tue, 28 Oct 2008 23:54:59 +0000</pubDate>
		<dc:creator>Moti Karmona &#124; מוטי קרמונה</dc:creator>
				<category><![CDATA[Development]]></category>
		<category><![CDATA[Disruptive Technology]]></category>
		<category><![CDATA[Internet]]></category>
		<category><![CDATA[Search]]></category>
		<category><![CDATA[Software]]></category>

		<guid isPermaLink="false">http://blog.karmona.com/?p=235</guid>
		<description><![CDATA[Yahoo have released the Y!OS (Yahoo Open Strategy) 1.0 platform. This is a cool set of simple APIs that can give you access to everything you ever wanted in Y! but was afraid to ask for&#8230; Yahoo! Social Platform (YSP) // The Yahoo Social Platform is a set of RESTful APIs for Profiles, Connections, Updates, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://blog.karmona.com/wp-content/uploads/2008/10/yos_diagram.jpg"><img class="alignleft size-thumbnail wp-image-236" style="float: left;" title="Y!OS" src="http://blog.karmona.com/wp-content/uploads/2008/10/yos_diagram-150x150.jpg" alt="" width="150" height="150" /></a>Yahoo have <a title="Y!OS 1.0 Launch" href="http://developer.yahoo.net/blog/archives/2008/10/yos_10_launch.html">released</a> the <a title="Y!OS Introduction" href="http://developer.yahoo.com/yos/intro/index.html">Y!OS</a> (Yahoo Open Strategy) 1.0 platform.</p>
<p>This is a cool set of simple APIs that can give you access to everything you ever wanted in Y! but was afraid to ask for&#8230;</p>
<p><strong>Yahoo! Social Platform</strong> (<a title="YSP | Y! Social Platform" href="http://developer.yahoo.com/social/">YSP</a>)<br />
<span style="color: #008000;">// The Yahoo Social Platform is a set of RESTful APIs for Profiles, Connections, Updates, Contacts and Status.</span></p>
<p><strong>Yahoo! Query Language</strong> (<a title="YQL | Y! Query Language" href="http://developer.yahoo.com/yql/docs/">YQL</a>)<br />
<span style="color: #008000;">// The Yahoo Query Language is a web service that functions much like SQL (see example below)</span></p>
<p><strong>OAuth Authentication</strong><br />
<span style="color: #008000;">// OAuth is the authentication and authorization standard Yahoo has decided to use when giving third parties access to Yahoo user data.</span></p>
<p><strong>Yahoo! Applications Platform </strong>(<a title="YAP | Y! Application Platform" href="http://developer.yahoo.com/yap/guide/">YAP</a>)<br />
<span style="color: #008000;">// Currently very limited and in a restricted sandbox.</span></p>
<p>________________________________</p>
<p><strong>Example: How to use YQL APIs to access MyBlogLog profiles?</strong></p>
<p>Simply ask for all the community members of MyBlogLog community with this YQL:</p>
<p><em>select * from mybloglog.members.find where community_id in (select id from mybloglog.community.find where name=&#8221;Karmona Pragmatic Blog&#8221;) </em></p>
<p>And once you have the IDs you can ask for my personal profile by:</p>
<p><em>select * from mybloglog.member where member_id =&#8221;2008070609482910&#8243; </em><span style="color: #008000;"></span><br />
<a title="YQL Console" href="http://developer.yahoo.com/yql/console/"></a></p>
<p>Well… together with the existing <a title="Y! BOSS API" href="http://developer.yahoo.com/search/boss/">BOSS API,</a> this set of APIs is a powerful enablers to the Y! development network and I am sure some cool stuff are going to emerge from this innovative move…</p>
<p>Amazing!!!</p>
<p>________________________________</p>
<p>* You can have more YQL experiments using the <a title="YQL Console" href="http://developer.yahoo.com/yql/console/">YQL Console</a></p>
<p>** <a title="BOSS HACK Day | 2008" href="http://developer.yahoo.net/blog/archives/2008/10/boss_hack_day_h.html">Boss Hack Day</a> is coming to Tel-Aviv | November 6, 2008 @ Feature (!!!)</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.karmona.com/index.php/2008/10/28/yahoo-open-strategy/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

