Surprise Surprise… My new facebook username / vanity URL is http://www.facebook.com/karmona
Go and get yours before someone else will @ http://www.facebook.com/username/, although it is probably too late ;)
Good Luck!
Surprise Surprise… My new facebook username / vanity URL is http://www.facebook.com/karmona
Go and get yours before someone else will @ http://www.facebook.com/username/, although it is probably too late ;)
Good Luck!
→ 2 CommentsTags: Conspiracy · Internet · Social Network
I was analyzing, dreaming, monitoring, crawling, debugging, reading, breathing, cursing, scaling, visualizing and learning the social graph for the last couple of months and I thought it might be a good idea to write a little something about The Social Graph Challenge with a pragmatic twist on few other common concepts.
——— Blitz Introduction to The Social Graph ———
The social graph is just a simplified mathematic abstraction when nodes are people and edges are relations between them.
In the last decade the internet have became more social than was ever expected it to be with the rapid growth and adaptation of social networks, social media and user-generated contributions and interactions.
Nowadays, there is a growing feeling that it is feasible to model and map the social web into a real-life social graph replication.

——— Pragmatic Overview on The Social Graph Challenge ———
Modeling | Building | Processing | Size | Architecture
*** Vocabulary
To better understand how complicated it is to create a vocabulary for expressing metadata about people, their interests, relationships and activities you should simply pay a quick visit to the FOAF Project technical specification page
The FOAF (“Friend of a Friend”) Project has the most comprehensive model available today and it is still lacking some basic modeling granularity e.g. time awareness metadata, no privacy model, poor relationship model
*** The Social Cloud
It is common mistake to forget that people are more than just flat internet identities (e.g. Linked profile) and to complete the profile modeling we must add all their content to the graph e.g. Personal Blog, Flickr images, YouTube Videos, Delicious bookmarks, Tweets, Blog Comments etc.
Modeling all these content and consumption types will yield a broader definition (a.k.a. The Social Cloud) with even more complex modeling challenges.

*** The Paradigm Shift
While conventional internet crawlers, follow hyperlinks within web pages and treat pages as plain-text, social crawlers should have social-”awareness”:
*** The Standards Dilemma – No Silver Bullet
Beside FOAF, there are several open standard like RSS, ATOM for content syndication and microformats like HCard, XFN for profiles and network discovery, that seems promising and can help with the identification quest but although this is being pushed by giants (e.g. Google Social Graph API) the adaptation is still low and have many correctness and corruptions issues - e.g. all these people claimed to be WordPress.com using the XFN (rel=”me”) microformat
*** The Promise of Structured Sources (a.k.a. The structure myth)
The Myth: Most social Media sites (e.g. FaceBook, LinkedIn, MySpace, Flickr etc.) have a public available structured profile pages so in principle all need to be done is some XPath magic on HTML DOM to finish the parsing task.
But… Most of the work isn’t parsing but data modeling which require deep understanding of each site user model and usage
*** Few more Challenges with Social Crawling:

(3) Processing the Social Graph
*** The Identity Crisis
*** Graph Enrichment

Let’s have some quick (and very dirty) guesstimates:
World Population is approx. ~6.7 Billion / 22% Internet penetration => 1.5 Billion internet users
Let’s say 65% of these users have some kind of presence in Social Media (~20% have more than one) => ~1 Billion Profiles x ~10 content items per profile
+ 1 Billion Profiles Nodes x ~100 network relations per profile => ~110 Billion Graph Edges + ~10 Billion Graph Nodes
It is highly depended on graph implementation but with this numbers, you can easily find yourself with ~1-2 Terabytes of graph metadata alone (without contents and profiles*)

(5) Two Cents on Social Graph Architecture
Updating and querying gigantic, dynamic, distributed, directed, cyclic, colored, weighted graph have “some” algorithmic, computational complexity – a little more complex than a blog post could cover…;-)
You can take a quick look at the tiny 15 Giga, 25 million nodes graph implementation in LinkedIn to get a glimpse to the technological challenge …
* Note: Indexing content and profiles data (e.g. for Building a Social Search Engine) is an architecture challenge equivalent to any modern search engine with ~10 Billion documents index

This is only the tip of the iceberg but it is more than enough for one blog post ;)
_________
Credit: All the images were taken from Tamar Hak‘s amazing artwork – creating The Delver Kid image.
→ 2 CommentsTags: Delver · Disruptive Technology · Search · Social Network
Dunbar’s number is the supposed cognitive limit to the number of individuals with whom any one person can maintain stable social relationships.
In a 1992 article, Dunbar used the correlation observed for non-human primates* to predict a social group size for humans and using a “simple” regression equation on data for 38 primate genera, Dunbar predicted a human “mean group size” of 150 (with 95% confidence interval of 100 to 230).
Dunbar’s Friends is my definition (and trademark ;-) to those few “real”, trusted and known people in your huge** online social network***.
—
* Primatologists have noted that, due to their highly social nature, non-human primates have to maintain personal contact with the other members of their social group, usually through grooming (and not “poking” as you might be expecting :-). The number of social group members a primate can track appears to be limited by the volume of the neocortex region of their brain.
** Did you know that Robert Scoble is following 21,060 people in Twitter, 2,992 in FriendFeed and only 71 “lousy” friends in Flickr…
*** Social Network for Dummies – Lee and Sachi LeFever (a.k.a. the CommonCraft‘s family :) have created a wonderful video explaining social network in plain English.
→ 1 CommentTags: Internet · People · Social Network
As I posted when I just started this blog – Almost 2 month ago I have decided to leave a promising (& cozy) career @ Mercury to join a challenging new-born start-up… called Delver.
This was an offer I couldn’t refused…
Delver is a venture-backed internet startup developing cutting-edge web application in the domain of Internet Search and Meta Social Networks.
It is more challenging, inspiring, interesting and exciting than I expected, imagined or I can put into words so you will have to join to understand… We are looking for top-notch algorithm researchers, .Net coders, mySQL DBAs, “Hackers”, QA experts and out-of-the-box thinkers… to join our unique development team (e.g. Pitz, Boo and Gabel)
Stay tuned (…)
→ No CommentsTags: Delver · Internet · Recruiting · Social Network
According to Alexa (which isn’t the most accurate thing in our planet :-), Orkut users are mainly Brazilian, leading with 71% share!!!, India with 13.2% in the honorable 2nd place, US goes 3rd with 3.3% and taking the 4th place is Pakistan (?) with only 2%…
I was really relieved to see that according to Google fight (which is the 2nd most accurate thing in our planet after alexa :-) Brazilian still like Football more than Orkut.
Loren Baker tried to explain the Orkut phenomena with: “Orkut sounds like Yakult or “iogurte” (yogurt)… Everyone drinks it in Brazil when they’re kids…” a.k.a. “Are you stupid?” – one of the amusing comments below his post…
Google’s “Black Sheep”:
So Google’s 3-year-old social network Orkut isn’t behaving according to the Google family expectation a.k.a. “why can’t you be more like your big sister Gmail?!”
Who knows… Maybe the new SocialStreem “treatment” (& Blogger integration) will help Orkut to grow-up and maybe it will only help it to grow-up in Brazil…
→ No CommentsTags: Google · Internet · Social Network