SEM Wired

Archive for the ‘ search ’ Category

What is the Canonical Link Tag?

A few months ago Google introduced the Canonical Link Tag.

This tag is supposed to solve the duplicate content issue on different URL, which can negatively affect those page’s ranking. It just tells search engine the preferred version of the content, in order to be ranked better by the engines themselves.

The Canonical Link Tag should reside in the HEAD section of the page, and the href attribute should point to the URL of the chosen page. That should be enough.

<link rel="canonical" href="http://www.yourURL.com/">

Canonical Link Tag vs. Redirect

Upsides
It is way better than a 301 redireccion: it’s search engine to be redirected, not the users hence not affecting the user experience though making sure rankings wouldn’t be affected.

A 301 redireccion would actually affects search engine who have to update their rankings according to the quality of the content of those pages

Downsides
However, while the 301 redirects visitors and search engines from different domains, the Canonical link tag can be used only into a single domai, its folders and subdomains. That’s the only downside. Still it’s a pretty tool.

Possible application of the Canonical Link Tag

Usually PHP pages create dynamic content with random urls, lacking of informative content related in any way to the content. They usually have the visitor’s session ID and merge content from different sources. The Canonical Link Tag can preserve rankings of the original page which featured the content.

To learn more about the Canonical Link Tag, check out this post from Matt Cutts or take a look into the Google Webmaster Center.

A few days aho I wrote a post about the rising importance of real-time search and Google and Bing crawling content from Twitter.
microsoft-launches-bingtweets
Now Microsoft has just released an interesting search tool called BingTweets: it’s a real time Twitter search engine with a pretty crowded front panel showing all cross references between standard and dynamic real-time search.

It’s definetely something Google should think about. It’s also interesting how the mixed the standard search results with the Twitter search results, giving a complete feedback over search on different sources.

What people has predicted since Twitter start hitting the news has become true: Twitter is not turning into a search engine but its content is considered valuable by search engines.

Despite all the Twitter-mania-hype (absolutely unnecessary but media needs something to blabber about right?) I guess Twitter is slowly finding out its real nature: a real-time updated source of “what’s going on” in the world, reflecting the current vibe of the net, an extremely dynamic and constantly updated source of useless content. That’s it. There’s nothing really valuable in most twitts: the thing is the reflect what’s going on in the world.

That’s why a real-time search is growing day by day. It’s the vibrating pulse of the net, and being able to rummage through all this garbage gives the advantage to know the present and the very next future of the net.

how-to-calculate-web-page-loading-time

Page loading time is a crucial factor when it comes to estimate page’s usability as well as search engine’s rankings and Google’s Adwords quality score.

according to this statement, Google is going to update its assessment policy . Page load time it is soon due to become an important factor requiring specific assessment and optimization from webmasters and SEOs.

In order to calculate page loading time, you might try Pingdom. Pingdom is an online utility which calculates pages text, images and scripts, giving you also some interesting suggestions about improvement.

However, besides any Google statement about quality score or rankings, it’s always good to think about you user’s experience on your website: nobody likes websites taking too long to load.

Considering both usability and rankings, this should be some general guidelines to be followed one assessing and optimizing page’s loading time:

    Text and page structure should load and be completely visible within the first 3 seconds
    Any other element of the page must be visible within the first 8 seconds

These are only thumb rules of course. Always cehck your page weight and loading times. Working on this side of on-page optimization can lead to interesting discoveries sometimes.

It seems that search engine are digging through Twitter more and more

. In the past few days a came across a few episodes which made me think about how much search engine are considering Twitter content as valuable enough to be included into their own search results.

Let’s take a look at Bing first. Try looking for Al Gore this query on Bing: the very first result will be it’s Twitter page with is real time twits. This entry comes also before his personal website. This has been also confirmed by a public statement from Bing published on its community page.

Something interesting happens with Google as well. A few days ago I posted a link to one of my websites I found and I wanted to share with my Twitter followers. Analytics record show the following as the very first visit on the website right after the link was published on twitter:

216.200.55.83 invx.com [01/Jul/2009:02:49:01 +0200] “GET /a HTTP/1.1″ 404 136 “-” “Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)” “-”

Clearly the page was just being crawled by Google. The link has not apperead anywhere else and it is not a fronpage or entry page for search engines on the website.

This it’s just another proof on how important Twitter is becoming in terms on search value. It’s not going to take long before Google itself will either buy or create its own real time search tool.