multichannel merchant
RSS Feeds Advertising | Contact Us | DIRECT | E-Newsletters | Subscribe
advanced
search
 

The Red Envelope, please
Aug 1, 2007 12:00 PM


JobZone
Search and post jobs for the Multichannel Merchant. Including jobs for brand & agency marketers, e-commerce, catalog marketers, ops & fulfillment, direct marketing and more.  
Click here to access JobZone

Find any supplier you need - agencies, CRM, fulfillment, lists, e-commerce, paper, printers, telemarketing, and more.
Featured Categories
Fulfillment
Warehousing
Lists & Data
Telemarketing
Merch. Order Processing
Shipping & Distribution
Print, Production & Paper
Lists and Data Processing
:: view all categories
toolbox
ListFinder
Get free access to more than 50,000 list data cards - one of the most comprehensive databases in the industry.
>> Search Now

sponsored content

The category and subcollection pages are not making it into the search engines at all — not because of their spider unfriendly URLs, but because they are being specifically blocked through “disallow” directives in the site's robots.txt file. Robots.txt is the place where you can give commands to Googlebot and the other spiders, such as “stay away from this directory” or “stay away from this file type.”

Looking at the site's robots.txt file, I see that category pages, sub-collection pages and product information pages are all being disallowed. You might wonder why this is the case. Wouldn't Red Envelope want these pages indexed and ranked, despite any inherent search engine unfriendliness? Well, in this case, the answer is no.

That's because Red Envelope has alternate pages for the spiders to index, using a technology called SearchDex. SearchDex autogenerates thousands (more precisely, 2,250, according to Google search results) of sitemap pages. These doorway pages (such as the one at http://www.redenvelope.com/giftcatalog/Ccat10095.jsp) are built specifically to lead spiders to product-level content pages.

These SearchDex “Ccat” pages may appear to be full of meaty, keyword-rich content, but upon closer examination of the content, it is quite apparent that the content was not written by a human. For example, consider this fine prose in the second sentence of the first paragraph on the aforementioned Ccat10095.jsp: “Our men's accessories range from men's fashion accessories to men's leather accessories, which are reasonably priced and unique to RedEnvelope.com.” Yuck!

Or consider the lead sentences on http://www.redenvelope.com/giftcatalog/Ccat10629.jsp: “Great Christmas presents make holidays magic. Unique Christmas presents from our collection of our newest gifts this Christmas season will bring good cheer.”

It's throwaway copy from the reader's perspective, but certainly dense with keywords: 10 occurrences of “accessories” and “men's” in the first paragraph of the former example, seven occurrences of either “Christmas presents” or “Christmas present” in the second example.

There are names for this black hat SEO tactic, none of which are complimentary: “keyword stuffing,” “spamglish,” and “doorway page” are just three that come to mind. This is a search engine ban waiting to happen.

Also in the aforementioned paragraph on Ccat10629.jsp, the words “my Christmas presents” are actually wrapped within heading tags, yet that fact is hidden from the user. The headings are given the exact same font, style, and treatment as the rest of the paragraph copy, so they are indistinguishable from the surrounding text and buried within the paragraph.

Links within the paragraph copy are hidden in the same way. Clearly, this was done only for search engines and not for humans. This is definitely the sort of thing that Google's automated algorithms seek to detect and penalize.

The title tags are similarly keyword-stuffed. A good rule of thumb with title tags is not to repeat a word three times and not to repeat more than two words. In the title tag of the aforementioned page (Ccat10629.jsp), “Christmas presents” is repeated twice, “gifts” is repeated three times, and then “gift.” Furthermore, the title spans 17 words — too long. I would go for a dozen words or less.

Looking again at the cached version of the home page (the one that Googlebot was given), I see that the majority of links on that page are wasted, because they link to category and subcollection pages that are being disallowed. Where are the links to the SearchDex pages?

There's only one SearchDex link — to the top page of the SearchDex sitemap. And that's quietly tucked away in the copyright line at the bottom of the page, since the linked page is not really meant for human consumption, only for spiders. There are no graphics on this sitemap page; it is a page chock-full of text links to various SearchDex Ccat pages.

Links contained on the home page along with their anchor text count heavily toward SEO. For instance, the “jewelry” text link would, in normal circumstances, help the linked page rank well for jewelry-related searches. That's because the search engines associate the anchor text with the page being linked to. Not so here; these navigation links are of no value because of the disallow.

If the “jewelry” category page weren't disallowed, it would be unlikely to rank well due to the lack of text content on the page. Contained on this page, however, are text links to search results pages. Search results pages can make for good search engine fodder, but fewer than 100 of these search results pages are making it into Google's index. And nearly all of those are in the supplemental index — an indicator that they are unlikely to rank well in all but obscure queries.

The logo in the top left on all the pages across the site (with the exception of the SearchDex pages) links to the home page — but using the spider unfriendly URL complete with session ID, rather than http://www.redenvelope.com/. Thus these links pass PageRank to a different version of the home page rather than reinforcing the PageRank of the true home page.

Although you wouldn't be able to tell this from the Google toolbar (due to the unique session ID-containing URL you will have been redirected to upon visiting), RedEnvelope.com has a respectable home page PageRank. By using the “PageRank Lookup” tool from SEOChat.com, I was able to determine that the home page scores a 6 out of 10.

But since PageRank is on a logarithmic scale, 6 is not as good as you may think — a 7 or an 8 would be much better. SEOChat's “PageRank Search” tool reveals something rather alarming: The majority of the SearchDex auto-generated pages score 0 out of 10, particularly at the product level. A number of Ccat pages have a PageRank 2 or 3, and only a few have a PageRank 4.

Yahoo Site Explorer (siteexplorer.search.yahoo.com) reveals quite a healthy set of inlinks — from blogs, shopping sites, news articles, directories, and so on. Yahoo counts nearly 20,000 inlinks (excluding internal links). With some re-architecting of the site, this “link juice” could really be much more effectively leveraged across Red Envelope's site. That — along with rewriting the URLs to eliminate session IDs and “stop characters” (ampersands, equal signs, question marks) from the URLs; discontinuing the questionable SEO tactics of doorway pages and hidden links; and adding meaty content — should have a profound impact on Red Envelope's rankings and search traffic.

Would you like to have your Website critiqued by our panel of pros?

If so, e-mail Mark Del Franco (mark.delfranco@penton.com) with the name and URL of your Website and some basic information about your target audience.

Please include “Website Critique” in the subject line of your e-mail.



Back to Top

BROWSE ISSUES
August 1, 2008 Cover July 1, 2008 Cover June 1, 2008 Cover May 1, 2008 Cover April 1, 2008 Cover March 1, 2008 Cover February 1, 2008 Cover
  August 1, 2008 July 1, 2008 June 1, 2008 May 1, 2008 April 1, 2008 March 1, 2008 February 1, 2008


BROWSE E-NEWSLETTERS
   
  View Sample
Subscribe
View Sample
Subscribe
View Sample
Subscribe
View Sample
Subscribe
View Sample
Subscribe
View Sample
Subscribe
 

BROWSE BACK ISSUES