Click to Play

Yahoo’s Search Ad Descriptions...
Starting in June of this year, will only be allowing a certain number of characters in its search ad descriptions. Yahoo currently allows an ad entry with as many as 190...

Recent Articles

Links - The Grammar Of The Web
Linking is the foundation of every quality website. Everything starts with the link. You build from the link, not from the sentence. Read the following paragraph and try to identify how it should be dealt with in terms of links...

Traffic Building: The Experience Sieve
Driving traffic to your site may be a matter of simply knowing what you're talking about. There is a high demand for experts. If you have learned a great deal about a certain subject you can bet there are those...

Using Robots.txt To Prevent Search Indexing
Sometimes there are parts of your website you don't want accessed by the search engines - for any number of reasons, like sensitive private data, articles that require subscriptions - whatever. Google recently posted a tutorial about how to use the robots.txt file to block search...

Make Your Content Del.icio.us
Del.icio.us is the most popular bookmarking service on the web. By getting on the Del.icio.us popular or hotlist page you could get thousands of visitors coming to your website within minutes. Here are some ways that...

Blogs - The Emotional Description
One topic that often comes up in conversations about blogging is how do you define a blog. Answering the question invariably includes a description of the attributes a website must have in order for it to be a blog...

Creating Rich Content
We have all heard the term content-rich, but what does content rich really mean? Content rich means different things for different individuals, because what one person finds useful, another may not. Content rich is all...


04.10.07


Pull Your Site Out Of The Supplemental Index

By Richard Hearne

I met Krishna De in Cork last month. She gave a fantastic presentation on marketing and leveraging the Internet to achieve your business goals.

In fact, without prejudice to any of the other speakers, I found that Krishna's topic area was of the most interest to me. Krishna also availed of my offer for a free site review. So without further ado…

KrishnaDe.com

I have to say I have always admired Krishna's website. It is just well polished from the get-go. The homepage just speaks ‘professionalism' to me.

If I were to find any fault it would be with the footer - I can't easily discern between text and links. But that would just be nit-picking.

More than meets the eye

It was only when I sent in a spider that the true size of Krishna's site became apparent. I knew that her blog has been on-line for a number of years and so expected the blog to be quite extensive. But I hadn't expected this:

Crawler 1: 2,306 internal pages
Crawler 2: 2,604 pages (some external)

A look at Google's index shows that Krishna's site has a high number of pages in the supplemental index:

Pages Indexed: 1,330
Pages Supplemental: 964

That's a particularly high proportion of supplemental:indexed pages, and to me this is the most pressing issue for Krishna.

A robots eye view

Here's Krishna's robots.txt file:

User-agent: *
Disallow: /_mm/
Disallow: /_notes/
Disallow: /_baks/
Disallow: /MMWIP/
Disallow: /audio-for/
Disallow: /private/
Disallow: /onlinebrand/

User-agent: googlebot
Disallow: *.csi


Low Rate eCommerce & Retail Plans

When I look at some of the files that have made their way into the supplemental index I can see immediately that many should not be indexed in the first place.

HOLD PRESS - I've just noticed that Krishna's site has been hacked:


Those links at the top of the page shouldn't be there. That's taken from Google's cached version of the page. Here's the original page. This type of hacking is normally carried out by altering the .htaccess file to cloak your pages for GoogleBot. Normal users are shown the second page, while Google sees the page with the links.

I've seen this hack a lot recently. The best medicine is to make sure that your software is up-to-date. There have been issues with Wordpress, and that's why the Wordpress guys are very much on the ball with updates. You have to carefully check your server to see what else has been left around. The first file I would check is .htaccess, although in this case I have a feeling there may be a bit more going on.

Continue reading this article.


About the Author:
Richard Hearne is the founder of Red Cardinal, a dedicated search marketing consultancy. A frequent contributor to Google's Webmaster Group, Richard regularly advises clients on Internet marketing strategy and Search Engine optimisation campaigns. Richard's thoughts and research can be found on his search marketing blog.

About SmallSiteNews
News, Tips, and Resources for Small Web Site Developers

SmallSiteNews is brought to you by:

WebProNews.com Jayde.com
MarketingNewz.com SalesNewz.com
CareerNewz.com InvestNewz.com
eCommNewz.com WebsiteNotes.com
AdvertisingDay.com ManagerNewz.com
SearchNewz.com CRMNewz.com



-- SmallSiteNews is an iEntry, Inc. publication --
iEntry, Inc. 2549 Richmond Road, Lexington, KY 40509
2007 iEntry, Inc. All Rights Reserved | Privacy Policy | Legal

archives | advertising info | news headlines | free newsletters | comments/feedback | submit article



SmallSiteNews Home Page About Article Archive News Downloads WebProWorld Forums Jayde iEntry Advertise Contact SmallSiteNews News Archives About Us Feedback WebProWorld Forum