Getting Indexed by Google
May 19, 2007 – 1:07 pmHyper Dog Blog posted an interesting article with 9 common web mistakes that prevent Google from indexing your site.
Here they are - as presented on their blog.
1. JavaScript Menus
Search Engines do not follow links reliably in JavaScript, if at all.2. Imagemaps
Search Engines cannot see the image, and so cannot classify the relevance or topic of the link. Lesser search engine robots do not even attempt to follow imagemap links.3. Image Links / Rollover links
These links frequently contain JavaScript, but also are difficuly for search engines to classify.4. JavaScript popups
Search Engines do not follow JavaScript reliably, and do not seem to like popups at all!5. “Jump menus”
These pulldown menus are usually submitting a form. If the form target is sent GET requests, there is a chance that the links will be followed in some manner, but again - this isn’t reliable navigation for Search Engines.6. NOSCRIPT embedded links
We were told that content in NOSCRIPT tags is for those visitors that have JavaScript off. But if you were told this means search engines, you were told wrong! This HTML tags has been abused by spammers early on, and search engines do not reliably follow navigation within these tags.7. Frames - they’re rarely done in a search friendly manner
More on the “right way” in a later post. Frames are challenging for search engines, and we have recently seen Google penalizing framee-based sites, perhaps due to the usability challenges they can present.8. Java
Java cannot be executed by search engines. Many early rollover effects relied on Java, but the navigation cannot be read by search engine robots.9. Flash
Flash navigation cannot be followed by search engines. Splash pages can become a deadend for search engines, and alternatives to Flash navigation should always be given.
Do you have your own theories (or proof) about what works and doesn’t to get indexed? Share them in the comments section below.