Buck Anderson - DNN Professor Member Login
May 21, 2012

Article Search  
Article Index

Click to grab the RSS feed for the DNN Professor Article IndexDotNetNuke Tutorials and Articles

We cover a lot of different DNN topics on this site - this article index gives you a pretty good idea of the DotNetNuke tutorials, XMod tutorials, DNN training, DNN hosting, and Internet Marketing subjects we cover for beginners and advanced DotNetNuke users.

The following is a complete index of all articles on this site - presented in the most current to oldest article sequence. You can view categories and search the index by subject.

Check back often for new articles, tips on our Business Directory, or have articles conveniently delivered to your favorite program by clicking the RSS feed icon above.


Share    
27
Ventrian News Articles Module Search Engine Optimization Tip

If you are like me, you are constantly paying attention to Google Analytics and the search engine placement of your articles and content.

During a recent Google search expedition, I noticed that the print pages of my articles produced with the Ventrian News Article module were being linked on Google and not my article detail pages.

Since a print page does not have menu navigation, readers could not navigate the rest of the site.

My wife Mara and I share common articles on her better living and gardening tips site, MaraKknows.com and my outdoor recreational site, JustNorth Outdoors.

Here is an example of Google search results:

Compare Google links using Ventrian NewsArticles module

 

I was able to correct the situation and have Google index the correct article detail pages by adding a line to Mara's robots.txt file.

What on Earth is a robots.txt File?

A robots.txt is a file placed on your server to tell the various search engine spiders not to crawl or index certain sections or pages of your site. You can use it to prevent indexing totally, prevent certain areas of your site from being indexes or to issue individual indexing instructions to specific search engines.

The file itself is a simple text file, which can be created in Notepad. It need to be saved to the root directory of your site, that is the directory where your home page or index page is.

By adding the following line to Mara's robots.txt file:

Disallow: /DesktopModules/DnnForge%20-%20NewsArticles/Print.aspx

It prevents the Google robots from indexing the print page and Google should begin indexing the correct pages.

Here is an example of a robots.txt file:

User-Agent: *
Allow: /
Disallow: /language/language/
Disallow: /desktopmodules/
Disallow: /Terms.aspx
Disallow: /Privacy.aspx
Disallow: /*/ctl/
Disallow: /Host/
Disallow: /Admin/
Disallow: /Portals/1/Logs/
Disallow: /App_Browser/
Disallow: /App_Code/
Disallow: /App_Data/
Disallow: /App_GlobalResources/
Disallow: /HttpModules/
Disallow: /bin/
Disallow: /Components/
Disallow: /Config/
Disallow: /Controls/
Disallow: /Documentation/
Disallow: /DesktopModules/
Disallow: /Providers/
Disallow: /Install/
Disallow: /Resources/
Disallow: /js/
Disallow: /Portals/
Disallow: /DesktopModules/PropertyAgent/Print.aspx
Disallow: /DesktopModules/DnnForge%20-%20NewsArticles/Print.aspx

Sitemap: http://maraknows.com/SiteMap.aspx
Sitemap: http://maraknows.com/DesktopModules/DnnForge%20-%20NewsArticles
/Rss.aspx?TabID=56&ModuleID=379
Sitemap: http://maraknows.com/portals/0/MaraKnowsQuickTips.xml

Another method to avoid the search robots from indexing the print page is to add a meta tag to the header of the /DesktopModules/DnnForge%20-%20NewsArticles/Print.aspx page.

Example:

<META NAME="ROBOTS" CONTENT="NOINDEX, NOFOLLOW">

Indicates that a robot should neither index a document, nor analyze it for links.

Wishing you success,
Buck Anderson
DNN Professor

DNNprofessor.com
http://twitter.com/DNNprofessor


Share    

Post Rating

Comments

Mark Holburn
# Mark Holburn
Wednesday, October 20, 2010 3:23 AM
We have produced a web site for someone using the Ventrian News Module and we want to stop receiving emails every time a comment is posted on the site as we don't need to know about them and about 10 comments/emails come in every day. How do we stop the Host email address from receiving these?
Ryan
# Ryan
Wednesday, November 24, 2010 7:03 AM
Which directory on the filesystem correspond to: "It need to be saved to the root directory of your site, that is the directory where your home page or index page is"

I have two portals on my site with the following paths:
c:\inetpub\wwwroot\DotNetNuke\Portals\0
c:\inetpub\wwwroot\DotNetNuke\Portals\1

I tried to place the robots.txt in the \0 and \1 directories but I get a 404 error.

Thanks
Buck
# Buck
Wednesday, November 24, 2010 8:36 AM
Ryan,

The robots.txt file must be in the top root folder of your site not in the portal directories.

You may need to FTP to your site to add this file. It would be placed in the same location as web.config.

Your robots.txt URL should be www.yoursite.com/robots.txt

Join the Discussion!


Buck Anderson - DNN ProfessorLet us know what you think. Or ask us anything. Or offer your own sage advice.

The only rule: RESPECT THIS HOUSE! Postings that contain abusive language and/or personal attacks will be cheerfully VAPORIZED. One cross word and – POOF! – your well-thought-out post will be gone in a puff of smoke.

          Buck

RSS comment feed RSS feed for comments on this post | Permalink URL

Note: For security, public comments require an Email address (Email will not be published and is also used for your Gravatar image)

Post Comment

Only registered users may post comments.


Copyright 2004 - 2012 by Jive Media Group LLC. All Rights Reserved.  | Terms of Use | Privacy Statement | Email White Listing | Site Map