User-agent: * Disallow: / # but allow only important bots User-agent: Googlebot User-agent: Googlebot-Image User-agent: Mediapartners-Google User-agent: msnbot User-agent: msnbot-media User-agent: Slurp User-agent: Yahoo-Blogs User-agent: MJ12bot User-agent: Yahoo-MMCrawler User-agent: NostoCrawlerBot User-agent: Zend_Http_Client User-agent: Pingdom.com_bot_version_1.4_(http://www.pingdom.com/) ####################################### ############### SITEMAP ############### ####################################### Sitemap: http://www.peterharrington.co.uk/sitemap.xml ####################################### ################ PAGES ################ ####################################### ####################################### ##### MAGENTO DIRECTORIES & FILES ##### ####################################### ##### Directories ##### Disallow: /404/ Disallow: /app/ Disallow: /cgi-bin/ Disallow: /downloader/ Disallow: /includes/ Disallow: /lib/ Disallow: /magento/ Disallow: /pkginfo/ Disallow: /report/ Disallow: /stats/ Disallow: /var/ # Disallow: /search* Disallow: */account/* ##### Paths (clean URLs) ##### Disallow: /index.php/ Disallow: /catalog/product_compare/ Disallow: /catalog/category/view/ Disallow: /catalog/product/view/ Disallow: /catalog/product/gallery/ #Disallow: /catalogsearch/ Disallow: /checkout/ Disallow: /control/ # Disallow: /contacts/ Disallow: /customer/ Disallow: /customize/ Disallow: /newsletter/ Disallow: /poll/ Disallow: /review/ Disallow: /sendfriend/ Disallow: /tag/ Disallow: /wishlist/ ##### Files ##### Disallow: /cron.php Disallow: /cron.sh Disallow: /error_log Disallow: /install.php Disallow: /LICENSE.html Disallow: /LICENSE.txt Disallow: /LICENSE_AFL.txt Disallow: /STATUS.txt ####################################### ######## QUERY STRING BLOCKER ######### ####################################### #Uncomment if the site is a brand new un-cached site. # Disallow: /*?* ####################################### #### WORDPRESS DIRECTORIES & FILES #### ####################################### ##### Uncomment if using Wordpress in subdirectory ##### Allow: /blog/wp-content/uploads/ #Disallow: /blog/wp-content/upgrade/ #Disallow: /blog/wp-admin/ #Disallow: /blog/wp-includes/ ####################################### ########### SCREAMING FROG ############ ####################################### #User-agent: Screaming Frog SEO Spider #Allow: / #Disallow: /*.gif$ #Disallow: /*.jpg$ #Disallow: /*.png$ #Disallow: /*.bmp$ #Disallow: /*.xml$ #Disallow: /*.css$ #Disallow: /*.js$root User-agent:* Disallow: /lib/ Disallow: /*.php$ Disallow: /pkginfo/ Disallow: /report/ Disallow: /var/ Disallow: /catalog/ Disallow: /customer/ Disallow: /sendfriend/ Disallow: /review/ Disallow: /*SID= Disallow: /checkout/ Disallow: /onestepcheckout/ Disallow: /customer/ Disallow: /customer/account/ Disallow: /customer/account/login/ Disallow: /catalogsearch/ Disallow: /catalog/product_compare/ Disallow: /catalog/category/view/ Disallow: /catalog/product/view/ Disallow: /*?dir* Disallow: /*?dir=desc Disallow: /*?dir=asc Disallow: /*?limit=all Disallow: /*?mode* Disallow: /app/ Disallow: /bin/ Disallow: /dev/ Disallow: /lib/ Disallow: /phpserver/ Disallow: /pub/ Disallow: /tag/ Disallow: /review/ Disallow: /blog/wp-content/plugins/ Disallow: /blog/wp-admin/ Disallow: /blog/readme.html Disallow: /blog/refer/ Allow: /blog/wp-content/uploads/ Allow: /blog/