Skip to main content

Google Filetype Operator is Broken


Google has a useful filetype search operator that helps you find non-HTML documents like Microsoft Word, PDFs or even Powerpoint presentations on the web.

Say you are searching for the operating manual of Canon EOS 5D Digital SLR Camera, you could type the following query in Google:

eos 5d site:canon.com filetype:pdf

I use this very frequently but today came across a funny bug in Google filetype operator.

Basically, if a website name ends with one of the searchable file extensions like doc or pdf, Google would confuse that website for a document. For instance, the following query will show all Microsoft Word documents stored on google.com

site:google.com filetype:doc

But if you look at the results page [screenshot above], none of the search results are actually Word documents - they are just names of Google groups that end with doc.

Update: Emma wrote to Matt Cutts about this Google filetype bug and here's the response of Matt on his blog
Emma, interesting post. I think it's known/expected that filetype:doc will return urls that end in .doc, even if the file isn't a Word file. That's why you should never name your web documents something.exe . :) I'll ask someone to be sure though in case that's new behavior.
Thanks Emma and Matt.

Popular posts from this blog

How to Download Contacts from Facebook To Outlook Address Book

Facebook users are not too pleased with the "walled garden" approach of Facebook. The reason is simple - while you can easily import your Outlook address book and GMail contacts into Facebook, the reverse path is closed. There's no "official" way to export your Facebook friends email addresses or contact phone numbers out as a CSV file so that you can sync the contacts data with Outlook, GMail or your BlackBerry. Some third-party Facebook hacks like "Facebook Sync" (for Mac) and "Facebook Downloader" (for Windows) did allow you to download your Facebook friends' names, emails, mobile phone number and profile photo to the desktop but they were quickly removed for violation of Facebook Terms of Use. How to Download Contacts from Facebook There are still some options to take Friends data outside the walls of Facebook wall. Facebook offers the Takeout option allowing you to download all Facebook data locally to the disk (include

PhishTank Detects Phishing Websites by Digg Style Voting

OpenDNS, a free service that helps anyone surf the Internet faster with a simple DNS tweak , will announce PhishTank today. PhishTank is a free public database of phishing URLs where anyone can submit their phishes via email or through the website. The submissions are verified by the other community members who then vote for the suspected site. This is such a neat idea as sites can be categorized just based on user feedback without even having to manually verify each and every submission. PhishTank employs the "feedback loop" mechanism where users will be kept updated with the status' of the phish they submit either via email alerts or a personal RSS feed . Naturally, once the PhishTank databases grows, other sites can harness the data using open APIs which will remain free. OpenDNS would also use this data to improve their existing phishing detection algorithms which are already very impressive and efficient. PhishTank | PhishTank Blog [Thanks Allison] Related: Google

Digital Inspiration

Digital Inspiration is a popular tech blog by  Amit Agarwal . Our popular Google Scripts include  Gmail Mail Merge  (send personalized emails with Gmail ),  Document Studio (generate PDFs from Google Forms ) and   File Upload Forms ( receive files  in Google Drive). Also see  Reverse Image Mobile Search , Online Speech Recognition and Website Screenshots , the most useful websites on the Internet.