Google Can Now Index Scanned PDFs

Google just doesn’t seem to cease in rolling out really cool stuff. Now, they have developed technology that allows them to index scanned PDF files. Google is already able to read documents saved as PDFs, but will now also have access to scanned PDFs.

There are tons of government and scholarly articles on the web whose only search engine friendly properties are metadata. Now, using Optical Character Recognition (OCR) technology, Google can scan these files and convert them to text files that are readable by the search engines.

Google also recently made Flash files readable, as well as files hidden behind forms. Now PDFs like the first result listed when you search for repairing aluminum wiring.

Any companies that have a lot of content printed out and haven’t felt like utilizing other technologies to convert their scans to text may now consider putting these on their sites and letting Google do the work for them. It’s always good from an SEO standpoint to add new content, not to mention how useful it may be.

Viewing 3 Comments

Trackbacks

close Reblog this comment
blog comments powered by Disqus