Oct 31, 2008 6:21 AM

Google Begins Indexing Scanned Documents

Google has begun to index documents posted online that contain images of text using Optical Character Recognition (OCR) technology, it announced yesterday on its blog. Previously only docs converted to PDFs with text were indexed and included in results. Since scanned docs are only a picture of text, they are typically more difficult to interpret, […]

Google has begun to index documents posted online that contain images of text using Optical Character Recognition (OCR) technology, it announced yesterday on its blog.

Previously only docs converted to PDFs with text were indexed and included in results. Since scanned docs are only a picture of text, they are typically more difficult to interpret, and the pages can include wrinkle, smudges or stains.

This advancement opens up a whole new collection of information, including many government and academic documents once hidden from the public searches.

The news comes a few days after Google settled its book-scan suit, giving it the go-ahead to continue its book search project.

TopicsGoogle Search

Bill Atkinson, Macintosh Pioneer and Inventor of Hypercard, Dies at 74

Atkinson’s gleeful brilliance helped people draw on computer screens and access information via links.

Steven Levy

24 Best Deals on Father's Day Gifts

Get Dad a WIRED-approved weather station, pair of headphones, or treadmill for less.

Louryn Strampe

The Best Backpacking Tents for Getting Away From It All

The right shelter makes all the difference in the backcountry. Here are the best tents we’ve tested and love.

Scott Gilbertson

Tech Up Your Sourdough With These Upper-Crust Baking Gadgets

Sourdough bread is one of the most wonderful things you can make with your hands, but it can be fussy and hard to perfect. Now technology takes out most of the pesky guesswork.

Joe Ray

Which Samsung Galaxy Phone Should You Buy?

From flagship and budget to flipping and folding, Samsung’s Galaxy range spans the breadth of the smartphone cosmos. WIRED’s here to help you make your choice.

Julian Chokkattu

Everything You Need to Know About MicroSD Express

What is the latest MicroSD iteration, and why does your Nintendo Switch need it?

Brad Bourque

The Best Weighted Blankets

If you’re looking for the sensation of a hug, these weighted blankets—plus weighted robes, eye masks, and more—will snuggle you back.

Nena Farrell

Uber Just Reinvented the Bus … Again

Beyond the jokes about its new shuttle service are serious questions about what it will mean for struggling transit systems, air quality, and congestion.

Sophie Hurwitz

The 46 Best Shows on Netflix Right Now

Dept. Q, Sirens, and Black Mirror are just a few of the shows you need to watch on Netflix this month.

Matt Kamen

The 46 Best Movies on Netflix Right Now

Lost in Starlight, Kill Boksoon, and The Old Guard are just a few of the movies you should watch on Netflix this month.

Matt Kamen

The Mystery of iPhone Crashes That Apple Denies Are Linked to Chinese Hacking

Plus: A 22-year-old former intern gets put in charge of a key anti-terrorism program, threat intelligence firms finally wrangle their confusing names for hacker groups, and more.

Dhruv Mehrotra

Samsung Teases Z Fold Ultra, Bing Gets AI Video, and Nothing Sets A Date—Your Gear News of the Week

Plus: Ruark has new speakers, Photoshop comes to Android and summer's finest music player gets updated.

Julian Chokkattu