這台怪物據說是 Google Books Search 用來把書頁內容掃進資料庫以供搜尋的究極兵器。在這篇報導中介紹了這台怪兵器的厲害之處:
According an essay written by Joseph Huttner, “Google recently implemented a convoy of new machines from Kirtas Technologies called the APT BookScan 1200, which is capable of scanning up to 1200 pages per hour (Kirtas Technologies). The machine works by utilizing a 16-megapixel digital camera to photograph each page, then transfers the image to local storage. Then, a robotic arm gently turns the page of the book, and the process is completed for each additional page. The digital images are then ‘cleaned’ to remove smudges and other errors, cropped, and centered. At this point, the image can be posted online, but “searching through the text is impossible”.
文章雖說在拍完後仍無法對內文進行搜尋,但圖檔很快就會送到 Google 的 OCR 軟體去辨識成數位化的文字資料,這樣就可以搜尋了。BTW,Google 說最近會推出支援中文的 Google Book Search。到時有多少台灣和中國的出版社想跟 Google 玩呢?值得觀察。
這台之前在吉米丘的電子報中看過。
連結在此:http://forums.jcms.com.tw/viewtopic.php?t=6407
真是部有趣的機器阿,一小時1200頁真快!!
[…] 連結在此,這應該是封面掃瞄時出了問題,結果封面就整個爛掉了。看來用高速掃瞄器掃書時,最好還是有人檢查一下掃瞄結果才好,不能全丟給機器。 […]
[…] 關鍵技術是什麼?" ( http://bit.ly/beYiA0 ) 那福忠顯然不知道這個東西… https://wp.tenz.net/archives/196 18 hrs […]
[…] 關鍵技術是什麼?" ( bit.ly/beYiA0 ) 那福忠顯然不知道這個東西… wp.tenz.net/archives/196 //標題下已經寫了:本篇文章價值 0 元…XD […]