From mboxrd@z Thu Jan 1 00:00:00 1970 From: Karl Voit Subject: Re: How to get a paper scanner into org-mode workflow Date: Tue, 1 Jan 2013 23:46:26 +0100 Message-ID: <2013-01-01T23-26-39@devnull.Karl-Voit.at> References: Reply-To: news1142@Karl-Voit.at Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Return-path: Received: from eggs.gnu.org ([208.118.235.92]:45561) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TqAbd-0001Ga-US for emacs-orgmode@gnu.org; Tue, 01 Jan 2013 17:46:48 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1TqAbZ-0002WU-5Z for emacs-orgmode@gnu.org; Tue, 01 Jan 2013 17:46:45 -0500 Received: from plane.gmane.org ([80.91.229.3]:56508) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1TqAbY-0002WN-RL for emacs-orgmode@gnu.org; Tue, 01 Jan 2013 17:46:41 -0500 Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1TqAbl-0006fZ-34 for emacs-orgmode@gnu.org; Tue, 01 Jan 2013 23:46:53 +0100 Received: from mail.michael-prokop.at ([88.198.6.110]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 01 Jan 2013 23:46:53 +0100 Received: from news1142 by mail.michael-prokop.at with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Tue, 01 Jan 2013 23:46:53 +0100 List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org * Torsten Wagner wrote: > Hi, Hi Torsten! > I plan to buy a document scanner with ADF and duplex function to scan all > incoming/intermediate/outgoing papers, convert them into PDF and link those > into my org-files. I definitely recommend you to but Fujitsu ScanScap S1500. I do have the Mac version. > I was wondering if someone did something like this already? I am in the process of digitizing all my physical life so far to complete my lifelogging-habbit. I also removed the spine of many books in order to be able to scan them for full-text search and archiving purposes! > I use Linux and hence I am looking for a Linux friendly solution. Although I prefer GNU/Linux by myself, at least the Mac software bundle of the Fujitsu is really worth the money. Its scanning and OCR features are just like you need, almost no hazzle with anything (except the awful bad idea to use JPEG as the only scanning format). Before the Fujitsu, I bought a HP OfficeJet Pro 8500A Plus which I am using as a (pretty decent) printer. Because the scanning hardware and software is *really* complicated/bad compared to the Fujitsu solution: reliability, usability, featureset, speed - BIG difference. I played around with Linux scanning/OCR solutions as well but I could not get any reasonable fast, reliable, and suitable OCR results at all :-( Note: The scan workflow on my Mac mini at home is more or less the only reason why I did not install GNU/Linux on this thing yet. (Most of the time I am using my Debian Wheezy notebook to get things done efficiently.) > I want to make this finally as smooth, easy and painless as possible to > make sure I do not pile up papers because I am to lazy to scan them on a > day by day basis. Sure. BTDT. > Ideally, I want to fill the ADF unit, press a button, receive a single PDF > with all pages scanned duplex in a preset folder, and keep a path to that > file in org-modes killring to place it as a link at the right place. OCR > would be nice too to get a full searchable system. OCR ist a must, if you invest time and money in such a workflow. Except the killring thing, I do have a similar workflow: paper in scanner, press single button on the scanner, PDF (with OCR) appears on the Mac, I enter the filename (usually starting with YYYY-MM-DD... or YYYY-MM-DDThh.mm...), and it's done. Most things I keep in ~/archive/YYYY/ and some things I file in ~/institutions/COMPANYNAME/ Because of the date/time-stamp file name part, I can index all those files using my Memacs[1] file-name module very easily. Using a yasnippet/custom-Org-link combination, I can easily link to those files rather quickly. So much for the Org-mode related stuff :-) > CC. Any recommendation on a well working scanner under Linux. At the moment > my favorite is the Fujitsu ScanSnap S1500. Seems to work out of the box and > comes with a good performance. Oh, you already did good research on this topic :-) If you try out the Mac SW bundle (I guess Windows is similar), you are not going to use Linux for this I am afraid :-) But if you do get different results and thus a workflow with GNU/Linux which is as smooth as the Mac one, I am *very* curious about your experience, notes, and lessons learned! PS: Because of a cross country skiing vacation I am looking forward to, I will probably not be able to read this ML until next weekend or so. 1. https://github.com/novoid/Memacs -- Karl Voit