From mboxrd@z Thu Jan 1 00:00:00 1970 From: Achim Gratz Subject: Re: [OT] Scanning for archiving Date: Sat, 05 Nov 2011 21:34:22 +0100 Message-ID: <87fwi2ux4h.fsf@Rainer.invalid> References: Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([140.186.70.92]:36844) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RMmwt-0004n2-8A for emacs-orgmode@gnu.org; Sat, 05 Nov 2011 16:34:44 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1RMmwr-0003oI-Pv for emacs-orgmode@gnu.org; Sat, 05 Nov 2011 16:34:43 -0400 Received: from lo.gmane.org ([80.91.229.12]:54011) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RMmwr-0003oD-EN for emacs-orgmode@gnu.org; Sat, 05 Nov 2011 16:34:41 -0400 Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1RMmwq-0006mf-5Y for emacs-orgmode@gnu.org; Sat, 05 Nov 2011 21:34:40 +0100 Received: from p57aaaf55.dip.t-dialin.net ([87.170.175.85]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sat, 05 Nov 2011 21:34:40 +0100 Received: from Stromeko by p57aaaf55.dip.t-dialin.net with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Sat, 05 Nov 2011 21:34:40 +0100 List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org Marcelo de Moraes Serpa writes: > I just bought a scanner and started to scan important documents as a > backup, and archiving them with meaningful metadata in orgmode files. > Then a question came to mind - what dpi to use? I'm not really savvy > when it comes to scanning or printing, and I want like a dpi that > allows me to reprint the document at an acceptable quality later if > necessary, but that also doesn't take that much space (600dpi pdfs > take around 5MB). Fax in fine mode has about 200dpi resolution. The raw scan should be in higher resolution (usually 2x-4x the target resolution depending on the document quality). The file to be archived then needs to be compressed (lossless compression is preferred, e.g. TIFF or PNG) and the bit depth reduced (black and white, usually). When making PDF files you need to make sure that the image data doesn't get re-coded (often into much inferior JPEG). For documents containing (color) images it is often preferrable to separately treat text and images. The best compression would be achieved if the whole text was extracted via OCR, but that is probably a lot more effort than you're willing to spend. Regards, Achim. -- +<[Q+ Matrix-12 WAVE#46+305 Neuron microQkb Andromeda XTk Blofeld]>+ Samples for the Waldorf Blofeld: http://Synth.Stromeko.net/Downloads.html#BlofeldSamplesExtra