From mboxrd@z Thu Jan 1 00:00:00 1970 From: Alan L Tyree Subject: Making ePub books Date: Sun, 11 Dec 2011 17:59:30 +1100 Message-ID: <1323586770.20628.12@windy> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Return-path: Received: from eggs.gnu.org ([140.186.70.92]:40514) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RZdNR-0000a1-I9 for emacs-orgmode@gnu.org; Sun, 11 Dec 2011 01:59:14 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1RZdNP-0003Ng-TI for emacs-orgmode@gnu.org; Sun, 11 Dec 2011 01:59:13 -0500 Received: from mail-iy0-f169.google.com ([209.85.210.169]:45632) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RZdNP-0003Nc-Og for emacs-orgmode@gnu.org; Sun, 11 Dec 2011 01:59:11 -0500 Received: by iahk25 with SMTP id k25so8013410iah.0 for ; Sat, 10 Dec 2011 22:59:10 -0800 (PST) Content-Disposition: inline List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org Debian Squeeze; org 7.7; emacs 23.2.1 I am back to trying to make ePub books from org articles/books. I am=20 working on a book which currently produces about 100 pages in LaTeX=20 export. It will be about 200 pages when finished. ePub uses XHTML for the main content. So, I export the org file to=20 HTML. It verifies as a valid XHTML1.0 file at the w3c verification=20 site: http://validator.w3.org/ OK. Then wrap it up in the mess that is the ePub specification. It=20 actually reads OK in FBReader and in Iceweasel with the ePub add on,=20 BUT it does not validate. There are several problems, but most of the=20 errors involve the "name" attribute. For example:

1 History

ePub does not like the name in there. Wipe out all the name=3D"xxx" and=20 the problem goes away. Everything else still works. I know that I can do a post export clean up of the XHTML file, but I=20 wonder if this is set in some variable that I cannot find. And, as a general question, whay have both name=3D"sec-1" and id=3D"sec-1"=20 in the same element? I would like to automate everything to go from org to ePub. It doesn't=20 seem too hard, but I'm a legal academic, not a programmer :-). Any=20 pointers appreciated. Cheers, Alan --=20 Alan L Tyree http://www2.austlii.edu.au/~alan Tel: 04 2748 6206 sip:172385@iptel.org