From mboxrd@z Thu Jan 1 00:00:00 1970 From: Alan L Tyree Subject: ePub construction Date: Wed, 14 Dec 2011 09:45:35 +1100 Message-ID: <1323816335.4781.0@windy> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Return-path: Received: from eggs.gnu.org ([140.186.70.92]:54252) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Rab66-0004Yv-M7 for emacs-orgmode@gnu.org; Tue, 13 Dec 2011 17:45:19 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Rab65-0003ts-4E for emacs-orgmode@gnu.org; Tue, 13 Dec 2011 17:45:18 -0500 Received: from mail-ww0-f49.google.com ([74.125.82.49]:52309) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Rab64-0003tm-UX for emacs-orgmode@gnu.org; Tue, 13 Dec 2011 17:45:17 -0500 Received: by wgbdt11 with SMTP id dt11so276074wgb.30 for ; Tue, 13 Dec 2011 14:45:15 -0800 (PST) Content-Disposition: inline List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org G'day, Here are the modifications that I needed to make to the exported html=20 file (using the standard exporter) in order to get a valid ePub=20 document. http://threepress.org/document/epub-validate provides an=20 on-line validation service. My document is relatively simple, but does have footnotes, citations=20 and formatting commands. I will be happy to add a fuller description to worg if someone will=20 point me to some instructions. 1. Make the html file have utf-8 encoding; there are probably several=20 ways to do this, but I used the emacs23 menu MULE options on the org=20 document. 2 Change the language statement by removing the "lang=3D" attribute: From: To: 3 Change anchor descriptions associated with section headings by removing "name=3D" entries, for example, From: To: NOTE: this can now be done by customising org-export-html-headline- anchor-format (Thanks Bastien!!) 4. Change anchor class=3D"footref" by replacing "name=3D" with "id=3D". For example: from: to: 5. Change anchor class=3D"footnum" by replacing "name=3D" with "id=3D". For example: from: 1 to: 1 Cheers, Alan --=20 Alan L Tyree http://www2.austlii.edu.au/~alan Tel: 04 2748 6206 sip:172385@iptel.org From mboxrd@z Thu Jan 1 00:00:00 1970 From: Nick Dokos Subject: Re: ePub construction Date: Tue, 13 Dec 2011 20:42:38 -0500 Message-ID: <10534.1323826958@alphaville.americas.hpqcorp.net> References: <1323816335.4781.0@windy> Reply-To: nicholas.dokos@hp.com Return-path: Received: from eggs.gnu.org ([140.186.70.92]:51763) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Radrr-00060q-3S for emacs-orgmode@gnu.org; Tue, 13 Dec 2011 20:42:50 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Radrl-0007e3-GX for emacs-orgmode@gnu.org; Tue, 13 Dec 2011 20:42:47 -0500 Received: from g5t0006.atlanta.hp.com ([15.192.0.43]:25042) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Radrk-0007Yd-Mp for emacs-orgmode@gnu.org; Tue, 13 Dec 2011 20:42:41 -0500 In-Reply-To: Message from Alan L Tyree of "Wed, 14 Dec 2011 09:45:35 +1100." <1323816335.4781.0@windy> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Alan L Tyree Cc: nicholas.dokos@hp.com, emacs-orgmode@gnu.org Alan L Tyree wrote: > G'day, > > Here are the modifications that I needed to make to the exported html > file (using the standard exporter) in order to get a valid ePub > document. http://threepress.org/document/epub-validate provides an > on-line validation service. > > My document is relatively simple, but does have footnotes, citations > and formatting commands. > > I will be happy to add a fuller description to worg if someone will > point me to some instructions. > > > 1. Make the html file have utf-8 encoding; there are probably several > ways to do this, but I used the emacs23 menu MULE options on the org > document. > > 2 Change the language statement by removing the "lang=" attribute: > > From: xml:lang="en"> > > To: > > 3 Change anchor descriptions associated with section headings by > removing "name=" entries, for example, > > From: > > To: > > NOTE: this can now be done by customising org-export-html-headline- > anchor-format (Thanks Bastien!!) > > 4. Change anchor class="footref" by replacing "name=" with "id=". For > example: > > from: > > to: > > 5. Change anchor class="footnum" by replacing "name=" with "id=". For > example: > > from: 1 > > to: 1 > Interesting - I downloaded the epub validator, ran Calibre on a silly little org-produced HTML file and duplicated your "lang" problem. AFAICT, all of the problems you mention above are legal HTML4, so either the epub spec (which I have not looked at: do you have a pointer?) is made to trip people up by enforcing restrictions that they dreamed up, or the validator is not quite as smart as it should be. Just for kicks, I ran it against the Calibre Quick Start Guide (the only "real" ebook I had access to) and it gave me an error :-) ,---- | Epubcheck Version 3.0b3 | | ERROR: Calibre Quick Start Guide - John Schember.epub/OEBPS/content.opf(5,57): attribute "file-as" not allowed here; expected attribute "id", "opf:file-as", "opf:role" or "xml:lang" | | Check finished with warnings or errors! `---- so, presuming that the ebook is "fine", I'm not sure how seriously to take the validator. Nick PS In case it's not obvious, all I know about ebooks is what is contained in this email. Caveat emptor. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Alan L Tyree Subject: Re: ePub construction Date: Wed, 14 Dec 2011 14:14:02 +1100 Message-ID: <1323832442.2949.1@misty> References: <10534.1323826958@alphaville.americas.hpqcorp.net> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Return-path: Received: from eggs.gnu.org ([140.186.70.92]:54594) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RafHj-0005QY-8E for emacs-orgmode@gnu.org; Tue, 13 Dec 2011 22:13:36 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1RafHh-0006jF-QK for emacs-orgmode@gnu.org; Tue, 13 Dec 2011 22:13:35 -0500 Received: from mail-qy0-f169.google.com ([209.85.216.169]:35279) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RafHh-0006jA-Nj for emacs-orgmode@gnu.org; Tue, 13 Dec 2011 22:13:33 -0500 Received: by qcsd17 with SMTP id d17so266869qcs.0 for ; Tue, 13 Dec 2011 19:13:33 -0800 (PST) In-Reply-To: <10534.1323826958@alphaville.americas.hpqcorp.net> (from nicholas.dokos@hp.com on Wed Dec 14 12:42:38 2011) Content-Disposition: inline List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: nicholas.dokos@hp.com Cc: emacs-orgmode@gnu.org On 14/12/11 12:42:38, Nick Dokos wrote: > Alan L Tyree wrote: >=20 > > G'day, > >=20 > > Here are the modifications that I needed to make to the exported > html=20 > > file (using the standard exporter) in order to get a valid ePub=20 > > document. http://threepress.org/document/epub-validate provides an=20 > > on-line validation service. > >=20 > > My document is relatively simple, but does have footnotes,=20 > citations >=20 > > and formatting commands. > >=20 > > I will be happy to add a fuller description to worg if someone will=20 > > point me to some instructions. > >=20 > >=20 > > 1. Make the html file have utf-8 encoding; there are probably > several=20 > > ways to do this, but I used the emacs23 menu MULE options on the=20 > org >=20 > > document. > >=20 > > 2 Change the language statement by removing the "lang=3D" attribute: > >=20 > > From: > xml:lang=3D"en"> > >=20 > > To: > >=20 > > 3 Change anchor descriptions associated with section headings by > > removing "name=3D" entries, for example, > >=20 > > From: > >=20 > > To: > >=20 > > NOTE: this can now be done by customising org-export-html-headline- > > anchor-format (Thanks Bastien!!) > >=20 > > 4. Change anchor class=3D"footref" by replacing "name=3D" with "id=3D". > For > > example: > >=20 > > from: > >=20 > > to: > >=20 > > 5. Change anchor class=3D"footnum" by replacing "name=3D" with "id=3D". > For > > example: > >=20 > > from: 1 > >=20 > > to: 1 > >=20 >=20 > Interesting - I downloaded the epub validator, ran Calibre on a silly > little org-produced HTML file and duplicated your "lang" problem. >=20 > AFAICT, all of the problems you mention above are legal HTML4, so > either > the epub spec (which I have not looked at: do you have a pointer?) is > made to trip people up by enforcing restrictions that they dreamed=20 > up, > or the validator is not quite as smart as it should be. Yes, the org export file validates at w3c as a valid XHTML1.0 file. The=20 ePub spec is here: http://idpf.org/epub but I admit fatigue in trying=20 to wade through it, so I don't know what XHTML1.0 constructions it=20 objects to. It seems like the kind of document that nobody has ever=20 read: sort of like EULAs and express warranties. I just made the changes in response to the validation errors. >=20 > Just for kicks, I ran it against the Calibre Quick Start Guide (the > only > "real" ebook I had access to) and it gave me an error :-) >=20 > ,---- > | Epubcheck Version 3.0b3 > |=20 > | ERROR: Calibre Quick Start Guide - John > Schember.epub/OEBPS/content.opf(5,57): attribute "file-as" not=20 > allowed > here; expected attribute "id", "opf:file-as", "opf:role" or=20 > "xml:lang" > |=20 > | Check finished with warnings or errors! > `---- >=20 > so, presuming that the ebook is "fine", I'm not sure how seriously to > take the validator. There is something to that. Calibre explicitly disclaims any intent to=20 produce an epub file that validates. In fact, the FAQ seems to suggest=20 that a Calibre produced file is *better* than one that validates: http://manual.calibre-ebook.com/faq.html#the-epub-i-produced-with-app-is-no= t-valid The problem is that some marketing sites require validation, although I=20 am not sure how much that is enforced. >=20 > Nick >=20 > PS In case it's not obvious, all I know about ebooks is what is > contained in this email. Caveat emptor. Ahhh, what about statutory warranties :-). Cheers, Alan >=20 --=20 Alan L Tyree http://www2.austlii.edu.au/~alan Tel: 04 2748 6206 From mboxrd@z Thu Jan 1 00:00:00 1970 From: Olaf Dietsche Subject: Re: ePub construction Date: Wed, 14 Dec 2011 09:53:35 +0100 Message-ID: <87fwgn3568.fsf@rat.lan> References: <1323832442.2949.1@misty> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Return-path: Received: from eggs.gnu.org ([140.186.70.92]:37423) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Rakb3-0004ru-Nv for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 03:53:57 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Rakb1-0000c4-Qq for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 03:53:53 -0500 Received: from www85.your-server.de ([213.133.104.85]:58764) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Rakb1-0000bD-In for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 03:53:51 -0500 List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Alan L Tyree Cc: nicholas.dokos@hp.com, emacs-orgmode@gnu.org Alan L Tyree writes: > On 14/12/11 12:42:38, Nick Dokos wrote: >> >> Interesting - I downloaded the epub validator, ran Calibre on a silly >> little org-produced HTML file and duplicated your "lang" problem. >> >> AFAICT, all of the problems you mention above are legal HTML4, so >> either >> the epub spec (which I have not looked at: do you have a pointer?) is >> made to trip people up by enforcing restrictions that they dreamed >> up, >> or the validator is not quite as smart as it should be. > > Yes, the org export file validates at w3c as a valid XHTML1.0 file. The > ePub spec is here: http://idpf.org/epub but I admit fatigue in trying > to wade through it, so I don't know what XHTML1.0 constructions it > objects to. It seems like the kind of document that nobody has ever > read: sort of like EULAs and express warranties. I took a quick look through and right to the beginning "1.3: Relationship to Other Specifications" it says: "This specification combines subsets and applications of other specifications." Next, in "1.3.4: Relationship to XHTML and DTBook" it talks about "..., the Preferred Vocabularies do not include all XHTML 1.1 elements and attributes." So, even if you have a valid XHTML file, only a subset of it might be valid for an epub book. Same goes for CSS. Lazy as I am, I haven't looked through the other epub parts, since this seems not relevant here. Another interesting epub checker might be , which uses "... schemas that were developed by IDPF and DAISY. ..." according to their website. Regards, Olaf From mboxrd@z Thu Jan 1 00:00:00 1970 From: Nick Dokos Subject: Re: ePub construction Date: Wed, 14 Dec 2011 08:58:58 -0500 Message-ID: <20896.1323871138@alphaville.dokosmarshall.org> References: <1323832442.2949.1@misty> <87fwgn3568.fsf@rat.lan> Reply-To: nicholas.dokos@hp.com Return-path: Received: from eggs.gnu.org ([140.186.70.92]:38703) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RapMR-0002Yl-FD for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 08:59:12 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1RapMM-0004Mc-Tz for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 08:59:07 -0500 Received: from g6t0187.atlanta.hp.com ([15.193.32.64]:30011) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RapMM-0004MA-Qb for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 08:59:02 -0500 In-Reply-To: Message from Olaf Dietsche of "Wed, 14 Dec 2011 09:53:35 +0100." <87fwgn3568.fsf@rat.lan> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Olaf Dietsche Cc: nicholas.dokos@hp.com, emacs-orgmode@gnu.org, Alan L Tyree Olaf Dietsche wrote: > ... > Another interesting epub checker might be > , which uses "... schemas that were > developed by IDPF and DAISY. ..." according to their website. > I'm pretty sure it's the same checker as the online checker that Alan pointed to. Nick From mboxrd@z Thu Jan 1 00:00:00 1970 From: suvayu ali Subject: Re: ePub construction Date: Thu, 15 Dec 2011 00:24:16 +0100 Message-ID: References: <1323816335.4781.0@windy> Mime-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Return-path: Received: from eggs.gnu.org ([140.186.70.92]:38828) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RayBj-0003hT-CQ for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 18:24:40 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1RayBi-0002JV-BC for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 18:24:39 -0500 Received: from mail-ey0-f169.google.com ([209.85.215.169]:61733) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1RayBi-0002JO-4Y for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 18:24:38 -0500 Received: by eaai13 with SMTP id i13so1607279eaa.0 for ; Wed, 14 Dec 2011 15:24:37 -0800 (PST) In-Reply-To: <1323816335.4781.0@windy> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Alan L Tyree Cc: emacs-orgmode@gnu.org Hi Alan, On Tue, Dec 13, 2011 at 23:45, Alan L Tyree wrote: > I will be happy to add a fuller description to worg if someone will > point me to some instructions. Here is the latest announcement from Jason: -- Suvayu Open source is the future. It sets us free. From mboxrd@z Thu Jan 1 00:00:00 1970 From: Alan L Tyree Subject: Re: ePub construction Date: Thu, 15 Dec 2011 10:56:14 +1100 Message-ID: <1323906974.10491.3@windy> References: <1323816335.4781.0@windy> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Return-path: Received: from eggs.gnu.org ([140.186.70.92]:40756) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Rayfz-0002Dp-Qy for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 18:55:57 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1Rayfy-0001By-FG for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 18:55:55 -0500 Received: from mail-vw0-f41.google.com ([209.85.212.41]:42043) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1Rayfy-0001Br-Cw for emacs-orgmode@gnu.org; Wed, 14 Dec 2011 18:55:54 -0500 Received: by vbbfn1 with SMTP id fn1so1297842vbb.0 for ; Wed, 14 Dec 2011 15:55:53 -0800 (PST) In-Reply-To: (from fatkasuvayu+linux@gmail.com on Thu Dec 15 10:24:16 2011) Content-Disposition: inline List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: suvayu ali Cc: emacs-orgmode@gnu.org On 15/12/11 10:24:16, suvayu ali wrote: > Hi Alan, >=20 > On Tue, Dec 13, 2011 at 23:45, Alan L Tyree > wrote: > > I will be happy to add a fuller description to worg if someone will > > point me to some instructions. >=20 > Here is the latest announcement from Jason: >=20 > Thanks for that - I had seen the thread but didn't recognise its=20 significance. Duh! Alan >=20 > --=20 > Suvayu >=20 > Open source is the future. It sets us free. >=20 --=20 Alan L Tyree http://www2.austlii.edu.au/~alan Tel: 04 2748 6206 sip:172385@iptel.org