From mboxrd@z Thu Jan 1 00:00:00 1970 From: Sebastian Rose Subject: Re: Re: Custom entry IDs in HTML export Date: Thu, 16 Apr 2009 15:14:39 +0200 Message-ID: <87vdp4u4e8.fsf@kassiopeya.MSHEIMNETZ> References: <87myb7w2s9.fsf@CPU107.opentrends.net> <6BF0FCBC-4343-4B8C-9A16-F4B9AC9B0F48@gmail.com> <87eiwiluft.fsf@gollum.intra.norang.ca> <87y6uqwsjw.fsf@kassiopeya.MSHEIMNETZ> <871vsfjkm3.fsf@CPU107.opentrends.net> <1FEE16B4-2913-487C-8822-094FF4EC725C@gmail.com> <878wm1ugml.fsf@kassiopeya.MSHEIMNETZ> <43758593-D9D0-43BC-B4D9-14E036C66271@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Return-path: Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1LuRND-0003jZ-Ps for emacs-orgmode@gnu.org; Thu, 16 Apr 2009 09:11:23 -0400 Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1LuRN6-0003eh-V2 for emacs-orgmode@gnu.org; Thu, 16 Apr 2009 09:11:21 -0400 Received: from [199.232.76.173] (port=50252 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1LuRN6-0003e8-Lu for emacs-orgmode@gnu.org; Thu, 16 Apr 2009 09:11:16 -0400 Received: from mail.gmx.net ([213.165.64.20]:41394) by monty-python.gnu.org with smtp (Exim 4.60) (envelope-from ) id 1LuRN5-0008SS-1e for emacs-orgmode@gnu.org; Thu, 16 Apr 2009 09:11:15 -0400 In-Reply-To: <43758593-D9D0-43BC-B4D9-14E036C66271@gmail.com> (Carsten Dominik's message of "Thu, 16 Apr 2009 13:28:52 +0200") List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Carsten Dominik Cc: org-mode mailing list , Bernt Hansen Hm - counter arguments? The only counter argument is, that hand made IDs for links are prone to error. But that risk should be up to the user. I actually changed my mind a little in this concern. If the user clicks a section link in the toc to jump to a section, he can bookmark the page with exactly that jump target. If the jump target (the ID) is human readable, the bookmark is more verbose. Just one wish: The containers should reflect that change (HRID =3D human readable id):

headline

sections content...
That way the script would keep working with older pages. Automatic IDs and human readable ones could be mixed. The '' anchors are scanned anyway, as are all jump targets in the page. Maybe this is even the point to re-work the parser of org-info.js to become independent of the TOC at all. The script could search for headings instead. That's more work, but the script would then work for all HTML pages with a structure similar to the org-export's one:
content
=20 but I could postpone this, if you fullfill my wish above. Best wishes Sebastian Carsten Dominik writes: > On Apr 16, 2009, at 10:50 AM, Sebastian Rose wrote: > >> Carsten Dominik writes: >>> Hi Sebastian, >>> >>> I kind of like the idea to have a property that can be >>> used to set an ID, as an alternative to the <> >>> notation. Actually, using a property seems a lot cleaner, >>> thanks for coming up with this idea, Daniel. >>> >>> I can also follow the reasoning that it is useful to have >>> the table of contents link to the human-readable id, because >>> it provides a general, simple workflow to retrieve a link that >>> will persist through changes of the document. This workflow >>> was described also by Bernt earlier in this thread. >>> >>> Finally, I also agree that the main id in the

tag >>> should be the automatically generated one because this is >>> best for automatic processing and because of all the arguments >>> you have presented. >>> >>> Would it cause problems for org-info.js if the toc points to >>> a user specified anchor in the headline, instead of the main >>> ID that is inside the

tag? THis would really be the only >>> required change. >> >> >> I'll have to test this before I can give a final answer to this >> question. >> >> But regardless of the results, I will adjust the script to reflect that >> change. The script should not rule the HTML export and it will be an >> easy thing to do. > > But I do want to hear any counter arguments you might have.... > > - Carsten > >> >> Sebastian >> >> >> >>> - Carsten >>> >>> >>> On Mar 30, 2009, at 1:49 PM, Daniel Clemente wrote: >>> >>>> El dv, mar 27 2009, Sebastian Rose va escriure: >>>>> >>>>> What we have now, just as Carstens said: >>>>> >>>>> # <> >>>>> * Section B >>>>> >>>>> Creates this headline in HTML: >>>>> >>>>>

2 Section B >>>>>

>>>>> >>>>> This is enough for all the use cases I can think of. >>>>> >>>> >>>> Yes, this is enough except for two things: >>>> 1. The TOC still links to #sec-2 and the user can't change that >>>> 2. Your syntax doesn't fold very well in the outliner. I mean: if you = use >>>> >>>>> # <> >>>>> * Section B >>>> >>>> then the comment appears at the end of the previous section, and you c= an >>>> miss >>>> it when you are viewing the heading =E2=80=9ESection B=E2=80=9C. I wo= uld swap both lines >>>> (solution 1): >>>> >>>>> * Section B >>>>> # <> >>>> >>>> But since there are already LOGBOOK drawers under the heading, it woul= d be a >>>> lot clearer to use a property, like EXPORT_ID (solution 2): >>>> >>>>> * Section B >>>>> :PROPERTIES: >>>>> :EXPORT_ID: human-readable >>>>> :END: >>>> >>>> >>>> In this way, the TOC can reliably find the EXPORT_ID, and then generat= e: >>>>>

2 Section B >>>>>

>>>> >>>> (You could also leave *just* the human-readable id, but having two is = not >>>> bad. >>>> >>>> >>>> I would prefer solution 1, but I don't because I'm not sure that the T= OC can >>>> find the ID if it is written as a comment anywhere under the heading = (and >>>> together with other things). >>>> >>>> Solution 2 involves thus: a new property to specify the human- >>>> readable entry ID, which will be used to link to the entry. The automa= tic ID >>>> (#sec-2) will still work for all entrys. >>>> >>>> >>>>> >>>>> * Distinguishing automatic and human readable IDs >>>>> >>>>> One thing I like is, that we now _can_ distinguish the >>>>> `human-readable-target' (human readable) from the `sec-2' (not human >>>>> readable and not context related) using a regular expression. >>>>> >>>>> In org-info.js, I can now prefere the human readable ID in from an >>>>> automatic created one, and thus use that to create the links for `l' >>>>> and `L'. The same holds true for other programming languages and >>>>> parsers. >>>>> >>>>> If we open the

's ID for user defined values (bad), we can not >>>>> distinguish those ID's using a regular expression and there is no way >>>>> to detect the human readable one. There will be no way to _know_ that >>>>> the 's ID is the prefered one used for human readable links. >>>>> >>>> >>>> Solution 2 doesn't break the parsing techniques you use; in fact it ca= n also >>>> make clearer which ID is the human readable one and which one not. >>>> >>>> >>>> This is not extremely important; just useful: >>>> - for pages with many incoming links from external sites >>>> - to ensure link integrity (now you can't assure that links will still= work >>>> in >>>> 1 year ... or in some weeks) >>>> - to avoid that HTML visitors get directed to a wrong section and can'= t find >>>> what they searched >>>> >>>> >>>> Greetings, >>>> Daniel >>>> >>>> >>>> _______________________________________________ >>>> Emacs-orgmode mailing list >>>> Remember: use `Reply All' to send replies to the list. >>>> Emacs-orgmode@gnu.org >>>> http://lists.gnu.org/mailman/listinfo/emacs-orgmode >>> >> >> -- >> Sebastian Rose, EMMA STIL - mediendesign, Niemeyerstr.6, 30449 Hannover >> Tel.: +49 (0)511 - 36 58 472 >> Fax: +49 (0)1805 - 233633 - 11044 >> mobil: +49 (0)173 - 83 93 417 >> Email: s.rose@emma-stil.de, sebastian_rose@gmx.de >> Http: www.emma-stil.de > --=20 Sebastian Rose, EMMA STIL - mediendesign, Niemeyerstr.6, 30449 Hannover Tel.: +49 (0)511 - 36 58 472 Fax: +49 (0)1805 - 233633 - 11044 mobil: +49 (0)173 - 83 93 417 Email: s.rose@emma-stil.de, sebastian_rose@gmx.de Http: www.emma-stil.de