emacs-orgmode@gnu.org archives
 help / color / mirror / code / Atom feed
From: Aankhen <aankhen@gmail.com>
To: amscopub-mail@yahoo.com, Org-mode ml <emacs-orgmode@gnu.org>
Subject: Re: Feature request: modify italic regexp list to include non-breaking space and other characters
Date: Thu, 21 Apr 2011 00:48:21 +0530	[thread overview]
Message-ID: <BANLkTinX2Mkm3zVb5=rPb9vdQQ119PtCtQ@mail.gmail.com> (raw)
In-Reply-To: <824926.32909.qm@web120711.mail.ne1.yahoo.com>

On Wed, Apr 20, 2011 at 20:34,  <amscopub-mail@yahoo.com> wrote:
> Sample code:
>  Using /a/’s and /b/’s, write add /x/ + 2.
>           ^         ^                ^ ^
>       Right single quotation mark    Non-breaking space
>
> Expected HTML export:
>  Using <i>a</i>’s and <i>b</i>’s, write <i>x</i> + 2.
>
> Actual HTML export:
>  Using /a/’s and /b/’s, write add /x/ + 2.

Interestingly, the development version treats this differently, as it
considers all the text from the first slash to the last part of the
emphasis:

,----
| <p>Using <i>a/’s and /b/’s, write add /x</i> + 2.
| </p>
`----

(This behaviour occurs in the original Org buffer as well, in case
anyone’s wondering.  The exported HTML was just the easiest way to
show it.)

> If it's not clear, the sample code uses the unicode character "right single character" instead of an apostrophe and "non-breaking space" instead of regular white space.
>
> It makes sense to use these characters this way, however, orgmode neither displays the italic expressions correctly nor exports them correctly.
>
> I believe older versions of orgmode worked in the expected way.
>
> How can I modify the regexp list? Bold characters are also affected.

These two variables are used to configure the regexp:

,----[ C-h v org-emphasis-alist RET ]
| org-emphasis-alist is a variable defined in `org.el'.
| Its value is (("*" bold "<b>" "</b>")
|  ("/" italic "<i>" "</i>")
|  ("_" underline "<span style=\"text-decoration:underline;\">" "</span>")
|  ("=" org-code "<code>" "</code>" verbatim)
|  ("~" org-verbatim "<code>" "</code>" verbatim)
|  ("+"
|   (:strike-through t)
|   "<del>" "</del>"))
|
|
| Documentation:
| Special syntax for emphasized text.
| Text starting and ending with a special character will be emphasized, for
| example *bold*, _underlined_ and /italic/.  This variable sets the marker
| characters, the face to be used by font-lock for highlighting in Org-mode
| Emacs buffers, and the HTML tags to be used for this.
| For LaTeX export, see the variable `org-export-latex-emphasis-alist'.
| For DocBook export, see the variable `org-export-docbook-emphasis-alist'.
| Use customize to modify this, or restart Emacs after changing it.
|
| You can customize this variable.
`----

,----[ C-h v org-emphasis-regexp-components RET ]
| org-emphasis-regexp-components is a variable defined in `org.el'.
| Its value is (" 	('\"{" "- 	.,:!?;'\")}\\" " 	
\n,\"'" "." 1)
|
|
| Documentation:
| Components used to build the regular expression for emphasis.
| This is a list with five entries.  Terminology:  In an emphasis string
| like " *strong word* ", we call the initial space PREMATCH, the final
| space POSTMATCH, the stars MARKERS, "s" and "d" are BORDER characters
| and "trong wor" is the body.  The different components in this variable
| specify what is allowed/forbidden in each part:
|
| pre          Chars allowed as prematch.  Beginning of line will be
allowed too.
| post         Chars allowed as postmatch.  End of line will be allowed too.
| border       The chars *forbidden* as border characters.
| body-regexp  A regexp like "." to match a body character.  Don't use
|              non-shy groups here, and don't allow newline here.
| newline      The maximum number of newlines allowed in an emphasis exp.
|
| Use customize to modify this, or restart Emacs after changing it.
|
| You can customize this variable.
|
| [back]
`----

I’d say that ‘pre’/‘post’ should really contain [[:space:]], but then
Org’s syntax table seems to treat the non-breaking space as
punctuation, so that wouldn’t help.  You could try adding the
character itself to both of those categories for a fix.  You’ll need
to restart Emacs afterwards (unless you used the Customize interface)
so that ‘org-emph-re’ is updated accordingly.

> Using 7.4.

This little problem aside, you might want to upgrade (if not to the
development version, at least to 7.5).

Aankhen

  reply	other threads:[~2011-04-20 19:18 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-04-20 15:04 Feature request: modify italic regexp list to include non-breaking space and other characters amscopub-mail
2011-04-20 19:18 ` Aankhen [this message]
2011-04-22 12:31   ` Uriel Avalos

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.orgmode.org/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='BANLkTinX2Mkm3zVb5=rPb9vdQQ119PtCtQ@mail.gmail.com' \
    --to=aankhen@gmail.com \
    --cc=amscopub-mail@yahoo.com \
    --cc=emacs-orgmode@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).