emacs-orgmode@gnu.org archives
 help / color / mirror / code / Atom feed
From: Sebastian Rose <sebastian_rose@gmx.de>
To: Carsten Dominik <carsten.dominik@gmail.com>
Cc: emacs-orgmode@gnu.org
Subject: Re: Insert link with "foreign" character - cannot save
Date: Sun, 22 Nov 2009 02:12:55 +0100	[thread overview]
Message-ID: <87r5rrbbvs.fsf@gmx.de> (raw)
In-Reply-To: <87einseqhe.fsf@gmx.de> (Sebastian Rose's message of "Sat, 21 Nov 2009 00:17:01 +0100")



OK, this is a severe Emacs 23 bug.


I can reproduce the errors like this:


sh$ mkdir test
sh$ cd test
sh$ touch ümläute.org
sh$ ls
ümläute.org
sh$ convmv --notest -f utf-8 -t iso-8859-1 ümläute.org 
Your Perl version has fleas #37757 #49830 
mv "./ümläute.org"	"./�ml�ute.org"
Ready!
sh$ ls
?ml?ute.org


Now I open an Org-file in Emacs 23, and do

  C-u C-c C-l

Navigate to that file and

BAMM - cannot save my work anymore!!!



Another nice test: Try to recode an UTF-8 encoded file name with German
Umlauts unsing

  M-x recode-file-name 

It does not work either.



I'll file an Emacs bug report as this is no bug in Org-mode.




  Sebastian





Sebastian Rose <sebastian_rose@gmx.de> writes:
> Carsten Dominik <carsten.dominik@gmail.com> writes:
>
>> Hi Mattias,
>>
>> I tried that, and my buffer swiched to unicode encoding automatically.
>>
>> Unfortunately I don't know much about coding systems, and so I do
>> not know how to fix this.
>>
>> Anyone????
>>
>> - Carsten
>>
>> On Nov 12, 2009, at 10:28 AM, Mattias Jämting wrote:
>>
>>> (I'm using English Windows Vista x64, Emacs 23.1 and Org-mode 6.32b)
>>>
>>> So i'm doing C-u C-c C-l to browse for a file in order to insert a link to
>>> it.
>>>
>>> The path and/or the filename contains for instance an ö (an o with two dots
>>> above it, also the swedish word for "island"), which gets translated in my
>>> org-file as \366.
>>>
>>> When I try to save the file I see the message:
>>>
>>> These default coding systems were tried to encode text
>>> in the buffer `jwd.org':
>>>   (utf-8-dos (79 . 4194294))
>>> However, each of them encountered characters it couldn't encode:
>>>   utf-8-dos cannot encode these:  These default coding systems were tried
>>> to encode text
>>> in the buffer `jwd.org':
>>>   (utf-8-dos (79 . 4194294))
>>> However, each of them encountered characters it couldn't encode:
>>>   utf-8-dos cannot encode these:  \366
>>>
>>> Next I tried to hack myself a fix :-)
>>>
>>> I added (?\366 . "%F6") to org-link-escape-chars and ran make on it again,
>>> but it didn't seem to work.
>
>
> It's true, you cannot encode the bytes with dec. values above 127 in
> utf-8 (see `man utf-8', unicode.org, whatever).
>
>
>
> Seems your filenames are not utf-8 encoded.
>
> Here it works, because on current Linux distros (and MAC OS??) filenames
> are utf-8 encoded:
>
>   C-u C-c C-f
>   nä TAB RET
>
>   [[file:nächtes-nötiges.org][Umlaute in Dateinamen]]
>
>
> Maybe, if the file is from an old system, rename it (twice, to give it
> the original name) and try again. Would that work?
>
>
>
>
> There's an interesting discussion going on on emacs-devel, that might be
> related (but it's not about filenames). You may read the entire thread
> here:
>
>   http://lists.gnu.org/archive/html/emacs-devel/2009-11/msg00661.html
>
>
> The fazit so far, as I understood it, is, that Emasc 23 distinguishes
> single and multibyte strings. Better not use array functions to handle
> strings (which are multibyte internaly) in Emacs 23.
>
> The OP did
>
>   (setq nl "\n")
>   (aset nl 0 ?ñ
>   (insert nl)
>
> which sets the first _byte_ of an array to 241, which in turn has no
> valid representation on screen as character in Unicode (see `man urf-8'
> and unicode.org). Thus Emacs insert \361 - for some reason :)
>
>
>
>    Sebastian
>
>
> _______________________________________________
> Emacs-orgmode mailing list
> Remember: use `Reply All' to send replies to the list.
> Emacs-orgmode@gnu.org
> http://lists.gnu.org/mailman/listinfo/emacs-orgmode

  reply	other threads:[~2009-11-22  1:13 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-11-12  9:28 Insert link with "foreign" character - cannot save Mattias Jämting
2009-11-20 17:03 ` Carsten Dominik
2009-11-20 23:17   ` Sebastian Rose
2009-11-22  1:12     ` Sebastian Rose [this message]
2009-11-22  1:32       ` Sebastian Rose
2009-12-07 23:36         ` Mattias Jämting
2009-12-08 16:37           ` Carsten Dominik
  -- strict thread matches above, loose matches on Subject: below --
2009-11-12  9:49 Mattias Jämting

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.orgmode.org/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87r5rrbbvs.fsf@gmx.de \
    --to=sebastian_rose@gmx.de \
    --cc=carsten.dominik@gmail.com \
    --cc=emacs-orgmode@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).