emacs-orgmode@gnu.org archives
 help / color / mirror / code / Atom feed
* [tip/offtopic] A function to describe the characters of a word at point
@ 2022-07-13 10:49 Juan Manuel Macías
  2022-07-14 15:42 ` Marcin Borkowski
  0 siblings, 1 reply; 4+ messages in thread
From: Juan Manuel Macías @ 2022-07-13 10:49 UTC (permalink / raw)
  To: orgmode

Sorry for the slight offtopic.

Since Unicode and character issues come up here from time to time, I'm
sharing this 'homemade' function that I wrote a long time ago for my
work, in case someone finds it useful. It Shows a brief descriptive list
of all characters in a word at point. Each character includes the
Unicode name, code, and canonical decomposition. Example:

ἄρχοντα >>

ἄ (#1f04) ... GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA ... descomp: #1f00 #301
ρ (#3c1) ... GREEK SMALL LETTER RHO ... descomp: #3c1
χ (#3c7) ... GREEK SMALL LETTER CHI ... descomp: #3c7
ο (#3bf) ... GREEK SMALL LETTER OMICRON ... descomp: #3bf
ν (#3bd) ... GREEK SMALL LETTER NU ... descomp: #3bd
τ (#3c4) ... GREEK SMALL LETTER TAU ... descomp: #3c4
α (#3b1) ... GREEK SMALL LETTER ALPHA ... descomp: #3b1


#+begin_src emacs-lisp
  (defun describe-chars-word-at-point ()
    (interactive)
    (setq chars-in-word nil)
    (if
        (not (current-word t t))
        (error "Not in a word at point...")
      (let
          ((word (current-word t t)))
        (save-excursion
          (with-temp-buffer
            (insert word)
            (goto-char (point-min))
            (while (re-search-forward "\\(.\\)" nil t)
              (let* ((char-name (save-excursion
                                  (backward-char)
                                  (get-char-code-property (char-after (point)) 'name)))
                     (char-desc (save-excursion
                                  (backward-char)
                                  (get-char-code-property (char-after (point)) 'decomposition)))
                     (char-format (concat (match-string 1) "\s" "("
                                          (format "#%x" (string-to-char (match-string 1)))
                                          ")\s...\s" char-name "\s...\sdecomp:\s"
                                          (mapconcat (lambda (cod)
                                                       (format "#%x" cod))
                                                     char-desc " "))))
                (push char-format chars-in-word)))
            (when (get-buffer "*chars in word*")
              (kill-buffer "*chars in word*"))
            (get-buffer-create "*chars in word*")
            (set-buffer "*chars in word*")
            (insert (mapconcat 'identity
                               (reverse chars-in-word) "\n"))
            (view-mode)
            (temp-buffer-window-show "*chars in word*"
                                     '((display-buffer-below-selected display-buffer-at-bottom)
                                       (inhibit-same-window . t)
                                       (window-height . fit-window-to-buffer))))
          (pop-to-buffer "*chars in word*")))))
#+end_src


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [tip/offtopic] A function to describe the characters of a word at point
  2022-07-13 10:49 [tip/offtopic] A function to describe the characters of a word at point Juan Manuel Macías
@ 2022-07-14 15:42 ` Marcin Borkowski
  2022-07-14 22:30   ` Samuel Wales
  2022-07-15  0:56   ` Juan Manuel Macías
  0 siblings, 2 replies; 4+ messages in thread
From: Marcin Borkowski @ 2022-07-14 15:42 UTC (permalink / raw)
  To: Juan Manuel Macías; +Cc: orgmode


On 2022-07-13, at 12:49, Juan Manuel Macías <maciaschain@posteo.net> wrote:

> Sorry for the slight offtopic.

Not off-topic at all, as far as I'm concerned!  (Though sending this to
help-gnu-emacs might be an even better idea.)  I use `C-u C-x =' pretty
often, so I fully understand why someone might want to code something
like this.  Very nice, thanks for sharing!

You might want to extend it and create a minor mode which would display
data about the current character in the echo area, Eldoc-style, or in
a tooltip when you hover the mouse pointer over a character.  Depending
on what exactly you need, these ideas might be more or less useful, of
course.

Also, since the answer to quite a few org-related issues seems to be
"just insert a zero-width space", making those stand out (like
non-breaking spaces already are) could also be useful.  FWIW, I have
this function in my init.el:

(defun insert-zero-width-space ()
  "Insert Unicode character \"zero-width space\"."
  (interactive)
  (insert "​"))

(of course, the 0-width space is invisible between the quotes).

Best,
mbork



> Since Unicode and character issues come up here from time to time, I'm
> sharing this 'homemade' function that I wrote a long time ago for my
> work, in case someone finds it useful. It Shows a brief descriptive list
> of all characters in a word at point. Each character includes the
> Unicode name, code, and canonical decomposition. Example:
>
> ἄρχοντα >>
>
> ἄ (#1f04) ... GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA ... descomp: #1f00 #301
> ρ (#3c1) ... GREEK SMALL LETTER RHO ... descomp: #3c1
> χ (#3c7) ... GREEK SMALL LETTER CHI ... descomp: #3c7
> ο (#3bf) ... GREEK SMALL LETTER OMICRON ... descomp: #3bf
> ν (#3bd) ... GREEK SMALL LETTER NU ... descomp: #3bd
> τ (#3c4) ... GREEK SMALL LETTER TAU ... descomp: #3c4
> α (#3b1) ... GREEK SMALL LETTER ALPHA ... descomp: #3b1
>
>
> #+begin_src emacs-lisp
>   (defun describe-chars-word-at-point ()
>     (interactive)
>     (setq chars-in-word nil)
>     (if
>         (not (current-word t t))
>         (error "Not in a word at point...")
>       (let
>           ((word (current-word t t)))
>         (save-excursion
>           (with-temp-buffer
>             (insert word)
>             (goto-char (point-min))
>             (while (re-search-forward "\\(.\\)" nil t)
>               (let* ((char-name (save-excursion
>                                   (backward-char)
>                                   (get-char-code-property (char-after (point)) 'name)))
>                      (char-desc (save-excursion
>                                   (backward-char)
>                                   (get-char-code-property (char-after (point)) 'decomposition)))
>                      (char-format (concat (match-string 1) "\s" "("
>                                           (format "#%x" (string-to-char (match-string 1)))
>                                           ")\s...\s" char-name "\s...\sdecomp:\s"
>                                           (mapconcat (lambda (cod)
>                                                        (format "#%x" cod))
>                                                      char-desc " "))))
>                 (push char-format chars-in-word)))
>             (when (get-buffer "*chars in word*")
>               (kill-buffer "*chars in word*"))
>             (get-buffer-create "*chars in word*")
>             (set-buffer "*chars in word*")
>             (insert (mapconcat 'identity
>                                (reverse chars-in-word) "\n"))
>             (view-mode)
>             (temp-buffer-window-show "*chars in word*"
>                                      '((display-buffer-below-selected display-buffer-at-bottom)
>                                        (inhibit-same-window . t)
>                                        (window-height . fit-window-to-buffer))))
>           (pop-to-buffer "*chars in word*")))))
> #+end_src


-- 
Marcin Borkowski
http://mbork.pl


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [tip/offtopic] A function to describe the characters of a word at point
  2022-07-14 15:42 ` Marcin Borkowski
@ 2022-07-14 22:30   ` Samuel Wales
  2022-07-15  0:56   ` Juan Manuel Macías
  1 sibling, 0 replies; 4+ messages in thread
From: Samuel Wales @ 2022-07-14 22:30 UTC (permalink / raw)
  To: Marcin Borkowski; +Cc: Juan Manuel Macías, orgmode

good idea for command.  i like the additional ideas too like the help
text [i hae that put in echo area even in gui].

for even more blue sky stuff, i was thinking along the lines of
information about characters, such as en/locale meanings for cjk.  or
furigana [ruby text] for the echo area.  requires lookup though.  (to
go along with meanings for input method. :))


On 7/14/22, Marcin Borkowski <mbork@mbork.pl> wrote:
>
> On 2022-07-13, at 12:49, Juan Manuel Macías <maciaschain@posteo.net> wrote:
>
>> Sorry for the slight offtopic.
>
> Not off-topic at all, as far as I'm concerned!  (Though sending this to
> help-gnu-emacs might be an even better idea.)  I use `C-u C-x =' pretty
> often, so I fully understand why someone might want to code something
> like this.  Very nice, thanks for sharing!
>
> You might want to extend it and create a minor mode which would display
> data about the current character in the echo area, Eldoc-style, or in
> a tooltip when you hover the mouse pointer over a character.  Depending
> on what exactly you need, these ideas might be more or less useful, of
> course.
>
> Also, since the answer to quite a few org-related issues seems to be
> "just insert a zero-width space", making those stand out (like
> non-breaking spaces already are) could also be useful.  FWIW, I have
> this function in my init.el:
>
> (defun insert-zero-width-space ()
>   "Insert Unicode character \"zero-width space\"."
>   (interactive)
>   (insert "​"))
>
> (of course, the 0-width space is invisible between the quotes).
>
> Best,
> mbork
>
>
>
>> Since Unicode and character issues come up here from time to time, I'm
>> sharing this 'homemade' function that I wrote a long time ago for my
>> work, in case someone finds it useful. It Shows a brief descriptive list
>> of all characters in a word at point. Each character includes the
>> Unicode name, code, and canonical decomposition. Example:
>>
>> ἄρχοντα >>
>>
>> ἄ (#1f04) ... GREEK SMALL LETTER ALPHA WITH PSILI AND OXIA ... descomp:
>> #1f00 #301
>> ρ (#3c1) ... GREEK SMALL LETTER RHO ... descomp: #3c1
>> χ (#3c7) ... GREEK SMALL LETTER CHI ... descomp: #3c7
>> ο (#3bf) ... GREEK SMALL LETTER OMICRON ... descomp: #3bf
>> ν (#3bd) ... GREEK SMALL LETTER NU ... descomp: #3bd
>> τ (#3c4) ... GREEK SMALL LETTER TAU ... descomp: #3c4
>> α (#3b1) ... GREEK SMALL LETTER ALPHA ... descomp: #3b1
>>
>>
>> #+begin_src emacs-lisp
>>   (defun describe-chars-word-at-point ()
>>     (interactive)
>>     (setq chars-in-word nil)
>>     (if
>>         (not (current-word t t))
>>         (error "Not in a word at point...")
>>       (let
>>           ((word (current-word t t)))
>>         (save-excursion
>>           (with-temp-buffer
>>             (insert word)
>>             (goto-char (point-min))
>>             (while (re-search-forward "\\(.\\)" nil t)
>>               (let* ((char-name (save-excursion
>>                                   (backward-char)
>>                                   (get-char-code-property (char-after
>> (point)) 'name)))
>>                      (char-desc (save-excursion
>>                                   (backward-char)
>>                                   (get-char-code-property (char-after
>> (point)) 'decomposition)))
>>                      (char-format (concat (match-string 1) "\s" "("
>>                                           (format "#%x" (string-to-char
>> (match-string 1)))
>>                                           ")\s...\s" char-name
>> "\s...\sdecomp:\s"
>>                                           (mapconcat (lambda (cod)
>>                                                        (format "#%x"
>> cod))
>>                                                      char-desc " "))))
>>                 (push char-format chars-in-word)))
>>             (when (get-buffer "*chars in word*")
>>               (kill-buffer "*chars in word*"))
>>             (get-buffer-create "*chars in word*")
>>             (set-buffer "*chars in word*")
>>             (insert (mapconcat 'identity
>>                                (reverse chars-in-word) "\n"))
>>             (view-mode)
>>             (temp-buffer-window-show "*chars in word*"
>>                                      '((display-buffer-below-selected
>> display-buffer-at-bottom)
>>                                        (inhibit-same-window . t)
>>                                        (window-height .
>> fit-window-to-buffer))))
>>           (pop-to-buffer "*chars in word*")))))
>> #+end_src
>
>
> --
> Marcin Borkowski
> http://mbork.pl
>
>


-- 
The Kafka Pandemic

A blog about science, health, human rights, and misopathy:
https://thekafkapandemic.blogspot.com


^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: [tip/offtopic] A function to describe the characters of a word at point
  2022-07-14 15:42 ` Marcin Borkowski
  2022-07-14 22:30   ` Samuel Wales
@ 2022-07-15  0:56   ` Juan Manuel Macías
  1 sibling, 0 replies; 4+ messages in thread
From: Juan Manuel Macías @ 2022-07-15  0:56 UTC (permalink / raw)
  To: Marcin Borkowski; +Cc: Samuel Wales, orgmode

Hi, Marcin and Samuel, thanks for your comments,

Marcin Borkowski writes:

> You might want to extend it and create a minor mode which would display
> data about the current character in the echo area, Eldoc-style, or in
> a tooltip when you hover the mouse pointer over a character.  Depending
> on what exactly you need, these ideas might be more or less useful, of
> course.

I also have written a smaller function to display a quick information of
a single character at point, something much simpler and not as verbose
as describe-char. But it had never occurred to me to do something
eldoc-like with it. In my case, although for those contexts I prefer
quick information (describe-char also has its relaxing moment), I don't
feel such an urgency :-).

In any case, something quick and dirty, just as a proof of concept,
could be this:

(define-minor-mode char-info-at-point-mode
  "TODO"
  :init-value nil
  :lighter ("chinfo")
  (if char-info-at-point-mode
      (add-hook 'post-command-hook #'char-name-at-point nil t)    
    (remove-hook 'post-command-hook #'char-name-at-point 'local)))

(defun char-name-at-point ()
  (interactive)
  (let* ((char-name (get-char-code-property (char-after (point)) 'name))
	 (code (format "#%x" (char-after (point))))
	 (dec (get-char-code-property (char-after (point)) 'decomposition))
	 (info (concat
		char-name
		" / "
		code
		" / descomp: "
		dec
		"\s"
		(mapconcat (lambda (cod)
			     (format "#%x" cod))
			   dec "\s+\s"))))
    (message info)))

Best regards,

Juan Manuel 


^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2022-07-15  0:58 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-07-13 10:49 [tip/offtopic] A function to describe the characters of a word at point Juan Manuel Macías
2022-07-14 15:42 ` Marcin Borkowski
2022-07-14 22:30   ` Samuel Wales
2022-07-15  0:56   ` Juan Manuel Macías

Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).