* Translation of the Org mode manual (was: Translation of manuals (was: SES manual French translation)) [not found] ` <3E7CF23D-CE62-4581-80D6-E6266CEE4DF5@traductaire-libre.org> @ 2024-01-04 11:34 ` Ihor Radchenko 2024-01-04 12:19 ` Jean-Christophe Helary 0 siblings, 1 reply; 5+ messages in thread From: Ihor Radchenko @ 2024-01-04 11:34 UTC (permalink / raw) To: Jean-Christophe Helary Cc: Eli Zaretskii, stefankangas, vincent.b.1, emacs-devel, rms, emacs-orgmode Jean-Christophe Helary <jean.christophe.helary@traductaire-libre.org> writes: > I’m not sure this is the place to discuss translation theory... Let me steer this discussion to non-texi manuals - Org mode manual. We also had a few requests about translations, although Org mode needs are more general compared to a very focused task of translating texi manuals: 1. Org mode manual itself, just like the other Emacs manuals may need to be translated. However, Org mode manual source is written in Org mode format, not texi. It is just transformed to texi as one of the export options. 2. Org mode website has several translations (French, Japanese, and Mandarin). The website sources are written in Org mode, but the work of translation was done be enthusiasts, and we have no good system to maintain the main English version of the website in sync with the translations. 3. A number of Org mode users are using Org sources to publish multi-lingual books. Such publishing is a bit more challenging compared to translation as it requires special book layout. Therefore, there is a demand to have a toolset for translating Org mode documents. So far, I have been thinking about some way to support translations with ideas rather similar to raised in this topic: 1. Mark paragraphs/headings/other structural elements with an ID 2. Split translation into chunks linked to the original source IDs and their text hash 3. When exporting from Org source to target format (texi, html, pdf, etc), allow choosing the target language(s) and ensure consistency between translations and their possibly changed sources. However, I did not know about po4a. It is possible that many of the necessary design decisions are already in place in po4a. > ... > I think po-mode already does that and all the "out of emacs" tools do > too. May you share the most important features of po-mode that you find essential to your translation work? >> @node Display Margins >> @subsection Displaying in the Margins >> @cindex display margins >> @cindex margins, display > > As long as the above strings are to be translated, they need an ID too. > >> @c para 1234 >> >> @c para 1235 >> >> @c para 1236 > > Is the numbering automatic? > > What if you add paragraphs in the middle, like Vincent did? Will the > author have to check the IDs and add a number that’s not used already? > > There are plenty of issues with static IDing parts of a document. May you please elaborate on the issues with static IDs? -- Ihor Radchenko // yantar92, Org mode contributor, Learn more about Org mode at <https://orgmode.org/>. Support Org development at <https://liberapay.com/org-mode>, or support my work at <https://liberapay.com/yantar92> ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Translation of the Org mode manual (was: Translation of manuals (was: SES manual French translation)) 2024-01-04 11:34 ` Translation of the Org mode manual (was: Translation of manuals (was: SES manual French translation)) Ihor Radchenko @ 2024-01-04 12:19 ` Jean-Christophe Helary 2024-02-07 3:12 ` Richard Stallman 0 siblings, 1 reply; 5+ messages in thread From: Jean-Christophe Helary @ 2024-01-04 12:19 UTC (permalink / raw) To: Ihor Radchenko Cc: Eli Zaretskii, Stefan Kangas, Vincent Belaïche, emacs-devel, rms, emacs-orgmode Ihor, Thank you for chiming in. > On Jan 4, 2024, at 20:34, Ihor Radchenko <yantar92@posteo.net> wrote: > > Jean-Christophe Helary <jean.christophe.helary@traductaire-libre.org> > writes: >> I’m not sure this is the place to discuss translation theory... > > Let me steer this discussion to non-texi manuals - Org mode manual. Sure. > We also had a few requests about translations, although Org mode needs > are more general compared to a very focused task of translating texi > manuals: > > 1. Org mode manual itself, just like the other Emacs manuals may need to > be translated. However, Org mode manual source is written in Org mode > format, not texi. It is just transformed to texi as one of the export > options. That’s correct and that texi file seems to choke po4a, which is the reason my script removes it and uses the .org original instead. I use the txt filter in po4a to convert it to po: https://git.sr.ht/~brandelune/emacs_documentation_repository/tree/main/item/emacs_docs2po.sh#L71 The result is here: https://git.sr.ht/~brandelune/emacs_documentation_repository/tree/main/item/doc/misc/org.org.fr.po And looks like this: --- #. type: Plain text #: /Users/suzume/Documents/Repositories/Projet OmegaT de Documentation Emacs - #: Sources/doc/misc/org.org:24 msgid "" "Org Mode is an authoring tool and a TODO lists manager for GNU Emacs. It " "relies on a lightweight plain-text markup language used in files with the " "=.org= extension." msgstr "" #. type: Plain text #: /Users/suzume/Documents/Repositories/Projet OmegaT de Documentation Emacs - #: Sources/doc/misc/org.org:31 msgid "" "As an authoring tool, Org helps you write structured documents and provides " "exporting facilities. Org files can also be used for literate programming " "and reproducible research. As a TODO lists manager, Org helps you organize " "your tasks in a flexible way, from daily needs to detailed project-planning, " "allowing logging, multiple views on your tasks, exporting your agendas, etc." msgstr "" #. type: Plain text #: /Users/suzume/Documents/Repositories/Projet OmegaT de Documentation Emacs - #: Sources/doc/misc/org.org:38 msgid "" "Org mode is implemented on top of Outline mode, which makes it possible to " "keep the content of large files well structured. Visibility cycling and " "structure editing help to work with the tree. Tables are easily created " "with a built-in table editor. Plain text URL-like links connect to " "websites, emails, Usenet messages, BBDB entries, and any files related to " "the projects." msgstr "" --- In OmegaT it looks like this (all the po decorations are hidden - protected - in the editor, only the translatable parts are displayed): --- ¶ Org Mode is an authoring tool and a TODO lists manager for GNU Emacs. It relies on a lightweight plain-text markup language used in files with the =.org= extension. ¶ As an authoring tool, Org helps you write structured documents and provides exporting facilities. Org files can also be used for literate programming and reproducible research. As a TODO lists manager, Org helps you organize your tasks in a flexible way, from daily needs to detailed project-planning, allowing logging, multiple views on your tasks, exporting your agendas, etc. ¶ Org mode is implemented on top of Outline mode, which makes it possible to keep the content of large files well structured. Le cycle de visualisation et la modification de la structure aident à travailler avec l'arborescence. Les tableaux sont facilement créés avec un éditeur de tableau intégré. Les liens de type URL en texte brut se connectent aux sites Web, aux e-mails, aux messages Usenet, aux entrées BBDB et à tous les fichiers liés aux projets. ¶ --- You’ll notice that a few sentences have been translated in the last paragraph. In other tools that support the PO format, similarly the PO decorations would be hidden from view. > 2. Org mode website has several translations (French, Japanese, and > Mandarin). The website sources are written in Org mode, but the work > of translation was done be enthusiasts, and we have no good system to > maintain the main English version of the website in sync with the > translations. I think you should take a look at how Debian does that, even if their site sources are not in org. The process is the same. Debian has a huge amount of experience in documentation/web localization. > 3. A number of Org mode users are using Org sources to publish > multi-lingual books. Such publishing is a bit more challenging > compared to translation as it requires special book layout. Indeed. > Therefore, there is a demand to have a toolset for translating Org mode > documents. > > So far, I have been thinking about some way to support translations with > ideas rather similar to raised in this topic: > > 1. Mark paragraphs/headings/other structural elements with an ID > > 2. Split translation into chunks linked to the original source IDs and > their text hash > > 3. When exporting from Org source to target format (texi, html, pdf, > etc), allow choosing the target language(s) and ensure consistency > between translations and their possibly changed sources. > > However, I did not know about po4a. It is possible that many of the > necessary design decisions are already in place in po4a. po4a has been around for a while and is used in a lot of free documentation projects. A lot of tools that free software translators use fully support the PO format and the translation teams are used to the process of team translating/proofreading/validating/committing. I am sure Martin Quinson would appreciate development of a dedicated org mode filter. Right now, as I wrote above, the plain text filter is sufficient. Also, po4a is not only a filter, it is a tool equivalent to gettext but focused on documentation formats. You should check its documentation to see how your workflow could benefit from its design. >> ... >> I think po-mode already does that and all the "out of emacs" tools do >> too. > > May you share the most important features of po-mode that you find > essential to your translation work? I don’t use po-mode. po-mode is not sufficient (now) for most of the professional translation that I handle. >>> @node Display Margins >>> @subsection Displaying in the Margins >>> @cindex display margins >>> @cindex margins, display >> >> As long as the above strings are to be translated, they need an ID too. >> >>> @c para 1234 >>> >>> @c para 1235 >>> >>> @c para 1236 >> >> Is the numbering automatic? >> >> What if you add paragraphs in the middle, like Vincent did? Will the >> author have to check the IDs and add a number that’s not used already? >> >> There are plenty of issues with static IDing parts of a document. > > May you please elaborate on the issues with static IDs? For one, the fact that the numbering ends up not being sequential after a while. If you have a tool that does all the grunt work, then any kind of ID system should work. In PO, the message is the ID. -- Jean-Christophe Helary @jchelary@emacs.ch https://traductaire-libre.org https://mac4translators.blogspot.com https://sr.ht/~brandelune/omegat-as-a-book/ ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Translation of the Org mode manual (was: Translation of manuals (was: SES manual French translation)) 2024-01-04 12:19 ` Jean-Christophe Helary @ 2024-02-07 3:12 ` Richard Stallman 2024-02-07 23:10 ` Ihor Radchenko 0 siblings, 1 reply; 5+ messages in thread From: Richard Stallman @ 2024-02-07 3:12 UTC (permalink / raw) To: Jean-Christophe Helary; +Cc: yantar92, vincent.b.1, emacs-devel, emacs-orgmode [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > 1. Org mode manual itself, just like the other Emacs manuals may need to > be translated. However, Org mode manual source is written in Org mode > format, not texi. It is just transformed to texi as one of the export > options. Using Org format for the source of a document, and converting it to texi for processing (for instance, through TeX) would be a fine method to use, if it worked. But it doesn't actually work. The problem is that it cannot work, given Org format as it exists now. It cannot properly represent manuals that use the GNU standard style because it does not have ways to represent all the distinctions that are needed. Texinfo implements semantic markup, and is designed for multiple output formats, including Info, HTML, and DVI (whence also PDF). Two different markup constructs that look the same in one output format may look different in another. All these design decisions were the result of long thought. For instance, italic face can be the result of @dfn, @var, @emph, @cite and @i. Each of them has a meaning -- which one to use is not a matter of taste. @cite is used around the title of a book. @dfn is used around the first use of a term, to say it is being defined. @emph is used to indicate emphasis. @var is used around a metasyntactic variable. @i means "use italic for this" and does not say why. It is a escape for situations that Texinfo has no specific way to describe. These five commands produce the same output in DVI/PDF, but different output in Info. I think there must be 15 different Texinfo commands to generate fixed-width bold text in DVI/PDF. To represent a manual source in Org format and do the job right is not possible with Org format as it is now. To make it possible requires extending Org format so it can make all the distinctions Texinfo can make. I would like Org format to be extended in this way. Texinfo format is often misunderstood. People write GNU manuals without knowing of all these constructs, so they use the wrong construct. If Org format were extended to do _the whole job_, we could convert all manuals to Org format once and for all. A few years ago I raised this idea, but nobody wanted to work on it. -- Dr Richard Stallman (https://stallman.org) Chief GNUisance of the GNU Project (https://gnu.org) Founder, Free Software Foundation (https://fsf.org) Internet Hall-of-Famer (https://internethalloffame.org) ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Translation of the Org mode manual (was: Translation of manuals (was: SES manual French translation)) 2024-02-07 3:12 ` Richard Stallman @ 2024-02-07 23:10 ` Ihor Radchenko 2024-02-09 3:51 ` Richard Stallman 0 siblings, 1 reply; 5+ messages in thread From: Ihor Radchenko @ 2024-02-07 23:10 UTC (permalink / raw) To: rms; +Cc: Jean-Christophe Helary, vincent.b.1, emacs-devel, emacs-orgmode Richard Stallman <rms@gnu.org> writes: > Using Org format for the source of a document, and converting it to > texi for processing (for instance, through TeX) would be a fine method > to use, if it worked. But it doesn't actually work. > > The problem is that it cannot work, given Org format as it exists now. > It cannot properly represent manuals that use the GNU standard style > because it does not have ways to represent all the distinctions that > are needed. > ... This is not true. We can generate a precise TeXinfo markup from Org files even now using export snippets: @@texinfo:<direct texinfo code>@@ So, Org mode manual in particular can be adapted if absolutely necessary. Another question is that having native Org mode constructs for manual-specific markup would make things less verbose and ad-hoc. But that question is out of scope of this particular discussion, which is focused on translation tools. > I would like Org format to be extended in this way. > > Texinfo format is often misunderstood. People write GNU manuals > without knowing of all these constructs, so they use the wrong > construct. If Org format were extended to do _the whole job_, we > could convert all manuals to Org format once and for all. > > A few years ago I raised this idea, but nobody wanted to work on it. Your idea is not forgotten. But the resources for Org mode development are limited. In particular, I am currently not prioritizing adding brand-new features in favour of helping contributors, improving the existing functionality, and fixing the existing problems. If somebody is interested to work on adding new Org mode features like custom inline markup, please contact Org mode mailing list. We can guide the volunteers through the contribution process and help with any questions when they arise. See https://orgmode.org/worg/org-contribute.html -- Ihor Radchenko // yantar92, Org mode contributor, Learn more about Org mode at <https://orgmode.org/>. Support Org development at <https://liberapay.com/org-mode>, or support my work at <https://liberapay.com/yantar92> ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: Translation of the Org mode manual (was: Translation of manuals (was: SES manual French translation)) 2024-02-07 23:10 ` Ihor Radchenko @ 2024-02-09 3:51 ` Richard Stallman 0 siblings, 0 replies; 5+ messages in thread From: Richard Stallman @ 2024-02-09 3:51 UTC (permalink / raw) To: Ihor Radchenko Cc: jean.christophe.helary, vincent.b.1, emacs-devel, emacs-orgmode [[[ To any NSA and FBI agents reading my email: please consider ]]] [[[ whether defending the US Constitution against all enemies, ]]] [[[ foreign or domestic, requires you to follow Snowden's example. ]]] > This is not true. We can generate a precise TeXinfo markup from Org > files even now using export snippets: @@texinfo:<direct texinfo code>@@ I know that it is possible to convert Org format to Texinfo. I am raising a different issue. I think we are miscommunicating. The question is not whether Org can be translated to Texingo. The question is whether Org format can represent all the distinctions that Texinfo represents. Or, equivalently, is it possible to translate Texinfo text to Org format and then back to Texinfo format without losing any information? Last I looked, that was not possible, because of the issue I explained in my previous message. If things have improved and Org format is now capable of doing this, I will consider that good news. > Another question is that having native Org mode constructs for > manual-specific markup would make things less verbose and ad-hoc. To handle GNU manuals using Org format would not require a feature like that. -- Dr Richard Stallman (https://stallman.org) Chief GNUisance of the GNU Project (https://gnu.org) Founder, Free Software Foundation (https://fsf.org) Internet Hall-of-Famer (https://internethalloffame.org) ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2024-02-09 3:52 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- [not found] <PAXP192MB1608F5D8D0A89A5571C67D07849FA@PAXP192MB1608.EURP192.PROD.OUTLOOK.COM> [not found] ` <8A22A273-11F5-449C-A2A4-2E8C68CCF1FE@traductaire-libre.org> [not found] ` <83ttnvzwlw.fsf@gnu.org> [not found] ` <5D3D13BC-E621-4C40-88D7-3E5805F631AB@traductaire-libre.org> [not found] ` <83sf3fzvk4.fsf@gnu.org> [not found] ` <CADwFkmnvx3=kevSGb9qb9_yXs7s__crsOfpb39+7MtU6QYDM9A@mail.gmail.com> [not found] ` <835y0azhvo.fsf@gnu.org> [not found] ` <E9136A1F-CCD1-4A31-8018-CC4D6703BC85@traductaire-libre.org> [not found] ` <83wmsqy14p.fsf@gnu.org> [not found] ` <3E7CF23D-CE62-4581-80D6-E6266CEE4DF5@traductaire-libre.org> 2024-01-04 11:34 ` Translation of the Org mode manual (was: Translation of manuals (was: SES manual French translation)) Ihor Radchenko 2024-01-04 12:19 ` Jean-Christophe Helary 2024-02-07 3:12 ` Richard Stallman 2024-02-07 23:10 ` Ihor Radchenko 2024-02-09 3:51 ` Richard Stallman
Code repositories for project(s) associated with this public inbox https://git.savannah.gnu.org/cgit/emacs/org-mode.git This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).