emacs-orgmode@gnu.org archives
 help / color / mirror / code / Atom feed
From: Matt <matt@excalamus.com>
To: "Leo Butler" <Leo.Butler@umanitoba.ca>
Cc: "galaxy being" <borgauf@gmail.com>,
	"emacs-orgmode mailinglist" <emacs-orgmode@gnu.org>
Subject: Re: Template for ob- packages?
Date: Sat, 18 Feb 2023 18:02:16 -0500	[thread overview]
Message-ID: <18666c38a2e.c2cfedda2107471.4355804631611432902@excalamus.com> (raw)
In-Reply-To: <87fsb3ly8d.fsf@t14.reltub.ca>

[-- Attachment #1: Type: text/plain, Size: 314 bytes --]


 ---- On Fri, 17 Feb 2023 17:32:18 -0500  Leo Butler  wrote --- 

 > Matt, thanks for sharing those notes. I would suggest that they be added
 > to worg in their current state.

Thanks for your vote of confidence.

I've attached the source, in case someone wants to do that.  Otherwise, I'll get to it when I can.

[-- Attachment #2: 2021-11-03-org_babel.org --]
[-- Type: application/octet-stream, Size: 13343 bytes --]

#+TITLE: Org Babel API
#+DATE: 2021-11-03
#+TYPE: post
#+TOC: t

* TODO COMMENT
- [ ] explain each function
- [ ] explain terms
- [ ] remove cruft functions from =ob-template.el= (or just refactor
  that example)

* COMMENT Setup
Provide the Org source directory path in order to make this file
portable.

#+begin_src emacs-lisp :results none
(setq-local org-dir "/home/ahab/.emacs.d/straight/repos/org/lisp")
#+end_src

#+begin_src sh :var ORGDIR=(identity org-dir) :results output
echo "$ORGDIR"
#+end_src

#+RESULTS:
: /home/ahab/.emacs.d/straight/repos/org/lisp

* Overview
This document explains the Org Babel API with a focus on maintenance
and extension.  Babel is part of the Org package for Emacs.  It allows
for embedded code blocks within a text document[fn:1].

At the time of writing, the Org Babel API is only described in
[[https://git.sr.ht/~bzg/worg/tree/master/item/org-contrib/babel/ob-template.el][=ob-template.el=]], the [[https://lists.gnu.org/archive/html/emacs-orgmode/2015-09/msg00487.html][email thread]] linked to in [[https://orgmode.org/worg/org-contrib/babel/languages/index.html#develop][the Worg docs]], and, of
course, the source code itself.

The functions that need to be (or even can be) defined aren't really
laid out anywhere, nor are the ways they interact explained in one
place. This document aims to change that.

*NOTE* At some point I'll merge this into Worg[fn:2].  However, this
is still a draft and I don't feel it's right to host it there
yet. These are my active notes and people probably expect Worg to be
somewhat more complete.

* The Babel Codebase
Org Babel is not a single file or package. It's a suite of
files. Babel files begin with "ob-", for "Org Babel", and are found in
the =lisp/= directory of the Org project[fn:3].  The primary file is
=ob-core.el=.

#+begin_src sh :var ORGDIR=(identity org-dir) :results output
ls $ORGDIR | grep "ob-" | grep "el$" | head -n 10 && echo "..."
#+end_src

#+RESULTS:
#+begin_example
ob-awk.el
ob-calc.el
ob-C.el
ob-clojure.el
ob-comint.el
ob-core.el
ob-css.el
ob-ditaa.el
ob-dot.el
ob-emacs-lisp.el
...
#+end_example

At the time of writing, Babel is useable with more than 40 different
languages.

#+begin_src sh :var ORGDIR=(identity org-dir) :results output
ls $ORGDIR | grep "ob-" | grep "el$" | wc -l
#+end_src

#+RESULTS:
: 48

Babel is no longer maintained by the original author.  Instead,
extensions have individual maintainers. Each extension varies in
quality and consistency. Some see regular updates whereas others don't
seem to have changed much.

It's [[https://www.gnu.org/software/emacs/manual/html_node/elisp/Coding-Conventions.html][convention]] in Elisp to choose a prefix to distinguish functions
and variables between packages.  The name "Org Babel" implies the
package prefix "org-babel-"[fn:4].  This is the prefix used within
Babel *as well as other Org extensions (e.g. Org Links, Org Export
Framework, etc.)*. That is, symbols defined in =ob-<lang>.el= don't
use a "ob-<lang>-" prefix–they use "org-babel-".  It's impossible to
distinguish Org Babel symbols from other Org symbols by name alone.

* Babel Basics
Babel works by defining certain functions that =ob-core.el= expects.

If we compare =ob-template.el= and =ob-core.el=, we get the following
in no particular order:

| function name                             | ob-core.el | ob-template.el |
|-------------------------------------------+------------+----------------|
| [[org-babel-execute:template][=org-babel-execute:template=]]              | X          | X              |
| [[org-babel-template-initiate-session][=org-babel-template-initiate-session=]]     | X          | X              |
| [[org-babel-expand-body:template][=org-babel-expand-body:template=]]          | X          | X              |
| [[org-babel-prep-session:template][=org-babel-prep-session:template=]]         | X          | X              |
| [[=org-babel-template-var-to-template=][=org-babel-template-var-to-template=]]      |            | X              |
| [[=org-babel-template-table-or-string=][=org-babel-template-table-or-string=]]      |            | X              |
| [[org-babel-variable-assignments:template][=org-babel-variable-assignments:template=]] | X          |                |
| [[org-babel-process-params][=org-babel-process-params=]]                | X          |                |

| variable name                            | ob-core.el | ob-template.el |
|------------------------------------------+------------+----------------|
| [[=org-babel-tangle-lang-exts=][=org-babel-tangle-lang-exts=]]             |            | X              |
| [[org-babel-default-header-args:template][=org-babel-default-header-args:template=]] | X          | X              |

Functions from =ob-core.el= not in =ob-template.el=, but still used in
some =ob-template.el= derived languages (e.g. =ob-shell.el=):

- [[org-babel-header-args:template][=org-babel-header-args:template=]]
- [[org-babel-load-session:template][=org-babel-load-session:template=]]
- [[org-babel-load-session:template][=org-babel-variable-assignments:template=]]

Important functions to understand for any extension that uses a comint
(all of them?):

- [[org-babel-comint-wait-for-output][=org-babel-comint-wait-for-output=]]

* Function: =org-babel-execute:template= <<org-babel-execute:template>>
*Arguments*: /body/, /params/

*Returns*: emacs-lisp containing the results of evaluating /body/ in
/template/ language.

The function called to actually evaluate a code block.  It is defined
in an "ob-" file and gets called by =org-babel-execute-src-block=.

The =org-babel-execute:template= function will evaluate the body of
the source code and return the results as emacs-lisp depending on the
value of the =:results header= argument:

- =:results output= means that the output to STDOUT will be captured
  and returned
- =:results value= means that the value of the last statement in the
  source code block will be returned

The [[https://git.sr.ht/~bzg/worg/tree/master/item/org-contrib/babel/ob-template.el][ob-template.el]] file claims that "the most common first step in
this function is the expansion of the PARAMS argument using
=org-babel-process-params=."  This was probably true when that was
written. At the time of writing, however, only four of the files in the
org-babel suite use it.

#+begin_quote
Please feel free to not implement options which aren't appropriate
for your language (e.g. not all languages support interactive
"session" evaluation).  Also you are free to define any new header
arguments which you feel may be useful -- all header arguments
specified by the user will be available in the PARAMS variable.
#+end_quote

* TODO Function: =org-babel-template-initiate-session= <<org-babel-template-initiate-session>>
* TODO Function: =org-babel-expand-body:template= <<org-babel-expand-body:template>>
Called by [[org-babel-execute:template][=org-babel-execute:template=]].

* TODO Function: =org-babel-prep-session:template= <<org-babel-prep-session:template>>
* TODO Function: =org-babel-template-var-to-template= <<org-babel-template-var-to-template>>
This is cruft.  It's only defined in =ob-template.el=. All it does is this:

#+begin_src emacs-lisp :eval never
(defun org-babel-template-var-to-template (var)
  "Convert an elisp var into a string of template source code
specifying a var of the same value."
  (format "%S" var))
#+end_src

Surely this doesn't deserve a whole function.

* TODO Function: =org-babel-template-table-or-string= <<org-babel-template-table-or-string>>
This is cruft. It is defined in =ob-template.el= but not used
anywhere, in that file or elsewhere.

* Function: =org-babel-process-params= <<org-babel-process-params>>
  "Expand variables in PARAMS and add summary parameters."

* TODO Variable: =org-babel-tangle-lang-exts= <<org-babel-tangle-lang-exts>>
Gets called by =ob-tangle.el= in the
=org-babel-effective-tangled-filename= function.
* TODO Variable: =org-babel-default-header-args:template= <<org-babel-default-header-args:template>>
* TODO Function: =org-babel-header-args:template= <<org-babel-header-args:template>>
* Function: =org-babel-load-session:template= <<org-babel-load-session:template>>
Called by the =ob-core.el= function =org-babel-load-in-session=. That
function handles header args, mainly no-web expansion. It then starts
the comint process and opens the associated process buffer.  The
=org-babel-load-in-session= function is, basically, called by the
"org-metaup-hook" which is attached to M-up by default.  Otherwise,
this function is never called. Also, AFAICT, this behavior is not
documented anywhere.

For ob-shell.el, this function is implemented, but only for the
"shell" language. For example, press M-up when in the following
block. The body will be inserted in a new process buffer "my-shell".

: #+begin_src shell :session my-shell
: echo "hello, world!"
: #+end_src
:
: #+begin_src sh :session my-sh
: echo "hello, world!"
: #+end_src
:
: #+begin_src bash :session my-bash
: echo "hello, world!"
: #+end_src

* Function: =org-babel-variable-assignments:template= <<org-babel-variable-assignments:template>>
Used during tangle (ob-tangle.el) and eval.

It is used as an argument for [[https://git.savannah.gnu.org/cgit/emacs/org-mode.git/tree/lisp/ob-core.el#n810][=org-babel-expand-body:generic=]].

It is used in =org-babel-prep-session:template=

#+begin_quote
"Return list of template statements assigning the block's variables."
#+end_quote

* TODO Function: org-babel-comint-wait-for-output <<org-babel-comint-wait-for-output>>
* Undocumented behavior
Don't be surprised if there is undocumented behavior defined within an
ob file.

The following were undocumented features of =ob-shell.el=:

  - :stdin passes org reference (#+name) as stdin
  - :cmdline with :shebang runs body with given command-line arg(s)
  - :cmdline without :shebang runs body with shell-file-name and command-line arg(s)
  - org-babel-load-session:shell (see [[org-babel-load-session:template][org-babel-load-session:template]])

* Terminology
To quote [[https://orgmode.org/manual/Structure-of-Code-Blocks.html][(org) Structure of Code Blocks]],

#+begin_example org
A source code block conforms to this structure:

    #+NAME: <name>
    #+BEGIN_SRC <language> <switches> <header arguments>
    <body>
    #+END_SRC
#+end_example

** TODO info
** params
'params' is an alist derived from the src block header.  Element keys
 are referred to as "header args". These are defined in
 =org-babel-common-header-args-w-values=, but are not exclusive to
 that list. Any ":header-arg value" pair defined in the header will
 appear in the list.

** switch
Switches control evaluation, export and tangling of code blocks.

For example, the following block uses the "-n" switch to toggle line
numbers in the HTML output:

#+begin_example org
#+begin_src elisp -n
(message "hello, world!")
#+end_src
#+end_example

#+attr_html: :class pre-top pre-bottom
#+begin_src elisp -n
(message "hello, world!")
#+end_src

** header argument
Header arguments are keywords which control block evaluation or
expansion.  For instance, the =:session <session-name>= argument runs
a block in a separate shell. Another example is the =:var NAME=VALUE=
argument which passes data between blocks such that =NAME= gets
defined as =VALUE= within the block environment.

Header arguments can be more general, however. To quote [[https://git.sr.ht/~bzg/worg/tree/master/item/org-contrib/babel/ob-template.el][ob-template.el]],

#+begin_quote
"you are free to define any new header arguments which you feel may be
useful -- all header arguments specified by the user will be available
in the PARAMS variable."
#+end_quote

Arguments can be system-wide, using =org-babel-default-header-args= or
language specific with =org-babel-default-header-args:<LANG>=.

For common header arguments, see
=org-babel-common-header-args-w-values=.

** expansion
"Expand" and "expansion" are overloaded terms. Generally, they mean to
replace something within a form.

When org-babel "expands" the body of a source block, it resolves any
noweb references (noweb is the functionality that allows different
source blocks to reference one another).

To expand a variable, in the context of Org babel, means to assign a
header variable as a variable within the language runtime.

For example, the table named =tbl= gets "expanded" and assigned to
=data= within the Python runtime.

#+begin_example
#+name: tbl
| 1 | 2 |
| 3 | 4 |

#+BEGIN_SRC python :var data=tbl :results value
return data
#+END_SRC

#+RESULTS:
| 1 | 2 |
| 3 | 4 |
#+end_example

* The =:var= parameter
The =:var= parameter has the form

#+begin_example
(:var name . value)
#+end_example

where "value", the cdr, should be a cons cell whose car is the name of
the variable "as a symbol" and whose value is the value of the
variable.

That is,

#+begin_example
(:var name . (name-symbol . "it's value"))
#+end_example

* References
- https://lists.gnu.org/archive/html/emacs-orgmode/2015-09/msg00487.html

* Footnotes

[fn:1] https://orgmode.org/

[fn:2] https://orgmode.org/worg/worg-about.html

[fn:3] https://git.savannah.gnu.org/cgit/emacs/org-mode.git/tree/lisp

[fn:4] Technically, Emacs doesn't have namespaces. Instead, a prefix
is given to symbols so that various packages don't conflict.

  reply	other threads:[~2023-02-18 23:03 UTC|newest]

Thread overview: 6+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-02-16 21:10 Template for ob- packages? Galaxy Being
2023-02-16 21:16 ` Matt
2023-02-17 17:06   ` Galaxy Being
2023-02-17 22:32   ` Leo Butler
2023-02-18 23:02     ` Matt [this message]
2023-02-16 21:17 ` Thomas S. Dye

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

  List information: https://www.orgmode.org/

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=18666c38a2e.c2cfedda2107471.4355804631611432902@excalamus.com \
    --to=matt@excalamus.com \
    --cc=Leo.Butler@umanitoba.ca \
    --cc=borgauf@gmail.com \
    --cc=emacs-orgmode@gnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
Code repositories for project(s) associated with this public inbox

	https://git.savannah.gnu.org/cgit/emacs/org-mode.git

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).