From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp1 ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms11 with LMTPS id 8KmdIuek2F8nVAAA0tVLHw (envelope-from ) for ; Tue, 15 Dec 2020 11:58:31 +0000 Received: from aspmx1.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp1 with LMTPS id QINfHuek2F/EVwAAbx9fmQ (envelope-from ) for ; Tue, 15 Dec 2020 11:58:31 +0000 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id 992EC940148 for ; Tue, 15 Dec 2020 11:58:30 +0000 (UTC) Received: from localhost ([::1]:55736 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kp8yJ-000225-O4 for larch@yhetil.org; Tue, 15 Dec 2020 06:58:28 -0500 Received: from eggs.gnu.org ([2001:470:142:3::10]:60462) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kp8qS-0001L0-0W for emacs-orgmode@gnu.org; Tue, 15 Dec 2020 06:50:20 -0500 Received: from mail-pl1-x62f.google.com ([2607:f8b0:4864:20::62f]:43636) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1kp8qP-0001bE-CV; Tue, 15 Dec 2020 06:50:19 -0500 Received: by mail-pl1-x62f.google.com with SMTP id x12so10428656plr.10; Tue, 15 Dec 2020 03:50:14 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=references:user-agent:from:to:cc:subject:date:in-reply-to :message-id:mime-version; bh=dtunCi2Kzwys7iOgCUBwv6CqjPGoBKhst9R6v0dkoR8=; b=H2PiH7CcK5AfmIf9u2PI+1ayz6vBcmgwEE0aSSrVwQ29aTICle0a3Iu3qxFejcn6mP 9cb30LQZPA6raXTib5zzSv6NF4pmZTauBhbtgs3AL7ngTzg0RM+hSOENoXygEx38TeSk TLs+WmFP7dPGOiZqKbS7tSGYmASX723qfbLcVAY9ZdpzzewlP7gzhMRzjha+qCOSTnAb zsX7DKfvRs6ZAWvz9nEG0kat9mp6uMrEwyxCtVIJT7DkbKoW2myvd5+fQmfup6cgVZJs IBRuHhHCMPxItLXB7S8vVnIhcYmDisCrTYn3XApoR1urbnpwWBvxu29E1/a1pwVgK6Oo KFfA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:references:user-agent:from:to:cc:subject:date :in-reply-to:message-id:mime-version; bh=dtunCi2Kzwys7iOgCUBwv6CqjPGoBKhst9R6v0dkoR8=; b=OaHdYptSBR3a5sGJ4bf2fJo7cdmPYvlnEVutX1f9mwYefshqcSDvKcdkTMTxSxuEYS NyXlLK2/Pw+sQf1+QFgXQGhKy+CPG2g+jhfBYTK81pEz0u7oC9ziJRqhgbCmoFIHgQbj TNRt+l4+K+1jJ4irHSgl1MKJPdGfawZDyMO1R8mh1dxe8d94J5GmN7v99iDUujva7xWN KyoArMtUJzSKQO3WavHw8w9OIeZVzonF5dzs7NA6v67gl+RA0LsqcvxMMyuTQh2PeWUh ktGzdAqLEBOVhtJ/aA8Dlf6FxJZMQ8Td2Vup2oVH7moVRkxf/zgTx2k+mHQGFcsmkCsp SosQ== X-Gm-Message-State: AOAM533vOiCkCL7itMx7SibWw6BvFwDMJ75te7gYlepty286W4mOhcSd IlwefhLNo6QT1sx6ux/f7R2MmIzk2jQyhw== X-Google-Smtp-Source: ABdhPJzcCWadsm6aMiJrOaCgWJEJ/hNVTVqf7YAGT5SMVjDKKBW+TQ5NNgVqJaF8rATcwUjyo0KnbA== X-Received: by 2002:a17:90b:1c10:: with SMTP id oc16mr29518753pjb.144.1608033014006; Tue, 15 Dec 2020 03:50:14 -0800 (PST) Received: from localhost (180-150-91-8.b4965b.per.nbn.aussiebb.net. [180.150.91.8]) by smtp.gmail.com with ESMTPSA id b19sm23289012pfo.24.2020.12.15.03.50.12 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 15 Dec 2020 03:50:13 -0800 (PST) References: <87pn6kfr19.fsf@gmail.com> <87v9gcz9ge.fsf@wi.uni-muenster.de> <87lfh8fkj1.fsf@gmail.com> <87zh5n1p3s.fsf@wi.uni-muenster.de> <87tuvl3fyc.fsf@gmail.com> <878scvs0z7.fsf@wi.uni-muenster.de> <87wo0fktjs.fsf@gmail.com> <874knjrtg9.fsf@wi.uni-muenster.de> <87tuvjkqzl.fsf@gmail.com> <87sgb28gd8.fsf@wi.uni-muenster.de> <87blexel9f.fsf@gmail.com> <87im94ordh.fsf@gnu.org> <87tusolntx.fsf@wi.uni-muenster.de> User-agent: mu4e 1.4.13; emacs 27.1 From: TEC To: Jens Lechtenboerger Subject: Re: [PATCH] Enhance org-html--build-meta-info Date: Tue, 15 Dec 2020 19:39:35 +0800 In-reply-to: <87tusolntx.fsf@wi.uni-muenster.de> Message-ID: <873607uw3y.fsf@gmail.com> MIME-Version: 1.0 Content-Type: multipart/mixed; boundary="=-=-=" Received-SPF: pass client-ip=2607:f8b0:4864:20::62f; envelope-from=tecosaur@gmail.com; helo=mail-pl1-x62f.google.com X-Spam_score_int: -20 X-Spam_score: -2.1 X-Spam_bar: -- X-Spam_report: (-2.1 / 5.0 requ) BAYES_00=-1.9, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, FREEMAIL_FROM=0.001, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-orgmode@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: Bastien , org-mode-email Errors-To: emacs-orgmode-bounces+larch=yhetil.org@gnu.org Sender: "Emacs-orgmode" X-Migadu-Flow: FLOW_IN X-Migadu-Spam-Score: -1.21 Authentication-Results: aspmx1.migadu.com; dkim=fail (body hash did not verify) header.d=gmail.com header.s=20161025 header.b=H2PiH7Cc; dmarc=fail reason="SPF not aligned (relaxed)" header.from=gmail.com (policy=none); spf=pass (aspmx1.migadu.com: domain of emacs-orgmode-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=emacs-orgmode-bounces@gnu.org X-Migadu-Queue-Id: 992EC940148 X-Spam-Score: -1.21 X-Migadu-Scanner: scn0.migadu.com X-TUID: Ha6/7RKUlZtW --=-=-= Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Thanks for testing Jens. I think I've managed to resolve the issues you've raised. Jens, Bastien, you can find the latest revision of the patches attached :) Jens Lechtenboerger writes: > [title export being dodgy, how about treating like author?] Yep, ~org-element-interpret-data~ is necessary. I found that wrapping it in ~org-html-plain-text~ seems better again though, as it encodes entities like "---" (org) to "—", and doesn't seem to introduce any nested tags. I've also applied this to the author field as a result. Maybe it should be applied to the rest (in ~org-html--build-meta-info~)? I'm not sure. > The keywords export as follows, where the name attribute is missing: > Fixed. > The current lambda functions in org-html-meta-tags all accept three > arguments, where the first one is ignored in all cases. The second > one is used in exactly one case. Why not add four calls to > org-html--build-meta-entry (for author, description, keywords, > generator) in org-html--build-meta-info? I had an idea on this, I think the new form is cleaner. Either have a list where each item generates a meta entry, or a function that generates such a list. No more mixing of the two. How does this look? Timothy. --=-=-= Content-Type: text/x-patch Content-Disposition: attachment; filename=0001-lisp-ox-html.el-make-html-meta-tag-builder-nicer.patch >From 9848af808752bc03404befaab7ab5ebb902aa1d0 Mon Sep 17 00:00:00 2001 From: TEC Date: Mon, 14 Dec 2020 17:41:33 +0800 Subject: [PATCH 1/2] lisp/ox-html.el: make html meta tag builder nicer * lisp/ox-html.el (org-html--build-meta-info): Multi-line repeated structure extracted to new function `org-html--build-meta-entry'. The keyword value formatting is changed from `org-export-data' to `org-html-encode-plain-text' to avoid potentially nesting HTML tags in meta tags and the element, which would violate W3C. --- lisp/ox-html.el | 114 ++++++++++++++++++++++++------------------------ 1 file changed, 56 insertions(+), 58 deletions(-) diff --git a/lisp/ox-html.el b/lisp/ox-html.el index d2f24f5c6..005703f60 100644 --- a/lisp/ox-html.el +++ b/lisp/ox-html.el @@ -1835,78 +1835,76 @@ INFO is a plist used as a communication channel." ;;; Template +(defun org-html--build-meta-entry (label identity &optional content-format &rest content-formatters) + "Construct <meta> tag of form <meta LABEL=\"IDENTITY\" />, or when CONTENT-FORMAT is present: +<meta LABEL=\"IDENTITY\" content=\"{content}\" /> + +Here {content} is determined by applying any CONTENT-FORMATTERS to the CONTENT-FORMAT and encoding +the result as plain text." + (concat "<meta " + (format "%s=\"%s" label identity) + (when content-format + (concat "\" content=\"" + (replace-regexp-in-string + "\"" """ + (org-html-encode-plain-text + (if content-formatters + (apply #'format content-format content-formatters) + content-format))))) + "\" />\n")) + (defun org-html--build-meta-info (info) "Return meta tags for exported document. INFO is a plist used as a communication channel." - (let* ((protect-string - (lambda (str) - (replace-regexp-in-string - "\"" """ (org-html-encode-plain-text str)))) - (title (org-export-data (plist-get info :title) info)) - ;; Set title to an invisible character instead of leaving it - ;; empty, which is invalid. - (title (if (org-string-nw-p title) title "‎")) - (author (and (plist-get info :with-author) - (let ((auth (plist-get info :author))) + (let* ((title (org-html-plain-text + (org-element-interpret-data (plist-get info :title)) info)) + ;; Set title to an invisible character instead of leaving it + ;; empty, which is invalid. + (title (if (org-string-nw-p title) title "‎")) + (author (and (plist-get info :with-author) + (let ((auth (plist-get info :author))) ;; Return raw Org syntax. - (and auth (org-element-interpret-data auth))))) - (description (plist-get info :description)) - (keywords (plist-get info :keywords)) - (charset (or (and org-html-coding-system - (fboundp 'coding-system-get) - (coding-system-get org-html-coding-system - 'mime-charset)) - "iso-8859-1"))) + (and auth (org-html-plain-text + (org-element-interpret-data auth) info))))) + (charset (or (and org-html-coding-system + (fboundp 'coding-system-get) + (symbol-name + (coding-system-get org-html-coding-system + 'mime-charset))) + "iso-8859-1"))) (concat (when (plist-get info :time-stamp-file) (format-time-string (concat "<!-- " (plist-get info :html-metadata-timestamp-format) " -->\n"))) - (format - (if (org-html-html5-p info) - (org-html-close-tag "meta" "charset=\"%s\"" info) - (org-html-close-tag - "meta" "http-equiv=\"Content-Type\" content=\"text/html;charset=%s\"" - info)) - charset) "\n" + + (if (org-html-html5-p info) + (org-html--build-meta-entry "charset" charset) + (org-html--build-meta-entry "http-equiv" "Content-Type" + (concat "text/html;charset=" charset))) + (let ((viewport-options (cl-remove-if-not (lambda (cell) (org-string-nw-p (cadr cell))) (plist-get info :html-viewport)))) - (and viewport-options - (concat - (org-html-close-tag - "meta" - (format "name=\"viewport\" content=\"%s\"" - (mapconcat - (lambda (elm) (format "%s=%s" (car elm) (cadr elm))) - viewport-options ", ")) - info) - "\n"))) + (if viewport-options + (org-html--build-meta-entry "name" "viewport" + (mapconcat + (lambda (elm) (format "%s=%s" (car elm) (cadr elm))) + viewport-options ", ")))) + (format "<title>%s\n" title) - (org-html-close-tag "meta" "name=\"generator\" content=\"Org mode\"" info) - "\n" - (and (org-string-nw-p author) - (concat - (org-html-close-tag "meta" - (format "name=\"author\" content=\"%s\"" - (funcall protect-string author)) - info) - "\n")) - (and (org-string-nw-p description) - (concat - (org-html-close-tag "meta" - (format "name=\"description\" content=\"%s\"\n" - (funcall protect-string description)) - info) - "\n")) - (and (org-string-nw-p keywords) - (concat - (org-html-close-tag "meta" - (format "name=\"keywords\" content=\"%s\"" - (funcall protect-string keywords)) - info) - "\n"))))) + + (when (org-string-nw-p author) + (org-html--build-meta-entry "name" "author" author)) + + (when (org-string-nw-p (plist-get info :description)) + (org-html--build-meta-entry "name" "description" (plist-get info :description))) + + (when (org-string-nw-p (plist-get info :keywords)) + (org-html--build-meta-entry "keywords" (plist-get info :keywords))) + + (org-html--build-meta-entry "name" "generator" "Org Mode")))) (defun org-html--build-head (info) "Return information for the .. of the HTML output. -- 2.29.2 --=-=-= Content-Type: text/x-patch Content-Disposition: attachment; filename=0002-lisp-ox-html.el-make-html-meta-tags-customizable.patch >From 3fdc205a549fe315b3096afb72a87868ef9c57d5 Mon Sep 17 00:00:00 2001 From: TEC Date: Mon, 14 Dec 2020 17:50:15 +0800 Subject: [PATCH 2/2] lisp/ox-html.el: make html meta tags customizable * lisp/ox-html.el (org-html-meta-tags): Introduce this as a new option which can be modified to set the meta tags added in HTML exports. (org-html--build-meta-info): Make use of `org-html-meta-tags' instead of hardcoded meta tags. This is leveraging the earlier restructuring of `org-html--build-meta-info' into a much DRYer form, such that this modification has a negligible impact on complexity and readability. --- lisp/ox-html.el | 47 +++++++++++++++++++++++++++++++++++++---------- 1 file changed, 37 insertions(+), 10 deletions(-) diff --git a/lisp/ox-html.el b/lisp/ox-html.el index 005703f60..6a74cdca8 100644 --- a/lisp/ox-html.el +++ b/lisp/ox-html.el @@ -1425,6 +1425,22 @@ not be modified." ;;;; Template :: Styles +(defcustom org-html-meta-tags #'org-html-meta-tags-default + "A list where each item is a list of arguments to be passed +to `org-html--build-meta-entry'. Any nil items are ignored. + +Also accept a function which gives such a list when called with with +signature (TITLE AUTHOR INFO) where TITLE and AUTHOR are strings, +and INFO a communication plist." + :group 'org-export-html + :package-version '(Org . "9.5") + :type '(choice + (repeat + (list (string :tag "Meta label") + (string :tag "label value") + (string :tag "Content value"))) + function)) + (defcustom org-html-head-include-default-style t "Non-nil means include the default style in exported HTML files. The actual style is defined in `org-html-style-default' and @@ -1835,6 +1851,22 @@ INFO is a plist used as a communication channel." ;;; Template +(defun org-html-meta-tags-default (_title author info) + "Generate a list items, each of which is a list of arguments that can +be passed to `org-html--build-meta-entry', to generate meta tags to be +included in the HTML head. + +The documents's TITLE, AUTHOR, and communication plist INFO may be used." + (list + (when (org-string-nw-p author) + (list "name" "author" author)) + (when (org-string-nw-p (plist-get info :description)) + (list "name" "description" + (plist-get info :description))) + (when (org-string-nw-p (plist-get info :keywords)) + (list "name" "keywords" (plist-get info :keywords))) + '("name" "generator" "Org Mode"))) + (defun org-html--build-meta-entry (label identity &optional content-format &rest content-formatters) "Construct tag of form , or when CONTENT-FORMAT is present: @@ -1895,16 +1927,11 @@ INFO is a plist used as a communication channel." (format "%s\n" title) - (when (org-string-nw-p author) - (org-html--build-meta-entry "name" "author" author)) - - (when (org-string-nw-p (plist-get info :description)) - (org-html--build-meta-entry "name" "description" (plist-get info :description))) - - (when (org-string-nw-p (plist-get info :keywords)) - (org-html--build-meta-entry "keywords" (plist-get info :keywords))) - - (org-html--build-meta-entry "name" "generator" "Org Mode")))) + (mapconcat + (lambda (args) (apply #'org-html--build-meta-entry args)) + (delq nil (if (functionp org-html-meta-tags) + (funcall org-html-meta-tags title author info) + org-html-meta-tags)) "")))) (defun org-html--build-head (info) "Return information for the .. of the HTML output. -- 2.29.2 --=-=-=--