From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from mp2 ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by ms11 with LMTPS id 2POMFeOMYF8ICgAA0tVLHw (envelope-from ) for ; Tue, 15 Sep 2020 09:44:03 +0000 Received: from aspmx1.migadu.com ([2001:41d0:2:4a6f::]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits)) by mp2 with LMTPS id YPncD+OMYF+yJwAAB5/wlQ (envelope-from ) for ; Tue, 15 Sep 2020 09:44:03 +0000 Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by aspmx1.migadu.com (Postfix) with ESMTPS id A86F294053F for ; Tue, 15 Sep 2020 09:44:02 +0000 (UTC) Received: from localhost ([::1]:57988 helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1kI7VH-0004OM-Tr for larch@yhetil.org; Tue, 15 Sep 2020 05:44:00 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:39358) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1kI7T3-0004OF-9u for emacs-orgmode@gnu.org; Tue, 15 Sep 2020 05:41:41 -0400 Received: from dal3relay209.mxroute.com ([64.40.27.209]:33403) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.90_1) (envelope-from ) id 1kI7T0-0004v5-TE for emacs-orgmode@gnu.org; Tue, 15 Sep 2020 05:41:40 -0400 Received: from filter003.mxroute.com ([168.235.111.26] 168-235-111-26.cloud.ramnode.com) (Authenticated sender: mN4UYu2MZsgR) by dal3relay209.mxroute.com (ZoneMTA) with ESMTPSA id 17491241a63000352a.001 for (version=TLSv1/SSLv3 cipher=ECDHE-RSA-AES128-GCM-SHA256); Tue, 15 Sep 2020 09:41:36 +0000 X-Zone-Loop: 8da332cb9e850403a061b6161852ffb4512b2882022d X-Originating-IP: [168.235.111.26] Received: from friday.mxlogin.com (friday.mxlogin.com [159.69.65.104]) by filter003.mxroute.com (Postfix) with ESMTPS id E10B062D91 for ; Tue, 15 Sep 2020 09:34:41 +0000 (UTC) Subject: Re: official orgmode parser To: emacs-orgmode@gnu.org References: <68dc1ea1-52e8-7d9e-fb2d-bcf08c111eca@intrepidus.pl> <87d02n2yyr.fsf@gmail.com> From: =?UTF-8?B?UHJ6ZW15c8WCYXcgS2FtacWEc2tp?= Message-ID: <482cea5c-4214-57ac-dfeb-1e305180fee5@intrepidus.pl> Date: Tue, 15 Sep 2020 11:17:57 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.12.0 MIME-Version: 1.0 In-Reply-To: <87d02n2yyr.fsf@gmail.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit X-AuthUser: pk@mmksoft.uk Received-SPF: pass client-ip=64.40.27.209; envelope-from=pk@intrepidus.pl; helo=dal3relay209.mxroute.com X-detected-operating-system: by eggs.gnu.org: First seen = 2020/09/15 05:41:37 X-ACL-Warn: Detected OS = Linux 3.11 and newer [fuzzy] X-Spam_score_int: -18 X-Spam_score: -1.9 X-Spam_bar: - X-Spam_report: (-1.9 / 5.0 requ) BAYES_00=-1.9, NICE_REPLY_A=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: emacs-orgmode@gnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+larch=yhetil.org@gnu.org Sender: "Emacs-orgmode" X-Scanner: scn0 Authentication-Results: aspmx1.migadu.com; dkim=none; dmarc=none; spf=pass (aspmx1.migadu.com: domain of emacs-orgmode-bounces@gnu.org designates 209.51.188.17 as permitted sender) smtp.mailfrom=emacs-orgmode-bounces@gnu.org X-Spam-Score: -1.01 X-TUID: gDPp3uouyouG On 9/15/20 11:03 AM, Tim Cross wrote: > > Przemysław Kamiński writes: > >> Hello, >> >> I oftentimes find myself needing to parse org files with some external >> tools (to generate reports for customers or sum up clock times for given >> month, etc). Looking through the list >> >> https://orgmode.org/worg/org-tools/ >> >> and having tested some of these, I must say they are lacking. The >> Haskell ones seem to be done best, but then the compile overhead of >> Haskell and difficulty in embedding this into other languages is a drawback. >> >> I think it might benefit the community when such an official parser >> would exist (and maybe could be hooked into org mode directly). >> >> I was thinking picking some scheme like chicken or guile, which could be >> later easily embedded into C or whatever. Then use that parser in org >> mode itself. This way some important part of org mode would be outside >> of the small world of elisp. >> >> This is just an idea, what do you think? :) >> > > The problem with this idea is maintenance. It is also partly why > external tools are not terribly reliable/good. Org mode is constantly > being enhanced and improved. It is very hard for external tools to keep > pace with org-mode development, so they soon get out of date or stop > working correctly. > > Org mode IS an elsip application. This is the main goal. The reason it > works so well is because elisp is largely a DSL that focuses on text > manipulation and is therefore ideally suited for a text based organiser. > > This means if you want to implement parsing of org files in any > other language, there is a lot of fundamental functionality which willl > need to be implemented that is not necessary when using elisp as it is > already built-in. Not only that, it is also 'battle hardened' and well > tested. The other problem would be in selecting another language which > behaves consistently across all the platforms Emacs and org-mode is > supported on. As org-mode is a stnadard part of Emacs, it also needs to > be implemented in something which is also available on all the platforms > emacs is on without needing the user to install additional software. > > The other issue is that you would need another skill in order to > maintain/extend org-mode. In addition to elisp, you will also need to > know whatever the parser implementation language is. > > A third negative is that if the parser was in a different language to > elisp, the interface between the rest of org mode (in elisp) and the > parser would become an issue. At the moment, there are far fewer > barriers as it is all elisp. However, if part of the system is in > another language, you are now restricted to whatever defined interface > exists. This would likely also have performance issues and overheads > associated with translating from one format to another etc. > > So, in short, the chances of org mode using a parser written in > something other than elisp is pretty close to 0. This leaves you with 2 > options - > > 1. Implement another external tool which can parse org-files. As > metnioned above, this is a non-trivial task and will likely be difficult > to maintain. Probably not the best first choice. > > 2. Provide some details about your workflow where you believe you need > to use external tools to process the org-files. It is very likely there > are alternative approaches to give you the result you want, but without > the need to do external parsing of org-files. There isn't sufficient > details in the examples you mention to provide any specific details. > However, I have used org-mode for reporting, invoicing, time tracking, > documentation, issue/request tracking, project planning and project > management and never needed to parse my org files with an external tool. > I have exported the data in different formats which have then been > processed by other tools and I have tweaked my setup to support various > enterprise/corporate standards or requirements (logos, corporate > colours, report formats, etc). Sometimes these tweaks are trivial and > others require more extensive effort. Often, others have had to do > something the same or similar and have working examples etc. > > So my recommendation is post some messages to this list with details on > what you need to try and do and see what others can suggest. I would > keep each post to a single item rather than one long post with multiple > requests. From watching this list, I've often see someone post a "How > can I ..." question only to get the answer "Oh, that is already > built-in, just do .....". Org is a large application with lots of > sophisticated power that isn't always obvious from just reading the > manual. > > So, I keep clock times for work in org mode, this is very handy. However, my customers require that I use their service to provide the times. They do offer API. So basically I'm using elisp to parse org, make API calls, and at the same time generate CSV reports with a Python interop with org babel (because my elisp is just too bad to do that). If I had access to some org parser, I'd pick a language that would be more comfortable for me to get the job done. I guess it can all be done in elisp, however this is just a tool for me alone and I have limited time resources on hacking things for myself :) Another one is generating total hours report for day/week/month to put into my awesomewm toolbar. I ended up using orgstat https://github.com/volhovM/orgstat however the author is creating his own DSL in YAML and I guess things were much better off if it all stayed in some Scheme :) Anyways, my parser needs aren't that sophisticated: just parse the file, return headings with clock drawers. I tried the common lisp library but got frustrated after fiddling with it for couple of hours. Best, Przemek