From mboxrd@z Thu Jan 1 00:00:00 1970 From: "Thomas S. Dye" Subject: Re: Reproducible Research Template Date: Wed, 5 Jan 2011 06:24:01 -1000 Message-ID: References: <1294170927.2599.110.camel@Yates> Mime-Version: 1.0 (Apple Message framework v936) Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes Content-Transfer-Encoding: 7bit Return-path: Received: from [140.186.70.92] (port=35669 helo=eggs.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1PaW9g-0006fE-Bw for emacs-orgmode@gnu.org; Wed, 05 Jan 2011 11:24:09 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1PaW9f-0008Pn-9B for emacs-orgmode@gnu.org; Wed, 05 Jan 2011 11:24:08 -0500 Received: from oproxy3-pub.bluehost.com ([69.89.21.8]:57644) by eggs.gnu.org with smtp (Exim 4.71) (envelope-from ) id 1PaW9f-0008PJ-32 for emacs-orgmode@gnu.org; Wed, 05 Jan 2011 11:24:07 -0500 In-Reply-To: <1294170927.2599.110.camel@Yates> List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: Andy Choens Cc: Org-mode ml On Jan 4, 2011, at 9:55 AM, Andy Choens wrote: > I am developing a reproducible research template for R. I am trying > to implement most of a research "compendium" in org. I say "most" > because I am going to allow the actual data to exist outside of org, > simply because most of the data I work with is relational or very > large, which makes storage in plain text problematic or impossible. > > Has anyone ever implemented a reproducible research template in org > that I can look at? I looked at the stuff on Worg and there are > examples of a project, but not a template. > > In a nutshell, I am trying to do two things. > 1) Provide a structure for reproducible research / programming ; > 2) Provide a small set of helper functions. > > But, I don't want the helper functions to get in the way. I have > considered two options: > 1) Store the example code / template stuff in subheadings > 2) Store the example code / template stuff in an external file. > > Using subheadings is tempting, but I'm afraid that org-babel-execute- > buffer would cause problems for users who don't use all of the > template functions. > > Using an external file, similar to the Lobrary of Babel is also > tempting. It would allow me to make a cleaner template for structure > and still allow users to access any helper functions. Is there a way > to link to an external file, other than the Library of Babel? If so, > how do I do this? > > Does anyone have any opinions about hiding/linking/importing example > code in a template? > > I certainly appreciate any thoughts. > > --andy Aloha Andy, Great idea. I'll be interested to see where you go with this. You can link to an external file other than the one holding the library of babel using the library of babel facility. I use this in my config file: #+source: load-local-lob #+begin_src emacs-lisp :tangle yes (org-babel-lob-ingest "~/org/local-lob.org") #+end_src I put functions there that are useful to me in more than one Org-mode buffer, but that are not likely to be useful to other Org-mode users (and therefore fit for the library of babel). Most of my projects store data in a MySQL database. My projects define queries that relate tables to one another, but the results are typically something that Org-mode understands---a flat table or a single value. The reproducible research functions that I write break the analysis workflow down into separate tasks so that the results of a SQL query are written to the Org-mode buffer. Subsequent steps in the workflow refer to the named results block. This way, the Org-mode project contains the actual data used in the analysis without the need to reproduce a relational structure. The functions that access the SQL database go in the local library of babel because I don't want to give users of a RR document direct access to my database. If I manipulate query results after they are written into the Org-mode buffer, I also like to write out intermediate results in many situations. My goal in writing RR code is not speed or compactness, but maximum transparency. Because Org-mode is language agnostic it is frequently the case that the files created by others contain code in languages I don't understand. If these are long and complex, then I'm at a loss to what is actually going on. If they are short, and intermediate results are written out in the buffer, then it is easier for me to follow the analysis. I look forward to learning from your work with RR templates. All the best, Tom