From mboxrd@z Thu Jan 1 00:00:00 1970 From: Feng Shu Subject: Re: html to org-mode Date: Sat, 04 Jan 2014 12:56:32 +0800 Message-ID: <87ha9kuyyn.fsf@news.tumashu-localhost.org> References: Mime-Version: 1.0 Content-Type: text/plain Return-path: Received: from eggs.gnu.org ([2001:4830:134:3::10]:40621) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1VzJLr-0004bV-AZ for emacs-orgmode@gnu.org; Sat, 04 Jan 2014 00:00:51 -0500 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1VzJLn-0003CK-4j for emacs-orgmode@gnu.org; Sat, 04 Jan 2014 00:00:47 -0500 Received: from mail-pb0-x244.google.com ([2607:f8b0:400e:c01::244]:43759) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1VzJLm-0003C8-SY for emacs-orgmode@gnu.org; Sat, 04 Jan 2014 00:00:43 -0500 Received: by mail-pb0-f68.google.com with SMTP id jt11so13255479pbb.11 for ; Fri, 03 Jan 2014 21:00:41 -0800 (PST) Received: from news.tumashu-localhost.org ([101.22.49.227]) by mx.google.com with ESMTPSA id sy10sm147625832pac.15.2014.01.03.21.00.37 for (version=TLSv1.2 cipher=RC4-SHA bits=128/128); Fri, 03 Jan 2014 21:00:39 -0800 (PST) Received: from feng by news.tumashu-localhost.org with local (Exim 4.80) (envelope-from ) id 1VzJHk-0004xe-QA for emacs-orgmode@gnu.org; Sat, 04 Jan 2014 12:56:32 +0800 In-Reply-To: (John Kitchin's message of "Fri, 3 Jan 2014 21:40:14 -0500") List-Id: "General discussions about Org-mode." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org Sender: emacs-orgmode-bounces+geo-emacs-orgmode=m.gmane.org@gnu.org To: emacs-orgmode@gnu.org John Kitchin writes: > Hi everyone, > > I was playing around with org-rss today, and it is pretty cool. I > would like to customize the way the subheading bodies look though, > primarily to unescape some html things like <, to get rid of all > the html tags, convert to org-mode links, to download > so they can be displayed, etc... > > for example a body of an rss entry looks like: > > Philip Herron: Cython Book > http://redbrain.co.uk/?p=147 > http://redbrain.co.uk/cython-book/

Hey > all i thought i should really share that i actually wrote a book on > Cython. The book has detailed examples and even shows you how you can > extend native C/C++ applications in python by doing it for Tmux. href="http://bit.ly/195ahQs">http://bit.ly/195ahQs

href="http://redbrain.co.uk/wp-content/uploads/2013/12/photo.jpg"> class="aligncenter size-full wp-image-148" alt="photo" > src="http://redbrain.co.uk/wp-content/uploads/2013/12/photo.jpg" > width="640" height="480" />The code can be found: href="https://github.com/redbrain/cython-book">https://github.com/redbrain/cython-book

> Tue, 10 Dec 2013 14:45:08 +0000 > > I would like this simplified to something like: > Philip Herron: Cython Book > > http://redbrain.co.uk/?p=147 > > http://redbrain.co.uk/cython-book/ > Hey all i thought i should really share that i actually wrote a book > on Cython. The book has detailed examples and even shows you how you > can extend native C/C++ applications in python by doing it for Tmux. > http://bit.ly/195ahQs > > [[feed-images/photo.jpg]] > > The code can be found: https://github.com/redbrain/cython-book > > basically, get the html code as close to org as reasonable. i found a > way to get an html parse tree (libxml-parse-html-region start end), > but I can't figure out how to convert that to the text I want. > > Has anyone done anything like this? > > John Maybe eww can help you... > > ----------------------------------- > John Kitchin > Associate Professor > Doherty Hall A207F > Department of Chemical Engineering > Carnegie Mellon University > Pittsburgh, PA 15213 > 412-268-7803 > http://kitchingroup.cheme.cmu.edu --