Xidel - HTML/XML data extraction tool

"Xidel is a command line tool to download and extract data from html/xml pages." Neat -- I usually wind up writing some Python to do this kind of thing.

to html software to-package xml xpath ... on 03 October 2014

British Library: Free Data Services

The BL catalogue available in convenient formats (note it's also downloadable from archive.org). I should make ALMS use this as a source; it can't possibly be worse than Amazon's data!

to bibliography books british-library data library xml ... on 01 July 2014

Ian Bicking: a blog :: lxml: an underappreciated web scraping library

Useful overview of lxml, which is the module I really ought to use for random XML/HTML parsing.

to html lxml parsing python xml ... on 30 October 2010

XML Alternatives

to software xml ... on 05 June 2005

Tags related to xml

- xml
 
1 bibliography
1 books
1 british-library
1 data
2 html
1 library
1 lxml
1 parsing
1 python
2 software
1 to-package
4 xml
1 xpath