« earlier | later » Page 1 of 1
Xidel - HTML/XML data extraction tool
"Xidel is a command line tool to download and extract data from html/xml pages." Neat -- I usually wind up writing some Python to do this kind of thing.
to html software to-package xml xpath ... on 03 October 2014
British Library: Free Data Services
The BL catalogue available in convenient formats (note it's also downloadable from archive.org). I should make ALMS use this as a source; it can't possibly be worse than Amazon's data!
to bibliography books british-library data library xml ... on 01 July 2014
Ian Bicking: a blog :: lxml: an underappreciated web scraping library
Useful overview of lxml, which is the module I really ought to use for random XML/HTML parsing.
« earlier | later » Page 1 of 1
- xml | |
1 | bibliography |
1 | books |
1 | british-library |
1 | data |
2 | html |
1 | library |
1 | lxml |
1 | parsing |
1 | python |
2 | software |
1 | to-package |
4 | xml |
1 | xpath |
tasty by Adam Sampson.