How to consume XML from RESTful web services using Django / Python?

Question

Should I use PyXML or what's in the standard library?

vezult · Accepted Answer · 2009-04-30 01:26:40Z

10

ElementTree is provided as part of the standard Python libs. ElementTree is pure python, and cElementTree is the faster C implementation:

# Try to use the C implementation first, falling back to python
try:
    from xml.etree import cElementTree as ElementTree
except ImportError, e:
    from xml.etree import ElementTree

Here's an example usage, where I'm consuming xml from a RESTful web service:

def find(*args, **kwargs):
    """Find a book in the collection specified"""

    search_args = [('access_key', api_key),]
    if not is_valid_collection(kwargs['collection']):
        return None
    kwargs.pop('collection')
    for key in kwargs:
        # Only the first keword is honored
        if kwargs[key]:
            search_args.append(('index1', key))
            search_args.append(('value1', kwargs[key]))
            break

    url = urllib.basejoin(api_url, '%s.xml' % 'books')
    data = urllib.urlencode(search_args)
    req = urllib2.urlopen(url, data)
    rdata = []
    chunk = 'xx'
    while chunk:
        chunk = req.read()
        if chunk:
            rdata.append(chunk)
    tree = ElementTree.fromstring(''.join(rdata))
    results = []
    for i, elem in enumerate(tree.getiterator('BookData')):
        results.append(
               {'isbn': elem.get('isbn'),
                'isbn13': elem.get('isbn13'),
                'title': elem.find('Title').text,
                'author': elem.find('AuthorsText').text,
                'publisher': elem.find('PublisherText').text,}
             )
    return results

edited Apr 30, 2009 at 1:26

answered Apr 30, 2009 at 1:19

vezult

5,2431 gold badge28 silver badges41 bronze badges

Sign up to request clarification or add additional context in comments.

3 Comments

rick Over a year ago

vezult, how come sometimes you use elem.get() and sometimes you use elem.find().text?

vezult Over a year ago

@rick: elem.get() is fetching the value of an element attribute, while elem.find() is searching for elements contained within the elem element.

jfs Over a year ago

tree = ElementTree.parse(urllib2.urlopen(url, data)) should work without rdata list.

Justin · Accepted Answer · 2009-04-30 01:17:45Z

3

I always prefer to use the standard library when possible. ElementTree is well known amongst pythonistas, so you should be able to find plenty of examples. Parts of it have also been optimized in C, so it's quite fast.

http://docs.python.org/library/xml.etree.elementtree.html

answered Apr 30, 2009 at 1:17

Justin

1551 silver badge6 bronze badges

Comments

Henrik Lied · Accepted Answer · 2010-02-27 13:43:16Z

0

There's also BeautifulSoup, which has an API some might prefer. Here's an example on how you can extract all tweets that have been favorited from Twitter's Public Timeline:

from BeautifulSoup import BeautifulStoneSoup
import urllib

url = urllib.urlopen('http://twitter.com/statuses/public_timeline.xml').read()
favorited = []

soup = BeautifulStoneSoup(url)
statuses = soup.findAll('status')

for status in statuses:
    if status.find('favorited').contents != [u'false']:
        favorited.append(status)

answered Feb 27, 2010 at 13:43

Henrik Lied

1352 silver badges8 bronze badges

3 Comments

mlissner Over a year ago

Alas, BeautifulSoup is no longer maintained. I would avoid it, and lean towards lxml or ElementTree.

mrkzq Over a year ago

@mlissner I can't see where does it says on BS4 website that it is no longer maintained. Is that really the case?

mlissner Over a year ago

At one point the maintainer was threatening to step down, but it seems that reality never came to pass.

Collectives™ on Stack Overflow

How to consume XML from RESTful web services using Django / Python?

3 Answers 3

3 Comments

Comments

3 Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

3 Comments

Comments

3 Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related