Converting XML To JSON Using Python?


Answer :

xmltodict (full disclosure: I wrote it) can help you convert your XML to a dict+list+string structure, following this "standard". It is Expat-based, so it's very fast and doesn't need to load the whole XML tree in memory.



Once you have that data structure, you can serialize it to JSON:



import xmltodict, json

o = xmltodict.parse('<e> <a>text</a> <a>text</a> </e>')
json.dumps(o) # '{"e": {"a": ["text", "text"]}}'


There is no "one-to-one" mapping between XML and JSON, so converting one to the other necessarily requires some understanding of what you want to do with the results.



That being said, Python's standard library has several modules for parsing XML (including DOM, SAX, and ElementTree). As of Python 2.6, support for converting Python data structures to and from JSON is included in the json module.



So the infrastructure is there.



You can use the xmljson library to convert using different XML JSON conventions.



For example, this XML:



<p id="1">text</p>


translates via the BadgerFish convention into this:



{
'p': {
'@id': 1,
'$': 'text'
}
}


and via the GData convention into this (attributes are not supported):



{
'p': {
'$t': 'text'
}
}


... and via the Parker convention into this (attributes are not supported):



{
'p': 'text'
}


It's possible to convert from XML to JSON and from JSON to XML using the same
conventions:



>>> import json, xmljson
>>> from lxml.etree import fromstring, tostring
>>> xml = fromstring('<p id="1">text</p>')
>>> json.dumps(xmljson.badgerfish.data(xml))
'{"p": {"@id": 1, "$": "text"}}'
>>> xmljson.parker.etree({'ul': {'li': [1, 2]}})
# Creates [<ul><li>1</li><li>2</li></ul>]


Disclosure: I wrote this library. Hope it helps future searchers.



Comments

Popular posts from this blog

Converting A String To Int In Groovy

"Cannot Create Cache Directory /home//.composer/cache/repo/https---packagist.org/, Or Directory Is Not Writable. Proceeding Without Cache"

Android SDK Location Should Not Contain Whitespace, As This Cause Problems With NDK Tools