Python论坛的帖子：

Thu Feb 17 17:43:42 HKT 2005

简单的利用minidom解析xml文件的测试,在xml文件中含有中文时,minidom的parse方法抛出异常了.
如果不含有中文,则解析是正确的. 请问,如何解析含有中文的xml文件?

我的 test.py 的代码如下:

# -*- coding: gb2312 -*-
import xml.dom.minidom
dom = xml.dom.minidom.parse('brz_sys.xml')
remote_ip_list = dom.getElementsByTagName('breeze_sys_config')[0].getElementsByTagName('remote_ip')

for remote_ip_node in remote_ip_list:
    print remote_ip_node.getElementsByTagName('ip')[0].firstChild.toxml()   

xml 文件内容如下:

        10.71.105.27
        root
        中文
        #

抛出的异常如下:
Traceback (most recent call last):
  File "D:\py\test.py", line 6, in ?
    dom = xml.dom.minidom.parse('brz_sys.xml')
  File "c:\python24\lib\xml\dom\minidom.py", line 1915, in parse
    return expatbuilder.parse(file)
  File "C:\Python24\lib\xml\dom\expatbuilder.py", line 924, in parse
    result = builder.parseFile(fp)
  File "C:\Python24\lib\xml\dom\expatbuilder.py", line 207, in parseFile
    parser.Parse(buffer, 0)
xml.parsers.expat.ExpatError: not well-formed (invalid token): line 9, column 11

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.exoweb.net/pipermail/python-chinese/attachments/20050217/49cf7a92/attachment.html

标题：[python-chinese] xml.dom.minidom在解析带有中文的xml文件时抛出异常,请问如何解决这个问题?