Python论坛  - 讨论区

标题:Re: [python-chinese] 如何处理非well-formed 的XML?

2005年08月30日 星期二 12:48

Jerry jetport at gmail.com
Tue Aug 30 12:48:43 HKT 2005

谢谢,limodou & Wang
可惜MSXML 需要增加lib
我尝试用limodou大哥的方法, 用 FIXED=''TRUE" 替换 FIXED 处理

-- 
If U can see it, then U can do it
If U just believe it, there's nothing to it
I believe U can fly 
From Jetport at gmail.com

[导入自Mailman归档:http://www.zeuux.org/pipermail/zeuux-python]

2005年08月31日 星期三 09:48

Bruce Wang number5 at gmail.com
Wed Aug 31 09:48:00 HKT 2005

On 8/30/05, Jerry <jetport at gmail.com> wrote:
> 谢谢,limodou & Wang
> 可惜MSXML 需要增加lib
> 我尝试用limodou大哥的方法, 用 FIXED=''TRUE" 替换 FIXED 处理
> 
也许你可以试一下 BeautifulSoup [1]
他可以处理非 well-formed xml 或者html

[1] http://www.crummy.com/software/BeautifulSoup/
-- 
simple is good

[导入自Mailman归档:http://www.zeuux.org/pipermail/zeuux-python]

2005年08月31日 星期三 14:30

Jerry jetport at gmail.com
Wed Aug 31 14:30:41 HKT 2005

> On 8/30/05, Jerry <jetport at gmail.com> wrote:
> > 谢谢,limodou & Wang
> > 可惜MSXML 需要增加lib
> > 我尝试用limodou大哥的方法, 用 FIXED=''TRUE" 替换 FIXED 处理
> >
> 也许你可以试一下 BeautifulSoup [1]
> 他可以处理非 well-formed xml 或者html
> 
> [1] http://www.crummy.com/software/BeautifulSoup/
 不错,贴一下BeautifulSoup的应用例子:
 
Duncan Gough's Fotopic backup <http://www.suttree.com/code/fotopic/> uses 
Beautiful Soup to scrape the Fotopic website. 
 
Iñigo Serna's googlenews.py<http://inigo.katxi.org/devel/misc/googlenews.py>uses
Beautiful Soup to scrape Google News (it's in the parse_entry and
parse_category functions) 
 
The Weather Office Screen Scraper
<http://potheads.ca/~cgm/weatheroffice/>uses Beautiful Soup to scrape
the Canadian government's weather office site.

 
News Clues <http://www.cs.duke.edu/~pbh/newsclues.html> uses Beautiful Soup 
to parse RSS feeds. 
 
BlinkFlash <http://blinkflash.sourceforge.net/> uses Beautiful Soup to 
automate form submission for an online service. 
 
The linky <http://www.voidspace.org.uk/python/programs.shtml#linky> link 
checker uses Beautiful Soup to find a page's links and images that need 
checking. 
Matt Croydon<http://www.postneo.com/2005/03/28/mobile-screen-scraping-with-beautifulsoup-and-python-for-series-60>got
Beautiful Soup
1.x to work on his Nokia Series 60 smartphone. C.R.
Sandeep<http://sandeep.weblogs.us/archives/024473.html>wrote a
real-time currency converter for the Series 60 using Beautiful Soup
(source not available yet). 
Here's a short script<http://jacobian.org/recipes/archives/2005/03/21/fixing-tracks-bought-from-allofmp3com/>from
jacobian.org <http://jacobian.org> to fix the metadata on music files 
downloaded from allofmp3.com <http://allofmp3.com>. 
The Python Community Server <http://www.pycs.net/> uses Beautiful Soup in 
its spam detector. 



-- 
If U can see it, then U can do it
If U just believe it, there's nothing to it
I believe U can fly 
From Jetport at gmail.com
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.exoweb.net/pipermail/python-chinese/attachments/20050831/48de25da/attachment.htm

[导入自Mailman归档:http://www.zeuux.org/pipermail/zeuux-python]

如下红色区域有误,请重新填写。

    你的回复:

    请 登录 后回复。还没有在Zeuux哲思注册吗?现在 注册 !

    Zeuux © 2025

    京ICP备05028076号