Python论坛的帖子：

Fri Mar 12 18:35:51 HKT 2004

I am trying to process a huge Chinese document.  The
single document is in pure text format and it's nearly
4M.

I always get "incomplete multibyte sequence" error
when I try to unicode the sentences.

I think the reason is because the Chinese document
uses both ascii punctuations and 2-byte Chinese
punctuations.

For example, the single document can both , and
， and both < > and 《》.

Is there anyway, I can go around this?  Don't ask me
to fix the Chinese document!

__________________________________
Do you Yahoo!?
Yahoo! Search - Find what youre looking for faster
http://search.yahoo.com

标题：[python-chinese] incomplete multibyte sequence