Python论坛的帖子：

Fri Sep 24 11:09:45 HKT 2004

就像今天有位兄弟提出的那样各种邮件客户端在回复邮件时title里加的字符位置各不相同
那么有什么方法来快速的规类呢？

在python的标准库里有一个difflib
其中的get_close_matches很符合我们的需求
get_close_matches( word, possibilities[, n[, cutoff]]) 

Return a list of the best ``good enough'' matches. word is a sequence
for which close matches are desired (typically a string), and
possibilities is a list of sequences against which to match word
(typically a list of strings).
Optional argument n (default 3) is the maximum number of close matches
to return; n must be greater than 0.

Optional argument cutoff (default 0.6) is a float in the range [0, 1].
Possibilities that don't score at least that similar to word are
ignored.

The best (no more than n) matches among the possibilities are returned
in a list, sorted by similarity score, most similar first.

例如：
>>> difflib.get_close_matches("helo", ['test', 'Helo', 're:helo',
'Re:helo', 'RE:   helo', 'helo Re:'], 5)
['Helo', 're:helo', 'Re:helo', 'helo Re:']
>>> 

-- 
I'm the one, powered by nEO

标题：[python-chinese] [TIPS]如何找出近似的内容