Python论坛  - 讨论区

标题:[python-chinese] [TIPS]如何找出近似的内容

2004年09月24日 星期五 11:09

gentoo cn gentoo.cn at gmail.com
Fri Sep 24 11:09:45 HKT 2004

就像今天有位兄弟提出的那样各种邮件客户端在回复邮件时title里加的字符位置各不相同
那么有什么方法来快速的规类呢?

在python的标准库里有一个difflib
其中的get_close_matches很符合我们的需求
get_close_matches( word, possibilities[, n[, cutoff]]) 

Return a list of the best ``good enough'' matches. word is a sequence
for which close matches are desired (typically a string), and
possibilities is a list of sequences against which to match word
(typically a list of strings).
Optional argument n (default 3) is the maximum number of close matches
to return; n must be greater than 0.

Optional argument cutoff (default 0.6) is a float in the range [0, 1].
Possibilities that don't score at least that similar to word are
ignored.

The best (no more than n) matches among the possibilities are returned
in a list, sorted by similarity score, most similar first.

例如:
>>> difflib.get_close_matches("helo", ['test', 'Helo', 're:helo',
'Re:helo', 'RE:   helo', 'helo Re:'], 5)
['Helo', 're:helo', 'Re:helo', 'helo Re:']
>>> 

-- 
I'm the one, powered by nEO


[导入自Mailman归档:http://www.zeuux.org/pipermail/zeuux-python]

2004年09月24日 星期五 11:40

March Liu March.Liu at gmail.com
Fri Sep 24 11:40:54 HKT 2004

不错,这个话题值得讨论一下


On Fri, 24 Sep 2004 11:09:45 +0800, gentoo cn <gentoo.cn at gmail.com> wrote:
> 就像今天有位兄弟提出的那样各种邮件客户端在回复邮件时title里加的字符位置各不相同
> 那么有什么方法来快速的规类呢?
> 
> 在python的标准库里有一个difflib
> 其中的get_close_matches很符合我们的需求
> get_close_matches( word, possibilities[, n[, cutoff]])
> 
> Return a list of the best ``good enough'' matches. word is a sequence
> for which close matches are desired (typically a string), and
> possibilities is a list of sequences against which to match word
> (typically a list of strings).
> Optional argument n (default 3) is the maximum number of close matches
> to return; n must be greater than 0.
> 
> Optional argument cutoff (default 0.6) is a float in the range [0, 1].
> Possibilities that don't score at least that similar to word are
> ignored.
> 
> The best (no more than n) matches among the possibilities are returned
> in a list, sorted by similarity score, most similar first.
> 
> 例如:
> >>> difflib.get_close_matches("helo", ['test', 'Helo', 're:helo',
> 'Re:helo', 'RE:   helo', 'helo Re:'], 5)
> ['Helo', 're:helo', 'Re:helo', 'helo Re:']
> >>>
> 
> --
> I'm the one, powered by nEO
> _______________________________________________
> python-chinese list
> python-chinese at lists.python.cn
> http://python.cn/mailman/listinfo/python-chinese
> 



-- 
欢迎访问:http://blog.csdn.net/ccat

刘鑫
March.Liu


[导入自Mailman归档:http://www.zeuux.org/pipermail/zeuux-python]

如下红色区域有误,请重新填写。

    你的回复:

    请 登录 后回复。还没有在Zeuux哲思注册吗?现在 注册 !

    Zeuux © 2025

    京ICP备05028076号