[问题] 常规表达式找副词

楼主: yuseke (大将)   2015-12-08 17:34:57
as title
这两天在看RE的部分
根据PYTHON 基金会的网站:
https://docs.python.org/2/library/re.html
7.2.5.7. Finding all Adverbs and their Positions¶
If one wants more information about all matches of a pattern than the matched
text, finditer() is useful as it provides instances of MatchObject instead of
strings. Continuing with the previous example, if one was a writer who wanted
to find all of the adverbs and their positions in some text, he or she would
use finditer() in the following manner:
>>> text = "He was carefully disguised but captured quickly by police."
>>> for m in re.finditer(r"\w+ly", text):
... print '%02d-%02d: %s' % (m.start(), m.end(), m.group(0))
07-16: carefully
40-47: quickly
关于""""for m in re.finditer(r"\w+ly", text):""""
这个部份我有一个疑问,
可是并不是所有的adv都有ly字尾.......
这种情形该怎么处理呢?
作者: tiefblau (tiefblau)   2015-12-08 17:39:00
他只是在举例而已......
楼主: yuseke (大将)   2015-12-08 17:43:00
那如果我真的遇到这个问题的时候该怎么办呢?
作者: uranusjr (←這人是超級笨蛋)   2015-12-08 17:45:00
只能用字典来做, 或者因为有些副词和其他词性拼法相同,大概只能用 machine learning 来做; 自然语言分析很难的
作者: bibo9901 (function(){})()   2015-12-08 20:56:00
用NLTK做POS tagging 然后查 wordnet (?)

Links booklink

Contact Us: admin [ a t ] ucptt.com