python读取配置文件不更改大小写,Python：使用prefixStringSuffix替换字符串，保留原始大小写，但在搜索匹配项时忽略大小写...

最新推荐文章于 2022-10-31 14:54:53 发布

赵智沉

最新推荐文章于 2022-10-31 14:54:53 发布

阅读量153

点赞数

文章标签： python读取配置文件不更改大小写

So what I'm trying to do is replace a string "keyword" with

"keyword"

in a larger string.

Example:

myString = "HI there. You should higher that person for the job. Hi hi."

keyword = "hi"

result I would want would be:

result = "HI there. You should higher that person for the job.

Hi hi."

I will not know what the keyword until the user types the keyword

and won't know the corpus (myString) until the query is run.

I found a solution that works most of the time, but has some false positives,

namely it would return "higher"which is not what I want. Also note that I

am trying to preserve the case of the original text, and the matching should take

place irrespective of case. so if the keyword is "hi" it should replace

HI with HI and hi with hi.

The closest I have come is using a slightly derived version of this:

http://code.activestate.com/recipes/576715/

but I still could not figure out how to do a second pass of the string to fix all of the false positives mentioned above.

Or using the NLTK's WordPunctTokenizer (which simplifies some things like punctuation)

but I'm not sure how I would put the sentences back together given it does not

have a reverse function and I want to keep the original punctuation of myString. Essential, doing a concatenation of all the tokens does not return the original

string. For example I would not want to replace "7 - 7" with "7-7" when regrouping the tokens into its original text if the original text had "7 - 7".

Hope that was clear enough. Seems like a simple problem, but its a turned out a little more difficult then I thought.

解决方案

This ok?

>>> import re

>>> myString = "HI there. You should higher that person for the job. Hi hi."

>>> keyword = "hi"

>>> search = re.compile(r'\b(%s)\b' % keyword, re.I)

>>> search.sub('\\1', myString)

'HI there. You should higher that person for the job. Hi hi.'

The key to the whole thing is using word boundaries, groups and the re.I flag.

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python读取配置文件不更改大小写,Python：使用prefixStringSuffix替换字符串，保留原始大小写，但在搜索匹配项时忽略大小写...

So what I'm trying to do is replace a string "keyword" with"keyword"in a larger string.Example:myString = "HI there. You should higher that person for the job. Hi hi."keyword = "hi"result I would wan...
复制链接

扫一扫

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。