Tokyo Dystopia 全文搜索

最新推荐文章于 2023-08-12 17:52:40 发布

zuroc

最新推荐文章于 2023-08-12 17:52:40 发布

阅读量111

点赞数

分类专栏：网络

网络专栏收录该内容

75 篇文章 0 订阅

订阅专栏

[url]http://d.hatena.ne.jp/perezvon/20080921/1222016246[/url]

>>> from tokyodystopia import TokyoDystopia
>>> db = TokyoDystopia("/tmp/test.db", 255)
>>> db.put(0, u"仙台".encode("utf8"), " ")
1
>>> db.put(1, u"仙台広島".encode("utf8"), " ")
1
>>> db.put(2, u"広島山形湘南山形".encode("utf8"), " ")
1
>>> db.search(u"仙台".encode("utf-8"))
2
[0L, 1L]
>>> db.search(u"広島".encode("utf-8"))
2
[1L, 2L]
>> db.close()

The function `tcidbsearch2' searches with a compound expression. In the compound expression, tokens are separated by one or more white space characters. If one token is specified, records including the specified pattern are searched for. Upper or lower case is not distinguished. Accent marks and diacritical marks are ignored. If two or more tokens are specified, records including all of the patterns are searched for. The compound expression includes the following sub expressions.

* A B : searches for records including the two tokens.
* A && B : searches for records including the two tokens.
* A || B : searches for records including the one or both of the two tokens.
* "A B..." : searches for records including the phrase.
* ［A］ : searches for records including words exactly matching the token.
* ［A*］ : searches for records including words beginning with the token.
* ［*A］ : searches for records including words ending with the token.
* ［［A : searches for records beginning with the token.
* A］］ : searches for records ending with the token.

Note that the priority of "||" is higher than the one of "&&".

zuroc

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
Tokyo Dystopia 全文搜索

[url]http://d.hatena.ne.jp/perezvon/20080921/1222016246[/url]>>> from tokyodystopia import TokyoDystopia>>> db = TokyoDystopia("/tmp/test.db", 255)>>> db.put(0, u"仙台".encode("utf8"), "
复制链接

扫一扫