ElasticSearch入门笔记

最新推荐文章于 2024-01-21 15:51:30 发布

gaohe7091

最新推荐文章于 2024-01-21 15:51:30 发布

阅读量710

点赞数

分类专栏： ElasticSearch 文章标签： ElasticSearch

ElasticSearch 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

出处：http://www.nosqldb.cn/1368777378160.html

[隐藏]

ElasticSearch 是构建在Apache Lucene之上的的搜索引擎服务，开源（Apache2协议），分布式，RESTful。安装方便，使用简单。

官方站点：http://www.elasticsearch.com/
中文站点：http://es-cn.medcl.net/

1.安装

必须先安装Java环境，并设置 JAVA_HOME => C:\Program Files\Java\jdk1.6.0_18

elasticsearch-rtf 中文入门集成包 https://github.com/medcl/elasticsearch-rtf
使用git签出，下载到本地。windows下，执行bin下面的elasticsearch.bat。linux下，执行bin下面或者service下面elasticsearch。

Pyes https://github.com/aparo/pyes 更多客户端

Bottle http://bottlepy.org/docs/dev/

2.角色关系对照

elasticsearch 跟 MySQL 中定义资料格式的角色关系对照表如下

MySQL elasticsearch
database index
table type

table schema mapping
row document
field field

3.索引映射

 
     #创建索引 
     
 $ curl  
     -XPUT http: 
     //localhost: 
     9200 
     /test-index 
     
 
     
 
     #创建Mapping 
     
 $ curl  
     -XPUT http: 
     //localhost: 
     9200 
     /test-index 
     /test-type 
     /_mapping  
     -d  
     '{
     "properties" : {
         "name" : { "type" : "string" }
     }
 }' 
    

 
     @route 
     ( 
     '/indexsetting/' 
     ) 
     
 
     def indexmapping 
     ( 
     ): 
     
      
     """索引映射""" 
     
     conn  
     = ES 
     ( 
     '127.0.0.1:9200' 
     ) 
     
     conn. 
     debug_dump  
     =  
     True 
     
      
     try: 
     
          
     #删除索引 
     
         conn. 
     delete_index 
     ( 
     "test-index" 
     ) 
     
      
     except: 
     
          
     pass 
     
      
     #创建索引 
     
     conn. 
     create_index 
     ( 
     "test-index" 
     ) 
     
     mapping  
     =  
     { 
     
            u 
     'id':  
     { 
     'store':  
     'yes' 
     , 
     
                      
     'type': u 
     'integer' 
     } 
     , 
     
            u 
     'author':  
     { 
     'boost':  
     1.0 
     , 
     
                         
     'index':  
     'not_analyzed' 
     , 
     
                         
     'store':  
     'yes' 
     , 
     
                         
     'type': u 
     'string' 
     } 
     , 
     
            u 
     'published':  
     { 
     'boost':  
     1.0 
     , 
     
                            
     'index':  
     'not_analyzed' 
     , 
     
                            
     'store':  
     'yes' 
     , 
     
                            
     'type': u 
     'datetime' 
     } 
     , 
     
            u 
     'url':  
     { 
     'store':  
     'yes' 
     , 
     
                      
     'type': u 
     'string' 
     } 
     , 
     
            u 
     'title':  
     { 
     'boost':  
     1.0 
     , 
     
                         
     'index':  
     'analyzed' 
     , 
     
                         
     'store':  
     'yes' 
     , 
     
                         
     'type': u 
     'string' 
     } 
     , 
     
            u 
     'content':  
     { 
     'boost':  
     1.0 
     , 
     
                         
     'index':  
     'analyzed' 
     , 
     
                         
     'store':  
     'yes' 
     , 
     
                         
     'type': u 
     'string' 
     , 
     
                         
     "term_vector" : 
     "with_positions_offsets" 
     } 
     
             
     } 
     
      
     #索引映射 
     
     conn. 
     put_mapping 
     ( 
     "test-type" 
     ,  
     { 
     'properties':mapping 
     } 
     , 
     [ 
     "test-index" 
     ] 
     ) 
     
      
     return  
     "索引映射" 
    

4.索引

 
     #索引 
     
 $ curl  
     -XPUT http: 
     //localhost: 
     9200 
     /test-index 
     /test-type 
     / 
     1  
     -d  
     '{
     "user": "kimchy",
     "post_date": "2009-11-15T13:12:00",
     "message": "Trying out elasticsearch, so far so good?"
 }' 
     
 
     
 
     #获取 
     
 $ curl  
     -XGET http: 
     //localhost: 
     9200 
     /test-index 
     /test-type 
     / 
     1 
     
 
     
 
     #删除 
     
 $ curl  
     -XDELETE  
     'http://localhost:9200/test-index/test-type/1' 
    

 
     @route 
     ( 
     '/indextest/' 
     ) 
     
 
     def indexTest 
     ( 
     ): 
     
      
     """索引测试""" 
     
     conn  
     = ES 
     ( 
     '127.0.0.1:9200' 
     ) 
     
      
     for item  
     in Data 
     ( 
     ). 
     getData 
     ( 
     ): 
     
          
     #添加索引 
     
         conn. 
     index 
     (item 
     , 
     "test-index" 
     ,  
     "test-type" 
     ,item 
     [ 
     'id' 
     ] 
     ) 
     
 
     
      
     #索引优化 
     
     conn. 
     optimize 
     ( 
     [ 
     "test-index" 
     ] 
     ) 
     
      
     #删除索引内容 
     
     conn. 
     delete 
     ( 
     "test-index" 
     ,  
     "test-type" 
     ,  
     2668090 
     ) 
     
      
     #更新索引内容 
     
     model  
     = conn. 
     get 
     ( 
     "test-index" 
     ,  
     "test-type" 
     ,  
     2667371 
     ) 
     
     model 
     [ 
     "title" 
     ] 
     = 
     "标题修改测试" 
     
     conn. 
     update 
     (model 
     , 
     "test-index" 
     ,  
     "test-type" 
     , 
     2667371 
     ) 
     
 
     
      
     #刷新索引 
     
     conn. 
     refresh 
     ( 
     [ 
     "test-index" 
     ] 
     ) 
     
 
     
     q  
     = MatchAllQuery 
     ( 
     ) 
     
     results  
     = conn. 
     search 
     (query  
     = q 
     ,indices 
     = 
     "test-index" 
     ,doc_types 
     = 
     "test-type" 
     ) 
     
 
     #    for r in results: 
     
 
     #        print r 
     
      
     return template 
     ( 
     'default.tpl' 
     , 
     list 
     =results 
     ,count 
     = 
     len 
     (results 
     ) 
     ) 
    

5.搜索

 
     #lucene语法方式的查询 
     
 $ curl  
     -XGET http: 
     //localhost: 
     9200 
     /test-index 
     /test-type 
     /_search? 
     q=user:kimchy 
     
 
     
 
     #query DSL方式查询 
     
 $ curl  
     -XGET http: 
     //localhost: 
     9200 
     /test-index 
     /test-type 
     /_search 
     -d  
     '{
     "query" : {
         "term" : { "user": "kimchy" }
     }
 }' 
     
 
     
 
     #query DSL方式查询 
     
 $ curl  
     -XGET http: 
     //localhost: 
     9200 
     /test-index 
     /_search? 
     pretty= 
     true  
     -d  
     '{
     "query" : {
         "range" : {
             "post_date" : {
                 "from" : "2009-11-15T13:00:00",
                 "to" : "2009-11-15T14:30:00"
             }
         }
     }
 }' 
     
 
     
 
     #查找全部索引内容 
     
 $ curl  
     -XGET http: 
     //localhost: 
     9200 
     /test-index 
     /test-type 
     /_search? 
     pretty= 
     true 
    

 
     @route 
     ( 
     '/search/' 
     ) 
     
 
     @route 
     ( 
     '/search/<searchkey>' 
     ) 
     
 
     def search 
     (searchkey 
     =u 
     "关键算法" 
     ): 
     
      
     """索引搜索""" 
     
     conn  
     = ES 
     ( 
     '127.0.0.1:9200' 
     ) 
     
 
     
      
     #TextQuery会对searchkey进行分词 
     
     qtitle  
     = TextQuery 
     ( 
     "title" 
     , searchkey 
     ) 
     
     qcontent  
     = TextQuery 
     ( 
     "content" 
     , searchkey 
     ) 
     
      
     #发布时间大于"2012-9-2 22:00:00" 
     
     qpublished 
     =RangeQuery 
     (ESRangeOp 
     ( 
     "published" 
     ,  
     "gt" 
     , 
     datetime 
     ( 
     2012 
     ,  
     9 
     ,  
     2 
     ,  
     22 
     ,  
     0 
     ,  
     0 
     ) 
     ) 
     ) 
     
 
     
     h  
     = HighLighter 
     ( 
     [ 
     '<b>' 
     ] 
     ,  
     [ 
     '</b>' 
     ] 
     , fragment_size 
     = 
     500 
     ) 
     
      
     #多字段搜索(must=>and,should=>or)，高亮，结果截取（分页），排序 
     
     q  
     =Search 
     (BoolQuery 
     (must 
     = 
     [qpublished 
     ] 
     ,should 
     = 
     [qtitle 
     ,qcontent 
     ] 
     ) 
     ,highlight 
     =h 
     , start 
     = 
     0 
     , size 
     = 
     3 
     , sort 
     = 
     { 
     'id':  
     { 
     'order':  
     'asc' 
     } 
     } 
     ) 
     
     q. 
     add_highlight 
     ( 
     "title" 
     ) 
     
     q. 
     add_highlight 
     ( 
     "content" 
     ) 
     
     results  
     = conn. 
     search 
     (query  
     = q 
     ,indices 
     = 
     "test-index" 
     ,doc_types 
     = 
     "test-type" 
     ) 
     
 
     
      
     list 
     = 
     [ 
     ] 
     
      
     for r  
     in results: 
     
          
     if 
     (r._meta. 
     highlight. 
     has_key 
     ( 
     "title" 
     ) 
     ): 
     
             r 
     [ 
     'title' 
     ] 
     =r._meta. 
     highlight 
     [u 
     "title" 
     ] 
     [ 
     0 
     ] 
     
          
     if 
     (r._meta. 
     highlight. 
     has_key 
     ( 
     "content" 
     ) 
     ): 
     
             r 
     [ 
     'content' 
     ] 
     =r._meta. 
     highlight 
     [u 
     "content" 
     ] 
     [ 
     0 
     ] 
     
          
     list. 
     append 
     (r 
     ) 
     
      
     return template 
     ( 
     'search.tpl' 
     , 
     list 
     = 
     list 
     ,count 
     =results. 
     total 
     ) 
     
 
     </searchkey 
     > 
    

6.设置

 
     #创建索引，并设置分片和副本参数 
     
 $ curl  
     -XPUT http: 
     //localhost: 
     9200 
     /elasticsearch 
     /  
     -d  
     '{
     "settings" : {
         "number_of_shards" : 2,
         "number_of_replicas" : 3
     }
 }' 
    

7.其他

 
     #分词 
     
 curl  
     -XGET  
     'http://localhost:9200/test-index/_analyze?text=中华人民共和国'

例子程序下载

来源：http://www.qwolf.com/?p=1387

看了这篇文章的人还看了：

分布式搜索elasticsearch java API 之（五）------搜索

elasticsearch aggregations java api使用示例（分组统计）类似facet

elasticsearch aggregations java api使用示例（分组统计)二

elasticsearch 0.9.0.3 suggest使用详解、搜索提示用法详解（二）java api调用示例

分布式搜索elasticsearch java API 之（一）------与集群交互

分布式搜索elasticsearch java API 之（二）------put Mapping定义索引字段属性

分布式搜索elasticsearch java API 之（三）------索引数据

分布式搜索elasticsearch java API 之（四）------删除索引数据

分布式搜索elasticsearch java API 之（六）------批量添加删除索引

[隐藏]

ElasticSearch 是构建在Apache Lucene之上的的搜索引擎服务，开源（Apache2协议），分布式，RESTful。安装方便，使用简单。

官方站点：http://www.elasticsearch.com/
中文站点：http://es-cn.medcl.net/

1.安装

必须先安装Java环境，并设置 JAVA_HOME => C:\Program Files\Java\jdk1.6.0_18

elasticsearch-rtf 中文入门集成包 https://github.com/medcl/elasticsearch-rtf
使用git签出，下载到本地。windows下，执行bin下面的elasticsearch.bat。linux下，执行bin下面或者service下面elasticsearch。

Pyes https://github.com/aparo/pyes 更多客户端

Bottle http://bottlepy.org/docs/dev/

2.角色关系对照

elasticsearch 跟 MySQL 中定义资料格式的角色关系对照表如下

MySQL elasticsearch
database index
table type

table schema mapping
row document
field field

3.索引映射

 
      #创建索引 
      
 $ curl  
      -XPUT http: 
      //localhost: 
      9200 
      /test-index 
      
 
      
 
      #创建Mapping 
      
 $ curl  
      -XPUT http: 
      //localhost: 
      9200 
      /test-index 
      /test-type 
      /_mapping  
      -d  
      '{
     "properties" : {
         "name" : { "type" : "string" }
     }
 }' 
     

 
      @route 
      ( 
      '/indexsetting/' 
      ) 
      
 
      def indexmapping 
      ( 
      ): 
      
      
      """索引映射""" 
      
     conn  
      = ES 
      ( 
      '127.0.0.1:9200' 
      ) 
      
     conn. 
      debug_dump  
      =  
      True 
      
      
      try: 
      
          
      #删除索引 
      
         conn. 
      delete_index 
      ( 
      "test-index" 
      ) 
      
      
      except: 
      
          
      pass 
      
      
      #创建索引 
      
     conn. 
      create_index 
      ( 
      "test-index" 
      ) 
      
     mapping  
      =  
      { 
      
            u 
      'id':  
      { 
      'store':  
      'yes' 
      , 
      
                      
      'type': u 
      'integer' 
      } 
      , 
      
            u 
      'author':  
      { 
      'boost':  
      1.0 
      , 
      
                         
      'index':  
      'not_analyzed' 
      , 
      
                         
      'store':  
      'yes' 
      , 
      
                         
      'type': u 
      'string' 
      } 
      , 
      
            u 
      'published':  
      { 
      'boost':  
      1.0 
      , 
      
                            
      'index':  
      'not_analyzed' 
      , 
      
                            
      'store':  
      'yes' 
      , 
      
                            
      'type': u 
      'datetime' 
      } 
      , 
      
            u 
      'url':  
      { 
      'store':  
      'yes' 
      , 
      
                      
      'type': u 
      'string' 
      } 
      , 
      
            u 
      'title':  
      { 
      'boost':  
      1.0 
      , 
      
                         
      'index':  
      'analyzed' 
      , 
      
                         
      'store':  
      'yes' 
      , 
      
                         
      'type': u 
      'string' 
      } 
      , 
      
            u 
      'content':  
      { 
      'boost':  
      1.0 
      , 
      
                         
      'index':  
      'analyzed' 
      , 
      
                         
      'store':  
      'yes' 
      , 
      
                         
      'type': u 
      'string' 
      , 
      
                         
      "term_vector" : 
      "with_positions_offsets" 
      } 
      
             
      } 
      
      
      #索引映射 
      
     conn. 
      put_mapping 
      ( 
      "test-type" 
      ,  
      { 
      'properties':mapping 
      } 
      , 
      [ 
      "test-index" 
      ] 
      ) 
      
      
      return  
      "索引映射" 
     

4.索引

 
      #索引 
      
 $ curl  
      -XPUT http: 
      //localhost: 
      9200 
      /test-index 
      /test-type 
      / 
      1  
      -d  
      '{
     "user": "kimchy",
     "post_date": "2009-11-15T13:12:00",
     "message": "Trying out elasticsearch, so far so good?"
 }' 
      
 
      
 
      #获取 
      
 $ curl  
      -XGET http: 
      //localhost: 
      9200 
      /test-index 
      /test-type 
      / 
      1 
      
 
      
 
      #删除 
      
 $ curl  
      -XDELETE  
      'http://localhost:9200/test-index/test-type/1' 
     

 
      @route 
      ( 
      '/indextest/' 
      ) 
      
 
      def indexTest 
      ( 
      ): 
      
      
      """索引测试""" 
      
     conn  
      = ES 
      ( 
      '127.0.0.1:9200' 
      ) 
      
      
      for item  
      in Data 
      ( 
      ). 
      getData 
      ( 
      ): 
      
          
      #添加索引 
      
         conn. 
      index 
      (item 
      , 
      "test-index" 
      ,  
      "test-type" 
      ,item 
      [ 
      'id' 
      ] 
      ) 
      
 
      
      
      #索引优化 
      
     conn. 
      optimize 
      ( 
      [ 
      "test-index" 
      ] 
      ) 
      
      
      #删除索引内容 
      
     conn. 
      delete 
      ( 
      "test-index" 
      ,  
      "test-type" 
      ,  
      2668090 
      ) 
      
      
      #更新索引内容 
      
     model  
      = conn. 
      get 
      ( 
      "test-index" 
      ,  
      "test-type" 
      ,  
      2667371 
      ) 
      
     model 
      [ 
      "title" 
      ] 
      = 
      "标题修改测试" 
      
     conn. 
      update 
      (model 
      , 
      "test-index" 
      ,  
      "test-type" 
      , 
      2667371 
      ) 
      
 
      
      
      #刷新索引 
      
     conn. 
      refresh 
      ( 
      [ 
      "test-index" 
      ] 
      ) 
      
 
      
     q  
      = MatchAllQuery 
      ( 
      ) 
      
     results  
      = conn. 
      search 
      (query  
      = q 
      ,indices 
      = 
      "test-index" 
      ,doc_types 
      = 
      "test-type" 
      ) 
      
 
      #    for r in results: 
      
 
      #        print r 
      
      
      return template 
      ( 
      'default.tpl' 
      , 
      list 
      =results 
      ,count 
      = 
      len 
      (results 
      ) 
      ) 
     

5.搜索

 
      #lucene语法方式的查询 
      
 $ curl  
      -XGET http: 
      //localhost: 
      9200 
      /test-index 
      /test-type 
      /_search? 
      q=user:kimchy 
      
 
      
 
      #query DSL方式查询 
      
 $ curl  
      -XGET http: 
      //localhost: 
      9200 
      /test-index 
      /test-type 
      /_search 
      -d  
      '{
     "query" : {
         "term" : { "user": "kimchy" }
     }
 }' 
      
 
      
 
      #query DSL方式查询 
      
 $ curl  
      -XGET http: 
      //localhost: 
      9200 
      /test-index 
      /_search? 
      pretty= 
      true  
      -d  
      '{
     "query" : {
         "range" : {
             "post_date" : {
                 "from" : "2009-11-15T13:00:00",
                 "to" : "2009-11-15T14:30:00"
             }
         }
     }
 }' 
      
 
      
 
      #查找全部索引内容 
      
 $ curl  
      -XGET http: 
      //localhost: 
      9200 
      /test-index 
      /test-type 
      /_search? 
      pretty= 
      true 
     

 
      @route 
      ( 
      '/search/' 
      ) 
      
 
      @route 
      ( 
      '/search/<searchkey>' 
      ) 
      
 
      def search 
      (searchkey 
      =u 
      "关键算法" 
      ): 
      
      
      """索引搜索""" 
      
     conn  
      = ES 
      ( 
      '127.0.0.1:9200' 
      ) 
      
 
      
      
      #TextQuery会对searchkey进行分词 
      
     qtitle  
      = TextQuery 
      ( 
      "title" 
      , searchkey 
      ) 
      
     qcontent  
      = TextQuery 
      ( 
      "content" 
      , searchkey 
      ) 
      
      
      #发布时间大于"2012-9-2 22:00:00" 
      
     qpublished 
      =RangeQuery 
      (ESRangeOp 
      ( 
      "published" 
      ,  
      "gt" 
      , 
      datetime 
      ( 
      2012 
      ,  
      9 
      ,  
      2 
      ,  
      22 
      ,  
      0 
      ,  
      0 
      ) 
      ) 
      ) 
      
 
      
     h  
      = HighLighter 
      ( 
      [ 
      '<b>' 
      ] 
      ,  
      [ 
      '</b>' 
      ] 
      , fragment_size 
      = 
      500 
      ) 
      
      
      #多字段搜索(must=>and,should=>or)，高亮，结果截取（分页），排序 
      
     q  
      =Search 
      (BoolQuery 
      (must 
      = 
      [qpublished 
      ] 
      ,should 
      = 
      [qtitle 
      ,qcontent 
      ] 
      ) 
      ,highlight 
      =h 
      , start 
      = 
      0 
      , size 
      = 
      3 
      , sort 
      = 
      { 
      'id':  
      { 
      'order':  
      'asc' 
      } 
      } 
      ) 
      
     q. 
      add_highlight 
      ( 
      "title" 
      ) 
      
     q. 
      add_highlight 
      ( 
      "content" 
      ) 
      
     results  
      = conn. 
      search 
      (query  
      = q 
      ,indices 
      = 
      "test-index" 
      ,doc_types 
      = 
      "test-type" 
      ) 
      
 
      
      
      list 
      = 
      [ 
      ] 
      
      
      for r  
      in results: 
      
          
      if 
      (r._meta. 
      highlight. 
      has_key 
      ( 
      "title" 
      ) 
      ): 
      
             r 
      [ 
      'title' 
      ] 
      =r._meta. 
      highlight 
      [u 
      "title" 
      ] 
      [ 
      0 
      ] 
      
          
      if 
      (r._meta. 
      highlight. 
      has_key 
      ( 
      "content" 
      ) 
      ): 
      
             r 
      [ 
      'content' 
      ] 
      =r._meta. 
      highlight 
      [u 
      "content" 
      ] 
      [ 
      0 
      ] 
      
          
      list. 
      append 
      (r 
      ) 
      
      
      return template 
      ( 
      'search.tpl' 
      , 
      list 
      = 
      list 
      ,count 
      =results. 
      total 
      ) 
      
 
      </searchkey 
      > 
     

6.设置

 
      #创建索引，并设置分片和副本参数 
      
 $ curl  
      -XPUT http: 
      //localhost: 
      9200 
      /elasticsearch 
      /  
      -d  
      '{
     "settings" : {
         "number_of_shards" : 2,
         "number_of_replicas" : 3
     }
 }' 
     

7.其他

 
      #分词 
      
 curl  
      -XGET  
      'http://localhost:9200/test-index/_analyze?text=中华人民共和国'

例子程序下载

来源：http://www.qwolf.com/?p=1387

看了这篇文章的人还看了：

分布式搜索elasticsearch java API 之（五）------搜索

elasticsearch aggregations java api使用示例（分组统计）类似facet

elasticsearch aggregations java api使用示例（分组统计)二

elasticsearch 0.9.0.3 suggest使用详解、搜索提示用法详解（二）java api调用示例

分布式搜索elasticsearch java API 之（一）------与集群交互

分布式搜索elasticsearch java API 之（二）------put Mapping定义索引字段属性

分布式搜索elasticsearch java API 之（三）------索引数据

分布式搜索elasticsearch java API 之（四）------删除索引数据

分布式搜索elasticsearch java API 之（六）------批量添加删除索引

gaohe7091

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
ElasticSearch入门笔记

出处：http://www.nosqldb.cn/1368777378160.html
复制链接

扫一扫

专栏目录

ElasticSearch入门笔记

1.安装

2.角色关系对照

3.索引映射

4.索引

5.搜索

6.设置

7.其他

1.安装

2.角色关系对照

3.索引映射

4.索引

5.搜索

6.设置

7.其他

“相关推荐”对你有帮助么？