常用第三方开源代码库 (thirdparty/common)

在工作和学习中,借助第三方开源代码库是常见的事情,“站在巨人的肩膀上”嘛,相信大家都不会陌生,赞叹开源、共享的伟大。

一方面为了做个总结,另一方面,就是好东西要与大家分享,我在 Github 上维护了一个页面 https://github.com/fandywang/thirdparty_intro,包含了个人比较关注的第三方代码库,如下(持续更新中):

Google 开源库

  • zh-google-styleguide - Google 开源项目风格指南.
  • protobuf - Protocol Buffers - Google's data interchange format.
  • gflags - Commandline flags module for C++.
  • glog - Logging library for C++.
  • gtest - Google C++ Testing Framework.
  • googlemock - Google C++ Mocking Framework.
  • leveldb - A fast and lightweight key/value database library by Google. cpy-leveldb - Python bindings for LevelDB using leveldb c api.
  • The Chromium Projects - The Chromium projects include Chromium and Chromium OS, the open-source projects behind the Google Chrome browser and Google Chrome OS, respectively.

C++ base 库

  • toft - C++ Base Library for Linux server side development. thirdparty - Put thirdparty library here for toft ant foxy. chen3feng
  • folly - Folly is an open-source C++ library developed and used at Facebook.

算法和数据结构

  • darts-clone - A clone of the Darts (Double-ARray Trie System).
  • Darts - Double-ARray Trie System. 中文翻译文档
  • sparsehash - An extremely memory-efficient hash_map implementation。
  • cityhash - The CityHash family of hash functions.
  • stringencoders - A collection of high performance c-string transformations, frequently 2x faster than standard implementations (if they exist at all).
  • Numpy - NumPy is the fundamental package for scientific computing with Python.

自然语言处理库

  • NLTK - NLTK -- the Natural Language Toolkit -- is a suite of open source Python modules, data sets and tutorials supporting research and development in Natural Language Processing. NLTK Book
  • jieba - 结巴中文分词.
  • gensim - Gensim is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.
  • LTP - 语言技术平台(Language Technology Platform,LTP)是哈工大社会计算与信息检索研究中心历时十年研制的一整套开放中文自然语言处理系统。
  • Stanford CoreNLP - Stanford CoreNLP provides a set of natural language analysis tools which can take raw English language text input and give the base forms of words, their parts of speech, whether they are names of companies, people, etc., normalize dates, times, and numeric quantities, and mark up the structure of sentences in terms of phrases and word dependencies, and indicate which noun phrases refer to the same entities.
  • openNLP - The Apache OpenNLP library is a machine learning based toolkit for the processing of natural language text.
  • SRILM - SRILM is a toolkit for building and applying statistical language models (LMs), primarily for use in speech recognition, statistical tagging and segmentation, and machine translation.
  • IRS
  • 1
    点赞
  • 5
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值