自定义博客皮肤VIP专享

*博客头图:

格式为PNG、JPG,宽度*高度大于1920*100像素,不超过2MB,主视觉建议放在右侧,请参照线上博客头图

请上传大于1920*100像素的图片!

博客底图:

图片格式为PNG、JPG,不超过1MB,可上下左右平铺至整个背景

栏目图:

图片格式为PNG、JPG,图片宽度*高度为300*38像素,不超过0.5MB

主标题颜色:

RGB颜色,例如:#AFAFAF

Hover:

RGB颜色,例如:#AFAFAF

副标题颜色:

RGB颜色,例如:#AFAFAF

自定义博客皮肤

-+

用思想、知识和音乐去影响和改变

  • 博客(11)
  • 资源 (6)
  • 收藏
  • 关注

转载 Record linkage

Record linkage (RL) refers to the task of finding records in a data set that refer to the same entity across different data sources (e.g., data files, books, websites, databases). Record linkage i

2016-01-25 22:29:37 2577

转载 entity resolution

StanfordEntityResolution FrameworkOverviewPapersSoftwarePeopleNewsJan. 2012: Our paper on Pay-As-You-Go ER [11] has been accepted to the IEEE Transact

2016-01-24 22:52:59 1591

转载 穷人的语义处理工具箱之一:语义版Jaccard

|为什么我们是ML界的穷人如果对工业界里的机器学习(ML)从业者进行阶级划分的话,划线标准不是你用的算法的学名听起来多酷炫,无论你手里抡着的是最潮的深度高达1000层的深度学习核炸弹,还是听起来有点掉渣的大刀长矛朴素贝叶斯,如果没有大量数据,尤其是能跑监督学习算法的带标签训练数据,你就是ML界标准的底层渣男渣女或者渣娘炮。再加上计算资源,如果贵公司有上千台GPU服务器集群可供阁

2016-01-13 00:01:13 591 1

转载 Dubbo 简介

Dubbo是一个分布式服务框架,致力于提供高性能和透明化的RPC远程服务调用方案,以及SOA服务治理方案。简单的说,dubbo就是个服务框架,如果没有分布式的需求,其实是不需要用的,只有在分布式的时候,才有dubbo这样的分布式服务框架的需求,并且本质上是个服务调用的东东,说白了就是个远程服务调用的分布式框架(告别Web Service模式中的WSdl,以服务者与消费者的方式在dubbo上注册

2016-01-10 21:52:03 709

转载 中文文本处理简要介绍

本文作者李绳,博客地址 http://acepor.github.io/。作者自述:一位文科生曾励志成为语言学家出国后阴差阳错成了博士候选人三年后交完论文对学术彻底失望回国后误打误撞成了数据科学家作为一个处理自然语言数据的团队,我们在日常工作中要用到不同的工具来预处理中文文本,比如 Jieba 和 Stanford NLP software。出于准确性和效率的考

2016-01-08 15:31:27 619

转载 实战中的 Promise 和 Future

上一章介绍了 Future 类型,以及如何用它来编写高可读性、高组合性的异步执行代码。Future 只是整个谜团的一部分: 它是一个只读类型,允许你使用它计算得到的值,或者处理计算中出现的错误。 但是在这之前,必须得有一种方法把这个值放进去。 这一章里,你将会看到如何通过 Promise 类型来达到这个目的。类型 Promise之前,我们把一段顺序执行的代码块传递给了 scala.co

2016-01-06 20:04:53 1297

转载 sbt 编译spark 的wordcount 程序

直接执行 sbt 会在当前目录下面创建 target 目录sbt 的目录格局一般为 lib/ (该目录下存储与编译相关的 jar 文件) project/ src/main/scala/ src/main/test/scala复制 jar 文件 spark-assembly *hadoop2.5.1.jar 到 lib 目录下[root@localhost wo

2016-01-06 19:34:51 939

转载 SBT-Simple Build Tool入门

sbt的源文件目录结构sbt的命令行模式配置文件 .sbt.sbt vs .scala 构建语句定义什么是构建语句?如何定义 build.sbt 配置项配置项之间必须用空行分割Keys类型内建 Keys自定义 Keys任务Keys 和 配置 Keys定义 task 和 settings任务和配置项的类型Keys 在 sbt 的交互模式在

2016-01-06 19:30:21 903

转载 hive tez-use

文章目录1. DAG task git获取源码 一、安装必要软件1. 1、安装java2. 2、安装apache maven3. 3、Protocol Buffers 2.5.04. 4、编译tez5. github获取某个release版本6. 5、 change hadoop-version为你的版本6.1. Build tez 二、tez install

2016-01-06 00:32:25 2945

转载 Apache Tez:一个运行在YARN之上支持DAG作业的计算框架

【Apache Tez是什么?】Tez(Hortonworks介绍:这里,源代码下载:这里,今天刚刚发布的源代码)是Apache最新开源的支持DAG作业的计算框架,它直接源于MapReduce框架,核心思想是将Map和Reduce两个操作进一步拆分,即Map被拆分成Input、Processor、Sort、Merge和Output, Reduce被拆分成Input、Shuffl

2016-01-05 22:19:17 2239

转载 MySQL 去除字段中的换行和回车符

UPDATE SMS_PROVINCE SET PROVINCENAME = REPLACE(REPLACE(PROVINCENAME, CHAR(10),''), CHAR(13),'');环境: RHEL5+MySQL解决方法:UPDATE tablename SET  field = REPLACE(REPLACE(field, CHAR(10), ”), CHAR(

2016-01-05 20:29:35 3294

httpclient tutorial httpclient 指南

httpclient 指南 包括了详细的调用和常用代码 The Hyper-Text Transfer Protocol (HTTP) is perhaps the most significant protocol used on the Internet today. Web services, network-enabled appliances and the growth of network computing continue to expand the role of the HTTP protocol beyond user-driven web browsers, while increasing the number of applications that require HTTP support. Although the java.net package provides basic functionality for accessing resources via HTTP, it doesn't provide the full flexibility or functionality needed by many applications. HttpClient seeks to fill this void by providing an efficient, up-to-date, and feature-rich package implementing the client side of the most recent HTTP standards and recommendations. Designed for extension while providing robust support for the base HTTP protocol, HttpClient may be of interest to anyone building HTTP-aware client applications such as web browsers, web service clients, or systems that leverage or extend the HTTP protocol for distributed communication.

2018-03-08

mask rcnn paper

We present a conceptually simple, flexible, and general framework for object instance segmentation. Our approach efficiently detects objects in an image while simultaneously generating a high-quality segmentation mask for each instance. The method, called Mask R-CNN, extends Faster R-CNN by adding a branch for predicting an object mask in parallel with the existing branch for bounding box recognition. Mask R-CNN is simple to train and adds only a small overhead to Faster R-CNN, running at 5 fps. Moreover, Mask R-CNN is easy to generalize to other tasks, e.g., allowing us to estimate human poses in the same framework. We show top results in all three tracks of the COCO suite of challenges, including instance segmentation, bounding-box object detection, and person keypoint detection. Without tricks, Mask R-CNN outperforms all existing, single-model entries on every task, including the COCO 2016 challenge winners. We hope our simple and effective approach will serve as a solid baseline and help ease future research in instance-level recognition. Code will be made available.

2018-03-07

Applying Deep Learning To Answer Selection

Applying Deep Learning To Answer Selection- A Study And An Open Task

2018-03-07

Learning Phrase Representations using RNN Encoder–Decoder

Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation

2018-03-07

BPTT BackPropagation Through Time.pdf

BPTT paper This report provides detailed description and necessary derivations for the BackPropagation Through Time (BPTT) algorithm. BPTT is often used to learn recurrent neural networks (RNN). Contrary to feed-forward neural networks, the RNN is characterized by the ability of encoding longer past information, thus very suitable for sequential models. The BPTT extends the ordinary BP algorithm to suit the recurrent neural architecture.

2018-03-07

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

提示
确定要删除当前文章?
取消 删除