自定义博客皮肤VIP专享

*博客头图:

格式为PNG、JPG,宽度*高度大于1920*100像素,不超过2MB,主视觉建议放在右侧,请参照线上博客头图

请上传大于1920*100像素的图片!

博客底图:

图片格式为PNG、JPG,不超过1MB,可上下左右平铺至整个背景

栏目图:

图片格式为PNG、JPG,图片宽度*高度为300*38像素,不超过0.5MB

主标题颜色:

RGB颜色,例如:#AFAFAF

Hover:

RGB颜色,例如:#AFAFAF

副标题颜色:

RGB颜色,例如:#AFAFAF

自定义博客皮肤

-+

无知的我

先入行,知深浅,再抉择

  • 博客(2055)
  • 资源 (190)
  • 论坛 (28)
  • 收藏
  • 关注

原创 北京理工大学研究生毕业答辩相关事宜

(学术型研究生)首先你得完成你的论文,并且得到你导师的同意。送审,内审和外审 ,副教授级别及其以上(楼主因为这事儿差点儿跪了)。材料:论文一份,评审表一份(纸质和电子版),答辩酬金表(送审时就让老师签字了,尤其外审的)。送审的分数不重要,80多即可,只要评阅结论为   A准予答辩    就万事大吉。   答辩委员会:普硕、全日制专业学位:答辩委员会要求由五位相关

2014-06-12 14:52:34 4339

原创 北京理工大学硕士学位申请者发表学术论文要求

为了进一步提高我校研究生教育水平,保证学位授予质量,激励研究生原始创新,引导和促进研究生在高水平学术刊物上发表反映学位论文工作成果的学术论文,特对研究生发表学术论文作如下规定: 一、申请学位发表学术论文基本要求 1.申请博士、硕士学位者,需满足本学科博士、硕士学位申请者发表学术论文的要求,具体见附件1、2。 2.在《SCIENCE》、《NATURE》期刊上发表1篇论文即可满

2013-11-05 17:50:02 6390

转载 IT男容易出现的几个问题

和一位IT前辈聊天,他说IT男最容易出现的几个问题,仔细观察周围发现确实没错:1.容易成为大龄光棍(我周围的三十好几了连女朋友都没有的大有人在)多和外界接触,多结交一些朋友。周末或者节假日别宅着。就算是宅着也要学好怎么通过社交工具把妹2.年纪轻轻的头发渐渐稀疏,最后秃顶(这个吓人)这种情况比较少,少熬夜,多一些健康的作息一般没有这样的事。3.容易发胖,容易有啤酒肚主要还

2013-10-12 10:54:25 3135

转载 程序员技术练级攻略

来源:http://coolshell.cn/articles/4990.html 月光博客6月12日发表了《写给新手程序员的一封信》,翻译自《An open letter to those who want to start programming》,我的朋友(他在本站的id是

2011-07-18 16:10:10 2864 5

转载 谈谈我对攻读计算机研究生的看法

<br />就我自己的理解,谈谈我对读研和软件学院的看法,不妥之处一笑了之即可。<br /><br />如果你有实际开发工作经验,感觉自己的水平和实力进入了一个高原期,迫切需要从理论上提高,那么计算机学院是唯一选择。因为计算机学院才能让你在理论上更上一层楼。软件学院从教学计划上就没有把你往这方面带。当然能不能更上一层楼最终还是完全取决于你自己。需要特别说明的是,工作经验并不一定等于开发经验,我见过很多工作2-3年的人,但是没有一点开发经验。<br /><br />你说:“他们都有很强的开发能力,只是不太喜欢

2011-05-05 21:39:00 2949 8

原创 Zynq-Linux移植学习笔记之47-PL部分spi flash文件系统挂载

1、背景介绍ZYNQ在PL部分使用AXI_QUAD_SPI IP核外挂了8片SPI FLASH,VIVADO示例图如下:上图中有颜色的线条为IP核的中断到ZYNQ PL-PS中断,由于设备太多,这里使用了中断级联。IP核的地址分配如下:2、内核配置内核中需要修改下面文件增加兼容的硬件spi型号,这里使用的n25q256完整代码如下:/* * MTD SPI driver for ST M25Pxx (and sim...

2021-01-12 13:45:03 79

原创 A. Bovine Dilemma

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputArgus was charged with guarding Io, which is not an ordinary cow. Io is quite an explorer, and she wanders off rather frequently, making Ar

2021-01-02 12:25:53 73

原创 A. Robot Program

time limit per test2 secondsmemory limit per test256 megabytesinputstandard inputoutputstandard outputThere is an infinite 2-dimensional grid. The robot stands in cell(0,0)(0,0)and wants to reach cell(x,y)(x,y). Here is a list of possibl...

2021-01-02 10:48:15 41

原创 A. Subtract or Divide

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputRidbit starts with an integernn.In one move, he can perform one of the following operations:dividennby one of itsproperdivisors, o...

2021-01-02 10:20:54 49

原创 A. Favorite Sequence

time limit per test2 secondsmemory limit per test256 megabytesinputstandard inputoutputstandard outputPolycarp has a favorite sequence a[1…n] consisting of n integers. He wrote it out on the whiteboard as follows:he wrote the number a1 to the left sid

2020-12-20 16:10:08 63

原创 A. Strange Functions

time limit per test2 secondsmemory limit per test256 megabytesinputstandard inputoutputstandard outputLet’s define a function f(x) (x is a positive integer) as follows: write all digits of the decimal representation of x backwards, then get rid of the

2020-12-20 16:07:10 30

原创 B. Unique Bid Auction

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputThere is a game called “Unique Bid Auction”. You can read more about it here: https://en.wikipedia.org/wiki/Unique_bid_auction (though you don’t have t

2020-12-20 16:03:33 35

原创 Linux下Intel网卡固件烧写工具

可以通过eeupdate64e来更新intel网卡固件,工具下载地址:https://download.csdn.net/download/jj12345jj198999/13651571烧写mac地址命令如下:./eeupdate64e /nic=1 /mac=00a0c907YY01./eeupdate64e /nic=2 /mac=00a0c910YY01./eeupdate64e /nic=3 /mac=00a0c910YY02./eeupdate64e /nic=4 /mac=00

2020-12-11 19:38:29 156

原创 linux下LPC访问CPLD

相关背景知识参加这里:https://blog.csdn.net/weixin_44056231/article/details/849545211、在linux系统下查看LPC设备名称指令lspci -nn 关注红圈内的数值下图为示意图2、修改驱动中对应位置的值将红色部分修改为和上图中列出的值一样3、配置cpld地址X86通过lpc与cpld相连,需要分配一个地址标定设备cpld,这里使用0x378地址需要匹配lpc寄存器使用可以...

2020-12-08 15:12:26 125

原创 A. Special Permutation

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputYou are given one integernn(n>1n>1).Recall that a permutation of lengthnnis an array consisting ofnndistinct integers from11...

2020-11-29 21:31:24 72

原创 A. Marketing Scheme

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputYou got a job as a marketer in a pet shop, and your current task is to boost sales of cat food. One of the strategies is to sell cans of fo

2020-11-22 21:08:58 49

原创 A. Buying Torches

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputYou are playing a very popular game called Cubecraft. Initially, you have one stick and want to craftkktorches. One torch can be crafted ..

2020-11-22 20:51:25 43

原创 A. XORwice

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputIn order to celebrate Twice's 5th anniversary, Tzuyu and Sana decided to play a game.Tzuyu gave Sana two integersaaandbband a really ...

2020-11-22 20:44:53 42

原创 B. Symmetric Matrix

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputMasha hasnntypes of tiles of size2×22×2. Each cell of the tile contains one integer. Masha has aninfinite numberof tiles of each type....

2020-11-17 21:14:57 123

原创 A. Digit Game

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputEveryone knows that agents in Valorant decide, who will play as attackers, and who will play as defenders. To do that Raze and Breach decid

2020-11-17 20:52:18 35

原创 A. Fence

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputYura is tasked to build a closed fence in shape of an arbitrary non-degenerate simple quadrilateral. He's already got three straight fence

2020-11-17 20:44:32 51

原创 A. Reorder

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputFor a given arrayaaconsisting ofnnintegers and a given integermmfind if it is possible to reorder elements of the arrayaain such a ...

2020-11-17 20:39:35 53

原创 Zynq-Linux移植学习笔记之46-光模块I2C驱动移植

1、背景介绍近期板卡上开始使用中航光电的光模块,查阅资料发现这些光模块都可以通过I2C来获取状态信息并进行开关控制,描述如下,其中需要特别注意的是所有光模块的读写I2C地址都是一样的,不可以挂在一根总线上,要么分别单独控制,要么通过交换芯片切换控制。为了实现这一点,可以考虑通过zynq的I2C控制器来对光模块进行操作。由于ZYNQ PS部分的I2C控制器只有两个,当光模块数量超过2个时使用PL部分的I2C IP核来实现较为简单。2.硬件参考设计这...

2020-11-11 19:54:04 201

原创 Zynq-Linux移植学习笔记之45-linux下访问物理地址时间优化

1、背景介绍在zynq上运行linux程序时,有时候会需要arm去axi总线上挂载的IP核的某个寄存器去取值。最简单的方法就是使用devmem进行虚实地址转换,然后直接访问IP核该寄存器的物理地址。一般情况下如果只是读一两次寄存器这种操作耗时可以接受,但在面对fifo这种需要长时间读取的情况,这种每次都需要转换的耗时就很明显了。在之前的博客中记录过类似的优化方法(虚实地址转换建议),这里再做点补充。2、优化前代码优化之前代码很简单,比如下面GetGpioReg/SetGpioReg,里面都.

2020-10-31 20:47:25 149

原创 C. Dominant Piranha

time limit per test2 secondsmemory limit per test256 megabytesinputstandard inputoutputstandard outputThere arennpiranhas with sizesa1,a2,…,ana1,a2,…,anin the aquarium. Piranhas are numbered from left to right in order they live in the ...

2020-10-26 20:34:04 123

原创 B. Yet Another Bookshelf

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputThere is a bookshelf which can fitnnbooks. Theii-th position of bookshelf isai=1ai=1if there is a book on this position andai=0ai=0o...

2020-10-26 20:02:21 86

原创 A. Boring Apartments

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputThere is a building consisting of1000010000apartments numbered from11to1000010000, inclusive.Call an apartmentboring, if its nu...

2020-10-26 19:52:04 171

原创 A1. Prefix Flip (Easy Version)

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputThis is the easy version of the problem. The difference between the versions is the constraint onnnand the required number of operations...

2020-10-08 21:49:39 49

原创 B. Array Cancellation

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputYou're given an arrayaaofnnintegers, such thata1+a2+⋯+an=0a1+a2+⋯+an=0.In one operation, you can choose twodifferentindicesiiand...

2020-10-08 21:40:07 49

原创 2020-10-08

time limit per test2 secondsmemory limit per test256 megabytesinputstandard inputoutputstandard outputAshishgup and FastestFinger play a game.They start with a numbernnand play in turns. In each turn, a player can makeany oneof the fo...

2020-10-08 21:25:21 37

原创 A. Ahahahahahahahaha

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputAlexandra has an even-length arrayaa, consisting of00s and11s. The elements of the array are enumerated from11tonn. She wants to remo...

2020-10-08 21:13:38 54 1

原创 A. Three Indices

time limit per test2 secondsmemory limit per test256 megabytesinputstandard inputoutputstandard outputYou are given a permutationp1,p2,…,pnp1,p2,…,pn. Recall that sequence ofnnintegers is called apermutationif it contains all integers ...

2020-10-08 21:06:23 65

原创 C. Mere Array

time limit per test2 secondsmemory limit per test256 megabytesinputstandard inputoutputstandard outputYou are given an arraya1,a2,…,ana1,a2,…,anwhere allaiaiare integers and greater than00.In one operation, you can choose two differe...

2020-10-08 20:35:37 43

原创 C. Yet Another Array Restoration

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputWe have a secret array. You don't know this array and you have to restore it. However, you know some facts about this array:The array con

2020-10-08 20:26:19 67

原创 A. Cubes Sorting

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputFor god's sake, you're boxes with legs! It is literally your only purpose! Walking onto buttons! How can you not do the one thing you were de

2020-10-08 20:14:06 71

原创 linux下根据槽位号自动配置IP地址shell脚本【第二版】

板卡上面有7个网卡,默认规则为槽位号*10+插箱号作为网卡IP地址的最后一个字段,在配置时同时将槽位号和插箱号作为板卡的hostname以示区分。脚本中如果板卡所在的槽位和插箱和原有配置的一样就无须配置,否则重新进行配置。#!/bin/bashvpx_eth_name=("enp2s0f0" "enp2s0f1" "enp7s0" "enp9s0f0" "enp9s0f1" "enp9s0f2" "enp9s0f3")net_vpx_autocfg(){ echo "vpx_net_aut

2020-10-07 17:50:40 76

原创 A. Floor Number

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputVasya goes to visit his classmate Petya. Vasya knows that Petya's apartment number isnn.There is only one entrance in Petya's house and .

2020-10-04 16:32:30 146

原创 C. A Cookie for You

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputAnna is a girl so brave that she is loved by everyone in the city and citizens love her cookies. She is planning to hold a party with cooki

2020-09-21 21:31:03 63

原创 A. Subset Mex

time limit per test1 secondmemory limit per test512 megabytesinputstandard inputoutputstandard outputGiven a set of integers (it can contain equal elements).You have to split it into two subsetsAAandBB(both of them can contain equal e...

2020-09-21 21:21:27 96 2

原创 A. Sign Flipping

time limit per test1 secondmemory limit per test256 megabytesinputstandard inputoutputstandard outputYou are givennnintegersa1,a2,…,ana1,a2,…,an, wherennis odd. You are allowed to flip the sign of some (possibly all or none) of them. Y...

2020-09-21 20:59:34 40

基于opengl的小车漫游场景实现

基于opengl的小车漫游场景实现,小车模型通过3dmax创建并导入,并通过键盘控制小车移动。-Opengl-based car walkthrough scene, the car modeled by the 3dmax and import to the scene, and use keyboard to control the car movement.

2012-11-06

ast2500开发手册文档

压缩包内为ast2500开发手册文档,主要介绍MDS工具的使用,方便用户进行ast2500应用开发。

2020-05-04

百度百科全部词条

百度百科全部词条,包含最新的百度百科全部词条,dic格式,解压后用notepad打开即可

2013-09-24

zynq开发板linux移植资料

zynq开发板linux移植资料,包括从官网上下载的东西以及个人编写的文档

2016-12-20

intel_i350固件烧写工具

intel固件烧写工具,在linux下可以烧i210和i350网卡固件mac地址,也可以烧写固件,非常好用

2020-12-11

人工智能全部课件和作业题

人工智能课程总结 转眼之间,研一的上半学期就要结束了,陪伴了自己一学期的人工智能课也在今天结束了最后的考试。回顾这半个学期来学习人工智能的感受,确实还是有点可说的东西。 我记得自己第一次听AI这个名字是上大二时一个北航软件学院朋友提起的,他特别想去微软做AI方面的研究,然后他热情的向我介绍了这个领域是多么多么好,当时的自己完全没有印象,只觉得可能和机器人有关,AI的目的就是做出和人类一模一样的机器人。现在看来自己当初的想法是多么的幼稚可笑。等到了大三的时候,软件学院正好开设了这门课,我便抱着好奇的心态选了这门课,无奈当时授课老师胡晶晶讲解极其乏味,也没有教材,每节课上课就照着PPT念,完

2013-01-11

The Number Sense pdf清晰版 刘未鹏推荐

The Number Sense pdf清晰版 刘未鹏推荐...............

2011-07-12

mellanox网卡驱动.zip

该资源为银河麒麟下适配的mellanox网卡驱动,先安装deb文件夹内的依赖包,再安装kylin4.0.2里面的deb包即可。

2020-03-10

ACM编程题模板和各种经典算法数据结构实现代码

ACM编程题模板,其中包含各种算法,来源王道论坛,希望对搞ACM的同学有帮助

2011-03-06

基于winpcap发送数据包

标 题: 【原创】基于WinPCap的网络协议开发 - 炮王(超级打炮机)发送数据包(03) 作 者: 加百力 时 间: 2009-05-18,16:55:36 链 接: http://bbs.pediy.com/showthread.php?t=89175 【文章标题】: 基于WinPCap的网络协议开发 - 炮王(超级打炮机)发送数据包(03) 【文章作者】: 加百力 【软件名称】: WinPCap 【下载地址】: http://www.winpcap.org/ 【内部编号】: MASSADA 0029 【编写语言】: C++ 【使用工具】: VS2005 【操作平台】: Windows

2011-04-09

人人网文艺青年搜索引擎

本软件为chrome浏览器的插件,需要chrome的beta版或者开发版的环境来运行。如果没有,可运行Chrome_16.0.912.63_XiaZaiBa.exe进行安装。 也就是说你需要先按照beta或者开发版的chrome浏览器,然后才能安装插件。 插件安装方法: 将 文艺青年搜索引擎_V2.crx 用鼠标拖入打开的chrome,点击继续即可,之后将在右上角出现插件图标,点击则进入引擎。 软件使用注意: 建议使用校内网的马甲账号,因为为了增大相册可见性的概率,软件将自动添加所搜索的大学,这导致新鲜事里会出现一大堆“成为某某大学的粉丝”的后果。

2012-01-06

linux下中断处理程序示例,含驱动和应用程序

该示例提供了中断处理的详细代码,含驱动程序和应用程序。用户可以直接调用接口进行中断函数的处理。

2018-08-19

新浪微博备份工具 导出成多种格式

使用说明: 本程序为绿色软件,解压即可使用 请先尝试直接运行&quot;SinaWeiboBackup.jar&quot;文件 如果双击jar文件之后无法运行而是自动用解压软件打开, 请先解压出来后尝试双击&quot;启动.bat&quot;来运行 如果以上两个方法皆无法运行程序 这说明你的电脑缺少java的运行环境 请到以下地址下载jre http://www.java.com/zh_CN/ 安装后就可以运行本程序了.. 更新说明: ----------2011-05-13 V1.5---------- *修复因为新浪页面变动导致页面抓取失败的问题. *优化了一些流程 ----------2011-05-02

2011-06-05

mtd_tools针对arm-linux-gnueabihf(gcc-linaro-5.2)

mtd_tools针对gcc-linaro-5.2-2015.11-2-x86_64_arm-linux-gnueabihf.tar,已经全部编译好,tar包直接解压就可以使用。使用前请确保编译器一致。

2018-04-14

ANSJ分词JAR包

ANSJ分词JAR包,Github上面已经找不到了,这是孙健老师通过QQ发给我的,很是感谢,关于ANSJ更多请看 http://blog.csdn.net/blogdevteam/article/details/8148451

2013-08-27

百度百科分类词条

自己用爬虫抓取的百度百科分类词条,分好类的

2013-09-26

爬取的搜狐新闻数据

爬取的搜狐新闻数据,一共有12个类别,分好类了

2014-03-05

分词工具ANSJ 1.3版本

具体描述请看这里http://ansjsun.github.io/ansj_seg/ 这是最新的ANSJ版本,需要配合另一个tree-split jar包使用,请从这里下载:http://maven.ansj.org/org/ansj/

2014-02-14

爬取的虾米网全部专辑列表

爬取的虾米网全部专辑列表,包括作曲,专辑名称等等

2014-01-03

豆瓣网电影人名数据爬取

豆瓣网电影人名数据爬取,得到的是全网段的数据

2014-01-03

JAVA JSON jar包

JAVA JSON jar包,调用如下:JSONObject record = new JSONObject(lines); JSONArray dataarray =record.getJSONArray("data"); JSONObject _dataarray =dataarray.getJSONObject(0); String content =_dataarray.get("co").toString(); String title =_dataarray.get("t").toString();

2013-11-26

斯坦福算法分析与设计公开课part2

斯坦福算法分析与设计公开课part2,课程地址:https://class.coursera.org/algo2-002/class

2013-10-08

百度词条爬虫程序

百度词条爬虫程序,爬取词条的摘要。

2013-10-09

IKAnalyzer分词工具

IKAnalyzer分词工具,包括lucene库和百度百科词条。

2013-09-26

hadoop mapreduce eclipse插件

最近学习hadoop,发现Hadoop不提供编译后的hadoop-eclipse插件,于是就自己动手编译了hadoop-eclipse-plugin-1.1.0.rar插件 Hadoop1.1.0是beta版本,有兴趣的朋友可以下载装装,感受下MapReduce编程框架

2013-09-23

ANSJ分词工具最新版

ANSJ最新版,把两个包添加到工程下,另外创建library文件夹,加入两个字典文件,最后把library.properties添加到src目录下即可。感谢孙健老师。

2013-09-16

机器学习课件和大作业

机器学习课件和大作业,研究生课程和大作业,授课老师为李侃老师。

2013-09-08

自然语言处理课件

coursera上面的自然语言处理课件,英文版

2013-09-05

Coding the Matrix全部课程讲义

布朗大学Philip Klein教授授课,讲述矩阵分析的应用,用python进行编程。

2013-09-03

自己收集的一些分词工具

自己收集的一些分词工具,包括中科院的ictclas,ansj,fudanNLP等等,相信对搞自然语言处理的有帮助

2013-08-28

斯坦福算法分析与设计【part 1】课程讲义

课程地址:https://class.coursera.org/algo-004/class/index,Algorithms: Design and Analysis, Part 1 by Tim Roughgarden 授课老师魏讲解很到位,内容很具体。

2013-08-14

斯坦福机器学习公开课讲义+个人笔记

包括coursera上面斯坦福机器学习公开课讲义pdf,ppt,还有自己总结的学习笔记,可以参看我的博客:http://blog.csdn.net/jj12345jj198999

2013-06-27

基于OpenCV的双目测距系统

基于OpenCV的双目测距系统,在ubuntu下用QT进行开发,dyb,大哥,周博童鞋原创

2013-06-03

web intelligence and big data

https://www.coursera.org/ 上面印度理工大学开设的公开课讲义和作业,包括mapreduce,贝叶斯分类,贝叶斯信念网等

2013-06-03

计算机视觉课件

计算机视觉课件,全英文课件。讲课内容比较全,最后开卷考试。

2013-05-13

算法导论课件

算法导论课件

2013-04-21

利用micemeat实现mapreduce

最近一直在学coursera上面web intelligence and big data这门课,上周五印度老师布置了一个家庭作业,要求写一个mapreduce程序,用python来实现。 具体描述如下: Programming Assignment for HW3 Homework 3 (Programming Assignment A) Download data files bundled as a .zip file from hw3data.zip Each file in this archive contains entries that look like: journals/cl/SantoNR90:::Michele Di Santo::Libero Nigro::Wilma Russo:::Programmer-Defined Control Abstractions in Modula-2. that represent bibliographic information about publications, formatted as follows: paper-id:::author1::author2::…. ::authorN:::title Your task is to compute how many times every term occurs across titles, for each author. For example, the author Alberto Pettorossi the following terms occur in titles with the indicated cumulative frequencies (across all his papers): program:3, transformation:2, transforming:2, using:2, programs:2, and logic:2. Remember that an author might have written multiple papers, which might be listed in multiple files. Further notice that ‘terms’ must exclude common stop-words, such as prepositions etc. For the purpose of this assignment, the stop-words that need to be omitted are listed in the script stopwords.py. In addition, single letter words, such as "a" can be ignored; also hyphens can be ignored (i.e. deleted). Lastly, periods, commas, etc. need to be ignored; in other words, only alphabets and numbers can be part of a title term: Thus, “program” and “program.” should both be counted as the term ‘program’, and "map-reduce" should be taken as 'map reduce'. Note: You do not need to do stemming, i.e. "algorithm" and "algorithms" can be treated as separate terms. The assignment is to write a parallel map-reduce program for the above task using either octo.py, or mincemeat.py, each of which is a lightweight map-reduce implementation written in Python. These are available from http://code.google.com/p/octopy/ and mincemeat.py-zipfile respectively. I strongly recommend mincemeat.py which is much faster than Octo,py even though the latter was covered first in the lecture video as an example. Both are very similar. Once you have computed the output, i.e. the terms-frequencies per author, go attempt Homework 3 where you will be asked questions that can be simply answered using your computed output, such as the top terms that occur for some particular author. Note: There is no need to submit the code; I assume you will experiment using octo.py to learn how to program using map-reduce. Of course, you can always write a serial program for the task at hand, but then you won’t learn anything about map-reduce. Lastly, please note that octo.py is a rather inefficient implementation of map-reduce. Some of you might want to delve into the code to figure out exactly why. At the same time, this inefficiency is likely to amplify any errors you make in formulating the map and reduce functions for the task at hand. So if your code starts taking too long, say more than an hour to run, there is probably something wrong.

2013-04-18

Data Analysis课件

coursera公开课上面的Data Analysis课件,在线课堂地址:https://www.coursera.org/course/dataanalysis

2013-04-05

Web Intelligence and Big Data课件

coursera上面的公开课课件,地址是 https://class.coursera.org/bigdata-002/class/index

2013-04-05

SQL Server 2005数据挖掘与商业智能完全解决方案光盘

SQL Server 2005数据挖掘与商业智能完全解决方案,老师推荐的是这本书,学习数据挖掘可以看看

2013-03-19

Felven的留言板

发表于 2020-01-02 最后回复 2020-01-02

现任女友与富家女,我该如何抉择?

发表于 2014-01-15 最后回复 2014-04-25

在公司上厕所也要输密码,服了

发表于 2013-11-12 最后回复 2013-11-16

搞IT的怎么移民啊?

发表于 2013-10-12 最后回复 2013-11-16

连CSDN都被山寨了

发表于 2013-08-21 最后回复 2013-11-16

使用weka进行二次开发的问题

发表于 2013-05-23 最后回复 2013-10-12

如何在windows phone8下实现calendar控件

发表于 2013-01-30 最后回复 2013-09-27

请教openCV双目测距时摄像头的问题

发表于 2013-04-30 最后回复 2013-05-23

北理工计算机学院研究生课程安排

发表于 2012-11-06 最后回复 2013-01-30

询问Linux下如何编程获取网络利用率

发表于 2011-08-05 最后回复 2012-11-23

询问有没有人知道如何在windows phone下使用Funambol

发表于 2012-06-13 最后回复 2012-11-06

求助windows phone后台进程访问UI

发表于 2012-05-17 最后回复 2012-05-17

求助一个C#调用JS代码的例子

发表于 2012-03-02 最后回复 2012-03-02

我想问下为什么领导总希望你加班

发表于 2012-02-28 最后回复 2012-02-29

询问.net与mysql之间的交互问题

发表于 2012-01-30 最后回复 2012-01-31

有人能说下南京或苏州比较好的软件公司么?

发表于 2011-12-21 最后回复 2011-12-22

Why's (Poignant) Guide to Ruby这本书为何没有卖的

发表于 2011-08-18 最后回复 2011-12-21

国外知名网站Stackoverflow 历时两年评选出11本对程序员最有影响力的书籍

发表于 2011-08-18 最后回复 2011-09-14

C代码编辑器求教

发表于 2009-07-03 最后回复 2011-08-29

求助有关SQL注入检测的问题

发表于 2011-08-18 最后回复 2011-08-22

求助将GB2312转换为汉字

发表于 2011-07-04 最后回复 2011-07-05

在北理工软件学院大学四年上过的课

发表于 2011-06-14 最后回复 2011-07-04

关于SQL语句中where子句真假值判断

发表于 2011-06-14 最后回复 2011-06-15

大家都来说说当年毕业时面试了哪些公司吧

发表于 2011-04-16 最后回复 2011-04-18

面试笔试过的公司【不断更新】

发表于 2011-03-30 最后回复 2011-04-13

华为的二面要注意什么

发表于 2011-03-30 最后回复 2011-03-31

如何用C++开发字符界面?

发表于 2009-07-04 最后回复 2009-07-06

帮帮忙,C++中的虚基类问题

发表于 2009-03-28 最后回复 2009-03-28

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人 TA的粉丝

提示
确定要删除当前文章?
取消 删除