数据挖掘:是福是祸

 

"Pizza Palace, may I take your order?" a woman's voice echoes from a video clip that plays in the Giovanini Commons of the University of Notre Dame's Mendoza College of Business.

"Is this Mr. Kelly?" the woman continues.

"Yes," a man replies.

"Thank you for calling again, sir," the woman responds. "I show your national identification number as 6102049998-45-54610 — is that correct?"



"Uh, yes..." the man answers.

"Thank you Mr. Kelly," the woman says. "I see you live at 736 Montrose Court, but you're calling from your cell phone. Are you at home?"

"Uh," the man replies. "I'm just leaving work, but I'm..."

"Oh, we can deliver to Bob's Auto Supply," the woman says. "That's at 175 Lincoln Avenue, yes?"

"No!" the man says, frustrated. "I'm on my way home! How do you know all this stuff?"

"We just got wired into the system, sir," the woman says.

The video clip — a cautionary dramatization circulated by the American Civil Liberties Union — was enough to draw a few laughs from the audience during the final event of the 12th annual Ethics Week.

"What they're talking about here is a situation where information has been (collected) from a lot of different sources," said Barry Keating, professor of finance at Notre Dame, during the session, which focused on data mining. "And it has been put together and used by a single company."

But take a deeper look at the concept of data mining, and the ethical dilemmas surrounding it can be pretty serious.

"Data mining can be used for the good of mankind," said Keating. "But it can also be used perhaps to the detriment of mankind."

What is data mining?

In a nutshell, data mining can be defined as extracting useful information from large data sets.

Data mining, Keating said, plays a key role in today's society since it's used in fraud detection all the time.

This includes detection of credit card fraud, money laundering, phone fraud, securities fraud and medical fraud.

Data mining, Keating added, can be applied in a variety of fields, including scientific areas of astronomy and drug discovery, governmental areas of law enforcement and profiling tax cheaters, and business areas of advertising, manufacturing and customer relationship management.

Keating cited specific examples, such as in e-commerce, where a person buys a book at amazon.com and receives recommendations of other books he or she is likely to buy.

But with the beneficial uses of data mining come some key concerns.

Ethical issues

When it comes to data mining, ethical issues may be more accurately described as problems in data security and privacy preservation, Keating said.

In fact, the most fundamental ethical issue deals with the basic storage and retrieval of personal data, he said.

"Where did you get the data?" Keating said as an example. "Whose data is it? Did somebody allow you to use their own data?"

For Keating, a big test of ethics and data mining is probably going to surround a search engine people use every day — Google.

Take, for example, how Google unveiled a new service last year that will allow you to store and access your medical records on the Web.

And just this month, Google released software that allows users of mobile phones and other wireless devices to automatically share their whereabouts with family and friends. This allows users in 27 countries to be able to broadcast their location to others constantly, using Google Latitude.



Through sheer speed of collection, Google will test the limits of what our society can tolerate, Keating said.

"Google really is going to test the limits of the ethical dimensions of data mining," he said.

The question is, do we know what we're getting ourselves into by giving up so much information about ourselves?

"I wonder whether everybody knows what they're willingly giving up," Keating said. "I wonder if I know what I'm willingly giving up. Am I giving up too much?"

Python网络爬虫与推荐算法新闻推荐平台:网络爬虫:通过Python实现新浪新闻的爬取,可爬取新闻页面上的标题、文本、图片、视频链接(保留排版) 推荐算法:权重衰减+标签推荐+区域推荐+热点推荐.zip项目工程资源经过严格测试可直接运行成功且功能正常的情况才上传,可轻松复刻,拿到资料包后可轻松复现出一样的项目,本人系统开发经验充足(全领域),有任何使用问题欢迎随时与我联系,我会及时为您解惑,提供帮助。 【资源内容】:包含完整源码+工程文件+说明(如有)等。答辩评审平均分达到96分,放心下载使用!可轻松复现,设计报告也可借鉴此项目,该资源内项目代码都经过测试运行成功,功能ok的情况下才上传的。 【提供帮助】:有任何使用问题欢迎随时与我联系,我会及时解答解惑,提供帮助 【附带帮助】:若还需要相关开发工具、学习资料等,我会提供帮助,提供资料,鼓励学习进步 【项目价值】:可用在相关项目设计中,皆可应用在项目、毕业设计、课程设计、期末/期中/大作业、工程实训、大创等学科竞赛比赛、初期项目立项、学习/练手等方面,可借鉴此优质项目实现复刻,设计报告也可借鉴此项目,也可基于此项目来扩展开发出更多功能 下载后请首先打开README文件(如有),项目工程可直接复现复刻,如果基础还行,也可在此程序基础上进行修改,以实现其它功能。供开源学习/技术交流/学习参考,勿用于商业用途。质量优质,放心下载使用。
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值