
数据集
文章平均质量分 78
不务正业的猿
桃李不言,下自成蹊。
展开
-
世界卫生统计数据
原文:World Health Statistics 2020|Complete|Geo-AnalysisComplete dataset with brief explanation.This dataset covers the most recent and updated health statistics of the world (countries recognized by WHO- all), BUT the data could not be directly used as原创 2021-01-14 21:48:51 · 1006 阅读 · 0 评论 -
COVID-19数据集-2020
原文:COVID-19 Open Research Dataset Challenge (CORD-19)An AI challenge with AI2, CZI, MSR, Georgetown, NIH & The White HouseIn response to the COVID-19 pandemic, the White House and a coalition of leading research groups have prepared the COVID-19原创 2021-01-01 22:53:24 · 1813 阅读 · 7 评论 -
Flowers Recognition(花卉识别数据集)
原文:Flowers RecognitionThis dataset contains labeled 4242 images of flowers.This dataset contains 4242 images of flowers.The data collection is based on the data flicr, google images, yandex images.You can use this datastet to recognize plants fro原创 2020-12-31 18:52:40 · 4506 阅读 · 10 评论 -
美国房屋销售数据集
原文:House Sales in King County, USAPredict house price using regressionThis dataset contains house sale prices for King County, which includes Seattle. It includes homes sold between May 2014 and May 2015.It's a great dataset for evaluating simple r原创 2020-12-31 18:21:24 · 1348 阅读 · 1 评论 -
Graduate Admission(研究生入学相关数据集)
原文:Graduate Admission 2Predicting admission from important parametersThis dataset is created for prediction of Graduate Admissions from an Indian perspective.The dataset contains several parameters which are considered important during the applicat原创 2020-12-31 18:12:27 · 1699 阅读 · 1 评论 -
World Development Indicators(世界发展指标相关数据集)
原文:World Development IndicatorsExplore country development indicators from around the worldThe World Development Indicators from the World Bank contain over a thousand annual indicators of economic development from hundreds of countries around the wo原创 2020-12-31 18:03:31 · 4614 阅读 · 0 评论 -
1872年到2020年的国际足球成绩
原文:International football results from 1872 to 2020An up-to-date dataset of over 40,000 international football resultsWell, what happened was that I was looking for a semi-definite easy-to-read list of international football matches and couldn't find原创 2020-12-29 18:13:05 · 417 阅读 · 1 评论 -
Goodreads-books(好书籍相关数据集)
原文:Goodreads-bookscomprehensive list of all books listed in goodreadsThe primary reason for creating this dataset is the requirement of a good clean dataset of books. Being a bookie myself (see what I did there?) I had searched for datasets on books原创 2020-12-29 17:54:01 · 1784 阅读 · 0 评论 -
FiveThirtyEight Comic Characters Dataset(五分之八漫画人物数据集)
原文:FiveThirtyEight Comic Characters DatasetExplore Data from FiveThirtyEightComic CharactersThis folder contains data behind the storyComic Books Are Still Made By Men, For Men And About Men.The data comes fromMarvel WikiaandDC Wikia. Charact...原创 2020-12-20 23:00:13 · 2549 阅读 · 0 评论 -
Telco Customer Churn(电信客户流失相关数据集)
原文:Telco Customer ChurnFocused customer retention programs"Predict behavior to retain customers. You can analyze all relevant customer data and develop focused customer retention programs." [IBM Sample Data Sets]Each row represents a customer, each原创 2020-12-20 22:51:25 · 2324 阅读 · 2 评论 -
Daily News for Stock Market Prediction(股市预测日报)
原文:Daily News for Stock Market PredictionUsing 8 years daily news headlines to predict stock market movementActually, I prepare this dataset for students on my Deep Learning and NLP course.But I am also very happy to see kagglers play around with i原创 2020-12-07 17:11:48 · 491 阅读 · 0 评论 -
Students Performance in Exams(学生考试成绩相关数据集)
原文:Students Performance in ExamsMarks secured by the students in various subjectsMarks secured by the studentsThis data set consists of the marks secured by the students in various subjects.This data set includes scores from three exams and a var原创 2020-12-03 17:24:51 · 5221 阅读 · 0 评论 -
Stanford Dogs Dataset(斯坦福狗数据集)
原文:Stanford Dogs DatasetOver 20,000 images of 120 dog breedsThe Stanford Dogs dataset contains images of 120 breeds of dogs from around the world. This dataset has been built using images and annotation from ImageNet for the task of fine-grained imag原创 2020-11-30 17:56:23 · 5851 阅读 · 1 评论 -
120年奥运史:运动员和成绩(相关数据集)
原文:120 years of Olympic history: athletes and resultsbasic bio data on athletes and medal results from Athens 1896 to Rio 2016This is a historical dataset on the modern Olympic Games, including all the Games from Athens 1896 to Rio 2016. I scraped th原创 2020-11-27 16:42:39 · 2039 阅读 · 0 评论 -
人脸数据集(电视剧生活大爆炸)
原文:Semi-supervised Learning with Constraints for Person Identification in Multimedia DataWe address the problem of person identification in TV series. We propose a unified learning framework for multi-class classification which incorporates labeled and原创 2020-11-20 18:19:45 · 540 阅读 · 0 评论 -
Red Wine Quality(红酒品质相关数据集)
原文:Red Wine QualityThe two datasets are related to red and white variants of the Portuguese "Vinho Verde" wine. For more details, consult the reference [Cortez et al., 2009]. Due to privacy and logistic issues, only physicochemical (inputs) and sensory原创 2020-11-20 18:17:46 · 4444 阅读 · 0 评论 -
Labeled Faces in the Wild
原文:Labeled Faces in the WildLabeled Faces in the Wild is a public benchmark for face verification, also known as pair matching. No matter what the performance of an algorithm on LFW, it should not be used to conclude that an algorithm is suitable for a原创 2020-11-15 20:45:07 · 731 阅读 · 0 评论 -
Chess Game Dataset (国际象棋游戏数据集)
原文:Chess Game Dataset (Lichess)20,000+ Lichess Games, including moves, victor, rating, opening details and moreGeneral InfoThis is a set of just over 20,000 games collected from a selection of users on the site Lichess.org, and how to collect more.原创 2020-11-15 20:42:33 · 1372 阅读 · 0 评论 -
The BioID Face Database
原文:The BioIDFace DatabaseThe BioID Face Database has been recorded and is published to give all researchers working in the area of face detection the possibility to compare the quality of their face detection algorithms with others. It may be used for.原创 2020-11-13 17:33:40 · 464 阅读 · 0 评论 -
500首歌曲数据集
原文:500 Greatest Songs of All TimeRolling Stone’s definitive list of the 500 greatest songs of all time.Rolling Stone is an American monthly magazine that focuses on popular culture. It was founded in San Francisco, California, in 1967 by Jann Wenner,原创 2020-11-13 17:32:33 · 1201 阅读 · 0 评论 -
Indian Food Recipes Dataset(印度食品配方数据集)
原文:When I browsed for a Food Recipes (Especially Indian Food) Dataset, I could not find one (that I could use) online. So, I decided to create one.The dataset has following fields (self-explanatory) - ['RecipeName', 'TranslatedRecipeName', 'Ingredients原创 2020-11-12 19:33:57 · 842 阅读 · 0 评论 -
基于深度多任务学习的人脸标志点检测-相关数据集
原文:Facial Landmark Detection by Deep Multi-task LearningFacial landmark detection of face alignment has long been impeded by the problems of occlusion and pose variation. Instead of treating the detection task as a single and independent problem, we in原创 2020-11-12 19:32:12 · 239 阅读 · 0 评论 -
Color FERET Database(面部识别技术)数据集
原文:The DOD Counterdrug Technology Program sponsored the Facial Recognition Technology (FERET) program and development of theFERETdatabase. The National Institute of Standards and Technology (NIST) is serving as Technical Agent for distribution of the F..原创 2020-11-08 22:10:38 · 2560 阅读 · 1 评论 -
哥伦比亚大学公众人物脸部数据集
原文:IntroductionThe PubFig database is a large, real-world face dataset consisting of58,797images of200people collected from the internet. Unlike most other existing face datasets, these images are taken in completely uncontrolled situations with no...原创 2020-11-07 21:57:21 · 1761 阅读 · 0 评论 -
美国大选2020推特相关数据
原文:US Election 2020 TweetsOct 15th 2020 - Nov 4th 2020, 1.09M TweetsThe 2020 US election is happening on the 3rd November 2020 and the resulting impact to the world will no doubt be large, irrespective of which candidate is elected! After reading the原创 2020-11-06 18:05:06 · 953 阅读 · 1 评论 -
FDDB数据集(人脸检测)
原文:Face Detection Data Set and Benchmark HomeWelcome to the Face Detection Data Set and Benchmark (FDDB), a data set of face regions designed for studying the problem of unconstrained face detection. This data set contains the annotations for 5171 face原创 2020-11-06 17:53:34 · 2631 阅读 · 0 评论 -
MegaFace完整数据集(65G)
之前分享过一部分MegaFace数据集,有部分网友留言问有没有完整的数据集。我这里统一回复一下,完整的数据集,我这边是有的,但当时考虑到百度网盘上传有限制,单文件上传最大只能传20G的。而考虑到MegaFace不算完全是公开数据集,切分文件太花时间了,所以就没计划上传全部。如果有网友确实很需要,我自己是在广州天河附近上班,如果有需要的,可以在我公众号里留言给我。本人公众号:MegaFace 是由华盛顿大学(University of Washington)计算机科学与工程实验室于2015原创 2020-11-05 15:07:51 · 2931 阅读 · 0 评论 -
Large-scale CelebFaces Attributes (CelebA) Dataset
原文:CelebFaces Attributes Dataset (CelebA)is a large-scale face attributes dataset with more than200Kcelebrity images, each with40attribute annotations. The images in this dataset cover large pose variations and background clutter. CelebA has large d...原创 2020-11-05 14:50:26 · 879 阅读 · 0 评论 -
UNCOVER COVID-19 Challenge(COVID-19相关数据集)
原文:UNCOVER COVID-19 ChallengeUnited Network for COVID Data Exploration and ResearchChallenge DescriptionThe Roche Data Science Coalition (RDSC) is requesting the collaborative effort of the AI community to fight COVID-19. This challenge presents a原创 2020-11-04 15:54:24 · 1110 阅读 · 0 评论 -
COVID-19 in India(印度COVID-19相关数据)
原文:COVID-19 in IndiaDataset on Novel Corona Virus Disease 2019 in IndiaCoronaviruses are a large family of viruses which may cause illness in animals or humans. In humans, several coronaviruses are known to cause respiratory infections ranging from t原创 2020-11-04 15:51:25 · 3185 阅读 · 0 评论 -
气候变化数据集-Climate Change Earth Surface Temperature Data
原文:Climate Change: Earth Surface Temperature DataExploring global temperatures since 1750Some say climate change is the biggest threat of our age while others say it’s a myth based on dodgy science. We are turning some of the data over to you so you原创 2020-11-01 21:45:09 · 5236 阅读 · 4 评论 -
Mushroom Classification(蘑菇分类数据集)
原文:Mushroom ClassificationSafe to eat or deadly poison?Although this dataset was originally contributed to the UCI Machine Learning repository nearly 30 years ago, mushroom hunting (otherwise known as "shrooming") is enjoying new peaks in popularity.原创 2020-11-01 21:43:00 · 6607 阅读 · 1 评论 -
IBM HR Analytics Employee Attrition & Performance(IBM HR Analytics员工流失与绩效数据集)
原文:IBM HR Analytics Employee Attrition & PerformancePredict attrition of your valuable employeesUncover the factors that lead to employee attrition and explore important questions such as ‘show me a breakdown of distance from home by job role and原创 2020-10-31 12:51:16 · 2259 阅读 · 1 评论 -
Data Science for COVID-19 (DS4C)(COVID-19数据科学(DS4C))
原文:COVID-19 has infected more than 10,000 people in South Korea.KCDC (Korea Centers for Disease Control & Prevention) announces the information of COVID-19 quickly and transparently.We make a structured dataset based on the report materials of KC原创 2020-10-31 12:49:17 · 369 阅读 · 0 评论 -
Titanic(泰坦尼克号数据集)
原文:OverviewThe data has been split into two groups: training set (train.csv) test set (test.csv) The training setshould be used to build your machine learning models. For the training set, we provide the outcome (also known as the “ground tr.原创 2020-10-30 16:19:08 · 3502 阅读 · 3 评论 -
Data Science Cheat Sheets
原文:Data Science Cheat SheetsQuick help to make a data scientist's life easierA collection of cheat sheets for various data-science related languages and topics.译:数据科学备忘单快速帮助让数据科学家的生活更轻松各种数据科学相关语言和主题的备忘单集合。大家可以到官网地址下载数据集,我自己也在百度网盘分享了一份。可关注本人原创 2020-10-28 10:52:42 · 269 阅读 · 0 评论 -
Fashion MNIST
原文:Fashion MNISTAn MNIST-like dataset of 70,000 28x28 labeled fashion imagesFashion-MNIST is a dataset of Zalando's article images—consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale im原创 2020-10-27 09:02:10 · 470 阅读 · 0 评论 -
Fruits 360(水果数据集)
原文:Fruits 360A dataset with 90483 images of 131 fruits and vegetablesThe following fruits and are included:Apples (different varieties: Crimson Snow, Golden, Golden-Red, Granny Smith, Pink Lady, Red, Red Delicious), Apricot, Avocado, Avocado rip.原创 2020-10-26 16:49:50 · 12290 阅读 · 47 评论 -
Pokemon with stats(口袋妖怪统计数据集)
原文:Pokemon with stats721 Pokemon with stats and typesThis data set includes 721 Pokemon, including their number, name, first and second type, and basic stats: HP, Attack, Defense, Special Attack, Special Defense, and Speed. It has been of great use w原创 2020-10-25 22:49:20 · 2196 阅读 · 0 评论 -
Pima Indians Diabetes Database(Pima印第安人糖尿病数据库)
原文:Pima Indians Diabetes DatabasePredict the onset of diabetes based on diagnostic measuresThis dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective of the dataset is to diagnostically predict原创 2020-10-25 22:46:44 · 5740 阅读 · 4 评论