The Computational Network Toolkit (CNTK) by Microsoft Research, is a unified deep-learning toolkit that describes neural networks as a series of computational steps via a directed graph.
PredictionIO, a machine learning server for software developers and data engineers.
Continually updated Data Science Python Notebooks: Spark, Hadoop MapReduce, HDFS, AWS, Kaggle, scikit-learn, matplotlib, pandas, NumPy, SciPy, and various command lines.
"I learned Python by hacking first, and getting serious later. I wanted to do this with Machine Learning. If this is your style, join me in getting a bit ahead of yourself."
Implementations of Machine Learning models from scratch in Python with a focus on transparency. Aims to showcase the nuts and bolts of ML in an accessible way.
Face recognition library that recognize and manipulate faces from Python or from the command line
Universe is a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites and other applications.
Code samples for my book "Neural Networks and Deep Learning" [DEEP LEARNING]
NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
Machine Learning library for PHP. Algorithms, Cross Validation, Neural Network, Preprocessing, Feature Extraction and much more in one library.
DeepMind Lab is a 3D learning environment based on id Software's Quake III Arena via ioquake3 and other open source software. Its primary purpose is to act as a testbed for research in artificial intelligence, especially deep reinforcement learning.
A machine learning library by Airbnb designed from the ground up to be human friendly.
Darknet is an open source neural network framework written in C and CUDA. It is fast, easy to install, and supports CPU and GPU computation.
Microsoft's fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
A fast parallel implementation of Connectionist Temporal Classification (CTC), on both CPU and GPU.
Tool that automatically creates and optimizes machine learning pipelines using genetic programming. Consider it your personal data science assistant, automating a tedious part of machine learning.
The Deep Learning GPU Training System (DIGITS) is a web application for training deep learning models.
Kaldi is a toolkit for speech recognition written in C++ and licensed under the Apache License v2.0. Kaldi is intended for use by speech recognition researchers.
ML engine that supports distributed learning on Hadoop, Spark or your laptop via APIs in R, Python, Scala, REST/JSON.
A dynamic neural network library working well with networks that have dynamic structures that change for every training instance. Written in C++ with bindings in Python.
This package contains the matlab implementation of the algorithms described in the book Pattern Recognition and Machine Learning by C. Bishop.
Distributed, masterless, high performance, fault tolerant data processing. Written entirely in Clojure.
A JavaScript implementation of descriptive, regression, and inference statistics. Implemented in literate JavaScript with no dependencies, designed to work in all modern browsers (including IE) as well as in Node.js.
C, C++, and Python tools for named entity recognition and relation extraction
a suite of libraries for interpreting machine learning models according to their algebraic structure.
A machine learning API and server written in C++11. It makes state of the art machine learning easy to work with and integrate into existing applications.
Text REtrieval and Annotation Toolkit, definitely the most comprehensive toolkit I’ve encountered so far for Ruby
IPython notebooks from Data School's video tutorials on scikit-learn.
TensorDebugger (TDB) is a visual debugger for deep learning. It features interactive, node-by-node debugging and visualization for TensorFlow.
Lambda Architecture Framework using Apache Spark and Apache Kafka with a specialization for real-time large-scale machine learning.
Spearmint is a package to perform Bayesian optimization according to the algorithms outlined in the paper: Practical Bayesian Optimization of Machine Learning Algorithms. Jasper Snoek, Hugo Larochelle and Ryan P. Adams. Advances in Neural Information Processing Systems, 2012.
Fast multilayer perceptron neural network library for iOS and Mac OS X. MLPNeuralNet predicts new examples by trained neural network. It is built on top of the Apple's Accelerate Framework, using vectorized operations and hardware acceleration if available.
framework for torch which provides a set of abstractions aiming at encouraging code re-use as well as encouraging modular programming
Machine Learning framework for rapid development of Machine Learning and Statistical applications
A visualization library for quick and easy generation of common plots in data analysis and machine learning.
A web-based application for quick, scalable, and automated hyperparameter tuning and stacked ensembling
A Recurrent Neural Network library that extends Torch's nn. RNNs, LSTMs, GRUs, BRNNs, BLSTMs, etc.
A python framework to transform natural language questions to queries in a database query language
Python implementation of many of the artificial intelligence algorithms described on the book "Artificial Intelligence, a Modern Approach". It focuses on providing an easy to use, well documented and tested library.
A genomics processing engine and specialized file format built using Apache Avro, Apache Spark and Parquet. Apache 2 licensed.
A library that does auto linking and extraction of usernames, lists and hashtags in tweets
A general classifier module to allow Bayesian and other types of classifications.
An advanced neural network and machine learning framework. Encog contains classes to create a wide variety of networks, as well as support classes to normalize and process data for these neural networks. Encog trains using multithreaded resilient propagation. Encog can also make use of a GPU to further speed processing time. A GUI based workbench is also provided to help model and train neural networks.
A toolbox framework of AI modules written in Swift: Graphs/Trees, Linear Regression, Support Vector Machines, Neural Networks, PCA, KMeans, Genetic Algorithms, MDP, Mixture of Gaussians.
Autograd automatically differentiates native Torch code. Inspired by the original Python version.
FACTORIE is a toolkit for deployable probabilistic modeling, implemented as a software library in Scala. It provides its users with a succinct language for creating relational factor graphs, estimating parameters and performing inference.
ViZDoom allows developing AI bots that play Doom using only the visual information (the screen buffer). It is primarily intended for research in machine visual learning, and deep reinforcement learning, in particular.
an IPython-based environment for conducting data-driven research in a consistent and reproducible way. REP is not trying to substitute scikit-learn, but extends it and provides better user experience.
The Gesture Recognition Toolkit (GRT) is a cross-platform, open-source, C++ machine learning library designed for real-time gesture recognition.
Automated machine learning for production and analytics. Lets you focus on the fun parts of ML, while outputting production-ready code, and detailed analytics of your dataset and results. Includes support for NLP, XGBoost, LightGBM, and soon, deep learning.
MeTA : ModErn Text Analysis is a C++ Data Sciences Toolkit that facilitates mining big text data.
A javascript library containing a collection of least squares fitting methods for finding a trend in a set of data.
A deep learning library designed for streamlining research and development using the Torch7 distribution. It emphasizes flexibility through the elegant use of object-oriented design patterns.
Automated machine learning, data formatting, ensembling, and hyperparameter optimization for competitions and exploration- just give it a .csv file!
library for approximate unsupervised inference in Bayesian Hidden Markov Models (HMMs) and explicit-duration Hidden semi-Markov Models (HSMMs), focusing on the Bayesian Nonparametric extensions, the HDP-HMM and HDP-HSMM, mostly with weak-limit approximations.
The first neural network / machine learning library written in Swift. This is a project for AI algorithms in Swift for iOS and OS X development. This project includes algorithms focused on Bayes theorem, neural networks, SVMs, Matrices, etc..
Python Natural Language Processing Library. General purpose NLP library for Python. Also contains some specific modules for parsing common NLP formats, most notably for FoLiA, but also ARPA language models, Moses phrasetables, GIZA++ alignments.
C++ Neural Network library for Node.js. It has advantage on large dataset and multi-threaded training.
Text and supporting code for Think OS: A Brief Introduction to Operating Systems.
A set of tools for creating and testing machine learning features, with a scikit-learn compatible API
Tools for writing, submitting, debugging, and monitoring Storm topologies in pure Python.
VIGRA is a generic cross-platform C++ computer vision and machine learning library for volumes of arbitrary dimensionality with Python bindings.
a machine learning framework featuring logistic regression, support vector machines, decision trees and random forests.
CPU and GPU-accelerated matrix library intended to support large-scale exploratory data analysis.
The Push programming language and the PushGP genetic programming system implemented in Clojure
An optimization library for Torch. SGD, Adagrad, Conjugate-Gradient, LBFGS, RProp and more.
JRuby Mahout is a gem that unleashes the power of Apache Mahout in the world of JRuby.
Simple machine learning library, including Perceptron, Regression, Support Vector Machine, Decision Tree and more, it's easy to use and easy to learn for beginners.
A high performance software library developed by Intel and optimized for Intel's architectures. Library provides algorithmic building blocks for all stages of data analytics and allows to process data in batch, online and distributed modes.
Two data science utilities in R from Microsoft: 1) Interactive Data Exploration, Analysis, and Reporting (IDEAR) ; 2) Automated Modeling and Reporting (AMR).
deeplearn-rs provides simple networks that use matrix multiplication, addition, and ReLU under the MIT license.
Linear / Logistic regression, Neural Networks, Collaborative Filtering and Gaussian Multivariate Distribution
Clojure wrapper for Encog (v3) (Machine-Learning framework that specializes in neural-nets)
A C library for product recommendations/suggestions using collaborative filtering (CF).
Optimization library focused on machine learning, pythonic implementations of gradient descent, LBFGS, rmsprop, adadelta and others
Functionally composable Machine Learning library using Numenta’s Cortical Learning Algorithm
Massively parallel self-organizing maps: accelerate training on multicore CPUs, GPUs, and clusters, has python API.
The ClearNLP project provides software and resources for natural language processing. The project started at the Center for Computational Language and EducAtion Research, and is currently developed by the Center for Language and Information Research at Emory University. This project is under the Apache 2 license.
a service for deployment Apache Spark MLLib machine learning models as realtime, batch or reactive web services.
a service for deployment Apache Spark MLLib machine learning models as realtime, batch or reactive web services.
a service for deployment Apache Spark MLLib machine learning models as realtime, batch or reactive web services.
A complete object-oriented environment for machine learning in Matlab.
A package for unsupervised learning in Torch. Provides modules that are compatible with nn (LinearPsd, ConvPsd, AutoEncoder, ...), and self-contained algorithms (k-means, PCA).
A simple Machine Learning Framework written in Swift. Currently features Simple Linear Regression, Polynomial Regression, and Ridge Regression.
A sentence aligner, a friendly tool for extracting parallel sentences from comparable corpora.
Swift Language Bindings of TensorFlow. Using native TensorFlow models on both macOS / Linux.
Example code and materials that illustrate applications of SAS machine learning techniques.
A library for machine learning that builds predictions using a linear regression.
An all-in-one NuPIC Hierarchical Temporal Memory visualization and debugging super-tool!
Kanji / Hiragana / Katakana to Romaji Converter. Edict Dictionary & parallel sentences Search. Sentence Similarity between two JP Sentences. Sentiment Analysis of Japanese Text. Run Cabocha(ISO--8859-1 configured) in Python.
Source code and supporting content for my Ruby Manor presentation on Data Visualisation with Ruby
C++ library, command line tools, and Python binding for extracting and working with basic linguistic constructions such as n-grams and skipgrams in a quick and memory-efficient way.
Python binding to C++ library for extracting and working with with basic linguistic constructions such as n-grams and skipgrams in a quick and memory-efficient way.
Example code and materials that illustrate techniques for integrating SAS with other analytics technologies in Java, PMML, Python and R.
Python interface for converting Penn Treebank trees to Stanford Dependencies.
Python bindings for ZPar, a statistical part-of-speech-tagger, constiuency parser, and dependency parser for English.
An Objective-C multilayer perceptron library, with full support for training through backpropagation. Implemented using vDSP and vecLib, it's 20 times faster than its Java equivalent. Includes sample code for use from Swift.
Unicode-aware regular-expression based tokenizer for various languages. Tool and C++ library. Supports FoLiA format.
It implemented 3 layers neural network ( Input Layer, Hidden Layer and Output Layer ) and it named Back Propagation Neural Network (BPN). This network can be used in products recommendation, user behavior analysis, data mining and data analysis.
Memory-based NLP suite developed for Dutch: PoS tagger, lemmatiser, dependency parser, NER, shallow parser, morphological analyzer.
Library of SAS Enterprise Miner process flow diagrams to help you learn by example about specific data mining topics.
Cephes mathematical functions library, wrapped for Torch. Provides and wraps the 180+ special mathematical functions from the Cephes mathematical library, developed by Stephen L. Moshier. It is used, among many other places, at the heart of SciPy.
This is an objected-oriented framework that facilitates the integration of probabilistic models for sequences over a user defined alphabet.
Simple, concise implementations of machine learning techniques and utilities in Clojure.
it implemented multi-perceptrons neural network (???????????) based on Back Propagation Neural Network (BPN) and designed unlimited-hidden-layers.
Python implementation of cover trees, near-drop-in replacement for scipy.spatial.kdtree
Community based data collection, packed in gem. Get list of pretty much anything (stop words, countries, non words) in txt, json or hash. Demo/Search for a list
An image/graph library for Torch. This package provides routines to construct graphs on images, segment them, build trees out of them, and convert them back to images.
Some experiments with the coordinate descent algorithm used in the (Sparse) Group Lasso model
It implemented K-Means the clustering and classification algorithm. It could be used in data mining and image compression.
Python binding to Frog, an NLP suite for Dutch. (pos tagging, lemmatisation, dependency parsing, NER)
Easy to install and use deep learning Faster R-CNN face detection for images and video in a docker container.
Implementation of image to image (pix2pix) translation from the paper by isola et al.[DEEP LEARNING]
Neuron is simple class for time series predictions. It's utilize LNU (Linear Neural Unit), QNU (Quadratic Neural Unit), RBF (Radial Basis Function), MLP (Multi Layer Perceptron), MLP-ELM (Multi Layer Perceptron - Extreme Learning Machine) neural networks learned with Gradient descent or LeLevenberg–Marquardt algorithm.
Example code and materials that illustrate using neural networks with several hidden layers in SAS.
Receiver Operating Characteristics and functions for evaluation probabilistic binary classifiers
It is a non-supervisor and self-learning algorithm (adjust the weights) in neural network of Machine Learning.
It implemented Fuzzy C-Means (FCM) the fuzzy clustering / classification algorithm on Machine Learning. It could be used in data mining and image compression.
A Python extension module wrapping the full TiMBL C++ programming interface. Timbl is an elaborate k-Nearest Neighbours machine learning toolkit.
A video/graph library for Torch. This package provides routines to construct graphs on videos, segment them, build trees out of them, and convert them back to videos.
Very simple implementation of neural networks for dummies in python without using any libraries, with detailed comments.
An implementation of Dell Zhang's solution to Wikipedia's Participation Challenge on Kaggle
A header-only C++11 Neural Network library. Low dependency, native traditional chinese document.
OpenGM is a C++ library for graphical modeling, and inference. The Lua bindings provide a simple way of describing graphs, from Lua, and then optimizing them with OpenGM.
code and tools around integral images. A library for finding interest points based on fast integral histograms.