The "Python Machine Learning (1st edition)" book code repository and info resource
Jupyter Notebook
Updated Feb 16, 2018
Machine Learning From Scratch. Bare bones Python implementations of Machine Learning models and algorithms with a foc…
Python
Updated May 3, 2018
📝 An awesome Data Science repository to learn and apply for real world problems.
Updated May 10, 2018
Topic Modelling for Humans
A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree al…
extract text from any document. no muss. no fuss.
HTML
Updated May 7, 2018

Machine Learning for Cyber Security
Updated Apr 23, 2018
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Python
Updated May 13, 2018
🍊 📊 💡 Orange: Interactive data analysis
https://orange.biolab.si
automated feature engineering
Python
Updated May 16, 2018
A set of tools for extracting tables from PDF files helping to do data mining on (OCR-processed) scanned documents.
Python
Updated Jan 9, 2018
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publis…
JavaScript
Updated May 1, 2018
📝 Подборка ресурсов по машинному обучению
Updated Nov 30, 2017
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processin…
Java
Updated May 7, 2018
HTML5 based online tool to extract numerical data from plot images.
JavaScript
Updated May 14, 2018
Data collection toolkit for social media analytics
Python
Updated Jan 12, 2018
AIL framework - Analysis Information Leak framework
Practical Data Science with Ruby based tools.
Ruby
Updated Apr 25, 2018
An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a …
Python
Updated May 17, 2018
Companion code for the book "Mastering Social Media Mining with Python"
Python
Updated Nov 11, 2017
ELKI Data Mining Toolkit
pyclustring is a Python, C++ data mining (clustering, oscillatory networks, neural networks, etc.) library.
Extract structured data from PDF invoices
Python
Updated May 15, 2018
CogComp's Natural Language Processing libraries and Demos:
Jekyll-based static site for The Programming Historian
HTML
Updated May 17, 2018
Job data mining repo for lagou.com
Python
Updated Dec 12, 2017
Real-time sentiment analysis in Python using twitter's streaming api
Python
Updated Jan 20, 2018
Data analysis of Zhihu with Python
Python
Updated Apr 7, 2017
Analytic platform for real-time large-scale streams containing structured and unstructured data.
C++
Updated Apr 24, 2018
some python projects
Python
Updated Dec 30, 2017