Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spar…
#
spark
Repositories 3,471
Learn and understand Docker technologies, with real DevOps practice!
Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark
汇总java生态圈常用技术框架、开源中间件,系统架构、项目管理、经典架构案例、数据库、常用三方库、线上运维等知识
Updated Mar 9, 2019
Kubernetes中文指南/云原生应用架构实践手册 - https://jimmysong.io/kubernetes-handbook
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools…
A Flexible and Powerful Parameter Server for large-scale machine learning
Alluxio, formerly Tachyon, Unify Data at Memory Speed
alluxio
distributed-storage
big-data
memory-speed
hadoop
spark
virtual-file-system
presto
tensorflow
Java
Updated Apr 3, 2019
List of Data Science Cheatsheets to rule the world
Updated Apr 2, 2019
Open Source Fast Scalable Machine Learning Platform For Smarter Applications: Deep Learning, Gradient Boosting & XGBo…
h2o
machine-learning
data-science
deep-learning
big-data
ensemble-learning
gbm
random-forest
naive-bayes
pca
opensource
distributed
multi-threading
java
python
r
hadoop
spark
gpu
automatic
Java
Updated Apr 3, 2019
PipelineAI: Real-Time Enterprise AI Platform
machine-learning
artificial-intelligence
tensorflow
kubernetes
elasticsearch
cassandra
spark
kafka
netflixoss
presto
airflow
pipeline
docker
redis
neural-network
gpu
microservices
nifi
scikit
prediction
Java
Updated Mar 19, 2019
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
Python
Updated Mar 13, 2019
BigDL: Distributed Deep Learning Library for Apache Spark
Scala
Updated Apr 3, 2019
Open-source IoT Platform - Device management, data collection, processing and visualization.
Interactive and Reactive Data Science using Scala and Spark.
Python clone of Spark, a MapReduce alike framework in Python
Python
Updated Jan 23, 2019
酷玩 Spark: Spark 源代码解析、Spark 类库等
Scala
Updated Feb 6, 2019
REST job server for Apache Spark
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Pyt…
machine-learning
data-science
r
python
gradient-boosting-machine
random-forest
deep-learning
xgboost
h2o
spark
R
Updated Sep 15, 2018
Fast, Scientific and Numerical Computing for the JVM (NDArrays)
java
gpu
scientific
nd4j
jvm
dl4j
backend
scala-notebook
spark
artificial-intelligence
scientific-computing
numerical-calculations
Java
Updated Jun 16, 2018
DataStax Spark Cassandra Connector
Scala
Updated Mar 26, 2019
A large-scale entity and relation database supporting aggregation of properties
Microsoft Machine Learning for Apache Spark
Compile-time Language Integrated Queries for Scala
Scala
Updated Apr 3, 2019
spark ml 算法原理剖析以及具体的源码实现分析
Updated Mar 25, 2019
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machin…
Machine Learning Platform and Recommendation Engine built on Kubernetes
machine-learning
deep-learning
deployment
kubernetes
docker
microservices
spark
kafka
kafka-streams
tensorflow
python
java
cloud
aws
gcp
azure
seldon
recommender-system
recommendation-engine
prediction
Java
Updated Jul 28, 2018
The Hunting ELK
A better compressed bitset in Java
Elassandra = Elasticsearch + Apache Cassandra
cassandra
elasticsearch
search
completion
aggregation
nosql
masterless
mission-critical
fuzzy-search
rest-api
lucene
kibana
logstash
spark
Java
Updated Mar 12, 2019