Repositories
-
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
-
incubator-dolphinscheduler
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available `out of the box`.
-
superset
Apache Superset is a Data Visualization and Data Exploration Platform
-
ozone
Scalable, redundant, and distributed object store for Apache Hadoop
-
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication…
-
-
orc
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
-
camel
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
-
incubator-nuttx
Apache NuttX is a mature, real-time embedded operating system (RTOS)
-
-
servicecomb-service-center
A standalone service center to allow services to register their instance information and to discover providers of a given service
-
skywalking
APM, Application Performance Monitoring System
-
pulsar
Apache Pulsar - distributed pub-sub messaging system
-
-
incubator-nlpcraft
Apache NLPCraft - API to convert natural language into actions.
-
-
solr
Apache Solr open-source search software
-
spark
Apache Spark - A unified analytics engine for large-scale data processing
-
-
lucene
Apache Lucene open-source search software
-
-