Stars
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
Fluss is a streaming storage built for real-time analytics.
Page Cache stat: get page cache stats for files on Linux
Open, Multi-modal Catalog for Data & AI
High Performance Inter-Thread Messaging Library
FlatBuffers: Memory Efficient Serialization Library
An open source, standard data file format for graph data storage and retrieval.
World's most powerful open data catalog for building a high-performance, geo-distributed and federated metadata lake.
MSVC's implementation of the C++ Standard Library.
图解计算机网络、操作系统、计算机组成、数据库,共 1000 张图 + 50 万字,破除晦涩难懂的计算机基础知识,让天下没有难懂的八股文!🚀 在线阅读:https://xiaolincoding.com
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
Stream processing platform for developers.
Apache ORC - the smallest, fastest columnar storage for Hadoop workloads
A Cloud Native Batch System (Project under CNCF)
Katalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This is the core components in Katalyst system, including multiple ag…
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
OceanBase is an enterprise distributed relational database with high availability, high performance, horizontal scalability, and compatibility with SQL standards.
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.