Skip to content

Latest commit

 

History

History

query execution

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 

architecture

单机

实现

  • pull
    • volcano
  • push
    • callback
    • consumer-producer / DAG
      • single thread
      • multi thread
        • thread pool
        • no thread pool

编译 vs 解释

语言有llvm 语言没有llvm,且不使用llvm 语言没有llvm,且使用llvm
pull 解释型 解释型 解释型
push 编译型90% 编译型80% 编译型100%

向量化

  • table
  • batch
  • stream

分布式

阶段

  • logical plan
  • standalone physical plan
  • distributed physical plan
    • DAG: stage / pipeline
      • source
      • transform
      • sink
    • operator: writer + reader / sender + receiver
      • merge
      • shuffle / redistribution / repartition
      • broadcast

流程

  • 角色
    • master
    • worker
    • client
  • 流程
    • worker register master
    • client send sql to master
    • master parse sql to distributed physical plan
    • master send distributed physical plan to worker
    • master send begin flag to source
    • worker execute and data flow between distributed physical plan
    • sink send end flag to master
    • master send data to client

paper

article