You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
TensorOps - A Multi-Target MLIR Tensor Compiler with Operator Fusion
Architecture
TensorOps Dialect (ten.matmul, ten.relu, ten.add)
|
v
TensorOps -> Linalg (--tensor-to-linalg)
|
+----+----+
v v
Fusion Lowering
(matmul (bufferize -> linalg->loops -> scf->cf
+relu) -> memref->llvm -> func->llvm)
| |
v v
Fused LLVM Dialect -> JIT (lli)
TensorOps
|
v
GPU Lowering
|
+--+--+
v v
NVVM ROCDL
| |
v v
PTX (codegen
| text only)
v
cubin -> CUDA Driver API
(RTX 4080)