[RFC] Create a basic binding for CPP Fusion in python frontend using AI Coding Tools #4077

rdspring1 · 2025-03-14T04:22:37Z

This PR is a rapid prototype of Create a basic binding for CPP Fusion in python frontend using Google Gemini.

How I am using Google Gemini?

Upload header files to Google Gemini
Initial Prompt: Create PYBIND11_MODULE for cpp class in some_file.txt?

Other Prompts

Add docstrings
Can you convert all string names to snake_case?
Can you convert cpp operators to their corresponding python dunder methods?

Python code demo

from nvfuser import fusion

f = fusion.Fusion()
fg = fusion.FusionGuard(f)

tv0 = fusion.TensorViewBuilder().n_dims(1).shape([10]).contiguity(True).build()
tv1 = fusion.TensorViewBuilder().n_dims(1).shape([10]).contiguity(True).build()
f.add_input(tv0)
f.add_input(tv1)

tv2 = fusion.ops.add(tv0, tv1)
f.add_output(tv2)

print("Fusion IR")
f.print_math()
print("=========")

print("TensorView")
print(tv0.to_string(0))
print(tv1.to_string(0))
print(tv2.to_string(0))
print("=========")

print("IterDomain")
print(tv0.axis(0).to_string())
print(tv1.axis(0).to_string())
print(tv2.axis(0).to_string())
print("=========")

print("IterDomain Extent")
print(tv0.axis(0).extent().to_string(0))
print(tv1.axis(0).extent().to_string(0))
print(tv2.axis(0).extent().to_string(0))
print("=========")

Output

Fusion IR
Inputs:
  T0_g_float[iS0{10}]
  T1_g_float[iS1{10}]
Outputs:
  T2_g_float[iS2{10}]

%kernel_math {
T2_g_float[iS2{10}]
   = T0_g_float[iS0{10}]
   + T1_g_float[iS1{10}];
} // %kernel_math 
=========

TensorView
T0_g_float[iS0{10}]
T1_g_float[iS1{10}]
T2_g_float[iS2{10}]
=========

IterDomain
iS0{10}
iS1{10}
iS2{10}
=========

IterDomain Extents
10
10
=========

github-actions · 2025-03-14T04:23:25Z

Review updated until commit b6d7a16

Description

Added direct bindings for CPP Fusion in Python frontend.
Created bindings for IR nodes, operations, and runtime components.
Included demo scripts to demonstrate usage.

Changes walkthrough 📝

Relevant files

Enhancement

ir.cpp `Bindings for IR Nodes` csrc/python_frontend/direct_bindings/ir.cpp Created bindings for base nodes, internal base nodes, and interface nodes. Added docstrings for all bindings.	+1159/-0
ops.cpp `Bindings for Operations` csrc/python_frontend/direct_bindings/ops.cpp Created bindings for unary and binary operations. Added docstrings for all operations.	+630/-0
runtime.cpp `Bindings for Runtime Components` csrc/python_frontend/direct_bindings/runtime.cpp Created bindings for Fusion and FusionExecutorCache. Added methods for execution, compilation, and IR retrieval.	+561/-0
python_bindings.cpp `Initialize Direct Bindings` csrc/python_frontend/python_bindings.cpp Added initialization of direct bindings for fusion.	+6/-0
python_bindings.h `Declare Direct Bindings` csrc/python_frontend/python_bindings.h Added declarations for direct bindings functions.	+9/-0

Documentation

demo.py `Add Fusion Demo` demo.py Added a demo script to demonstrate fusion usage.	+68/-0
pointwise.py `Add Pointwise Demo` pointwise.py Added a script to demonstrate pointwise operations and scheduling.	+40/-0

Configuration changes

CMakeLists.txt `Update Build System` CMakeLists.txt Added source files for direct bindings to the build system.	+3/-0

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

🧪 No relevant tests

⚡ Recommended focus areas for review

Possible Issue

The py::nodelete policy is used for all bindings, which means the memory management is handled by IrContainer. However, there is no explicit check or documentation to ensure that IrContainer is correctly managing the memory. This could lead to memory leaks or undefined behavior if IrContainer does not properly handle the deletion of these objects.

namespace {
void bindBaseNodes(py::module& fusion) {
  // Statement
  py::class_<
      nvfuser::Statement,
      std::unique_ptr<nvfuser::Statement, py::nodelete>>(fusion, "Statement")
      .def(
          "name",
          &nvfuser::Statement::name,
          R"(
Get the unique identifier of this statement.

Returns
-------
int
    The integer that represents this statement's unique identifier.
)")
      .def(
          "is_val",
          &nvfuser::Statement::isVal,
          R"(
Check if this statement is a value.

Returns
-------
bool
    True if this statement is a Val, False otherwise.
)")
      .def(
          "is_expr",
          &nvfuser::Statement::isExpr,
          R"(
Check if this statement is an expression.

Returns
-------
bool
    True if this statement is an Expr, False otherwise.
)")
      .def(
          "fusion",
          &nvfuser::Statement::fusion,
          R"(
Get the fusion this statement belongs to.

Returns
-------
Fusion
    The fusion container that owns this statement.
)")
      .def(
          "same_type",
          &nvfuser::Statement::sameType,
          py::arg("other"),
          R"(
Check if this statement has the same type as another statement.

Parameters
----------
other : Statement
    The statement to compare types with.

Returns
-------
bool
    True if both statements are of the same type, False otherwise.
)")
      .def(
          "__eq__",
          &nvfuser::Statement::sameAs,
          R"(
Check if this statement is equal to another statement.

Parameters
----------
other : Statement
    The statement to compare with.

Returns
-------
bool
    True if the statements are equal, False otherwise.
)")
      .def(
          "__str__",
          [](Statement* self) { return self->toString(); },
          "Convert the IterDomain to a string representation.");

  // Val
  py::class_<
      nvfuser::Val,
      nvfuser::Statement,
      std::unique_ptr<nvfuser::Val, py::nodelete>>(fusion, "Val")
      .def(
          "vtype",
          &nvfuser::Val::vtype,
          R"(
Get the value type of this Val.

Returns
-------
ValType
    The type of value (e.g., Scalar, IterDomain, TensorView, etc.).
)")
      .def(
          "dtype",
          &nvfuser::Val::dtype,
          R"(
Get the data type of this Val.

Returns
-------
DataType
    The data type (e.g., Float, Half, Int, etc.).
)")
      .def(
          "is_symbolic",
          &nvfuser::Val::isSymbolic,
          R"(
Check if this value is symbolic (not a concrete value).

Returns
-------
bool
    True if the value is symbolic, False otherwise.
)")
      .def(
          "is_scalar",
          &nvfuser::Val::isScalar,
          R"(
Check if this value is a scalar.

Returns
-------
bool
    True if the value is a scalar, False otherwise.
)")
      .def(
          "is_const_scalar",
          &nvfuser::Val::isConstScalar,
          R"(
Check if this value is a constant scalar.

Returns
-------
bool
    True if all dependencies are constant scalars, False otherwise.
)")
      .def(
          "is_const_int",
          &nvfuser::Val::isConstInt,
          R"(
Check if this value is a constant integer.

Returns
-------
bool
    True if all dependencies are constant integers, False otherwise.
)")
      .def(
          "is_integral_scalar",
          &nvfuser::Val::isIntegralScalar,
          R"(
Check if this value is an integral scalar.

Returns
-------
bool
    True if the value is an integral scalar, False otherwise.
)")
      .def(
          "is_floating_point_scalar",
          &nvfuser::Val::isFloatingPointScalar,
          R"(
Check if this value is a floating point scalar.

Returns
-------
bool
    True if the value is a floating point scalar, False otherwise.
)")
      .def(
          "is_a_bool",
          &nvfuser::Val::isABool,
          R"(
Check if this value is a boolean.

Returns
-------
bool
    True if the value is a boolean, False otherwise.
)")
      .def(
          "is_const",
          &nvfuser::Val::isConst,
          R"(
Check if this value is a constant with no dependencies.

Returns
-------
bool
    True if the value is a constant scalar with no dependencies, False otherwise.
)")
      .def(
          "is_zero",
          &nvfuser::Val::isZero,
          R"(
Check if this value is zero.

Returns
-------
bool
    True if the value is zero, False otherwise.
)")
      .def(
          "is_zero_int",
          &nvfuser::Val::isZeroInt,
          R"(
Check if this value is the integer zero.

Returns
-------
bool
    True if the value is the integer zero, False otherwise.
)")
      .def(
          "is_one",
          &nvfuser::Val::isOne,
          R"(
Check if this value is one.

Returns
-------
bool
    True if the value is one, False otherwise.
)")
      .def(
          "is_one_int",
          &nvfuser::Val::isOneInt,
          R"(
Check if this value is the integer one.

Returns
-------
bool
    True if the value is the integer one, False otherwise.
)")
      .def(
          "is_true",
          &nvfuser::Val::isTrue,
          R"(
Check if this value is true.

Returns
-------
bool
    True if the value is true, False otherwise.
)")
      .def(
          "is_false",
          &nvfuser::Val::isFalse,
          R"(
Check if this value is false.

Returns
-------
bool
    True if the value is false, False otherwise.
)")
      .def(
          "definition",
          &nvfuser::Val::definition,
          R"(
Get the expression that defines this value.

Returns
-------
Expr
    The expression that produces this value, or None if it's an input.
)")
      .def(
          "uses",
          &nvfuser::Val::uses,
          R"(
Get all expressions that use this value as an input.

Returns
-------
list of Expr
    The expressions that consume this value.
)")
      .def(
          "is_fusion_input",
          &nvfuser::Val::isFusionInput,
          R"(
Check if this value is a fusion input.

Returns
-------
bool
    True if the value is a fusion input, False otherwise.
)")
      .def(
          "is_fusion_output",
          &nvfuser::Val::isFusionOutput,
          R"(
Check if this value is a fusion output.

Returns
-------
bool
    True if the value is a fusion output, False otherwise.
)");

  // Expr
  py::class_<
      nvfuser::Expr,
      nvfuser::Statement,
      std::unique_ptr<nvfuser::Expr, py::nodelete>>(fusion, "Expr")
      .def(
          "input",
          &nvfuser::Expr::input,
          py::arg("index"),
          R"(
Get the input value at the specified index.

Parameters
----------
index : int
    The index of the input to retrieve.

Returns
-------
Val
    The input value at the given index.
)")
      .def(
          "output",
          &nvfuser::Expr::output,
          py::arg("index"),
          R"(
Get the output value at the specified index.

Parameters
----------
index : int
    The index of the output to retrieve.

Returns
-------
Val
    The output value at the given index.
)")
      .def(
          "__eq__",
          &nvfuser::Expr::sameAs,
          py::arg("other"),
          R"(
Check if this expression is equal to another expression.

Parameters
----------
other : Expr
    The expression to compare with.

Returns
-------
bool
    True if the expressions are equal, False otherwise.
)")
      .def(
          "get_op_string",
          &nvfuser::Expr::getOpString,
          R"(
Get the string representation of this expression's operation.

Returns
-------
str
    The name/type of the operation this expression performs.
)");
}

void bindInternalBaseNodes(py::module& fusion) {
  // IterDomain
  py::class_<
      nvfuser::IterDomain,
      nvfuser::Val,
      std::unique_ptr<nvfuser::IterDomain, py::nodelete>>(fusion, "IterDomain")
      .def(
          "__eq__",
          &nvfuser::IterDomain::sameAs,
          py::arg("other"),
          R"(
Check if this IterDomain is equal to another IterDomain.

Parameters
----------
other : IterDomain
    The IterDomain to compare with.

Returns
-------
bool
    True if the domains are equal, False otherwise.
)")
      .def(
          "__str__",
          [](IterDomain* self) { return self->toString(/*indent_size=*/0); },
          "Convert the IterDomain to a string representation.")
      .def(
          "is_reduction",
          &nvfuser::IterDomain::isReduction,
          R"(
Check if this is a reduction domain.

Returns
-------
bool
    True if this is a reduction domain, False otherwise.
)")
      .def(
          "is_iteration",
          &nvfuser::IterDomain::isIteration,
          R"(
Check if this is an iteration domain.

Returns
-------
bool
    True if this is an iteration domain, False otherwise.
)")
      .def(
          "is_broadcast",
          &nvfuser::IterDomain::isBroadcast,
          R"(
Check if this is a broadcast domain.

Returns
-------
bool
    True if this is a broadcast domain, False otherwise.
)")
      .def(
          "is_symbolic",
          &nvfuser::IterDomain::isSymbolic,
          R"(
Check if this is a symbolic domain.

Returns
-------
bool
    True if this is a symbolic domain, False otherwise.
)")
      .def(
          "is_rfactor_product",
          &nvfuser::IterDomain::isRFactorProduct,
          R"(
Check if this domain is an rfactor product.

Returns
-------
bool
    True if this is an rfactor product, False otherwise.
)")
      .def(
          "is_parallelized",
          &nvfuser::IterDomain::isParallelized,
          R"(
Check if this domain is parallelized.

Returns
-------
bool
    True if this domain is parallelized, False otherwise.
)")
      .def(
          "get_parallel_type",
          &nvfuser::IterDomain::getParallelType,
          R"(
Get the parallel type of this domain.

Returns
-------
ParallelType
    The parallel type of this domain.
)")
      .def(
          "get_iter_type",
          &nvfuser::IterDomain::getIterType,
          R"(
Get the iteration type of this domain.

Returns
-------
IterType
    The iteration type of this domain.
)")
      .def(
          "extent",
          &nvfuser::IterDomain::extent,
          R"(
Get the extent of this domain.

Returns
-------
Val
    The extent of this domain.
)")
      .def(
          "has_expanded_extent",
          &nvfuser::IterDomain::hasExpandedExtent,
          R"(
Check if this domain has an expanded extent.

Returns
-------
bool
    True if this domain has an expanded extent, False otherwise.
)")
      .def(
          "expanded_extent",
          &nvfuser::IterDomain::expandedExtent,
          R"(
Get the expanded extent of this domain.

Returns
-------
Val
    The expanded extent of this domain.
)")
      .def(
          "maybe_partial",
          &nvfuser::IterDomain::maybePartial,
          R"(
Check if this domain may be partial.

Returns
-------
bool
    True if this domain may be partial, False otherwise.
)")
      .def(
          "parallelize",
          &nvfuser::IterDomain::parallelize,
          py::arg("parallel_type"),
          R"(
Set the parallel type of this domain.

Parameters
----------
parallel_type : ParallelType
    The type of parallelization to apply (e.g., BIDx, TIDx, etc.).

Notes
-----
This is a key function used in scheduling to specify how the domain should be parallelized
across CUDA threads and blocks.
)");

  py::class_<
      nvfuser::TensorDomain,
      nvfuser::Val,
      std::unique_ptr<nvfuser::TensorDomain, py::nodelete>>(
      fusion, "TensorDomain")
      .def(
          "__str__",
          [](TensorDomain* self) { return self->toString(/*indent_size=*/0); },
          "Convert the TensorDomain to a string representation.")
      .def(
          "get_root_domain",
          &nvfuser::TensorDomain::root,
          R"(
Get the root domain of this tensor.

Returns
-------
list of IterDomain
    The root iteration domains.
)")
      .def(
          "get_allocation_domain",
          &nvfuser::TensorDomain::allocation,
          R"(
Get the allocation domain of this tensor.

Returns
-------
list of IterDomain
    The allocation iteration domains.
)")
      .def(
          "get_loop_domain",
          &nvfuser::TensorDomain::loop,
          R"(
Get the loop domain of this tensor.

Returns
-------
list of IterDomain
    The loop iteration domains.
)")
      .def(
          "get_logical_domain",
          &nvfuser::TensorDomain::logical,
          R"(
Get the logical domain of this tensor.

Returns
-------
list of IterDomain
    The logical iteration domains.
)")
      .def(
          "get_maybe_root_domain",
          &nvfuser::TensorDomain::maybeRoot,
          R"(
Get the root domain if it exists.

Returns
-------
list of IterDomain
    The root iteration domains, or empty list if not available.
)")
      .def(
          "get_maybe_allocation_domain",
          &nvfuser::TensorDomain::maybeAllocation,
          R"(
Get the allocation domain if it exists.

Returns
-------
list of IterDomain
    The allocation iteration domains, or empty list if not available.
)")
      .def(
          "is_maybe_root",
          &nvfuser::TensorDomain::isMaybeRoot,
          py::arg("id"),
          R"(
Check if the given IterDomain is potentially a root domain.

Parameters
----------
id : IterDomain
    The IterDomain to check.

Returns
-------
bool
    True if the domain is potentially a root domain, False otherwise.
)");
}

void bindInterfaceNodes(py::module& fusion) {
  py::class_<
      nvfuser::TensorView,
      nvfuser::Val,
      std::unique_ptr<nvfuser::TensorView, py::nodelete>>(fusion, "TensorView")
      .def(
          "__str__",
          [](TensorView* self) { return self->toString(/*indent_size=*/0); },
          "Convert the TensorView to a string representation.")
      .def(
          "num_dims",
          &nvfuser::TensorView::nDims,
          R"(
Get the number of dimensions in this tensor.

Returns
-------
int
    The number of dimensions.
)")
      .def(
          "domain",
          &nvfuser::TensorView::domain,
          R"(
Get the domain of this tensor.

Returns
-------
TensorDomain
    The tensor domain object that describes the dimensionality and properties
    of this tensor. The tensor domain contains information about:
    - Root domain (original dimensions)
    - Allocation domain (how memory is allocated)
    - Loop domain (how iterations are structured)
    - Logical domain (current transformed state)

Notes
-----
The TensorDomain is a fundamental part of the tensor that manages all aspects
of its dimensional properties and transformations.
)")
      .def(
          "get_logical_domain",
          &nvfuser::TensorView::getLogicalDomain,
          R"(
Get the logical domain of this tensor.

Returns
-------
list of IterDomain
    The logical iteration domains.
)")
      .def(
          "get_maybe_root_domain",
          &nvfuser::TensorView::getMaybeRootDomain,
          R"(
Get the root domain of this tensor if it exists.

Returns
-------
list of IterDomain
    The root iteration domains.
)")
      .def(
          "get_maybe_allocation_domain",
          &nvfuser::TensorView::getMaybeAllocationDomain,
          R"(
Get the allocation domain of this tensor if it exists.

Returns
-------
list of IterDomain
    The allocation iteration domains.
)")
      .def(
          "get_loop_domain",
          &nvfuser::TensorView::getLoopDomain,
          R"(
Get the loop domain of this tensor.

Returns
-------
list of IterDomain
    The loop iteration domains.
)")
      .def(
          "axis",
          &nvfuser::TensorView::axis,
          py::arg("index"),
          py::return_value_policy::reference,
          R"(
Get the iteration domain at the specified axis.

Parameters
----------
index : int
    The axis index.

Returns
-------
IterDomain
    The iteration domain at the specified axis.
)")
      .def(
          "has_reduction",
          &nvfuser::TensorView::hasReduction,
          R"(
Check if this tensor has any reduction axes.

Returns
-------
bool
    True if the tensor has reduction axes, False otherwise.
)")
      .def(
          "has_broadcast",
          &nvfuser::TensorView::hasBroadcast,
          R"(
Check if this tensor has any broadcast axes.

Returns
-------
bool
    True if the tensor has broadcast axes, False otherwise.
)")
      .def(
          "is_fusion_input",
          &nvfuser::TensorView::isFusionInput,
          R"(
Check if this tensor is a fusion input.

Returns
-------
bool
    True if the tensor is a fusion input, False otherwise.
)")
      .def(
          "definition",
          &nvfuser::TensorView::definition,
          py::return_value_policy::reference,
          R"(
Get the expression that defines this tensor.

Returns
-------
Expr
    The defining expression, or None if this is an input.
)")
      .def(
          "cache_before",
          &nvfuser::TensorView::cacheBefore,
          py::arg("op_type") = LoadStoreOpType::Set,
          py::return_value_policy::reference,
          R"(
Create a cache of this tensor before its computation.

Parameters
----------
op_type : LoadStoreOpType, optional
    The type of load/store operation. Default is Set.

Returns
-------
TensorView
    The newly created cache tensor.
)")
      .def(
          "cache_after",
          &nvfuser::TensorView::cacheAfter,
          py::arg("op_type") = LoadStoreOpType::Set,
          py::arg("cache_op") = CacheOp::Unspecified,
          py::arg("propagate_allocation_domain") = true,
          py::arg("cached_uses") = std::vector<Expr*>{},
          py::return_value_policy::reference,
          R"(
Create a cache of this tensor after its computation.

Parameters
----------
op_type : LoadStoreOpType, optional
    The type of load/store operation. Default is Set.
cache_op : CacheOp, optional
    The type of cache operation. Default is Unspecified.
propagate_allocation_domain : bool, optional
    Whether to propagate the allocation domain. Default is True.

Returns
-------
TensorView
    The newly created cache tensor.
)")
      .def(
          "set_memory_type",
          &nvfuser::TensorView::setMemoryType,
          py::arg("memory_type"),
          R"(
Set the memory type of this tensor.

Parameters
----------
memory_type : MemoryType
    The memory type to set (e.g., Global, Shared, Local).
)")
      .def(
          "split",
          static_cast<TensorView* (
              nvfuser::TensorView::*)(int64_t, int64_t, bool)>(
              &nvfuser::TensorView::split),
          py::arg("axis"),
          py::arg("factor"),
          py::arg("inner_split") = true,
          py::return_value_policy::reference,
          R"(
Split an axis into two axes.

Parameters
----------
axis : int
    The axis to split.
factor : int
    The factor to split by.
inner_split : bool, optional
    If True, the factor determines the size of the inner domain.
    If False, the factor determines the size of the outer domain.
    Default is True.

Returns
-------
TensorView
    A TensorView with the split axes in its loop domain.
)")
      .def(
          "merge",
          static_cast<TensorView* (nvfuser::TensorView::*)(int64_t)>(
              &nvfuser::TensorView::merge),
          py::arg("axis"),
          py::return_value_policy::reference,
          R"(
Merge an axis with the following axis into one.

Parameters
----------
axis : int
    The axis to merge.

Returns
-------
TensorView
    A TensorView with the merged axis in its loop domain.
)")
      .def(
          "reorder",
          static_cast<TensorView* (
              nvfuser::
                  TensorView::*)(const std::unordered_map<int64_t, int64_t>&)>(
              &nvfuser::TensorView::reorder),
          py::arg("old2new"),
          R"(
Reorder the axes according to the given mapping.

Parameters
----------
old2new : dict of int to int
    Mapping from old positions to new positions.

Returns
-------
TensorView
    A TensorView with its loop domain reordered.
)")
      .def(
          "rfactor",
          static_cast<TensorView* (
              nvfuser::TensorView::*)(const std::vector<int64_t>&)>(
              &nvfuser::TensorView::rFactor),
          py::arg("axes"),
          py::return_value_policy::reference,
          R"(
Perform an rfactor transformation on the specified axes.

Parameters
----------
axes : list of int
    The axes to apply rfactor to.

Returns
-------
TensorView
    The newly created rfactor tensor.
)");

  py::class_<nvfuser::TensorViewBuilder>(fusion, "TensorViewBuilder")
      .def(py::init<>(), R"(
Create a new TensorViewBuilder.

A builder class for creating TensorViews with specified properties like dimensions,
data type, contiguity, shape, and stride order.

Examples
--------
>>> builder = TensorViewBuilder()
>>> tv = (builder
...       .num_dims(2)
...       .dtype(DataType.Float)
...       .shape([3, 4])
...       .contiguity(True)
...       .build())
)")
      .def(
          "num_dims",
          &nvfuser::TensorViewBuilder::ndims,
          py::arg("num_dimensions"),
          R"(
Set the number of dimensions for the TensorView.

Parameters
----------
num_dimensions : int
    Number of dimensions for the tensor.

Returns
-------
TensorViewBuilder
    The builder instance for method chaining.
)")
      .def(
          "dtype",
          &nvfuser::TensorViewBuilder::dtype,
          py::arg("dtype"),
          R"(
Set the data type for the TensorView.

Parameters
----------
dtype : DataType
    The data type for the tensor (e.g., DataType.Float, DataType.Half).

Returns
-------
TensorViewBuilder
    The builder instance for method chaining.
)")
      .def(
          "contiguity",
          static_cast<nvfuser::TensorViewBuilder& (
              nvfuser::TensorViewBuilder::*)(std::vector<std::optional<bool>>)>(
              &nvfuser::TensorViewBuilder::contiguity),
          py::arg("contiguity"),
          R"(
Set the contiguity for each dimension of the TensorView.

Parameters
----------
contiguity : list of Optional[bool]
    List of contiguity flags for each dimension. Use None for unspecified contiguity.

Returns
-------
TensorViewBuilder
    The builder instance for method chaining.
)")
      .def(
          "contiguity",
          static_cast<nvfuser::TensorViewBuilder& (
              nvfuser::TensorViewBuilder::*)(bool)>(
              &nvfuser::TensorViewBuilder::contiguity),
          py::arg("contiguous"),
          R"(
Set uniform contiguity for all dimensions of the TensorView.

Parameters
----------
contiguous : bool
    If True, make all dimensions contiguous. If False, make all dimensions non-contiguous.

Returns
-------
TensorViewBuilder
    The builder instance for method chaining.
)")
      .def(
          "shape",
          static_cast<nvfuser::TensorViewBuilder& (
              nvfuser::TensorViewBuilder::*)(std::vector<nvfuser::Val*>)>(
              &nvfuser::TensorViewBuilder::shape),
          py::arg("shape"),
          R"(
Set the shape of the TensorView using Val pointers.

Parameters
----------
shape : list of Val
    List of Val pointers defining the size of each dimension.

Returns
-------
TensorViewBuilder
    The builder instance for method chaining.
)")
      .def(
          "shape",
          static_cast<nvfuser::TensorViewBuilder& (
              nvfuser::TensorViewBuilder::*)(const std::vector<int64_t>&)>(
              &nvfuser::TensorViewBuilder::shape),
          py::arg("shape"),
          R"(
Set the shape of the TensorView using integer values.

Parameters
----------
shape : list of int
    List of integers defining the size of each dimension.

Returns
-------
TensorViewBuilder
    The builder instance for method chaining.
)")
      .def(
          "expanded",
          &nvfuser::TensorViewBuilder::expanded,
          py::arg("expanded"),
          R"(
Set whether dimensions are expanded.

Parameters
----------
expanded : list of bool
    List of flags indicating whether each dimension is expanded.

Returns
-------
TensorViewBuilder
    The builder instance for method chaining.
)")
      .def(
          "stride_order",
          &nvfuser::TensorViewBuilder::strideOrder,
          py::arg("stride_order"),
          R"(
Set the stride order of the dimensions.

Parameters
----------
stride_order : list of int
    List of indices defining the stride ordering of dimensions.
    The ordering is from fastest varying (innermost) to slowest varying (outermost).

Returns
-------
TensorViewBuilder
    The builder instance for method chaining.
)")
      .def(
          "build",
          &nvfuser::TensorViewBuilder::build,
          py::return_value_policy::reference,
          R"(
Build and return the configured TensorView.

Returns
-------
TensorView
    A new TensorView instance with the configured properties.

Notes
-----
- All required properties (dimensions, dtype, shape) must be set before building.
- The build method validates the configuNVFUSER::DIMENSION SEPARATOR POSITION.
)");
}

} // namespace

void bindDirectIr(py::module& fusion) {
  bindBaseNodes(fusion);
  bindInternalBaseNodes(fusion);
  bindInterfaceNodes(fusion);
}

Documentation

The from_pyiterable function is not documented. It is used to convert a Python iterable to a KernelArgumentHolder, but there is no docstring explaining its purpose, parameters, or return value. Adding documentation would improve code readability and maintainability.

KernelArgumentHolder from_pyiterable(
    const py::iterable& iter,
    std::optional<int64_t> device) {
  KernelArgumentHolder args;
  for (py::handle obj : iter) {
    // Allows for a Vector of Sizes to be inputed as a list/tuple
    if (py::isinstance<py::list>(obj) || py::isinstance<py::tuple>(obj)) {
      for (py::handle item : obj) {
        args.push(torch::jit::toIValue(item, c10::AnyType::get()));
      }
    } else {
      args.push(torch::jit::toIValue(obj, c10::AnyType::get()));
    }
  }

  // Transform int64_t device to int8_t
  std::optional<int8_t> selected_device = std::nullopt;
  if (device.has_value()) {
    NVF_CHECK(device.value() < 256, "Maximum device index is 255");
    selected_device = (int8_t)device.value();
  }
  args.setDeviceIndex(selected_device);

Performance

The demo script creates tensors with small sizes (e.g., [2, 4, 8]). This may not be representative of real-world workloads and could lead to inaccurate performance evaluations. Consider using larger tensors to better assess the performance benefits of the fusion.

tv0 = fusion.TensorViewBuilder().num_dims(3).shape([2, 4, 8]).contiguity(True).build()
tv1 = fusion.TensorViewBuilder().num_dims(3).shape([2, 4, 8]).contiguity(True).build()

rdspring1 · 2025-03-14T04:34:49Z

!build

Add TensorViewBuilder

* Use multiple inheritance to disable destructor with std::unique_ptr<nvfuser::Statement, py::nodelete> * Why? Memory management for all Statements is handled by IrContainer

Add schedule

* Return string from fusion and FEC print functions

* Add test_pointwise_manual * Add test_pointwise_auto * Add test_register_sharing_circular_buffering_pointwise

rdspring1 added the Python API Issues related to the Python API label Mar 14, 2025

rdspring1 force-pushed the gemni branch 2 times, most recently from 6c27fdf to 90d9ffe Compare March 14, 2025 04:29

rdspring1 force-pushed the gemni branch 7 times, most recently from 64c17e1 to 4671e35 Compare March 20, 2025 16:16

rdspring1 changed the title ~~[RFC] Create a basic binding for CPP Fusion in python frontend using Gemini~~ [RFC] Create a basic binding for CPP Fusion in python frontend using AI Coding Tools Mar 21, 2025

rdspring1 force-pushed the gemni branch from ed165aa to 870e6dc Compare March 25, 2025 01:43

rdspring1 added 16 commits March 25, 2025 10:59

Create IrContainer

df22980

Add Statement, Expr, Val

c490406

create IterDomain

009ef82

create TensorDomain

62f5803

add parallelize

2c22b3e

Create TensorView

7a32aa1

Add TensorViewBuilder

create Fusion and FusionGuard

8802fe6

Create Add operation

92f9572

add inheritance

92dcb99

create demo

a266578

expand demo

fe22caf

lint and license

1fc2551

Fix segmentation fault

63d23a4

* Use multiple inheritance to disable destructor with std::unique_ptr<nvfuser::Statement, py::nodelete> * Why? Memory management for all Statements is handled by IrContainer

create FusionExecutorCache

52611f6

Add execute to FusionExecutorCache

8dac2aa

basic scheduling

7261942

rdspring1 added 9 commits March 25, 2025 10:59

create sphinx docs

a732619

optimize sphinx

cae0ca9

Add define_scalar and reduction support

d242192

Move DirectFusionDefinition to nvfuser install

4217568

Create tests for DirectFusionDefinition

8a25e36

Add circular buffering support

2e519e2

Add AbstractTensor

a631d6e

Add LdStMatrix support

e0f9f9d

create advance_scheduling.py

ff8a7ff

rdspring1 force-pushed the gemni branch from cad0181 to ff8a7ff Compare March 25, 2025 21:10

rdspring1 added 2 commits March 25, 2025 14:11

update abstract tensor

783b4b4

cherry-pick: create direct_bindings api

dc67fdb

rdspring1 force-pushed the gemni branch from 6acddc9 to ea0d3da Compare March 27, 2025 00:50

cherry-pick: add fusion ir

c4858f6

rdspring1 force-pushed the gemni branch from ea0d3da to c4858f6 Compare March 27, 2025 01:57

rdspring1 added 4 commits March 26, 2025 19:10

cherry-pick: add runtime

947469b

Add DirectFusionDefinition and Ops

afd369d

Add python translate

3b1a9b4

Fix cmake build

792d24e

Add schedule

rdspring1 force-pushed the gemni branch from ae095de to 792d24e Compare March 28, 2025 00:46

rdspring1 added 4 commits March 27, 2025 18:23

remove python_frontend/direct_bindings

2df9ce0

add note

9570c39

Create test_direct_bindings.py

799dfa2

* Return string from fusion and FEC print functions

Create test_schedule.py

ecf6fc7

* Add test_pointwise_manual * Add test_pointwise_auto * Add test_register_sharing_circular_buffering_pointwise

rdspring1 force-pushed the gemni branch from 0de0314 to ecf6fc7 Compare March 28, 2025 17:59

create PythonPrinter

b833265

rdspring1 force-pushed the gemni branch from ec4c953 to cf2e834 Compare March 29, 2025 03:21

metaprogramming

8a5de3c

rdspring1 force-pushed the gemni branch from cf2e834 to 8a5de3c Compare March 29, 2025 03:53

rdspring1 mentioned this pull request Mar 31, 2025

Create direct_bindings_api extension #4156

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Create a basic binding for CPP Fusion in python frontend using AI Coding Tools #4077

[RFC] Create a basic binding for CPP Fusion in python frontend using AI Coding Tools #4077

rdspring1 commented Mar 14, 2025 •

edited

Loading

github-actions bot commented Mar 14, 2025 •

edited

Loading

rdspring1 commented Mar 14, 2025

[RFC] Create a basic binding for CPP Fusion in python frontend using AI Coding Tools #4077

Are you sure you want to change the base?

[RFC] Create a basic binding for CPP Fusion in python frontend using AI Coding Tools #4077

Conversation

rdspring1 commented Mar 14, 2025 • edited Loading

How I am using Google Gemini?

Other Prompts

Python code demo

Output

github-actions bot commented Mar 14, 2025 • edited Loading

Description

Changes walkthrough 📝

PR Reviewer Guide 🔍

rdspring1 commented Mar 14, 2025

rdspring1 commented Mar 14, 2025 •

edited

Loading

github-actions bot commented Mar 14, 2025 •

edited

Loading