Allocate cgra to task by guosran · Pull Request #307 · coredac/dataflow

guosran · 2026-03-31T02:02:31Z

AllocateCgraToTask Pass

Summary

Adds a new AllocateCgraToTask compiler pass that maps tasks onto a 2D CGRA grid, replacing the previous MapTaskOnCgra pass. The key addition is multi-CGRA support: a task can now be assigned multiple contiguous grid positions based on a cgra_count attribute already present in the IR (set manually or by an upstream optimization pass).

Changes

New pass

The old 600-line monolithic MapTaskOnCgraPass.cpp is replaced by:

A thin pass wrapper (~50 lines) in lib/TaskflowDialect/Transforms/.
A mapper implementation and shared utilities in a new lib/TaskflowDialect/Util/ library, making the placement logic reusable by other passes.

Multi-CGRA task placement

Previously each task was pinned to a single CGRA cell. Now the placer reads cgra_count per task and finds a connected cluster of that many cells on the grid. Placement shapes are enumerated (rectangles first, then non-rectangular DFS fallback) and ranked by a proximity score.

Placement algorithm

Critical-path-first ordering: tasks with longer downstream dependency chains are placed first, giving their successors the best chance of landing on adjacent cells.
Scoring: a candidate position is scored by Manhattan distance to already-placed SSA producers/consumers and to assigned SRAM locations.
Fixed-point SRAM assignment: after placing all tasks, each MemRef is assigned to the SRAM at the centroid of its accessing tasks. Task placement is then re-run with the updated SRAM positions; this repeats until assignments converge.

Input / output contract

Input: task IR with cgra_count attribute on each TaskflowTaskOp.
Output: task_mapping_info attribute on each task with cgra_positions,
read_sram_locations, and write_sram_locations.

What Is Not In This PR

The ResourceAwareTaskOptimizationPass (which decides how many CGRAs each task should use) is not included. That integration lives on a separate branch.

…ng multi-CGRA task placement

…to latest main

include/TaskflowDialect/Util/CgraPlacementUtils.h

ShangkunLi · 2026-03-31T05:07:55Z

include/TaskflowDialect/TaskflowPasses.td

    Fusion candidates (same-header SSA dependencies) are placed on adjacent
    CGRAs to enable direct data forwarding.


What do fusion candidates mean?

ShangkunLi · 2026-03-31T05:15:10Z

lib/TaskflowDialect/Transforms/AllocateCgraToTaskPass.cpp

+  }
+
+  void runOnOperation() override {
+    runAllocateCgraToTask(getOperation(), kCgraGridRows, kCgraGridCols);


I think we can maintain an Allocation class in the include or lib/TaskflowDialect/Allocation folder. And make this function a virtual function of this pass, which can be overridden by different task allocation algorithms. Please refer to https://github.com/coredac/dataflow/blob/main/include/NeuraDialect/Mapping/Mapping.h and https://github.com/coredac/dataflow/tree/main/include/NeuraDialect/Mapping/HeuristicMapping

I think you can put the definition of this function in the allocation_utils.cpp in this pr. And make another pr for the code refactoring described above.

ShangkunLi · 2026-03-31T05:20:21Z