PoC: modularize executor and simplify Wasm I/O with permissionless and permissioned scenarios #1

s8sato · 2025-06-02T03:54:20Z

Context

Transcript from Jun 05 standup:

I'm suggesting not only decoupling of the executor's roles, but also switching to Wasm component model.
The Wasm component model makes it much easier to develop and maintain Iroha's Wasm implementation: there are no unsafe blocks, and the common interface between host and guest—WIT (Wasm Interface Type)—is straightforward to read and write.
By the way, this interface works across languages, which is a feature absent in the previous model.

Also, I'm introducing tree-like structure from my PR (which hasn't been merged yet). This will bring significant API-breaking changes, but I have legitimate reasons to do so.
Some of these structures implements BitOr and Add, which means aggregation of events, permissions, and even executables, which enables performance improvements.
The structures also preserve keys when converting into each other, which will reduce future misimplementation or development effort.

feat: demonstrate permissionless scenario

This commit corresponds to the point at which authorization functionality—introducing new concepts for events and permissions—remains only a stub. Please begin your review with this relatively simple phase, because jumping straight to the latest commit might be overwhelming. This commit already captures the vision for Wasm I/O (#5358) very well.

Review notes

The concept of fuzzy keys may be confusing. They act as containers for capturing their counterparts (see the fuzzy_key_captures unit test). For example, a read-access intent (ReadSet) is fuzzy, whereas the resulting view (ViewSet) is specific and concrete, derived by inspecting the current state.

Fuzzy keys will be used to implement permissions in the next commit.

Although it’s outside this PoC’s immediate scope, PR #5355 demonstrates how fuzzy keys can enable more fine-grained event filters (designed as receptors).

feat: demonstrate permissioned scenario

I recommend reviewing the previous commit first.

This commit is the latest—and final—step of this PoC, where the instruction lifecycle interacts with the authorizer component, assuming that registrable permissions (#5359) and registrable executables (#5147) have been approved and implemented.

This commit also shows how the tree structures proposed in #5355—where entities share keys—can aid instruction state transitions and batch processing.

Review notes

The concept of flex keys may also be confusing. Flex keys serve to represent the This (or "self") placeholder (see the flex_key_resolves unit test). Imagine a common permission that allows users to transfer their own assets. Without flex keys, you would need separate permissions for withdrawing from Alice, Bob, Carol, and so on. In reality, you only need a single permission that allows withdrawal from this authority.

README

Executor Modularization & WasmInstruction I/O Simplification PoC

This proof-of-concept explores splitting Hyperledger Iroha’s Executor into modular pieces and slimming down WasmInstruction I/O. The inspiration comes from:

Modularize Executor: split Fee Calculator and Authorizer hyperledger-iroha/iroha#5357
Simplify WasmInstruction I/O: enforce batched read/write hyperledger-iroha/iroha#5358 (see state-transition diagram)

These occupy a key position within tracking issue #5356:

Objective

Can we cleanly separate instruction execution into three roles?

WasmInstruction: collects read/write intents without directly mutating state
Authorizer: evaluates intents against permission rules (approve or reject)
Host runtime: initiates the instruction flow, batches intents for the authorizer, and applies them to state

Success means smaller, testable components, fewer FFI round-trips, and clearer extension points.

Repository Structure

.
├── guest/
│   ├── authorizer/          — Wasm component that enforces permissions
│   └── instruction/         — Wasm component that submits read/write intents
├── host/                    — Rust runtime and tests
├── wit/                     — Shared WIT interfaces
└── README.md

Building & Testing

Prerequisites

rustup target add wasm32-wasip2

cargo add wit-bindgen

Guest components

cargo build --target wasm32-wasip2 --manifest-path guest/instruction/Cargo.toml

cargo build --target wasm32-wasip2 --manifest-path guest/authorizer/Cargo.toml

Host tests

cargo test --package host --lib

cargo test --package host --lib -- tests::instruction_flows --exact --show-output

Compare the test steps to the #5358 state-transition diagram for clarity.

Developer Notes

Host vs. guest, imports vs. exports

wasmtime::component::bindgen! is used on the host side to implement import functions.
wit_bindgen::generate! is used on the guest side to implement export functions.

Component model trade-offs

Removes all unsafe blocks around FFI calls, making future development and maintenance easier:

By expressing higher-level semantics than integers and floats, it becomes possible to statically analyse and reason about a component's behaviour - to enforce and guarantee properties just by looking at the surface of the component.
Wasm components typically produce larger binaries than classic modules—keep that in mind.

Future developer experience

Consider guest/instruction/src/lib.rs as a reference implementation of smart contracts and trigger executables. It’s intentionally verbose now; later we can introduce syntax sugars.

This PoC is experimental and exists solely to test the feasibility of the referenced Iroha issues.

Signed-off-by: Shunkichi Sato <[email protected]>

s8sato · 2025-06-12T00:40:02Z

host/src/lib.rs

+    });
+
+    #[test]
+    fn instruction_flows() {


To reviewers,

First of all, please run cargo test -- --show-output at commit 85749d7 as described in the README, and review the output.

0x009922 · 2025-06-13T05:58:18Z

Can't comment yet about the fuzzy/flex keys and the new model of permissions in general, but the use of WASI and WIT is very intriguing. Particularly, WIT is a great candidate for defining the data model agnostically to Rust or Iroha (hyperledger-iroha/iroha#2021).

0x009922 · 2025-06-24T02:05:51Z

WebAssembly Component Model

This looks great.

Pros:

Reduces cost of WASM IO, better performance
Removes our FFI/model macros, which produce a lot of development pain
WIT-based data model & schema enables support for many languages without need for codegen, thus better SDKs

Cons:

May require drastic refactoring nonetheless
May be complicated integrating structures in the query projections/predicates
May have limitations on types such as inability to define newtypes like Json or define associated methods?

Batching IO

It is not always desirable to read everything in the beginning. For example: I want to read info X, and, depending on its value, I would like to read either big chunk Y or a different big chunk Z. Reading both Y and Z unconditionally is redundant.
Smartcontracts are already executed with read-only state and produce writes in batches.
It might be less intuitive how to write smartcontracts with this new model. As an implementer of a smartcontract, I have to reorganise my thinking into "read all I need at the beginning and write all I need in the end".

Permissions

It is hard for me to see the trade-offs of the new and old approaches. I haven't been exploring the permission system closely.

As a note, read keys (fuzzy and flex) could be combined into a notion of dynamic (or read) key with variants:

Fixed(Id) - resolves strictly to the given id (synonyms: static, strict, explicit, simple)
Fuzzy(IdPredicate) - matches keys based on some rule, possibly with a CompoundPredicate in it (synonyms: match, filter). Question: shall it produce only a single result, or I could expect many?
This - substituted with the authority itself
Others special keys could be added if needed

Tree-like structure

This is a drastic breaking change, as you say. It would require rewriting the majority of integration tests, the SDKs, huge chunks of the docs, the Explorer, and probably something else. I would call it a shift to Iroha 3.

Whether this change is worth the effort - I don't know. The PR and the PoC don't show the entire impact of it. And I don't really understand how to work with it, TBH (need to dive deeper).

Summary

I would be hesitant to proceed to such drastic changes. It requires huge effort across many areas, and I am not sure we are capable of doing so. Additionally, I am not sure the effort is worth it. What is the finite problem we are trying to solve? Is there a possibly more pragmatic way to achieve it with less cost?

s8sato · 2025-06-24T07:58:31Z

Thank you for the detailed review with proactive suggestions. I’ll comment on the most concerning breaking API changes and propose alternative approaches that might be more acceptable.

Breaking API changes for events and permissions, no changes to queries and instructions

In fact, this plan does not introduce any new API change proposals. Although I suggested that migrating to Wasm components would alter the current API, the changes to events and permissions were already explicitly stated when the RC2 milestone was established to enable fine-grained access control and batch processing.

As for queries and instructions, I’m not necessarily arguing that their roles in the UI should be replaced by ReadSet and WriteSet. Adopting the bindings generated from WIT almost as-is in the UI would result in breaking API changes, but we can avoid that by providing conversions between those bindings and the existing structures. This involves modifying the conversions defined in host/src/bindings.rs, and most of the work would simply reapply what was already done in #5355. However, from a performance perspective, this conversion might be disadvantageous.

Reconfirming the motivation

Known issues to be resolved:

Bug fix (#5171)

Currently, iroha_core::smartcontracts::wasm::state defines six variants of validate_query(), some of which do not reference the correct state at the right point in time. This plan centralizes read access and limits it to a single point just before instruction execution.

Performance improvement (#4756)

The union of permissions will be passed to the authorizer only once (AllowSet).
Similarly, the union of query results will be passed to the Wasm instruction (ViewSet).

Missing feature blocking production adoption: permissions for querying transactions (#5338)

By making the write range of transactions comparable with the reader’s readable range, this is addressed.

Scope and commitments of this plan

Limit breaking API changes to events and permissions only
Include benchmarks in the PR to demonstrate performance effectiveness

With these conditions in place, may I proceed with the work?

0x009922 · 2025-06-25T01:50:29Z

Thank you for clarification.

I am still skeptical about the point of "reading everything before execution". As I said:

It is not always desirable to read everything in the beginning. For example: I want to read info X, and, depending on its value, I would like to read either big chunk Y or a different big chunk Z. Reading both Y and Z unconditionally is redundant.

I do, however, find it useful to be able to batch queries together (related: hyperledger-iroha/iroha#5044).

Would it be possible to not enforce smartcontracts to define unconditional inputs, but provide a way to read in batches dynamically instead? What are the trade-offs here?

s8sato · 2025-06-25T02:25:55Z

guest/instruction/src/lib.rs

+        unimplemented!("boilerplate");
+    }
+
+    fn write_request(view: ViewSet, args: String) -> WriteSet {


This is a thread for discussing the separation of the read side and the write side: #1 (comment)

We may be able to support adding as many

fn more_read_request(view: ViewSet, args: String) -> ReadSet

as the user needs before moving on to write_request.
Even then, there’s value in batching wherever possible, and in most simple cases a single read will be sufficient.

We may be able to support adding as many

This sounds like adding an arbitrary amount of static methods that users may or may not use. I think this isn't a flexible design choice and a dynamic approach is preferable. It is also still hard to think about smart contract design in the sense of "multi-stage reading".

there’s value in batching wherever possible

Yes, but that's up to the implementer of a smart contract. I think we shall provide an instrument to make batched reads, but not enforce it, and not limit all reads to a fixed point of smartcontract execution. In other words, leave it the same as it is now, but add a mechanism to make batched queries.

I agree that it’s not common to design a smart contract that limits read access to a finite number before writing. The previous design—where reads and writes could be interleaved in any order and as many times as needed—is more typical, so I may need to revise my plan.

The original goal of this design was to minimize the performance overhead of permission checks on every access in Iroha (#4756). Currently, permission tokens are re-aggregated on each check, but in the PoC the union of permissions obtained on the first read access is cached as part of the instruction’s transition state and reused on the next write access.

This caching effect remains effective even as the number of accesses grows and may be sufficiently powerful. However, it would be even more ideal to cache the account permissions on the authorizer side rather than the host side, since that would suppress the problematic serialize/deserialize operations on the FFI boundary.

However, it would be even more ideal to cache the account permissions on the authorizer side rather than the host side

Do you have an idea on how to cache permissions on the authorizer side?

As I understand, that would require a "stateful authorizer (executor, validator) session" during the time of a single smartcontract execution.

Unfortunately, as long as we adhere to the component model, memory sharing isn’t allowed and the authorizer can’t cache, so de/serialization will be inevitable per request.

Design Revision Proposal

For write access, since allowing arbitrary invocations would add complexity without sufficient benefit, we plan to keep it limited to a single invocation. In more complex use cases, it will still be possible to include a subsequent Wasm instruction call within that single write access.

For read access, the outline of the planned changes is as follows:

Instead of having the guest define this read-request:

export read-request: func(args: string) -> read-set;

the host will provide a generic read function:

import read: func(request: read-set) -> result<view-set, unauthorized>;

The guest can use this read function any number of times to implement the write-request function:

export write-request: func(args: string) -> write-set;

Perhaps the host could also provide a generic function for writes without adding extra complexity:

import write: func(request: write-set) -> result<event-set, unauthorized>;

I’ll try out a few PoC implementations.

s8sato added 2 commits June 2, 2025 12:35

feat: demonstrate permissionless scenario

85749d7

Signed-off-by: Shunkichi Sato <[email protected]>

feat: demonstrate permissioned scenario

0980ed0

Signed-off-by: Shunkichi Sato <[email protected]>

s8sato marked this pull request as ready for review June 2, 2025 04:07

This was referenced May 26, 2025

Simplify WasmInstruction I/O: enforce batched read/write hyperledger-iroha/iroha#5358

Open

Modularize Executor: split Fee Calculator and Authorizer hyperledger-iroha/iroha#5357

Open

s8sato commented Jun 12, 2025

View reviewed changes

s8sato commented Jun 25, 2025

View reviewed changes

PoC: modularize executor and simplify Wasm I/O with permissionless and permissioned scenarios #1

Are you sure you want to change the base?

PoC: modularize executor and simplify Wasm I/O with permissionless and permissioned scenarios #1

Uh oh!

Conversation

s8sato commented Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

feat: demonstrate permissionless scenario

Review notes

feat: demonstrate permissioned scenario

Review notes

Executor Modularization & WasmInstruction I/O Simplification PoC

Objective

Repository Structure

Building & Testing

Prerequisites

Guest components

Host tests

Developer Notes

Host vs. guest, imports vs. exports

Component model trade-offs

Future developer experience

Uh oh!

s8sato Jun 12, 2025

Choose a reason for hiding this comment

Uh oh!

0x009922 commented Jun 13, 2025

Uh oh!

0x009922 commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

WebAssembly Component Model

Batching IO

Permissions

Tree-like structure

Summary

Uh oh!

s8sato commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Breaking API changes for events and permissions, no changes to queries and instructions

Reconfirming the motivation

Bug fix (#5171)

Performance improvement (#4756)

Missing feature blocking production adoption: permissions for querying transactions (#5338)

Scope and commitments of this plan

Uh oh!

0x009922 commented Jun 25, 2025

Uh oh!

s8sato Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

0x009922 Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

s8sato Jun 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

0x009922 Jun 27, 2025

Choose a reason for hiding this comment

Uh oh!

s8sato Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

s8sato Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Design Revision Proposal

Uh oh!

s8sato Jul 9, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

s8sato commented Jun 2, 2025 •

edited

Loading

0x009922 commented Jun 24, 2025 •

edited

Loading

s8sato commented Jun 24, 2025 •

edited

Loading

s8sato Jun 25, 2025 •

edited

Loading

s8sato Jun 26, 2025 •

edited

Loading

s8sato Jul 9, 2025 •

edited

Loading

s8sato Jul 9, 2025 •

edited

Loading