Merge pull request #2230 from MicrosoftDocs/main638941574918902442sync_temp

learn-build-service-prod[bot] · web-flow · commit 3f487bd3f7e0 · 2025-09-22T17:05:05.000Z
For protected branch, push strategy should use PR and merge to target branch method to work around git push error
diff --git a/docs/data-engineering/create-custom-spark-pools.md b/docs/data-engineering/create-custom-spark-pools.md
@@ -6,17 +6,21 @@ ms.author: eur
 author: eric-urban
 ms.topic: how-to
 ms.custom:
-ms.date: 07/03/2025
+ms.date: 09/22/2025
 ---
 
 # How to create custom Spark pools in Microsoft Fabric
 
-In this document, we explain how to create custom Apache Spark pools in Microsoft Fabric for your analytics workloads. Apache Spark pools enable users to create tailored compute environments based on their specific requirements, ensuring optimal performance and resource utilization.
+This article shows you how to create custom Apache Spark pools in Microsoft Fabric for your analytics workloads. Apache Spark pools let you create tailored compute environments based on your requirements, so you get optimal performance and resource use.
 
-You specify the minimum and maximum nodes for autoscaling. Based on those values, the system dynamically acquires and retires nodes as the job's compute requirements change, which results in efficient scaling and improved performance. The dynamic allocation of executors in Spark pools also alleviates the need for manual executor configuration. Instead, the system adjusts the number of executors depending on the data volume and job-level compute needs. This process enables you to focus on your workloads without worrying about performance optimization and resource management.
+Specify the minimum and maximum nodes for autoscaling. The system gets and retires nodes as your job's compute needs change, so scaling is efficient and performance improves. Spark pools adjust the number of executors automatically, so you don't need to set them manually. The system changes executor counts based on data volume and job compute needs, so you can focus on your workloads instead of performance tuning and resource management.
 
-> [!NOTE]
-> To create a custom Spark pool, you need admin access to the workspace. The capacity admin must enable the **Customized workspace pools** option in the **Spark Compute** section of **Capacity Admin settings**. To learn more, see [Spark Compute Settings for Fabric Capacities](capacity-settings-management.md).
+> [!TIP]
+> When you configure Spark pools, node size is determined by **Capacity Units (CU)**, which represent the compute capacity assigned to each node. For more information about node sizes and CU, see [Node size options](#node-size-options) section in this guide.
+
+## Prerequisites
+
+To create a custom Spark pool, make sure you have admin access to the workspace. The capacity admin enables the **Customized workspace pools** option in the **Spark Compute** section of **Capacity Admin settings**. For more information, see [Spark Compute Settings for Fabric Capacities](capacity-settings-management.md).
 
 ## Create custom Spark pools
 
@@ -44,14 +48,17 @@ These custom pools have a default autopause duration of 2 minutes. Once the auto
 
 ## Node size options
 
-When configuring a custom Spark pool, you can choose from the following different node sizes:
+When you set up a custom Spark pool, you choose from the following node sizes:
 
-| Node size | Compute Units (CU) | Memory (GB) | Description |
+| Node size | Capacity Units (CU) | Memory (GB) | Description |
 |--|--|--|--|
-| Small | 4 | 32 | Lightweight development and testing jobs. |
-| Medium | 8 | 64 | Most general workloads and typical operations. |
-| Large | 16 | 128 | Memory-intensive tasks or larger data processing jobs. |
-| X-Large | 32 | 256 | The most demanding Spark workloads requiring significant resources. |
+| Small | 4 | 32 | For lightweight development and testing jobs. |
+| Medium | 8 | 64 | For general workloads and typical operations. |
+| Large | 16 | 128 | For memory-intensive tasks or large data processing jobs. |
+| X-Large | 32 | 256 | For the most demanding Spark workloads that need significant resources. |
+
+> [!NOTE] 
+> A capacity unit (CU) in Microsoft Fabric Spark pools represents the compute capacity assigned to each node, not the actual consumption. Capacity units differ from VCore (Virtual Core), which is used in SQL-based Azure resources. CU is the standard term for Spark pools in Fabric, while VCore is more common for SQL pools. When sizing nodes, use CU to determine the assigned capacity for your Spark workloads.
 
 ## Related content
 
diff --git a/docs/data-engineering/workspace-roles-lakehouse.md b/docs/data-engineering/workspace-roles-lakehouse.md
@@ -6,25 +6,38 @@ ms.author: eur
 author: eric-urban
 ms.topic: conceptual
 ms.custom:
-ms.date: 05/23/2023
+ms.date: 09/22/2025
 ms.search.form: Lakehouse Workspace roles Permissions
 ---
 
 # Workspace roles in Lakehouse
 
-Workspace roles define what users can do with Microsoft Fabric items. Roles can be assigned to individuals or security groups from workspace view. See, [Give users access to workspaces](../fundamentals/give-access-workspaces.md).
+Workspace roles define what users can do with Microsoft Fabric items. Roles can be assigned to individuals or security groups from workspace view. For more information about how to manage workspace roles, see [Give users access to workspaces](../fundamentals/give-access-workspaces.md).
 
-The user can be assigned to the following roles:
+## Lakehouse workspace roles and item-specific functions
+
+A user can be assigned to the following roles:
 
 * Admin
 * Member
 * Contributor
 * Viewer
 
-In a lakehouse, the users with Admin, Member, and Contributor roles can perform all CRUD operations on all data. A user with Viewer role can only read data stored in Tables using the [SQL analytics endpoint](lakehouse-sql-analytics-endpoint.md).
+In a lakehouse, the users with *Admin*, *Member*, and *Contributor* roles can perform all CRUD (create, read, update, and delete) operations on all data. A user with the *Viewer* role can only read data stored in tables using the [SQL analytics endpoint](lakehouse-sql-analytics-endpoint.md).
 
 > [!IMPORTANT]
-> When accessing data using the SQL analytics endpoint with Viewer role, **make sure SQL access policy is granted to read required tables**.
+> When accessing data using the SQL analytics endpoint with *Viewer* role, make sure the SQL access policy is granted to read required tables.
+
+The following matrix shows which actions each workspace role can perform on lakehouse items:
+
+| Role        | Create | Read | Update | Delete |
+|-------------|:------:|:----:|:------:|:------:|
+| Admin       |   ✔    |  ✔   |   ✔    |   ✔    |
+| Member      |   ✔    |  ✔   |   ✔    |   ✔    |
+| Contributor |   ✔    |  ✔   |   ✔    |   ✔    |
+| Viewer      |        |  ✔<sup>1</sup>  |        |        |
+
+<sup>1</sup> Viewer can only read data stored in tables using the SQL analytics endpoint provided SQL access policy is granted.
 
 ## Related content