Reduce tokens in generated system prompt


Motivation: Better inference latency, accuracy

The following code is some minimal code that generates a system prompt:

```python
schema_manager = manager.A2uiSchemaManager(
    version=constants.VERSION_0_9,
    catalogs=[
      # ...
    ],
)

system_prompt = schema_manager.generate_system_prompt(
    role_description="...",
    include_schema=True,
).strip()

print(system_prompt)
```

https://github.com/google/A2UI/blob/73213337efde490041d81039a668c4dd5bf5d4e9/agent_sdks/python/src/a2ui/core/schema/manager.py#L199

The generated system prompt has this structure:

```
<role description>

## Workflow description

The generated response MUST...

---BEGIN A2UI JSON SCHEMA---

### Server to Client Schema:

<server to client schema>

### Common Types Schema:

<common types schema>

### Catalog Schema:

<catalog schema>

---END A2UI JSON SCHEMA---
```

This task relates to both the `<server to client schema>` and `<common types schema>`.

1. `<common types schema>` can be automatically tree-shaken based on the `<catalog schema>` by traversing `$ref`s. Recursively look up `$ref`s, then drop unused types from `<common types schemna>`.
2. `<server to client schema>` needs to be tree-shaken with intent from the user. Perhaps `generate_system_prompt` should be parameterized by a list of allowed messages (e.g. `CreateSurfaceMessage`)?

@nan-yu @jacobsimionato I'm still new to all this but let me know your thoughts!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce tokens in generated system prompt #1004

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Reduce tokens in generated system prompt #1004

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions