## Bug Report: Cannot use GLM for both main agent and auxiliary title generation simultaneously

## Bug Report: Cannot use GLM for both main agent and auxiliary title generation simultaneously

### Summary

When using GLM as the main agent via the Anthropic-compatible endpoint (`/api/anthropic`), auxiliary tasks like title generation cannot use GLM's OpenAI-compatible endpoint (`/api/paas/v4`) even when `base_url` is explicitly set in `config.yaml`. This is because `GLM_BASE_URL` always overrides any `base_url` specified under `auxiliary.title_generation` in `config.yaml`.

### Environment

- Hermes Web UI (latest, cloned from main)
- GLM main agent endpoint: `https://open.bigmodel.cn/api/anthropic`
- OS: Ubuntu (WSL2)

### Steps to Reproduce

1. Set `GLM_BASE_URL=https://open.bigmodel.cn/api/anthropic` in `~/.hermes/.env` (required for main agent to work via Anthropic-compatible endpoint)
2. Configure `auxiliary.title_generation` in `config.yaml` with an explicit OpenAI-compatible base URL:
   ```yaml
   auxiliary:
     title_generation:
       provider: zai
       model: glm-4-flash
       base_url: https://open.bigmodel.cn/api/paas/v4
       api_key: ''
       timeout: 30
       extra_body: {}
   ```
3. Start Hermes and send a chat message.

### Expected Behavior

The `base_url` explicitly set under `auxiliary.title_generation` in `config.yaml` should take priority for auxiliary tasks, allowing the main agent and title generation to use different GLM endpoints.

### Actual Behavior

Title generation fails with:
```
⚠ Auxiliary title generation failed: HTTP 401: 令牌已过期或验证不正确
```

This happens because `_resolve_zai_base_url()` in `hermes_cli/auth.py` gives `GLM_BASE_URL` absolute priority:
```python
def _resolve_zai_base_url(api_key, default_url, env_override):
    if env_override:  # GLM_BASE_URL always wins — ignores config.yaml base_url
        return env_override
```
So the Anthropic-format endpoint is used for title generation too, which returns 401 since it doesn't accept OpenAI-format requests.

### Root Cause

`GLM_BASE_URL` is a single global variable controlling the GLM endpoint for the entire application. There is no way to specify different endpoints for the main agent vs auxiliary tasks. The `base_url` field in `auxiliary.title_generation` config is silently ignored for the `zai` provider.

### Suggested Fix

In `_resolve_zai_base_url()`, check for an explicit `base_url` passed from the auxiliary config before falling back to `GLM_BASE_URL`:

```python
def _resolve_zai_base_url(api_key, default_url, env_override, explicit_base_url=None):
    if explicit_base_url:  # config.yaml base_url takes priority for aux tasks
        return explicit_base_url
    if env_override:       # GLM_BASE_URL wins for main agent
        return env_override
    ...
```

Alternatively, introduce a separate `GLM_AUX_BASE_URL` environment variable for auxiliary tasks.

### Impact

Any user running GLM via the Anthropic-compatible endpoint as their main model cannot use GLM for auxiliary tasks (title generation, vision, web extract, etc.) without breaking their main agent configuration.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

## Bug Report: Cannot use GLM for both main agent and auxiliary title generation simultaneously #1291

Bug Report: Cannot use GLM for both main agent and auxiliary title generation simultaneously

Summary

Environment

Steps to Reproduce

Expected Behavior

Actual Behavior

Root Cause

Suggested Fix

Impact

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

## Bug Report: Cannot use GLM for both main agent and auxiliary title generation simultaneously #1291

Description

Bug Report: Cannot use GLM for both main agent and auxiliary title generation simultaneously

Summary

Environment

Steps to Reproduce

Expected Behavior

Actual Behavior

Root Cause

Suggested Fix

Impact

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions