Skip to content

Commit 4fb6340

Browse files
authored
Merge branch 'main' into mason/vercel-marketplace-docs
2 parents cea8947 + 51eb46d commit 4fb6340

File tree

5 files changed

+134
-28
lines changed

5 files changed

+134
-28
lines changed

integrations/computer-use/anthropic.mdx

Lines changed: 0 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -4,12 +4,6 @@ title: "Anthropic"
44

55
[Computer Use](https://docs.claude.com/en/docs/agents-and-tools/tool-use/computer-use-tool) is Anthropic's groundbreaking capability that enables Claude to interact with computers the way humans do—by looking at screens, moving cursors, clicking buttons, and typing text. This powerful feature allows AI agents to control web browsers, navigate interfaces, and perform complex tasks across applications.
66

7-
With Computer Use, Claude can:
8-
- **Navigate websites and applications** by interpreting visual interfaces
9-
- **Click buttons and fill forms** just like a human would
10-
- **Take screenshots** to understand and verify its actions
11-
- **Perform multi-step workflows** that span multiple applications or web pages
12-
137
By integrating Computer Use with Kernel, you can run these AI-powered browser automations on cloud-hosted infrastructure, eliminating the need for local browser management and enabling scalable, reliable AI agents.
148

159
## Quick setup with Computer Use

integrations/computer-use/gemini.mdx

Lines changed: 16 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -2,30 +2,34 @@
22
title: "Gemini"
33
---
44

5-
Google's [Gemini 2.5 Computer Use model](https://blog.google/technology/google-deepmind/gemini-computer-use-model/) is a specialized model built on Gemini 2.5 Pro's capabilities to power agents that can interact with user interfaces.
5+
[Gemini 2.5 Computer Use](https://blog.google/technology/google-deepmind/gemini-computer-use-model/) is Google's groundbreaking capability that enables AI models to interact with computers the way humans do—by looking at screens, moving cursors, clicking buttons, and typing text. This powerful feature allows AI agents to control web browsers, navigate interfaces, and perform complex tasks across applications.
66

77
By integrating Gemini 2.5 Computer Use with Kernel, you can run these AI-powered browser automations on cloud-hosted infrastructure, eliminating the need for local browser management and enabling scalable, reliable AI agents.
88

9-
## Quick setup with our example template
9+
## Quick setup with Computer Use
1010

11-
Get started quickly with our TypeScript template that demonstrates Gemini 2.5 Computer Use with Kernel.
11+
Get started with Gemini Computer Use and Kernel using our pre-configured app template:
1212

13-
Check out the [Open-source Gemini Template](https://github.com/onkernel/ts-stagehand-google-cua-agent) repository for a complete working example that shows how to:
14-
- Set up Gemini 2.5 Computer Use with Kernel
15-
- Use Stagehand for browser automation
16-
- Run AI-powered web interactions on cloud infrastructure
13+
```bash
14+
npx @onkernel/create-kernel-app my-computer-use-app
15+
```
1716

18-
## Benefits of using Kernel with Gemini Computer Use
17+
Choose `TypeScript` as the programming language and then select `gemini-cua` as the template.
18+
19+
Then follow the [Quickstart guide](/quickstart/) to deploy and run your Computer Use automation on Kernel's infrastructure.
20+
21+
## Benefits of using Kernel with Computer Use
1922

2023
- **No local browser management**: Run Computer Use automations without installing or maintaining browsers locally
21-
- **Scalability**: Launch multiple browser sessions in parallel for concurrent automations
22-
- **Stealth mode**: Built-in anti-detection features for web interactions
24+
- **Scalability**: Launch multiple browser sessions in parallel for concurrent AI agents
25+
- **Stealth mode**: Built-in anti-detection features for reliable web interactions
2326
- **Session persistence**: Maintain browser state across automation runs
24-
- **Live view**: Debug your automations with real-time browser viewing
27+
- **Live view**: Debug your Computer Use agents with real-time browser viewing
28+
- **Cloud infrastructure**: Run computationally intensive AI agents without local resource constraints
2529

2630
## Next steps
2731

28-
- Check out [live view](/browsers/live-view) for debugging your automations
32+
- Check out [live view](/browsers/live-view) for debugging your Computer Use automations
2933
- Learn about [stealth mode](/browsers/stealth) for avoiding detection
3034
- Learn how to properly [terminate browser sessions](/browsers/termination)
3135
- Learn how to [deploy](/apps/deploy) your Computer Use app to Kernel

integrations/computer-use/openai.mdx

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,9 @@
22
title: "OpenAI"
33
---
44

5-
[Computer Use](https://openai.com/index/computer-using-agent/) is OpenAI's feature that enables AI models to interact with computers like humans do - through screen observation, cursor movement, and keyboard input. By integrating with Kernel, you can run Computer Use automations with cloud-hosted browsers, allowing your AI agents to navigate websites, fill forms, and interact with web applications autonomously.
5+
[Computer Use](https://openai.com/index/computer-using-agent/) is OpenAI's feature that enables AI models to interact with computers the way humans do—by looking at screens, moving cursors, clicking buttons, and typing text. This powerful feature allows AI agents to control web browsers, navigate interfaces, and perform complex tasks across applications.
6+
7+
By integrating Computer Use with Kernel, you can run these AI-powered browser automations on cloud-hosted infrastructure, eliminating the need for local browser management and enabling scalable, reliable AI agents.
68

79
## Quick setup with our Computer Use example app
810

proxies/overview.mdx

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -84,7 +84,7 @@ When no longer needed, delete the proxy configuration:
8484
<DeleteProxySnippet />
8585

8686
<Info>
87-
Deleting a proxy does not affect existing browser sessions that are currently using it. The configuration is only removed from your organization so it can't be used in future browser sessions.
87+
Deleting a proxy immediately reconfigures associated browsers to route directly to the internet.
8888
</Info>
8989

9090
## Limitations

proxies/residential.mdx

Lines changed: 114 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ Residential proxies route traffic through real residential IP addresses. They su
66

77
## Configuration
88

9-
Residential proxies support multiple targeting parameters:
9+
Create a residential proxy with a target country:
1010

1111
<CodeGroup>
1212

@@ -16,10 +16,9 @@ const kernel = new Kernel();
1616

1717
const proxy = await kernel.proxies.create({
1818
type: 'residential',
19-
name: 'my-la-residential',
19+
name: 'my-us-residential',
2020
config: {
21-
country: 'US',
22-
city: 'los_angeles'
21+
country: 'US'
2322
}
2423
});
2524

@@ -34,10 +33,9 @@ client = kernel.Kernel()
3433

3534
proxy = client.proxies.create(
3635
type='residential',
37-
name='my-la-residential',
36+
name='my-us-residential',
3837
config={
39-
'country': 'US',
40-
'city': 'los_angeles'
38+
'country': 'US'
4139
}
4240
)
4341

@@ -54,4 +52,112 @@ browser = client.browsers.create(
5452
- **`state`** - Two-letter state code. Only supported for US.
5553
- **`city`** - City name (lowercase, no spaces, e.g., `sanfrancisco`, `newyork`).
5654
- **`zip`** - US ZIP code (5 digits). Conflicts with city and state.
57-
- **`asn`** - Autonomous System Number. Conflicts with city and state.
55+
- **`asn`** - Autonomous System Number. Conflicts with city and state.
56+
57+
## Advanced Targeting Examples
58+
59+
Kernel recommends using the least-specific targeting configuration that works for your use case. The more specific a configuration, the less available IPs there are, increasing the chance of a slow connection or no available connection (`no_peer` connection error).
60+
61+
### Target by City
62+
63+
Route traffic through a specific city:
64+
65+
<CodeGroup>
66+
67+
```typescript Typescript/Javascript
68+
const proxy = await kernel.proxies.create({
69+
type: 'residential',
70+
name: 'la-residential',
71+
config: {
72+
country: 'US',
73+
state: 'CA',
74+
city: 'los_angeles'
75+
}
76+
});
77+
```
78+
79+
```Python Python
80+
proxy = client.proxies.create(
81+
type='residential',
82+
name='la-residential',
83+
config={
84+
'country': 'US',
85+
'state': 'CA',
86+
'city': 'los_angeles'
87+
}
88+
)
89+
```
90+
91+
</CodeGroup>
92+
93+
<Note>
94+
If the city name is not matched, the API will return the best 10 city names from the state to help you find the correct city identifier.
95+
</Note>
96+
97+
### Target by State
98+
99+
Route traffic through a specific state:
100+
101+
<CodeGroup>
102+
103+
```typescript Typescript/Javascript
104+
const proxy = await kernel.proxies.create({
105+
type: 'residential',
106+
name: 'ny-residential',
107+
config: {
108+
country: 'US',
109+
state: 'NY'
110+
}
111+
});
112+
```
113+
114+
```Python Python
115+
proxy = client.proxies.create(
116+
type='residential',
117+
name='ny-residential',
118+
config={
119+
'country': 'US',
120+
'state': 'NY'
121+
}
122+
)
123+
```
124+
125+
</CodeGroup>
126+
127+
<Note>
128+
If the state name is not matched, the API will return the most-available 10 states.
129+
</Note>
130+
131+
### Target by ASN
132+
133+
Route traffic through a specific Autonomous System Number (ISP):
134+
135+
<CodeGroup>
136+
137+
```typescript Typescript/Javascript
138+
const proxy = await kernel.proxies.create({
139+
type: 'residential',
140+
name: 'comcast-residential',
141+
config: {
142+
country: 'US',
143+
asn: 'AS7922'
144+
}
145+
});
146+
```
147+
148+
```Python Python
149+
proxy = client.proxies.create(
150+
type='residential',
151+
name='comcast-residential',
152+
config={
153+
'country': 'US',
154+
'asn': 'AS7922'
155+
}
156+
)
157+
```
158+
159+
</CodeGroup>
160+
161+
<Note>
162+
If the ASN is not matched, the API will return the most-available 10 examples.
163+
</Note>

0 commit comments

Comments
 (0)