From 8a8a03a23fd791ba103a80da189c322a627f435c Mon Sep 17 00:00:00 2001 From: angelinalg <122562471+angelinalg@users.noreply.github.com> Date: Thu, 6 Feb 2025 00:08:30 -0800 Subject: [PATCH] [Docs] Mention Anyscale platform provids e2e fault tolerance (#50274) Per request from: @brent-anyscale ## Related issue number [Jira issue](https://anyscale1.atlassian.net/browse/MLDX-468?atlOrigin=eyJpIjoiYjg3ODBkZDQ4MGMyNDUwNmJmMzU3NjQ2OWJkMzgyOGUiLCJwIjoiamlyYS1zbGFjay1pbnQifQ) ## Checks - [x] I've signed off every commit(by using the -s flag, i.e., `git commit -s`) in this PR. - [ ] I've run `scripts/format.sh` to lint the changes in this PR. - [ ] I've included any doc changes needed for https://docs.ray.io/en/master/. - [ ] I've added any new APIs to the API Reference. For example, if I added a method in Tune, I've added it in `doc/source/tune/api/` under the corresponding `.rst` file. - [ ] I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/ - Testing Strategy - [ ] Unit tests - [ ] Release tests - [ ] This PR is not tested :( Signed-off-by: angelinalg --- doc/source/serve/production-guide/fault-tolerance.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/doc/source/serve/production-guide/fault-tolerance.md b/doc/source/serve/production-guide/fault-tolerance.md index 5c46ee166bfe..a412071fa8c1 100644 --- a/doc/source/serve/production-guide/fault-tolerance.md +++ b/doc/source/serve/production-guide/fault-tolerance.md @@ -17,7 +17,9 @@ This section discusses concepts from: (serve-e2e-ft-guide)= ## Guide: end-to-end fault tolerance for your Serve app -Serve provides some [fault tolerance](serve-ft-detail) features out of the box. You can provide end-to-end fault tolerance by tuning these features and running Serve on top of [KubeRay]. +Serve provides some [fault tolerance](serve-ft-detail) features out of the box. Two options to get end-to-end fault tolerance are the following: +* tune these features and run Serve on top of [KubeRay] +* use the [Anyscale platform](https://docs.anyscale.com/platform/services/head-node-ft), a managed Ray platform ### Replica health-checking