New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

feat(google-vertexai): Support Non-Google and Model Garden models in Vertex AI - Anthropic integration #6999

Open

afirstenberg wants to merge 91 commits into langchain-ai:main from afirstenberg:model-garden

Contributor

afirstenberg commented Oct 17, 2024 •

edited

Loading

Fixes #2562
Fixes #6207

The ultimate objective is to add full support to Vertex AI for the various 3P models that are provided via API as well as custom user endpoints they may deploy for personal models

This will be done through an API object that provides methods that both turn the LangChain objects into model-API compatible objects that will be formatted in JSON as well as take the JSON result and convert it to LangChain objects.

In this first batch, we get the Gemini support integrated with this API object and add Anthropic / Claude support.

This work was supported by Google Cloud Credits provided by Google.

afirstenberg added 30 commits

June 16, 2024 10:12


          Use earliest interface in the chain necessary.

d0cc30f


          Add delete method

db3cdd7


          Provide ways to get additional headers into the request.

27031f4


          Initial work on Blobs and BlobStores, including initial work on GCS

49bb70a


          Fix typos and bugs on data fetch

951175f


          Start of refactoring around native JS Blobs


          Refactor how RawConnections are structured and used

07c92d6


          Switch to using "blob"

30f228a


          Refactor how RawConnections are structured and used

cfb1081


          Refactoring

9ca6a31

Testing


          formatting

d2f4a40


          Initial implementation of the MediaManager

7550b51


          Add options for store() that can be used to make sure URIs are set in…

87f8c13

… a way that makes sense.

Add option for fetch() that determines how to handle missing MediaBlobs.


          Add options for store() that can be used to make sure URIs are set in…

39ffcb3

… a way that makes sense.

Add option for fetch() that determines how to handle missing MediaBlobs.


          Default implementation of hasValidPath

fd54f5e


          Tests. Bug fixes.

afd2f25


          Assorted name and type refactoring.

60589dd

Change default action if invalid blob.
Add "ignore" action for invalid blobs.


          Tests for MediaManager.

48ddbe9

Bug fix for BlobStore.store with how it handles the key


          Fix tests

185229b


          Refactor Gemini API into a single, containable, thing.

d3fcadb


          Remove obsolete comment

f84cdaf


          Refactor

259ec12


          Add mediaManager to Gemini functions.

708a2fd

Required making some functions and methods async


          Testing MediaManager in chat functions in Vertex AI.


          Initial, incomplete, work on BlobStoreAIStudio

b394eec


          Basic BlobStoreAIStudio implementation and test

6098ee5


          Report more error details in the exception

c4089e2


          Testing MediaManager in chat functions in AI Studio.

ebdf657


          Merge branch 'main' into media-manager

8b7a18b

# Conflicts (resolved):
#	libs/langchain-google-common/src/chat_models.ts
#	libs/langchain-google-common/src/utils/gemini.ts


          Fix bug handling larger files

5dd7164

vercel bot deployed to Preview – langchainjs-docs

November 2, 2024 15:33

View deployment


          Fixes for function call streaming

vercel bot deployed to Preview – langchainjs-docs

November 2, 2024 23:01

View deployment

afirstenberg changed the title ~~[WIP] feat(google-vertexai): Support Non-Google and Model Garden models in Vertex AI~~ feat(google-vertexai): Support Non-Google and Model Garden models in Vertex AI - Anthropic integration

afirstenberg added 2 commits

November 2, 2024 19:51


          Merge branch 'main' into model-garden

00ecc03


          Formatting

2f923d0

vercel bot deployed to Preview – langchainjs-docs

November 3, 2024 00:02

View deployment


          Documentation

28e6ac7

afirstenberg marked this pull request as ready for review

November 3, 2024 00:10

Contributor Author

afirstenberg commented Nov 3, 2024

@bracesproul @jacoblee93 - I believe this is ready for review and, if things look good, integration and release.

This is just the Anthropic updates. As the other supported models are ready, I'll be doing separate PRs for them. (I don't want to hold things up.)

vercel bot had a problem deploying to Preview – langchainjs-docs

November 3, 2024 00:18

Failure

Contributor Author

afirstenberg commented Nov 3, 2024

Hmmm... I see the docs have failed, but I don't see why.

afirstenberg added 2 commits

November 3, 2024 23:01


          Add enums for the various Gemini safety settings

b5d8225


          Add enums for the various Gemini safety settings

3b1e7e8

vercel bot had a problem deploying to Preview – langchainjs-docs

November 4, 2024 04:11

Failure

jacoblee93 reviewed

View reviewed changes

libs/langchain-google-common/src/chat_models.ts Outdated Show resolved Hide resolved

jacoblee93 reviewed

View reviewed changes

libs/langchain-google-common/src/connection.ts

+                  return this._location ?? this.computedLocation;
+                }
+                get computedLocation(): string {

Collaborator

jacoblee93 Nov 5, 2024

Any point to having a base implementation here? One less level of indirection to just put it in location above

Contributor Author

afirstenberg Nov 5, 2024

Mostly this is how location and endpoint evolved and come from a few requirements:

The endpoint is usually, but not always, location based. So it may need to be specified separately from location.
Other elements of the path may have location. So location may need to be specified separately from endpoint.
In the simple cases, I didn't want people to have to specify both, but usually just location. But have them be able to if necessary.

Most resources are available at us-central1. But, apparently not all. So I wanted subclasses to be able to set their default location if appropriate.

And I wanted to be able to support default locations. Requiring a developer to have to know more magic values makes for less manageable code. Especially if we can avoid it. (We support default values all over the place.)

jacoblee93 reviewed

View reviewed changes

libs/langchain-google-common/src/connection.ts

+                  return this._endpoint ?? this.computedEndpoint;
+                }
+                get computedEndpoint(): string {

Collaborator

jacoblee93 Nov 5, 2024

See above

Contributor Author

afirstenberg Nov 5, 2024

See above.

jacoblee93 reviewed

View reviewed changes

libs/langchain-google-common/src/connection.ts

+                get computedLocation(): string {
+                  switch (this.apiName) {
+                    case "google":
+                      return super.computedLocation;

Collaborator

jacoblee93 Nov 5, 2024

Why isn't this just super.location?

Contributor Author

afirstenberg Nov 5, 2024

computedLocation is typically only called from location. Keeping it calling the same method on the superclass keeps it clean and avoids additional logic processing.

jacoblee93 reviewed

View reviewed changes

libs/langchain-google-common/src/connection.ts

+                    case "google":
+                      return super.computedLocation;
+                    case "anthropic":
+                      return "us-east5";

Collaborator

jacoblee93 Nov 5, 2024

Should we be hardcoding this here?

If it's only available in one region now, would prefer to have this configurable or even not have a default at all and just have it documented

Contributor Author

afirstenberg Nov 5, 2024

See above about making magic values as unnecessary as possible.

Some of the Anthropic models are available in other regions, but all are available in us-east5. And none, mysteriously, are available in us-central1.

I'd like to keep it as a default, but clearly have it available to be settable.

jacoblee93 reviewed

View reviewed changes

libs/langchain-google-common/src/connection.ts

-                    description: tool.description ?? `A function available to call.`,
-                    parameters: jsonSchema,
-                  };
+              export class GoogleRequestLogger extends GoogleRequestCallbackHandler {

Collaborator

jacoblee93 Nov 5, 2024

This is intended for public use? Or just your own debugging?

Contributor Author

afirstenberg Nov 5, 2024

Initially for my debugging, but there have been cases where people have reported problems that I couldn't duplicate. Being able to easily have them add this logger to the callbacks to see what the request and response were would make it easier to create a test and a fix.

So it is rare that anyone would need to use it, but invaluable to have in place if they do.

jacoblee93 reviewed

View reviewed changes

libs/langchain-google-common/src/utils/anthropic.ts Outdated Show resolved Hide resolved

jacoblee93 reviewed

View reviewed changes

libs/langchain-google-common/src/utils/anthropic.ts

+                 * If the content is an array of just text fields, turn them into a string.
+                 * @param fields
+                 */
+                function newAIMessageChunk(fields: string | AIMessageFields): AIMessageChunk {

Collaborator

jacoblee93 Nov 5, 2024 •

edited

Loading

Think it's worth exposing and reusing some logic from @langchain/anthropic?

https://github.com/langchain-ai/langchainjs/tree/main/libs/langchain-anthropic/src/utils

Could even make it a dep

Contributor Author

afirstenberg Nov 5, 2024

I considered it, and even tried for a bit, but ultimately decided not to for a few reasons:

It adds an unnecessary dependency in the case where people don't intend to use Claude on GCP. Including pulling in the Anthropic library.
Some of the types aren't exactly compatible (I don't remember which ones now - I just remember starting down this road and running into issues).

The code you highlighted here has more to do with how my logic was generating the content objects and how there are expectations in parts of the base code that AIMessageChunk.content should be a string. So this normalizes it in those cases. (Which are almost all the cases for AI Messages historically.)

Contributor Author

afirstenberg Nov 5, 2024

(To be clear, however, I'm open minded on this. I don't like duplicating code.)

jacoblee93 reviewed

View reviewed changes

Collaborator

jacoblee93 left a comment

Overall looks fantastic - left some nits/comments and will try myself before cutting a release


          Removed commented out code.

bc58188

Contributor Author

afirstenberg commented Nov 5, 2024

@jacoblee93 - Thanks! Very much appreciate the feedback.
I've explained my thought process on the points above, but happy to discuss them further.

Any thoughts on the problems with docs?

vercel bot had a problem deploying to Preview – langchainjs-docs

November 5, 2024 19:48

Failure

afirstenberg added 2 commits

November 5, 2024 15:25


          Make sure the entire chunk is logged, not just the text.

fcb1b21


          formatting

d75e3ee

vercel bot had a problem deploying to Preview – langchainjs-docs

November 5, 2024 20:41

Failure

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

auto:enhancement size:XXL