added AzureOpenAI API call #11

joel-lzb · 2024-11-22T02:50:46Z

Added AzureOpenAI API call functions that still uses existing dependencies (openai library) but allows usage of deployed Azure endpoints. Updated README.md with env template example.

Summary by CodeRabbit

Release Notes

New Features
- Introduced support for Azure OpenAI, allowing users to generate responses and embeddings synchronously and asynchronously.
- Added new environment variables for configuration: AZUREOPENAI_API_KEY and AZURE_OPENAI_ENDPOINT.
Bug Fixes
- Enhanced error handling and retry logic for API requests to ensure robustness.
Documentation
- Updated the README to include new environment variable configurations for Azure OpenAI services.

uses existing OpenAI library, but added appropriate function calls when using AzureOpenAI endpoints

added AzureOpenAI env template

vercel · 2024-11-22T02:50:50Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
simpler-llm	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Nov 22, 2024 2:51am

coderabbitai · 2024-11-22T02:50:54Z

Walkthrough

The changes introduce support for Azure OpenAI within the LLM module of the project. A new provider, AzureOpenAI, is added to the LLMProvider enumeration, along with a corresponding class AzureOpenAILLM that extends the existing LLM class. This class includes methods for generating responses and embeddings both synchronously and asynchronously, with appropriate error handling and parameter validation. Additionally, the readme.md file is updated to reflect new environment variable configurations necessary for using Azure OpenAI services.

Changes

File	Change Summary
`SimplerLLM/language/llm.py`	Added `AZUREOPENAI` to `LLMProvider`. Introduced `AzureOpenAILLM` class with methods `generate_response` and `generate_response_async`. Updated `create` method to handle `AZUREOPENAI` provider.
`SimplerLLM/language/llm_providers/azureopenai_llm.py`	Added methods for generating responses and embeddings: `generate_response`, `generate_response_async`, `generate_embeddings`, and `generate_embeddings_async`, all with error handling and retry logic.
`readme.md`	Updated to include new environment variables: `AZUREOPENAI_API_KEY` and `AZURE_OPENAI_ENDPOINT`.

Poem

🐇 In the meadow, bright and wide,
Azure OpenAI is our guide.
With responses swift and true,
New paths of thought for me and you.
Hop along, let's explore this way,
With every change, we leap and play! 🌼

Thank you for using CodeRabbit. We offer it for free to the OSS community and would appreciate your support in helping us grow. If you find it useful, would you consider giving us a shout-out on your favorite social media?

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 9

🧹 Outside diff range and nitpick comments (6)

SimplerLLM/language/llm_providers/azureopenai_llm.py (4)

1-1: Remove misleading or outdated comment.

The comment # add streaming at the top of the file doesn't reflect the current content, as the code doesn't include streaming functionality. Consider removing or updating the comment to accurately describe the file.

124-125: Clarify the exception message for missing user_input.

The current exception message states "user_input must be provided." Consider specifying which function it pertains to for clearer debugging.

Suggested change:
124     if not user_input:
-125         raise ValueError("user_input must be provided.")
+125         raise ValueError("generate_embeddings: 'user_input' must be provided.")
177-178: Clarify the exception message for missing user_input.

For better debugging, specify which function the exception is raised from.

Suggested change:
177     if not user_input:
-178         raise ValueError("user_input must be provided.")
+178         raise ValueError("generate_embeddings_async: 'user_input' must be provided.")
17-211: Refactor to reduce code duplication across functions.

The functions generate_response, generate_response_async, generate_embeddings, and generate_embeddings_async share similar structures and logic. Consider refactoring common code into helper functions or creating a base class to improve maintainability and reduce redundancy.

SimplerLLM/language/llm.py (2)

78-132: Refactor to eliminate code duplication with OpenAILLM

The AzureOpenAILLM class shares significant code with the OpenAILLM class, particularly in the generate_response, generate_response_async, and append_messages methods. Refactoring common logic into a shared base class or utility functions would enhance maintainability and reduce redundancy.

Consider applying this refactor:

+class BaseOpenAILLM(LLM):
+    def __init__(self, provider, model_name, temperature, top_p, api_key_env_var, api_key):
+        super().__init__(provider, model_name, temperature, top_p, api_key)
+        self.api_key = api_key or os.getenv(api_key_env_var, "")
+
+    def append_messages(self, system_prompt: str, messages: list):
+        model_messages = [{"role": "system", "content": system_prompt}]
+        if messages:
+            model_messages.extend(messages)
+        return model_messages
+
+    def _validate_and_prepare_messages(self, prompt, messages, system_prompt):
+        if prompt and messages:
+            raise ValueError("Only one of 'prompt' or 'messages' should be provided.")
+        if not prompt and not messages:
+            raise ValueError("Either 'prompt' or 'messages' must be provided.")
+        if prompt:
+            return [
+                {"role": "system", "content": system_prompt},
+                {"role": "user", "content": prompt},
+            ]
+        else:
+            return self.append_messages(system_prompt, messages)
+
+    def _prepare_params(self, model_name, temperature, top_p, max_tokens, full_response):
+        params = self.prepare_params(model_name, temperature, top_p)
+        params.update({
+            "api_key": self.api_key,
+            "max_tokens": max_tokens,
+            "full_response": full_response,
+        })
+        return params

-class AzureOpenAILLM(LLM):
+class AzureOpenAILLM(BaseOpenAILLM):
     def __init__(self, provider, model_name, temperature, top_p, api_key):
-        super().__init__(provider, model_name, temperature, top_p, api_key)
-        self.api_key = api_key or os.getenv("AZUREOPENAI_API_KEY", "")
+        super().__init__(provider, model_name, temperature, top_p, "AZUREOPENAI_API_KEY", api_key)
    
     def generate_response(
         self,
         model_name: str = None,
         prompt: str = None,
         messages: list = None,
         system_prompt: str = "You are a helpful AI Assistant",
         temperature: float = 0.7,
         max_tokens: int = 300,
         top_p: float = 1.0,
         full_response: bool = False,
     ):
-        params = self.prepare_params(model_name, temperature, top_p)
-        # Validate inputs
-        if prompt and messages:
-            raise ValueError("Only one of 'prompt' or 'messages' should be provided.")
-        if not prompt and not messages:
-            raise ValueError("Either 'prompt' or 'messages' must be provided.")
-        # Prepare messages based on input type
-        if prompt:
-            model_messages = [
-                {"role": "system", "content": system_prompt},
-                {"role": "user", "content": prompt},
-            ]
-        elif messages:
-            model_messages = self.append_messages(system_prompt, messages)
-        params.update(
-            {
-                "api_key": self.api_key,
-                "messages": model_messages,
-                "max_tokens": max_tokens,
-                "full_response": full_response,
-            }
-        )
+        model_messages = self._validate_and_prepare_messages(prompt, messages, system_prompt)
+        params = self._prepare_params(model_name, temperature, top_p, max_tokens, full_response)
+        params["messages"] = model_messages
         return azureopenai_llm.generate_response(**params)
    
     async def generate_response_async(
         self,
         model_name: str = None,
         prompt: str = None,
         messages: list = None,
         system_prompt: str = "You are a helpful AI Assistant",
         temperature: float = 0.7,
         max_tokens: int = 300,
         top_p: float = 1.0,
         full_response: bool = False,
     ):
-        params = self.prepare_params(model_name, temperature, top_p)
-        # Validate inputs
-        if prompt and messages:
-            raise ValueError("Only one of 'prompt' or 'messages' should be provided.")
-        if not prompt and not messages:
-            raise ValueError("Either 'prompt' or 'messages' must be provided.")
-        # Prepare messages based on input type
-        if prompt:
-            model_messages = [
-                {"role": "system", "content": system_prompt},
-                {"role": "user", "content": prompt},
-            ]
-        elif messages:
-            model_messages = self.append_messages(system_prompt, messages)
-        params.update(
-            {
-                "api_key": self.api_key,
-                "messages": model_messages,
-                "max_tokens": max_tokens,
-                "full_response": full_response,
-            }
-        )
+        model_messages = self._validate_and_prepare_messages(prompt, messages, system_prompt)
+        params = self._prepare_params(model_name, temperature, top_p, max_tokens, full_response)
+        params["messages"] = model_messages
         return await azureopenai_llm.generate_response_async(**params)

92-131: Extract common logic in generate_response methods

The generate_response and generate_response_async methods contain repeated code for input validation and message preparation. To improve code clarity and reduce duplication, consider extracting this logic into a private helper method.

Example refactor:

+    def _prepare_request(
+        self,
+        model_name,
+        prompt,
+        messages,
+        system_prompt,
+        temperature,
+        max_tokens,
+        top_p,
+        full_response,
+    ):
+        params = self.prepare_params(model_name, temperature, top_p)
+        # Validate inputs
+        if prompt and messages:
+            raise ValueError("Only one of 'prompt' or 'messages' should be provided.")
+        if not prompt and not messages:
+            raise ValueError("Either 'prompt' or 'messages' must be provided.")
+        # Prepare messages
+        if prompt:
+            model_messages = [
+                {"role": "system", "content": system_prompt},
+                {"role": "user", "content": prompt},
+            ]
+        else:
+            model_messages = self.append_messages(system_prompt, messages)
+        params.update(
+            {
+                "api_key": self.api_key,
+                "messages": model_messages,
+                "max_tokens": max_tokens,
+                "full_response": full_response,
+            }
+        )
+        return params

     def generate_response(
         self,
         model_name: str = None,
         prompt: str = None,
         messages: list = None,
         system_prompt: str = "You are a helpful AI Assistant",
         temperature: float = 0.7,
         max_tokens: int = 300,
         top_p: float = 1.0,
         full_response: bool = False,
     ):
-        params = self.prepare_params(model_name, temperature, top_p)
-        # Validate inputs
-        # ...
-        # Prepare messages
-        # ...
-        params.update(
-            {
-                "api_key": self.api_key,
-                "messages": model_messages,
-                "max_tokens": max_tokens,
-                "full_response": full_response,
-            }
-        )
+        params = self._prepare_request(
+            model_name,
+            prompt,
+            messages,
+            system_prompt,
+            temperature,
+            max_tokens,
+            top_p,
+            full_response,
+        )
         return azureopenai_llm.generate_response(**params)

     async def generate_response_async(
         self,
         model_name: str = None,
         prompt: str = None,
         messages: list = None,
         system_prompt: str = "You are a helpful AI Assistant",
         temperature: float = 0.7,
         max_tokens: int = 300,
         top_p: float = 1.0,
         full_response: bool = False,
     ):
-        params = self.prepare_params(model_name, temperature, top_p)
-        # Validate inputs
-        # ...
-        # Prepare messages
-        # ...
-        params.update(
-            {
-                "api_key": self.api_key,
-                "messages": model_messages,
-                "max_tokens": max_tokens,
-                "full_response": full_response,
-            }
-        )
+        params = self._prepare_request(
+            model_name,
+            prompt,
+            messages,
+            system_prompt,
+            temperature,
+            max_tokens,
+            top_p,
+            full_response,
+        )
         return await azureopenai_llm.generate_response_async(**params)

Also applies to: 133-171

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL

📥 Commits

Reviewing files that changed from the base of the PR and between 927b416 and d8e109e.

📒 Files selected for processing (3)

SimplerLLM/language/llm.py (5 hunks)
SimplerLLM/language/llm_providers/azureopenai_llm.py (1 hunks)
readme.md (1 hunks)

🔇 Additional comments (6)

SimplerLLM/language/llm_providers/azureopenai_llm.py (2)

141-142: ⚠️ Potential issue

Ensure correct access to embedding data.

The embeddings are assigned using generate_embeddings = response.data. Typically, embeddings are accessed via response.data[0].embedding. Verify that you are extracting the embeddings correctly.

2-3: ⚠️ Potential issue

Verify the imports of AsyncAzureOpenAI and AzureOpenAI.

The classes AsyncAzureOpenAI and AzureOpenAI imported from the openai library may not exist. Ensure that these classes are available in the openai package you're using, or adjust the imports accordingly.

Run the following script to check if these classes are available in the openai module:

✅ Verification successful

The imports are correct for the OpenAI SDK version being used

The codebase is using openai==1.25.0 as specified in requirements.txt, and both AsyncAzureOpenAI and AzureOpenAI classes are correctly imported and used throughout the codebase. These classes are part of the official OpenAI v1 Python SDK, which introduced a new Azure-specific client implementation. The usage is consistent with the API version "2024-07-01-preview" being specified in the code.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Check for 'AsyncAzureOpenAI' and 'AzureOpenAI' in the openai module.

# List all attributes of the openai module and search for the classes
python -c "import openai; print(dir(openai))" | grep -E 'AzureOpenAI|AsyncAzureOpenAI'

Length of output: 185

Script:

#!/bin/bash
# Let's check the requirements.txt or setup.py files to see what version of openai is being used
fd "requirements.txt|setup.py" --exec cat {}

# Also check for any direct references to these classes in the codebase
rg "AsyncAzureOpenAI|AzureOpenAI" -A 2 -B 2

Length of output: 5093

SimplerLLM/language/llm.py (4)

6-6: Import statement added correctly

The import of the azureopenai_llm module is properly added, ensuring that Azure OpenAI functions are accessible.

19-19: Enum updated with new provider

The LLMProvider enum now includes AZUREOPENAI, allowing the selection of Azure OpenAI as a provider.

59-61: Factory method updated to support Azure OpenAI

The create method correctly handles the AZUREOPENAI provider by returning an instance of AzureOpenAILLM.

666-666: Ensure correct parameter passing in LwhLLM

The line appears to be part of the LwhLLM class. Verify that all required parameters are correctly passed to lwh_llm.generate_response_async.

Run the following script to check for parameter consistency:

✅ Verification successful

Parameter passing to generate_response_async is correct

All required parameters from the generate_response_async function signature are correctly included in the params dictionary being passed, including user_id, messages, system_prompt, max_tokens, temperature, top_p, and full_response. The parameters match both in name and type with the function definition.

🏁 Scripts executed

The following scripts were executed for the analysis:

Script:

#!/bin/bash
# Description: Check that all calls to `lwh_llm.generate_response_async` pass the correct parameters.

# Search for the function definition
rg -A 10 'def generate_response_async' SimplerLLM/language/llm_providers/lwh_llm.py

Length of output: 392

Script:

#!/bin/bash
# Let's check the context of the line in question and the parameters being passed
rg -B 10 -A 2 'return await lwh_llm\.generate_response_async\(\*\*params\)' SimplerLLM/language/llm.py

Length of output: 534

coderabbitai · 2024-11-22T02:53:37Z

readme.md

+AZUREOPENAI_API_KEY="your_azureopenai_api_key_here"
+AZURE_OPENAI_ENDPOINT='your_azureopenai_endpoint_url_here'
+


🛠️ Refactor suggestion

Standardize environment variable naming and add deployment name.

The Azure OpenAI configuration needs some adjustments:

Standardize the naming convention: AZUREOPENAI_API_KEY should be AZURE_OPENAI_API_KEY to match the endpoint variable style

Add the required AZURE_OPENAI_DEPLOYMENT_NAME environment variable which is typically needed for Azure OpenAI API calls

-AZUREOPENAI_API_KEY="your_azureopenai_api_key_here" +AZURE_OPENAI_API_KEY="your_azure_openai_api_key_here" AZURE_OPENAI_ENDPOINT='your_azureopenai_endpoint_url_here' +AZURE_OPENAI_DEPLOYMENT_NAME='your_azure_openai_deployment_name_here'

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

AZUREOPENAI_API_KEY="your_azureopenai_api_key_here"

AZURE_OPENAI_ENDPOINT='your_azureopenai_endpoint_url_here'

AZURE_OPENAI_API_KEY="your_azure_openai_api_key_here"

AZURE_OPENAI_ENDPOINT='your_azureopenai_endpoint_url_here'

AZURE_OPENAI_DEPLOYMENT_NAME='your_azure_openai_deployment_name_here'

🛠️ Refactor suggestion

Add Azure OpenAI usage example in code samples.

The environment variables are added but there's no example showing how to use Azure OpenAI in the "Creating an LLM Instance" section.

Add this example to the code samples section:

# For Anthropic Claude #llm_instance = LLM.create(LLMProvider.ANTHROPIC, model_name="claude-3-opus-20240229") + +# For Azure OpenAI +#llm_instance = LLM.create(provider=LLMProvider.AZURE_OPENAI, model_name="your-deployment-model") response = llm_instance.generate_response(prompt="generate a 5 words sentence")

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

AZUREOPENAI_API_KEY="your_azureopenai_api_key_here"

AZURE_OPENAI_ENDPOINT='your_azureopenai_endpoint_url_here'

# For Anthropic Claude

#llm_instance = LLM.create(LLMProvider.ANTHROPIC, model_name="claude-3-opus-20240229")

# For Azure OpenAI

#llm_instance = LLM.create(provider=LLMProvider.AZURE_OPENAI, model_name="your-deployment-model")

response = llm_instance.generate_response(prompt="generate a 5 words sentence")

coderabbitai · 2024-11-22T02:53:37Z