Create Sherlock example for Completion pipeline #1305

mdemoret-nv · 2023-10-22T02:31:49Z

Is this a new feature, an improvement, or a change to existing functionality?

New Feature

How would you describe the priority of this feature request

High

Please provide a clear description of problem this feature solves

As part of the Sherlock work, an example showing how to use Morpheus to execute multiple LLM completion queries inside of a pipeline.

Describe your ideal solution

Purpose

The purpose of this example is to illustrate how a user could build a pipeline while will integrate an LLM service into a Morpheus pipeline. This example is the most basic building block on how to use the LLMEngine without requiring any external services or pre-existing data.

Scenario

This example will show one single implementation but the pipeline and components could be used in many scenarios with different requirements. At a high level, the following illustrates different customization points for this pipeline and the specific choices made for this example:

LLM Service
- This pipeline could support any type of LLM service which is compatible with our LLMService interface.
- Such services include OpenAI, NeMo, or even running locally using llama-cpp-python
- For this example, we will focus on using NeMo as the LLM service. This provides an opportunity to show the benefits of using NeMo over other LLM services and helps build components around the NeMo ecosystem. Additionally, more complex pipelines could be built using NeMo + Inform without requiring any changes to the pipeline.
Downstream tasks
- After the LLM has been run, the output of the model could be used in any number of tasks such as training a model, running analysis, or even simulating an attack.
- For this example, we will not have any downstream tasks to keep the implementation simple and the focus on the LLMEngine

Implementation

This example will be composed of a single click command.

Morpheus pipeline

The Morpheus pipeline is built using the following components:

An InMemorySourceStage to hold the LLM queries in a DataFrame
A DeserializationStage to convert the MessageMeta objects into ControlMessages needed by the LLMEngine
New functionality was added to the DeserializeStage to support ControlMessages and add a default task to each message.
A LLMEngineStage then wraps the core LLMEngine functionality
1. An ExtracterNode pulls the questions out of the DataFrame
2. A PromptTemplateNode converts the data and a template into the final inputs for the LLM
3. The LLM is executed using an LLMGenerateNode to run the LLM queries
4. Finally, the responses are put back into the ControlMessage using a SimpleTaskHandler
The pipeline concludes with an InMemorySink stage to store the results.

Completion Criteria

The following items need to be satisfied to consider this issue complete:

Dependent Issues

The following issues should be resolved before this can be completed:

Tasks

Give feedback

[FEA]: Create LLM Engine Core Functionality #1178

feature request sherlock
[FEA]: Create a NeMo Service and NeMo Stage #1130

feature request sherlock
[FEA]: Create Templating Prompt Generator for the LLM Engine #1179

feature request sherlock
Add documentation and tests for the ExtractorNode #1277

4 of 4

sherlock
Add documentation and tests for the LLMEngineStage #1278

4 of 4

sherlock
Add documentation and tests for the LLMGenerateNode #1279

4 of 4

sherlock
Add documentation and tests for the PromptTemplateNode #1282

4 of 4

sherlock
Add documentation and tests for the SimpleTaskHandler #1302

4 of 4

sherlock
Add documentation and tests for the LLMService #1303

4 of 4

sherlock
Options

Additional context

No response

Code of Conduct

I agree to follow this project's Code of Conduct
I have searched the open feature requests and have found no duplicates for this feature request

The text was updated successfully, but these errors were encountered:

mdemoret-nv · 2023-12-07T18:35:01Z

Closing since it was completed in 23.11

mdemoret-nv added the sherlock Issues/PRs related to Sherlock workflows and components label Oct 22, 2023

mdemoret-nv added this to the 23.11 - Sherlock milestone Oct 22, 2023

mdemoret-nv assigned drobison00 Oct 22, 2023

mdemoret-nv mentioned this issue Oct 22, 2023

Create Sherlock example for Retrieval Augmented Generation (RAG) pipeline #1306

Closed

27 tasks

bsuryadevara linked a pull request Nov 6, 2023 that will close this issue

Completion README.md #1332

Merged

mdemoret-nv closed this as completed Dec 7, 2023

mdemoret-nv mentioned this issue Dec 7, 2023

Add Persistant pipeline to the Sherlock RAG example #1416

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create Sherlock example for Completion pipeline #1305

Create Sherlock example for Completion pipeline #1305

mdemoret-nv commented Oct 22, 2023 •

edited

Loading

Tasks

mdemoret-nv commented Dec 7, 2023

Create Sherlock example for Completion pipeline #1305

Create Sherlock example for Completion pipeline #1305

Comments

mdemoret-nv commented Oct 22, 2023 • edited Loading

Is this a new feature, an improvement, or a change to existing functionality?

How would you describe the priority of this feature request

Please provide a clear description of problem this feature solves

Describe your ideal solution

Purpose

Scenario

Implementation

Morpheus pipeline

Completion Criteria

Dependent Issues

Tasks

Additional context

Code of Conduct

mdemoret-nv commented Dec 7, 2023

mdemoret-nv commented Oct 22, 2023 •

edited

Loading