Refactor DSPy adapters to make it more extensible #7996

chenmoneygithub · 2025-03-21T23:16:44Z

We are reworking DSPy adapters for extensibility. For most users this change shouldn't cause backward compatibility issues, but if your workflow explicitly calls child methods of DSPy adapters, you need to make adjustments.

The goal here is with DSPy 3.0, we want Adapter to be a customizable interface rather than some tribal knowledge. We acknowledge that it will be common for users willing to write their own adapter to adjust to their LLMs and workflows. However, the current adapter doesn't have a decent abstraction, and to write a custom adapter users need to understand the source code, and go through a tedious debugging process without guidelines.

In this PR, we are trying to standardize the dspy Adapters, and open a few hooks for people to override during customization. We are aware that there is no single standard that fits all use cases, but trying to hit a stage where we don't over-simplify or over-engineer the base DSPy Adapter/

In a nutshell, we are making the following breakdown of Adapters:

Adapter
- format(): formats the type-based inputs into LM multiturn messages
  - System messages: The high level description of the task, and LM I/O format.
    - Fields description: format_field_description()
    - LM input/output structure description: format_field_structure()
    - task description: format_task_description
  - Few-shot examples (demo): multiturn few-shot examples
    - user message (inputs of demo): format_user_message_content()
    - assistant message (outputs of demo): format_assistant_message_content()
  - Conversation history: multiturn conversation history
    - user message (inputs of history message): format_user_message_content()
    - assistant message (outputs of history message): format_assistant_message_content()
  - Current input: the actual question/input
    - user message: format_user_message_content()
- parse(): parse the LM response to type-based outputs. No sub-hook for parse() because it varies for different adapters.

Note that format_user_message_content() and format_assistant_message_content() are used in multiple places. Users can override any level of hooks for customization.

We will publish a guide on how to customize Adapter with concrete use cases after landing this PR.

okhat · 2025-03-31T20:40:18Z

dspy/adapters/chat_adapter.py

+
+        class MySignature(dspy.Signature):
+            text: str = dspy.InputField(description="The text to analyze")
+            context: str = dspy.InputField(description="The context of the text")


Why this long docstring that will go stale as soon as someone changes the adapter slightly?

simplified for easy maintenance!

okhat · 2025-03-31T20:41:54Z

dspy/adapters/base.py

+        for k, v in signature.input_fields.items():
+            value = inputs[k]
+            formatted_field_value = format_field_value(field_info=v, value=value)
+            messages.append(f"[[ ## {k} ## ]]\n{formatted_field_value}")


This is very specific to chat adapter. Why does it belong in base.py?

Overall, base.py should be very short and lightweight to allow anyone to build any kind of adapter. Right now, this PR is introducing a lot of specific and complicated structure to base.py

okhat · 2025-03-31T20:43:04Z

dspy/adapters/base.py

+        conversation_history = self.format_conversation_history(signature, inputs_copy)
+        if conversation_history:
+            # Conversation history and current input
+            messages.extend(conversation_history)


How do we handle cases where the demos have multi-turn history?

it will be a single turn entity with history as one field, for example:

User message: This is an example of the task, though some input or output fields are not supplied. [[ ## question ## ]] What is the capital of France? [[ ## history ## ]] [{"question": "What is the capital of Germany?", "answer": "Berlin"}] Respond with the corresponding output fields, starting with the field `[[ ## reasoning ## ]]`, then `[[ ## answer ## ]]`, and then ending with the marker for `[[ ## completed ## ]]`. Assistant message: [[ ## reasoning ## ]] Not supplied for this particular example. [[ ## answer ## ]] Paris

chenmoneygithub marked this pull request as draft March 21, 2025 23:17

chenmoneygithub changed the title ~~Refactor DSPy adapters to make it more extensible~~ [WIP] Refactor DSPy adapters to make it more extensible Mar 21, 2025

chenmoneygithub changed the title ~~[WIP] Refactor DSPy adapters to make it more extensible~~ Refactor DSPy adapters to make it more extensible Mar 21, 2025

chenmoneygithub marked this pull request as ready for review March 21, 2025 23:17

okhat reviewed Mar 31, 2025

View reviewed changes

chenmoneygithub added 6 commits April 1, 2025 11:08

init

41b79ae

init

eb7baf0

increment

fc252b5

add docstring

406e9c0

refactor

9005569

fix broken path

59ce841

chenmoneygithub force-pushed the extensible-adapter branch from 5ed7109 to 59ce841 Compare April 1, 2025 02:12

better style

793aa7f

chenmoneygithub requested a review from okhat April 1, 2025 02:30

chenmoneygithub and others added 2 commits April 1, 2025 11:52

increment

64c1c15

Merge branch 'main' into extensible-adapter

062cee9

okhat merged commit a48eff0 into stanfordnlp:main Apr 1, 2025
4 checks passed

Taytay mentioned this pull request Apr 2, 2025

[Question] Is DSPy _designed_ to allow me to export optimized prompt templates/programs? (How to use DSPy with other frameworks) #8043

Closed

2 tasks

okhat mentioned this pull request Apr 8, 2025

Fix JSON Adapter's first attempt, all Adapters for ReAct trajectories #8051

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor DSPy adapters to make it more extensible #7996

Refactor DSPy adapters to make it more extensible #7996

chenmoneygithub commented Mar 21, 2025

okhat Mar 31, 2025

chenmoneygithub Apr 1, 2025

okhat Mar 31, 2025 •

edited

Loading

okhat Mar 31, 2025

chenmoneygithub Apr 1, 2025

Refactor DSPy adapters to make it more extensible #7996

Refactor DSPy adapters to make it more extensible #7996

Conversation

chenmoneygithub commented Mar 21, 2025

okhat Mar 31, 2025

Choose a reason for hiding this comment

chenmoneygithub Apr 1, 2025

Choose a reason for hiding this comment

okhat Mar 31, 2025 • edited Loading

Choose a reason for hiding this comment

okhat Mar 31, 2025

Choose a reason for hiding this comment

chenmoneygithub Apr 1, 2025

Choose a reason for hiding this comment

okhat Mar 31, 2025 •

edited

Loading