Adding support for HYBRID search. #3813

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

petyaslavova wants to merge 5 commits into master from ps_add_hybrid_search

+8,437 −5,239

Collaborator

petyaslavova commented Oct 24, 2025

Pull Request check-list

Please make sure to review and check all of these items:

Do tests and lints pass with this change?
Do the CI tests pass with this change (enable it first in your forked repo and wait for the github action build to finish)?
Is the new or changed code fully tested?
Is a documentation update included (if this change modifies existing APIs, or introduces new ones)?
Is there an example added to the examples folder (if applicable)?

NOTE: these things are not required to open a PR and can be done
afterwards / while the PR is open.

Description of change

Please provide a description of the change here.

petyaslavova requested a review from Copilot

October 24, 2025 09:22

Copilot AI reviewed

View reviewed changes

Contributor

Copilot AI left a comment

Pull Request Overview

This PR adds support for hybrid search functionality to the Redis search client, enabling combined text and vector similarity searches. The implementation introduces new query types, result parsers, and comprehensive test coverage.

Key changes:

New hybrid query classes for combining text search and vector similarity operations
Hybrid search command execution and result parsing
Post-processing configuration for filtering, sorting, and aggregating results

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
tests/test_search.py	Comprehensive test suite covering hybrid search functionality including various query types, filters, and post-processing options
redis/connection.py	Minor whitespace cleanup (removed blank line)
redis/commands/search/hybrid_query.py	New file implementing hybrid query classes and post-processing configuration
redis/commands/search/commands.py	Added hybrid_search method and result parsing logic
redis/commands/search/init.py	Registered hybrid search command parser

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

redis/commands/search/commands.py Outdated Show resolved Hide resolved

tests/test_search.py Outdated Show resolved Hide resolved

tests/test_search.py Outdated Show resolved Hide resolved

petyaslavova added the feature label

petyaslavova force-pushed the ps_add_hybrid_search branch 3 times, most recently from 1e8345a to 3b3b4f5 Compare

October 29, 2025 08:13

htemelski-redis mentioned this pull request

Added hybrid search command redis/go-redis#3573

Open

petyaslavova marked this pull request as draft

October 29, 2025 08:17

petyaslavova force-pushed the ps_add_hybrid_search branch from 3b3b4f5 to 551315b Compare

November 3, 2025 10:24

petyaslavova changed the title ~~Adding support for hybrid search.~~ Adding support for HYBRID search.

petyaslavova requested a review from Copilot

November 3, 2025 12:39

Copilot AI reviewed

View reviewed changes

Contributor

Copilot AI left a comment

Pull Request Overview

Copilot reviewed 6 out of 7 changed files in this pull request and generated 8 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

redis/commands/search/hybrid_query.py Outdated Show resolved Hide resolved

tests/test_asyncio/test_search.py Show resolved Hide resolved

tests/test_asyncio/test_search.py Outdated Show resolved Hide resolved

tests/test_search.py Outdated Show resolved Hide resolved

tests/test_search.py Show resolved Hide resolved

tests/test_asyncio/test_search.py Show resolved Hide resolved

tests/test_search.py Show resolved Hide resolved

redis/commands/search/hybrid_query.py Show resolved Hide resolved

petyaslavova force-pushed the ps_add_hybrid_search branch 2 times, most recently from 0ea7eec to fc7c324 Compare

November 4, 2025 10:55

petyaslavova marked this pull request as ready for review

November 4, 2025 10:58

petyaslavova requested a review from Copilot

November 4, 2025 11:36

Copilot AI reviewed

View reviewed changes

Contributor

Copilot AI left a comment

Pull Request Overview

Copilot reviewed 5 out of 6 changed files in this pull request and generated 4 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

redis/commands/search/commands.py Show resolved Hide resolved

redis/commands/search/hybrid_query.py Show resolved Hide resolved

redis/commands/search/commands.py Show resolved Hide resolved

redis/commands/search/commands.py Show resolved Hide resolved

petyaslavova requested a review from vladvildanov

November 4, 2025 12:03

petyaslavova force-pushed the ps_add_hybrid_search branch from ce28c55 to ae531b0 Compare

November 4, 2025 12:51

vladvildanov requested changes

View reviewed changes

redis/commands/search/hybrid_query.py Outdated

    
                  def scorer(self, scorer: str) -> "HybridSearchQuery":

                      """

                      Scoring algorithm for text search query.

                      Allowed values are "TFIDF" or  "BM25"

Collaborator

vladvildanov Nov 4, 2025

Allowed values are type?: 'BM25' | 'TFIDF' | 'DISMAX' | 'DOCSCORE' according to the design. Also maybe it makes sense to keep value as Enum so we don't need to add validation around it?

Collaborator Author

petyaslavova Nov 4, 2025

Tested the other options - they also work + found some other that are not listed in the spec and are supported. I would prefer to leave it as string and even not be too detailed in the docstring - I am not sure what are all of the actually supported values...

redis/commands/search/hybrid_query.py

    
                      """Return the query string of this query object."""

                      return self._query_string

                  def scorer(self, scorer: str) -> "HybridSearchQuery":

Collaborator

vladvildanov Nov 4, 2025

Apart of scorer algorithm, scorer also supports named parameters

SCORER algorithm params...

Collaborator Author

petyaslavova Nov 4, 2025

Not for this version - currently only the scorer name can be provided.

redis/commands/search/hybrid_query.py

    
                      Add search method parameters to the query.

                      Args:

                          method: Vector search method name. Supported values are "KNN" or "RANGE".

Collaborator

vladvildanov Nov 4, 2025

Enum value could be used

Collaborator Author

petyaslavova Nov 4, 2025

Done.

redis/commands/search/hybrid_query.py

    
                      Args:

                          method: Vector search method name. Supported values are "KNN" or "RANGE".

                          kwargs: Search method parameters. Use the param names for keys and the

                              values for the values. Example: {"K": 10, "EF_RUNTIME": 100}.

Collaborator

vladvildanov Nov 4, 2025

K is a required argument, whereas EF_RUNTIME is optional. Makes sense to add some validation around it

Collaborator Author

petyaslavova Nov 4, 2025

I would prefer not to add validation of the inputs, and this way allow more flexibility for future server api changes, but I have added more details in the docstrings

redis/commands/search/hybrid_query.py

    
                          conditions: Filter conditions.

                      """

                      args = [conditions]

                      Filter.__init__(self, "FILTER", *args)

Collaborator

vladvildanov Nov 4, 2025

Filters are way more complex then this.

[FILTER] "<filter-expression>"
       [POLICY [ADHOC/BATCHES/ACORN]]
           [BATCHES BATCH_SIZE <batch-size-value>]

redis/commands/search/hybrid_query.py Outdated

    
                  def combine(

                      self,

                      method: Literal["RRF", "LINEAR"],

Collaborator

vladvildanov Nov 4, 2025

Design also specifies third one: FUNCTION. PRD also mentioned it

Collaborator Author

petyaslavova Nov 4, 2025

It is not supported in this version.

redis/commands/search/hybrid_query.py Outdated

    
                      Args:

                          method: The combine method to use - RRF or LINEAR.

                          kwargs: Additional combine parameters.

Collaborator

vladvildanov Nov 4, 2025

Same problem here, isn't really clear which parameters could be used with which method

redis/commands/search/hybrid_query.py

    
                      ret = ["GROUPBY", str(len(fields)), *fields]

                      for reducer in reducers:

                          ret += ["REDUCE", reducer.NAME, str(len(reducer.args))]

Collaborator

vladvildanov Nov 4, 2025

So it needs to add REDUCE for each reducer? From PRD it seems that it accepts count argument and single REDUCE keyword. Also, it doesn't says about aliases. Correct me if I'm wrong

[GROUPBY count field... REDUCE function count...]

redis/commands/search/hybrid_query.py

    
                              the alias for the projection, and the value is the projection

                              expression itself, for example `apply(square_root="sqrt(@foo)")`.

                      """

                      for alias, expr in kwexpr.items():

Collaborator

vladvildanov Nov 4, 2025

PRD also doesn't specifies an option to have multiple APPLY keywords

Collaborator Author

petyaslavova Nov 4, 2025

But it works... And there is no other way to apply more than one field transformation except to have the apply twice or more. There is a test covering that case.

redis/commands/search/hybrid_query.py

    
                      return self

                  def sort_by(self, *sortby: "SortbyField") -> Self:

Collaborator

vladvildanov Nov 4, 2025

Looks like SortByField is missing WITHCOUNT option

Collaborator Author

petyaslavova Nov 4, 2025

I can't find such an option in the spec.

petyaslavova added 4 commits

November 4, 2025 18:27


          Adding support for hybrid search.

6f674fc


          Adding YIELD_SCORE_AS tests and clearing up support for the keyword

b5fb7d8


          Removing commented test code.

bd777ab


          Applying review comments - part 1

a0985b4

petyaslavova force-pushed the ps_add_hybrid_search branch from ae531b0 to a0985b4 Compare

November 4, 2025 18:04


          Fixing linters

12b153e

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

feature