ChunkFetcher object that can apply limits including max_per_interview #3231

jrochkind · 2025-12-17T18:57:29Z

We fetch chunks with embedding vector similarity, in pgvector, using the neighbor gem. It's just one line of code -- but we start by extracting it to a ChunkFetcher service object so we can add more complicated code, primarily llimits and exclusions.

The ability to say "not including these Chunks" (use case: that I already fetched), or "not including these Intevrviews" (same use case) is pretty easy ActiveRecord.

Harder, we want ot say "give me the top-ranked chunks, but no more than N-per interview." Googling and ChatGPT showed me there's a way to do that with a sub-query Common Table Expression (CTE) using ROW_NUMBER "window_function" with aggregates... phew! Fairly straightforward after I figured it out, then I figured out how to use ActiveRecord to generically wrap a query as a sub-query in larger query, so it could be "that query but limited to top-2 per interviewee".

Basic format of the SQL (as given to me by claude, ha) is:

WITH ranked_chunks AS (
      SELECT 
        chunks.*,
        chunks.embedding <=> ? as distance,
        ROW_NUMBER() OVER (PARTITION BY document_id ORDER BY chunks.embedding <=> ?) as doc_rank
      FROM chunks
      ORDER BY chunks.embedding <=> ?
      LIMIT <big limit to get enough to choose from>
)
    SELECT *
    FROM ranked_chunks
    WHERE doc_rank <= 2
    ORDER BY distance
    LIMIT <actual limit>

You can ask chatgpt or claude for more info on it. :)

Then use it, to expand chunks to Claude

After some experimentation, I think this is a fine point to test at:

Fetch 8 closest-vector chunks
Then fetch 8 more, closest that are only one-per-interview, not including any interviews from first 8

(It's possible point of diminishing returns is even a bit fewer chunks; adding TONS more chunks did not seem to help, see wiki).

… enforced

jrochkind added 6 commits December 17, 2025 15:20

extract ChunkFetcher, so we can later add on more functionality

3744980

add limit_per_document to ChunkFetcher

a657781

add exclude_chunks to ChunkFetcher

12b5c4c

ChunkFetcher can exclude interviews

482b742

change name to max_per_interview

82cdcca

add missing includes

5fe8c76

jrochkind force-pushed the chunk_fetcher branch 2 times, most recently from aba9004 to 073be58 Compare December 17, 2025 20:28

ClaudeInteractor fetch more chunks, but with some interview diversity…

e13dd88

… enforced

jrochkind force-pushed the chunk_fetcher branch from 073be58 to e13dd88 Compare December 17, 2025 20:33

eddierubeiz approved these changes Dec 18, 2025

View reviewed changes

jrochkind marked this pull request as ready for review December 18, 2025 16:44

eddierubeiz merged commit fa6ee2a into master Dec 18, 2025
1 check passed

eddierubeiz deleted the chunk_fetcher branch December 18, 2025 17:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ChunkFetcher object that can apply limits including max_per_interview #3231

ChunkFetcher object that can apply limits including max_per_interview #3231

Uh oh!

jrochkind commented Dec 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ChunkFetcher object that can apply limits including max_per_interview #3231

ChunkFetcher object that can apply limits including max_per_interview #3231

Uh oh!

Conversation

jrochkind commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Then use it, to expand chunks to Claude

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jrochkind commented Dec 17, 2025 •

edited

Loading