Skip to content

Conversation

@ti-chi-bot
Copy link
Member

This is an automated cherry-pick of #22033

First-time contributors' checklist

What is changed, added or deleted? (Required)

  • Updated getAllMdList to accept multiple TOC files and avoid duplicate links.
  • Modified filterCloudDoc.js to copy multiple TOC files to the temporary directory.
  • Improved regex patterns in merge_by_toc.py and utils.js for better matching of CustomContent tags.

Which TiDB version(s) do your changes apply to? (Required)

Tips for choosing the affected version(s):

By default, CHOOSE MASTER ONLY so your changes will be applied to the next TiDB major or minor releases. If your PR involves a product feature behavior change or a compatibility change, CHOOSE THE AFFECTED RELEASE BRANCH(ES) AND MASTER.

For details, see tips for choosing the affected versions.

  • master (the latest development version)
  • v9.0 (TiDB 9.0 versions)
  • v8.5 (TiDB 8.5 versions)
  • v8.1 (TiDB 8.1 versions)
  • v7.5 (TiDB 7.5 versions)
  • v7.1 (TiDB 7.1 versions)
  • v6.5 (TiDB 6.5 versions)
  • v6.1 (TiDB 6.1 versions)
  • v5.4 (TiDB 5.4 versions)

What is the related PR or file link(s)?

  • This PR is translated from:
  • Other reference link(s):

Do your changes match any of the following descriptions?

  • Delete files
  • Change aliases
  • Need modification after applied to another branch
  • Might cause conflicts after applied to another branch

@ti-chi-bot ti-chi-bot added do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. lgtm size/M Denotes a PR that changes 30-99 lines, ignoring generated files. type/cherry-pick-for-release-8.5 This PR is cherry-picked to release-8.5 from a source PR. labels Nov 10, 2025
@ti-chi-bot
Copy link
Member Author

@shhdgit This PR has conflicts, I have hold it.
Please resolve them or ask others to resolve them, then comment /unhold to remove the hold label.

@ti-chi-bot
Copy link

ti-chi-bot bot commented Nov 10, 2025

@ti-chi-bot: ## If you want to know how to resolve it, please read the guide in TiDB Dev Guide.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the ti-community-infra/tichi repository.

@gemini-code-assist
Copy link

Summary of Changes

Hello @ti-chi-bot, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces significant refactoring to the documentation processing scripts, primarily focusing on improving the handling of Table of Contents (TOC) files and the parsing of custom content tags. The changes aim to enhance the flexibility and reliability of the documentation build process, especially for projects with multiple documentation versions or specific content filtering requirements, by allowing for the processing of multiple TOC files and more resilient regex matching.

Highlights

  • Enhanced TOC File Handling: The getAllMdList function has been updated to accept multiple Table of Contents (TOC) files, allowing for more flexible documentation processing and ensuring that duplicate links are automatically removed.
  • Improved Multi-TOC File Copying: The filterCloudDoc.js script now supports copying multiple specified TOC files to a temporary directory, streamlining the preparation of documentation for different TiDB Cloud versions.
  • Refined Regex Patterns: Regular expression patterns in both merge_by_toc.py and utils.js have been made more robust to accurately match CustomContent tags, even when they contain additional attributes, preventing parsing issues.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request refactors TOC file handling to support multiple files and improves regex patterns for more robust matching. The regex enhancements in merge_by_toc.py and utils.js are good improvements. However, there are critical issues with unresolved merge conflicts in both scripts/filterCloudDoc.js and scripts/utils.js that will prevent the scripts from running. Additionally, the refactored getAllMdList function in utils.js has a regression, as it no longer cleans up file paths, which could lead to issues.

Comment on lines 47 to 63
<<<<<<< HEAD
const allFilePaths = getAllCloudMdList();

extractFilefromList(allFilePaths, "./", "./tmp");
copySingleFileSync("TOC-tidb-cloud.md", "./tmp/TOC.md");
=======
const existingTocFiles = tocFiles.filter((file) => fs.existsSync(file));
const filteredLinkList = getAllMdList(existingTocFiles);

extractFilefromList(filteredLinkList, ".", "./tmp");

tocCopyTargets.forEach(({ src, dest }) => {
if (fs.existsSync(src)) {
copySingleFileSync(src, dest);
}
});
>>>>>>> 2adf999c1d (refactor: enhance TOC file handling and regex patterns in scripts (#22033))

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

critical

This file contains unresolved merge conflict markers (<<<<<<< HEAD, =======, >>>>>>>). This is a critical issue that will cause the script to fail. Please resolve the conflict by removing the markers and the obsolete code block.

  const existingTocFiles = tocFiles.filter((file) => fs.existsSync(file));
  const filteredLinkList = getAllMdList(existingTocFiles);

  extractFilefromList(filteredLinkList, ".", "./tmp");

  tocCopyTargets.forEach(({ src, dest }) => {
    if (fs.existsSync(src)) {
      copySingleFileSync(src, dest);
    }
  });

Copy link
Member

@shhdgit shhdgit left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm.

@ti-chi-bot
Copy link

ti-chi-bot bot commented Nov 11, 2025

@shhdgit: adding LGTM is restricted to approvers and reviewers in OWNERS files.

In response to this:

lgtm.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@qiancai qiancai removed the do-not-merge/hold Indicates that a PR should not merge because someone has issued a /hold command. label Nov 11, 2025
@qiancai
Copy link
Collaborator

qiancai commented Nov 11, 2025

/approve

@ti-chi-bot
Copy link

ti-chi-bot bot commented Nov 11, 2025

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: qiancai

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@ti-chi-bot ti-chi-bot bot added the approved label Nov 11, 2025
@ti-chi-bot ti-chi-bot bot merged commit 196082e into pingcap:release-8.5 Nov 11, 2025
9 checks passed
@ti-chi-bot ti-chi-bot bot deleted the cherry-pick-22033-to-release-8.5 branch November 11, 2025 08:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved lgtm size/M Denotes a PR that changes 30-99 lines, ignoring generated files. type/cherry-pick-for-release-8.5 This PR is cherry-picked to release-8.5 from a source PR.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants