-
Notifications
You must be signed in to change notification settings - Fork 13
chore: validate html descriptions, warn if invalid #196
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
sirugh
wants to merge
5
commits into
main
Choose a base branch
from
validate-description
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from all commits
Commits
Show all changes
5 commits
Select commit
Hold shift + click to select a range
6b1d689
validate html descriptions, set to empty string if invalid
sirugh 3a5e53d
do not strip html
sirugh c036adb
Merge remote-tracking branch 'origin/main' into validate-description
sirugh 0250f6b
add validation tests, and fix renderer test
sirugh b572809
remove metaDescription validation since it is stripped of tags. Add t…
sirugh File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,77 @@ | ||
| /** | ||
| * Validates HTML syntax by checking for balanced opening and closing tags | ||
| * @param {string} html - The HTML string to validate | ||
| * @returns {Object} - { valid: boolean, reason: string } | ||
| */ | ||
| function validateHtml(html) { | ||
| if (typeof html !== 'string') { | ||
| return { valid: false, reason: 'Input must be a string' }; | ||
| } | ||
|
|
||
| if (html.trim() === '') { | ||
| return { valid: true, reason: 'Empty string is valid' }; | ||
| } | ||
|
|
||
| const stack = []; | ||
| const selfClosingTags = new Set([ | ||
| 'area', 'base', 'br', 'col', 'embed', 'hr', 'img', 'input', | ||
| 'link', 'meta', 'param', 'source', 'track', 'wbr' | ||
| ]); | ||
|
|
||
| // Regular expression to match HTML tags | ||
| const tagRegex = /<\/?([a-zA-Z][a-zA-Z0-9]*)\b[^>]*>/g; | ||
| let match; | ||
| let lineNumber = 1; | ||
| let charPosition = 0; | ||
|
|
||
| while ((match = tagRegex.exec(html)) !== null) { | ||
| const fullTag = match[0]; | ||
| const tagName = match[1].toLowerCase(); | ||
| const isClosingTag = fullTag.startsWith('</'); | ||
| const isSelfClosing = fullTag.endsWith('/>') || selfClosingTags.has(tagName); | ||
|
|
||
| // Calculate position for error reporting | ||
| const beforeMatch = html.substring(0, match.index); | ||
| lineNumber = beforeMatch.split('\n').length; | ||
| charPosition = match.index - beforeMatch.lastIndexOf('\n') - 1; | ||
|
|
||
| if (isSelfClosing) { | ||
| // Self-closing tags don't need to be balanced | ||
| continue; | ||
| } | ||
|
|
||
| if (isClosingTag) { | ||
| // Check if we have a matching opening tag | ||
| if (stack.length === 0) { | ||
| return { | ||
| valid: false, | ||
| reason: `Unexpected closing tag </${tagName}> at line ${lineNumber}, position ${charPosition}` | ||
| }; | ||
| } | ||
|
|
||
| const lastOpenTag = stack.pop(); | ||
| if (lastOpenTag !== tagName) { | ||
| return { | ||
| valid: false, | ||
| reason: `Mismatched tags: expected </${lastOpenTag}> but found </${tagName}> at line ${lineNumber}, position ${charPosition}` | ||
| }; | ||
| } | ||
| } else { | ||
| // Opening tag - push onto stack | ||
| stack.push(tagName); | ||
| } | ||
| } | ||
|
|
||
| // Check if any opening tags weren't closed | ||
| if (stack.length > 0) { | ||
| const unclosedTags = stack.reverse().join(', '); | ||
| return { | ||
| valid: false, | ||
| reason: `Unclosed tags: ${unclosedTags}` | ||
| }; | ||
| } | ||
|
|
||
| return { valid: true, reason: 'HTML is valid' }; | ||
| } | ||
|
|
||
| module.exports = { validateHtml } |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is this log sufficient to inform the user? Hopefully users do not have to experience what I did - tracing backwards from malformed UX through EDS CDN, Azure, and finally realizing it is the data itself that is the problem.
@duynguyen let me know if there's a better way to alert users to bad data.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the log message is good but it will potentially get lost among other logs.
IMO this should be tackled in a generic way, created #197