-
Notifications
You must be signed in to change notification settings - Fork 186
citation file #1165
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
citation file #1165
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Co-authored-by: Praateek Mahajan <[email protected]> Signed-off-by: L.B. <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Co-authored-by: Sarah Yurick <[email protected]> Signed-off-by: L.B. <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
Signed-off-by: Lawrence Lane <[email protected]>
- Resolve conflict by removing task-decontamination.md as no longer needed - Include all text curation documentation updates
Signed-off-by: Lawrence Lane <[email protected]>
|
Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually. Contributors can view more details about this message here. |
Signed-off-by: Lawrence Lane <[email protected]>
ayushdg
reviewed
Oct 3, 2025
Signed-off-by: Lawrence Lane <[email protected]>
auto-merge was automatically disabled
October 9, 2025 15:33
Pull Request is not mergeable
sarahyurick
approved these changes
Oct 13, 2025
lbliii
added a commit
to lbliii/NeMo-Curator
that referenced
this pull request
Oct 22, 2025
* text curation updates Signed-off-by: Lawrence Lane <[email protected]> * concepts Signed-off-by: Lawrence Lane <[email protected]> * remove synthetic docs not for this release Signed-off-by: Lawrence Lane <[email protected]> * updates Signed-off-by: Lawrence Lane <[email protected]> * text concepts and getting started changes Signed-off-by: Lawrence Lane <[email protected]> * links, concepts Signed-off-by: Lawrence Lane <[email protected]> * crosslinks Signed-off-by: Lawrence Lane <[email protected]> * quality assessment updates Signed-off-by: Lawrence Lane <[email protected]> * more cleanup Signed-off-by: Lawrence Lane <[email protected]> * semdedup Signed-off-by: Lawrence Lane <[email protected]> * example import cleanup Signed-off-by: Lawrence Lane <[email protected]> * concepts Signed-off-by: Lawrence Lane <[email protected]> * Update docs/about/concepts/text/data-acquisition-concepts.md Co-authored-by: Praateek Mahajan <[email protected]> Signed-off-by: L.B. <[email protected]> * feedback batch 1 Signed-off-by: Lawrence Lane <[email protected]> * feedback batch 2 Signed-off-by: Lawrence Lane <[email protected]> * file_paths="/path/to/jsonl_directory", Signed-off-by: Lawrence Lane <[email protected]> * revert removal of xenna for common crawl executors Signed-off-by: Lawrence Lane <[email protected]> * quickstart installation steps Signed-off-by: Lawrence Lane <[email protected]> * Update docs/about/concepts/text/data-acquisition-concepts.md Co-authored-by: Sarah Yurick <[email protected]> Signed-off-by: L.B. <[email protected]> * data loading concepts updates / simplification Signed-off-by: Lawrence Lane <[email protected]> * data processing feedback Signed-off-by: Lawrence Lane <[email protected]> * read-existing pg updates Signed-off-by: Lawrence Lane <[email protected]> * add-id updates Signed-off-by: Lawrence Lane <[email protected]> * dedup updates Signed-off-by: Lawrence Lane <[email protected]> * feedback Signed-off-by: Lawrence Lane <[email protected]> * citation file Signed-off-by: Lawrence Lane <[email protected]> * fix Signed-off-by: Lawrence Lane <[email protected]> * updates Signed-off-by: Lawrence Lane <[email protected]> --------- Signed-off-by: Lawrence Lane <[email protected]> Signed-off-by: L.B. <[email protected]> Co-authored-by: Praateek Mahajan <[email protected]> Co-authored-by: Sarah Yurick <[email protected]> Signed-off-by: Lawrence Lane <[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.