-
-
Notifications
You must be signed in to change notification settings - Fork 4.3k
explosion spaCy Help-best-practices Discussions
Sort by:
Latest activity
Label
Categories, most helpful, and community links
Categories
Community links
📚 Help: Best practices Discussions
Discuss best practices, tips and tricks
-
You must be logged in to vote 📚 \n character tokenization and sentence segmentation
feat / sentencizerFeature: Sentencizer (rule-based sentence segmenter) -
You must be logged in to vote 📚 -
You must be logged in to vote 📚 Spacy 3 Sentence Segmentation
feat / sentencizerFeature: Sentencizer (rule-based sentence segmenter) -
You must be logged in to vote 📚 dependencyMatcher.pipe
feat / matcherFeature: Token, phrase and dependency matcher -
You must be logged in to vote 📚 Multi-word static vectors
feat / vectorsFeature: Word vectors and similarity -
You must be logged in to vote 📚 Best practice to keep the custom tokenizer and vocabulary when resuming training
trainingTraining and updating models feat / configFeature: Training config -
You must be logged in to vote 📚 Speedup initialization of pipeline
perf / speedPerformance: speed -
You must be logged in to vote 📚 Multiple NER models with shared transformer
feat / nerFeature: Named Entity Recognizer feat / transformerFeature: Transformer -
You must be logged in to vote 📚 Masking PROPN when training/predicting with textcat
enhancementFeature requests and improvements trainingTraining and updating models -
You must be logged in to vote 📚 -
You must be logged in to vote 📚 Interaction between entity ruler and training, and other conceptual queries
feat / matcherFeature: Token, phrase and dependency matcher -
You must be logged in to vote 📚 Multiple textcat_multilabel components in same spacy v3 pipeline
feat / textcatFeature: Text Classifier feat / configFeature: Training config -
You must be logged in to vote 📚 Best way to do Entity Linking with heterogeneous data
feat / nelFeature: Named Entity linking -
You must be logged in to vote 📚 Real-life project examples from A to Z (free or paid for)
usageGeneral spaCy usage -
You must be logged in to vote 📚 Joining multiple pre-trained SpaCy models into one
feat / pipelineFeature: Processing pipeline and components feat / configFeature: Training config -
You must be logged in to vote 📚 spacy v3 pretrain / static vectors ndim mismatch
feat / vectorsFeature: Word vectors and similarity v3.0Related to v3.0 feat / configFeature: Training config -
You must be logged in to vote 📚 Variability of words / phrases that are matching with specific label
usageGeneral spaCy usage feat / nerFeature: Named Entity Recognizer -
You must be logged in to vote 📚 Data Augmentation for NER
feat / nerFeature: Named Entity Recognizer -
You must be logged in to vote 📚 What is the best practice for setting custom extensions and
feat / docforce
Feature: Doc, Span and Token objects -
You must be logged in to vote 📚 Seeking advice on creating a PDF to text extraction pipeline component
usageGeneral spaCy usage feat / pipelineFeature: Processing pipeline and components -
You must be logged in to vote 📚 Training sharing transformer layer
feat / transformerFeature: Transformer -
You must be logged in to vote 📚 Restrict spaCy from creating NER parts-of-speech Spans in Doc instance.
feat / pipelineFeature: Processing pipeline and components -
You must be logged in to vote 📚 How to configure when model training ends?
feat / configFeature: Training config faqFrequently asked questions and solutions. -
You must be logged in to vote 📚 How to gather timing information on every step of a pipeline?
feat / pipelineFeature: Processing pipeline and components perf / speedPerformance: speed -
You must be logged in to vote 📚 Where to register extension attributes that don't belong to a specific component?
usageGeneral spaCy usage feat / docFeature: Doc, Span and Token objects