Out now: spaCy v3.1.0! New pipelines for Catalan & Danish, SpanCategorizer for arbitrary overlapping spans, use predicted annotations during training, bug fixes & more #8632
ines
announced in
News & Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
✨ New features and improvements
SpanCategorizer
component for labeling arbitrary and potentially overlapping spans of text.[training.annotating_components]
config setting.EntityRecognizer
with known incorrect span annotations.README.md
based on the meta inspacy package
.For more details, see the New in v3.1 usage guide.
📦 New trained pipelines
ca_core_news_sm
ca_core_news_md
ca_core_news_lg
ca_core_news_trf
da_core_news_trf
spacy_version
in your model package meta to">=3.0.0,<3.2.0"
. If you run into degraded performance, retrain your pipeline with v3.1.spacy init fill-config
to update a v3.0 config for v3.1.[initialize.vectors]
.warnings.filterwarnings
or the new helper methodspacy.errors.filter_warning(action, error_msg='')
to manage warnings.For more information, see Notes on upgrading from v3.0.
🔴 Bug fixes
spacy ray
command works.debug data
.EntityLinker
robust for nO=None.minn
is not set.debug model
for transformers.ENT_KB_ID
inner
annotation.Doc.from_docs()
for all empty docs.textcat
with listener.ENT_ID
andNORM
toDocBin
strings.Span.as_doc
.Span
attrs writable.debug data
fortextcat
.DocBin
is too large.to/from_bytes
forKnowledgeBase
andEntityLinker
.Span.get_lca_matrix
.attrs.IDS
.spacy.batch_by_words.v1
.EntityRuler
:ent_ids
returns None for phrases.EntityRuler
.Doc
.Span.lemma_
.Example.from_dict
.Language.pipe
return values.Doc.from_docs
.textcat
with <2 labels.👥 Contributors
@aajanki, @adrianeboyd, @bodak, @bryant1410, @dhruvrnaik, @explosion-bot, @fhopp, @frascuchon, @graue70, @gtoffoli, @honnibal, @ines, @jacopofar, @jenojp, @jhroy, @jklaise, @juliensalinas, @kevinlu1248, @ldorigo, @mathcass, @meghanabhange, @michael-k, @narayanacharya6, @NirantK, @nsorros, @polm, @sevdimali, @svlandeg, @themrmax, @xadrianzetx, @yohasebe, @ZeeD
This discussion was created from the release v3.1.0: New pipelines for Catalan & Danish, SpanCategorizer for arbitrary overlapping spans, use predicted annotations during training, bug fixes & more.
Beta Was this translation helpful? Give feedback.
All reactions