-
Notifications
You must be signed in to change notification settings - Fork 330
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Updates to Clemsciences's spacy process (#1239)
* Added spaCy process * Improved spaCy to CLTK wrapper * Use correct Token attributes * Use spaCy download function instead of shell command * Update poetry.lock * Removed unrelated changes * Fixed SpacyWrapper, StanzaWrapper and download_all_models.py * Improved SpacyWrapper * update dependencies * demo notebook not working * Added spaCy process * Improved spaCy to CLTK wrapper * Use correct Token attributes * Use spaCy download function instead of shell command * Removed unrelated changes * Fixed SpacyWrapper, StanzaWrapper and download_all_models.py * Improved SpacyWrapper * Improved SpacyWrapper * Renamed spacy_dep.py to spacy_wrapper.py and lint fixes * add morphology from spacy to cltk Doc * downgrade spacy to 3.6.1 * Download spaCy model if the model is absent * update dev dependencies, improve spacy wrapper * spacy wrapper working * fix .get_dependencies() * make e2e work with LatinCy * re-add Latin Stops Process * add spacy dl to build script * trigger CI rerun * load spacy directly * dl spacy model with subprocess * load model after dl * load w/ spacy (wrapper seems to err * don't check only dl * bump spacy * why old spacy on ci? * deepcopy to copy * basic lat tests pass * rewrite bad UD types from Mood to VerbForm * finish more cleanup or LatinCy release * add citation printing * rename latincy proc --------- Co-authored-by: Clément Besnier <[email protected]> Co-authored-by: Clément Besnier <[email protected]>
- Loading branch information
1 parent
0bca52a
commit cdc278e
Showing
39 changed files
with
2,302 additions
and
1,967 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,6 +1,6 @@ | ||
[tool.poetry] | ||
name = "cltk" | ||
version = "1.1.7" | ||
version = "1.2.0" | ||
description = "The Classical Language Toolkit" | ||
license = "MIT" | ||
authors = ["Kyle P. Johnson <[email protected]>", "Patrick J. Burns <[email protected]>", "John Stewart <[email protected]>", "Todd Cook <[email protected]>", "Clément Besnier <[email protected]>", "William J. B. Mattingly <https://github.com/wjbmattingly>"] | ||
|
@@ -41,7 +41,7 @@ rapidfuzz = "^3.4.0" | |
stanza = "^1.6.0" | ||
nltk = "^3.7" | ||
stringcase = "^1.2" | ||
spacy = "^3.6.1" | ||
spacy = "3.7.2" | ||
PyYAML = "^6.0.0" | ||
scikit-learn = "^1.0.2" | ||
# Note: Adding torch like this should not be necessary, | ||
|
@@ -51,23 +51,23 @@ scikit-learn = "^1.0.2" | |
torch = ">=2.0.0, !=2.0.1, !=2.1.0" | ||
|
||
[tool.poetry.dev-dependencies] | ||
pytest = "^7.1.1" | ||
pytest = "^7.4.3" | ||
nose = "^1.3" | ||
ipython = "^8.2" | ||
pylint = "^2.13.5" | ||
pylint = "^3.0.3" | ||
sphinx = "^4.5.0" | ||
coverage = "^6.3.2" | ||
coverage = "^7.3.4" | ||
pylint-json2html = "^0.4.0" | ||
tox = "^3.24.5" | ||
tox-pyenv = "^1.1" | ||
pytest-cov = "^3.0" | ||
rtd-poetry = "^0.1.0" | ||
sphinx-autodoc-typehints = "^1.17" | ||
pre-commit = "2.18.1" | ||
mypy = "^0.942" | ||
pre-commit = "3.6.0" | ||
mypy = "^1.8.0" | ||
lxml = "^4.9" | ||
black = "^22.3.0" | ||
isort = "^5.10.1" | ||
black = "^23.12.1" | ||
isort = "^5.13.2" | ||
|
||
[tool.poetry.group.dev.dependencies] | ||
jupyter = "^1.0.0" | ||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.