-
Notifications
You must be signed in to change notification settings - Fork 184
Issues: huggingface/text-embeddings-inference
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Nulls instead of vector for Alibaba-NLP/gte-multilingual-base on T4 GPU
#439
opened Nov 25, 2024 by
superchar
2 of 4 tasks
Sagemaker Asynchronous TEI Endpoints Fail On Requests Greater Than 2mb
#433
opened Nov 7, 2024 by
ma3842
2 of 4 tasks
Run TEI model on CPU fails (says Cuda f16 and flash attention is required)
#431
opened Oct 25, 2024 by
Astlaan
2 of 4 tasks
TEI Process dying on Sagemaker Endpoint with g4dn.xlarge
#429
opened Oct 24, 2024 by
BebehCodes
3 of 4 tasks
thread 'tokio-runtime-worker' panicked at /usr/src/backends/src/lib.rs:176:14
#424
opened Oct 14, 2024 by
jackli0127
Inconsistency in how different URL paths are handled (in inference endpoints)
#398
opened Sep 4, 2024 by
MoritzLaurer
4 tasks
dunzhang/stella_en_1.5B_v5 Maximum Token Limit Set to 512 Despite Model Capabilities
#396
opened Sep 4, 2024 by
taoari
2 of 4 tasks
Input validation error:
inputs
must have less than 32000 characters. Given: 67337
#394
opened Sep 3, 2024 by
ffalkenberg
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.