-
Notifications
You must be signed in to change notification settings - Fork 9.8k
Pull requests: ggerganov/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ggml : add support for dynamic loading of backends
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#10469
opened Nov 23, 2024 by
slaren
Loading…
2 tasks done
vulkan: Handle GPUs with less shared memory
#10468
opened Nov 23, 2024 by
jeffbolznv
Loading…
2 of 4 tasks
vulkan: optimize Q2_K and Q3_K mul_mat_vec
#10459
opened Nov 23, 2024 by
jeffbolznv
Loading…
2 of 4 tasks
server : add speculative decoding support
examples
server
#10455
opened Nov 22, 2024 by
ggerganov
Loading…
2 of 5 tasks
[CANN] Improve the Inferencing Performance for Ascend NPU Device
#10454
opened Nov 22, 2024 by
shen-shanshan
Loading…
2 of 4 tasks
llava: return false instead of exit
examples
#10452
opened Nov 22, 2024 by
tinglou
Loading…
2 of 4 tasks
fix: ggml: fix vulkan-shaders-gen build
#10448
opened Nov 22, 2024 by
sparkleholic
Loading…
2 of 4 tasks
Fix --no-clean for vulkan-shaders-gen
#10445
opened Nov 21, 2024 by
netrunnereve
Loading…
2 of 4 tasks
Integrating llama.cpp with Microsoft Word
#10443
opened Nov 21, 2024 by
GPTLocalhost
Loading…
2 of 4 tasks
vulkan: define all quant data structures in types.comp
#10440
opened Nov 20, 2024 by
jeffbolznv
Loading…
2 of 4 tasks
sycl : offload of get_rows set to 0
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#10432
opened Nov 20, 2024 by
Alcpz
Loading…
2 of 4 tasks
bug-fix: snprintf prints NULL in place of the last character
examples
server
#10419
opened Nov 20, 2024 by
kallewoof
Loading…
2 of 4 tasks
server : replace behave with pytest
devops
improvements to build systems and github actions
examples
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
python
python script changes
server
#10416
opened Nov 19, 2024 by
ngxson
Loading…
5 tasks done
sycl : permuted mul_mat through oneMKL
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#10408
opened Nov 19, 2024 by
Alcpz
Loading…
2 of 4 tasks
server: Fix the status of finish_reason if the stream value is False
examples
server
#10382
opened Nov 18, 2024 by
SeongBeomLEE
Loading…
2 of 4 tasks
speculative : refactor and add a simpler example
examples
server
testing
Everything test related
#10362
opened Nov 17, 2024 by
ggerganov
Loading…
Add support for Qwen2VL
build
Compilation issues
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
python
python script changes
common: compile shared lib, and export some c functions
#10353
opened Nov 17, 2024 by
KenForever1
Loading…
2 of 4 tasks
Refactor/tinyblas
build
Compilation issues
demo
Demonstrate some concept or idea, not intended to be merged
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Add complete implementation of the classical PCA algorithm with covar…
examples
#10315
opened Nov 15, 2024 by
nPr0nn
Loading…
2 of 4 tasks
chore : Fix the error when compiling rocm build on windows using cmake
documentation
Improvements or additions to documentation
#10310
opened Nov 15, 2024 by
cocochick
Loading…
2 of 4 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.