Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

cmake: don't fail on GGML_CPU=OFF ggml changes relating to the ggml tensor library for machine learning
#11457 opened Jan 27, 2025 by someone13574 Loading…
ggml : x2 speed for WASM by optimizing SIMD ggml changes relating to the ggml tensor library for machine learning
#11453 opened Jan 27, 2025 by ngxson Loading…
llama: Add support for RWKV v7 architecture Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs python python script changes SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related Vulkan Issues specific to the Vulkan backend
#11452 opened Jan 27, 2025 by MollySophia Draft
2 tasks
Optimized DeepSeek V2/V3 implementation (MLA) python python script changes
#11446 opened Jan 27, 2025 by fairydreaming Loading…
Fix graph for RWKV6Qwen2
#11445 opened Jan 27, 2025 by MollySophia Loading…
docker: add perplexity and bench commands to full image devops improvements to build systems and github actions
#11438 opened Jan 26, 2025 by rare-magma Loading…
docker: allow installing pip packages system-wide devops improvements to build systems and github actions
#11437 opened Jan 26, 2025 by rare-magma Loading…
vulkan: Catch pipeline creation failure and print an error message ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#11436 opened Jan 26, 2025 by jeffbolznv Loading…
Llama-bench: allow benchmarking lora impact examples
#11410 opened Jan 25, 2025 by IMbackK Loading…
ci : allow creating artifacts on PRs on demand artifacts Creates artifacts for pull requests devops improvements to build systems and github actions
#11398 opened Jan 24, 2025 by slaren Draft
llama : add option to override model tensor buffers demo Demonstrate some concept or idea, not intended to be merged need feedback Testing and feedback with results are needed
#11397 opened Jan 24, 2025 by slaren Draft
docs: update fedora cuda guide for 12.8 release documentation Improvements or additions to documentation
#11393 opened Jan 24, 2025 by teihome Loading…
ggml-cpu: Add CPU backend support for KleidiAI library ggml changes relating to the ggml tensor library for machine learning
#11390 opened Jan 24, 2025 by chaxu01 Loading…
convert_hf_to_gguf: fix typo python python script changes
#11377 opened Jan 23, 2025 by stevegrubb Loading…
vulkan: implement initial support for IQ2 and IQ3 quantizations ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#11360 opened Jan 22, 2025 by remyoudompheng Loading…
gguf_convert_endian.py: implement byteswapping for q4_k and q6_k python python script changes
#11349 opened Jan 22, 2025 by AlekseiNikiforovIBM Loading…
cpu_pnp_strategy changes ggml changes relating to the ggml tensor library for machine learning
#11326 opened Jan 21, 2025 by savesanketsw Draft
cmake: refined conditions for math library linking on windows ggml changes relating to the ggml tensor library for machine learning
#11312 opened Jan 20, 2025 by Xarbirus Loading…
ggml: reserve in gguf_writer and added const pointers as params ggml changes relating to the ggml tensor library for machine learning
#11297 opened Jan 18, 2025 by GermanAizek Loading…
Removed const references for simple types and structures less 16 bytes ggml changes relating to the ggml tensor library for machine learning
#11294 opened Jan 18, 2025 by GermanAizek Loading…
llama : second attempt to refactor vision API examples python python script changes server
#11292 opened Jan 18, 2025 by ngxson Draft
1 of 5 tasks
Align structures for 64bit, reorder params and ignore error-warn for Clang 19 ggml changes relating to the ggml tensor library for machine learning
#11291 opened Jan 18, 2025 by GermanAizek Loading…
SYCL: SOFTMAX F16 mask support and other fixes ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language testing Everything test related
#11261 opened Jan 16, 2025 by qnixsynapse Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.