ggml-org / llama.cpp Public

Notifications You must be signed in to change notification settings
Fork 10.8k
Star 75k

Code
Issues 346
Pull requests 390
Discussions
Actions
Projects 9
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Issues: ggml-org/llama.cpp

examples : add configuration presets

#10932 opened Dec 21, 2024 by ggerganov

Open 3

changelog : libllama API

#9289 opened Sep 3, 2024 by ggerganov

Open 5

changelog : llama-server REST API

#9291 opened Sep 3, 2024 by ggerganov

Open 12

Labels 72 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

346 Open 4,235 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

llama-cli misbehaving (changed?)

#12036 opened Feb 23, 2025 by 0wwafa

Eval bug: GGML_ASSERT(hparams.n_embd_head_k % ggml_blck_size(type_k) == 0) failed bug-unconfirmed

#12033 opened Feb 22, 2025 by AbdullahMPrograms

Misc. bug: Web-UI now unusably slow due to animations bug-unconfirmed

#12026 opened Feb 22, 2025 by clort81

Misc. bug: llama-run segmentation fault bug-unconfirmed

#12022 opened Feb 22, 2025 by benoitf

Eval bug: unknown pre-tokenizer type: 'deepseek-r1-qwen' bug-unconfirmed

#12021 opened Feb 22, 2025 by wr131

Misc. bug: Concurrency Limitation: Only 6 Inferences Run Simultaneously When Setting --parallel > 6 bug-unconfirmed

#12013 opened Feb 21, 2025 by karanotsingyu

Eval bug: Several models producing gibberish bug-unconfirmed

#12012 opened Feb 21, 2025 by iamangus

[CANN] Compile bug: no matching function for call to 'CastIntrinsicsImpl' Ascend NPU issues specific to Ascend NPUs

#12010 opened Feb 21, 2025 by Cikaros

Eval bug: does llama.cpp support Intel AMX instruction? how to enable it bug-unconfirmed

#12003 opened Feb 21, 2025 by montagetao

Misc. bug: add tool_calls id in response in server bug-unconfirmed

#11992 opened Feb 21, 2025 by henryclw

Misc. bug: convert_hf_to_gguf failed bug-unconfirmed

#11991 opened Feb 21, 2025 by JSXGQ

Misc. bug: json_schema under response_format is not working on OpenAI compatible API endpoint v1/chat/completions bug-unconfirmed

#11988 opened Feb 20, 2025 by henryclw

Feature Request: add Kernel level verbose option enhancement

New feature or request

#11985 opened Feb 20, 2025 by 0400H

4 tasks done

Misc. bug: llama-cli '--log-disable' parameter omits response bug-unconfirmed

#11983 opened Feb 20, 2025 by nmandic78

Eval bug: CANNOT LINK EXECUTABLE "./llama-cli": library "libomp.so" not found: needed by main executable bug-unconfirmed

#11979 opened Feb 20, 2025 by Krallbe68

GGML to GGUF FAIL Quantized tensor bytes per row (5120) is not a multiple of Q2_K type size (84)

#11976 opened Feb 20, 2025 by chokoon123

tensor 'blk.25.ffn_down.weight' has invalid ggml type 42 (NONE) bug-unconfirmed

#11975 opened Feb 20, 2025 by evaninf

Eval bug: context shift is disabled bug-unconfirmed

#11974 opened Feb 20, 2025 by deific

Misc. bug: Sporadic MUL_MAT Failures in test-backend-ops for Nvidia backend bug-unconfirmed

#11972 opened Feb 20, 2025 by ShanoToni

Misc. bug: The KV cache is sometimes truncated incorrectly when making v1/chat/completions API calls bug

Something isn't working

#11970 opened Feb 20, 2025 by vnicolici

Eval bug: CPU usage is abnormal when running deepseek-r1-671B-Q4_0 weights in Atlas 800T a2 and NPU device。 bug-unconfirmed

#11966 opened Feb 20, 2025 by woshidahunzi1

Eval bug: Ram boom after using llama-bench with cuda12.8 and deepseekr1q6 bug-unconfirmed

#11965 opened Feb 20, 2025 by Xxianna

Misc. bug: Rpc-server does not use opencl backend on Android. bug-unconfirmed

#11957 opened Feb 19, 2025 by belog2867

bamba

#11955 opened Feb 19, 2025 by werruww

Misc. bug: Segmentation fault when importing model to opencl buffer bug-unconfirmed

#11953 opened Feb 19, 2025 by zhouzengming

Previous 1 2 3 4 5 … 13 14 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-02-20.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly