feat: sync llama.cpp to b9101 by github-actions[bot] · Pull Request #339 · mybigday/llama.rn

github-actions · 2026-05-10T04:05:46Z

🤖 Automated llama.cpp sync

This PR was automatically created/updated by the daily sync workflow.

Changes:

Verification:

📋 llama.cpp changes (b9084 → b9101)

389ff61 server : print warning when HTTP timeout exceeded (#22907)
2e97c5f backend sampling: support returning post-sampling probs (#22622)
5d5d2e1 vendor : update cpp-httplib to 0.43.4 (#22888)
2b2babd ggml-virtgpu : include missing mutex header (#22810)
0b04728 sync : ggml
efbada9 ggml : bump version to 0.11.1 (ggml/1484)
f3c3e0e internal AllReduce kernel for CUDA provider (#22299)
5755a10 model : fix model type check for granite/llama3 and deepseek2/glm4.7 lite (#22870)
1e5ad35 model : add sarvam_moe architecture support (#20275)
65d7a8b devops : updated Nix systems (#22869)
00d56b1 docker : upgraded the default intel compute-runtime version (#22567)
5757c4d cmake : update BoringSSL to 0.20260508.0 (#22839)
e20b839 SYCL: reduce allocation overhead during flash attention (#22732)
fd89556 [SYCL] Add BF16 support to GET_ROWS operation (#21391)
6048993 sycl: Q5_K reorder MMVQ/dequant + Q8_0 reorder MMVQ path (#22152)
4a4f819 sycl: Battlemage AOT build via spir64_gen + MMQ subgroup annotations (#22147)
046e284 Add flash attention MMA / Tiles to support MiMo-V2.5 (#22812)

Please review and merge if all checks pass.

github-actions Bot added 2 commits May 11, 2026 03:42

chore: update llama.cpp to b9101 (submodule ref)

8897134

chore(sync): update cpp/ directory after llama.cpp b9101 bootstrap

1cb94f4

github-actions Bot force-pushed the auto/sync-llama.cpp branch from ae507fb to 1cb94f4 Compare May 11, 2026 03:59

github-actions Bot changed the title ~~feat: sync llama.cpp to b9093~~ feat: sync llama.cpp to b9101 May 11, 2026

jhen0409 closed this May 18, 2026

jhen0409 deleted the auto/sync-llama.cpp branch May 18, 2026 04:50

Provide feedback