Skip to content

feat: sync llama.cpp to b9101#339

Closed
github-actions[bot] wants to merge 2 commits into
mainfrom
auto/sync-llama.cpp
Closed

feat: sync llama.cpp to b9101#339
github-actions[bot] wants to merge 2 commits into
mainfrom
auto/sync-llama.cpp

Conversation

@github-actions
Copy link
Copy Markdown
Contributor

@github-actions github-actions Bot commented May 10, 2026

🤖 Automated llama.cpp sync

This PR was automatically created/updated by the daily sync workflow.

Changes:

  • Updated llama.cpp submodule from b9084 to b9101
  • Regenerated bindings and build files

Verification:

  • ✅ Bootstrap script completed successfully (including iOS Metal compilation)
  • ✅ iOS frameworks build completed successfully (macOS job)
  • ✅ Android libraries build completed successfully (Ubuntu job)
  • ✅ C++ unit tests passed
  • ✅ TypeScript build completed successfully
📋 llama.cpp changes (b9084 → b9101)
  • 389ff61 server : print warning when HTTP timeout exceeded (#22907)
  • 2e97c5f backend sampling: support returning post-sampling probs (#22622)
  • 5d5d2e1 vendor : update cpp-httplib to 0.43.4 (#22888)
  • 2b2babd ggml-virtgpu : include missing mutex header (#22810)
  • 0b04728 sync : ggml
  • efbada9 ggml : bump version to 0.11.1 (ggml/1484)
  • f3c3e0e internal AllReduce kernel for CUDA provider (#22299)
  • 5755a10 model : fix model type check for granite/llama3 and deepseek2/glm4.7 lite (#22870)
  • 1e5ad35 model : add sarvam_moe architecture support (#20275)
  • 65d7a8b devops : updated Nix systems (#22869)
  • 00d56b1 docker : upgraded the default intel compute-runtime version (#22567)
  • 5757c4d cmake : update BoringSSL to 0.20260508.0 (#22839)
  • e20b839 SYCL: reduce allocation overhead during flash attention (#22732)
  • fd89556 [SYCL] Add BF16 support to GET_ROWS operation (#21391)
  • 6048993 sycl: Q5_K reorder MMVQ/dequant + Q8_0 reorder MMVQ path (#22152)
  • 4a4f819 sycl: Battlemage AOT build via spir64_gen + MMQ subgroup annotations (#22147)
  • 046e284 Add flash attention MMA / Tiles to support MiMo-V2.5 (#22812)

Please review and merge if all checks pass.

@github-actions github-actions Bot force-pushed the auto/sync-llama.cpp branch from ae507fb to 1cb94f4 Compare May 11, 2026 03:59
@github-actions github-actions Bot changed the title feat: sync llama.cpp to b9093 feat: sync llama.cpp to b9101 May 11, 2026
@jhen0409 jhen0409 closed this May 18, 2026
@jhen0409 jhen0409 deleted the auto/sync-llama.cpp branch May 18, 2026 04:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant