feat(serve): upgrade daily driver to qwen3.6-35b-a3b q6_k_xl
Switch `make serve` default to Qwen3.6 UD Q6_K_XL (32 GB, hybrid DeltaNet, near-lossless) and register it in the model catalog. Add --jinja to the llama-server launcher so tool/function calling works — without it clients silently ignore tool definitions advertised by the server.
This commit is contained in:
4
Makefile
4
Makefile
@@ -39,8 +39,8 @@ benchmark-compare: ## Compare two benchmark runs (usage: make benchmark-compare
|
||||
@bash bin/benchmark compare $(BEFORE) $(AFTER)
|
||||
|
||||
# --- Serve ---
|
||||
serve: ## Launch APEX I-Compact daily driver (2 slots, 256K ctx)
|
||||
@bash bin/serve -m Qwen3.5-35B-A3B-Claude-Distilled-APEX-I-Compact.gguf --parallel 2 --ctx 262144 $(ARGS)
|
||||
serve: ## Launch Qwen3.6-35B-A3B UD-Q6_K_XL daily driver (2 slots, 256K ctx)
|
||||
@bash bin/serve -m Qwen3.6-35B-A3B-UD-Q6_K_XL.gguf --parallel 2 --ctx 262144 $(ARGS)
|
||||
|
||||
serve-custom: ## Launch llama-server with custom model (ARGS="-m MODEL.gguf")
|
||||
@bash bin/serve $(ARGS)
|
||||
|
||||
Reference in New Issue
Block a user