Skip to content

Brassai Kao’s Tech Weave

Blog
- AboutCertification
Apps
Tech
Tips
Contact

搜尋

Run 35B LLM Model by GTX 3050 8GB

地端模型真的越來越猛了…

用 GTX 3050 8GB 跑 35B (IQ4_NL) 模型，

還能有 30+ t/s 的速度

>llama-server.exe -m models\Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive-IQ4_NL.gguf –mmproj models\mmproj-Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive-f16.gguf -c 131072 -np 1 -t 6 –flash-attn on –image-min-tokens 1024 –no-mmap

Qwen 3.6 讓 8GB 顯卡實現 35B 模型的 AI Agent.pdf Download

Share this:

X
Facebook
Email
LinkedIn
Tumblr
Telegram
WhatsApp
Print
Reddit
Pinterest
Mastodon
Nextdoor
X

Like Loading…

←2026全球汽車產業趨勢分析

Docker Compose to MicroK8S→

留言

Leave a comment Cancel reply

Δ

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Exploring technology with a human touch—innovation, accessibility, and practical insights to enhance everyday life.

Facebook
YouTube

Privacy & Cookies: This site uses cookies. By continuing to use this website, you agree to their use.
To find out more, including how to control cookies, see here: Cookie Policy

Loading Comments...

Write a Comment...

Email (Required)

Name (Required)

Website

Comment
Reblog
Subscribe Subscribed
- Brassai Kao’s Tech Weave
- Already have a WordPress.com account? Log in now.

%d