本地化运行大模型测试

用笔记本试着跑了一下qwen3 8B模型,速度感人;
deepseek-r1:8b-0528-qwen3-q8_0
速度大概 4.0 tokens/s左右,不插电掉到1.6 tokens/s左右
基本不可用,一个字一个字的蹦…

硬件

华为笔记本matebook 13s
处理器:Intel Core i5 11300H @ 3.10GHz 睿频4.0GHz
内存:LPDDR4 16G 3733 MH

软件

操作系统:Microsoft Windows 11 Home , Version 23H2,
软件:Ollama 0.9.2
运行 deepseek-r1:8b-0528-qwen3-q8_0

模型deepseek-r1:8b

速度大概 4.0 tokens/s左右,基本不可用,一个字一个字的蹦…


total duration: 42.9621588s
load duration: 18.0573ms
prompt eval count: 15 token(s)
prompt eval duration: 1.1304521s
prompt eval rate: 13.27 tokens/s
eval count: 173 token(s)
eval duration: 41.8129173s
eval rate: 4.14 tokens/s


插电(正常)
3.7-3.9GHz,60%-70%占用

total duration: 4m4.9069496s
load duration: 19.5485ms
prompt eval count: 52 token(s)
prompt eval duration: 2.653565s
prompt eval rate: 19.60 tokens/s
eval count: 983 token(s)
eval duration: 4m2.2065187s
eval rate: 4.06 tokens/s


低优先级,插电
3.7-3.9GHz,60%-70%占用

total duration: 2m34.7337782s
load duration: 21.9906ms
prompt eval count: 1430 token(s)
prompt eval duration: 50.8199411s
prompt eval rate: 28.14 tokens/s
eval count: 415 token(s)
eval duration: 1m43.8479937s
eval rate: 4.00 tokens/s


未插电,节能模式
1.3GHz,20%占用

total duration: 13m43.4436181s
load duration: 74.8943ms
prompt eval count: 685 token(s)
prompt eval duration: 2m1.6321035s
prompt eval rate: 5.63 tokens/s
eval count: 1149 token(s)
eval duration: 11m41.6676697s
eval rate: 1.64 tokens/s


模型 qwen3:4b-fp16

速度略快一点,大概4.6 tokens/s,基本不可用,一个字一个字的蹦…


插电,低优先级
3.7-3.9GHz,60%-70%占用

total duration: 4m24.0887835s
load duration: 19.7498ms
prompt eval count: 14 token(s)
prompt eval duration: 707.2566ms
prompt eval rate: 19.79 tokens/s
eval count: 1223 token(s)
eval duration: 4m23.3607047s
eval rate: 4.64 tokens/s


total duration: 3m52.060574s
load duration: 9.0139398s
prompt eval count: 784 token(s)
prompt eval duration: 28.3682621s
prompt eval rate: 27.64 tokens/s
eval count: 870 token(s)
eval duration: 3m14.6603767s
eval rate: 4.47 tokens/s


DMI Processor
manufacturer Intel(R) Corporation
model 11th Gen Intel(R) Core(TM) i5-11300H @ 3.10GHz
clock speed 3100.0 MHz
FSB speed 100.0 MHz
multiplier 31.0x
max clock speed 4400.0 MHz


Windows Version Microsoft Windows 11 Home China (x64), Version 23H2, Build 22631.5472
Windows Installation Date 9/4/2022
DirectX Version 12.0


Number of cores		4 (max 4)
Number of threads	8 (max 8)
Manufacturer		GenuineIntel
Name			Intel Core i5 11300H
Codename		Tiger Lake-U
Specification		11th Gen Intel(R) Core(TM) i5-11300H @ 3.10GHz
Package (platform ID)	Socket 1449 FCBGA (0x7)
CPUID			6.C.1
Extended CPUID		6.8C
Core Stepping		B1
Technology		10 nm
TDP Limit		35.0 Watts
Tjmax			100.0 癈
Core Speed		1397.8 MHz
Multiplier x Bus Speed	14.0 x 99.8 MHz
Base frequency (cores)	99.8 MHz
Stock frequency		3100 MHz
Max frequency		4400 MHz
Instructions sets	MMX, SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, EM64T, VT-x, AES, AVX, AVX2, AVX512 (DQ, BW, VL, CD, IFMA, VBMI, VBMI2, VNNI, BITALG, VPOPCNTDQ, VP2INTERSECT), FMA3, SHA
Microcode Revision	0x8A
L1 Data cache		4 x 48 KB (12-way, 64-byte line)
L1 Instruction cache	4 x 32 KB (8-way, 64-byte line)
L2 cache		4 x 1.25 MB (20-way, 64-byte line)
L3 cache		8 MB (8-way, 64-byte line)
Max CPUID level		0000001Bh
Max CPUID ext. level	80000008h
FID/VID Control		yes


Turbo Mode		supported, enabled
Max non-turbo ratio	31x
Max turbo ratio		44x
Max efficiency ratio	4x
Min operating ratio	4x
Speedshift		Autonomous
O/C bins		none
Power Max (PL1)		45.00 W
PL1 Time Window		28.00 s
Short Power Max (PL2)	64.00 W
Max Peak Power (PL4)	121.00 W
Ratio 1 core		44x
Ratio 2 cores		44x
Ratio 3 cores		40x
Ratio 4 cores		40x
Ratio 5 cores		40x
Ratio 6 cores		40x
Ratio 7 cores		40x
Ratio 8 cores		40x
TDP Level		35.0 W @ 31x
TDP Level		28.0 W @ 26x

DMI Physical Memory Array
location Motherboard
usage System Memory
correction None
max capacity 16 GB
max# of devices 8

DMI Memory Device
designation ChannelA-DIMM0
format Row of chips
type LPDDR4
total width 64 bits
data width 64 bits
size 2 GB
speed 3733 MHz
manufacturer Micron Technology
part number 53E1G32D2NP-046
serial number 00000000
voltage 0.600000
manufacturer id 0x2C00
product id 0x0