Skip to content

Commit

Permalink
Edit mixtral quantization config file (#114) (#1739)
Browse files Browse the repository at this point in the history
  • Loading branch information
dudilester authored Feb 5, 2025
1 parent 183a80d commit df43900
Showing 1 changed file with 2 additions and 5 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -3,10 +3,7 @@
"mode": "QUANTIZE",
"observer": "maxabs",
"scale_method": "maxabs_hw",
"allowlist": {"types": [], "names": ["gate","w1","w3","w2"]},
"blocklist": {"types": [], "names": [
"model.layers.1.block_sparse_moe.experts.(3|4).w2",
"model.layers.[29-31].block_sparse_moe.experts.[0-7].w2"
]},
"allowlist": {"types": [], "names": []},
"blocklist": {"types": [], "names": ["self_attn"]},
"dump_stats_path": "./hqt_output/measure"
}

0 comments on commit df43900

Please sign in to comment.