Mengzhao Chen's picture

7 9 26

Mengzhao Chen

ChenMnZ

·

https://chenmnz.github.io/

ChenMnZ

AI & ML interests

model compression

Organizations

None yet

ChenMnZ's activity

commented a paper about 1 month ago

PrefixQuant: Static Quantization Beats Dynamic through Prefixed Outliers in LLMs

Paper • 2410.05265 • Published Oct 7 • 29 •

New activity in ChenMnZ/Mistral-Large-Instruct-2407-EfficientQAT-w2g64-GPTQ 3 months ago

Where GGUF?

#1 opened 3 months ago by

rdtfddgrffdgfdghfghdfujgdhgsf

commented a paper 4 months ago

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Paper • 2407.11062 • Published Jul 10 • 8 •

New activity in ChenMnZ/Llama-2-13b-chat-omniquant-w3a16g128asym about 1 year ago

Setting vocab_size to solve the issue of mlc-chat module not initalizing

#1 opened about 1 year ago by

New activity in ChenMnZ/Llama-2-7b-chat-omniquant-w3a16g128asym about 1 year ago

Updated config file to remove special characters

#1 opened about 1 year ago by

New activity in ChenMnZ/OmniQuant about 1 year ago

Models about w4a4 for LLama-2 family

#2 opened about 1 year ago by

more models?

#1 opened about 1 year ago by

more models?

#1 opened about 1 year ago by

more models?

#1 opened about 1 year ago by