riczhou
/

Llama-3-70B-Instruct-awq-int8-kv-cache-trt-llm-compiled

Inference Endpoints

Model card Files Files and versions Community

Llama-3-70B-Instruct-awq-int8-kv-cache-trt-llm-compiled

1 contributor

History: 2 commits

riczhou's picture

Upload folder using huggingface_hub

f4d67d5 verified 6 months ago