Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ENOT-AutoDL
/
gpt-j-6B-tensorrt-int8
like
7
Follow
ENOT AutoDL
11
Text Generation
Transformers
ONNX
lambada
English
text-generation-inference
causal-lm
int8
tensorrt
ENOT-AutoDL
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
554833e
gpt-j-6B-tensorrt-int8
Commit History
added onnx model (fake quant) compatible with trt
554833e
igor
commited on
Jun 7, 2023
updated README.md (added latency table)
2db91ae
ivkalgin
commited on
Apr 5, 2023
fixed typo
292111d
ivkalgin
commited on
Mar 31, 2023
added direct link to github repo, fixed text
4f0d13d
ivkalgin
commited on
Mar 31, 2023
fixed link to example
2643a65
ivkalgin
commited on
Mar 30, 2023
fixed accuracy in validation table
23e6d15
ivkalgin
commited on
Mar 30, 2023
Update README.md (
#2
)
a6ac5eb
ivkalgin
agoncharenko1992
commited on
Mar 30, 2023
Create README.md (
#1
)
23da18b
ivkalgin
commited on
Mar 30, 2023
added 2080ti engine
d78be47
ivkalgin
commited on
Mar 29, 2023
added 4090 engine
74702df
ivkalgin
commited on
Mar 28, 2023
normalized engine name
bf4729c
ivkalgin
commited on
Mar 28, 2023
added engine for rtx3080ti
ca37425
ivkalgin
commited on
Mar 28, 2023
initial commit
8912b4c
ivkalgin
commited on
Mar 28, 2023