Llama-2-70b-chat-hf-trt-fp8 / postprocessing

Commit History

update models for newer trt 0.6.1 version
dd20dba

yessenzhar commited on

try bugfix
151b0f6

yessenzhar commited on

remove hardcoded weights and tokenizer dirs, replace with template
5de4d5c

yessenzhar commited on

remove print
d5c7a29

yessenzhar commited on

add smaller files
a83b588

yessenzhar commited on