LLaMa/RWKV onnx models, quantization and testcase
Why do you think that https://github.com/Ki6an/fastT5 is a good alternative to llama.onnx
LLaMa/RWKV onnx models, quantization and testcase
Why do you think that https://github.com/Ki6an/fastT5 is a good alternative to llama.onnx