Suggest an alternative to

llm-vscode-inference-server

An endpoint server for efficiently serving quantized open-source LLMs for code.

Why do you think that https://github.com/microsoft/unilm is a good alternative to llm-vscode-inference-server