Zhihu
知乎 GitHub 官方帐号 ,欢迎关注我们的技术专栏 https://zhuanlan.zhihu.com/hackers
- 318 followers
- Beijing, China
- https://zhuanlan.zhihu.com/hackers
- jobs+github@zhihu.com
Popular repositories Loading
Repositories
    Showing 10 of 35 repositories
    
  
  
    
      -           fust Publiczhihu/fust’s past year of commit activity 
-           TLLM_QMM PublicTLLM_QMM strips the implementation of quantized kernels of Nvidia's TensorRT-LLM, removing NVInfer dependency and exposes ease of use Pytorch module. We modified the dequantation and weight preprocessing to align with popular quantization alogirthms such as AWQ and GPTQ, and combine them with new FP8 quantization. zhihu/TLLM_QMM’s past year of commit activity 
Top languages
Loading…
Most used topics
Loading…