Download Lagu MLSys'24 Best Paper - AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration MP3 & MP4


1 year ago
MIT HAN Lab
18:57 Menit