Cocktail: Chunk-Adaptive Mixed-Precision Quantization for Long-Context LLM Inference
DOI:
10.23919/date64628.2025.10992912
Publication Date:
2025-05-21T17:36:35Z
AUTHORS (5)
ABSTRACT
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (39)
CITATIONS (0)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....