Cocktail: Chunk-Adaptive Mixed-Precision Quantization for Long-Context LLM Inference

DOI: 10.23919/date64628.2025.10992912 Publication Date: 2025-05-21T17:36:35Z
ABSTRACT
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (39)
CITATIONS (0)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....