PDP: Parameter-free Differentiable Pruning is All You Need
Artofficial
Lecture 4 wilde
Quantum compression with classically simulatable circuits
Retentive Network: A Successor to Transformer for Large Language Models
Neural Haircut: Prior-Guided Strand-Based Hair Reconstruction