代码库
Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
Python
adanartificial-intelligencebert-modelconvnextcuda-programmingdeep-learningdiffusiondreamfusionfairseqgpt2llm-trainingllmsmaemoeoptimizerpytorchresnettimmtransformer-xlvit
CVPR'24, Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation".
Python
associationhumor-generationlarge-language-modelsleap-of-thoughtmultimodal-deep-learning
[ACL 2026 (Main)] LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
Python