自适应语音到脉冲编码用于脉冲神经网络

精选理由

这篇论文给脉冲神经网络设计了个自适应语音编码器，参数少还能在GSC-v2上跑到94.97%，比很多大模型都强，还比较了两种训练方式。

AI 摘要

提出可学习的残差语音到脉冲编码器，与Recurrent Leaky Integrate-and-Fire (R-LIF)骨干联合训练。在Google Speech Commands v2 (GSC-v2)基准上达到94.97%准确率。35k参数的紧凑变体达到89.8%，匹配或超越参数多一个数量级的基线。编码器学习任务对齐的脉冲表示，提升类别可分性。Direct Feedback Alignment (DFA)在相同设置下达到91.5%，量化了生物启发学习规则的性能权衡。

AI 翻译 · 中文

arXiv cs.LGThe mismatch between continuous acoustic signals and discrete event-driven processing remains a fundamental bottleneck for neuromorphic speech processing. Current systems typically rely on fixed spike encoders, forcing d…

阅读原文