TY - GEN
T1 - Analog Softmax with Wide Input Current Range for In-Memory Computing
AU - Dube, Aradhana
AU - Manea, Paul
AU - Gibertini, Paolo
AU - Covi, Erika
AU - Strachan, John Paul
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025/6/27
Y1 - 2025/6/27
N2 - The Softmax activation function plays a pivotal role in both the attention mechanism of Transformers and in the final layer of neural networks performing classification. The Softmax function outputs probabilities by normalizing the input values, emphasizing differences among them to highlight the largest values. In digital implementations, the complexity of softmax grows linearly with the number of inputs. In contrast, analog implementations enable parallel computations with lower latency. In this work, we demonstrate that this approach achieves a more efficient linear scaling of latency as vector size increases logarithmically. This analog softmax circuits are implemented in TSMC 28 nm PDK technology, capable of driving up to 128 inputs and producing an analog current output spanning three orders of magnitude. The study examines the circuit's power consumption, latency, and error, emphasizing its efficiency compared to the alternative approach of converting outputs to digital signals via ADCs and performing the softmax calculation digitally. By reducing reliance on these power-intensive operations, this work aims to significantly enhance energy efficiency in in-memory computing systems.
AB - The Softmax activation function plays a pivotal role in both the attention mechanism of Transformers and in the final layer of neural networks performing classification. The Softmax function outputs probabilities by normalizing the input values, emphasizing differences among them to highlight the largest values. In digital implementations, the complexity of softmax grows linearly with the number of inputs. In contrast, analog implementations enable parallel computations with lower latency. In this work, we demonstrate that this approach achieves a more efficient linear scaling of latency as vector size increases logarithmically. This analog softmax circuits are implemented in TSMC 28 nm PDK technology, capable of driving up to 128 inputs and producing an analog current output spanning three orders of magnitude. The study examines the circuit's power consumption, latency, and error, emphasizing its efficiency compared to the alternative approach of converting outputs to digital signals via ADCs and performing the softmax calculation digitally. By reducing reliance on these power-intensive operations, this work aims to significantly enhance energy efficiency in in-memory computing systems.
KW - Analog computing
KW - Classification
KW - In-memory computing
KW - Softmax
UR - https://www.scopus.com/pages/publications/105010649832
U2 - 10.1109/ISCAS56072.2025.11043251
DO - 10.1109/ISCAS56072.2025.11043251
M3 - Conference contribution
AN - SCOPUS:105010649832
SN - 979-8-3503-5684-7
T3 - Proceedings - IEEE International Symposium on Circuits and Systems
BT - ISCAS 2025 - IEEE International Symposium on Circuits and Systems, Proceedings
PB - IEEE
T2 - 2025 IEEE International Symposium on Circuits and Systems, ISCAS 2025
Y2 - 25 May 2025 through 28 May 2025
ER -