TY - JOUR
T1 - Distributed actor-critic method for multi-objective resource-aware neural architecture search
AU - Tabrizchi, Hamed
AU - Razmara, Jafar
AU - Javadi, Bahman
PY - 2025/12
Y1 - 2025/12
N2 - In the era of diverse hardware platforms, a critical element of neural network architecture optimization is ensuring that the architecture is optimized for a specific deployment environment. This paper aims to present an effective distributed method for optimizing neural architectures to address the challenge of achieving multiple objectives simultaneously. Using the power of distributed computing, our method employs an Actor-Critic framework to explore the architecture search space while considering multiple resource-related objectives such as memory footprint for deployments in memory-constrained systems, and the amount of computation required given a number of Floating-point Operations Per Second (FLOPs) for applications deployed on energy-constrained devices. An important innovation that is proposed in this research is the integration of hardware awareness, multi-objective optimization, and multi-modality in data. It enables neural architectures to meet strict hardware constraints, including memory efficiency and inference speed, while handling a wide range of data modalities. The proposed method was evaluated in four scenarios, incorporating 3 image, 4 text, 8 graph, and 8 time series benchmarks. Despite the diversity of data modalities, the method demonstrates consistent performance and adaptability.
AB - In the era of diverse hardware platforms, a critical element of neural network architecture optimization is ensuring that the architecture is optimized for a specific deployment environment. This paper aims to present an effective distributed method for optimizing neural architectures to address the challenge of achieving multiple objectives simultaneously. Using the power of distributed computing, our method employs an Actor-Critic framework to explore the architecture search space while considering multiple resource-related objectives such as memory footprint for deployments in memory-constrained systems, and the amount of computation required given a number of Floating-point Operations Per Second (FLOPs) for applications deployed on energy-constrained devices. An important innovation that is proposed in this research is the integration of hardware awareness, multi-objective optimization, and multi-modality in data. It enables neural architectures to meet strict hardware constraints, including memory efficiency and inference speed, while handling a wide range of data modalities. The proposed method was evaluated in four scenarios, incorporating 3 image, 4 text, 8 graph, and 8 time series benchmarks. Despite the diversity of data modalities, the method demonstrates consistent performance and adaptability.
KW - Deep learning
KW - Distributed Actor-Critic Method
KW - Multi-Objective Optimization
KW - Neural Architecture Search
UR - http://www.scopus.com/inward/record.url?scp=105018573794&partnerID=8YFLogxK
UR - https://go.openathens.net/redirector/westernsydney.edu.au?url=https://doi.org/10.1016/j.asoc.2025.114021
U2 - 10.1016/j.asoc.2025.114021
DO - 10.1016/j.asoc.2025.114021
M3 - Article
AN - SCOPUS:105018573794
SN - 1568-4946
VL - 185
JO - Applied Soft Computing
JF - Applied Soft Computing
M1 - 114021
ER -