TY - GEN
T1 - Reinforcement learning based anti-jamming schedule in cyber-physical systems
AU - Gan, Ruimeng
AU - Xiao, Yue
AU - Shao, Jinliang
AU - Zhang, Heng
AU - Zheng, Wei Xing
PY - 2020
Y1 - 2020
N2 - ![CDATA[In this paper, the security issue of cyber-physical systems is investigated, where the observation data is transmitted from a sensor to an estimator through wireless channels disturbed by an attacker. The failure of this data transmission occurs, when the sensor accesses the channel that happens to be attacked by the jammer. Since the system performance measured by the estimation error depends on whether the data transmission is a success, the problem of selecting the channel to alleviate the attack effect is studied. Moreover, the state of each channel is time-variant due to various factors, such as path loss and shadowing. Motivated by energy conservation, the problem of selecting the channel with the best state is also considered. With the help of cognitive radio technique, the sensor has the ability of selecting a sequence of channels dynamically. Based on this, the problem of selecting the channel is resolved by means of reinforcement learning to jointly avoid the attack and enjoy the channel with the best state. A corresponding algorithm is presented to obtain the sequence of channels for the sensor, and its effectiveness is proved analytically. Numerical simulations further verify the derived results.]]
AB - ![CDATA[In this paper, the security issue of cyber-physical systems is investigated, where the observation data is transmitted from a sensor to an estimator through wireless channels disturbed by an attacker. The failure of this data transmission occurs, when the sensor accesses the channel that happens to be attacked by the jammer. Since the system performance measured by the estimation error depends on whether the data transmission is a success, the problem of selecting the channel to alleviate the attack effect is studied. Moreover, the state of each channel is time-variant due to various factors, such as path loss and shadowing. Motivated by energy conservation, the problem of selecting the channel with the best state is also considered. With the help of cognitive radio technique, the sensor has the ability of selecting a sequence of channels dynamically. Based on this, the problem of selecting the channel is resolved by means of reinforcement learning to jointly avoid the attack and enjoy the channel with the best state. A corresponding algorithm is presented to obtain the sequence of channels for the sensor, and its effectiveness is proved analytically. Numerical simulations further verify the derived results.]]
UR - https://hdl.handle.net/1959.7/uws:60889
U2 - 10.1016/j.ifacol.2020.12.221
DO - 10.1016/j.ifacol.2020.12.221
M3 - Conference Paper
SP - 2501
EP - 2506
BT - Proceedings of the 21th IFAC World Congress, Berlin, Germany, 12-17 July 2020
PB - International Federation of Automatic Control
T2 - International Federation of Automatic Control. World Congress
Y2 - 12 July 2020
ER -