International Association for Cryptologic Research

International Association
for Cryptologic Research

CryptoDB

Ranking Loss: Maximizing the Success Rate in Deep Learning Side-Channel Analysis

Authors:
Gabriel Zaid , Univ Lyon, UJM-Saint-Etienne, CNRS Laboratoire Hubert Curien UMR 5516 F-42023, Saint-Etienne, France; Thales ITSEF, Toulouse, France
Lilian Bossuet , Univ Lyon, UJM-Saint-Etienne, CNRS Laboratoire Hubert Curien UMR 5516 F-42023, Saint-Etienne, France
François Dassance , Thales ITSEF, Toulouse, France
Amaury Habrard , Univ Lyon, UJM-Saint-Etienne, CNRS Laboratoire Hubert Curien UMR 5516 F-42023, Saint-Etienne, France
Alexandre Venelli , Thales ITSEF, Toulouse, France
Download:
DOI: 10.46586/tches.v2021.i1.25-55
URL: https://tches.iacr.org/index.php/TCHES/article/view/8726
Search ePrint
Search Google
Abstract: The side-channel community recently investigated a new approach, based on deep learning, to significantly improve profiled attacks against embedded systems. Compared to template attacks, deep learning techniques can deal with protected implementations, such as masking or desynchronization, without substantial preprocessing. However, important issues are still open. One challenging problem is to adapt the methods classically used in the machine learning field (e.g. loss function, performance metrics) to the specific side-channel context in order to obtain optimal results. We propose a new loss function derived from the learning to rank approach that helps preventing approximation and estimation errors, induced by the classical cross-entropy loss. We theoretically demonstrate that this new function, called Ranking Loss (RkL), maximizes the success rate by minimizing the ranking error of the secret key in comparison with all other hypotheses. The resulting model converges towards the optimal distinguisher when considering the mutual information between the secret and the leakage. Consequently, the approximation error is prevented. Furthermore, the estimation error, induced by the cross-entropy, is reduced by up to 23%. When the ranking loss is used, the convergence towards the best solution is up to 23% faster than a model using the cross-entropy loss function. We validate our theoretical propositions on public datasets.
Video from TCHES 2020
BibTeX
@article{tches-2020-30759,
  title={Ranking Loss: Maximizing the Success Rate in Deep Learning Side-Channel Analysis},
  journal={IACR Transactions on Cryptographic Hardware and Embedded Systems},
  publisher={Ruhr-Universität Bochum},
  volume={2021, Issue 1},
  pages={25-55},
  url={https://tches.iacr.org/index.php/TCHES/article/view/8726},
  doi={10.46586/tches.v2021.i1.25-55},
  author={Gabriel Zaid and Lilian Bossuet and François Dassance and Amaury Habrard and Alexandre Venelli},
  year=2020
}