研究目的
To develop a framework for the thermal management of complex many-core architectures that can precisely recover the thermal distribution from a minimal number of sensors, reducing the impact of noisy measurements on the reconstructed thermal distribution.
研究成果
The proposed framework significantly improves thermal monitoring performance over the state of the art, achieving lower reconstruction errors with fewer sensors. It demonstrates that realistic workloads are not necessary for learning the model and efficiently placing sensors, as random power traces can also yield reasonably good models.
研究不足
The study does not consider intra and inter-die variations due to process variations, and the practical implementation details of the reconstruction algorithm are not deeply explored.
1:Experimental Design and Method Selection:
The study employs a linear low-dimensional subspace to represent thermal distributions, utilizing a PCA-based model for thermal distribution reconstruction and FrameSense algorithm for sensor placement optimization.
2:Sample Selection and Data Sources:
Thermal distributions are generated using 3D-ICE, a fast compact transient thermal model, based on power traces from benchmarks run on an instruction-level architectural simulator.
3:List of Experimental Equipment and Materials:
The study uses a 64-core architecture designed by STM with 28 nm CMOS technology, and ANSYS CFX for computational fluid dynamics simulations.
4:Experimental Procedures and Operational Workflow:
The framework is divided into design-time and run-time phases, including thermal simulation, model learning, sensor placement optimization, and thermal distribution estimation.
5:Data Analysis Methods:
The performance is evaluated based on approximation and reconstruction errors, comparing PCA and DCT models, and different sensor placement algorithms.
独家科研数据包,助您复现前沿成果,加速创新突破
获取完整内容