Исследование интегральных характеристик суперкомпьютерных приложений для всего потока задач больших вычислительных систем
Аннотация
Ключевые слова
Полный текст:
PDF (English)Литература
Top50 Supercomputers of Russia and CIS: [http://top50.supercomputers.ru/]. 15.02.2016.
Top500 Supercomputer sites: [http://top500.org/].15.02.2016.
Antonov A., Zhumatiy S., Nikitenko D., Stefanov K., Teplov A., Shvets P. Analysis of dynamic characteristics of job stream on supercomputer system Numerical Methods and Programming, 2013. Vol. 14, No. 2., P. 104-108.
Safonov A., Kostenetskiy P., Borodulin K., Melekhin F. A monitoring system for supercomputers of SUSU // Russian Supercomputing Days International Conference, Moscow, Russian Federation, 28-29 September, 2015, Proceedings. CEUR Workshop Proceedings, 2015. Vol. 1482, P. 662-666
Stefanov K. et al. Dynamically Reconfigurable Distributed Modular Monitoring System for Supercomputers (DiMMon) // Procedia Computer Science / Elsevier B.V., 2015. Vol. 66, P. 625–634.
Nikitenko D. Complex approach to performance analysis of supercomputer systems based on system monitoring data. Numerical Methods and Programming, 2014, Vol. 15, P. 85–97.
Voevodin V., Zhumatiy S., Nikitenko D. Octoshell: Large Supercomputer Complex Administration System // Russian Supercomputing Days International Conference, Moscow, Russian Federation, 28-29 September, 2015, Proceedings. CEUR Workshop Proceedings, 2015. Vol. 1482, P. 69-83.
Voevodin Vl., Antonov A., Bryzgalov P., Nikitenko D., Zhumatiy S., Sobolev S., Stefanov K., Voevodin Vad. Practice of "Lomonosov" Supercomputer // Open systems, 2012. No. 7, P. 36–39.
Zhumatiy S., Nikitenko D. Approach to flexible supercomputers management // International supercomputing conference Scientific Services & Internet: all parallelism edges, Novorossiysk, Russian Federation, 23-28 September, 2013, Proceedings. MSU, 2013. P. 296–300.
Voevodin Vl. Supercomputer situational screen // Open systems, 2014. No. 3, P. 36–39.
Shvets P. , Antonov A., Nikitenko D., Sobolev S., Stefanov K., Voevodin Vad., Voevodin V., Zhumatiy S. An Approach for Ensuring Reliable Functioning of a Supercomputer Based on a Formal Model. 11th Int. Conference on Parallel Processing and Applied Mathematics, Krakow, Poland, 6-9 September, 2015. Proceedings.
Voevodin V., Antonov A., Dongarra J. AlgoWiki: an Open Encyclopedia of Parallel Algorithmic Features // Supercomputing Frontiers and Innovations, 2015. Vol.2. c1. P. 4-18.
SLURM workload manager: [http://slurm.schedmd.com/]. 15.02.2016.
Cleo cluster batch system: [http://sourceforge.net/projects/cleo-bs/]. 15.02.2016.
Ganglia Monitoring System: [http://ganglia.sourceforge.net/]. 15.02.2016.
Collectd – The system statistics collection daemon: [https://collectd.org/]. 15.02.2016.
Clustrx: [http://www.t-platforms.ru/products/software/clustrxproductfamily/clustrxwatch.html]. 15.02.2016.
jQuery & jQuery UI: [http://jqueryui.com/]. 15.02.2016.
TagIt: [http://aehlke.github.io/tag-it/].15.02.2016.
DOI: http://dx.doi.org/10.14529/cmse160403