KIT | KIT-Bibliothek | Impressum | Datenschutz

Infrastructure Monitoring for GridKa and beyond

Buttitta, Evelina 1
1 Scientific Computing Center (SCC), Karlsruher Institut für Technologie (KIT)

Abstract (englisch):

The Infrastructure Monitoring helps to control and monitor in real-time servers and applications involved in the operation of the WLCG Tier1 center GridKa, including the online and tape storages, the batch system and the GridKa network.
Monitoring data like server metrics (CPU, Memory, Disk, Network), storage operations (I/O Statistics) or visualizing real-time sensors data such as temperature, humidity, power consumption in server rooms are very important to provide a complete picture of availability, performance and resource efficiency of the entire data center.

Through the integration of open source and widely known technologies we have built a scalable solution able to collect, store and visualize infrastructure data across the data center. In this presentation we will talk about the main components of our monitoring architecture and the technologies we use. They include Telegraf as agent to collect metrics, InfluxDB as timeseries database to store data and Grafana as powerful visualization tool to query and visualize data. In addition, we operate a 5-nodes cluster based on OpenSearch search engine to collect logs from many sources.

Zugehörige Institution(en) am KIT Scientific Computing Center (SCC)
Publikationstyp Vortrag
Publikationsdatum 01.04.2025
Sprache Englisch
Identifikator KITopen-ID: 1000180969
HGF-Programm 53.52.02 (POF IV, LK 02) GridKa
Veranstaltung HEPiX Spring Workshop (2025), Lugano, Schweiz, 31.03.2025 – 04.04.2025
Schlagwörter monitoring, GridKa

Volltext §
DOI: 10.5445/IR/1000180969
Veröffentlicht am 10.04.2025
Seitenaufrufe: 5
seit 11.04.2025
Downloads: 1
seit 11.04.2025
Cover der Publikation
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page