Kuehn, E.; Fischer, M.; Giffels, M.; Jung, C.; Petzold, A.
Abstract:
With the introduction of federated data access to the workows of WLCG, it is becoming increasingly important for data centers to understand specific data ows regarding storage element accesses, firewall configurations, as well as the scheduling of batch jobs themselves. As existing batch system monitoring and related system monitoring tools do not support measurements at batch job level, a new tool has been developed and put into operation at the GridKa Tier 1 center for monitoring continuous data streams and characteristics of WLCG jobs and pilots. Long term measurements and data collection are in progress. These measurements already have been proven to be useful analyzing misbehaviors and various issues. Therefore we aim for an automated, realtime approach for anomaly detection. As a requirement, prototypes for standard workows have to be examined. Based on measurements of several months, different features of HEP jobs are evaluated regarding their effectiveness for data mining approaches to identify these common workows. The paper will introduce the actual measurement approach and statistics as well as the general concept and fir ... mehrst results classifying different HEP job workows derived from the measurements at GridKa.