KIT | KIT-Bibliothek | Impressum | Datenschutz

Practical Data Science Using the Shell

Schmidt, Andreas ORCID iD icon 1; Koubaa, Mohamad Anis ORCID iD icon 1
1 Institut für Automation und angewandte Informatik (IAI), Karlsruher Institut für Technologie (KIT)

Abstract:

For data analysis tasks, typically we load the data into a dedicated tool, i.e. the statistic program R,
mathematica, a relational database, or some other specialized tools to perform our analysis. However, there is often another option that can be performed on almost any computer. The GNU core utils are available by default on many computers and provide a set of powerful tools for manipulating and transforming data and also for performing analyses such as aggregation, etc. In addition to being freely available, these tools have the advantage that they can be used immediately, without the data having to be transformed and loaded into the target system beforehand. The stream-based approach throughout means that even very large amounts of data can be processed without running out of main memory.


Zugehörige Institution(en) am KIT Institut für Automation und angewandte Informatik (IAI)
Publikationstyp Vortrag
Publikationsdatum 21.10.2023
Sprache Englisch
Identifikator KITopen-ID: 1000165322
HGF-Programm 37.12.02 (POF IV, LK 01) Design,Operation & Digitalization of the Future Energy Grids
Veranstaltung 20th International Conference Applied Computing (AC 2023), Funchal, Portugal, 21.10.2023 – 23.10.2023
Schlagwörter Filter and Pipes, Shell Programming, Data Analysis
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page