A Python library for summarizing event data into multivariate categorical data
AlphaTwirl is a Python library that summarizes event data into multivariate categorical data as data frames. Event data, input to AlphaTwirl, are data with one entry (or row) for one event: for example, data in ROOT TTrees with one entry per collision event of an LHC experiment at CERN. Event data are often large—too large to be loaded in memory—because they have as many entries as events. Multivariate categorical data, the output of AlphaTwirl, have one row for one category. They are usually small—small enough to be loaded in memory—because they only have as many rows as categories. Users can, for example, import them as data frames into R and pandas, which usually load all data in memory, and can perform categorical data analyses with a rich set of data operations available in R and pandas.
- Jupyter Notebook: Quick start of AlphaTwirl
- Tai Sakuma, "AlphaTwirl: A Python library for summarizing event data into multivariate categorical data", EPJ Web of Conferences 214, 02001 (2019), doi:10.1051/epjconf/201921402001, 1905.06609
- AlphaTwirl is licensed under the BSD license.
- Tai Sakuma - [email protected]