Start-Up Exercises

This page assumes you (the reader) already has some experience with git and the command line. If you are not as experienced with either of these, then you may need to use more resources than this guide to gain an understanding of ldmx-sw.

Git Started using git
Windows Subsystem for Linux for Windoze users
Getting Started with the Terminal

0. Installation and Test Run

Follow the instructions on how to install ldmx-sw to build an installation on your local computer. The most up-to-date instructions can be found on the ldmx-sw README. The basic flow of building and installing ldmx-sw is below. The big steps break down into a series of terminal commands that you will use often.

Get the source code (either by downloading or writing) - The --recurisve flag is very important because of the way we package our code in git

git clone --recursive https://github.com/LDMX-Software/ldmx-sw.git

Configure the build to your system and your source - The source scripts/ldmx-env.sh is called "sourcing the environment script" and can be done earlier if the code is already on your computer.

cd ldmx-sw
source scripts/ldmx-env.sh
mkdir build
cd build
ldmx cmake ..

Compile and Install the code - This also needs to be inside the build directory.

ldmx make install

Make sure the installation was built and linked correctly by running the test application. This should return a message describing saying that several tests passed, for example:

$ ldmx ctest
Test project /home/tom/ldmx/ldmx-sw/build
    Start 1: Framework
1/6 Test #1: Framework ........................   Passed    2.63 sec
    Start 2: DetDescr
2/6 Test #2: DetDescr .........................   Passed    0.17 sec
    Start 3: Conditions
3/6 Test #3: Conditions .......................   Passed    0.30 sec
    Start 4: Ecal
4/6 Test #4: Ecal .............................   Passed    2.41 sec
    Start 5: Hcal
5/6 Test #5: Hcal .............................   Passed   32.68 sec
    Start 6: SimCore
6/6 Test #6: SimCore ..........................   Passed   13.13 sec

100% tests passed, 0 tests failed out of 6

Total Test time (real) =  51.35 sec

1. Practice using `fire`

This is another test on your installation, but it gives you more freedom to explore ldmx-sw.

fire is run by using a "python configuration file." This config file is actually executed as a python script before run parameters are extracted, so you can use python to make your job easier.

Write your first config file

For the purposes of the rest of these exercises, you should have a single file of ~100 events so that it can be analyzed fairly quickly. If you want to skip to writing your own analyzers and you already have an event file, then you can skip to the next exercise.

Read through Running and Configuring fire to learn more about the python configuration scripts. The configuration script provided there gives an example of a script you could use to generate the small event file to analyze. Mess around with the parameters and look at the processors used in that script to make your sample your own and learn more!

There are already several processors in ldmx-sw that have a template python module that you can import into your config. Most processors that have this sort of template put into their module's python directory.

2. Analyze a Sample using `python3`

One of the quickest ways to open up a file with events in it and start making plots is by writing a python-based analysis. The benefits of using a python-based analysis are that the analysis script is easier to develop; however, you do not have access to all of the tools that are coded into the C++ implementation of ldmx-sw so these analysese are limited.

A very basic pythnon analysis is given below. The EventTree module is a recent addition to ldmx-sw and is not available in older (pre v3.0.0) versions.

"""A short and basic python analysis in ldmx-sw"""

from LDMX.Framework import EventTree
import sys

# use the first command line argument as the input file
tree = EventTree.EventTree(sys.argv[1])

for event in tree:
    event.EventHeader.Print()

which you would run on the command line using the ldmx command defined after sourcing the environment script.

$ ldmx python3 my-analysis.py file-to-analyze.root

All this script will do is loop through the events in the file and print-out the event header for each one. The event header only has the basic information of the event's number, weight, timestamp, and run number. Obviously, this "analysis" is not very interesting. How can we improve upon it?

Exercise: Fill a histogram of simulated energy deposits in the ECal.

Now that you have a basic template for looping through an event tree, try to modify the above python script such that you can print-out a histogram of the energy deposits in the ECal.

Hints:

The list of simulated hits in the ECal are called EcalSimHits, so you can get them with event.EcalSimHits.
- Other objects in the event exist - use ldmx rootbrowse file-to-analyze.root to browse the file!
ROOT's documentation isn't great, but this "simple" histogram-filling file is a place you can look for code examples

3. Analyze a sample using `fire`

General Description of Processing in `fire`

fire iterates through all of the events stored in a file. Each of these events has an event bus that contains all of the objects that store data about the event. A good introductory way to think of this analysis is implied by the use of the phrase "event bus". Each file contains a line of buses and each bus stops at each processor in the sequence. Each processor can go onto the bus making additions or just getting information before the bus proceeds to the next processor in the sequence. A processor that is allowed to add objects to the bus are called producers while a processor that is not allowed to add objects is called an analyzer. The event bus can contain singular objects or Standard Template Library (STL) collections of objects.

Exercise

You can write your own processors in a branch of the analysis repository. This repo requires you to provide it a path to your ldmx-sw installation, but after that, you don't need to compile ldmx-sw anymore!

Write your own Analyzer that makes some sort of histogram. A good method of starting a new Analyzer is to simply copy an existing analyzer and change its file names and class name.

Goal: Fill a Histogram with the Transverse Momenta of All Particles

Copy an existing analyzer's header and source files to your own files and clean out all the old code (renaming where necessary).
- "renaming where necessary" is deceptively simple, reach out for some help on slack if you are unfamiliar with C++!
Make sure your new processor can be added to the processing sequence by having it print "Hello World" to the screen for each event. You will need to add your new processor to a python configuration file (like you used above) and have the processor in the sequence provided to the ldmx process.
Determine how to access the SimParticles from the event. (Browsing the code-base for other processors which do this is a common method!)
Now that you have the collection of simulated particles in your analyzer, have your analyzer fill a histogram of the transverse momentum of each of the simulated particles.

Now you've gotten a basic analyzer running and filling histograms! You can go forward and start making more interesting histograms from the objects that other processors put into the event.

The rest of these exercises are much more code-heavy. If you are not developing for ldmx-sw, you can take a break from these exercises now and start writing your own physics analyses using what you have learned.

4. Construct a Producer using a pre-defined event object

All of the event objects that can be added to the event bus are in the Event module. You've probably already used a few of them while practicing making an analyzer in the previous exercise. Now, you will add your own version of one of these classes to the event bus. Most of the event objects are highly specialized, but a simpler one to use to practice is HcalVetoResult.

Goal: Write your own HCal Veto

There is already an implementation of an HCal Veto: Hcal::HcalVetoProcessor. Look at it to gain some inspiration.
Look through the header and source files for HcalVetoResult to make sure you understand the aspects of this class.
Generate a new producer by copying an existing one and make sure that it can be added to the processing sequence by having it print the number of HcalHits in each event.
Design your veto. Look at the HcalVetoProcessor to find out how to add your object to the event bus.
Make sure your new object is being stored in the output file by opening it with ldmx rootbrowse.

Bonus:

There is a way to tell fire to skip an event (i.e. not save it in the output file) from within a processor. Can you make your veto actually skip events that would have failed to pass it? Hint: You have to put new code in both your producer's source file and your python configuration file. Hint: We usually call this process "skimming".

4. Construct a Producer and your own event object

Note: Currently, in order to write your own event object, you need to make additions to ldmx-sw and re-build and re-install ldmx-sw after these additions.

You've already written a producer in the previous exercise, but you were constrained to only using objects that have already been written for a different purpose. This is not how most new producers work. Most new producers have a corresponding new event object developed with it.

Goal: Determine full path information of primary electron.

Before you get started there are some things you need to know about the simulation in order to do this task effectively and accurately.

Geant4 (G4) is the simulation toolkit that we use, and a lot of the terminology in SimParticle is borrowed from G4.
G4 uses the TrackID as a unique identifier for each particle, but it is not perfect. In some interactions, the track ID changes even though we would consider one of the products to be the same as one of the incoming particles.
The kinematic information stored in SimParticle is what the particle had at creation. This is especially important to consider when looking at the primary electron.
The end point stored in SimParticle is when the given track ID is no longer used. This may or may not be the point where the particle actually is stopped or destroyed because there are interactions that change the track ID in G4 without destroying the particle.
Not all of the particles generated by G4 are stored in the simulation file. This is because thousands of particles are generated in the particle shower in the ECal, and storing them all would take too much space. This is important to remember when using the references to other particles. For example, the particle stored as the "parent" may or may not be stored as well.

Now we are ready to track the path of the primary electron in the event.

We want to know the full kinematic information for the primary electron along its journey. Start by thinking of what we need to find and how we could calculate it if it isn't directly available in the data file. This isn't as easy as taking the starting and ending points stored in SimParticle. These points don't take into account any scattering that occurs along the particle's real path in the simulation which would lead to a curved path instead of a straight line.
Make a producer template and use it to try to track the primary electron through the first few events in your file. What collections do you need to get? How do you determine the electrons energy or momentum at a specific time/place? Print this information to your screen to see if it makes sense. This processor can be in your ldmx-analysis repository. Only the new event bus object needs to be inside of ldmx-sw.
Write a C++ class that will store this path information for the primary electron and follow the instructions on how to add it to the LDMX event bus. This step is complicated and will take some patience.
Add your object to the event bus and check that it is being added. How would you check that the path is being determined correctly? How would you check that the momentum/energy at each point along the path is being determined correctly?
Write an analyzer to make a histogram of the real path length of the primary electron (not the straight line distance between the starting and ending points). You could make this histogram inside of your producer, but you should make a separate analyzer to double check that the object you added to the event bus can be accessed by other processors.

LDMX Software