1 Introduction

The top quark is an important component of the standard model (SM), especially because of its large mass, and its properties are critical for the overall understanding of the theory. Measurements of the top quark–antiquark pair (\({\mathrm{t}}\overline{{\mathrm{t}}}\)) production cross section test the predictions of quantum chromodynamics (QCD), constrain QCD parameters, and are sensitive to physics beyond the SM. The \({\mathrm{t}}\overline{{\mathrm{t}}}\) process is also the dominant SM background to many searches for new physical phenomena, and its precise measurement is essential for claiming new discoveries.

The copious top quark data samples produced at the CERN LHC enable measurements of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) production rate in extended parts of the phase space, and differentially as a function of the kinematic properties of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) system. Inclusive and differential cross section measurements from proton-proton (pp) collisions at centre-of-mass energies of 7 and 8 \(\,\text {TeV}\) have been reported by the ATLAS [111] and CMS collaborations [1224]. These are significantly more precise than the measurements of \({\mathrm{t}}\overline{{\mathrm{t}}}\) production in proton-antiproton collisions performed at the Tevatron [25]. In this paper, we report new results from pp collision data at \(\sqrt{s}=8~\,\text {TeV} \), collected with the CMS detector. Measurements of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) inclusive cross section and the normalized differential cross sections are presented for the first time in the all-jets final state at this collision energy. The results are compared to QCD predictions, and are in agreement with other measurements in different decay channels.

Top quarks decay almost exclusively into a W boson and a b quark. Events in which both W bosons from the \({\mathrm{t}}\overline{{\mathrm{t}}}\) decay produce a pair of light quarks constitute the so-called all-jets channel. As a result, the final state consists of at least six partons (more are possible from initial- and final-state radiation), two of which are b quarks. Despite the large number of combinatorial possibilities, it is possible to fully reconstruct the kinematical properties of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) decay products, unlike in the leptonic channels where the presence of one or two neutrinos makes the full event interpretation ambiguous. However, the presence of a large background from multijet production, and the larger number of jets in the final state make the measurement of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) cross section in the all-jets final state more uncertain compared to the leptonic channels. Nevertheless, a high-purity signal sample can be selected, which increases significantly the signal-over-background ratio compared to previous measurements in this decay channel [21, 26, 27].

2 The CMS detector

The central feature of the CMS apparatus is a superconducting solenoid of 6 \(\text {\,m}\) internal diameter, providing a magnetic field of 3.8 \(\text {\,T}\). Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter, and a brass and scintillator hadron calorimeter. Extensive forward calorimetry (pseudorapidity \(|\eta |>3.0\)) complements the coverage provided by the barrel (\(|\eta |<1.3\)) and endcap (\(1.3<|\eta |<3.0\)) detectors. Muons are measured in gas-ionization detectors embedded in the steel flux-return yoke outside the solenoid. The first level of the CMS trigger system, composed of custom hardware processors, uses information from the calorimeters and muon detectors to select the most interesting events in a fixed time interval of less than 4 \(\upmu \)s. The high-level trigger (HLT) processor farm further decreases the event rate from around 100 \(\text {\,kHz}\) to around 300 \(\text {\,Hz}\), before data storage. A detailed description of the CMS apparatus, together with the definition of the coordinate system used and the relevant kinematic variables, can be found in Ref. [28].

3 Event simulation

The \({\mathrm{t}}\overline{{\mathrm{t}}}\) events are simulated using the leading-order (LO) MadGraph (v. 5.1.5.11) event generator [29], which incorporates spin correlations through the madspin [30] package and the simulation of up to three additional partons. The value of the top quark mass is set to \(m_{{\mathrm{t}}}=172.5~\,\text {GeV} \) and the proton structure is described by the parton distribution functions (PDFs) from CTEQ6L1 [31]. The generated events are subsequently processed with pythia (v. 6.426) [32] which utilizes tune Z2* for parton showering and hadronization, and the MLM prescription [33] is used for matching of matrix element jets to those from parton shower. The pythia Z2* tune is derived from the Z1* tune [34], which uses the CTEQ5L PDF [31], whereas Z2* adopts CTEQ6L [31]. The CMS detector response is simulated using Geant4 (v. 9.4) [35].

In addition to the MadGraph simulation, predictions obtained with the next-to-leading-order (NLO) generators mc@nlo (v. 3.41) [36] and powheg (v. 1.0 r1380) [37] are also compared to the measurements. While powheg and mc@nlo are formally equivalent up to NLO accuracy, they differ in the techniques used to avoid double counting of the radiative corrections when interfacing with the parton shower generators. Two different powheg samples are used: one uses pythia and the other herwig (v. 6.520) [38] for parton showering and hadronization. The events generated with mc@nlo are interfaced with herwig. The herwig AUET2 tune [39] is used to model the underlying event in the powheg \(+\) herwig sample, while the default tune is used in the mc@nlo \(+\) herwig sample. The proton structure is described by the PDF sets CT10 [40] and CTEQ6M [31] for powheg and mc@nlo, respectively. The QCD multijet events are simulated using MadGraph (v. 5.1.3.2) interfaced with pythia (v. 6.424).

4 Event reconstruction and selection

4.1 Jet reconstruction

Jets are reconstructed with the anti-\(k_{\mathrm {T}}\) clustering algorithm [41, 42] with a distance parameter of 0.5. The input to the jet clustering algorithm is the collection of particle candidates that are reconstructed with the particle-flow (PF) algorithm [43, 44]. In the PF event reconstruction all stable particles in the event, i.e. electrons, muons, photons, and charged and neutral hadrons, are reconstructed as PF candidates using a combination of all of the subdetector information to obtain an optimal determination of their directions, energies, and types. All the reconstructed vertices in the event are ordered according to the sum of squared transverse momenta (\(p_{\mathrm {T}}\)) of tracks used to reconstruct it and the vertex with the largest sum is considered the primary one, while all the rest are considered as pileup vertices. In order to mitigate the effect of multiple interactions in the same bunch crossing (pileup), charged PF candidates that are unambiguously associated with pileup vertices are removed prior to the jet clustering. This procedure is called charged-hadron subtraction (CHS) [45]. An offset correction is applied for the additional energy inside of the jet due to neutral hadrons or photons from pileup. The resulting jets require a small residual energy correction, mostly due to the thresholds for reconstructed tracks and clusters in the PF algorithm and reconstruction inefficiencies [45].

The identification of jets that likely originate from the hadronization of b quarks is done with the “combined secondary vertex” (CSV) b tagger [46]. The CSV algorithm combines the information from track impact parameters and identified secondary vertices within a given jet, and provides a continuous discriminator output.

4.2 Trigger

The data used for this measurement were collected with a multijet trigger event selection (path) which, from the HLT, required at least four jets reconstructed from calorimetric information with a \(p_{\mathrm {T}}\) threshold of \(50~\,\text {GeV} \) and \(|\eta |<3.0\). The hardware trigger required the presence of two central (\(|\eta |<3.0\)) jets above various \(p_{\mathrm {T}}\) thresholds (52–64 \(\,\text {GeV}\)), or the presence of four central jets with lower \(p_{\mathrm {T}}\) thresholds (32–40 \(\,\text {GeV}\)), or the scalar sum of all jets \(p_{\mathrm {T}}\) to be greater than 125 or 175 \(\,\text {GeV}\). The various thresholds were adjusted within the quoted ranges according to the instantaneous luminosity. The trigger paths employed were unprescaled for a larger part of the run, yielding a data sample corresponding to an integrated luminosity of 18.4 \(\,\text {fb}^{-1}\).

Fig. 1
figure 1

Distribution of the reconstructed top quark mass after the kinematic fit. The normalizations of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) signal and the QCD multijet background are taken from the template fit to the data. The bottom panel shows the fractional difference between the data and the sum of signal and background predictions, with the shaded band representing the MC statistical uncertainty

Fig. 2
figure 2

Distribution of the kinematic fit probability (top). Distribution of the distance between the reconstructed b partons in the \(\eta \)\(\phi \) plane (bottom). The normalizations of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) signal and the QCD multijet background are taken from the template fit to the data. The bottom panels show the fractional difference between the data and the sum of signal and background predictions, with the shaded band representing the MC statistical uncertainty

4.3 Selection and kinematic top quark pair reconstruction

Selected events are required to contain at least six reconstructed jets with \(p_{\mathrm {T}} >40~\,\text {GeV} \) and \(|\eta |<2.4\) (jets are required to be within the tracker acceptance in order to apply the CHS), with at least four of the jets having \(p_{\mathrm {T}} >60~\,\text {GeV} \) (so that the trigger efficiency is greater than 80 % and the data-to-simulation correction factor smaller than 10 %). Among the six jets with the highest \(p_{\mathrm {T}}\) (leading jets), at least two must be identified as coming from b hadronization by the CSV algorithm at the medium working point (CSVM), with a typical b quark identification efficiency of 70 % and misidentification probability for light quarks of 1.4 %, and these are considered the most probable b jet candidates. If there are more than two such jets, which happens in approximately 2 % of the events, then the two with the highest \(p_{\mathrm {T}}\) are chosen. To select events compatible with the \({\mathrm{t}}\overline{{\mathrm{t}}}\) hypothesis, and to improve the resolution of the reconstructed quantities, a kinematic fit is performed that utilizes the constraints of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) decay. A \(\chi ^2\) fit is performed, starting with the reconstructed jet four-momenta, which are varied within their experimental \(p_{\mathrm {T}}\) and angular resolutions, imposing a W boson mass constraint (80.4 \(\,\text {GeV}\)  [47]) on the light-quark pairs, and requiring that the top quark and antiquark have equal mass. Out of all the possible combinations from the six input jets, the algorithm returns the one with the smallest \(\chi ^2\) and the resulting parton four momenta, which are used to compute the reconstructed top quark mass (\(m_{{\mathrm{t}}}^\text {rec}\)). The probability of the converged kinematic fit is required to be greater than 0.15. Overall, the kinematic fit requirements select approximately 5 % (2 %) of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) (background) events. The distance in the \(\eta \)\(\phi \) space between the two b quark candidates must be \(\Delta R_{{\mathrm{b}}{\mathrm{b}}}=\sqrt{{(\Delta \eta _{{\mathrm{b}}{\mathrm{b}}})^2+(\Delta \phi _{{\mathrm{b}}{\mathrm{b}}})^2}}>2.0\), which has an efficiency of roughly 75 % (50 %) on \({\mathrm{t}}\overline{{\mathrm{t}}}\) (background) events. The last two requirements are applied to select events with unambiguous top quark pair interpretation and to suppress the QCD background that originates from gluon splitting into collinear b quarks [48].

Fig. 3
figure 3

Distribution of the \(p_{\mathrm {T}}\) of the six leading jets. The normalization of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) signal and the QCD multijet background are taken from the template fit to the data. The bottom panels show the fractional difference between the data and the sum of signal and background predictions, with the shaded band representing the MC statistical uncertainty

Fig. 4
figure 4

Distribution of the leading (top) and subleading (bottom) reconstructed top quark \(p_{\mathrm {T}}\). The normalizations of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) signal and the QCD multijet background are taken from the template fit to the data. The bottom panels show the fractional difference between the data and the sum of signal and background predictions, with the shaded band representing the MC statistical uncertainty

Fig. 5
figure 5

Distribution of the \(p_{\mathrm {T}}\) (top) and the rapidity (bottom) of the reconstructed top quark pair. The normalizations of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) signal and the QCD multijet background are taken from the template fit to the data. The bottom panels show the fractional difference between the data and the sum of signal and background predictions, with the shaded band representing the MC statistical uncertainty

5 Signal extraction

The background to the \({\mathrm{t}}\overline{{\mathrm{t}}}\) signal is dominated by the QCD multijet production process, while the other backgrounds, such as the associated production of vector bosons with jets, are negligible. Due to the limited size of the Monte Carlo (MC) simulated samples, the background is determined directly from the data. A QCD-dominated event sample is selected with the trigger and offline requirements described in Sect. 4.3 and requiring zero CSVM b tagged jets. In these events the most probable b quark candidates are determined by the kinematic fit. The resulting sample contains a negligible fraction of \({\mathrm{t}}\overline{{\mathrm{t}}}\) events (\(<\)1 %) and is treated exactly like the signal sample. After applying the \(\Delta R_{{\mathrm{b}}{\mathrm{b}}}>2.0\) and the fit probability requirements, the reconstructed top-like kinematic properties of events with no b jet are very similar to those with two b jets (confirmed using simulated QCD events). We use this QCD-dominated control sample to extract the shape (templates) of the various kinematic observables. The number of \({\mathrm{t}}\overline{{\mathrm{t}}}\) events (signal yield) is extracted from a template fit of \(m_{{\mathrm{t}}}^\text {rec}\) to the data using parametrized shapes for signal and background distributions, where the signal shape is taken from the \({\mathrm{t}}\overline{{\mathrm{t}}}\) simulation and the QCD shape is taken from the control data sample described above. The background and signal yields are determined via a maximum likelihood fit to the \(m_{{\mathrm{t}}}^\text {rec}\) distribution and are used to normalize the corresponding samples. Figures 1 and 2 show the fitted mass and the kinematic fit probability and \(\Delta R_{{\mathrm{b}}{\mathrm{b}}}\) distributions. The \(p_{\mathrm {T}}\) distributions of the six leading jets is shown in Fig. 3. From the output of the kinematic fit one can reconstruct the two top quark candidates, whose \(p_{\mathrm {T}}\) are shown in Fig. 4, and the properties of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) system (\(p_{\mathrm {T}}\), rapidity y) are shown in Fig. 5. Overall, the data sample is dominated by signal events, and the data are in agreement with the fit results. The jet \(p_{\mathrm {T}}\) spectra in data appear to be systematically softer than in the simulation, in agreement with the observations in Ref. [24], related to a softer measured top quark \(p_{\mathrm {T}}\) spectrum.

Fig. 6
figure 6

Distribution of the reconstructed top quark mass after the kinematic fit in bins of the leading reconstructed top quark \(p_{\mathrm {T}}\). The normalizations of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) signal and the QCD multijet background are taken from the template fit to the data. The bottom panels show the fractional difference between the data and the sum of signal and background predictions, with the shaded band representing the MC statistical uncertainty

6 Systematic uncertainties

The measurement of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) cross section is affected by several sources of systematic uncertainty, both experimental and theoretical, which are described below and summarized in Table 1. The quoted values refer to the inclusive measurement, with small variations observed in the bins of the differential measurement presented in Sect. 7.2.

Table 1 Fractional uncertainties in the inclusive \({\mathrm{t}}\overline{{\mathrm{t}}}\) production cross section
Fig. 7
figure 7

Normalized fiducial differential cross section of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) production as a function of the leading (top) and subleading (bottom) reconstructed top quark \(p_{\mathrm {T}}\) (detector level). The bottom panels show the fractional difference between various MC predictions and the data. Statistical uncertainties are shown with error bars, and systematic uncertainties with the shaded band

Table 2 Normalized differential \({\mathrm{t}}\overline{{\mathrm{t}}}\) cross section as a function of the \(p_{\mathrm {T}}\) of the leading (\(p_{\mathrm {T}} ^{(1)}\)) and subleading (\(p_{\mathrm {T}} ^{(2)}\)) top quarks or antiquarks. The results are presented at detector level in the visible phase space
Fig. 8
figure 8

Normalized differential cross section of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) production at parton level as a function of the leading (top) and subleading (bottom) top quark \(p_{\mathrm {T}}\). The bottom panels show the fractional difference between various MC predictions and the data. Statistical uncertainties are shown with error bars, while theoretical (theo.) and experimental (exp.) systematic uncertainties with the shaded bands

  • Background modeling: the QCD \(m_{{\mathrm{t}}}^\text {rec}\) template shape derived from the data control sample is varied according to the uncertainty of the method evaluated with simulated events, which impacts the extracted signal yield moderately (4.9 %).

  • Trigger efficiency: the efficiency of the trigger path is taken from the simulation and corrected with an event-by-event scale factor (\(\mathrm {SF}_\text {trig}\)), calculated from data independent samples, that depends on the fourth jet \(p_{\mathrm {T}}\). In the phase space of the measurement, the \(\mathrm {SF}_\text {trig}\) is greater than 0.83 and on average 0.96. The associated uncertainty is conservatively defined as \((1-\mathrm {SF}_\text {trig})/2\) and has a small impact (2.0 %) on the cross section.

  • Jet energy scale and resolution: the jet energy scale (JES) and jet resolution (JER) uncertainties have significant impacts on the measured cross section due to the relatively high \(p_{\mathrm {T}}\) requirements on the fourth and sixth of the leading jets. In the simulated events, jets are shifted (smeared) according to the \(p_{\mathrm {T}}\)- and \(\eta \)-dependent JES (JER) uncertainty, prior to the kinematic fit, and the full event interpretation is repeated. The JES (JER) has a dominant (small) effect on the cross section measurement of 7.0 % (3.5 %). In addition, the JES/JER uncertainties affect the signal template, with a negligible impact (\(\approx \)1 %) on the cross section measurement.

  • b tagging: the performance of the b tagger has a dominant effect on the signal acceptance because the selected events are required to have at least two jets satisfying the CSVM requirement. An event-by-event scale factor (\(\mathrm {SF}_\text {btag}\)) is applied to the simulation, which accounts for the discrepancies between data and simulation in the efficiency of tagging true b jets and in the misidentification rate [46]. The average value of \(\mathrm {SF}_\text {btag}\) is 0.99. The uncertainty in the \(\mathrm {SF}_\text {btag}\) is taken into account by weighting each event with the shifted value of \(\mathrm {SF}_\text {btag}\) which results in a cross section uncertainty of 7.3 %. This is the leading systematic uncertainty.

  • Integrated luminosity: the uncertainty on the integrated luminosity is estimated to be 2.6 % [49].

  • Matching partons to showers: the impact of the choice of the scale that separates the description of jet production via matrix elements or parton shower in MadGraph is studied by changing its reference value of 20 to 40 and 10 \(\,\text {GeV}\), resulting in an asymmetric effect of \(-4.2,\,+2.4\) % on the cross section.

  • Renormalization and factorization scales: the uncertainty in modelling of the hard-production process is assessed through changes in the renormalization and factorization scales in the MadGraph sample by factors of two and half, relative to their common nominal value, which is set to the Q of the hard process. In MadGraph, Q is defined by \(Q^2 = m^2_{{\mathrm{t}}} + \Sigma p^2_{\mathrm {T}}\), where the sum is over all additional final state partons in the matrix element calculations. The effect on the measured cross section is moderate and asymmetric (\(-0.5,\,+3.8~\%\)).

  • Parton distribution functions: following the PDF4LHC prescription [50, 51], the uncertainty on the cross section is estimated to be 1.5 %, taking the largest deviation on the signal acceptance from all the considered PDF eigenvectors.

    Table 3 Normalized differential \({\mathrm{t}}\overline{{\mathrm{t}}}\) cross section as a function of the \(p_{\mathrm {T}}\) of the leading (\(p_{\mathrm {T}} ^{(1)}\)) and subleading (\(p_{\mathrm {T}} ^{(2)}\)) top quarks or antiquarks. The results are presented at parton level in the full phase space
  • Non-perturbative QCD: the impact of non-perturbative QCD effects is estimated by studying various tunes of the pythia shower model that predict different underlying event (UE) activity and strength of the color reconnection (CR), namely, the Perugia 2011, Perugia 2011 mpiHi, and Perugia 2011 Tevatron tunes, described in Ref. [52], were used. The effect on the measured cross section is moderate: 4.4 % for the UE and 1.4 % for the CR.

  • Hadronization model: the effect of the hadronization model on the signal efficiency is estimated by comparing the predictions from the mc@nlo \(+\) herwig and powheg \(+\) pythia simulations, and it amounts to 2 %.

Fig. 9
figure 9

Normalized differential cross section of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) production at particle level as a function of the leading (top) and subleading (bottom) top quark \(p_{\mathrm {T}}\). The bottom panels show the fractional difference between various MC predictions and the data. Statistical uncertainties are shown with error bars, while theoretical (theo.) and experimental (exp.) systematic uncertainties with the shaded bands

7 Results

7.1 Inclusive cross section

The signal yield (\(N_{{\mathrm{t}}\overline{{\mathrm{t}}}}\)), extracted as described in Sect. 5, is used to compute the inclusive \({\mathrm{t}}\overline{{\mathrm{t}}}\) production cross section, according to the formula

$$\begin{aligned} \sigma _{{\mathrm{t}}\overline{{\mathrm{t}}}}=\frac{N_{{\mathrm{t}}\overline{{\mathrm{t}}}}}{(\mathcal {A}\epsilon )\,\mathcal {L}}, \end{aligned}$$
(1)

where \((\mathcal {A}\,\epsilon )\) is the simulated signal acceptance times efficiency in the measurement phase space (\({\approx }7\times 10^{-4}\)) corrected event-by-event with the trigger and b tagging efficiency scale factors and \(\mathcal {L}\) is the integrated luminosity. The fitted signal amounts to \(3416\pm 79\) events. Taking into account the systematic uncertainties discussed in Sect. 6, the measured cross section is

$$\begin{aligned} \sigma _{{\mathrm{t}}\overline{{\mathrm{t}}}}=275.6\pm 6.1\,\text {(stat)} \pm 37.8\,\text {(syst)} \pm 7.2\,\text {(lumi)} ~\text {\,pb}. \end{aligned}$$
(2)
Table 4 Normalized differential \({\mathrm{t}}\overline{{\mathrm{t}}}\) cross section as a function of the \(p_{\mathrm {T}}\) of the leading (\(p_{\mathrm {T}} ^{(1)}\)) and subleading (\(p_{\mathrm {T}} ^{(2)}\)) top quarks or antiquarks. The results are presented at particle level

The precision of the measured inclusive cross section is dominated by the systematic uncertainties, and in particular by those related to JES and b tagging.

In order to parametrize the dependence of the result on the top quark mass assumption, the measurement was repeated using signal simulated samples with different generated top quark masses (167.5 and 175.5 \(\,\text {GeV}\)). The choice of the generated mass affects both the extracted signal yield and the signal efficiency. The quadratic interpolation of the measurements with the three different top quark masses is

$$\begin{aligned} \frac{\sigma _{{\mathrm{t}}\overline{{\mathrm{t}}}}(m_{\mathrm{t}})}{\sigma _{{\mathrm{t}}\overline{{\mathrm{t}}}}(m_{\mathrm{t}}=172.5)}= & {} 1.0-2.4\times 10^{-2}(m_{\mathrm{t}}-172.5)\nonumber \\&+\,8.3\times 10^{-4}(m_{\mathrm{t}}-172.5)^2. \end{aligned}$$
(3)

7.2 Differential cross sections

The size of the signal sample allows the differential measurement of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) production cross section to be performed as a function of various observables. In order to confront the theoretical predictions, the differential cross sections are reported normalized to the inclusive cross section, resulting in a significant cancellation of systematic uncertainties.

The process of measuring the differential cross sections is identical to the inclusive case: in each bin of the observable used to divide the phase space, the signal is extracted from a template fit to the reconstructed top quark mass. Besides the physics interest, the choice of the observables used is mainly motivated by their correlation to \(m_{{\mathrm{t}}}^\text {rec}\), and the ability to extract smooth signal and background templates. The variables chosen are the \(p_{\mathrm {T}}\) of the two reconstructed top quarks. Figure 6 shows the fitted \(m_{{\mathrm{t}}}^\text {rec}\) distributions in bins of the \(p_{\mathrm {T}}\) of the leading top quark.

The differential measurements are first reported for the visible fiducial volume, as a function of the reconstructed top \(p_{\mathrm {T}}\) (detector level), and then extrapolated to the parton and particle levels. The detector-level result is shown in Fig. 7 and is free of most of the systematic uncertainties affecting the inclusive measurement. The corresponding numerical values are reported in Table 2.

The parton-level results shown in Fig. 8 are obtained from the detector-level measurement, after correcting for bin migration effects and extrapolating to the full phase space using a bin-by-bin acceptance correction. The unfolding of the bin-migration effect is performed with the D’Agostini method [53], implemented in the RooUnfold package [54], using the migration matrix derived from the simulation. The uncertainty due to the modeling of the migration matrix and the phase-space extrapolation is estimated by repeating the unfolding and acceptance-correction procedures by varying the systematic sources described in Sect. 6. The numerical values of the normalized differential cross sections at parton level are reported in Table 3. It should be noted that there is a large extrapolation factor involved from the detector-level jets (7E–4 of the signal) to the full parton level, which results in large theoretical uncertainties.

In addition to the parton level, results are reported at particle level, in Fig. 9, in a phase space similar to the detector level by construction. This is defined as follows: first, particle jets are built in simulation from all stable particles (including neutrinos) with the same jet clustering algorithm as the detector jets. Then, starting from the six leading jets, the jets associated with B hadrons via matching in \(\eta \)\(\phi \) (\(\Delta R < 0.25\)) are identified as the b jet candidates. Events are further selected if \(p_{\mathrm {T}} ^\text {4th jet} > 60~\,\text {GeV} \) and \(p_{\mathrm {T}} ^\text {6th jet} > 40~\,\text {GeV} \) and if there are at least two b jets with \(\Delta R_{{\mathrm{b}}{\mathrm{b}}} > 2.0\). For the selected events, a “pseudo top quark” is reconstructed from one b jet and the two closest non-b-tagged jets. The particle-level results are obtained in a similar way to the parton level, via unfolding and acceptance correction. The numerical values of the normalized differential cross sections at particle level are reported in Table 4.

The comparison of the measured and predicted differential top quark \(p_{\mathrm {T}}\) shapes reveals that the models predict a harder spectrum, both in the leading and in the subleading top quark \(p_{\mathrm {T}}\), in the phase space of the measurement. This effect is also reflected on the jet \(p_{\mathrm {T}}\) distributions shown in Fig. 3. The powheg \(+\) herwig prediction is the closest to the data, but still shows a significant discrepancy. The parton-level results are accompanied by sizeable systematic uncertainties, dominated by the theoretical uncertainties due to the extrapolation to the full phase space. In contrast, the particle-level phase space is much closer to the visible one, and as a result the extrapolation uncertainties are smaller.

8 Summary

A measurement of the \({\mathrm{t}}\overline{{\mathrm{t}}}\) production cross section has been performed in the all-jets final state, using pp collision data at \(\sqrt{s}=8~\,\text {TeV} \) corresponding to an integrated luminosity of \(18.4~{\,\text {fb}^{-1}} \). The measured inclusive cross section is \(275.6\,\pm \, 6.1\,\text {(stat)} \pm 37.8\,\text {(syst)} \pm 7.2\,\text {(lumi)} \) \(\text {\,pb}\) for a top quark mass of \(172.5~\,\text {GeV} \), in agreement with the standard model prediction of \(252.9^{+6.4}_{-8.6}\,(\text {scale})\pm 11.7\,(\text {PDF}+\alpha _S)\) \(\text {\,pb}\) as calculated with the Top++ (v. 2.0) program [55] at next-to-next-to-leading order in perturbative QCD, including soft-gluon resummation at next-to-next-to-leading-log order [56], and assuming a top-quark mass \(m_{{\mathrm{t}}}=172.5~\,\text {GeV} \). Also reported are the fiducial normalized differential cross sections as a function of the leading and subleading top quark \(p_{\mathrm {T}}\). Compared to QCD predictions, the measurement shows a significantly softer top quark \(p_{\mathrm {T}}\) spectrum. The differential cross sections are also extrapolated to the full partonic phase space, as well as to particle level, and can be used to tune Monte Carlo models.