1 Introduction

Searches for new physics in final states with two leptons that have same-sign (SS) charges provide a powerful probe for searches of new physics, both because standard model (SM) processes with this signature are few and have low cross sections, and because this signature is produced in a large number of important new-physics scenarios. Examples of the latter include the production of supersymmetric (SUSY) particles [1, 2], Majorana neutrinos [3], vector-like quarks [4], and SS top quark pairs [5, 6]. In the SUSY framework [715], the SS signature can arise through gluino pair production. For example, the Majorana nature of the gluino allows gluino pairs to decay via SS charginos, yielding two SS W bosons. Gluino pair production can also yield four W bosons, e.g., from the decay of four top quarks, which may result in the SS dilepton final state. Alternatively, cascade decays of pair-produced squarks can lead to the SS dilepton signature. Searches for new physics in the SS channel have been previously performed at the CERN LHC by the ATLAS [1618] and CMS [1923] Collaborations.

This paper describes a search for new physics in the final state with two or more leptons and including a SS pair (\(\mu ^\pm \mu ^\pm \), \(\mu ^\pm \mathrm{e}^\pm \), or \(\mathrm{e}^\pm \mathrm{e}^\pm \), where \(\mu \) is a muon and \(\mathrm {e}\) an electron). The analysis is based on proton–proton (pp) collision data at \(\sqrt{s} = 13\) \(\,\text {TeV}\), corresponding to an integrated luminosity of 2.3\(\,\text {fb}^{-1}\) collected with the CMS detector in 2015. The search strategy resembles that used in our analysis of 19.5 \(\mathrm{fb}^{1}\) of data collected at \(\sqrt{s}=8\,\text {TeV} \) [23], which excluded gluino masses in the four top quark signature up to about 1050\(\,\text {GeV}\). We design an inclusive analysis sensitive to a wide range of new-physics processes produced via strong interactions and yielding undetected particles in the final state. The interpretations of the results consider R-parity conserving SUSY models [24], as well as cross section limits on the production of two top quark-antiquark (\(\mathrm{t}\overline{\mathrm{t}}\)) pairs and of two SS top quarks. We also provide model independent limits to allow further interpretations of the results. With respect to Ref. [23], the kinematic regions are redefined and improvements in the event selection are implemented, both of which increase the sensitivity to new-physics scenarios at \(\sqrt{s}=13\,\text {TeV} \).

2 The CMS detector

The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diameter, providing a magnetic field of 3.8 T. Within the field volume are several particle detection systems. Charged-particle trajectories are measured with silicon pixel and strip trackers, covering \(0 \le \phi < 2\pi \) in azimuth and \(|\eta | < 2.5\) in pseudorapidity, where \(\eta \equiv -\ln [\tan (\theta /2)]\) and \(\theta \) is the polar angle of the trajectory of the particle with respect to the counterclockwise beam direction. The transverse momentum, namely the component of the momentum p in the plane orthogonal to the beam, is defined as \(p_{\mathrm {T}} = p \sin \theta \). Surrounding the silicon trackers, a lead tungstate crystal electromagnetic calorimeter and a brass and scintillator hadron calorimeter provide energy measurements of electrons, photons, and hadronic jets in the range \(|\eta | < 3.0\). Muons are identified and measured within \(|\eta | < 2.4\) by gas-ionization detectors embedded in the steel flux-return yoke of the solenoid. Forward calorimeters on each side of the interaction point encompass \(3.0< |\eta | < 5.0\). The CMS trigger consists of a two-stage system. The first level of the CMS trigger system, composed of custom hardware processors, uses information from the calorimeters and muon detectors to select events in a fixed time interval of less than 4\(\,\upmu \text {s}\). The high-level trigger (HLT) processor farm further decreases the event rate from around 100 kHz to less than 1 kHz, before data storage. A more detailed description of the CMS detector can be found in Ref. [25].

3 Event selection and Monte Carlo simulation

Events are selected with two sets of HLT algorithms. The first requires two very loosely isolated leptons, one satisfying \(p_{\mathrm {T}} > 17\,\text {GeV} \) and the other satisfying \(p_{\mathrm {T}} > 8\,\text {GeV} \) for a muon and 12\(\,\text {GeV}\) for an electron. The isolation is evaluated with respect to nearby tracks for a muon and to both tracks and calorimetric objects for an electron. The second set of triggers selects events with lowered \(p_{\mathrm {T}}\) thresholds of 8\(\,\text {GeV}\) and without a restriction on the isolation, but requiring a hadronic activity \(H_{\mathrm {T}} ^\text {HLT}>300\,\text {GeV} \), where \(H_{\mathrm {T}} ^\text {HLT}\) is the scalar \(p_{\mathrm {T}} \) sum of all jets with \(p_{\mathrm {T}} > 40\,\text {GeV} \) and \(|\eta | < 3.0\) identified by the HLT. Typical trigger efficiencies for leptons satisfying the selection criteria described below are 94 % (98 %) per muon (electron), with 100 % efficiency for the \(H_{\mathrm {T}} ^\text {HLT}\) requirement.

In the subsequent analysis, muon candidates are reconstructed by combining information from the silicon tracker and the muon spectrometer in a global fit [26]. A selection is performed using the quality of the geometrical matching between the tracker and muon system measurements. We select muons with well-determined charge by imposing an additional criterion: \(\delta p_{\mathrm {T}} (\mu )/p_{\mathrm {T}} (\mu ) < 0.2\), where \(\delta p_{\mathrm {T}} (\mu )\) is the uncertainty in the measurement of the muon \(p_{\mathrm {T}}\) from the global fit.

Electron candidates are reconstructed by combining clusters of energy in the electromagnetic calorimeter with tracks in the silicon tracker [27]. The identification is performed using a Boosted Decision Tree multivariate discriminant [28] based on shower shape and track quality variables. The nominal selection criteria are designed to provide a maximum rejection of electron candidates from multijet production while maintaining approximately 90 % efficiency for electrons from the decay of \(\mathrm {W}\) or \(\mathrm{Z} \) bosons. A relaxed selection on the multivariate discriminant is used to define “loose” criteria for electron identification. To improve the accuracy of the electron charge reconstruction, we require the position of the calorimeter deposit, relative to the linear projection of the deposits in the pixel detector to the inner calorimeter surface, to be consistent with the charge determination from the full track fit. Electrons originating from photon conversions are suppressed by rejecting candidates that are either without energy deposits in the innermost layers of the tracking system, or that are associated with a displaced vertex compatible with a photon conversion.

Lepton candidates are required to be consistent with originating from the collision vertex for which the summed \(p_{\mathrm {T}} ^2\) of the associated physics objects is the largest. The transverse (longitudinal) impact parameter of the leptons must not exceed 0.5 (1.0) mm with respect to this vertex, and they must fulfill the requirement \(|d_{\mathrm {3D}}|/\sigma (d_{\mathrm {3D}}) < 4\), where \(d_{\mathrm {3D}}\) is the three-dimensional impact parameter with respect to the vertex, and \(\sigma (d_{\mathrm {3D}})\) is its uncertainty from the track fit.

The charged leptons produced in decays of heavy particles, such as \(\mathrm {W}\) and \(\mathrm{Z} \) bosons or SUSY particles (“prompt” leptons), are typically spatially isolated from the hadronic activity in the event, while leptons produced in hadron decays or in photon conversions, as well as hadrons misidentified as leptons, are usually embedded in jets (“nonprompt” leptons). This distinction becomes less evident for systems with a high Lorentz boost, where decay products tend to overlap and jets may contribute to the energy deposition around prompt leptons. This problem is mitigated with an isolation definition constructed using the following three variables:

  • the mini-isolation variable (\(I_\text {mini}\)) [29], computed as the ratio of the scalar \(p_{\mathrm {T}}\) sum of charged hadrons, neutral hadrons, and photons within a cone of radius \(\Delta R \equiv \sqrt{{(\Delta \eta )^2+(\Delta \phi )^2}}\) around the lepton candidate direction at the vertex, to the transverse momentum of the lepton candidate (\(p_{\mathrm {T}} (\ell )\)). The cone radius \(\Delta R\) depends on \(p_{\mathrm {T}} (\ell )\) as:

    $$\begin{aligned} \Delta R\left( p_{\mathrm {T}} (\ell )\right) = \dfrac{10\,\text {GeV}}{\min \left[ \max (p_{\mathrm {T}} (\ell ), 50\,\text {GeV}), 200\,\text {GeV} \right] }.\nonumber \\ \end{aligned}$$
    (1)

    The varying isolation cone definition takes into account the increased collimation of the decay products of a hadron as its \(p_{\mathrm {T}} \) increases, and it reduces the inefficiency from accidental overlap between the lepton and jets in a busy event environment. The momentum estimate of each particle is performed by the particle-flow (PF) algorithm [30, 31], which identifies individual particles through a combination of information from different detector components.

  • the ratio of the \(p_{\mathrm {T}} \) of the lepton to that of the closest jet within a distance \(\Delta R = 0.4\):

    $$\begin{aligned} p_{\mathrm {T}} ^\text {ratio} = \dfrac{p_{\mathrm {T}} (\ell )}{p_{\mathrm {T}} (\text {jet})}, \end{aligned}$$
    (2)

    where the definition of a jet is given below. In case of no jet within this distance, the value of \(p_{\mathrm {T}} ^\text {ratio}\) is set to 1. The \(p_{\mathrm {T}} ^\text {ratio}\) variable is a measure of the isolation in a larger cone and improves the performance of the isolation definition, especially for low-\(p_{\mathrm {T}} \) nonprompt leptons, which are more likely than high-\(p_{\mathrm {T}} \) leptons to appear in a jet that is wider than the \(I_\text {mini}\) cone.

  • the \(p_{\mathrm {T}} ^\text {rel}\) variable [32], defined as the transverse momentum of the lepton relative to the residual momentum of the closest jet after lepton momentum subtraction:

    $$\begin{aligned} p_{\mathrm {T}} ^\text {rel} =\frac{\left| \left( {\vec {p}}(\text {jet})-{\vec {p}}(\ell )\right) \times {\vec {p}}(\ell )\right| }{|{\vec {p}}(\text {jet})-{\vec {p}}(\ell )|}. \end{aligned}$$
    (3)

    This variable allows the identification of leptons that accidentally overlap with jets.

A lepton is considered to be isolated if the following condition is satisfied:

$$\begin{aligned} I_\text {mini} < I_1 \quad \text {AND}\quad (p_{\mathrm {T}} ^\text {ratio}> I_2 \ \text {OR} \ p_{\mathrm {T}} ^\text {rel} > I_3 ). \end{aligned}$$
(4)

The values of \(I_{i}\), with \(i = 1,2,3,\) depend on the lepton flavor: because the probability to misidentify a lepton is higher for electrons, tighter isolation values are used in this case (see Table 1). In addition, a “loose” isolation criterion is defined as \(I_\text {mini} < 0.4\).

Table 1 Values of the isolation parameters used in Eq. (4)

Muons (electrons) are required to have \(p_{\mathrm {T}} > 10\) (15) \(\,\text {GeV} \) and \(|\eta | < 2.4\) (2.5); at least one SS lepton pair with an invariant mass above \(8\,\text {GeV} \) must be present in the event. In order to reduce backgrounds from inclusive production of the Z boson and from low-mass resonances decaying into lepton pairs, the SS pair is rejected if there is an additional lepton in the event that satisfies loose requirements and that forms an opposite-sign, same-flavor pair with an invariant mass less than 12\(\,\text {GeV}\) or between 76 and 106\(\,\text {GeV}\) with one of the two SS leptons.

Jets and missing transverse momentum (\(E_{\mathrm {T}}^{\text {miss}} \)) are reconstructed with the PF algorithm. We define \(E_{\mathrm {T}}^{\text {miss}}\) as the magnitude of the vector sum of all PF candidate transverse momenta [33]. For jet clustering, the anti-\(k_t\) algorithm [34] with a distance parameter of 0.4 is utilized. Jets are required to satisfy quality requirements [35] to remove those consistent with anomalous energy deposits. After the estimated contribution from additional \(\mathrm {p}\mathrm {p}\) interactions in the same or adjacent bunch crossings (pileup) is subtracted, jet energies are corrected for residual nonuniformity and nonlinearity of the detector response using simulation and data. Jets are required to have \(p_{\mathrm {T}} >40\,\text {GeV} \) and to lie within the tracker acceptance \(|\eta |<2.4\). Jets must be separated from loosely identified leptons by \(\Delta R > 0.4\), so that jets already employed for the calculation of lepton isolation variables are not considered further in the analysis. We require \(N_\text {jet} \ge 2\), where \(N_\text {jet}\) denotes the number of selected jets in the event. The hadronic activity in the event (\(H_{\mathrm {T}} \)) is defined as the scalar \(p_{\mathrm {T}} \) sum of the selected jets.

To identify jets originating from b quarks, the combined secondary vertex algorithm CSVv2 [36] is used. Jets with \(p_{\mathrm {T}} > 25\,\text {GeV} \) and \(|\eta |<2.4\) are considered as b-tagged if they satisfy the requirements of the medium working point of the algorithm. These requirements result in approximately a 70 % efficiency for tagging a b quark jet, and a less than 1 % mistagging rate for light-quark and gluon jets in \(\mathrm{t}\overline{\mathrm{t}}\) events. The number of b-tagged jets in the event is denoted as \(N_{\mathrm{b}}\).

Monte Carlo (MC) simulation, which includes the contribution of pileup, is used to estimate the background from SM processes with prompt SS leptons (see Sect. 5) and to calculate the efficiency for various new-physics scenarios. The SM background samples are produced with the MadGraph5_aMC@NLO  2.2.2 generator [37] at leading order (LO) or next-to-leading order (NLO) accuracy in perturbative quantum chromodynamics, with the exception of diboson samples, which are produced with the powheg  v2 [38, 39] generator. The NNPDF3.0LO [40] parton distribution functions (PDFs) are used for the simulated samples generated at LO, and the NNPDF3.0NLO [40] PDFs for the samples generated at NLO. Parton showering and hadronization are described using the pythia  8.205 generator [41] with the CUETP8M1 tune [42, 43]. The CMS detector response for the background samples is modeled with the Geant4 package [44]. The signal samples are generated with MadGraph5_aMC@NLO at LO precision, including up to two additional partons in the matrix element calculations; parton showering and hadronization, as well as decays of SUSY particles, are simulated with pythia, while the detector simulation is performed with the CMS fast simulation package [45].

Fig. 1
figure 1

Diagrams illustrating the simplified SUSY models used in this analysis

4 Search strategy

This analysis is designed as an inclusive search, sensitive to models matching two assumptions: a strong-interaction production mechanism, leading to relatively large hadronic activity, and the presence of undetected particles in the final state, yielding sizable \(E_{\mathrm {T}}^{\text {miss}}\). In the process of defining the search strategy, R-parity conserving SUSY is taken as a guideline because of its rich variety of signatures. In this context, signal models that can lead to the experimental signature of SS lepton pairs differentiate themselves in the numbers of \(\mathrm {W}\) bosons, b jets, and light-flavor jets produced in the decays of SUSY particles. In addition, the mass differences between the SUSY particles involved in the decay chains affect the energy spectra of the decay products, resulting in differences between the models in the distributions of kinematic quantities such as the \(p_{\mathrm {T}}\) of the leptons, \(H_{\mathrm {T}}\), and \(E_{\mathrm {T}}^{\text {miss}}\).

We consider SUSY scenarios in the context of simplified models of new-particle production [46, 47]. Models with four \(\mathrm {W}\) bosons and four b jets involve gluino pair production, followed by the decay of each gluino through a chain containing third-generation squarks. If the gluino is lighter than all squarks, and the top squark is the lightest squark, the gluino undergoes a three-body decay mediated by an off-shell top squark. If the dominant top squark decay is \({\widetilde{\mathrm{t}} _1} \rightarrow \mathrm{t}\widetilde{\chi }^{0}_{1} \), where \(\widetilde{\chi }^{0}_{1}\) is the lightest neutralino, taken to be the stable, undetected, lightest SUSY particle (LSP), then the gluino three-body decay is \(\widetilde{\mathrm{g}} \rightarrow \mathrm{t}\overline{\mathrm{t}} \widetilde{\chi }^{0}_{1} \) (\(\mathrm {T}1\mathrm {tttt}\) model in Fig. 1, upper left). If instead the dominant top squark decay is \({\widetilde{\mathrm{t}} _1} \rightarrow \mathrm{b} \widetilde{\chi }^{+}_{1} \), the gluino three-body decay is \(\widetilde{\mathrm{g}} \rightarrow \overline{\mathrm{t}}\mathrm{b} \widetilde{\chi }^{+}_{1} \) (\(\mathrm {T}5\mathrm {ttbb}\mathrm {W}\mathrm {W}\) model in Fig. 1, upper middle); the latter signature can also arise if the bottom squark is the lightest squark and decays as \({\widetilde{\mathrm{b}} _1} \rightarrow \mathrm{t}\widetilde{\chi }^{-}_{1} \). If the top squark is light enough to be on-shell and decays predominantly to a top quark and the LSP, gluinos decay through the chain \(\widetilde{\mathrm{g}} \rightarrow {\widetilde{\mathrm{t}} _1} \overline{\mathrm{t}}\rightarrow \mathrm{t}\overline{\mathrm{t}} \widetilde{\chi }^{0}_{1} \) (\(\mathrm {T}5\mathrm {tttt}\) model in Fig. 1, upper right). If instead the top squark mainly decays to the charm quark and the LSP, gluinos decay as in the \(\mathrm {T}5\mathrm {ttcc}\) model (Fig. 1, lower left); in this case only two \(\mathrm {W}\) bosons and two b jets are produced.

Events with four \(\mathrm {W}\) bosons and two b jets can arise from bottom squark pair production, where each bottom squark decays to a top quark and a chargino, and the chargino decays to an LSP and a (possibly off-shell) \(\mathrm {W}\) boson (\(\mathrm {T}6\mathrm {tt}\mathrm {W}\mathrm {W}\) model in Fig. 1, lower middle).

Finally, SS lepton pairs can be produced in association with large values of \(H_{\mathrm {T}}\), \(E_{\mathrm {T}}^{\text {miss}}\), and \(N_\text {jet}\), but without b jets. In particular, events with two \(\mathrm {W}\) bosons and four light-flavor quark jets can arise from gluino pair production if each gluino decays to two light quarks and a chargino. The two charginos can have the same charge and each decay to a \(\mathrm {W}\) boson and the LSP (\(\mathrm {T}5\mathrm {qqqq}\mathrm {W}\mathrm {W}\) model in Fig. 1, lower right). In the case that the difference in mass between the chargino and the LSP is small, the \(\mathrm {W}\) bosons are off-shell and produce soft leptons.

To increase the sensitivity to new-physics scenarios, we categorize events based on their kinematic properties as follows. First, three exclusive lepton selections are defined:

  • high–high (HH) selection: two SS leptons, each with \(p_{\mathrm {T}} \ge 25\,\text {GeV} \);

  • high–low (HL) selection: two SS leptons, one with \(p_{\mathrm {T}} \ge 25\,\text {GeV} \) and the other with \(10 \le p_{\mathrm {T}} < 25\,\text {GeV} \);

  • low–low (LL) selection: two SS leptons, each with \(10 \le p_{\mathrm {T}} < 25\,\text {GeV} \).

The high lepton \(p_{\mathrm {T}}\) threshold suppresses the contribution from nonprompt leptons; hence the SM background in the HH region arises primarily from events with prompt SS leptons. The nonprompt lepton background is largely contained in the HL region, where the high-\(p_{\mathrm {T}}\) lepton is typically prompt and the low-\(p_{\mathrm {T}}\) lepton nonprompt. The LL region is characterized by a very small background since all processes where at least one lepton originates from an on-shell vector boson are suppressed by the low-\(p_{\mathrm {T}}\) requirements, while events with two nonprompt leptons are suppressed by the kinematic requirements described below; the main residual contribution in this region is from nonprompt leptons.

Second, search regions (SR) are introduced so that the analysis is sensitive to a variety of new-physics scenarios. SRs are defined separately for the HH, HL, and LL selections using the \(H_{\mathrm {T}}\), \(E_{\mathrm {T}}^{\text {miss}}\), \(N_\text {jet}\), and \(N_{\mathrm{b}}\) variables: \(N_\text {jet}\) and \(N_{\mathrm{b}}\) separate signal from background for scenarios with a large production of jets and/or b jets, while \(H_{\mathrm {T}}\) and \(E_{\mathrm {T}}^{\text {miss}}\) increase sensitivity to models with different masses of SUSY particles. In addition, we make use of the \(M_\mathrm {T}^{\text {min}}\) variable, defined as:

$$\begin{aligned} M_\mathrm {T}^{\text {min}} = \min \left[ M_\mathrm {T} (\ell _1,E_{\mathrm {T}}^{\text {miss}}),M_\mathrm {T} (\ell _2,E_{\mathrm {T}}^{\text {miss}})\right] , \end{aligned}$$
(5)

where \(M_\mathrm {T} (\ell ,E_{\mathrm {T}}^{\text {miss}}) = \sqrt{{2p_{\mathrm {T}} (\ell )E_{\mathrm {T}}^{\text {miss}} (1-\cos \phi _{\ell ,E_{\mathrm {T}}^{\text {miss}}})}}\) is the transverse mass and \(\phi _{\ell ,E_{\mathrm {T}}^{\text {miss}}}\) is the azimuthal angle difference between the directions of the lepton and of the missing transverse momentum [48]. In the case of an SS lepton pair from \(\mathrm{t}\overline{\mathrm{t}}\) or \(\mathrm {W}\)+jets processes, where one lepton is prompt and the other nonprompt, this variable has a cutoff near the \(\mathrm {W}\) boson mass; consequently, the nonprompt lepton background is suppressed for SRs requiring \(M_\mathrm {T}^{\text {min}} > 120\,\text {GeV} \) and is large for \(M_\mathrm {T}^{\text {min}} < 120\,\text {GeV} \). In order to better characterize the background we use a fine SR binning in kinematic regions where SM processes are abundant (e.g., low \(M_\mathrm {T}^{\text {min}}\) and low \(E_{\mathrm {T}}^{\text {miss}}\)), while, due to the low background, we use a coarser binning in regions with tight selections.

Finally, inclusive search regions in the HH and HL categories are defined in the tails of the \(E_{\mathrm {T}}^{\text {miss}}\) and \(H_{\mathrm {T}}\) variables; the boundaries \(E_{\mathrm {T}}^{\text {miss}} >300\,\text {GeV} \) and \(H_{\mathrm {T}} >1125\,\text {GeV} \) (for \(E_{\mathrm {T}}^{\text {miss}} \le 300\,\text {GeV} \)) are chosen so that each of these regions typically contains 1 background event.

A summary of the selection criteria is presented in Tables 2, 3 and 4. All SRs are non-overlapping. They are combined statistically to obtain the final results (Sect. 7).

Table 2 SR definitions for the HH selection. The notation \(^\mathrm {(*)}\) indicates that, in order to avoid overlaps with SR31, an upper bound \(E_{\mathrm {T}}^{\text {miss}} < 300\,\text {GeV} \) is used for regions with \(H_{\mathrm {T}} > 300\,\text {GeV} \). All unlabeled region are included in the SR above them, for example the unlabeled regions between SR3 and SR9 are included in SR3, with the exception of the region to the right of SR31, which is included in SR31
Table 3 SR definitions for the HL selection. The notation \(^\mathrm {(*)}\) indicates that, in order to avoid overlaps with SR25, an upper bound \(E_{\mathrm {T}}^{\text {miss}} < 300\,\text {GeV} \) is used for regions with \(H_{\mathrm {T}} > 300\,\text {GeV} \). All unlabeled region are included in the SR above them, for example the unlabeled regions between SR3 and SR7 are included in SR3, with the exception of the region to the right of SR25, which is included in SR25
Table 4 SR definitions for the LL selection. All SRs in this category require \(N_{\mathrm{b}} \ge 2\). Unlabeled region to the right of SR7 and SR8 are included in SR7 and SR8, respectively

5 Backgrounds

Backgrounds in the SS dilepton final state can be divided into three categories:

  • Nonprompt leptons: Nonprompt leptons are leptons from heavy-flavor decays, hadrons misidentified as leptons, muons from light-meson decays in flight, or electrons from unidentified conversions of photons in jets. Depending on the signal region, this background is dominated by \(\mathrm{t}\overline{\mathrm{t}}\) and \(\mathrm {W}\)+jets processes; it represents the largest background for regions with low \(M_\mathrm {T}^{\text {min}}\) and low \(H_{\mathrm {T}}\).

  • SM processes with SS dileptons: Standard model processes that yield an SS lepton pair include multi-boson production (considering \(\mathrm {W}\), \(\mathrm{Z}\), \(\mathrm{H}\), and prompt \(\gamma \)), single boson production in association with a \(\mathrm{t}\overline{\mathrm{t}}\) pair, and double-parton scattering. The dominant sources are \(\mathrm {W}\mathrm{Z} \) and \(\mathrm{t}\overline{\mathrm{t}} \mathrm {W}\) production, which contribute primarily to SRs with zero and \({\ge }1\) b jets, respectively. \(\mathrm {W}\mathrm{Z} \) events contribute to the background when the \(\mathrm{Z} \) boson decays leptonically and is off-shell, when one of the \(\mathrm{Z} \)-boson decay leptons is not identified, or when the \(\mathrm{Z} \) boson decays to \(\tau \) leptons that result in a semileptonic final state. SM processes with SS dileptons are the largest background in the signal regions defined by tight kinematic selections.

  • Charge misidentification: Charge misidentification arises from events with opposite-sign isolated leptons in which the charge of an electron is misidentified, mostly due to severe bremsstrahlung in the tracker material. Overall, this is a small background.

The nonprompt lepton background is estimated from data using the “tight-to-loose” ratio method, which was employed in previous versions of the analysis [1923] but has been improved for the current study. It is based on a control sample of events (application region) where one lepton fails the nominal (tight) selection but passes the loose requirements, defined by relaxing the isolation selection for muons, and both the isolation and identification requirements for electrons. Events in this control region are reweighted by the factor \({\epsilon _{\mathrm {TL}}/(1-\epsilon _{\mathrm {TL}})}\), where \(\epsilon _{\mathrm {TL}}\) is the probability for a nonprompt lepton that satisfies the loose selection to also satisfy the tight selection [19]. Its value is measured in a multijet-enriched data set (measurement region), using events from single-lepton triggers after applying a selection designed to suppress electroweak processes (Drell–Yan and \(\mathrm {W}\)+jets) and after subtracting their residual contribution; this selection requires only one lepton in the event, \(E_{\mathrm {T}}^{\text {miss}} <20\,\text {GeV} \), and \(M_\mathrm {T} <20\,\text {GeV} \). The measurement is made as a function of the lepton \(p_{\mathrm {T}}\) and \(\eta \), separately for each lepton flavor (\(\mu \) or \(\mathrm {e}\)) and trigger (with or without isolation).

The method assumes that \(\epsilon _{\mathrm {TL}}\) has the same value in the measurement and application regions. The main sources of discrepancy are identified as differences in the momentum spectrum and the flavor of the parton producing the nonprompt lepton. These two effects are mitigated in the following way. First, \(\epsilon _{\mathrm {TL}}\) is parameterized as a function of \(p_{\mathrm {T}} ^\text {corr}\), defined as the lepton \(p_{\mathrm {T}}\) plus the energy in the isolation cone exceeding the isolation threshold value – this quantity is highly correlated with the mother parton \(p_{\mathrm {T}}\), and thus the parameterization is robust against mother parton \(p_{\mathrm {T}}\) variations. The second effect, i.e., flavor dependence, is relevant for electrons only: while nonprompt muons originate predominantly from heavy-flavor decays, nonprompt electrons receive sizable contributions from misidentified hadrons and conversions. The effect of variations in the flavor composition is suppressed by adjusting the loose electron identification criteria so that the numerical value of \(\epsilon _{\mathrm {TL}}\) for electrons from light flavors matches that for electrons from heavy flavors. The loose lepton selection is defined based on MC studies, but we verify that \(\epsilon _{\mathrm {TL}}\) is not significantly different in data events with and without b jets.

As a cross-check of the prediction, an alternative \(\epsilon _{\mathrm {TL}}\) measurement, similar to that described in Ref. [49], is performed in the dilepton control region where one of the leptons fails the impact parameter requirement. The predictions from the two methods are found to be consistent, both in MC samples and in data.

The background from SM processes with a prompt SS lepton pair is evaluated from simulation, accounting for both theoretical and experimental uncertainties. The \(\mathrm {W}\mathrm{Z} \) background is normalized to data in a control region requiring at least two jets, no b jets, \(E_{\mathrm {T}}^{\text {miss}} >30\,\text {GeV} \), and three leptons, where two of the leptons form a same-flavor, opposite-sign pair with an invariant mass within 15\(\,\text {GeV}\) of the Z boson mass; the measured normalization factor is found to be compatible with unity within about one standard deviation. The MC simulation of \(\mathrm {W}\mathrm{Z} \) production is used to relate the number of expected WZ events in the signal regions to the WZ event yield in the control region.

Finally, the charge misidentification background is estimated by reweighting events with opposite-sign lepton pairs by the charge misidentification probability. For electrons this probability is obtained from simulated \(\mathrm{t}\overline{\mathrm{t}}\) events and from \({\mathrm {e}^\pm \mathrm {e}^\pm }\) data in the Z mass window, and it lies in the range 10\(^{-5}\)–10\(^{-3}\) depending on the electron \(p_{\mathrm {T}}\) and \(\eta \). Studies of simulated events indicate that the muon charge misidentification probability is negligible.

6 Systematic uncertainties

Systematic uncertainties can affect both the overall normalization and the relative population of signal and background processes. A summary of their effects on the SR yields is given in Table 5.

Table 5 Summary of systematic uncertainties in the event yields in the SRs. The first six uncertainties are related to experimental factors for all processes whose yield is estimated from simulation; the next five are uncertainties in these yields related to the event simulation process itself. The last three uncertainties are related to background processes whose yield is estimated from data
Fig. 2
figure 2

Distributions of the main analysis variables: \(H_{\mathrm {T}}\)  (top), \(E_{\mathrm {T}}^{\text {miss}}\)  (middle left), \(M_\mathrm {T}^{\text {min}}\)  (middle right), \(N_\text {jet}\)  (bottom left), and \(N_{\mathrm{b}}\)  (bottom right), after a baseline selection requiring a pair of SS leptons, two jets, and either \(E_{\mathrm {T}}^{\text {miss}} > 30\,\text {GeV} \) or \(H_{\mathrm {T}} > 500\,\text {GeV} \). The last bin includes the overflow. The notation X+\(\gamma \) refers to processes with a prompt photon in the final state. The hatched area represents the total uncertainty in the background prediction. The upper panels show the ratio of the observed event yield to the background prediction

Fig. 3
figure 3

Event yields in the HH (top left), HL (top right), and LL (bottom) SRs. The notation X+\(\gamma \) refers to processes with a prompt photon in the final state. The hatched area represents the total uncertainty in the background prediction

Table 6 Expected number of background and observed events for the different SRs considered in this analysis

Experimental systematic uncertainties are mostly the consequence of differing event selection efficiencies in data and simulation. Lepton identification and trigger efficiencies are computed with the “tag-and-probe” technique [26, 27] with an uncertainty of 2 and 4 %, respectively. For signal samples, additional uncertainties of 4–10 % are included to account for differences in the lepton efficiency between the fast and Geant4-based simulations. The jet energy scale uncertainty varies between 2 and 8 %, depending on the jet \(p_{\mathrm {T}} \) and \(\eta \). Its impact is assessed by shifting the energy of each jet and propagating the variation to all dependent kinematic quantities (\(H_{\mathrm {T}}\), \(E_{\mathrm {T}}^{\text {miss}}\), \(N_\text {jet}\), \(N_{\mathrm{b}}\), and \(M_\mathrm {T}^{\text {min}}\)); correlation effects due to the migration of events from one SR to another are taken into account. These variations yield estimated uncertainties of 2–10 %. A similar approach is used to estimate the uncertainties associated with the \(\mathrm{b} \) tagging efficiencies for light-flavor and b quark jets [36], which are parameterized as a function of \(p_{\mathrm {T}} \) and \(\eta \) and are found to be of order 5 % for the highly populated SRs. The uncertainty in the modeling of pileup is 1–5 % depending on the SR. The uncertainty in the integrated luminosity is 2.7 % [50].

Fig. 4
figure 4

Exclusion regions at the 95 % CL in the \(m_{\widetilde{\chi }^{0}_{1}}\) versus \(m_{\widetilde{\mathrm{g}}}\) plane for the \(\mathrm {T}1\mathrm {tttt}\)  (left) and \(\mathrm {T}5\mathrm {ttbb}\mathrm {W}\mathrm {W}\)  (right) models, where for the \(\mathrm {T}5\mathrm {ttbb}\mathrm {W}\mathrm {W}\) model \(m_{\widetilde{\chi }^\pm _{1}} = m_{\widetilde{\chi }^{0}_{1}} + 5\,\text {GeV} \). The right-hand side color scale indicates the excluded cross section values for a given point in the SUSY particle mass plane. The solid, black curves represent the observed exclusion limits assuming the NLO+NLL cross sections (thick line), or their variations of ±1 standard deviation (thin lines). The dashed, red curves show the expected limits with the corresponding ±1 standard deviation experimental uncertainties. Excluded regions are to the left and below the limit curves

The background sources estimated from simulation are subject to theoretical uncertainties related to unknown higher-order effects and to uncertainties in the knowledge of the PDFs. The former are estimated by simultaneously varying the renormalization and factorization scales up and down by a factor of two. The effect on the overall cross section is found to be 13 % for \(\mathrm{t}\overline{\mathrm{t}} \mathrm {W}\) events and 11 % for \(\mathrm{t}\overline{\mathrm{t}} \mathrm{Z} \) events, while the effect on the acceptance for the various SRs amounts to 3–8 % depending on \(H_{\mathrm {T}}\). The magnitude of the uncertainty related to the PDFs is obtained using variations of the NNPDF3.0 set [40]. The overall uncertainty is \({\sim } 4~\%\) for the \(\mathrm{t}\overline{\mathrm{t}} \mathrm {W}\) and \(\mathrm{t}\overline{\mathrm{t}} \mathrm{Z} \) samples. Theoretical uncertainties are also considered for the remaining minor backgrounds estimated from simulation: a similar procedure is used for the \(\mathrm {W}^{\pm }\mathrm {W}^{\pm }\) process, leading to an overall uncertainty of 30 %, while a 50 % uncertainty is assigned to processes with a prompt \(\gamma \) and to the sum of the other rare processes. For all backgrounds estimated from simulation we account for the statistical uncertainty of the MC samples.

The remaining sources of uncertainty are those related to the methods that are used to estimate the nonprompt lepton, charge misidentification, and \(\mathrm {W}\mathrm{Z} \) backgrounds. An overall normalization uncertainty of 30 % is assigned to the nonprompt lepton background prediction. This uncertainty accounts for the performance of the method on simulated data and for the differences in the prediction from the two alternative procedures described in Sect. 5. An additional uncertainty is associated with the subtraction procedure to remove Drell–Yan and \(\mathrm {W}\)+jets events from the measurement region; the overall effect on the nonprompt lepton background yield is 1–20 %, depending on the SR considered, and is larger for high-\(p_{\mathrm {T}}\) leptons. Finally, we account for the statistical uncertainty in the number of events observed in the application region.

The background from charge misidentification is assigned a systematic uncertainty of 26 %, which corresponds to the difference between the \({\mathrm {e}^\pm \mathrm {e}^\pm }\) event yield in the Z mass window in data and simulation.

The uncertainty in the \(\mathrm {W}\mathrm{Z} \) background is measured to be 30 % in the control region. It includes statistical uncertainties and systematic uncertainties due to non-\(\mathrm {W}\mathrm{Z} \) background subtraction. Using the same procedure as described above, uncertainties in the extrapolation from the control to the signal regions are assessed from the propagation of the uncertainty in the jet energy scale and in the \(\mathrm{b} \) tagging efficiencies.

Fig. 5
figure 5

Exclusion regions at the 95 % CL in the plane of \(m_{\widetilde{\chi }^{0}_{1}}\) versus \(m_{\widetilde{\mathrm{g}}}\) for models with the gluino decaying to an on-shell top squark: \(\mathrm {T}5\mathrm {tttt}\) with \(m_{{\widetilde{\mathrm{t}} _1}} = m_{\widetilde{\chi }^{0}_{1}} + m_{\mathrm{t}}\) (top), \(\mathrm {T}5\mathrm {tttt}\) with \(m_{{\widetilde{\mathrm{t}} _1}} = m_{\widetilde{\chi }^{0}_{1}} + 20\,\text {GeV} \) (bottom left), and \(\mathrm {T}5\mathrm {ttcc}\) with \(m_{{\widetilde{\mathrm{t}} _1}} = m_{\widetilde{\chi }^{0}_{1}} + 20\,\text {GeV} \) (bottom right). For a description of the notation, see Fig. 4

7 Results

Distributions of the five kinematic variables used to define the SRs, \(H_{\mathrm {T}}\), \(E_{\mathrm {T}}^{\text {miss}}\), \(M_\mathrm {T}^{\text {min}}\), \(N_\text {jet}\), and \(N_{\mathrm{b}}\), are shown in Fig. 2 after a baseline selection requiring a pair of SS leptons, two jets, and either \(E_{\mathrm {T}}^{\text {miss}} > 30\,\text {GeV} \) or \(H_{\mathrm {T}} > 500\,\text {GeV} \). The results are shown in comparison to the background prediction. The event yields in the SRs after the full selection are presented in Fig. 3 and in Table 6; no significant deviation from the SM background prediction is observed. The largest local significances are 2.2 and 1.8 standard deviations in HL SR8 and in HH SR10, respectively.

The results of the search are used to constrain the benchmark SUSY models presented in Sect. 4. For each mass point in the SUSY particle mass spectrum, results from all SRs are combined to extract cross section exclusion limits at the 95 % confidence level (CL), using the asymptotic formulation of the modified frequentist CL\(_\mathrm {s}\) criterion [5154]. Signal and background uncertainties are included as log-normal nuisance parameters and, when relevant, take into account correlation effects among different SRs and/or different processes. Exclusion contours make use of the cross section values calculated at NLO plus next-to-leading logarithmic (NLL) accuracy, assuming that all SUSY particles other than those included in the respective diagram are too heavy to participate in the interaction [5560]. In general, the SR with the largest sensitivity is HH SR31, which requires \(E_{\mathrm {T}}^{\text {miss}} >300\,\text {GeV} \) and is inclusive in the other variables. However, depending on the model and the region of parameter space, other SRs contribute significantly to the total sensitivity: for instance, a considerable contribution comes from HL SR25 in case of signal models with a soft lepton, from HH SR32 and HL SR26 in case of high \(H_{\mathrm {T}}\), from HH SR3 and SR8 in case of no b jets, and from HH SR24 and SR26 in case of 2 or more b jets.

Results for models with gluinos decaying to virtual third generation squarks are shown in Fig. 4 as a function of the gluino and LSP masses. For the \(\mathrm {T}1\mathrm {tttt}\) model (Fig. 4-left), in the regions of the SUSY parameter space with a large mass difference between the gluino and the LSP, the results are rather stable with respect to LSP mass variations, and gluino masses up to 1300\(\,\text {GeV}\) are excluded. Near the kinematic threshold \(m_{\widetilde{\mathrm{g}}}-m_{\widetilde{\chi }^{0}_{1}} = 2(m_{\mathrm {W}}+m_{\mathrm{b}})\), the gluino mass limit becomes weaker and is reduced to 1050\(\,\text {GeV}\) for an LSP mass of 800\(\,\text {GeV}\). Results for the \(\mathrm {T}5\mathrm {ttbb}\mathrm {W}\mathrm {W}\) model with nearly degenerate \(\widetilde{\chi }^\pm _{1}\) and \(\widetilde{\chi }^{0}_{1}\) masses are shown in Fig. 4-right; the limit on the gluino mass lies in the range 950–1100\(\,\text {GeV}\) except for very small \(\widetilde{\chi }^\pm _{1}\) and \(\widetilde{\chi }^{0}_{1}\) masses, where the sensitivity increases because of the large Lorentz boost of the leptons from the \(\widetilde{\chi }^\pm _{1}\) decay.

Fig. 6
figure 6

Exclusion regions at the 95 % CL in the plane of \(m_{\widetilde{\chi }^\pm _{1}}\) versus \(m_{{\widetilde{\mathrm{b}} _1}}\) for the \(\mathrm {T}6\mathrm {tt}\mathrm {W}\mathrm {W}\) model with \(m_{\widetilde{\chi }^{0}_{1}}=50\,\text {GeV} \). For a description of the notation, see Fig. 4

Results for models with a gluino decaying to an on-shell top squark are shown in Fig. 5 as a function of the gluino and LSP masses. For the \(\mathrm {T}5\mathrm {tttt}\) model (Fig. 5-top), for which we take \(m_{{\widetilde{\mathrm{t}} _1}} = m_{\widetilde{\chi }^{0}_{1}} + m_{\mathrm{t}}\), similar exclusion curves are obtained as for the T1tttt model in Fig. 4-left because the production cross section and the final-state particles are the same. The limit becomes weaker when there is a small mass difference between the top squark and the LSP: for \(m_{{\widetilde{\mathrm{t}} _1}} - m_{\widetilde{\chi }^{0}_{1}} = 20\,\text {GeV} \), the limit on the gluino mass is 1140\(\,\text {GeV}\) for small LSP masses and about 850\(\,\text {GeV}\) for \(m_{\widetilde{\chi }^{0}_{1}} = 700\,\text {GeV} \) (Fig. 5-bottom left). In the case of the \(\mathrm {T}5\mathrm {ttcc}\) model with the same SUSY particle mass values, the sensitivity is slightly reduced because of the smaller number of leptons and b jets in the final state (Fig. 5-bottom right).

Figure 6 shows the results for b squark production in the \(\mathrm {T}6\mathrm {tt}\mathrm {W}\mathrm {W}\) model in the chargino (\(\widetilde{\chi }^\pm _{1}\)) versus b squark mass plane, where the LSP mass is assumed to be \(m_{\widetilde{\chi }^{0}_{1}}=50\,\text {GeV} \). For chargino masses up to 550\(\,\text {GeV}\), b squark masses below 680\(\,\text {GeV}\) are excluded. The limit on the b squark mass is reduced to 500\(\,\text {GeV}\) in regions where \(m_{\widetilde{\chi }^\pm _{1}}\) is within 100\(\,\text {GeV}\) of \(m_{{\widetilde{\mathrm{b}} _1}}\), while a milder reduction is observed in regions where the difference between \(m_{\widetilde{\chi }^\pm _{1}}\) and \(m_{\widetilde{\chi }^{0}_{1}}\) is less than 150\(\,\text {GeV}\).

Results for the \(\mathrm {T}5\mathrm {qqqq}\mathrm {W}\mathrm {W}\) model are shown in Fig. 7 as a function of the gluino and LSP masses, with two different assumptions for the chargino mass: it is either assumed to be the average of \(m_{\widetilde{\mathrm{g}}}\) and \(m_{\widetilde{\chi }^{0}_{1}}\), or it is set to \(m_{\widetilde{\chi }^{0}_{1}} + 20\,\text {GeV} \). In the first case (Fig. 7-left), the exclusion limit on gluino masses exceeds 1100\(\,\text {GeV}\) for LSP masses up to 400\(\,\text {GeV}\); for larger LSP masses the limit is reduced to 830\(\,\text {GeV}\) at \(m_{\widetilde{\chi }^{0}_{1}} = 700\,\text {GeV} \). In the second case (Fig. 7-right), due to the smaller mass difference, leptons in the final state are soft and thus the sensitivity is reduced.

Fig. 7
figure 7

Exclusion regions at the 95 % CL in the plane of \(m_{\widetilde{\chi }^{0}_{1}}\) versus \(m_{\widetilde{\mathrm{g}}}\) for the \(\mathrm {T}5\mathrm {qqqq}\mathrm {W}\mathrm {W}\) model with \(m_{\widetilde{\chi }^\pm _{1}}=0.5(m_{\widetilde{\mathrm{g}}} + m_{\widetilde{\chi }^{0}_{1}})\) (left) and with \(m_{\widetilde{\chi }^\pm _{1}} = m_{\widetilde{\chi }^{0}_{1}} + 20\,\text {GeV} \) (right). For a description of the notation, see Fig. 4

The results of the search are also used to set 95 % CL upper limits on the double \(\mathrm{t}\overline{\mathrm{t}}\) production cross section, whose SM value computed at NLO precision [37] is 9.1\(\,\text {fb}\). The upper limit on \(\sigma ({\mathrm {p}\mathrm {p}} \rightarrow {\mathrm{t}\overline{\mathrm{t}} \mathrm{t}\overline{\mathrm{t}}})\) is found to be 119\(\,\text {fb}\), with an expected result of \(102^{+57}_{-35}\,\text {fb} \). With the current integrated luminosity, the sensitivity to this signature is limited by the statistical precision.

Limits at the 95 % CL on the SS top quark pair production cross section are determined using events that satisfy the baseline selection categorized according to number of b jets (Fig. 2-bottom right); apart from the charge requirement, the detector acceptance and the selection efficiency for the signal are assumed to match those of SM \(\mathrm{t}\overline{\mathrm{t}}\) events. The observed (expected) upper limit on \(\sigma (\mathrm {p}\mathrm {p}\rightarrow {\mathrm{t} \mathrm{t}}) + \sigma ({\mathrm {p}\mathrm {p}} \rightarrow {\overline{\mathrm{t}} \overline{\mathrm{t}}})\) is 1.7\(\,\text {pb}\)  (\(1.5^{+0.7}_{-0.4}\,\text {pb} \)).

Finally, we report model independent limits on the product of cross section, detector acceptance, and selection efficiency, \(\sigma \! \mathcal {A} \epsilon \), for the production of an SS dilepton pair in the two inclusive HH regions, SR31 and SR32, using the CL\(_\mathrm {s}\) criterion without the asymptotic approximation. In SR31 the limit is computed as a function of the minimum threshold on \(E_{\mathrm {T}}^{\text {miss}}\) for \(H_{\mathrm {T}} >300\,\text {GeV} \), while in SR32 it is computed as a function of the \(H_{\mathrm {T}}\) threshold for \(50<E_{\mathrm {T}}^{\text {miss}} <300\,\text {GeV} \). The results are shown in Fig. 8, where, in regions with no observed events, the minimum limit value of 1.3\(\,\text {fb}\) is obtained. These limits can be used to test additional BSM models, after accounting for the event selection efficiency. The lepton efficiency ranges between 70–85 % (45–70 %) for generated muons (electrons) with \(|\eta |<2.4\) and \(p_{\mathrm {T}} >25\,\text {GeV} \), increasing as a function of \(p_{\mathrm {T}}\) and converging to the maximum value for \(p_{\mathrm {T}} >60\,\text {GeV} \); the efficiencies of the \(H_{\mathrm {T}}\) and \(E_{\mathrm {T}}^{\text {miss}}\) requirements are mostly determined by the jet energy and \(E_{\mathrm {T}}^{\text {miss}}\) resolutions, which are discussed in Refs. [33, 35].

Fig. 8
figure 8

Limits on the product of cross section, detector acceptance, and selection efficiency, \(\sigma \! \mathcal {A} \epsilon \), for the production of an SS dilepton pair as a function of \(E_{\mathrm {T}}^{\text {miss}}\) in HH SR31 (left) and of \(H_{\mathrm {T}}\) in HH SR32 (right)

8 Summary

The results of a search for new physics in same-sign dilepton events using the CMS detector at the LHC and based on a data sample of pp collisions at \(\sqrt{s} = 13\,\text {TeV} \), corresponding to an integrated luminosity of 2.3\(\,\text {fb}^{-1}\), are presented. The data are analyzed in nonoverlapping signal regions defined with different selections on lepton and event kinematic variables, as well as jet and b quark jet multiplicities.

No significant deviation from the standard model expectations is observed. The results are used to set limits on the production of supersymmetric particles in various simplified models. Gluino and bottom squark masses are excluded at the 95 % confidence level up to 1300 and 680\(\,\text {GeV}\), respectively. These results extend the limits obtained in the previous version of the analysis [23] by about 250\(\,\text {GeV}\) on the gluino mass, and 150\(\,\text {GeV}\) on the bottom squark mass. In addition, 95 % confidence level upper limits of 119\(\,\text {fb}\) and 1.7\(\,\text {pb}\) are set on the cross sections for the production of two top quark-antiquark pairs and for the production of two SS top quarks, respectively. Model independent limits and selection efficiencies are provided to allow further interpretations of the results, using alternative models to those examined here.