## Abstract

We discuss the most important Physics thus far extracted from studies of *B* meson decays. Measurements of the four CP violating angles accessible in *B* decay are reviewed as well as direct CP violation. A detailed discussion of the measurements of the CKM elements *V*_{
cb
}and *V*_{
ub
}from semileptonic decays is given, and the differences between resulting values using inclusive decays versus exclusive decays is discussed. Measurements of "rare" decays are also reviewed. We point out where CP violating and rare decays could lead to observations of physics beyond that of the Standard Model in future experiments. If such physics is found by directly observation of new particles, e.g. in LHC experiments, *B* decays can play a decisive role in interpreting the nature of these particles.

**PACS Codes:** 13.25.Hw, 14.40.Nd, 14.65.Fy

## 1 Introduction

The forces of nature generally reveal their properties by how they act on matter. On the most fundamental material scale that we are aware of matter is formed from fermions. These take two forms, leptonic matter and quark matter. The former do not have any strong interactions, that is they lack a property called "color charge", which allow quarks to bind together either mesonically or baryonically. New and therefore as yet unknown forces could effect both leptons and quarks. Here, we concentrate on how such forces effect quarks, especially the *b* quark.

Light matter consists mostly of *u*, *d* and *s* quarks. In 1963 Cabibbo showed that weak interactions of mesons and baryons containing *s* quarks were suppressed with respect to those without *s* quarks by an amount tan *θ*_{
C
}, where the "Cabibbo" angle *θ*_{
C
}must be determined experimentally [1]. The *s* was further shown to have an important and at that time a mystifying role, by the discovery of CP violation in ${K}_{L}^{0}$ decays in 1964 [2, 3]. (In *CP*, *C* stands for charge conservation invariance and *P* for parity invariance. The operation of *C* takes a particle to anti-particle state and the operation of *P* changes left to right. [4–6].) When the *c* quark was discovered in Nov. 1974 [7, 8] (though its existence was speculated earlier [9–11]), it became clear that *θ*_{
C
}was the mixing angle between two quark doublets, (*c*, *s*) and (*u*, *d*). However, it is not possible to generate a CP violating phase with only two quark doublets.

This was recognized even before the discovery of the charm quark by Kobayashi and Maskawa, who postulated the existence of yet another quark doublet (*b*, *t*) [12], in work for which they were awarded the Nobel Prize in 2008. While the *t* quark is the heaviest, having a mass of 173 GeV, they are difficult to produce and decay before they can form a hadron, thus excluding many otherwise possible studies. Much interesting work has been done with the *s* and *c* quarks, but in this article we discuss the physics of the *b* quark, which turns out to be the most interesting of the six quarks to study.

### 1.1 How *B*'s Fit Into the Standard Model

First we will discuss how particles formed with *b*-quarks fit into current paradigm of particle physics, the "Standard Model" (SM) [13–15]. The SM has at its basis the gauge group SU(3)xSU(2)xU(1). The first term corresponds to the strong interaction and SU(3) describes the octet of colored gluons which are the strong force carriers of quantum chromodynamics. SU(2)xU(1) describes the weak interaction and is the product of weak isospin and hypercharge. We speak of the fundamental objects being spin-1/2 quarks and leptons and the force carriers generally spin-1 objects. The spin-0 Higgs boson, yet to be discovered, is necessary for generating mass (to view a cartoon of the mechanism of Higgs mass generation, see [16]).

Particles containing *b* quarks can be *B*^{0}, *B*^{-}, *B*_{
s
}, or *B*_{
c
}mesons, depending on whether the light anti-quark that it pairs with is $\overline{d}$, $\overline{u}$, $\overline{s}$, or $\overline{c}$ or a baryon containing two other quarks. Mesons containing *b* and $\overline{b}$ quarks are also interesting especially for studies of quantum chromodynamics, but will not be discussed further in this article, as we will concentrate on weak decays and discuss strong interactions as an important and necessary complication that must be understood in many cases to extract information on fundamental *b* quark couplings.

The quarks come in three repetitions called generations, as do the leptons. The first generation is *d* *u*, the second *s* *c* and the third *b* *t*. In the second and third generations the charge +2/3 quark is heavier than the charge -1/3; the first generation has two very light quarks on the order of a few MeV with the *d* thought to be a bit heavier. (Isospin invariance is related to the equality of *u* and *d* quark masses. The PDG [17] gives the *u* quark mass between 1.5–3.3 MeV and the *d* mass between 3.5–6.0 MeV, where the large range indicates the considerable uncertainties.) Decays usually proceed within generations, so the *c* decays predominantly to the *s* quark via the quark level process *c* → *W*^{+} *s*, though some decays do go to the first generation as *c* → *W*^{+} *d*. The ratio of these amplitudes approximate the Cabibbo angle discussed earlier.

The mixing matrix proposed by Kobayshi and Maskawa [12] parameterizes the mixing between the mass eigenstates and weak eigenstates as couplings between the charge +2/3 and -1/3 quarks. We use here the Wolfenstein approximation [18] good to order *λ*^{4}:

In the Standard Model *A*, *λ*, *ρ*, and *η* are fundamental constants of nature like *G*, or *α*_{
EM
}; *η* multiplies the imaginary *i* and is responsible for all Standard Model CP violation. We know *λ* = 0.226, *A* ~0.8 and we have constraints on *ρ* and *η*. Often the variables $\overline{\rho}$ and $\overline{\eta}$ are used where

where the definition in terms of the CKM matrix elements is correct to all orders in *λ* [19] (see also [20]).

Applying unitarity constraints allows us to construct the six independent triangles shown in Figure 1. Another basis for the CKM matrix are four angles labeled as *χ*, *χ'* and any two of *α*, *β* and *γ* since *α* + *β* + *γ* = *π* [21, 22]. (These angles are also shown in Figure 1.) CP violation measurements make use of these angles. (The Belle collaboration defines *ϕ*_{2} ≡ *α*, *ϕ*_{1} ≡ *β*, and *ϕ*_{3} ≡ *γ*.)

*B* meson decays can occur through various processes. Some decay diagrams with intermediate charged vector bosons are shown in Figure 2. The simple spectator diagram shown Figure 2(a) has by far the largest rate. Semileptonic decays proceed through this diagram, and allow us to measure the CKM couplings *V*_{
cb
}and *V*_{
ub
}by considering only the hadronic uncertainties due to the spectator quark. The color suppressed diagram Figure 2(b) exists only for hadronic decays. It can occur only when the colors of the quarks from the virtual *W*^{-} decay match those of the initial *B* meson. Since this happens only 1/3 of the time in amplitude, the rate is down by almost an order of magnitude from the spectator decays. The annihilation Figure 2(c) describes the important decay *B*^{-} → *τ*^{-} $\overline{\nu}$ and will be discussed in detail later. The *W* exchange Figure 2(d) diagram is small. The box diagram Figure 2(e) is the source of mixing in the *B* system. An analogous diagram exists also for the *B*_{
s
}meson. Finally the Penguin diagram Figure 2(f) is an example of one of several loop diagrams leading to "rare decays" that will be discussed in detail in a subsequent section.

#### 1.1.1 Dark Matter

"Dark Matter" was first shown to exist by Zwicky studying rotation curves of galaxies [23]. The motion could only be explained if there was massive cloud of matter that was not luminous. We still do not know what composes this dark matter, though hopes are it will be discovered at the LHC. An even more mysterious phenomena called "Dark Energy" may also have a connection to particle physics experiments [24], perhaps via "Extra Dimensions" [25].

#### 1.1.2 Baryogenesis

When the Universe began with the Big Bang, there was an equal amount of matter and antimatter. Now we have mostly matter. How did it happen? Sakharov gave three necessary conditions: Baryon ($\mathcal{B}$) number violation, departure from thermal equilibrium, and C and CP violation [26]. (The operation of Charge Conjugation (C) takes particle to anti-particle and Parity (P) takes a vector $\overrightarrow{r}$ to - $\overrightarrow{r}$.)

These criteria are all satisfied by the Standard Model. $\mathcal{B}$ is violated in Electroweak theory at high temperature, though baryon minus lepton number is conserved; in addition we need quantum tunneling, which is powerfully suppressed at the low temperatures that we now have. Non-thermal equilibrium is provided by the electroweak phase transition. C and CP are violated by weak interactions. However the violation is too small. The ratio of the number of baryons to the entropy in the observed part of the Universe needs to be ~5 × 10^{-11}, while the SM provides many orders of magnitude less. Therefore, there must be new physics [27].

#### 1.1.3 The Hierarchy Problem

Our worry is why the Planck scale at ~10^{19} GeV is so much higher than the scale at which we expect to find the Higgs Boson, ~100 GeV. As Lisa Randall said [28] "The gist of it is that the universe seems to have two entirely different mass scales, and we don't understand why they are so different. There's what's called the Planck scale, which is associated with gravitational interactions. It's a huge mass scale, but because gravitational forces are proportional to one over the mass squared, that means gravity is a very weak interaction. In units of GeV, which is how we measure masses, the Planck scale is 10^{19} GeV. Then there's the electroweak scale, which sets the masses for the W and Z bosons. These are particles that are similar to the photons of electromagnetism and which we have observed and studied well. They have a mass of about 100 GeV. So the hierarchy problem, in its simplest manifestation, is how can you have these particles be so light when the other scale is so big." We expect the explanation lies in physics beyond the Standard Model [29].

### 1.2 *B*Decays as Probes for New Physics

When we make measurements on *B* decays we observe the contributions of SM processes as well as any other processes that may be due to physics beyond the SM or New Physics (NP). Other diagrams would appear adding to those in Figure 2 with new intermediate particles. Thus, when it is declared by those who say that there isn't any evidence of NP in *B* decays, we have to be very careful that we have not absorbed such new evidence into what we declare to be SM physics. There are several approaches that can be followed.

One approach is to simply predict the decay rate of a single process in the SM with known couplings and compare to the measurements. The classical case here is *b* → *s* *γ* and we will discuss this and other specific examples later. Another approach is make different measurements of the CKM parameters in different ways and see if they agree. This is normally done by measuring both angles and sides of the CKM triangle, but other quantities can also be used. This is the approach used by the CKM fitter [19] (see also [20]) and UT fit groups [30]. In yet a third approach, the exact same quantity can be measured in several ways, even if cannot be predicted in the SM. An example here is measuring the CP violating angle *β* using *B*^{0} → *J*/*ψ K*_{
S
}decays that proceed through the diagram in Figure 2(b), at least in the SM, and another process that uses the "Penguin" diagram in Figure 2(f), e.g. *B*^{0} → *ϕ K*_{
S
}.

The punch line is that if new, more massive particles exist in a mass range accessible to the LHC then they MUST contribute to rare and CP violating *B* decays! Even if measurements are precise enough only to limit the size of these effects, the properties of these new particles will be much better understood. This is the raison d'être for the further study of *B* decays.

## 2 Measurements of mixing and CP violation

### 2.1 Neutral *B*Meson Mixing

Neutral *B* mesons can transform into their anti-particles before they decay. The diagrams for this process are shown in Figure 3 for the *B*_{
d
}. There is a similar diagram for the *B*_{
s
}. Although *u*, *c* and *t* quark exchanges are all shown, the *t* quark plays a dominant role mainly due to its mass, as the amplitude of this process is proportional to the mass of the exchanged fermion.

Under the weak interactions the eigenstates of flavor degenerate in pure QCD can mix. Let the quantum mechanical basis vectors be {|1⟩, |2⟩} ≡ {|*B*^{0}⟩, |${\overline{B}}^{0}$⟩}. The Hamiltonian is then

The Schrödinger equation is

Diagonalizing we have

ΔΓ = Γ_{
L
}- Γ_{
H
}= 2|Γ_{12}|cos *ϕ*

where *H* refers to the heavier and *L* the lighter of the two weak eigenstates, and *ϕ* = arg(-*M*_{12}/Γ_{12}). We expect that ΔΓ is very small for *B*^{0} mesons but should be significant for *B*_{
s
}mesons.

*B*_{
d
}mixing was first discovered by the ARGUS experiment [31] (There was a previous measurement by UA1 indicating mixing for a mixture of ${B}_{d}^{0}$ and ${B}_{s}^{0}$[32]). At the time it was quite a surprise, since the top-quark mass, *m*_{
t
}, was thought to be in the 30 GeV range. Since *b*-flavored hadrons are produced in pairs, it is possible to observe a mixed event by having both *B*'s decay semileptonically. Thus the number of events where both *B*s decay into leptons of the same sign is indicative of mixing. We can define *R* as the ratio of events with same-sign leptons to the sum of same-sign plus opposite-sign dilepton events. This is related to the mixing probability. The OPAL data for *R* are shown in Figure 4[33].

Data from many experiments has been combined by "The Heavy Flavor Averaging Group," (HFAG) to obtain an average value Δ*m*_{
d
}= (0.507 ± 0.004) × 10^{12} ps^{-1} [34].

The probability of mixing is related to the CKM matrix elements as [3, 35]

where *B*_{
B
}is a parameter related to the probability of the *d* and $\overline{b}$ quarks forming a hadron and must be estimated theoretically, *F* is a known function which increases approximately as ${m}_{t}^{2}$, and *η*_{
QCD
}is a QCD correction, with value about 0.8. By far the largest uncertainty arises from the decay constant, *f*_{
B
}.

In principle *f*_{
B
}can be measured. The decay rate of the annihilation process *B*^{-} → ℓ^{-} $\overline{\nu}$ is proportional to the product of ${f}_{B}^{2}|{V}_{ub}{|}^{2}$. Of course there is a substantial uncertainty associated with |*V*_{
ub
}|. The experimental evidence for *B*^{-} → *τ*^{-} $\overline{\nu}$ is discussed in Section 5.1, and substantial uncertainty exists in the branching ratio measurement. Thus, we need to rely on theory for a value of *f*_{
B
}.

The ratio of *B*_{
s
}to *B*_{
d
}mixing frequency, however, provides a better situation in terms of reducing model dependent errors. Using Eq. 7 for *B*_{
d
}mixing and an analogous relation for *B*_{
s
}mixing and then dividing them results in

The CKM terms are

ignoring the higher order term in *V*_{
ts
}. Solving the ratio of the two above equations for $\left(\frac{1}{\lambda}\right)\frac{\left|{V}_{td}\right|}{\left|{V}_{ts}\right|}$ gives a circle centered at (1,0) in the *ρ* - *η* plane whose radius depends on *x*_{
d
}/*x*_{
s
}. The theoretical errors are now due to the difference between having a light quark versus a strange quark in the *B* meson, called SU(3) splitting.

For many years experiments at the *Z*^{0} using *e*^{+} *e*^{-} colliders at both LEP and the SLC had set lower limits on ${B}_{s}^{0}$ mixing [17]. In 2006 *B*_{
s
}mixing was measured by the CDF collaboration [36, 37]. The D0 collaboration had previously presented both a lower and an upper limit [38].

We now discuss the CDF measurement. The probability, $\mathcal{P}$(*t*) for a *B*_{
s
}to oscillate into a ${\overline{B}}_{s}$ is given as

where *t* is the proper time.

An amplitude *A* for each test frequency *ω*, is defined as [39]

For each frequency the expected result is either zero for no mixing, or one for mixing. No other value is physical, although measurement errors admit other values. Figure 5 shows the CDF results.

At Δ*m*_{
s
}= 17.75 ps^{-1}, the observed amplitude *A* = 1.21 ± 0.20 (stat.) is consistent with unity, indicating that the data are compatible with *B*_{
s
}${\overline{B}}_{s}$ oscillations with that frequency, while the amplitude is inconsistent with zero: *A*/*σ*_{
A
}= 6.05, where *σ*_{
A
}is the statistical uncertainty on *A*, the ratio has negligible systematic uncertainties. (This then is called a "6*σ* effect.) The measured value is

where the first error is statistical and the second systematic.

In order to translate the mixing measurements to constraints on the CKM parameters *ρ* and *η*, we need to use theoretical values for the ratios $\frac{{B}_{B}}{{B}_{{B}_{s}}}$ and $\frac{{f}_{B}}{{f}_{{B}_{s}}}$ in Eq. 8. It is interesting that in practice it is not possible to measure either of these numbers directly. It is usually assumed, however, that ${f}_{{B}^{+}}$, which could in principle be measured via the process *B*^{-} → *τ*^{-} *ν* is the same as *f*_{
B
}. There is no way to measure ${f}_{{B}_{s}}$. The charm system, on the other hand, provides us with both *D*^{+} → ℓ^{+} *ν* and ${D}_{s}^{+}$ → ℓ^{+} *ν*, and both rates have been measured. They also have been calculated in unquenched lattice quantum electrodynamics (QCD).

The combined efforts of the HPQCD and UKQCD collaborations predict ${f}_{{D}^{+}}$ = (207 ± 4) MeV [40], while the CLEO measurement is in astonishing agreement: (205.8 ± 8.5 ± 2.5) MeV [41]. Furthermore, the measurements of ${f}_{{D}_{s}^{+}}$ are not in such good agreement with the Follana *et al* calculation of (241 ± 3) MeV. The average of CLEO and Belle results as determined by Rosner and Stone is (273 ± 10) MeV [42]. (For updated results see [43, 44].) The discrepancy is at the 3 standard deviation level and not yet statistically significant, though it bears watching.

Unfortunately, the group that has calculated ${f}_{{D}^{+}}$ is not the same group that has determined ${f}_{{B}^{+}}$. The theoretical calculations used for the *B*_{
B
}terms and the *f*_{
B
}terms are summarized by Tantalo [45]. He suggests using values of ${f}_{{B}_{s}}$ = (268 ± 17 ± 20) MeV, ${f}_{{B}_{s}}$/*f*_{
B
}= 1.20 ± 0.02 ± 0.05, ${B}_{{B}_{s}}$ = 0.84 ± 0.03 ± 0.055 and *B*_{
B
}= 0.83 ± 0.01 ± 0.06. These numbers allow us to measure the length of one side of the CKM triangle using Eq. 8. Use of this measurement will be discussed in more detail in Section 6.

### 2.2 CP Violation in the *B*System

We have two quantum mechanical operators: Charge Conjugation, *C*, and Parity, *P*. When applied to a particle wavefunction *C* changes particle to antiparticle and vice-versa. Applying *P* to a wavefunction *ψ* (**r**) we have *P ψ* (**r**) = *ψ* (-**r**). The *P* operator can be thought of changing the natural coordinate system from right-handed to left-handed. If nature was blind to handedness, then *P* would be always conserved. By applying the *P* operator twice we end up with *P*^{2}*ψ* (**r**) = *ψ* (**r**), so the eigenvalues of *P* are ± 1. Therefore wave-functions, or particles represented by such wave-functions, have either intrinsic positive parity +1 (right-handed) or -1 (left-handed).

Weak interactions, characterized by a combination of vector minus axial-vector currents, are known to be left-handed. Therefore, handedness matters and its well known that Parity is maximally violated in weak decays [3]. Since *C* changes left-handed particles to right-handed anti-particles, the product *CP* symmetry could have been preserved, but nature decided otherwise. Different particle transitions involve the different *CP* violating angles shown in Figure 1. Measuring these independently allows comparisons with measurements of the sides of the triangle and any differences in constraints in *ρ* and *η* can be due to the presence of new physics.

Consider the case of a process *B* → *f* that goes via two amplitudes, $\mathcal{A}$ and $\mathcal{B}$ each of which has a strong part e. g. ${s}_{\mathcal{A}}$ and a weak part ${w}_{\mathcal{A}}$. Then we have

Any two amplitudes will do, though its better that they be of approximately equal size. Thus charged *B* decays can exhibit CP violation as well as neutral *B* decays. In some cases, we will see that it is possible to guarantee that $|\mathrm{sin}({s}_{\mathcal{A}}-{s}_{\mathcal{B}})|$ is unity, so we can get information on the weak phases. In the case of neutral *B* decays, mixing serves as the second amplitude.

#### 2.2.1 Formalism of CP Violation in Neutral *B*Decays

Consider the operations of Charge Conjugation, *C*, and Parity, *P*:

For neutral mesons we can construct the *CP* eigenstates

where

Since *B*^{0} and ${\overline{B}}^{0}$ can mix, the mass eigenstates are a superposition of *a*|*B*^{0}⟩ + *b*|${\overline{B}}^{0}$⟩ which obey the Schrodinger equation

If *CP* is not conserved then the eigenvectors, the mass eigenstates |*B*_{
L
}⟩ and |*B*_{
H
}⟩, are not the *CP* eigenstates but are

where

*CP* is violated if ϵ_{
B
}≠ 0, which occurs if |*q*/*p*| ≠ 1.

The time dependence of the mass eigenstates is

leading to the time evolution of the flavor eigenstates as

where *m* = (*m*_{
L
}+ *m*_{
H
})/2, Δ*m* = *m*_{
H
}- *m*_{
L
}and Γ = Γ_{
L
}≈ Γ_{
H
}, and *t* is the decay time in the *B*^{0} rest frame, the so-called "proper time". Note that the probability of a *B*^{0} decay as a function of *t* is given by ⟨*B*^{0} (*t*)|*B*^{0}(*t*)⟩*, and is a pure exponential, *e*^{-Γt/2}, in the absence of *CP* violation.

#### 2.2.2 *CP* Violation for *B*Via the Interference of Mixing and Decay

Here we choose a final state *f* which is accessible to both *B*^{0} and ${\overline{B}}^{0}$ decays [3]. The second amplitude necessary for interference is provided by mixing. Figure 6 shows the decay into *f* either directly or indirectly via mixing. It is necessary only that *f* be accessible directly from either state; however if *f* is a *CP* eigenstate the situation is far simpler. For *CP* eigenstates

*CP*|*f*_{
CP
}⟩ = ± |*f*_{
CP
}⟩.

It is useful to define the amplitudes

If $\left|\frac{\overline{A}}{A}\right|$ ≠ 1, then we have "direct" *CP* violation in the decay amplitude, which we will discuss in detail later. Here *CP* can be violated by having

which requires only that *λ* acquire a non-zero phase, i.e. |*λ*| could be unity and *CP* violation can occur.

Other useful variables, that are independent of any phase convention are

The first quantity can be related to CKM angles, while the second can be measured by the "semileptonic asymmetry," or for that matter in any flavor specific decay [46]:

for either *B*^{0} or for *B*_{
s
}mesons, separately.

A comment on neutral *B* production at *e*^{+} *e*^{-} colliders is in order. At the ϒ(4*S*) resonance there is coherent production of *B*^{0} ${\overline{B}}^{0}$ pairs. This puts the *B*'s in a *C* = -1 state. In hadron colliders, or at *e*^{+} *e*^{-} machines operating at the *Z*^{0}, the *B*'s are produced incoherently. For the rest of this article we will assume incoherent production except where explicitly noted.

The asymmetry, in the case of *CP* eigenstates, is defined as

which for |*q*/*p*| = 1 gives

For the cases where there is only one decay amplitude *A*, |*λ*| equals 1, and we have

Only the factor -Im*λ* contains information about the level of CP violation, the sine term is determined by *B* mixing. In fact, the time integrated asymmetry is given by

This is quite lucky for the study of *B*_{
d
}mesons, as the maximum size of the coefficient for any *x* is -0.5, close to the measured value of *x*_{
d
}.

Let us now find out how Im*λ* relates to the CKM parameters. Recall $\lambda =\frac{q}{p}\cdot \frac{\overline{A}}{A}$. The first term is the part that comes from mixing:

#### 2.2.3 Measurements of sin(2*β*)

To evaluate the decay part we need to consider specific final states. For example, consider the final state *J*/*ψ K*_{
s
}. The decay diagram is shown in Figure 7. In this case we do not get a phase from the decay part because

is real to order 1/*λ*^{4}.

In this case the final state is a state of negative *CP*, i.e. *CP*|*J*/*ψ K*_{
s
}⟩ = -|*J*/*ψ K*_{
s
}⟩. This introduces an additional minus sign in the result for Im*λ*. Before finishing discussion of this final state we need to consider in more detail the presence of the *K*_{
s
}in the final state. Since neutral kaons can mix, we pick up another mixing phase (similar diagrams as for *B*^{0}, see Figure 3). This term creates a phase given by

which is real to order *λ*^{4}. It necessary to include this term, however, since there are other formulations of the CKM matrix than Wolfenstein, which have the phase in a different location. It is important that the physics predictions not depend on the CKM convention. (Here we do not include *CP* violation in the neutral kaon since it is much smaller than what is expected in the *B* decay.)

In summary, for the case of *f* = *J*/*ψ K*_{
s
}, Im*λ* = - sin(2*β*). The angle *β* is the best measured *CP* violating angle measured in *B* meson decays. The process used is *B*^{0} → *J*/*ψ K*_{
S
}, although there is some data used with the *ψ* (2*S*) or with *K*_{
L
}(where the phase is + sin(2*β*)).

Although it is normally thought that only one decay amplitude contributes here, in fact one can look for the presence of another source of *CP* violation, presumably in the decay amplitude, by not assuming |*λ*| equals one in Eq. 36. Then the time dependence of the decay rate is given by

Thus ${a}_{{f}_{CP}}$ (*t*) has both sin(Δ*mt*) and cos(Δ*mt*) terms, and the coefficients of these terms can be measured. Let us assign the labels

(Note that the sign of the $\mathcal{S}$ term changes depending on the *CP* of the final state, but not the $\mathcal{C}$ term.)

The most precise measurements of sin(2*β*) have been made by the BaBar and Belle experiments [47, 48]. These measurements are made at the ϒ(4*S*) resonance using *e*^{+} *e*^{-} → ϒ(4*S*) → *B*^{0} ${\overline{B}}^{0}$, and with one of the neutral *B*'s decaying into *J*/*ψ K*^{0}. Because the ϒ(4*S*) is a definite quantum state with *C* = -1, the formulae given above have to modified somewhat. One important change is the definition of *t*. The time used here is the difference between the decay time of the *J*/*ψ K*^{0} and the other *B*^{0}, also called the "tagging" *B* because we need to determine its flavor, whether *B*^{0} or ${\overline{B}}^{0}$, in order to make the asymmetry measurement.

While we will not discuss flavor tagging in general, it is an important part of *CP* violation measurements. At the ϒ(4*S*) once such very useful tag is that of high momentum lepton as a *b*-quark, part of a ${\overline{B}}^{0}$ meson decays semileptonically into an ℓ^{-}, while a $\overline{b}$-quark decays into an *e*^{+}. Other flavor tagging observables include the charge of kaons and fast pions. Modern experiments combine these in a "neural network," to maximize their tagging efficiencies [49] (see also [50]). The BaBar data are shown in Figure 8, Belle has similar results.

The average value as determined by the Heavy Flavor Averaging Group is sin(2β) = 0.671 ± 0.024, where the dominant part of the error is statistical. No evidence is found for a non-zero $\mathcal{C}$ term with the measured average given as 0.005 ± 0.020.

Determining the sine of any angle gives a four-fold ambiguity in the angle. The decay mode *B*^{0} → *J*/*ψ K**^{0}, *K**^{0} → *K*_{
s
}*π*^{0} offers a way of measuring cos 2*β* and resolving the ambiguities.

This is a subtle analysis that we will not go into detail on [51, 52]. The result is that $\beta ={42.14}_{-1.83}^{+1.88}$ degrees.

#### 2.2.4 Measurements of *α*

The next state to discuss is the *CP* + eigenstate *f* ≡ *π*^{+} *π*^{-}. The simple spectator decay diagram is shown in Figure 9(a). For the moment we will assume that this is the only diagram, though the Penguin diagram shown in Figure 9(b) could also contribute; its presence can be inferred because it would induce a non-zero value for $\mathcal{C}$, the coefficient of the cosine term in Eq. 43. For this *b* → *u*$\overline{u}$*d* process we have

and

Im (*λ*) = Im (*e*^{-2i β}*e*^{-2i γ}) = Im (*e*^{2i α}) = - sin (2*α*

Time dependent *CP* violation measurements have been made by both the BaBar and Belle collaborations [53, 54]. Both groups find a non-zero value of both $\mathcal{C}$ and $\mathcal{S}$, though their values are not in particularly good agreement. The HFAG average is shown in Figure 10 along with the experimental results. The value of $\mathcal{C}$ clearly is not zero, thus demonstrating direct *CP* violation. (Historically, this was an important observation because it showed that *CP* violation could not be due to some kind of "superweak" model ala Wolfenstein [55].)

The non-zero value of $\mathcal{C}$ shows the presence of at least two amplitudes, presumably tree and penguin, in addition to the mixing amplitude, making extraction of *α* difficult. All is not lost, however. Gronau and London [56] showed that *α* could be extracted without theoretical error by doing a full isotopic spin analysis of the *π π* final state. The required measurements include: $\mathcal{C}$, $\mathcal{S}$, $\mathcal{B}$(*B*^{+} → *π*^{+}*π*^{0}), $\mathcal{B}$ (*B*^{0} → *π*^{0}*π*^{0}), and $\mathcal{B}$ (${\overline{B}}^{0}$ → *π*^{0}*π*^{0}). The last two items require a flavored tagged analysis that has not yet been done and whose prospects are bleak. Grossman and Quinn have showed, however, that an upper limit on Γ (*B*^{0} → *π*^{0}*π*^{0})/Γ(*B*^{+} → *π*^{+}*π*^{0}) can be used to limit the penguin shift to *α* [57–59]. Unfortunately, current data show a relative large $\mathcal{B}$(*B*^{0} → *π*^{0}*π*^{0}) = (1.55 ± 0.19) × 10^{-6} rate, compared with $\mathcal{B}$(*B*^{0} → *π*^{+} *π*^{-}) = (5.16 ± 0.22) × 10^{-6}, implying a large penguin contribution [17, 34], and the limit is very weak.

Use of the *ρ*^{+} *ρ*^{-} final state is in principle quite similar to *π*^{+} *π*^{-}, with some important caveats. First of all, it is a vector-vector final state and therefore could have both *CP* + and *CP*- components. It is however possible, doing a full angular analysis to measure the *CP* violating phase separately in each of these two amplitudes. The best method for this is in the "transversity" basis and will be discussed later [60] in Section 2.2.6. It is possible, however, for one polarization component to be dominant and then the angular analysis might not be necessary. In fact the longitudinal polarization is dominant in this case. BaBar measures the fraction as $0.992\pm {0.024}_{-0.013}^{+0.026}$[61], and Belle measures it as ${0.941}_{-0.040}^{+0.034}\pm 0.030$[62]. Thus we can treat this final state without worrying about the angular analysis and just apply a correction for the amount of transverse amplitude.

In addition, *ρ* mesons are wide, so non-*B* backgrounds could be a problem and even if the proper *B* is reconstructed, there are non-resonant and possible *a*_{1} *π* contributions.

Furthermore, it has been pointed out that the large *ρ* width could lead to the violation of the isospin constraints and this effect should be investigated [63]. The relevant branching ratios are given in Table 1.

Nevertheless, the small branching ratio for *ρ*^{0} *ρ*^{0}, if indeed it has been observed at all, shows that the effects of the Penguin diagram on the extracted value of sin(2*α*) are small, and this may indeed be a good way to extract a value of *α*. The time dependent decay rates separately for *B*^{0} → *ρ*^{+} *ρ*^{-} and ${\overline{B}}^{0}$ → *ρ*^{+} *ρ*^{-} and their difference are shown in Figure 11 from BaBar. In is interesting to compare these results with those in Figure 8. We see that the measured asymmetry in the *ρ*^{+} *ρ*^{-} decay more or less cancels that from *B*^{0} - ${\overline{B}}^{0}$ mixing inferring that sin(2*α*) is close to zero.

Results from the time dependent *CP* violation analysis from both Belle and BaBar are shown in Figure 12.

The data can be averaged and *α* determined by using the isospin analysis and the rates listed in Table 1. Unfortunately the precision of the data leads only to constraints. These have been determined by both the CKM fitter group [64] and the UT fit group [30]. These groups disagree in some cases. The CKM fitter group use a frequentist statistical approach, while UT fit uses a Bayseian approach. The basic difference is the that the Bayesian approach the theoretical errors are taken as having a Gaussian distribution. Here we show in Figure 13 the results from CKM fitter.

The final state *π*^{+} *π*^{-} *π*^{0} can also be used to extract *α*. Snyder and Quinn proposed that a Dalitz plot analysis of *B* → *ρ π* → *π*^{+} *π*^{-} *π*^{0} can be used to unravel both *α* and the relative penguin-tree phases [65]. The Dalitz plot for simulated events unaffected by detector acceptance is shown in Figure 14.

The task at hand is to do a time dependent analysis of the magnitude of the decay amplitudes and phases. The analyses of both collaborations allow for *ρ*(1450) and *ρ*(1700) contributions in addition to the *ρ*(770). There are a total of 26 free parameters in the fit. The statistics for such an analysis are not overwhelming: BaBar has about 2100 signal events [66], while Belle has about 1000 [67].

The results for the confidence levels of *α* found by both collaborations are shown in Figure 15. Note that there is also a mirror solution at *α* + 180°. The Belle collaboration uses their measured decay rates for *B*^{+} → *ρ*^{+} *π*^{0} and *B*^{+} → *ρ*^{0} *π*^{+} coupled with isospin relations [68, 69] to help constrain *α*.

The LHCb experiment expects to be able to significantly improve on the determination of *α* using the *ρ π* mode. They expect 14,000 events in for an integrated luminosity of 2 fb^{-1}, with a signal to background ratio greater than 1 [70]. It is expected that this amount of data can be accumulated in one to two years of normal LHC operation.

Combining all the data, both the CKM fitter and UT fit groups derive a the confidence level plot for *α* shown in Figure 16. There is a clear disagreement between the two groups, UT fit preferring a solution in the vicinity of 160°, and CKM fitter a value closer to 90°. The CKM fitter group believes that this is due to the UT fit group's use of Bayesian statistics which they criticize [71].

#### 2.2.5 The Angle *γ*

The angle $\gamma =\mathrm{arg}\left[-\frac{{V}_{ud}{V}_{ub}^{*}}{{V}_{cd}{V}_{cb}^{*}}\right]$. It can be viewed as the phase of the |*V*_{
ub
}| matrix element, with respect to |*V*_{
cb
}|. Interference measurements are required to determine phases. We can use neutral or even charged *B* decays to measure *γ*, and there are several ways to do this without using any theoretical assumptions, as is the case for *α* and *β*. The first relies on the color suppressed tree diagrams shown in Figure 17, and the second on using the interference in *B*_{
s
}mixing. At first glance, the diagrams in Figure 17 don't appear to have any interfering components. However, if we insist that the final state is common to *D*^{0} and ${\overline{D}}^{0}$, then we have two diagrams which differ in that one has a *b* → *c* amplitude and the other a *b* → *u* amplitude, the relative weak phase is then *γ*. Explicitly the amplitudes are defined as

where *r*_{
B
}reflects the amplitude suppression for the *b* → *u* mode and *δ*_{
B
}is the relative phase. We have not yet used identical final states for *D*^{0} and ${\overline{D}}^{0}$ decays, yet we can see that there will be an ambiguity in our result between *δ*_{
B
}and *γ* that could be resolved by using different *D*^{0} decay modes.

There are several suggestions as to different *D*^{0} decay modes to use. In the original paper on this topic Gronau and Wyler [72, 73] propose using *CP* eigenstates for the *D*^{0} decay, such as *K*^{+} *K*^{-}, *π*^{+} *π*^{-} etc.., combining with charge specific decays and comparing *B*^{-} with *B*^{+} decays. In the latter, the sign of the strong phase is flipped with respect to the weak phase. In fact modes such as *D**^{0} or *K**^{-} can also be used. (When using *D**^{0} there is a difference in *δ*_{
B
}of *π* between *γ D*^{0} and *π*^{0} *D*^{0} decay modes [74].)

It is convenient to define the follow variables:

These are related to the variables defined in Eq. 47 as

Measurements have been made by the BaBar, Belle and CDF collaborations [75–79]. These data, however, are not statistically powerful enough by themselves to give a useful measurement of *γ*. Atwood, Dunietz and Soni suggested using double-Cabibbo suppressed decays as an alternative means of generating the interference between decay amplitudes [80]. For example the final state *B*^{-} → *D*^{0} *K*^{-} can proceed via the tree level diagram in Figure 2(a) when the *W*^{-} → $\overline{u}$*s*. Then if we use the doubly-Cabibbo suppressed decay *D*^{0} → *K*^{+} *π*^{-}, we get a final state that interferes with the ${\overline{D}}^{0}$*K*^{-} final state with the ${\overline{D}}^{0}$ → *K*^{+} *π*^{-}, which is Cabibbo allowed. The advantage of this method, besides extending the number of useful final states, is that both amplitudes are closer to being equal than in the above method, which can generate larger interferences. Any doubly-Cabibbo suppressed decay can be used. Similar equations exist relating the measured parameters to *γ* and the strong phase shift.

Measurements have been made mostly using the *K*^{±} *π*^{∓} final state [81, 82]. Again, these attempts do not yet produce accurate results.

Thus far, the best method for measuring *γ* uses the three-body final state *K*_{
S
}*π*^{+} *π*^{-}. Since the final state is accessible by both *D*^{0} and ${\overline{D}}^{0}$ decays interference results. By its very nature a three-body state is complicated, and consists of several resonant and non-resonant parts. It is typically analyzed by examining the Dalitz plot where two of the possible three sub-masses (squared) are plotted versus one another [83]. In this way, the phase space is described as a uniform density and thus any resonant structure is clearly visible. For our particular case a practical method was suggested by Giri *et al* [84], where the decay is analyzed in separate regions of the Dalitz plot. Results from this analysis have been reported by BaBar [85] (they also include the mode *D*^{0} → *K*_{
S
}*K*^{+} *K*^{-}) and Belle [86]. BaBar finds *γ* = (76 ± 22 ± 5 ± 5)°, and Belle finds $\gamma ={({76}_{-13}^{+12}\pm 4\pm 5)}^{\circ}$. In both cases the first error is statistical, the second systematic, and the third refers to uncertainties caused by the Dalitz plot model. (There is also a mirror solution at ± 180°.)

It has been shown [87, 88] that measurement of the amplitude magnitudes and phases found in the decays of *ψ* (3770) → (CP ± Tag)(*K*_{
S
}*π*^{+} *π*^{-})_{
D
}provide useful information that help the narrow model error. The CLEO collaboration is working on such an analysis, and preliminary results have been reported [89]. (CLEO also is working on incorporating other *D*^{0} decay modes.)

Both the CKM fitter and UT fit groups have formed liklihoods for *γ* based on the measured results, as shown in Figure 18. In this case CKM fitter and UT fit agree on the general shape of the liklihood curve. The CKM fitter plot shows a small disagreement between the Daltiz method and a combination of the other two methods.

#### 2.2.6 The Angle *χ*

The angle *χ* shown in Figure 1 is the phase that appears in the box diagram for *B*_{
s
}mixing, similar to the diagram for *B*^{0} mixing shown in Figure 3, but with the *d* quark replaced by an *s* quark. The analogous mode to *B*^{0} → *J*/*ψ K*_{
s
}in the *B*_{
s
}system is *B*_{
s
}→ *J*/*ψ η*. The Feynman diagrams are shown in Figure 19. This is very similar to measuring *β* so *χ* is often called *β*_{
s
}. We also have a relation between *ϕ*_{
s
}defined in Eq. 33, *ϕ*_{
s
}= -2*χ*.

Since there are usually two photons present in the *η* decay, experiments at hadron colliders, which can perform time-dependent studies of *B*_{
s
}mesons, preferentially use the *J*/*ψ ϕ* final state. This, unfortunately, introduces another complexity into the problem; as the *B*_{
s
}is spinless the total angular momentum of the final state particles must be zero. For a decay into a vector and scalar, such as *J*/*ψ η*, this forces the vector *J*/*ψ* to be fully longitudinally polarized with respect to the decay axis. For a vector-vector final state both angular momentum state vectors are either longitudinal (L), both are transverse with linear polarization vectors parallel (||) or they are perpendicular (⊥) to one another [90]. Another way of viewing this is that a spin-0 *B* decay into two massive vector mesons can form *CP* even states with L = 0 or 2, and a *CP* odd state with L = 1. The relative populations in the two *CP* states are determined by strong interactions dynamics, but to study the weak phase here we are not particularly interested in the actual amount, unless of course one state dominated. We do not expect this to be the case however, since the SU(3) related decay *B*^{0} → *J*/*ψ K**^{0}, *K**^{0} → *K*^{+} *π*^{-} has a substantial components of both *CP* states; the PDG quotes gives the longitudinal fraction as (80 ± 8 ± 5)% [17].

The even and odd *CP* components can be disentangled by measuring the appropriate angular quantities of each event. Following Dighe *et al* [91], we can decompose the decay amplitude for a *B*_{
s
}as

where **ϵ**_{J/ψ}and **ϵ**_{
ϕ
}are polarization 3-vectors in the *J*/*ψ* rest frame, $\widehat{p}$ is a unit vector giving the direction of the *ϕ* momentum in the *J*/*ψ* rest frame, and *E*_{
ϕ
}is the energy of the *ϕ* in the *J*/*ψ* rest frame. We note that the corresponding amplitude for the ${\overline{B}}_{s}$ decay are ${\overline{A}}_{0}$ = *A*_{0}, ${\overline{A}}_{\left|\right|}$ = *A*_{||}, and ${\overline{A}}_{\perp}$ = -*A*_{⊥}. The amplitudes are normalized so that

*d*Γ(*B*_{
s
}→ *J*/*ψ ϕ*)/*dt* = |*A*_{0}|^{2} + |*A*_{||}|^{2} + |*A*_{⊥}|^{2}.

The *ϕ* meson direction in the *J*/*ψ* rest frame defines the $\widehat{x}$ direction. The $\widehat{z}$ direction is perpendicular to the decay plane of the *K*^{+} *K*^{-} system, where *p*_{
y
}(*K*^{+}) ≥ 0. The decay direction of the ℓ^{+} in the *J*/*ψ* rest frame is described by the angles (*θ*, *ϕ*). The angle *ψ* is that formed by the *K*^{+} direction with the $\widehat{x}$-axis in the *ϕ* rest frame. Figure 20 shows the angles.

The decay width can be written as

The decay rate for ${\overline{B}}_{s}$ can be found by replacing *A*_{⊥} in the above expression with -*A*_{⊥}.

Another complexity arises from the expectation that the width difference ΔΓ_{
s
}/Γ_{
s
}≈ 15%. This complicates the time dependent rate equations. For convenience, setting $\overrightarrow{\rho}$ ≡ (cos *θ*, *ϕ*, cos*ψ*), we have for the decay width for *B*_{
s
}:

where

The quantities *δ*_{⊥} and *δ*_{||} are the strong phases of *A*_{⊥} and *A*_{||} relative to *A*_{0}, respectively [92]. The expression for ${\overline{B}}_{s}$ mesons can be found by substituting ${\mathcal{U}}_{+}$ → ${\mathcal{U}}_{-}$ and ${\mathcal{V}}_{+}$ → ${\mathcal{V}}_{-}$.

The most interesting quantities to be extracted from the data are *χ* and ΔΓ. There are many experimental challenges: the angular and lifetime distributions must be corrected for experimental acceptances; flavor tagging efficiencies and dilutions must be evaluated; backgrounds must be measured. Both the CDF [93] and D0 [94] experiments have done this complicated analysis. Updated results as of this writing are summarized by the CKM fitter derived limits shown in Figure 21.

The Standard Model allowed region is a very thin vertical band centered near zero at *ϕ*_{
s
}of -0.036 ± 0.002 (shown in red). (Recall *ϕ*_{
s
}≡ -2*χ* ≡ -2*β*_{
s
}.) The region labeled "all" (green) shows the allowed region at 68% confidence level. Although the fit uses several input components besides the *CP* asymmetry measurements in *B*_{
s
}→ *J*/*ψ ϕ*, including use of measured total widths introduced via the constraint equation $\Delta {\Gamma}_{s}=\mathrm{cos}{\varphi}_{s}\Delta {\Gamma}_{s}^{SM}$, it is the measurement of *ϕ*_{
s
}that dominates the fit result. We see that there is a discrepancy that may be as large as 2.7 standard deviations [95, 96]. While this is as not yet significant, it is very tantalizing. The LHCb experiment plans to vastly improve this measurement [97].

It has been pointed out, however, that there is likely an S-wave *K*^{+} *K*^{-} contribution in the region of the *ϕ* that contributes 5–10% of the event rate as estimated using ${D}_{s}^{+}$ decays [98]. For example, analysis of the analogous channel *B*^{0} → *J*/*ψ K**^{0} reveals about 8% S-wave in the *K*_{
π
}system under the *K**^{0}, and BaBar has used this to extract a value for cos 2*β* thus removing an ambiguity in *β* [99]. The S-wave amplitude and phase needs to be added to Eq. 53. Note that the errors will increase due to the addition of another amplitude and phase. The S-wave can manifest itself as a *π*^{+} *π*^{-}, so it is suggested that the decay *B*_{
s
}→ *J*/*ψ f*_{0} (980) be used to measure *χ*; here the *f*_{0} → *π*^{+} *π*^{-} [98]; the estimate is that the useful *f*_{0} rate would be about 20% of the *ϕ* rate, but the *J*/*ψ f*_{0} is a *CP* eigenstate, so an an angular analysis is unnecessary, and these events may provide a determination of *χ* with an error comparable to that using *J*/*ψ ϕ*.

#### 2.2.7 Measurements of Direct CP Violation in *B* → *K π*Decays

Time integrated asymmetries of *B* mesons produced at the ϒ(4S) resonance can only be due to direct *CP* asymmetry, as the mixing generated asymmetry must integrate to zero due to the fact that the initial state has *J*^{PC}= 1^{--}. The first evidence for such direct *CP* violation at the greater than four standard deviation level in the *K*^{∓} *π*^{±} final state was given by BaBar [100]. The latest BaBar result is [101]

showing a large statistical significance.

This result was confirmed by the Belle collaboration, but Belle also measured the isospin conjugate mode. Consider the two-body decays of *B* mesons into a kaon and a pion, shown in Figure 22. For netural and charged decays, it can proceed via a tree level diagram (a) or a Penguin diagram (b). There are two additional decay diagrams allowed for the *B*^{-}, the color-suppressed tree level diagram (c) and the elusive "Electroweak" Penguin diagram in (d). (So named because of the intermediate *γ* or *Z* boson.) Since it is expected that diagrams (c) and (d) are small, the direct *CP* violating asymmetries in both charged and neutral modes should be the same. Yet Belle observed [102]

The Belle data are shown in Figure 23.

The difference between ${\mathcal{A}}_{{K}^{-}{\pi}^{+}}$ and ${\mathcal{A}}_{{K}^{\mp}{\pi}^{0}}$ is not naively expected in the Standard Model and Belle suggests that this may be a sign of New Physics. Peskin commented on this possibility [103].

### 2.3 Conclusions from *CP*Violation Measurements

All *CP* violation in the quark sector is proportional to the parameter *η* in Eq. 1. In fact all *CP* asymmetries are proportional to the "Jarlskog Invariant", *J* = *A*^{2} *λ*^{6} *η*, which represents the equal area of all the CKM triangles [104]. Since we know the value of these three numbers, we do know the amount of *CP* violation we can expect, even without making the measurements. We also can estimate the amount of *CP* violation necessary using cosmology. To reproduce the observed baryon to entropy ratio requires many orders of magnitude more *CP* violation than thus far found in heavy quark decays [105]. Thus we believe there are new sources of *CP* violation that have not yet been found.

## 3 The CKM parameter |*V*_{
cb
}|

There are two experimental methods to determine |*V*_{
cb
}|: the **exclusive** method, where |*V*_{
cb
}| is extracted by studying the exclusive $\overline{B}$ → *D*^{(*)} ℓ^{-}$\overline{\nu}$ decay process; and the **inclusive** method, which uses the semileptonic decay width of *b*-hadron decays. In both methods, the extraction of |*V*_{
cb
}| is systematics limited and the dominant errors are from theory. The inclusive and exclusive determinations of |*V*_{
cb
}| rely on different theoretical calculations of the hadronic matrix element needed to extract it from measured quantities, and make use of different techniques which, to a large extent, have uncorrelated experimental uncertainties. Thus, the comparison between inclusive and exclusive decays allows us to test our understanding of hadronic effects in semileptonic decays. The latest determinations differ by more than 2*σ*, with the inclusive method having a stated error half of the size of the exclusive one.

### 3.0.1 Beauty Quark Mass Definitions

Due to confinement and the non-perturbative aspect of the strong interaction, the concept of the quark masses cannot be tied to an intuitive picture of the rest mass of a particle, as for leptons. Rather, quark masses must be considered as couplings of the SM Lagrangian that have to be determined from processes that depend on them. As such the *b*-quark mass (*m*_{
b
}) is a scheme-dependent, renormalised quantity.

In principle, any renormalisation scheme or definition of quark masses is possible. In the framework of QCD perturbation theory the difference between two mass schemes can be determined as a series in powers of *α*_{
s
}. Therefore, higher-order terms in the perturbative expansion of a quantity that depends on quark masses are affected by the particular scheme employed. There are schemes that are more appropriate and more convenient for some purposes than others. Here we examine the main quark mass definitions commonly used in the description of *B* decays.

• **Pole mass:** The pole mass definition is gauge-invariant and infrared-safe [106–108] to all orders in perturbation theory and has been used as the standard mass definition of many perturbative computations in the past. By construction, it is directly related to the concept of the mass of a free quark. The presence of a renormalon ambiguity [109, 110] makes the numerical value of the pole mass an order-dependent quantity, leading to large perturbative corrections for Heavy Quark Effective Theory (HQET) parameters (see below for a discussion of HQET). These shortcomings are avoided by using quark mass definitions that reduce the infrared sensitivity by removing the Λ_{QCD} renormalon of the pole mass. Such quark mass definitions are generically called "short-distance" masses.

• $\overline{MS}$**mass:** The most common short-distance mass definition is the $\overline{MS}$ mass ${\overline{m}}_{b}$ (*μ*) [111–113], where the scale *μ* is typically chosen to be the order of the characteristic energy scale of the process. In the $\overline{MS}$ scheme the subtracted divergencies do not contain any infrared sensitive terms and the $\overline{MS}$ mass is only sensitive to scales larger than *m*_{
b
}. The $\overline{MS}$ mass arises naturally in processes where the *b*-quark is far off-shell, but it is less adequate when the *b*-quark has non-relativistic energies.

• **Kinetic mass:** The shortcomings of the pole and the $\overline{MS}$ mass in describing non-relativistic *b*-quarks can be resolved by so-called threshold masses [114], that are free of an ambiguity of order Λ_{QCD} and are defined through subtractions that contain universal contributions for the dynamics of non-relativistic quarks. Since the subtractions are not unique, an arbitrary number of threshold masses can be constructed. The kinetic mass is defined as [115, 116]:

where *μ*_{kin} is the nominal kinetic mass renormalisation scale. For *μ*_{kin} → 0 the kinetic mass reduces to the pole mass.

• 1*S* **mass:** The kinetic mass depends on an explicit subtraction scale to remove the universal infrared sensitive contributions associated with the non-relativistic *b*-quark dynamics. The 1*S* mass [117, 118] achieves the same task without a factorisation scale, since it is directly related to a physical quantity. The *b*-quark 1*S* mass is defined as half of the perturbative contribution to the mass of the ϒ(*S*_{1}) in the limit *m*_{
b
}≫ *m*_{
b
}*v* ≫ *m*_{
b
}*v*^{2} ≫ Λ_{QCD}.

A list of *b*-quark mass determinations, converted into the $\overline{MS}$ mass scheme is shown in Table 2.

### 3.1 Determination Based on Exclusive Semileptonic *B*Decays

The exclusive |*V*_{
cb
}| determination is obtained by studying the decays *B* → *D** ℓ*ν* and *B* → *D* ℓ*ν*, where ℓ denotes either an electron or a muon. The exclusive measurements of a single hadronic final state, e.g. the ground state *D* or *D**, restrict the dynamics of the process. The remaining degrees of freedom, usually connected to different helicity states of the charmed hadron, can be expressed in terms of form factors, depending on the invariant mass of the lepton-*ν* pair, *q*^{2}. The shapes of those form factors are unknown but can be measured. However, the overall normalization of these functions needs to be determined from theoretical calculations.

Isgur and Wise formulated a theoretical breakthrough in the late 1980's. They found that in the limit of an infinitely heavy quark masses QCD possess additional flavor and spin symmetries. They showed that since the heavier *b* and *c* quarks have masses much heavier than the scale of the QCD coupling constant, they are heavy enough to posses this symmetry but that corrections for the fact that the quark mass was not infinite had to be made. They showed there was a systematic method of making these corrections by expanding a series in terms of the inverse quark mass [119]. This theory is known as Heavy Quark Effective Theory (HQET).

When studying *b* quark decays into *c* quarks, it is convenient to view the process in four-velocity transfer (*w*) space as opposed to four-momentum transfer space, because at maximum four-velocity transfer, where *w* equals one, the form-factor in lowest order is unity, i.e. the *b* transforms directly into a *c* quark without any velocity change. The value of |*V*_{
cb
}| can be extracted by studying the decay rate for the process ${\overline{B}}^{0}$ → *D** + ℓ^{-} $\overline{\nu}$_{ℓ} as a function of the recoil kinematics of the *D**^{+} meson. Specifically, HQET predicts that:

where *w* is the product of the four-velocities of the *D**^{+} and the ${\overline{B}}^{0}$ mesons, related to *q*^{2}. $\mathcal{K}$(*w*) is a known phase space factor and the form factor $\mathcal{F}$(*w*) is generally expressed as the product of a normalization factor and a shape function described by three form factors, constrained by dispersion relations [120]:

where $r={m}_{{D}^{\ast}}/{m}_{B},z=\frac{\sqrt{w+1}-\sqrt{2}}{\sqrt{w+1}+\sqrt{2}}$; The linear slope of the form-factor is given by the parameter *ρ*^{2}, and must be determined from the data. In the infinite mass limit, $\mathcal{F}$(*w* = 1) = ${h}_{{A}_{1}}$ (*w* = 1) = 1; for finite quark masses, non-perturbative effects can be expressed in powers of 1/*m*_{
Q
}. There are several different corrections to the infinite mass value $\mathcal{F}$(1) = 1:

Note that the first term in the non-perturbative expansion in powers of 1/*m*_{
Q
}vanishes [121]. QED corrections up to leading logarithmic order give *η*_{
A
}= 0.960 ± 0.007. Different estimates of the 1/${m}_{Q}^{2}$ corrections, involving terms proportional to 1/${m}_{b}^{2}$, 1/${m}_{c}^{2}$ and 1/(*m*_{
b
}*m*_{
c
}), have been performed in a quark model with QCD sum rules, and, more recently, with an HQET based lattice gauge calculation. The best estimate comes from lattice QCD, ${h}_{{A}_{1}}$ (1) = $\mathcal{F}$ (1) = 0.921 ± 0.013 ± 0.020 [122]. This result does not include a 0.7% QED correction.

Since the phase-space factor $\mathcal{K}$(*w*) tends to zero as *w* → 1, the decay rate vanishes and the accuracy of the |*V*_{
cb
}| value extracted with this method depends upon experimental and theoretical uncertainties in the extrapolation. Experiments determine the product |$\mathcal{F}$(1)·*V*_{
cb
}|^{2} by fitting the measured *d*Γ/*dw* distribution.

This decay has been analyzed by CLEO, BaBar and Belle using *B* mesons from the ϒ(4S) decay, and by ALEPH, DELPHI, and OPAL at the Z^{0} center of mass energy. Experiments that exploit the ϒ(4S) have the advantage that *w* resolution is quite good. However, they suffer from lower statistics near *w* = 1 in the decay *B* → *D**^{+} ℓ*ν* due to the lower reconstruction efficiency of the slow *π*^{±}. On the other hand, the decay *B* → *D**^{0} ℓ*ν* is not affected by this problem [123]. In addition, kinematic constraints enable these experiments to identify the final state including *D** without large contamination from the poorly known portion of semileptonic decays with a hadronic system recoiling against the lepton-*ν* pair with masses higher than the *D* and *D**, commonly identified as '*D***'. *B*-factories and CLEO fit for the signal and background components in the distribution of the cosine of the angle between the direction of the *B* and the direction of the *D**ℓ system. At LEP, *B* mesons are produced with a large variable momentum (about 30 GeV on average), giving a relatively poor *w* resolution and limited physics background rejection capabilities. By contrast, LEP experiments benefit from an efficiency that is only mildly dependent upon *w*.

LEP experiments extracted |*V*_{
cb
}| by performing a two-parameter fit, for $\mathcal{F}$(1)|*V*_{
cb
}| and the slope *ρ*^{2}. The first measurements of both ratio *R*_{1} and *R*_{2}, and *ρ*^{2}, were made by the CLEO collaboration [124]. Belle [125] and BaBar [126] improved upon these measurements using the ${\overline{B}}^{0}$ → *D**^{+} *e*^{-} $\overline{\nu}$_{
e
}decay. They determined *R*_{1}, *R*_{2}, and *ρ*^{2} using an unbinned maximum likelihood fit to the full decay distribution. BaBar and CLEO results are combined to give: *R*_{1} = 1.396 ± 0.060 ± 0.035 ± 0.027, *R*_{2} = 0.885 ± 0.040 ± 0.022 ± 0.013, and *ρ*^{2} = 1.145 ± 0.059 ± 0.030 ± 0.035. The stated uncertainties are the statistical from the data and systematic uncertainty, respectively.

Table 3 summarizes the available data. Values of $\mathcal{F}$(1)|*V*_{
cb
}| from different experiments can be combined if they are extracted using the same $\mathcal{F}$(*w*) parametrization. All measurements included in the exclusive |*V*_{
cb
}| world average relying on the above form factor ratios *R*_{1} and *R*_{2}. Using the $\mathcal{F}$(1)|*V*_{
cb
}| world average in Table 3 and ${h}_{{A}_{1}}$ (1) = 0.921 ± 0.013 ± 0.020 [122], we find

|*V*_{
cb
}| = (38.2 ± 0.5_{
exp
}± 1.0_{
theo
}) × 10^{-3},

where the dominant error is theoretical, and it will be difficult to improve upon.

The study of the decay *B* → *D* ℓ*ν* poses new challenges from the experimental point of view. The differential decay rate for *B* → *D* ℓ*ν* can be expressed as [127, 128]

where ${\mathcal{K}}_{\mathcal{D}}$ (*w*) is the phase space factor and the form factor $\mathcal{G}$ (*w*) is generally expressed as the product of a normalization factor $\mathcal{G}$ (1) and a function, *g*_{
D
}(*w*), constrained by dispersion relations [120]. The strategy to extract $\mathcal{G}$ (1)|V_{cb}| is identical to that used for the *B* → *D*⋆ℓ*ν* decay. However, *d*Γ_{
D
}/*dw* is more heavily suppressed near *w* = 1 than $d{\Gamma}_{D*}/dw$ due to the helicity mismatch between initial and final states. Moreover, this channel is much more challenging to isolate from the dominant background *B* → *D*⋆ℓ*ν* as well as from fake *D*-ℓ combinations. Table 4 shows the results of two-dimensional fits to |*V*_{
cb
}|$\mathcal{G}$(1) and *ρ*^{2} for different experiments and the world average.

In the limit of infinite quark masses, $\mathcal{G}$(*w* = 1) coincides with the Isgur-Wise function [129]. In this case there is no suppression of 1/*m*_{
Q
}corrections and QCD effects on $\mathcal{G}$(1) are calculated with less accuracy than $\mathcal{F}$(1). Corrections to this prediction have recently been calculated with improved precision, based on unquenched lattice QCD [130], specifically $\mathcal{G}$(1) = 1.074 ± 0.018 ± 0.016. Using this result we get

|*V*_{
cb
}| = (39.5 ± 1.4_{
exp
}± 0.9_{
theo
}) × 10^{-3}

consistent with the value extracted from *B* → *D*⋆ℓ*ν* decay, but with an experimental uncertainty about twice as large.

BaBar recently has also studied the differential decay widths for the decays *B*^{-} → *D*^{0} ℓ*ν* and *B*^{-} → *D*⋆^{0}ℓ*ν* to extract the ratio $\mathcal{G}$(1)/$\mathcal{F}$(1) = 1.23 ± 0.09 [131], compatible with the lattice theory prediction of 1.16 ± 0.04.

### 3.2 *B* → *D***ℓ*ν*Decays

It is important to understand the composition of the inclusive *B* semileptonic decay rate in terms of exclusive final states for use in semileptonic *B* decay analyses. The *B* → *D*^{(*)}ℓ*ν* decays are well measured, but a sizeable fraction of semileptonic *B* decay are to *D***ℓ*ν*. The *D*** resonances have larger masses than the *D** and not well studied. (*D*** refers to the resonant states, ${D}_{0}^{\ast}$, ${D}_{1}^{\ast}$, ${D}_{2}^{\ast}$ and *D*_{1}.) The measurements of these branching fractions require a good understanding of the different *D*^{(*)}*nπ* systems, which can be either resonant or non-resonant, each with characteristic decay properties.

There are four orbitally excited states with L = 1. They can be grouped in two pairs according to the value of the spin on the light system, *j* = L ± 1/2 (L = 1). States with *j* = 3/2 can have J^{P} = 1^{+} and 2^{+}. The 1^{+} state decays only through *D***π*, and the 2^{+} through *D π* or *D***π*. Parity and angular momentum conservation imply that in the 2^{+} the *D** and *π* are in a D wave but allow both S and D waves in the 1^{+} state. However, if the heavy quark spin is assumed to decouple, conservation of *j* = 3/2 forbids S waves even in the 1^{+} state. A large D-wave component and the fact that the masses of these states are not far from threshold imply that the *j* = 3/2 states are narrow. These states have been observed with a typical width of 20 MeV/*c*^{2}. On the contrary, *j* = 1/2 states can have J^{P} = 0^{+}and 1^{+}, so they are expected to decay mainly through an S wave and manifest as broad resonances, with typical widths of several hundred MeV/*c*^{2}.

The ALEPH [132], CLEO [133], DELPHI [132], and D0 [134] experiments have reported evidence of the narrow resonant states (*D*_{1} and ${D}_{2}^{\ast}$) in semileptonic decays, whereas more recent measurements by the BaBar [135] and Belle [136] experiments provide semi-inclusive measurements to *D*^{(*)}*π* ℓ*ν* final states [137, 138].

The differences between the measured inclusive semileptonic branching fraction and the sum of all exclusive *B* semileptonic measurements for Belle, BaBar and World averages are given in Table 5 for *B*^{0} and *B*^{-} decays. In the case where multiple measurements exist, only the most precise measurements have been used in the BaBar and Belle columns, i.e. no attempt at an average is made. In all cases the sum of the exclusive components does not saturate the *B* semileptonic rate.

All measured rates for the *D*** narrow states are in good agreement. Experimental results seem to point towards a larger rate for broader states. If it is due mainly to ${{D}^{\prime}}_{1}$ and ${D}_{0}^{\ast}$ decay channels, these results disagree with the prediction of QCD sum rules. However, Belle set an upper limit for the ${{D}^{\prime}}_{1}$ channel below the rate measured by BaBar and DELPHI. More measurements need to be performed to elucidate this puzzle.

### 3.3 Determination Based on Inclusive Semileptonic *B*Decays

Inclusive determinations of |*V*_{
cb
}| are obtained using combined fits to inclusive *B* decay distributions [139, 140]. These determinations are based on calculations of the semileptonic decay rate in the frameworks of the Operator Product Expansion (OPE) [141] and HQET [139, 142]. They predict the semileptonic decay rate in terms of |*V*_{
cb
}|, the *b*-quark mass *m*_{
b
}, and non-perturbative matrix elements. The spectator model decay rate is the leading term in a well-defined expansion controlled by the parameter Λ_{QCD}/*m*_{
b
}[142–146] with non-perturbative corrections arising to order 1/${m}_{b}^{2}$. The key issue in this approach is the ability to separate perturbative and non-perturbative corrections (expressed in powers of *α*_{
s
}). Thus the accuracy of an inclusive determination of |*V*_{
cb
}| is limited by our knowledge of the heavy quark nonperturbative matrix elements and the *b* quark mass. It also relies upon the assumption of local quark hadron duality. This is the statement that a hadronic matrix element can be related pointwise to a theoretical expression formulated in terms of quark and gluon variables.

Perturbative and non-perturbative corrections depend on the *m*_{
b
}definition, i.e. the non-perturbative expansion scheme, as well as the non-perturbative matrix elements that enter the expansion. In order to determine these parameters, Heavy Quark Expansions (HQE) [139, 146, 147] express the semileptonic decay width Γ_{SL}, moments of the lepton energy and hadron mass spectra in *B* → *X*_{
c
}ℓ*ν* decays in terms of the running kinetic quark masses ${m}_{b}^{\text{kin}}$ and ${m}_{c}^{\text{kin}}$ as well as the *b*-quark mass ${m}_{b}^{1\text{S}}$ in the 1S expansion scheme. These schemes should ultimately yield consistent results for |*V*_{
cb
}|. The precision of the *b*-quark mass is also important for |*V*_{
ub
}|.

The shape of the lepton spectrum and of the hadronic mass spectrum provide constraints on the heavy quark expansion, which allows for the calculation of the properties of *B* → *X*_{
c
}ℓ*ν* transitions. So far, measurements of the hadronic mass distribution and the leptonic spectrum have been made by BaBar [148], Belle [149], CLEO [150, 151], DELPHI [152]. CDF [153] provides only the measurement of the hadronic mass spectrum with a lepton momentum cut of 0.6 GeV in the *B* rest frame.

The inclusive semileptonic width can be expressed as

where $\rho ={m}_{c}^{2}/{m}_{b}^{2}$, and ${\mu}_{\pi}^{2}$, ${\mu}_{G}^{2}$, *ρ*_{
D
}and *ρ*_{
LS
}are non-perturbative matrix elements of local operators in HQET. To make use of equations such as Eq. 61, the values of the non-perturbative expansion parameters must be determined experimentally. Although some of these parameters are related to the mass splitting of pseudoscalar and vector meson states, most non-perturbative parameters are not so easily obtained. Measurements of the moments of different kinematic distributions in inclusive *B*-decays are used to gain access to these parameters. The first moment of a distribution is given by:

corresponding to the mean. Subsequent (central) moments are calculated around the first moment,

corresponding to a distribution's width, kurtosis and so on.

A $\overline{B}$ → *X*_{
c
}ℓ$\overline{\nu}$ decay observable calculated with the OPE is a double expansion in terms of the strong coupling *α*_{
s
}(*m*_{
b
}) and the ratio Λ_{QCD}/*m*_{
b
}. Observables are typically calculated with the cuts used in the experimental determination for background suppression, and enhanced sensitivity such as the lepton energy cut. The spectral moments are defined as:

where

The measurement of |*V*_{
cb
}| from inclusive decays requires that these decays be adequately described by the OPE formalism. The motivation of the moment approach is to exploit the degree of experimental and theoretical understanding of each moment and for different ${\widehat{E}}_{\ell}^{\mathrm{min}}$, directly examining the dependence of the various coefficient functions *f*_{
i
}on these terms. The possibility of deviations from the OPE predictions due to quark hadron duality violations have been raised [154]. To compare the OPE predictions with data, one also has to define how uncertainties from 1/${m}_{b}^{3}$ corrections are estimated. These uncertainties are hard to quantify reliably, as the only information on the size of the matrix elements of the dimension six operators comes from dimensional analysis. The only method to check the reliability of |*V*_{
cb
}| extraction from inclusive semileptonic decays is how well the OPE fits the data.

To compare with theoretical predictions, the moments are measured with a well defined cut on the lepton momentum in the *B* rest frame. The measured hadronic mass distribution and lepton energy spectrum are affected by a variety of experimental factors such as detector resolution, accessible phase space, radiative effects. It is particularly important to measure the largest fraction of the accessible phase space in order to reduce both theoretical and experimental uncertainties. Each experiment has focused on lowering the lepton energy cut.

The hadronic mass spectrum in *B* → *X*_{
c
}ℓ*ν* decays can be split into three contributions corresponding to *D*, *D**, and *D***, where *D*** here stands for any neutral charmed state, resonant or not, other than *D* and *D**. Belle [149], BaBar [148] and CLEO [150] explored the moments of the hadronic mass spectrum ${M}_{X}^{2}$ as a function of the lepton momentum cuts. CLEO performs a fit for the contributions of signal and background to the full three-dimensional differential decay rate distribution as a function of the reconstructed quantities *q*^{2}, ${M}_{X}^{2}$, cos *θ*_{ℓ}. Belle [155, 156] and BaBar [148] use a sample where one of the *B* mesons, produced in pairs from ϒ(4*S*) decays, is fully reconstructed and the signal side is tagged by a well identified lepton. The 4-momentum *p*_{
X
}of the hadronic system *X*, recoiling against the lepton and neutrino, is determined by summing the 4-momenta of the remaining charged tracks and unmatched clusters. Belle reconstructs the full hadronic mass spectrum and measures the first, second central and second non-central moments of the unfolded ${M}_{X}^{2}$ spectrum in *B* → *X*_{
c
}ℓ*ν*, for lepton energy thresholds, *E*_{min}, varying from 0.7 to 1.9 GeV [149]. BaBar extracts the moments from the measured distributions using a calibration curve derived from Monte Carlo data, with a minimum momentum for the electron in the *B* meson rest frame of 0.9 GeV. The latest BaBar analysis [149] measures the first, second central and second non-central moments of the ${M}_{X}^{2}$ spectrum for *E*_{min} from 0.9 to 1.9 GeV. The main systematic errors originate from background estimation, unfolding and signal model dependence.

DELPHI follows a different approach in extracting the moments, measuring the invariant mass distribution of the *D*** component only and fixing the *D* and *D** components. DELPHI measures the first moment with respect to the spin averaged mass of *D* and *D**. At LEP *b*-quarks were created with an energy of approximately 30 GeV allowing the measurement of the hadronic mass moments without a cut on the lepton energy [152].

The shape of the lepton spectrum provides further constraints on the OPE. These measurements are sensitive to higher order OPE parameters and are considerably more precise experimentally. Moments of the lepton momentum with a cut *p*_{ℓ} ≥ 1.0 GeV/c have been measured by the CLEO collaboration [157]. BaBar [148] extract up to the third moment of this distribution, using a low momentum cut of *p*_{ℓ} ≥ 0.6 GeV/c. Both BaBar and CLEO use dilepton samples. The most recent measurement of the electron energy spectrum is from Belle [155, 156]. Events are selected by fully reconstructing one of the *B* mesons, produced in pairs from ϒ(4*S*) decays and it determines the true electron energy spectrum by unfolding [158] the measured spectrum in the *B* meson rest frame. Belle measures *B*^{0} and *B*^{+} weighted average partial branching fractions $\mathcal{B}{(B\to {X}_{c}\ell \nu )}_{{E}_{\ell}>{E}_{\mathrm{min}}}$ and the first four moments of the electron energy spectrum in *B* → *X*_{
c
}*e ν*, for *E*_{min} from 0.4 to 2.0 GeV [159]. All the lepton moment measurements are consistent with theory and with the moment of the hadronic and *b* → *s γ* photon energy spectrum. Hadronic and lepton energy measurements are consistent within their errors. When compared with theory there is no sign of inconsistencies. The B-factories provide the most precise HQE parameter estimates.

#### 3.3.1 HQE Parameters

Using the moment measurements described above, it is possible to determine the CKM matrix element |*V*_{
cb
}| and HQE parameters by performing global fits in the kinetic and 1S *b*-quark mass schemes [160]. The photon energy moments in *B* → *X*_{
s
}*γ* decays [161] are also included in order to constrain the *b*-quark mass more precisely. Measurements that are not matched by theoretical predictions and those with high cutoff energies are excluded (i.e. semileptonic moments with *E*_{min} > 1.5 GeV and photon energy moments with *E*_{min} > 2 GeV). The results are preliminary.

The inclusive spectral moments of *B* → *X*_{
c
}ℓ*ν* decays have been derived in the 1S scheme up to $\mathcal{B}{(B\to {X}_{c}\ell \nu )}_{{E}_{\ell}>{E}_{\mathrm{min}}}$(1/${m}_{b}^{3}$) [139]. The theoretical expressions for the truncated moments are given in terms of HQE parameters with coefficients determined by theory, as functions of *E*_{min}. The non-perturbative corrections are parametrized in terms of the following non-perturbative parameters: Λ at $\mathcal{O}$(*m*_{
b
}), *λ*_{1} and *λ*_{2} at $\mathcal{O}$(1/${m}_{b}^{2}$), and *τ*_{1}, *τ*_{2}, *τ*_{3}, *τ*_{4}, *ρ*_{1} and *ρ*_{2} at $\mathcal{O}$(1/${m}_{b}^{3}$). In Table 6 one finds the following results for the fit parameters [160]. The first error is from the fit including experimental and theory errors, and the second error (on |*V*_{
cb
}| only) is due to the uncertainty on the average *B* lifetime. If the same fit is performed to all measured moments of inclusive distributions in *B* → *X*_{
c
}ℓ*ν* and *B* → *s γ* decays, the |*V*_{
cb
}| and *m*_{
b
}values obtained are in Table 6. The fit results for |*V*_{
cb
}| and ${m}_{b}^{1\text{S}}$ to *B* → *X*_{
c
}ℓ*ν* data only and *B* → *X*_{
c
}ℓ*ν* and *B* → *X*_{
s
}*γ* data combined are displayed in Fig. 24.

Spectral moments of *B* → *X*_{
c
}ℓ*ν* decays have been derived up to $\mathcal{O}$(1/${m}_{b}^{3}$) in the kinetic scheme [146]. The theoretical expressions used in the fit contain improved calculations of the perturbative corrections to the lepton energy moments [116] and account for the *E*_{min} dependence of the perturbative corrections to the hadronic mass moments [162]. For the *B* → *X*_{
s
}*γ* moments, the (biased) OPE prediction and the bias correction have been calculated [147]. All these expressions depend on the *b*- and *c*-quark masses *m*_{
b
}(*μ*) and *m*_{
c
}(*μ*), the non-perturbative parameters, defined at the scale *μ* = 1 GeV: ${\mu}_{\pi}^{2}$ (*μ*) and ${\mu}_{G}^{2}$ (*μ*) ($\mathcal{O}$(1/${m}_{b}^{2}$)), ${\tilde{\rho}}_{D}^{3}$(*μ*) and ${\rho}_{LS}^{3}$(*μ*) ($\mathcal{O}$(1/${m}_{b}^{3}$)), and *α*_{
s
}. The CKM element |*V*_{
cb
}| is a free parameter in the fit, related to the semileptonic width Γ (*B* → *X*_{
c
}ℓ*ν*) [142]. The fit results for various inputs and correlations are shown in Table 7.

All the measured moments of inclusive distributions in *B* → *X*_{
c
}ℓ*ν* and *B* → *s γ* decays are used in a fit to extract |*V*_{
cb
}| and the *b* and *c* quark masses. The |*V*_{
cb
}| and *m*_{
b
}values obtained are listed in the Table 7. The default fit also gives *m*_{
c
}= 1.16 ± 0.05 GeV, and ${\mu}_{G}^{2}$ = 0.27 ± 0.04 GeV^{2}. The errors are experimental and theoretical (HQE and Γ_{SL}) respectively. In this fit the following variations were considered when in the extraction of the HQ parameters, ± 20 MeV for the *b* and *c* quark masses, ± 20% for ${\mu}_{p}^{2}i$ and ${\mu}_{G}^{2}$, ± 30% for the 3^{rd}order non perturbative terms and *α*_{2} = 0.22 ± 0.04 for the perturbative corrections. The bias corrections uncertainties for *B* → *s γ* were varied by the full amount of their value.

There are open issues relating to the global fits. First of all, the *χ*^{2}/n.d.f. are very small, pointing to an underestimate in the theoretical correlations. In a recent study [163], the theoretical correlations used in the fit were scrutinized, and new correlation coefficients were derived from the theory expressions using a "toy Monte Carlo" approach, showing that the theoretical correlations were largely underestimated. The result of this new fit is shown in Table 7. The second issue is related the size of the theoretical error. Recently the NNLO full two-loop calculations become available [164–166]. In the Kinetic scheme NNLO calculations include an estimate of the non-BLM terms and lead to a roughly 0.6 % reduction of the |*V*_{
cb
}| value -0.25 × 10^{-3}. In the 1S scheme the shift on |*V*_{
cb
}| is of about -0.14 × 10^{-3} [167]. From the new power corrections at NLO we expect the chromo-magnetic corrections to be more important as the tree level corrections are more important, and a change of about 20–30% in the extracted value of ${\mu}_{\pi}^{2}$ in the pole expansion (may be less with the other schemes) [168].

HQE has been carried out up to 1/${m}_{b}^{4}$ and the effects are expected to be of the order *δ*^{(4)}Γ/Γ ≈ 0.25% [169]. All these newly calculated results can be used to scrutinize the earlier error estimates. In the kinetic scheme, the full NNLO value for *A*_{
pert
}= 0.919, which is in good agreement with *A*_{
pert
}= Γ (*B* → *X*_{
c
}ℓ*ν*)/Γ (*B* → *X*_{
c
}ℓ*ν*)_{
tree
}= 0.908 ± 0.009. In the 1S scheme the estimated uncertainty from the non-BLM two-loop is half of the BLM part, equivalent to 1.5% of the tree-level rate, more than 3 times the actual correction. The new results are not implemented in the global fits as they are available mainly in the pole-mass scheme.

### 3.4 Outlook

The error on the inclusive and exclusive determination of *V*_{
cb
}is limited by theory. The two method have a 2*σ* disagreement, even when results in the same experiment are compared.

For exclusive determinations, the experimental determinations in *B* → *D**ℓ*ν* from different experiments are not in agreement. Improving the statistical error for the determination from *B* → *D*ℓ*ν* will help in elucidating the origin of this discrepancy.

In the inclusive method, errors of less than 2% are quoted. However, the latest theoretical results and the introduction of better correlation between theoretical error show shifts in the central value larger than the quoted fit error. The situation needs to be reevaluated when all the new calculations and corrections are implemented in the fit. Another puzzling result of the global fit to *B* → *X*_{
c
}ℓ*ν* moments only is the value of ${m}_{b}^{\text{kin}}$, which seems to be in disagreement with other determinations, such as the ones in Table 2. (For a detailed comparison between ${m}_{b}^{\text{kin}}$ and different mass determinations see A. Hoang, talk at Joint Workshop on |*V*_{
ub
}| and |*V*_{
cb
}| at the B-Factories Heidelberg, December 14–16, 2007.)

## 4 The CKM parameter |*V*_{
ub
}|

The parameter |*V*_{
ub
}| determines one of the sides of the unitarity triangle, and thus affects one of the crucial tests of the Yukawa sector of the Standard Model. Also in this case, there are two general methods to determine this parameter, using *B* meson semileptonic decays. The first approach relies on the determination of branching fractions and form factor determinations of exclusive semileptonic decays, such as *B* → *π*ℓ$\overline{\nu}$_{ℓ}. The relationship between experimental measurements and |*V*_{
ub
}| requires a theoretical prediction of the hadronic form factors governing these decays. The complementary approach relies on measurements of inclusive properties of *B* meson semileptonic decays. In this case, the hadronic matrix element is evaluated in the context of the OPE.

Both methods pose challenges to both experimenters and theorists. The branching fractions are small, and a substantial background induced by the dominant *b* → *c*ℓ$\overline{\nu}$_{ℓ} needs to be suppressed or accounted for. The large data samples accumulated at the two b-factories, Belle and BaBar, have made possible the development of new experimental techniques that have reduced the experimental errors substantially. In parallel, theorists devoted considerable efforts to devise measurable quantities for which reliable predictions could be produced, and to improve the accuracy of the predictions through refinements in the calculation. Although the precision of the stated errors improved, so did the difference in central values between the inclusive and exclusive estimates of |*V*_{
ub
}|, at least in most determinations. Possible interpretations of this discrepancy will be discussed.

### 4.1 Determinations Based on Exclusive *B*Semileptonic Decays

The decay *B* → *π*ℓ$\overline{\nu}$_{ℓ} is the simplest to interpret, as it is affected by a single form factor. The differential decay width is given by

where *G*_{
F
}is the Fermi constant, $\lambda ({q}^{2})={({q}^{2}+{m}_{B}^{2}-{m}_{\pi}^{2})}^{2}-4{m}_{B}^{2}{m}_{\pi}^{2}$, and *f*_{+} (*q*^{2}) is the relevant form factor.

The first experimental challenge is to reconstruct this exclusive channel without significant background from the dominant charm semileptonic decays, and the additional background component from other *b* → *u*ℓ$\overline{\nu}$ transitions. The advantage of *e*^{+} *e*^{-} B-factories is that the *B* decaying semileptonically originates from the decay *e*^{+} *e*^{-} *B* $\overline{B}$. Thus, if the companion *B* is fully reconstructed, the $\overline{\nu}$_{ℓ} 4-momentum can be inferred from the rest of the event.

CLEO pioneered this approach by reconstructing the *ν* from the missing energy (*E*_{
miss
}≡ 2*E*_{
B
}- Σ_{
i
}*E*_{
i
}), and momentum $\left({\overrightarrow{p}}_{miss}\equiv {\displaystyle {\sum}_{i}{\overrightarrow{p}}_{i}}\right)$ in the event; in these definitions the index *i* runs over the well reconstructed tracks and photons, and cuts are applied to enhance the probability that the only missing particle in the event is the $\overline{\nu}$. This approach allows the application of a low momentum cut of 1.5 GeV on the lepton for *B* → *π*ℓ$\overline{\nu}$_{ℓ} and 2 GeV for *B* →*ρ*ℓ$\overline{\nu}$_{ℓ}. Averaging the results from these two exclusive channels, they obtain $\left|{V}_{ub}\right|=\left(3.3\pm {0.2}_{-0.4}^{+0.3}\pm 0.7\right)\times {10}^{-3}$. Using their relatively modest full data set (16 fb^{-1}) at the center-of-mass energy of the ϒ(4*S*) and considering only the *B* → *π*ℓ$\overline{\nu}$_{ℓ} channel, they get the branching fraction $\mathcal{B}$ (*B* → *π*^{+} ℓ$\overline{\nu}$_{ℓ}) = (1.37 ± 0.15 ± 0.11) × 10^{-4}, and $\left|{V}_{ub}\right|=\left(3.6\pm 0.4\pm 0.2{\pm}_{-0.4}^{+0.6}\right)\times {10}^{-3}$[170].

BaBar [171] uses a sample of 206 fb^{-1} to obtain $\mathcal{B}$(*B* → *π*^{+} ℓ$\overline{\nu}$_{ℓ}) = (1.46 ± 0.07 ± 0.08) × 10^{-4}. Recently, the availability of very large data sets at Belle and BaBar have made possible tagged analyses, where semileptonic decays are studied in samples where the other *B* is fully reconstructed, thus defining the event kinematics even more precisely. The first implementation relies on the the partial reconstruction of exclusive *B* → *D*^{(⋆)} ℓ^{+} *ν*_{ℓ} decay to tag the presence of a *B* $\overline{B}$ event [172, 173].

Belle uses fully reconstructed hadronic tags, achieving the best kinematic constraints and thus the highest background suppression capabilities and the most precise determination of *q*^{2}. Branching fractions obtained with this technique have a bigger statistical error because of the penalty introduced by the tag requirement, but the overall error is already comparable with the other methods. Table 8 summarizes the present status of the experimental information on this decay. Both the total branching fraction for *B* → *π* ℓ$\overline{\nu}$_{ℓ} and the partial branching fraction for *q*^{2} ≥ 16 GeV^{2} are shown. The latter partial branching fraction is useful to extract |*V*_{
ub
}| using unquenched lattice calculations, as this is the only *q*^{2} interval where their calculation is reliable.

In order to interpret these results, we need theoretical predictions for the form factor *f*_{+} (*q*^{2}). This problem can be split into two parts: the determination of the form factor normalization, *f*_{+} (0), and the functional form of the *q*^{2} dependence. Form factor predictions have been produced with quark models [174] and QCD sum rule calculations [175]. Lattice calculations provide evaluations of *f*_{+} (*q*^{2}) at specific values of *q*^{2} or, equivalently, pion momenta (*p*_{
π
}).

Authors then fit these data points with a variety of shapes. Typically a dominant pole shape has been used in the literature. Nowadays more complex functional forms are preferred. Becirevic and Kaidalov (BK) [176] suggest using

where *c*_{
B
}${M}_{{B}^{\star}}^{2}$ is the residue of the form factor at *q*^{2} = ${M}_{{B}^{\star}}^{2}$, and ${M}_{{B}^{\star}}^{2}$/*α* is the squared mass of an effective 1^{-} *B⋆'* excited state. Ball and Zwicky [175] propose

where the parameters *r*_{1}, *r*_{2}, and *α* are fitted from available data. Lastly, parameterizations that allow the application of constraints derived from soft-collinear effective theory (SCET), and dispersion relations have been proposed by Boyd, Grinstein, and Lebedev [120], and later pursued also by Hill [177]. They define

where *t* is (*p*_{
B
}- *p*_{
π
})^{2}, defined beyond the physical region, *t*_{±} = (*m*_{
B
}± *m*_{
π
})^{2}, and *t*_{0} is an expansion point. BGD use *t*_{0} = 0.65*t*_{-}. The parameter *α*_{
K
}allows the modeling of different functional forms, and the variable

maps *t*_{+} <*t* < ∞ onto |*z*| = 1 and -∞ <*t* < t_{+} onto the *z* interval [-1,1].

All are refinements of the old ansatz of a simple pole shape, now rarely used. The first lattice calculations were carried out with the quenched approximation that ignores vacuum polarization effects [178, 179]. In 2004, preliminary unquenched results were presented by the Fermilab/MILC [180] and HPQCD [181] collaborations. These calculations employed the MILC collaboration *N*_{
f
}= 2 + 1 unquenched configurations, which attain the most realistic values of the quark masses so far. Using these calculations and the most recent value of the partial branching fraction $\mathcal{B}$(*B* → *π* ℓ*ν*) for *q*^{2} > 16 GeV^{2}, shown in Table 8, we obtain |*V*_{
ub
}| = (3.51 ± 0.096_{
exp
}± 0.49_{
th
}) × 10^{-3} with the HPQCD normalization, and |*V*_{
ub
}| = (3.70 ± 0.10_{
exp
}± 0.37_{
th
}) × 10^{-3} with the Fermilab/MILC normalization. Fits to experimental data, combining lattice predictions, and dispersion relations reduce the theoretical errors. For example, [182] obtains |*V*_{
ub
}| = (3.5 ± 0.17 ± 0.44) × 10^{-3}, and, more recently, [183] obtains |*V*_{
ub
}| = (3.47 ± 0.29) × 10^{-3}. The most recent lattice calculation [184] performs a simultaneous fit of improved lattice numerical Monte Carlo results and the 12-bin BaBar experimental data on |*V*_{
ub
}|*f*_{+} (*q*^{2}) [185] and derives |*V*_{
ub
}| = (3.38 ± 0.35) × 10^{-3}. The ~10% error includes theoretical and experimental errors, not easily separable.

### 4.2 Determinations Based on Inclusive *B*Semileptonic Decays

Inclusive determinations of |*V*_{
ub
}| rely on the heavy quark expansion (HQE), which combines perturbative QCD with an expansion in terms of 1/*m*_{
b
}, which accounts for non-perturbative effects. Although the possible breaking of the assumption of local quark hadron duality may produce unquantified errors, other non-perturbative uncertainties can be evaluated with systematic improvements, and their uncertainties are easier to assess than the ones of unquenched lattice QCD or QCD sum-rules. This statement applies to the total charmless semileptonic width ${\Gamma}_{u}^{SL}$. In the OPE approach, discussed in the |*V*_{
cb
}| section, the observation by Chay, Georgi and Grinstein that there are no non-perturbative corrections of order Λ_{QCD}/*m*_{
b
}[121] has inspired the hope that this approach would lead to a more precise determination of this important parameter. Before discussing the methods used to relate data with theory, it is useful to observe one of the possible expressions available in the literature for the total semileptonic width [186], including an exact two-loop expression for the perturbative expansion and second-order power corrections. They obtain

where *μ* ~ *m*_{
b
}is the scale at which *α*_{
S
}needs to be evaluated, while the scale *μ*_{⋆} applies to the non-perturbative expansion parameters, namely the *b* quark mass *m*_{
b
}, the chromo-magnetic operator, *λ*_{2}, and the kinetic operator, *μ*_{
π
}. It is clear that, even restricting our attention to the total width, a precise knowledge of the *b* quark mass is critical, and considerable theoretical effort has been devoted to a reliable extraction of this parameter from experimental observables. Other uncertainties, such as the effects of weak annihilation or violations of quark-hadron duality, will be discussed later.

Charmless *B* meson semileptonic decays constitute only about 1% of the total semileptonic width. Thus the big challenge for experimentalists is to identify techniques to suppress this large background. For example, the first evidence for *B* meson charmless semileptonic decays came from the study of the end point of the lepton spectrum, where leptons from *b* → *c* ℓ*ν* processes are forbidden due to the larger mass of the hadronic system formed by the *c* quark [187]. While this was very important first evidence that |*V*_{
ub
}| ≠ 0, very quickly several authors pointed out that this region of phase space is ill suited to a precise determination of |*V*_{
ub
}| because near the end point the OPE does not converge and an infinite series of contributions needs to be taken into account [188, 189]. Thus, a large effort has gone into developing experimental techniques that would feature a low lepton energy *E*_{ℓ} cut and an acceptable signal to background ratio. Table 9 summarizes the present status of the |*V*_{
ub
}| determination with this approach.

Next a whole host of papers proposed alternative "model independent" approaches to measure |*V*_{
ub
}| from inclusive decays [190–192], with the common goal of identifying a region of phase space where experimentalists can suppress the *b* → *c* background, and where the OPE works. The first proposal by Bigi, Dikeman, and Uraltsev proposed considering semileptonic events where *M*_{
X
}≤ 1.5 GeV [190]. However, Bauer, Ligeti, and Luke pointed out [193] that the kinematic limit ${m}_{X}^{2}~{m}_{D}^{2}$ has the same properties of the lepton end point, and spoils the convergence of the OPE; the same authors proposed using *d*Γ/*dq*^{2} up to *q*^{2} = ${m}_{D}^{2}$ and argue that this distribution is better behaved in the kinematic region of interest. This is the theoretical foundation of the so called "improved end point" method, which encompasses the simultaneous study of *E*_{ℓ} and <