Identifiability determines whether all parameters in a model can be uniquely determined from available data. Imagine a detective trying to identify suspects from available clues. If the clues are sufficient and not contradictory, identification can be done with certainty.
For matrix A∈Rm×n, vector b∈Rm, with , the least squares problem
m≥n
xmin∥Ax−b∥22
aims to estimate parameter x∈Rn through the corresponding normal equation system
ATx=ATb
When this system has a unique solution, all parameters can be identified.
All parameters can be identified precisely when matrix A has full rank n.
Mathematically, this condition can be written as
Rank(A)=min(m,n)=n
The full rank condition is like ensuring that each parameter provides truly new and non-overlapping information. Similar to a detective case where there are sufficient independent clues to identify each suspect without confusion. Each parameter provides information that cannot be obtained from other parameters, making the estimation unique and stable.
The rank of matrix A∈Rm×n can be obtained during the computational process of QR decomposition or LU decomposition of matrix A. However, a more computationally expensive but numerically more stable approach is to determine the rank using singular value decomposition of A.
The difference between these two approaches is like comparing measurement with a regular ruler versus measurement with a high-precision instrument. Singular value decomposition provides more detailed and stable information about the numerical structure of matrices, especially for cases approaching singularity.
For matrix A∈Rm×n with Rank(A)=r, there exist orthogonal matrices U∈Rm×m and V∈Rn×n as well as matrix S=(sij)i=1,…,m with sij=0 for all i=j and non-negative diagonal entries s11≥s22≥⋯≥0, such that
A=USVT
This representation is called the singular value decomposition of A. The values σi=sii are called singular values of A. Matrices U and V are not uniquely determined.
This decomposition is like dismantling a complex machine into its basic components. We can see how the matrix transforms vector space, including the main transformation directions and how much scaling occurs in each direction.
The number of non-zero singular values of matrix A equals Rank(A).
Mathematically, this means
Rank(A)=#{σi:σi>0}
where # denotes the number of elements in the set.
This fundamental property provides a numerically stable way to determine matrix rank. Very small singular values are like weak radio signals, still present but barely detectable.
The term "rank-deficient" refers to the condition when a matrix does not have full rank. That is, Rank(A)<min(m,n). In this context, some rows or columns of the matrix are linearly dependent.
When a matrix is rank-deficient, some singular values become zero or very close to zero
σr>σr+1=σr+2=⋯=σmin(m,n)=0
This condition indicates that the equation system has more than one solution or may not even have a unique solution. In numerical practice, we often use a threshold ϵ to determine whether a singular value is considered zero
σi≤ϵ⋅σ1
where ϵ typically ranges between 10−12 to 10−16 depending on computational precision.
Singular value decomposition can be computed using eigenvalues and eigenvectors of ATA. The mathematical relationship is
ATA=VΣ2VT
where Σ2=diag(σ12,σ22,…,σn2) is a diagonal matrix that has values σi2 on the main diagonal and zeros elsewhere
Σ2=σ120⋮00σ22⋮0⋯⋯⋱⋯00⋮σn2
In numerical libraries, special functions are available for this computation called SVD (singular value decomposition). SVD implementations in modern numerical libraries use highly efficient and stable algorithms, making them reliable tools for various applications in matrix analysis and scientific computing.