Research Article | | Peer-Reviewed

On Inference of Weitzman Overlapping Coefficient ∆(X,Y) in the Case of Two Normal Distributions

Received: 9 July 2024     Accepted: 5 August 2024     Published: 20 August 2024
Views:       Downloads:
Abstract

The Weitzman overlapping coefficient ∆(X,Y) is the most important and widely used overlapping coefficient, which represents the intersection area between two probability distributions. This research proposes a new general technique to estimate ∆(X,Y) assuming the existence of two independent random samples following normal distributions. In contrast to some studies in this scope that place some restrictions on the parameters of the two populations such as the equality of their means or the equality of their variances, this study did not assume any restrictions on the parameters of normal distributions. Two new estimators for ∆(X,Y) were derived based on the proposed new technique, and then the properties of the estimator resulting from taking their arithmetic mean was studied and compared with some corresponding estimators available in the literature based on the simulation method. An extensive simulation study was performed by assuming two normal distributions with different parameter values to cover most possible cases in practice. The parameter values were chosen taking into account the exact value of ∆(X,Y), which taken to be small (close to zero), medium (close to 0.5) and large (close to 1). The simulation results showed the effectiveness of the proposed technique in estimating ∆(X,Y). By comparing the proposed estimator of ∆(X,Y) with some existing corresponding estimators, its performance was better than the performances of the other estimators in almost all considered cases.

Published in International Journal of Theoretical and Applied Mathematics (Volume 10, Issue 2)
DOI 10.11648/j.ijtam.20241002.11
Page(s) 14-22
Creative Commons

This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.

Copyright

Copyright © The Author(s), 2024. Published by Science Publishing Group

Keywords

Overlapping Weitzman Coefficient, Maximum Likelihood Method, Parametric Method, Normal Distribution, Expected Value, Relative Bias, Relative Mean Square Error

1. Introduction
The well-known Weitzman overlapping (OVL) coefficient Δ(X, Y) is a measure of similarity between two probability distributions. Although the Weitzman coefficient is the most important interference coefficient, there are some other overlapping coefficients that have been discussed and studied in the literature, see . Δ(X, Y) represents the common area under two probability density functions (pdfs). The OVL coefficients are widely used in the literature in many applications such as; comparison of income distributions ; distinctness clusters , reliability analysis and goodness of fit test .
Let X and Y be two independent continuous random variables follow f1x and f2x respectively. The Weitzman OVL coefficient between X and Y is defined by,
Δ(X, Y)=minf1x,f2xdx.
There are two main methods used to estimate the OVL coefficients; the parametric method and the non-parametric method. The parametric method assumes that the formulas of the probability density functions are known but depend on unknown parameter(s). To estimate the OVL coefficient, these parameters can be estimated by using one of the well-known statistical methods, such as the method of moments or the maximum likelihood method (see, ). If it is not possible to determine the probability density function models for the data or if there are some doubts about the validity of the assumption of a particular data model, the alternative is to use the non-parametric method instead of the parametric method. The non-parametric method does not require any assumptions about the formulas of probability density functions, as this method is used to estimate the formulas of the probability density functions themselves (see, ).
If X and Y follow N(μ1,σ12) and N(μ2,σ22) respectively, where N(μ,σ2) is the pdf of a normal distribution with mean μ and variance σ2 then
ΔX, Y=-minN(μ1,σ12),N(μ2,σ22)dx.
Under the assumption, σ12= σ22=σ2, Inman and Bradley derived the value of ΔX, Y, which is given by,
ΔX, Y=2Φ -μ1-μ22σ
=2Φ -δ/2
where δ=μ1-μ2/σ and Φ(t) is the standard normal cumulative distribution function at a point t. Let X1, X2,,Xn1 and Y1, Y2,,Yn2 be two independent random samples drawn from the two normal densities N(μ1,σ2) and N(μ2,σ2) respectively. The maximum-likelihood estimator of ΔX, Y is ,
̂INX, Y=2Φ-X̅-Y̅2S,
where X̅ and Y̅ are the maximum likelihood (ML) estimators of μ1 and μ2 respectively and S2 is the pooled ML estimator of σ2, which is given by,
S2=i=1n1(Xi-X̅)2+i=1n2(Yi-Y̅)2n1+n2.
Mulekar and Mishra derived the formula of ΔX, Y under the assumption that the two means are equal, i.e. μ1=μ2=μ. Now, define C=σ1/σ2, then they gave the following formula for ΔX, Y,
ΔX, Y= 1-2Φb+2ΦCb, if 0<C<11+2Φb-2ΦCb, if C1,
where = -ln(C2)/1-C2. The corresponding estimator of ΔX, Y that suggested by is,
̂MMX, Y=1-2Φb̂+2ΦĈb̂, if 0<σ̂1<σ̂21+2Φb̂-2ΦĈb̂, if σ̂1σ̂2
where, b̂=-ln(Ĉ2)/1-Ĉ2, Ĉ=σ̂1/σ̂2, σ̂12= i=1n1(Xi-μ̂)2n1, σ̂22= i=1n2(Yi-μ̂)2n2 and μ̂=i=1n1Xi+i=1n2Yin1+n2.
It is important to note that the value of b is undefined if σ1=σ2 and the corresponding estimator b̂ is also undefined if σ̂1=σ̂2. In this case, the value of the parameter ΔX, Y=1 (because the two densities are identical), and then the value of the corresponding estimator ̂MMX, Y must be 1.
The previous two studies placed some restrictions on the parameters of the distributions. The first assumed that the two variances were equal, while the second assumed that the two means were equal. To overcome this problem, Eidous and Al-Shourman estimated ΔX, Y under a pair of normal distributions without using any assumptions on their parameters. Their proposal was based on approximating the integral of ΔX, Y and then the resulting approximation was estimated instead of estimating the exact value of ΔX, Y.
The aim and main idea of this paper parallel the work of . Without using any assumptions about the parameters of normal distribution, this paper proposed a new technique to deal with the integral of ΔX, Y by writing it as an expected value for a function or some functions and then estimating the resulting expected value instead of estimating ΔX, Y itself.
2. Main Results
Let X and Y be two random variables from f1(x;μ1,σ12)=N(μ1,σ12) and f2(y;μ2,σ22)=N(μ2,σ22) respectively. In this section, a new technique for estimating overlapping Weitzman coefficient is suggested,
ΔX, Y=-minf1(x;μ1,σ12), f2(x;μ2,σ22)dx.
This proposed technique consists of two stages. In the first stage, the coefficient was written as an expected value of some function. In the second stage, a new estimator was proposed for the resulting expectation. Accordingly, the estimator was derived as follows.
Consider minf1(X; μ1,σ12), f2(X;μ2,σ22)/f1(X; μ1,σ12) as a function of X and minf1(Y; μ1,σ12), f2(Y;μ2,σ22)/f2(Y;μ2,σ22) as a function of Y. Then,
Eminf1(X;μ1,σ12), f2(X;μ2,σ22)f1(X;μ1,σ12) =0minf1(x; μ1,σ12), f2(x;μ2,σ22)f1(x;μ1,σ12) f1(x;μ1,σ12) dx
=-minf1(x;μ1,σ12), f2(x;μ2,σ22)dx
=ΔX, Y
and
Eminf1(Y;μ1,σ12), f2(Y;μ2,σ22)f2(Y;μ2,σ22) =0minf1(y;μ1,σ12), f2(y;μ2,σ22)f2(y;μ2,σ22) f2(y;μ2,σ22) dy
=-minf1(y;μ1,σ12), f2(y;μ2,σ22)dy
=-minf1(x;μ1,σ12), f2(x;μ2,σ22)dx
=ΔX, Y.
From the last two formulas, ΔX, Y can also be expressed as the average of the above two formulas as follows,
ΔX, Y=12Eminf1(X;μ1,σ12), f2(X;μ2,σ22)f1(X;μ1,σ12) +Eminf1(Y;μ1,σ12), f2(Y;μ2,σ22)f2(Y;μ2,σ22) .
Based on the last three formulas for ΔX, Y and based on the two independent random samples X1, X2, , Xn1 and Y1,Y2, , Yn2, the ML estimators of μ1, μ2,  σ1 2and σ22 are μ̂1=X̅, μ̂2=Y̅, σ̂12= S12 and σ̂22= S22 respectively. Therefore, the ML estimators for f1(x;μ1, σ12) and f2(y;μ2, σ22) are f1(x;μ̂1,σ̂12) and f2(y; μ̂2,σ̂22) respectively. Now, ΔX, Y can be estimated using any of the following two estimators that are consistent with the first two formulas of ΔX, Y,
Δ̂X, Y=1n1i=1n1minf1(Xi; μ̂1,σ̂12), f2(Xi;μ̂2,σ̂22)f1(Xi; μ̂1,σ̂12)
or,
Δ̂X, Y=1n2i=1n2minf1(Yi; μ̂1,σ̂12), f2(Yi;μ̂2,σ̂22)f2(Yi;μ̂2,σ̂22).
After conducting a preliminary simulation study, this study shows that the average of the last two estimators for ΔX, Y (corresponding to the last formula of ΔX, Y) is more stable than each of them individually. Therefore, the finite properties of the following proposed estimator is investigated in our simulation study in the next section,
Δ̂PropX, Y=121n1i=1n1minf1(Xi; μ̂1,σ̂12), f2(Xi;μ̂2,σ̂22)f1(Xi; μ̂1,σ̂12)+1n2i=1n2minf1(Yi; μ̂1,σ̂12), f2(Yi;μ̂2,σ̂22)f2(Yi;μ̂2,σ̂22).
3. Simulation Study
In this section, a simulation study was conducted to investigate the performance of the proposed estimator of ΔX, Y compared to some estimators found in the literature under pair of normal distributions. One of the general estimators that was taken into consideration in this study is the non-parametric kernel estimator developed by , which we will denote it by Δ̂kX, Y (see also for the selection of bandwidth).
We considered the following three cases at which four pairs of distributions were chosen to simulate the data for each case. The basis of the selection process for these distributions is to ensure small, medium (less than 0.5), medium (greater than 0.5) and large values for the true values of ΔX, Y. The 3×4=12 pairs are given in Table 1.
Four pairs of normal distributions with equal variances are selected (See Table 1). In this case, the estimators Δ̂kX, Y, ̂INX, Y and ̂PropX, Y are considered and their performances were compared.
Four pairs of normal distributions with equal means are selected (See Table 1). The estimators Δ̂kX, Y, ̂MMX, Y and ̂PropX, Y are considered in this case.
Four pairs of normal distributions with different variances and different means are selected (See Table 1). In this case, only the two estimators Δ̂kX, Y and ̂PropX, Y are investigated.
It should be noted here that, on the first hand, the estimator ̂INX, Y was developed assuming that the variances are equal, while ̂MMX, Y was derived assuming that the means are equal. On the other hand, the estimator Δ̂kX, Y was developed without using any assumptions on the parameters of the distributions or even on the shape of the distributions themselves. Finally, the proposed estimator ̂PropX, Y was derived assuming that the two distributions are normal but without using any assumptions on their parameters. Therefore, Δ̂kX, Y and ̂PropX, Y can be used for all three cases mentioned above, while the estimator ̂INX, Y can be used in the first case only, and the estimator ̂INX, Y can only be used in the second case.
Let x1, x2,, xn1 and y1, y2,, yn2 are two independent simulated samples from f1x and f2y respectively, then to study the behavior of the various estimators for different sample sizes, (n1,n2)=10, 10, 50, 50, (100, 200) are chosen. For each sample size, R=1000 replications are used. The Relative Bias (RB), Relative Mean Square Error (RMSE) and Efficiency (EFF) were computed for each estimator under study. These measures were computed according to the following rules.
If η̂ is an estimator of η then,
RB=Êη̂-ηη,
RMSE=MSÊ(η̂)η,
Where
Êη̂=j=1Rη̂(j)R
and
MSÊη̂=j=1Rη̂j-Êη̂2/R.
The efficiency of each considered estimator is computed with respect to the nonparametric estimator, Δ̂kX, Y. The efficiency of η̂ with respect to Δ̂kX, Y is,
EFF=MSÊ(Δ̂kX, Y)MSÊ (η̂).
The simulation results are reported in Tables 2 to 4.
Table 1. The 12 simulated pair normal distributions f1(x) and f2(y) together with the corresponding exact values of the overlapping coefficient ΔX, Y.

Normal distributions

f1x

f2y

ΔX, Y

Case 1: Equal variances

A

N0,1

N-0.5,1

0.8025

B

N0,1

N1,1

0.671

C

N0,1

N1.5,1

0.4532

D

N0,1

N3,1

0.1336

Case 2: Equal means

A

N0,1

N0,1.5

0.8064

B

N0,1

N0,2.5

0.585

C

N0,1

N0,5

0.3528

D

N0,1

N0,10

0.2017

Case 3: Different means and different variances

A

N0,1

N-0.2,1.1

0.9151

B

N0,1

N1,2

0.6099

C

N0,1

N2.5,4

0.3577

D

N0,1

N5,2

0.0891

4. Simulation Results
The RB, RMSE and EFF for each estimator mentioned in the previous sections are displayed in Table 2, Table 3 and Table 4. By examining these results, we conclude the following:
As a general conclusion, it is clear that the RMSE values of the different estimators decrease with increasing sample sizes. This a good sign for concluding that the different estimators are consistent estimators for ΔX, Y.
By examining the results of Table 2, which concern the case of two normal distributions with equal variances, it is clear that the two estimators ̂INX, Y and ̂propX, Y perform similar to each other with some preferring for ̂INX, Y over ̂propX, Y. These two estimators perform better than the general kernel estimator ̂kX, Y.
From Table 3, the performance of the estimator ̂MMX, Y is better than the other two counterpart estimators ̂kX, Y and ̂propX, Y when the exact values of overlapping coefficient are large. The opposite is true for small values of ΔX, Y. This may be due to the estimation of the common mean by using the pooled mean as suggested by .
In all cases, the performance of the proposed estimator ̂propX, Y is better than that of ̂kX, Y. It is worthwhile to mention here that the estimator of can only be used when the mean of the two normal distributions is assumed to be equal.
Based on the results of Table 4, the proposed estimator ̂propX, Y achieves good performance in general. Its performance is better than that of the kernel estimator ̂kX, Y in all cases studies. This is very evident when examining the values RMSE and EFF that associated with the proposed and kernel estimators. It is important to note that this result is expected because the kernel estimator can be used more generally without looking to the distributions at which the data come from. It should also be noted here that the proposed estimator was derived without using the assumption of equal means or equal variances for the two normal distributions.
Finally, it should be noted that the Inman and Bradley estimator ̂INX, Y (or Mulekar and Mishra estimator ̂MMX, Y) can be used only if the two variances (or the two means) of the normal distributions are equal. Because of this drawback of each of them, we recommend using the proposed estimator ̂propX, Y as a general estimator for ΔX, Y under a pair of normal distributions.
Table 2. The RB, RMSE and EFF of the estimators Δ̂kX, Y, ̂INX, Y and ̂PropX, Ywhen the data are simulated from pair normal distributions with equal variances (σ12=σ22=1) (Case, 1 of Table 1).

ΔX, Y

(n1,n2)

̂kX, Y

̂INX, Y

Δ̂PropX, Y

0.8025

(10,10)

RB

0.0364

-0.033

-0.0995

RMSE

0.246

0.1951

0.2037

EFF

1

1.5899

1.459

(50,50)

RB

-0.004

-0.0038

-0.0164

RMSE

0.0914

0.0973

0.095

EFF

1

0.8821

0.9255

(100,200)

RB

0.0018

0.002

-0.0022

RMSE

0.0615

0.0606

0.0601

EFF

1

1.031

1.049

0.617

(10,10)

RB

-0.0024

-0.0008

-0.0484

RMSE

0.2803

0.2784

0.268

EFF

1

1.0141

1.094

(50,50)

RB

0.002

0.0029

-0.0051

RMSE

0.1342

0.1297

0.1286

EFF

1

1.07

1.0894

(100,200)

RB

-0.0012

-0.0004

-0.0031

RMSE

0.0812

0.0726

0.0726

EFF

1

1.2507

1.2501

0.4532

(10,10)

RB

0.0009

-0.0036

-0.041

RMSE

0.3567

0.3461

0.3368

EFF

1

1.0619

1.1214

(50,50)

RB

0.0029

0.0011

-0.0049

RMSE

0.1698

0.1565

0.1556

EFF

1

1.1759

1.1899

(100,200)

RB

-0.0004

-0.001

-0.0032

RMSE

0.1061

0.089

0.0902

EFF

1

1.4206

1.3833

0.1336

(10,10)

RB

0.0607

0.0166

-0.0229

RMSE

0.7837

0.6801

0.6807

EFF

1

1.328

1.3257

(50,50)

RB

0.0295

0.0213

0.0158

RMSE

0.3642

0.3073

0.3062

EFF

1

1.4052

1.4148

(100,200)

RB

0.0009

-0.0043

-0.0057

RMSE

0.2093

0.1738

0.1768

EFF

1

1.4494

1.4006

Table 3. The RB, RMSE and EFF of the estimators Δ̂kX, Y, ̂MMX, Y and ̂PropX, Y when the data are simulated from pair normal distributions with equal means (μ1=μ2=0) (Case, 2 of Table 1).

ΔX, Y

(n1,n2)

̂kX, Y

̂MMX, Y

Δ̂PropX, Y

0.8064

(10,10)

RB

-0.1686

0.0014

-0.0093

RMSE

-0.2447

0.1481

0.1606

EFF

1

2.7332

2.3214

(50,50)

RB

-0.042

0.0014

-0.0007

RMSE

0.0954

0.0825

0.0844

EFF

1

1.3367

1.2793

(100,200)

RB

-0.01

0.0001

0.0008

RMSE

0.0537

0.0463

0.047

EFF

1

1.3505

1.3077

0.5850

(10,10)

RB

-0.1213

0.0688

0.0152

RMSE

0.297

0.2353

0.2523

EFF

1

1.5925

1.386

(50,50)

RB

-0.0274

0.0083

-0.004

RMSE

0.1266

0.0985

0.1074

EFF

1

1.653

1.3906

(100,200)

RB

-0.0029

0.0053

0.0029

RMSE

0.0723

0.0593

0.0609

EFF

1

1.4848

1.4057

0.3528

(10,10)

RB

-0.1349

0.2267

0.0105

RMSE

0.3798

0.4077

0.3296

EFF

1

0.8684

1.3284

(50,50)

RB

-0.03

0.0536

0.0032

RMSE

0.1613

0.1278

0.1323

EFF

1

1.5931

1.4858

(100,200)

RB

-0.013

0.0187

0.0014

RMSE

0.0949

0.0754

0.0781

EFF

1

1.5846

1.4756

0.2017

(10,10)

RB

-0.187

0.5953

-0.0112

RMSE

0.4733

0.8444

0.4167

EFF

1

0.3141

1.2901

(50,50)

RB

-0.0524

0.1694

0.0012

RMSE

0.2116

0.278

0.1732

EFF

1

0.5795

1.4938

(100,200)

RB

-0.0325

0.0833

0.0028

RMSE

0.1226

0.1533

0.0983

EFF

1

0.6402

1.5566

Table 4. The RB, RMSE and EFF of the estimators Δ̂kX, Y and ̂PropX, Y when the data are simulated from pair normal distributions with different parameters (Case, 3 of Table 1).

ΔX, Y

(n1,n2)

̂kX, Y

Δ̂PropX, Y

0.9151

(10,10)

RB

-0.2384

-0.1537

RMSE

0.2801

0.2029

EFF

1.0000

1.9056

(50,50)

RB

-0.0793

-0.0404

RMSE

0.1020

0.0777

EFF

1.0000

1.722

(100,200)

RB

-0.0359

-0.0132

RMSE

0.0551

0.048

EFF

1.0000

1.3183

0.6099

(10,10)

RB

-0.1117

-0.0682

RMSE

0.2813

0.2452

EFF

1.0000

1.316

(50,50)

RB

-0.0281

-0.0192

RMSE

0.1209

0.1037

EFF

1.0000

1.3596

(100,200)

RB

-0.0041

-0.0063

RMSE

0.069

0.0598

EFF

1.0000

1.3324

0.3577

(10,10)

RB

-0.1141

-0.0594

RMSE

0.3804

0.3234

EFF

1.0000

1.3833

(50,50)

RB

-0.0353

-0.0170

RMSE

0.1756

0.1438

EFF

1.0000

1.4914

(100,200)

RB

-0.0064

-0.0023

RMSE

0.0944

0.0791

EFF

1.0000

1.4263

0.0891

(10,10)

RB

-0.0098

-0.0464

RMSE

0.8987

0.8097

EFF

1.0000

1.2317

(50,50)

RB

0.0671

-0.0014

RMSE

0.4347

0.3660

EFF

1.0000

1.4105

(100,200)

RB

0.0578

0.0118

RMSE

0.2547

0.2103

EFF

1.0000

1.4662

5. Conclusion
This study presented a new technique for estimating (X,Y) under a pair of normal distributions by writing it as an expected value for some functions. One of the most important benefits of this technique is to estimate (X,Y) without placing any conditions on the parameters of normal distributions. Based on the results of numerical simulations, these results demonstrated the effectiveness of the new technique and that the performance of the estimator resulting from the use of this technique is better than the performance of the nonparametric kernel estimator of (X,Y) that developed by Eidous and AL-Talafha . Accordingly, this technique can be used to estimate other OVL coefficients mentioned in the literature, such as the Matusita coefficient (see, Eidous and Ananbeh ) and Pianka and Kullback-Leibler coefficients (see, Eidous and Abu Al-Hayja`a ).
Abbreviations

OVL

Overlapping

pdf

Probability Density Function

ML

Maximum Likelihood

RB

Relative Bias

MSE

Mean Square Error

RMSE

Relative Mean Square Error

EFF

Efficiency

Conflicts of Interest
The authors declare no conflicts of interest.
References
[1] Alodat, M. T., Al Fayez, M. and Eidous, O. (2021). On the asymptotic distribution of Matusita's overlapping measure. Communications in Statistics - Theory and Methods. 51(20), 6963-6977.
[2] Dhaker, H., Deme, E. and El-Adlouni, S. (2021). On inference of overlapping coefficients in two inverse Lomax populations. Statistical Theory and Applications, 20 (1), 61-75.
[3] Eidous, O., and Abu Al-Hayja’s, M. (2023a). Numerical integration approximations to estimate the Weitzman overlapping measure: Weibull distributions. Yugoslav Journal of Operations Research, 33 (4), 699-712.
[4] Eidous, O. M. and Abu Al-Hayja,a, M. (2023b). Estimation of overlapping measures using numerical approximations: Weibull distributions. Jordan Journal of Mathematics and Statistics (JJMS), 16(4), 741–761.
[5] Eidous, O. and Abu Al-Hayja’a, M. (2023c). Weibull distributions for estimation of Pianka and Kullback-Leibler overlapping measures. Journal of Mathematics and Statistics Research, 5 (1), 165.
[6] Eidous, O., and Al-Daradkeh, S. (2022). Estimation of Matusita overlapping coefficient for pair normal distributions. Jordan Journal of Mathematics and Statistics (JJMS), 15(4B), 1137-1151.‏
[7] Eidous, O. M., Al-Shourman, A. (2022). Numerical Integral Approximation to Estimate Matusita Overlapping Coefficient for Normal Distributions. Journal of Mathematical Techniques and Computational Mathematics, 1 (3), 264-270.
[8] Eidous, O. M. and Al-Shourman, A. (2023). Estimating the Weitzman Overlapping Coefficient Using Integral Approximation Method in the Case of Normal Distributions. Applied Mathematics, Modeling and Computer Simulation, 42, 1011-1020.
[9] Eidous, O. M., and AL-Talafha, S. A. (2020). Kernel method for overlapping coefficients estimation. Communications in Statistics: Simulation and Computation, 51(9), 5139–5156.
[10] Eidous, O. M., and Ananbeh, E. A. (2024a). Kernel method for estimating overlapping coefficient using numerical integration methods. Applied Mathematics and Computation, 462, 128339.‏
[11] Eidous, O. M., and Ananbeh, E. A. (2024b). Kernel method for estimating Matusita overlapping coefficient using numerical approximations. Annals of Data Science.
[12] Eidous, O. M, Marie, M. and Al-Haj Ibrahim, M. (2010). A comparative study for bandwidth selection in kernel density estimation. Journal of Modern Applied Statistical Methods, 9 (1), 263-273.
[13] Ichikawa, M. (1993). A meaning of the overlap pedarea under probability density curves of stress and strength. Reliability Engineering and System Safety, 41(2), 203-204.
[14] Inácio V. and Garrido Guillén J. E. (2022). Bayesian nonparametric inference for the overlap coefficient: With an application to disease diagnosis. Statistics in Medicine, 41(20), 3879-3898.
[15] Inman, H. F. and Bradley, E. L. (1989). The overlapping coefficient as a measure of agreement between probability distributions and point estimation of the overlap of two normal densities. Comm. Statist.- Theory Methods, 18, 3851-3874.
[16] Mulekar, M. S. and Mishra, S. N. (1994). Overlap Coefficient of two normal densities: equal means case. J. Japan Statist. Soc., 24, 169-180.
[17] Samawi, H. M. and Al-Saleh, M. F. (2008). Inference of overlapping coefficients in two exponential populations using ranked set sample. Communication of Korean of Statistical Society, 15 (2): 147-159.
[18] Sneath, P. H. A. (1977). A method for testing the distinctness of clusters: a test of the disjunction of two clusters in Euclidean space as measured by their overlap. Journal of the International Association for Mathematical Geology, 9, 123-143.
[19] Weitzman, M. S. (1970). Measures of overlap of income distributions of white and Negro families in the United States. Washington: U.S. Bureau of the Census.
Cite This Article
  • APA Style

    Eidous, O. M., Daradkeh, S. K. (2024). On Inference of Weitzman Overlapping Coefficient ∆(X,Y) in the Case of Two Normal Distributions. International Journal of Theoretical and Applied Mathematics, 10(2), 14-22. https://doi.org/10.11648/j.ijtam.20241002.11

    Copy | Download

    ACS Style

    Eidous, O. M.; Daradkeh, S. K. On Inference of Weitzman Overlapping Coefficient ∆(X,Y) in the Case of Two Normal Distributions. Int. J. Theor. Appl. Math. 2024, 10(2), 14-22. doi: 10.11648/j.ijtam.20241002.11

    Copy | Download

    AMA Style

    Eidous OM, Daradkeh SK. On Inference of Weitzman Overlapping Coefficient ∆(X,Y) in the Case of Two Normal Distributions. Int J Theor Appl Math. 2024;10(2):14-22. doi: 10.11648/j.ijtam.20241002.11

    Copy | Download

  • @article{10.11648/j.ijtam.20241002.11,
      author = {Omar Mohammad Eidous and Salam Khaled Daradkeh},
      title = {On Inference of Weitzman Overlapping Coefficient ∆(X,Y) in the Case of Two Normal Distributions
    },
      journal = {International Journal of Theoretical and Applied Mathematics},
      volume = {10},
      number = {2},
      pages = {14-22},
      doi = {10.11648/j.ijtam.20241002.11},
      url = {https://doi.org/10.11648/j.ijtam.20241002.11},
      eprint = {https://article.sciencepublishinggroup.com/pdf/10.11648.j.ijtam.20241002.11},
      abstract = {The Weitzman overlapping coefficient ∆(X,Y) is the most important and widely used overlapping coefficient, which represents the intersection area between two probability distributions. This research proposes a new general technique to estimate ∆(X,Y) assuming the existence of two independent random samples following normal distributions. In contrast to some studies in this scope that place some restrictions on the parameters of the two populations such as the equality of their means or the equality of their variances, this study did not assume any restrictions on the parameters of normal distributions. Two new estimators for ∆(X,Y) were derived based on the proposed new technique, and then the properties of the estimator resulting from taking their arithmetic mean was studied and compared with some corresponding estimators available in the literature based on the simulation method. An extensive simulation study was performed by assuming two normal distributions with different parameter values to cover most possible cases in practice. The parameter values were chosen taking into account the exact value of ∆(X,Y), which taken to be small (close to zero), medium (close to 0.5) and large (close to 1). The simulation results showed the effectiveness of the proposed technique in estimating ∆(X,Y). By comparing the proposed estimator of ∆(X,Y) with some existing corresponding estimators, its performance was better than the performances of the other estimators in almost all considered cases.
    },
     year = {2024}
    }
    

    Copy | Download

  • TY  - JOUR
    T1  - On Inference of Weitzman Overlapping Coefficient ∆(X,Y) in the Case of Two Normal Distributions
    
    AU  - Omar Mohammad Eidous
    AU  - Salam Khaled Daradkeh
    Y1  - 2024/08/20
    PY  - 2024
    N1  - https://doi.org/10.11648/j.ijtam.20241002.11
    DO  - 10.11648/j.ijtam.20241002.11
    T2  - International Journal of Theoretical and Applied Mathematics
    JF  - International Journal of Theoretical and Applied Mathematics
    JO  - International Journal of Theoretical and Applied Mathematics
    SP  - 14
    EP  - 22
    PB  - Science Publishing Group
    SN  - 2575-5080
    UR  - https://doi.org/10.11648/j.ijtam.20241002.11
    AB  - The Weitzman overlapping coefficient ∆(X,Y) is the most important and widely used overlapping coefficient, which represents the intersection area between two probability distributions. This research proposes a new general technique to estimate ∆(X,Y) assuming the existence of two independent random samples following normal distributions. In contrast to some studies in this scope that place some restrictions on the parameters of the two populations such as the equality of their means or the equality of their variances, this study did not assume any restrictions on the parameters of normal distributions. Two new estimators for ∆(X,Y) were derived based on the proposed new technique, and then the properties of the estimator resulting from taking their arithmetic mean was studied and compared with some corresponding estimators available in the literature based on the simulation method. An extensive simulation study was performed by assuming two normal distributions with different parameter values to cover most possible cases in practice. The parameter values were chosen taking into account the exact value of ∆(X,Y), which taken to be small (close to zero), medium (close to 0.5) and large (close to 1). The simulation results showed the effectiveness of the proposed technique in estimating ∆(X,Y). By comparing the proposed estimator of ∆(X,Y) with some existing corresponding estimators, its performance was better than the performances of the other estimators in almost all considered cases.
    
    VL  - 10
    IS  - 2
    ER  - 

    Copy | Download

Author Information