Chi-square test under indeterminacy: an application using pulse count data

Aslam, Muhammad

doi:10.1186/s12874-021-01400-z

Research
Open access
Published: 30 September 2021

Chi-square test under indeterminacy: an application using pulse count data

Muhammad Aslam ORCID: orcid.org/0000-0003-0644-1950¹

BMC Medical Research Methodology volume 21, Article number: 201 (2021) Cite this article

2540 Accesses
9 Citations
Metrics details

Abstract

Background

The data obtained from the counting process is known as the count data. In practice, the counting can be done at the same time or the time of the count is not the same. To test either the K counts are differed significantly or not, the Chi-square test for K counts is applied.

Results

The paper presents the Chi-square tests for K counts under neutrosophic statistics. The test statistic of the proposed test when K counts are recorded at the same time and different time are proposed. The testing procedure of the proposed test is explained with the help of pulse count data.

Conclusions

From the analysis of pulse count data, it can be concluded that the proposed test suggests the cardiologists use different treatment methods on patients. In addition, the proposed test gives more information than the traditional test under uncertainty.

Peer Review reports

Background

The data obtained from the counting process is known as the count data. In practice, the counting can be done at the same time or the time of the count is not the same. To test either the K counts are differed significantly or not, the Chi-square test for K counts is applied. This test is applied to test the null hypothesis either the same training or methods should be applied on K counts against the alternative hypothesis that different training or methods should be applied on K counts. The Chi-square test for K counts under classical statistics is applied under the assumption that K counts are obtained under comparable conditions, see [1, 2]. Worked on the test for testing two means of Poisson distribution [3,4,5,6,7,8,9,10] presented applications of test for count data in a variety of fields.

According to [11], “statistical data are frequently not precise numbers but more or less non-precise also called fuzzy. Measurements of continuous variables are always fuzzy to a certain degree”. Similarly, the counting data is not always exact but may be in intervals or unclear. For example, the weather record data and pulse count data are expressed in intervals than the exact values. In these situations, the existing Chi-square test for K counts under classical statistics may mislead the decision-makers. The fuzzy-based tests may be an alternative to being applied when the count data is intervals. The applications of statistical tests under fuzzy logic can be seen in [12,13,14,15,16,17,18,19].

The fuzzy logic is a special case of neutrosophic logic [20, 21] showed the efficiency of the neutrosophic logic over the fuzzy logic and analysis based on the interval approach. For the application of neutrosophic logic, the reader may refer to [22,23,24,25,26]. The neutrosophic statistics was proposed using the idea of neutrosophic logic by [27]. This is a branch of mathematical statistics that provides the presentation, analysis, and inference of neutrosophic, fuzzy, interval, and indeterminate data. Classical statistics was considered a special case of neutrosophic statistics [28,29,30,31,32,33] discussed various applications of neutrosophic statistics.

The existing Chi-square test for K counts under classical statistics cannot be applied when the counts are in intervals. In this paper, the design of the Chi-square test for K counts under neutrosophic statistics will be given. We will extend the statistic for counts at the same time and at different times under neutrosophic statistics. The testing of the hypothesis will be given. The application of the proposed test will be given using the pulse counts data. By proposing the test, it is expected that the proposed test will be effective, flexible, informative, and adequate to be applied under uncertainty.

Results

The pulse rate is theoretically is considered a discrete variable. The application of the proposed test is given using the counts of the pulse rate of 11 patients. The first 11 values of data are obtained from [34] and the next values are generated by simulation. The data is shown in Table 1. A cardiologist is interested to see either the same treatment should be applied to all patients or not. Therefore, the null hypothesis for this case is that the same treatment method should be applied vs. the alternative hypothesis that different methods of treatment should be applied to all patients. As the pulse counts are noted in the same period of time, therefore, the statistic ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$ is given in Eq. (1) a suitable statistic to apply for testing the given hypothesis. The neutrosophic form of ${\overline{N}}_{iN}\epsilon \left[{\overline{N}}_{iL},{\overline{N}}_{iU}\right]$ using the given data is given as ${\overline{N}}_{iN}=73.54+91.8{I}_{i\overline{N}};{I}_{i\overline{N}}\epsilon \left[\mathrm{0,0.1989}\right]$.The test statistic ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$ for the given data is computed as ${\chi}_{1N}^2=\sum_{i=1}^K\frac{{\left({N}_{iN}-{\overline{N}}_{iN}\right)}^2}{{\overline{N}}_{iN}}=\left[\mathrm{146.97,121.78}\right]$.The neutrosophic form of the statistic ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$ using the data is given as ${\chi}_{1N}^2=146.97-121.78{I}_{\chi_{1N}^2};{I}_{\chi_{1N}^2}\epsilon \left[\mathrm{0,0.2068}\right]$. The proposed test will be implemented as follows

Step-1: State the null H₀: all patients should be treated by the same method and alternative hypothesis H₁: patients should be treated with different methods.
Step-2: Let α = 5% and critical values are 34.76 and 67.5.
Step-3: Reject H₀ as the values of ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$ fall in the rejection region.
From the study, it can be concluded that cardiologists should use different methods of treatment for the patients.

Table 1 The pulse counts data

Full size table

Discussion

As mentioned earlier, the neutrosophic form of the proposed test consists of two parts. The first and the second parts presented classical statistics and indeterminate, respectively. The neutrosophic form of ${\chi}_{1N}^2={\chi}_{1L}^2+{\chi}_{1U}^2{I}_{\chi_{1N}^2};{I}_{\chi_{1N}^2}\epsilon \left[{I}_{\chi_{1L}^2},{I}_{\chi_{1U}^2}\right]$ reduces to statistic under classical statistics when ${I}_{\chi_{1L}^2}=0$. The efficiency of the proposed test will be compared with the existing test in terms of uncertainty measures. The neutrosophic form of ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$ of pulse count data is ${\chi}_{1N}^2=146.97-121.78{I}_{\chi_{1N}^2};{I}_{\chi_{1N}^2}\epsilon \left[\mathrm{0,0.2068}\right]$. When ${I}_{\chi_{1L}^2}=0$, the value 146.97 presents the existing test statistic. The part $121.78{I}_{\chi_{1N}^2}$ is an indeterminate part and ${I}_{\chi_{1U}^2}$ = 0.2068 is the uncertainty measure associated with statistic ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$. From the neutrosophic form, it can be seen that the proposed test statistic can be expressed in interval rather than the exact value. Under uncertainty, the value of the test statistic is from 146.97 to 121.78. From this study, it can be seen that the proposed test gives the results in the indeterminate interval that is expecting under uncertainty. On the other, the proposed statistic gives information about indeterminacy. Under an indeterminate environment, the proposed test has the interpretation like: when α = 5%, the probability of committing a type-I error is 0.05, the probability of accepting H₀ is 0.95 and the chance of indeterminacy about the accepting or rejecting H₀ is 0.2068. Let β is the probability of rejecting H₀ when it is true. To study the power of test (1 − β) for the proposed test and the existing test, various values of the level of significance α are considered. The neutrosophic data is generated from 45 to 55 and the values of ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$ are computed and compared with the critical values at various levels of α. The probability of rejecting H₀ when it is true (β) is calculated and used to calculate the power of the test (1 − β). The values of (1 − β) at various values of α are shown in Table 2 and plotted in Fig. 1. From Table 1, it can be seen that as the value of α increases from 0.1 to 0.99, the power of the test also increases. Figure 1 clearly indicates that the power curve of the proposed test is higher than the power curve of the existing test. The comparative study shows that the proposed test is efficient, revealing, and stretchy than the existing test. In addition, the proposed test provides higher values of power of the test as compared to the existing test.

Table 2 The values of the power of the test for the proposed test

Full size table

Methods

The existing test for K counts under classical statistics is applied under the assumption that the data of counts must be noted under comparable conditions. The existing test to investigate the existing test between K counts can be applied only when the counts are exact, precise, and clear. In this section, the proposed test for K counts will be introduced when the count data is in indeterminate intervals, unclear and vagueness. The proposed test will be designed when the time of counts is equal and not equal. The method of the proposed test when times of counts are equal is designed first. Let N_iN = N_iL + N_iUI_iN; I_iNϵ[I_iL, I_iU] be neutrosophic counts at ith time, where N_iL presents exact counts, N_iUI_iN presents inexact or indeterminate counts and I_iNϵ[I_iL, I_iU] be a measure of indeterminacy associated with the counts. Suppose that ${\overline{N}}_{iN}={\overline{N}}_{iL}+{\overline{N}}_{iU}{I}_{i\overline{N}};{I}_{i\overline{N}}\epsilon \left[{I}_{i\overline{L}},{I}_{i\overline{U}}\right]$ be a neutrosophic average of K counts, where ${\overline{N}}_{iL}$ and ${\overline{N}}_{iU}{I}_{i\overline{N}}$ are the determined and indeterminate part of neutrosophic average and ${I}_{i\overline{N}}\epsilon \left[{I}_{i\overline{L}},{I}_{i\overline{U}}\right]$ be the measure of indeterminacy associated with the neutrosophic average. The proposed test will be applied for testing the null hypothesis H₀ : N_iN= constant, when i = 1, 2, 3, …, K. The test statistic under neutrosophic statistics say ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$ can be written as follows.

$${\chi}_{1N}^2=\sum_{i=1}^K\frac{{\left({N}_{iN}-{\overline{N}}_{iN}\right)}^2}{{\overline{N}}_{iN}};{\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right],{\overline{N}}_{iN}\epsilon \left[{\overline{N}}_{iL},{\overline{N}}_{iU}\right]$$

(1)

The neutrosophic form of the proposed test ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$ can be written as.

$${\chi}_{1N}^2={\chi}_{1L}^2+{\chi}_{1U}^2{I}_{\chi_{1N}^2};{I}_{\chi_{1N}^2}\epsilon \left[{I}_{\chi_{1L}^2},{I}_{\chi_{1U}^2}\right]$$

(2)

Note that the proposed statistic ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$ is the extension of the test statistic under classical statistics. The proposed test statistic ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$ reduces to classical statistic ${\chi}_{1L}^2$ when ${I}_{\chi_{1L}^2}=0$. The second part ${\chi}_U^2{I}_{\chi_{1N}^2}$ presents the indeterminate part and ${I}_{\chi_{1N}^2}\epsilon \left[{I}_{\chi_{1L}^2},{I}_{\chi_{1U}^2}\right]$ is the measure of uncertainty.

Suppose now that the time to record ith neutrosophic count is t_i. The test statistic, say ${\chi}_{2N}^2\epsilon \left[{\chi}_{2L}^2,{\chi}_{2U}^2\right]$ under neutrosophic statistics for this case is given by.

$${\chi}_{2N}^2=\sum_{i=1}^K\frac{{\left({N}_{iN}-{t}_i{\overline{R}}_N\right)}^2}{t_i{\overline{R}}_N};\kern0.5em {\chi}_{2N}^2\epsilon \left[{\chi}_{2L}^2,{\chi}_{2U}^2\right]$$

(3)

where ${\overline{R}}_N=\sum {N}_{iL}/\sum {t}_{iL}+\sum {N}_{iU}/\sum {t}_{iU}{I}_{{\overline{R}}_N};{I}_{{\overline{R}}_N}\epsilon \left[{I}_{{\overline{R}}_L},{I}_{{\overline{R}}_U}\right]$ and ${I}_{{\overline{R}}_N}\epsilon \left[{I}_{{\overline{R}}_L},{I}_{{\overline{R}}_U}\right]$ is a measure of indeterminacy. The neutrosophic form of test statistic ${\chi}_{2N}^2\epsilon \left[{\chi}_{2L}^2,{\chi}_{2U}^2\right]$ is expressed as follows.

$${\chi}_{2N}^2={\chi}_{2L}^2+{\chi}_{2U}^2{I}_{\chi_{2N}^2};{I}_{\chi_{2N}^2}\epsilon \left[{I}_{\chi_{2L}^2},{I}_{\chi_{2U}^2}\right]$$

(4)

Note that the proposed statistic ${\chi}_{2N}^2\epsilon \left[{\chi}_{2L}^2,{\chi}_{2U}^2\right]$ is the extension of the test statistic under classical statistics. The proposed test statistic ${\chi}_{2N}^2\epsilon \left[{\chi}_{2L}^2,{\chi}_{1U}^2\right]$ reduces to classical statistic ${\chi}_{2L}^2$ when ${I}_{\chi_{2L}^2}=0$. The second part ${\chi}_U^2{I}_{\chi_{2N}^2}$ presents the indeterminate part and ${I}_{\chi_{2N}^2}\epsilon \left[{I}_{\chi_{2L}^2},{I}_{\chi_{2U}^2}\right]$ is the measure of uncertainty. The proposed test will be implemented in the following steps

Step-1: State the null H₀ and alternative hypothesis H₁.
Step-2: State the level of significance α and decide about the critical region using the Chi-square table.
Step-3: Reject H₀ if ${\chi}_{1N}^2\epsilon \left[{\chi}_{1L}^2,{\chi}_{1U}^2\right]$ or ${\chi}_{2N}^2\epsilon \left[{\chi}_{2L}^2,{\chi}_{2U}^2\right]$ falls in the rejection area, otherwise accept H₁.

Conclusions

In this paper the Chi-square tests for K counts under neutrosophic statistics was presented. The test statistic of the proposed test when K counts were recorded at the same time and different times was proposed. The proposed test was the modified version of the existing test for K counts. The testing of the hypothesis procedure was explained with the help of a real example. From the pulse count data, it is concluded that the proposed test is effective to apply in uncertainty. In addition, the proposed test provides higher values of the power of the test. The proposed test guides the cardiologists to apply different treatment methods for patients. The proposed test using big data can be extended us future research.

Availability of data and materials

All data generated or analysed during this study are included in this published article.

References

Kanji GK. 100 statistical tests. London: Sage Publications; 2006. https://0-doi-org.brum.beds.ac.uk/10.4135/9781849208499.
Krishnamoorthy K, Thomson J. A more powerful test for comparing two Poisson means. J Stat Plan Inference. 2004;119:23–35.
Article Google Scholar
Hilbe JM. The statistical analysis of count data/El análisis estadístico de los datos de recuento. Cult Educ. 2017;29:409–60.
Article Google Scholar
Puig P, Weiß CH. Some goodness-of-fit tests for the Poisson distribution with applications in Biodosimetry. Comput Stat Data Anal. 2020;144:106878.
Article Google Scholar
White GC, Bennetts RE. Analysis of frequency count data using the negative binomial distribution. Ecology. 1996;77:2549–57.
Article Google Scholar
Coxe S, West SG, Aiken LS. The analysis of count data: a gentle introduction to Poisson regression and its alternatives. J Pers Assess. 2009;91:121–36.
Article Google Scholar
Salinas-Rodriguez A, Manrique-Espinoza B, Sosa-Rubi SG. Statistical analysis for count data: use of healthcare services applications. Salud Publica de Mexico. 2009;51:397–406.
Article Google Scholar
Pham TV, Jimenez CR. An accurate paired sample test for count data. Bioinformatics. 2012;28:i596–602.
Article CAS Google Scholar
Hawinkel S, Rayner J, Bijnens L, Thas O. Sequence count data are poorly fit by the negative binomial distribution. PLoS One. 2020;15:e0224909.
Article CAS Google Scholar
Böhning, D. & Sangnawakij, P. Count outcome meta-analysis for comparing treatments by fusing mixed data sources: comparing interventions using across report information. AStA Adv Stat Anal. 2020;1–11.
Viertl R. Univariate statistical analysis with fuzzy data. Comput Stat Data Anal. 2006;51:133–47.
Article Google Scholar
Filzmoser P, Viertl R. Testing hypotheses with fuzzy data: the fuzzy p-value. Metrika. 2004;59:21–9.
Article Google Scholar
Tsai C-C, Chen C-C. Tests of quality characteristics of two populations using paired fuzzy sample differences. Int J Adv Manuf Technol. 2006;27:574–9.
Article Google Scholar
Taheri SM, Arefi M. Testing fuzzy hypotheses based on fuzzy test statistic. Soft Comput. 2009;13:617–25.
Article Google Scholar
Jamkhaneh, E. B. & Ghara, A. N. in 2010 International Conference on Intelligent Computing and Cognitive Informatics. 86–89 (IEEE).
Chachi, J., Taheri, S. M. & Viertl, R. Testing statistical hypotheses based on fuzzy confidence intervals. Aust J Stat. 2012;41, 267–286–267–286.
Kalpanapriya D, Pandian P. Statistical hypotheses testing with imprecise data. Appl Math Sci. 2012;6:5285–92.
Google Scholar
Montenegro, M., Casals, M. A. R., Lubiano, M. a. A. & Gil, M. a. A. Two-sample hypothesis tests of means of a fuzzy random variable. Inf Sci. 2001; 133:89–100.
Park S, Lee S-J, Jun S. Patent big data analysis using fuzzy learning. Int J Fuzzy Syst. 2017;19:1158–67.
Article Google Scholar
Smarandache F. Neutrosophy. Neutrosophic probability, set, and logic, ProQuest Information & Learning. Ann Arbor. 1998;105:118–23.
Google Scholar
Smarandache, F. Introduction to neutrosophic measure, neutrosophic integral, and neutrosophic probability. Infinite Study, 2013.
Broumi, S. & Smarandache, F. in Applied Mechanics and Materials. Trans Tech Publ 511–517.
Guo Y, Sengur A. NCM: Neutrosophic c-means clustering algorithm. Pattern Recogn. 2015;48:2710–24.
Article Google Scholar
Broumi, S., Bakali, A., Talea, M. & Smarandache, F. Bipolar neutrosophic minimum spanning tree. Infinite Study, 2018.
Abdel-Baset M, Chang V, Gamal A. Evaluation of the green supply chain management practices: a novel neutrosophic approach. Comput Ind. 2019;108:210–20.
Article Google Scholar
Abdel-Basset M, Mohamed M, Elhoseny M, Chiclana F, Zaied AE-NH. Cosine similarity measures of bipolar neutrosophic set for diagnosis of bipolar disorder diseases. Artif Intell Med. 2019;101:101735.
Article Google Scholar
Smarandache F. Introduction to neutrosophic statistics. Infinite Study, 2014. https://0-doi-org.brum.beds.ac.uk/10.13140/2.1.2780.1289.
Chen J, Ye J, Du S. Scale effect and anisotropy analyzed for neutrosophic numbers of rock joint roughness coefficient based on neutrosophic statistics. Symmetry. 2017;9:208.
Article Google Scholar
Chen J, Ye J, Du S, Yong R. Expressions of rock joint roughness coefficient using neutrosophic interval statistical numbers. Symmetry. 2017;9:123.
Article Google Scholar
Aslam, M. Neutrosophic analysis of variance: application to university students. Complex Intelligent Systems. 2019;1–5.
Aslam M. Neutrosophic analysis of variance: application to university students. Complex Intelligent Syst. 2019;5:403–7.
Article Google Scholar
Aslam M, Albassam M. Application of Neutrosophic logic to evaluate correlation between prostate Cancer mortality and dietary fat assumption. Symmetry. 2019;11:330.
Article Google Scholar
Aslam M. A new method to analyze rock joint roughness coefficient based on neutrosophic statistics. Measurement. 2019;146:65–71.
Article Google Scholar
Gioia F, Lauro CN. Basic statistical methods for interval data. Statistica Applicata. 2005;17:75–104.
Google Scholar

Download references

Acknowledgements

We are thankful to the editor and reviewers for their valuable suggestions to improve the quality of the paper.

Funding

none.

Author information

Authors and Affiliations

Department of Statistics, Faculty of Science, King Abdulaziz University, Jeddah, 21551, Saudi Arabia
Muhammad Aslam

Authors

Muhammad Aslam
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

MA wrote the paper. The author(s) read and approved the final manuscript.

Corresponding author

Correspondence to Muhammad Aslam.

Ethics declarations

Ethics approval and consent to participate

N/A

Consent for publication

N/A

Competing interests

none.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Aslam, M. Chi-square test under indeterminacy: an application using pulse count data. BMC Med Res Methodol 21, 201 (2021). https://0-doi-org.brum.beds.ac.uk/10.1186/s12874-021-01400-z

Download citation

Received: 02 August 2021
Accepted: 15 September 2021
Published: 30 September 2021
DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12874-021-01400-z

Chi-square test under indeterminacy: an application using pulse count data

Abstract

Background

Results

Conclusions

Background

Results

Discussion

Methods

Conclusions

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

BMC Medical Research Methodology

Contact us

Chi-square test under indeterminacy: an application using pulse count data

Abstract

Background

Results

Conclusions

Background

Results

Discussion

Methods

Conclusions

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Research Methodology

Contact us