- Research
- Open Access
- Published:

# Identification of ancient glass categories based on distance discriminant analysis

*Heritage Science*
**volume 11**, Article number: 160 (2023)

## Abstract

It is crucial for archaeological investigations to identify the category of cultural relics by analyzing their chemical composition. This study analyzed the chemical composition distribution of glass cultural relics and applied distance discriminant analysis methods to classify them into two categories. Through stepwise regression, four key feature factors (\({SiO}_{2}, {K}_{2}O, PbO\), and the presence of weathering on the artifact's surface) were selected from a total of 15 features, including surface weathering. Aside from using columnar table analysis to determine weathering on the surface of the artifact and correlations between categories, and using Spearman correlation coefficients to select key feature factors such as \({SiO}_{2}, {K}_{2}O, PbO, BaO, and \,SrO\) from 14 total feature factors (excluding weathering on the surface), we established a Mahalanobis distance discriminant model to differentiate unknown glass artifacts. Results indicate that Spearman-Mahalanobis distance discrimination outperformed stepwise regression-Mahalanobis distance discrimination, with an overall accuracy of 99.10% for the former and 98.69% for the latter in identifying high-potassium glass or lead-barium glass.

## Introduction

When glass is buried in soil or immersed in water, it is weathered due to the external environment and internal chemical composition [1, 2]. Ancient glass provides valuable physical evidence of early trade on the Silk Road [3], but it is easily weathered by the influence of the buried environment [4]. A large amount of exchange between its internal substances and environmental substances has led to a significant change in the ratio of its internal chemical composition, which in turn has led to a qualitative change in its chemical composition [5], seriously affecting the correct judgment of modern restorers on the category to which ancient glass belongs. Therefore, the timely detection of the internal chemical compositions of glass products and the accurate determination of the state of the glass belonging to, the conservation and restoration of ancient glass relics has positive significance.

Various techniques have been developed to detect the chemical composition of artifacts, including laser ablation inductively coupled plasma mass spectrometry (LA-ICPMS)[6], proton-excited X-ray fluorescence (PIXE) techniques [7,8,9], particle-induced gamma-ray emission (PIGE) techniques [8, 9], a combination of electron microprobe analysis (EPMA) and thermal ion mass spectrometry (TIMS) [10]. Among these, the PIXE technique is a non-destructive, multi-element quantitative nuclear technique [11] that has proven useful in scientific and technological archaeology [8, 9]. It is commonly applied to identify ancient glass by measuring the content of its internal chemical components. However, the chemical composition of glass is complex, and there are cases in which the sum of the component contents is lower or higher than non-percentage values, making it difficult to determine the category to which ancient glass belongs. For this reason, this paper proposes a method to establish a distance discrimination analysis model among chemical components to identify the category of ancient glass.

Discriminant analysis is a multivariate statistical method to identify the class to which a newly acquired sample belongs based on several quantifiable characteristics of the observed sample [12]. Distance discriminant analysis is a kind of discriminant analysis [13], which identifies a newly obtained sample's category by calculating the distance between samples. Distance discriminant analysis has been applied in slope stability [14], and the classification of balanced common spherical meteorites [15]; however, it has been less frequently used in identifying categories of cultural heritage. In this paper, we apply stepwise regression analysis and Spearman's correlation coefficient to select key feature factors based on the determined chemical composition of ancient glass [16, 17]. To obtain efficient category identification of ancient glass, we need to select the appropriate combination of feature factors and establish a distance discrimination model. This will ensure that the category to which the glass belongs can be accurately mapped.

## Models and algorithms

### Data sources

Chemical composition data of glass cultural relics, whether the surface of cultural relics is weathered or not, and the glass category data used in this paper are all from the 2022 Mathematical Modeling Competition for Chinese College Students (http://www.mcm.edu.cn/). There are 77 glass cultural relics samples (including 69 known category samples and 8 unknown category samples). In order to produce glass, quartz sand—with a high melting point—is used as a raw material. However, to ensure efficient production, various fluxes such as lead ore and grass ash are added to reduce the melting point. Glass samples can be classified into two types based on the cosolvents added during the refining process high-potassium glass and lead-barium glass. These types are quantified with values of 0 and 1, respectively. Each glass sample provided data on 15 characteristic factors, including 14 factors that represented the chemical composition percentages of glass (e.g.\({SiO}_{2}, {Na}_{2}O, {K}_{2}O, and CaO\)) and 1 factor that represented whether the surface of a cultural artifact was weathered or not (with the values of 1 and 0 representing weathered and unweathered surfaces, respectively). During data detection, errors may occur due to improper manual operation, testing equipment, or environmental temperature. In order to achieve precise calculations, we exclude abnormal data and set the effective data set to be between 85 and 105% of the cumulative percentage of chemical components. Finally, two sets of anomalous data are excluded, and the remaining 75 glass sample data are used for the establishment and inspection of the category identification model, and 8 unknown glass sample data are used for the prediction of glass cultural relic samples. The processing results of some data are shown in Table 1. Complete data can be found in Additional file 1.

### Stepwise regression analyses

Stepwise regression [18] is a method of factor screening, which progressively eliminates independent variables in the model that does not have significant predictors, which better avoids multicollinearity [19] among factors and throws off the heavy factor screening work (Additional file 1).

The full-factor model of the regression equation is:

where, the predictor \({X}_{i}(\mathrm{i}=\mathrm{1,2},\dots ,15)\) is the percentage of composition of the glass artifacts, as shown in Table 2, the response variable Y is the category to which the glass artifact belongs, \({\beta }_{0}\) is a constant term, \({\beta }_{i}(\mathrm{i}=\mathrm{1,2},\dots ,15)\) is the partial regression coefficient of the response variable \(Y\) on the predictor \({X}_{i}\), and ε is the regression residual.

According to the effect size value of each predictor on the response variable, the predictors with significant effects were included in the regression equation one by one from the largest to the smallest, and an F-test [20] was conducted to test the significance size, while the predictors with small significant effects in the current regression equation were excluded. The test statistics in the F-test are:

where, \(n\) is the glass artifact sample data; *k* is the number of predictors; \({ESSP}_{k}\) is the partial regression sum of squares of \({X}_{k}\), and \(RSS\) is the residual sum of squares.

### Contingency table analysis

To investigate whether there is a correlation between whether the surface of glass artifacts is weathered and the type of glass artifacts, a \(2\times 2\) contingency table [21] analysis was implemented for glass type \(Y\) and whether the surface of artifacts is weathered \({X}_{15}\), as shown in Table 3.

The computing method of the test statistic is shown in the Eq. (3):

where, \({\widehat{p}}_{ij}=\frac{{n}_{i,}}{n}\times \frac{{n}_{,j}}{n}\), \(n\) is the overall capacity of the sample, \({n}_{i,}\) denotes the number of each glass category in the sample, \({n}_{ij}\) denotes the number of each weathering condition in the sample, and \({n,}_{j}\) denotes the number of each weathering condition in each glass sample, such as \({n}_{11}\) represents the number of unweathered glass in the high-potassium glass.

When the original hypothesis holds, Eq. (3) approximately obeys a chi-square distribution with degrees of freedom of \(\left(2-1\right)\times \left(2-1\right)=1\). For a given significance level α, a judgment can be made to accept or reject the original hypothesis that the two are independent based on the corresponding test statistic and the calculated test statistic.

### Correlation analysis of Spearman coefficient

For correlation analysis between variable data, Pearson's correlation coefficient [22] or Spearman’s correlation coefficient [23] is generally used. After testing, it was found that the data in this paper did not conform to a normal distribution, so Spearman’s correlation coefficient was chosen for the correlation analysis of the glass data.

Assuming that X and Y denote two aggregates with the number of elements N, then the Spearman correlation coefficient ρ between X and Y is calculated by Eq. (4).

where, \({x}_{i}\) is the ratio of the ith chemical composition, \({y}_{i}\) is the type of the artifact glass, \(\overline{x }\) and \(\overline{y }\) is the mean value of different chemical compositions and the mean value of the type, respectively.

The value of Spearman's correlation coefficient ranges from \([-1, 1]\), and the larger the absolute value of \(\rho\), the stronger the correlation. When \(\rho >0\), the two groups of variables under discussion are positively correlated; when \(\rho <0\), the two groups of variables under discussion are negatively correlated.

In general, the significance test for the Spearman correlation coefficient uses a P value test when the sample size is greater than 30. Since the sample size in this study is much larger than 30, we perform the significance test of Spearman's correlation coefficient based on the size of the P value at a 0.05 significance level. We consider a correlation between variables to be significant when the P value is less than 0.05 [24].

### Distance discriminant model predicts unknown sample categories

This paper uses the Mahalanobis distance [25] to calculate the shortest distance between each heritage glass sample point and the overall glass of the two categories. Assuming that high-potassium glass is set as the overall \(A\) and lead-barium glass is set as the overall \(B\), the Mahalanobis distance from the unknown category as the sample point \(x\) of the heritage glass to each overall \(M(A \mathrm{or }B)\) is:

where, \(\mu\) is the sample mean and \(\sum\) is the covariance matrix of multidimensional random variables. The discriminant function used in this paper is:

In summary, the discriminant criterion for the sample \(x\) to be tested belongs to:

## Results and discussion

### Descriptive analyses

To better investigate the quantitative characteristics of the chemical composition content factors of glass artifacts, we investigate the concentration trends and dispersion of the chemical composition content factor data for a sample of 67 known categories of glass artifacts.

The concentration trend is assessed by two measures: the sample mean and the sample median. The median is not affected by extreme values, making it useful in joint analysis with the sample mean to determine the importance of chemical component content factors. Dispersion is measured by four indicators: sample standard deviation, coefficient of variation [26], skewness, and kurtosis. The coefficient of variation effectively eliminates the impact of data error caused by dimensionality. Therefore, a comprehensive analysis of all four indicators can better identify the degree of fluctuation in each chemical component content factor. Table 4 also shows the correlation analysis of the four main components.

Figures 1 and 2 are the index charts of the concentration trend and dispersion trend of high-potassium glass respectively. The graphs show that the concentration degree is relatively high in high-potassium glass, so it can be explained that this chemical composition has an important influence on the composition of high-potassium glass; In the weathered state, the contents of \({SiO}_{2}, PbO, BaO, SrO\) and is close to 0, which shows that the content of these four components will change with the weathered degree of cultural relics.

The dispersion of chemical composition content in non-weathered samples is higher than in weathered samples. This suggests that the chemical substances in weathered high-potassium glass are more stable and uniform. However, for this type of glass, the fluctuations in the contents of \(PbO,BaO, {SiO}_{2}\) differ significantly between weathered and non-weathered samples, reflecting the correlation between the weathering degree of high-potassium glass and its internal chemical makeup, particularly with respect to \(PbO, BaO, {SiO}_{2}\).

Figures 3 and 4 display the concentration and dispersion trends of lead-barium glass, respectively. The data reveals that the concentration of \(PbO\) and \({SiO}_{2}\) in lead-barium glass is high, with mean and median values exceeding 40%. These components play a crucial role in the composition of lead-barium glass. Conversely, the indicators for \(SrO and {K}_{2}O\), which reflect the dispersion degree, are relatively low, indicating that these chemical components are stable and exhibit minimal fluctuations.

Moreover, the overall dispersion of \(PbO, BaO, and {SiO}_{2}\) is large, suggesting that the content of these components varies significantly and has a considerable impact on whether the sample is in a weathered state.

### Stepwise regression analysis-Mahalanobis distance discriminant model

#### Stepwise regression analyses

Stepwise regression was carried out on 15 predictors, and the regression results were as shown in Table 5:

From the models in Tables 5, 6, and 7, it can be seen that the regression equation gradually introduces new predictors from \(PbO\) predictors for stepwise regression testing.

The adjusted \({R}^{2}\) of the model changed from 0.566 to 0.867, so the final stepwise regression obtained the linear regression equation for the glass category \(Y\) as:

The \(PbO, \,{K}_{2}O,\, { SiO}_{2}\) and the weathering condition of the glass surface in the equation will be used as the key characteristic factors of the distance discrimination model for the next step of prediction.

#### Distance discriminant analysis

We randomly select 50 data from the sample data as training sets for training, and test the overall sample, repeated 50 times. The model's overall average rate of correctness is 98.69%, with an estimated misjudgment rate of 2.46%. Based on this, the simulation accurately identifies seven out of eight unknown glass cultural relics, resulting in an 87.5% success rate. Table 8 also shows the predicted and actual classes of these eight unknown categories of glass using a Stepwise regression analysis-Mahalanobis distance discriminant model.

#### Sensitivity analysis

The analysis in this paper involves numerical data that was subjected to a certain degree of disturbance. For category data, any non-weathered artifacts were directly classified as “weathered.” This allowed us to study the sensitivity of glass cultural relics to changes in weathering on their surface.

Table 9 illustrates that with each 5% increase in a numerical characteristic factor, the model's prediction results vary, and the degree of change becomes more significant with an increase in error. Among the numerical data, \({SiO}_{2}\) content is crucial for the model, with its impact exceeding that of the other two numerical factors. The result is affected only when the value of each numerical feature is reduced by 20%.

Furthermore, exchanging categories for surface weathering in category-type data cultural relics did not impact the model's results. To summarize, our analysis suggests that \({SiO}_{2}\) content is highly sensitive, whereas \({K}_{2}O \,and \,PbO\) content exhibit weaker sensitivity compared to \({SiO}_{2}\) content. The factor of surface weathering of cultural relics was found to have weak sensitivity.

### Spearman-Mahalanobis distance discriminant model

#### Correlation analysis

To simplify the model calculation, this study will reduce the dimensionality of the sample data. Since the eigenvector data comprises two distinct types of data, the Spearman correlation coefficient is used to select eigenvectors for numerical data. For categorical data, the contingency table analysis method is used to determine whether the surface of glass cultural relics is weathered and whether the type of glass cultural relics is related.

Table 10 indicates the correlation coefficient, the P value, and correlation coefficient strength of each compound composition of glass cultural relics. Based on this analysis, we selected five characteristic factors for the numerical data: \({SiO}_{2}, \,{K}_{2}O, \, PaO, \,BaO \, and \, SrO\).

The correlation coefficient suggests that the content of \({SiO}_{2} \,and \, {K}_{2}O\) is negatively correlated with the glass type, whereas the content of \(PbO, \, \, BaO, and \, SrO\) is positively correlated with the glass type. In other words, higher \({SiO}_{2} \,and \,{K}_{2}O\) content increases the likelihood of the glass type being high-potassium, whereas higher \(PbO, BaO, \,and \,SrO\) content increases the probability of the glass type being lead-barium.

Table 11 displays a chi-square value of 3.526 and a companion probability of 0.097, which is higher than the significance level of 0.05. Hence, the original assumption that the surface weathering of cultural relics and the type of glass are independent of each other is acceptable. Therefore, no correlation exists between them.

#### Distance discriminant analysis

The Mahalanobis distance discriminant method was utilized for discriminant analysis based on the selected characteristic factors. For this study, 50 data were randomly selected from the sample data as the training set, and the entire sample was tested 50 times. The test was repeated 50 times, resulting in an overall average correct rate of 99.10% and an estimated misjudgment rate of 1.31%.

The simulation results of the 8 cultural relics are all judged correctly, with a correct rate of 100%. Table 12 also shows the predicted and actual classes of these eight unknown categories of glass using a Spearman-Mahalanobis distance discriminant model.

#### Sensitivity analysis

Table 13 illustrates that the number of \({SiO}_{2}\) changes slower than other factors as errors increase (ranging from 0.01 to 0.10). When the error is 0.02, the change in the number of \({SiO}_{2}\) is 0. Even when the error is 0.6, the number of cultural relics changes to 2 due to other factors, but the number of \({SiO}_{2}\) changes is still only 1. Therefore, \({SiO}_{2}\) has the lowest sensitivity. The correlation coefficients among the five elements are \(-0.6703, -0.5832, 0.7695, 0.7145, and\, 0.5881\), respectively. The sensitivity of \({SiO}_{2} \,and \,{K}_{2}O\), with negative correlation coefficients, is low, and the smaller the absolute value, the less sensitive they are. Similarly, we found that the sensitivity increases with the absolute value of the correlation coefficient. It can be seen that the sensitivity of \(PbO, \,BaO, \,and \,SrO\), which are positively correlated, is ranked from weak to strong as \(SrO, \,BaO, \,and \,PbO\). In summary, the sensitivity from weak to strong is \(SrO, \,BaO, \,PbO, \,{SiO}_{2}, \,and \,{K}_{2}O\).

### Model comparison

The stepwise regression analysis-Mahalanobis distance discriminant model includes \(PbO, {SiO}_{2}, {K}_{2}O\), and glass surface weathering as characteristic factors, with an average correct rate of 98.69%. The Spearman-Mahalanobis distance discriminant model uses \(PbO, {SiO}_{2}, {K}_{2}O, BaO, \,and \,SrO\) as characteristic factors, with an average correct rate of 99.10%. The latter model performs slightly better than the former, but both are affected by measurement errors in the compound content, which can lead to prediction inaccuracies when the error content exceeds 1.1%. However, the former model is slightly more sensitive than the latter. Despite this, both models demonstrate high accuracy in class prediction.

## Conclusions

Both the stepwise regression analysis-Mahalanobis distance discriminant model and the Spearman-Mahalanobis distance discriminant model perform well in identifying glass cultural relics. However, the latter achieves higher accuracy.

The model proposed in this paper offers a more scientifically rigorous analysis and identification of the composition of ancient glass products. It allows for timely identification and prediction of the internal components of glass products in various environments, which can effectively prevent the corrosion and weathering of such cultural relics. With further improvements, this model could provide more effective methods and technologies for the protection and restoration of ancient cultural relics.

## Availability of data and materials

Experimental data from the website [Chinese college students Mathematical Contest in Modeling(mcm.edu.cn)].

## References

Saijo Y, Suzuki Y, Akiyama R, Shimizu M, Shimotsuma Y, Miura K. Speciation analysis of tin at the tin side of float glass by solvent extraction combined with a stepwise etching technique. J Non-Cryst Solids. 2022;592:121752. https://doi.org/10.1016/j.jnoncrysol.2022.121752.

Koob SP, van Giffen NAR, Kunicki-Goldfinger JJ, Brill RH. Caring for glass collections: the importance of maintaining environmental controls. Stud Conserv. 2018;63(S1):146–50. https://doi.org/10.1080/00393630.2018.1492252.

Schibille N, Lankton JW, Gratuz B. Compositions of early Islamic glass along the Iranian silk road. Geochemistry. 2022;82(4):125903. https://doi.org/10.1016/j.chemer.2022.125903.

Xu H. A study on the composition analysis and identification of ancient glass products based on SVM model. Acad J Comput Inf Sci. 2022;5(13):89–95. https://doi.org/10.25236/JCIS.2022.051314.

Tang L, Tang M, Zhang L. Component analysis and identification of glass products based on hierarchical clustering and naive Bayes. Highlights Sci Eng Technol. 2023;41:279–86. https://doi.org/10.54097/hset.v41i.6833.

Swan CM, Rehren T, Lankton J, Gratuze B, Brill RH. Compositional observations for Islamic glass from Sīrāf, Iran, in the corning museum of glass collection. J Archaeol Sci Rep. 2017;16:102–16. https://doi.org/10.1016/j.jasrep.2017.08.020.

Kaspi O, Girshevitz O, Senderowitz H. Pixe based, machine-learning (pixel) supported workflow for glass fragments classification. Talanta. 2021;234:122608. https://doi.org/10.1016/j.talanta.2021.122608.

Bugoi R, Panaite A, Alexandrescu C. Chemical analyses on roman and late antique glass finds from the lower Danube: the case of Tropaeum Traiani. Archaeol Anthropol Sci. 2021;13(9):148. https://doi.org/10.1007/s12520-021-01310-7.

Balvanović R, Šmit Ž. Sixth-century ad glassware from Jelica, Serbia—an increasingly complex picture of late antiquity glass composition. Archaeol Anthropol Sci. 2020;12(4):94. https://doi.org/10.1007/s12520-020-01031-3.

Henderson J, Ma H, Evans J. Glass production for the silk road? Provenance and trade of Islamic glasses using isotopic and chemical analyses in a geological context. J Archaeol Sci. 2020;119:105164. https://doi.org/10.1016/j.jas.2020.105164.

Sottili L, Giuntini L, Mazzinghi A, Massi M, Carraresi L, Castelli L, Czelusniak C, Giambi F, Mandò PA, Manetti M, Ruberto C, Guidorzi L, Re A, Lo Giudice A, Torres R, Arneodo F, Mangani SM, Calusi S, Taccetti F. The role of PIXE and XRF in heritage science: the INFN-CHNet LABEC experience. Appl Sci. 2022. https://doi.org/10.3390/app1213658.

He H, An L, Liu W, Zhang J. Prediction model of collapse risk based on information entropy and distance discriminant analysis method. Math Probl Eng. 2017;2017:1–08. https://doi.org/10.1155/2017/8793632.

Santos AEM, Lana MS, Cabral IE, Pereira TM, Zare Naghadehi M, de Da Silva DFM, Dos Santos TB. Evaluation of rock slope stability conditions through discriminant analysis. Geotechn Geol Eng. 2019;37(2):775–802. https://doi.org/10.1007/s10706-018-0649-x.

Guo J, Wang J, Liu S, Lefik M. Application of an improved cloud model and distance discrimination to evaluate slope stability. Math Probl Eng. 2019;2019:8315894. https://doi.org/10.1155/2019/8315894.

Woźniak M, Galazka-Friedman J, Duda P, Jakubowska M, Rzepecka P, Karwowski L. Application of mössbauer spectroscopy, multidimensional discriminant analysis, and mahalanobis distance for classification of equilibrated ordinary chondrites. Meteorit Planet Sci. 2019;54(8):1828–39. https://doi.org/10.1111/maps.13314.

Abas MA, Wee ST. Exploring policy governance factors using stepwise multiple regression analysis: a case study of solid waste management policy in Malaysia. Int J Public Sector Perform Manag. 2020;6(6):876–92. https://doi.org/10.1504/IJPSPM.2020.110990.

Li H, Cao Y, Su L. Pythagorean fuzzy multi-criteria decision-making approach based on spearman rank correlation coefficient. Soft Comput. 2022;26(6):3001–12. https://doi.org/10.1007/s00500-021-06615-2.

Xiao J, Kong J, Leng S. Component analysis and identification of glass artifacts based on logistic regression and k-means clustering. Highlights Sci Eng Technol. 2022;22:333–9. https://doi.org/10.54097/hset.v22i.3399.

Abubakar A, Abbas UF, Lasisi KE. Remedying multicollinearity in quantitative analysis: a simulation studies. ATBU J Sci Technol Educ. 2023;10(04):108–16.

Sureiman O, Mangera C. F-test of overall significance in regression analysis simplified. J Pract Cardiovasc Sci. 2020;6(2):116. https://doi.org/10.4103/jpcs.jpcs_18_20.

Sulewski P. Some contributions to practice of 2 × 2 contingency tables. J Appl Stat. 2019;46(8):1438–55. https://doi.org/10.1080/02664763.2018.1552665.

Cai Z, Zheng Z, Jiang X. Composition analysis and identification of glass products based on Pearson correlation analysis. Highlights Sci Eng Technol. 2022;22:174–86. https://doi.org/10.54097/hset.v22i.3308.

Zheng X, Feng Y, Chen H. Analysis of each components of glass samples based on the spearman correlation coefficient model. Highlights Sci Eng Technol. 2022;22:241–5. https://doi.org/10.54097/hset.v22i.3368.

Qu Q, Wu W, Guo Y. Study on the classification of glass relics based on spearman correlation coefficient. Acad J Sci Technol. 2023;5(1):147–54. https://doi.org/10.54097/ajst.v5i1.5539.

Brereton RG, Lloyd GR. Re-evaluating the role of the mahalanobis distance measure. J Chemometrics. 2016;30(4):134–43. https://doi.org/10.1002/cem.2779.

Abdi H. Coefficient of variation. Encycl Res Des. 2010;1(5):170–2. https://doi.org/10.1007/0-387-26336-5_379.

## Acknowledgements

Not applicable.

## Funding

This work is supported by the National Undergraduate Training Program for Innovation and Entrepreneurship.

## Author information

### Authors and Affiliations

### Contributions

SW: conceptualization, methodology design, software implementation, results’ visualization and interpretation, writing of the main manuscript, project administration; JZ: revision of the final manuscript, writing of the main manuscript, data curation, supervision, investigation, data analysis; XK: methodology design, revision of the final manuscript, conceptualization; HY: supervision. All authors have read and agreed to the published version of the manuscript.

### Corresponding author

## Ethics declarations

### Ethics approval and consent to participate

Not applicable.

### Consent for publication

Not applicable.

### Competing interests

The authors declare that they have no conflicts of interest in this work. We declare that we do not have any commercial or associative interest that represents a competing interest in connection with the work submitted.

## Additional information

### Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Supplementary Information

**Additional file 1.**

Chemical composition ratio data set of different glasses.

## Rights and permissions

**Open Access** This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

## About this article

### Cite this article

Wu, S., Zhong, J., Ye, H. *et al.* Identification of ancient glass categories based on distance discriminant analysis.
*Herit Sci* **11**, 160 (2023). https://doi.org/10.1186/s40494-023-00999-0

Received:

Accepted:

Published:

DOI: https://doi.org/10.1186/s40494-023-00999-0

### Keywords

- Mahalanobis distance discriminant
- Stepwise regression analysis
- Glass category identification
- Spearman correlation coefficient