Classification of Hungarian medieval silver coins using x-ray fluorescent spectroscopy and multivariate data analysis
© Rácz et al.; licensee Chemistry Central Ltd. 2013
Received: 12 October 2012
Accepted: 23 January 2013
Published: 3 April 2013
A set of silver coins from the collection of Déri Museum Debrecen (Hungary) was examined by X-ray fluorescent elemental analysis with the aim to assign the coins to different groups with the best possible precision based on the acquired chemical information and to build models, which arrange the coins according to their historical periods.
Principal component analysis, linear discriminant analysis, partial least squares discriminant analysis, classification and regression trees and multivariate curve resolution with alternating least squares were applied to reveal dominant pattern in the data and classify the coins into several groups. We also identified those chemical components, which are present in small percentages, but are useful for the classification of the coins. With the coins divided into two groups according to adequate historical periods, we have obtained a correct classification (76-78%) based on the chemical compositions.
X-ray fluorescent elemental analysis together with multivariate data analysis methods is suitable to group medieval coins according to historical periods.
KeywordsX-ray fluorescence spectroscopy Multivariate techniques Coin Silver Middle ages
Elemental analysis is used for the examination of coins and other metal objects in the field of archeology. By determining the elemental composition of an object, one can find out about the used ore and its origin, as well as the age of the artifact. One can also come to conclusions about economic history, based on the changes of concentrations of various elements over time in the coins. In the medieval ages, minting workshops were usually built close to mines, so the identification of the precious metal mines could also mean the determination of the place of coinage.
The aim of research was to assign the coins to different groups with the best possible precision based on the acquired chemical information and to build models, which arrange the coins according to their historical periods.
Discovering relationship between elemental composition of coins and their origin has begun in the past 10-15 years with energy dispersive X-ray fluorescence spectroscopy devices (XRF). Publications, which connect elemental compositions to historical periods can be found scarcely in the literature, moreover only principal component analysis is applied for the evaluation of the data.
In another paper two types of medieval coins were examined with XRF, proton-induced X-ray emission analysis (PIXE) and scanning electron microscopy (SEM/EDX) . Data were evaluated with principal component analysis and the aim was to classify the coins by their places of origin . It was concluded, that PIXE was less appropriate for the measurement of corroded coins due to its lower depth of penetration. The classification of coins into two groups was successful and so was the recognition of unknown samples.
Similar measurements were carried out on coins from the eras of the Spanish War of Independence and of Ancient Greece [4, 5]. They were partially successful in classifying the coins according to their places and time of coinage [4, 5].
Besides the determination of places of origin, X-ray spectra and elemental compositions can also be applied to rule out counterfeit coins. Minemasa Hida et al compared counterfeit and valid 500¥ coins by elemental composition . The coins were successfully differentiated by PCA, as well as by cluster analysis. Moreover, two separate clusters were identified within the group of counterfeit coins .
Another interesting application of manual XRF devices is to determine limits of detection through various kinds of packaging: potentially they could be used to rule out post bombs and other explosives . As we can see, measurements carried out with XRF are useful in supporting not only the work of archaeologists, but also the work of the authorities.
X-ray fluorescence spectroscopy
During X-ray fluorescent analysis [8, 9] the surface of a sample is irradiated by X-ray beam. By applying the appropriate energy, a photoelectron is emitted. The vacancy then is filled by an outer electron, while the energy difference is emitted in the form of X-ray fluorescent radiation. The excitation energies correspond to the emission lines of the elements, while the intensity of the emission provides information about their concentration on the sample surface.
The X-ray fluorescent technique provides quick non-destructive analysis. It gives information about the composition of metallic and non-metallic surfaces without the need for any pretreatment. The technique is independent from the chemical state of the elements, but it doesn’t give information about the chemical bonds (oxidation state) of the examined elements. During a measurement with an appropriate excitation source, all of the elements in the sample can be examined simultaneously. This method enables the study of both solid and liquid substances.
We applied an INNOV-X Alpha handheld analyzer for our studies, which can easily measure concentration of elements heavier than sodium with 0.01% precision from very different matrices. According to recent research, handheld devices can produce equally accurate results as benchtop XRF analyzers in the study of coins .
Excitation source: X-ray tube, W anode, 10-40 kV, 10-50 μA.
Detector: Si PiN diode detector, <230 eV FWHM at 5.95 keV Mn K-alpha line.
Standard elements: Pb, Cr, Hg, Cd, Sb, Ti, Mn, Fe, Ni, Cu, Zn, Sn, Ag, As, Se, Ba, Co, Zr, Rb.
Principal component analysis (PCA)
Principal component analysis [11–14] is known by several names in different areas of science, so it can also be found in articles as “eigenvector analysis” or “characteristic vector analysis”. PCA is unsupervised, so we don’t classify the samples before the analysis. The basic idea is that we place our measured data in a data matrix (marked X), in which the rows correspond to the samples (in this case, coins), whilst the columns represent the studied properties (here: metal concentrations). This matrix can be decomposed into the product of two matrices. There are an infinite number of resolutions, but with constraints like orthogonality and normalization the solution becomes definite (aside from central mirroring). During standardization we first shift our original scale by a constant number and then shrink or expand it, so that the arithmetic mean of the property vectors becomes 0 and their deviation 1. The resulting matrices are the score (T) and factor loading (P) matrices.
PCA can be applied to rule out outliers, to reduce our dataset (which can ease our work greatly in cases of big, complex datasets) and to build models that describe the behavior of a physical or chemical system and reveal any pattern in the data. The models can be used for predictions when we introduce new data (new samples measured in the same way).
Linear discriminant analysis (LDA)
Similarly to PCA, linear discriminant analysis  (LDA) is a dimension-reducing method, in which we create background variables (called canonical variables, roots) by a linear combination of the variables of the original data matrix. LDA is a widely-used supervised pattern recognition technique. The main difference between PCA and LDA is that LDA is supervised, thus we need to know the class memberships of samples before the analysis. We can create N-1 canonical variables for N classes.
During LDA, we plot an ellipse (ellipsoid or a hyperellipsoid in the case of more than three variables) around each group of scattered points. The ellipse can be interpreted as a section plane of a Gaussian surface, which includes a given percentage of the points of the corresponding group. The center of the ellipse represents the maximum of the Gaussian surface. The discriminant function is given by the line connecting the intersections of the ellipses.
Classification and regression tree (CART)
CART [15, 16] is a recursive classification method, which creates binary divisions from our dataset. The principle of this method is to ask yes-or-no questions during the classification of the samples (i.e. the creation of a tree). The algorithm aims at identifying the possible variables and their values for the best resolution. The starting group is considered the root of the tree, which is always the group with the most samples. At the start, the other groups of samples are included in this group as well. Then the algorithm splits the samples to achieve the most advantageous separation of groups.
Its expressivity made it very popular in various field, such as data classification in medical diagnostics. Its theoretical basis was devised by Breiman, Friedman, Olshen and Stone in the 1980’s .
Partial least-square regression discriminant analysis (PLS-DA)
PLS-DA is used for identifying outliers, ruling out variables with low variance (thus easing further studies) and mainly examining groupings of samples. [18–20]. PLS-DA is closely related to multivariate linear regression, to PCA, and to principal component regression. A possible implementation of the method is to apply matrix decomposition to the original X and Y data matrices, which are thus expressed as a product of three matrices. In case of (PLS-DA) the data matrix Y contains the independent group variables.
Multivariate curve resolution with alternating least squares (MCR-ALS)
where X is the response matrix, E is the elemental distribution profile matrix of the components, and C is the composition profile matrix for the samples.
where the matrix X* is the reproduced data matrix obtained by principal component analysis for the selected number of components, and + means the pseudoinverse of the original X matrix . Unfortunately, this decomposition is very often not unique because of the rotational and intensity (scaling) ambiguities [28, 29]. The rotational ambiguities can be moderated or even eliminated if convenient constraints can be used [24–26]. Tauler and coworkers developed a Matlab code for MCR-ALS with some constraints .
We have examined 289 silver coins provided by the Déry Museum of Debrecen. 32 coins were omitted from this dataset, because if only a small amount (3 or 4 pieces) was dated back to the time of a particular king, that set cannot be considered representative to that period. Four coins were identified with PCA as outliers in the early phase of research, so they were also omitted. Each measurement (spectrum acquisition and calculation of elemental composition) was carried out three times, with 30 seconds of irradiation. This time-span was found optimally short and precise by a prior investigation of several alloys. We have used the mean of the three measurements in cases, where elemental composition data were needed, and the three results separately, where X-ray spectra were needed. The amount of the following elements has been determined: Ti, Fe, Ni, Cu, Zn, Ag, Sn, Sb, Pb, Bi. The properties of coins were summarized in two tables: one containing the mean values of elemental composition (257 × 10), and the other containing intensity values for all studied wavelengths (257 × 2048). Two data matrices were created accordingly and evaluated by PCA, LDA, CART and PLS modules of the software STATISTICA 6.0; besides, MCR-ALS calculations were completed by PLS Toolbox V6.7.
Results and discussion
First we pre-examined our variables. A correlation matrix (Additional file 1: Table S1) was created with the variables (metal concentrations) in its rows and its columns. Besides the correlation matrix, descriptive statistics were applied for the pre-examination. Both methods found all of the variables acceptable for further studies. The minima, maxima, standard deviations, medians and means of the variables were calculated in the pre-examination step (Additional file 2: Table S2).
Then, we carried out a PCA analysis on our standardized dataset and acquired the tables containing the scores (principal components) and the factor loadings. The number of scores was conveniently set to the number of variables, which in this case was not higher than ten. These ten principal components explained the total variance of the data, and 9 of them were linearly independent.
The scores were plotted in two ways. In the first case, the indicator variable had seven possible values, which represented three groups within the Árpád dynasty, three later dynasties and unknowns.
indicator variable: I2 indicator variable:
Unknown (18 coins) 0: Unknown
István I – Kálmán (997-1116) (30 coins) 2: István I – Imre:
István II – Imre (1116-1204) (60 coins) (997-1204) (90 coins)
András II – András III (1205-1301) (97 coins) 3: András II - Luxembourg
Czech (1301-1305) (5 coins) (1205-1437) (149 coins)
Anjou (1308-1387) (35 coins)
Luxembourg (1387-1437) (12 coins)
The same data matrix was used, but no spectral data were evaluated in this phase. I1 and I2 indicator variables were applied as grouping variables. Since LDA is a supervised method, grouping variables are necessary for the early classification of the samples to maximize the separation of the groups.
A calculation with the I1 indicator variable returned five concentrations as useful variables for classification, those are: iron, silver, bismuth, lead and tin. Based on maximizing the variance between the groups and minimizing it within the six groups in the classification matrix we could classify the 257 coins into three groups. By doing so, we have shown that the construction of the I2 indicator variable is valid.
In the case of the I2 grouping variable (two groups with the reign of András II as a borderline) iron, copper, bismuth, tin and nickel concentrations were identified by LDA as the best classifying variables.
Using the classification matrix for I2, we have created an I2b grouping variable, where the class membership for the misclassified 61 samples has changed, but the improved classification was not used further on.
The analysis shows, that several samples from group 2 were included by CART in group 3 or into the unknowns. In the case of the modified I2 grouping variable, classification was more successful in terms of correct classification percentages.
Although there is an overlap between the coins from before and after András II, (and there is a number of outliers among the coins from before András II), the two groups are separated to a satisfactory extent. The correct classification rate was 76.6%. The inclusion of unknowns is ambiguous in the cases of five coins, but the rest of the outliers can be included in one of the groups with sufficient probabilities.
With the introduction of several indicator variables, we can observe two well-defined groups in the PCA score plot. The clustering is justified by numismatic and historic theories, and supported by the results of other types of chemometric analysis. The precision of PCA evaluation is not increased by the use of X-ray spectra instead of elemental compositions, because with the increasing amount of data, the amount of noise increases proportionally.
We have successfully classified the coins to their corresponding periods with a correct classification percentage of 76-78% based on X-ray fluorescence data with the use of four statistical analysis methods. This result is considered satisfactory, because the introduction of errors is not limited to the acquisition of spectra and the evaluation, but can also originate from the incorrect archaeological identification of the coins. If the group of unknowns was omitted, CART was the most successful method in classifying the samples to the correct groups. MCR-ALS based LDA could classify the unknown coins into the group 2 (István I-Imre).
Elemental compositions of the silver coins from the Árpád Dynasty and the following dynasties were diverse, so overlaps between the groups are possible. Reasons for these overlaps range from counterfeiting to the bad condition of certain coins including the errors of archaeological identification.
However, determining the chemical composition of the coins and evaluating the data with chemometric methods can provide scientifically valid results to aid the archaeologists’ and numismatists’ work in classifying the coins according to their times of origin.
The authors thank Professor Richard Brereton for his useful advices and steady encouragement.
- Kallithrakas-Kontos N, Katsanos AA, Touratsoglou J: Trace element analysis of Alexander the Great’s silver tetradrachms minted in macedoni. Nuclear Instruments and Methods in Physics Research Section B. 2000, 171: 342-349. 10.1016/S0168-583X(00)00268-8.View Article
- Bugoi R, Constantinescu B, Constantin F, Catana D, Plostinaru D, Sasianu A: Archaeometrical studies of Greek and Roman silver coins. J Radioanal Nucl Chem. 1999, 242 (3): 777-781. 10.1007/BF02347394.View Article
- Linke R, Schreiner M, Demortier G: The application of photon, electron and proton induced X-ray analysis for the identification and characterisation of medieval silver coins. Nuclear Instruments and Methods in Physics Research Section B. 2004, 226: 172-178. 10.1016/j.nimb.2004.03.084.View Article
- Pitarch A, Queralt I, Alvarez-Perez A: Analysis of Catalonian silver coins from the Spanish war of independence period (1808–1814) by energy dispersive X-ray fluorescence. Nuclear Instruments and Methods in Physics Research Section B. 2011, 269: 308-312. 10.1016/j.nimb.2010.11.045.View Article
- Pitarch A, Queralt I: Energy dispersive X-ray fluorescence analysis of ancient coins: the case of Greek silver drachmae from the emporion site in Spain. Nuclear Instruments and Methods in Physics Research Section B. 2010, 268: 1682-1685. 10.1016/j.nimb.2010.02.004.View Article
- Hida M, Sato H, Sugawara H, Mitsui T: Classification of counterfeit coins using multivariate analysis with X-ray diffraction and X-ray fuorescence methods. Forensic Sci Int. 2001, 115: 129-134. 10.1016/S0379-0738(00)00309-1.View Article
- Ida H, Kawai J: Analysis of wrapped or cased object by a hand-held X-ray fluorescence spectrometer. Forensic Sci Int. 2005, 151: 267-272. 10.1016/j.forsciint.2005.02.017.View Article
- Beckhoff B, Kanngießer B, Langhoff N, Wedell R, Wolff H: Handbook of practical X-ray fluorescence analysis. 2006, Berlin: SpringerView Article
- Jenkins R: X-ray fluorescence spectrometry. 1988, New York: John Wiley & Sons
- Pessanha S, Guilherme A, Carvalho ML: Comparison of matrix effects on portable and stationary XRF spectrometers for cultural heritage samples. Applied Physics A. 2009, 97: 497-505. 10.1007/s00339-009-5251-x.View Article
- Wold S, Esbensen K, Geladi P: Principal component analysis. Chemom Intell Lab Syst. 1987, 2: 37-52. 10.1016/0169-7439(87)80084-9.View Article
- Christie OHJ: Some fundamental criteria for multivariate correlation methodologies. Chemom Intell Lab Syst. 1987, 2: 53-59. 10.1016/0169-7439(87)80085-0.View Article
- Joliffe J: Principal component analysis. 1986, Berlin: SpringerView Article
- Malinowsky F, Howery D: Factor analysis in chemistry. 1980, New York: John Wiley and Sons
- Hastie T, Tibshirani R, Friedman JH: The elements of statistical leraning. 2001, Germany: SpringerView Article
- Timofeev R: Classification and regression trees (CART), theory and applications. Master thesis. 2004, Humboldt University, Berlin: CASE – Center of Applied Statistics and Economics
- Breiman L, Freidman JH, Olshen RA, Stone CJ: Classification and regression trees. 1984, Belmont: Wadsworth International
- Geladi P, Kowalski BR: Partial least-squares regression: a tutorial. Anal Chim Acta. 1986, 185: 1-17.View Article
- Lindberg W, Persson J-A, Wold S: Partial least-squares method for spectrofluorimetric analysis of mixtures of humic acid and ligninsulfonate. Anal Chem. 1983, 55: 643-648. 10.1021/ac00255a014.View Article
- Strang G: Linear algebra and its application. 1980, New York: Academic Press
- Tauler R, Casassas E, Izquierdo-Ridorsa A: Self-modeling curve resolution in studies of spectrometric titrations of multi-equilibria systems by factor-analysis. Anal Chim Acta. 1991, 248: 447-458. 10.1016/S0003-2670(00)84662-0.View Article
- Tauler R: Multivariate curve resolution applied to second order data. Chemom Intell Lab Syst. 1995, 30: 133-146. 10.1016/0169-7439(95)00047-X.View Article
- De Juan A, Tauler R: Chemometrics applied to unravel multicomponent processes and mixtures - revisiting latest trends in multivariate resolution. Anal Chim Acta. 2003, 500: 195-210. 10.1016/S0003-2670(03)00724-4.View Article
- De Juan A, Vander Heyden Y, Tauler R, Massart DL: Assessment of new constraints applied to the alternating least squares method. Anal Chim Acta. 1997, 346: 307-318. 10.1016/S0003-2670(97)90069-6.View Article
- Tauler R: Calculation of maximum and minimum band boundaries of feasible solutions for species profiles obtained by multivariate curve resolution. J Chemom. 2001, 15: 627-646. 10.1002/cem.654.View Article
- Van Benthem MH, Keenan MR, Haaland DM: Application of equality constraints on variables during alternating least squares procedures. J Chemom. 2002, 16: 613-622. 10.1002/cem.761.View Article
- Golub GH, Van Loan CF: Matrix computations (2nd Ed.). 1989, Baltimore: The John Hopkins University Press
- Tauler R, Smilde A, Kowalski BR: Selectivity, local rank, 3-way data-analysis and ambiguity in multivariate curve resolution. J Chemom. 1995, 9: 31-58. 10.1002/cem.1180090105.View Article
- Jiang J-H, Liang Y, Ozaki Y: Principles and methodologies in self-modeling curve resolution. Chemom Intell Lab Syst. 2004, 71: 1-12. 10.1016/j.chemolab.2003.07.002.View Article
- Multivariate curve resolution homepage. http://www.mcrals.info/,
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.