Classification of Hungarian medieval silver coins using xray fluorescent spectroscopy and multivariate data analysis
 Anita Rácz^{1},
 Károly Héberger^{2}Email author,
 Róbert Rajkó^{3} and
 János Elek^{4}
DOI: 10.1186/2050744512
© Rácz et al.; licensee Chemistry Central Ltd. 2013
Received: 12 October 2012
Accepted: 23 January 2013
Published: 3 April 2013
Abstract
Background
A set of silver coins from the collection of Déri Museum Debrecen (Hungary) was examined by Xray fluorescent elemental analysis with the aim to assign the coins to different groups with the best possible precision based on the acquired chemical information and to build models, which arrange the coins according to their historical periods.
Results
Principal component analysis, linear discriminant analysis, partial least squares discriminant analysis, classification and regression trees and multivariate curve resolution with alternating least squares were applied to reveal dominant pattern in the data and classify the coins into several groups. We also identified those chemical components, which are present in small percentages, but are useful for the classification of the coins. With the coins divided into two groups according to adequate historical periods, we have obtained a correct classification (7678%) based on the chemical compositions.
Conclusions
Xray fluorescent elemental analysis together with multivariate data analysis methods is suitable to group medieval coins according to historical periods.
Keywords
Xray fluorescence spectroscopy Multivariate techniques Coin Silver Middle agesIntroduction
Elemental analysis is used for the examination of coins and other metal objects in the field of archeology. By determining the elemental composition of an object, one can find out about the used ore and its origin, as well as the age of the artifact. One can also come to conclusions about economic history, based on the changes of concentrations of various elements over time in the coins. In the medieval ages, minting workshops were usually built close to mines, so the identification of the precious metal mines could also mean the determination of the place of coinage.
The aim of research was to assign the coins to different groups with the best possible precision based on the acquired chemical information and to build models, which arrange the coins according to their historical periods.
Discovering relationship between elemental composition of coins and their origin has begun in the past 1015 years with energy dispersive Xray fluorescence spectroscopy devices (XRF). Publications, which connect elemental compositions to historical periods can be found scarcely in the literature, moreover only principal component analysis is applied for the evaluation of the data.
Greek and Romanian researchers used Xray fluorescence spectra to categorize antique coins made between the 4th and 1st centuries B.C., by their places of origin and recovery [1, 2].
In another paper two types of medieval coins were examined with XRF, protoninduced Xray emission analysis (PIXE) and scanning electron microscopy (SEM/EDX) [3]. Data were evaluated with principal component analysis and the aim was to classify the coins by their places of origin [3]. It was concluded, that PIXE was less appropriate for the measurement of corroded coins due to its lower depth of penetration. The classification of coins into two groups was successful and so was the recognition of unknown samples.
Similar measurements were carried out on coins from the eras of the Spanish War of Independence and of Ancient Greece [4, 5]. They were partially successful in classifying the coins according to their places and time of coinage [4, 5].
Besides the determination of places of origin, Xray spectra and elemental compositions can also be applied to rule out counterfeit coins. Minemasa Hida et al compared counterfeit and valid 500¥ coins by elemental composition [6]. The coins were successfully differentiated by PCA, as well as by cluster analysis. Moreover, two separate clusters were identified within the group of counterfeit coins [6].
Another interesting application of manual XRF devices is to determine limits of detection through various kinds of packaging: potentially they could be used to rule out post bombs and other explosives [7]. As we can see, measurements carried out with XRF are useful in supporting not only the work of archaeologists, but also the work of the authorities.
Methods
Xray fluorescence spectroscopy
During Xray fluorescent analysis [8, 9] the surface of a sample is irradiated by Xray beam. By applying the appropriate energy, a photoelectron is emitted. The vacancy then is filled by an outer electron, while the energy difference is emitted in the form of Xray fluorescent radiation. The excitation energies correspond to the emission lines of the elements, while the intensity of the emission provides information about their concentration on the sample surface.
The Xray fluorescent technique provides quick nondestructive analysis. It gives information about the composition of metallic and nonmetallic surfaces without the need for any pretreatment. The technique is independent from the chemical state of the elements, but it doesn’t give information about the chemical bonds (oxidation state) of the examined elements. During a measurement with an appropriate excitation source, all of the elements in the sample can be examined simultaneously. This method enables the study of both solid and liquid substances.
We applied an INNOVX Alpha handheld analyzer for our studies, which can easily measure concentration of elements heavier than sodium with 0.01% precision from very different matrices. According to recent research, handheld devices can produce equally accurate results as benchtop XRF analyzers in the study of coins [10].
Instrument specifications

Excitation source: Xray tube, W anode, 1040 kV, 1050 μA.

Detector: Si PiN diode detector, <230 eV FWHM at 5.95 keV Mn Kalpha line.

Standard elements: Pb, Cr, Hg, Cd, Sb, Ti, Mn, Fe, Ni, Cu, Zn, Sn, Ag, As, Se, Ba, Co, Zr, Rb.
Principal component analysis (PCA)
Principal component analysis [11–14] is known by several names in different areas of science, so it can also be found in articles as “eigenvector analysis” or “characteristic vector analysis”. PCA is unsupervised, so we don’t classify the samples before the analysis. The basic idea is that we place our measured data in a data matrix (marked X), in which the rows correspond to the samples (in this case, coins), whilst the columns represent the studied properties (here: metal concentrations). This matrix can be decomposed into the product of two matrices. There are an infinite number of resolutions, but with constraints like orthogonality and normalization the solution becomes definite (aside from central mirroring). During standardization we first shift our original scale by a constant number and then shrink or expand it, so that the arithmetic mean of the property vectors becomes 0 and their deviation 1. The resulting matrices are the score (T) and factor loading (P) matrices.
PCA can be applied to rule out outliers, to reduce our dataset (which can ease our work greatly in cases of big, complex datasets) and to build models that describe the behavior of a physical or chemical system and reveal any pattern in the data. The models can be used for predictions when we introduce new data (new samples measured in the same way).
Linear discriminant analysis (LDA)
Similarly to PCA, linear discriminant analysis [15] (LDA) is a dimensionreducing method, in which we create background variables (called canonical variables, roots) by a linear combination of the variables of the original data matrix. LDA is a widelyused supervised pattern recognition technique. The main difference between PCA and LDA is that LDA is supervised, thus we need to know the class memberships of samples before the analysis. We can create N1 canonical variables for N classes.
During LDA, we plot an ellipse (ellipsoid or a hyperellipsoid in the case of more than three variables) around each group of scattered points. The ellipse can be interpreted as a section plane of a Gaussian surface, which includes a given percentage of the points of the corresponding group. The center of the ellipse represents the maximum of the Gaussian surface. The discriminant function is given by the line connecting the intersections of the ellipses.
Classification and regression tree (CART)
CART [15, 16] is a recursive classification method, which creates binary divisions from our dataset. The principle of this method is to ask yesorno questions during the classification of the samples (i.e. the creation of a tree). The algorithm aims at identifying the possible variables and their values for the best resolution. The starting group is considered the root of the tree, which is always the group with the most samples. At the start, the other groups of samples are included in this group as well. Then the algorithm splits the samples to achieve the most advantageous separation of groups.
Its expressivity made it very popular in various field, such as data classification in medical diagnostics. Its theoretical basis was devised by Breiman, Friedman, Olshen and Stone in the 1980’s [17].
Partial leastsquare regression discriminant analysis (PLSDA)
PLSDA is used for identifying outliers, ruling out variables with low variance (thus easing further studies) and mainly examining groupings of samples. [18–20]. PLSDA is closely related to multivariate linear regression, to PCA, and to principal component regression. A possible implementation of the method is to apply matrix decomposition to the original X and Y data matrices, which are thus expressed as a product of three matrices. In case of (PLSDA) the data matrix Y contains the independent group variables.
Multivariate curve resolution with alternating least squares (MCRALS)
where X is the response matrix, E is the elemental distribution profile matrix of the components, and C is the composition profile matrix for the samples.
where the matrix X* is the reproduced data matrix obtained by principal component analysis for the selected number of components, and ^{+} means the pseudoinverse of the original X matrix [27]. Unfortunately, this decomposition is very often not unique because of the rotational and intensity (scaling) ambiguities [28, 29]. The rotational ambiguities can be moderated or even eliminated if convenient constraints can be used [24–26]. Tauler and coworkers developed a Matlab code for MCRALS with some constraints [30].
Experimental
We have examined 289 silver coins provided by the Déry Museum of Debrecen. 32 coins were omitted from this dataset, because if only a small amount (3 or 4 pieces) was dated back to the time of a particular king, that set cannot be considered representative to that period. Four coins were identified with PCA as outliers in the early phase of research, so they were also omitted. Each measurement (spectrum acquisition and calculation of elemental composition) was carried out three times, with 30 seconds of irradiation. This timespan was found optimally short and precise by a prior investigation of several alloys. We have used the mean of the three measurements in cases, where elemental composition data were needed, and the three results separately, where Xray spectra were needed. The amount of the following elements has been determined: Ti, Fe, Ni, Cu, Zn, Ag, Sn, Sb, Pb, Bi. The properties of coins were summarized in two tables: one containing the mean values of elemental composition (257 × 10), and the other containing intensity values for all studied wavelengths (257 × 2048). Two data matrices were created accordingly and evaluated by PCA, LDA, CART and PLS modules of the software STATISTICA 6.0; besides, MCRALS calculations were completed by PLS Toolbox V6.7.
Results and discussion
PCA results
First we preexamined our variables. A correlation matrix (Additional file 1: Table S1) was created with the variables (metal concentrations) in its rows and its columns. Besides the correlation matrix, descriptive statistics were applied for the preexamination. Both methods found all of the variables acceptable for further studies. The minima, maxima, standard deviations, medians and means of the variables were calculated in the preexamination step (Additional file 2: Table S2).
Then, we carried out a PCA analysis on our standardized dataset and acquired the tables containing the scores (principal components) and the factor loadings. The number of scores was conveniently set to the number of variables, which in this case was not higher than ten. These ten principal components explained the total variance of the data, and 9 of them were linearly independent.
The scores were plotted in two ways. In the first case, the indicator variable had seven possible values, which represented three groups within the Árpád dynasty, three later dynasties and unknowns.
 I1
indicator variable: I2 indicator variable:
 0:
Unknown (18 coins) 0: Unknown
 1:
István I – Kálmán (9971116) (30 coins) 2: István I – Imre:
 2:
István II – Imre (11161204) (60 coins) (9971204) (90 coins)
 3:
András II – András III (12051301) (97 coins) 3: András II  Luxembourg
 4:
Czech (13011305) (5 coins) (12051437) (149 coins)
 5:
Anjou (13081387) (35 coins)
 6:
Luxembourg (13871437) (12 coins)
LDA results
The same data matrix was used, but no spectral data were evaluated in this phase. I1 and I2 indicator variables were applied as grouping variables. Since LDA is a supervised method, grouping variables are necessary for the early classification of the samples to maximize the separation of the groups.
A calculation with the I1 indicator variable returned five concentrations as useful variables for classification, those are: iron, silver, bismuth, lead and tin. Based on maximizing the variance between the groups and minimizing it within the six groups in the classification matrix we could classify the 257 coins into three groups. By doing so, we have shown that the construction of the I2 indicator variable is valid.
In the case of the I2 grouping variable (two groups with the reign of András II as a borderline) iron, copper, bismuth, tin and nickel concentrations were identified by LDA as the best classifying variables.
Using the classification matrix for I2, we have created an I2b grouping variable, where the class membership for the misclassified 61 samples has changed, but the improved classification was not used further on.
CART results
The analysis shows, that several samples from group 2 were included by CART in group 3 or into the unknowns. In the case of the modified I2 grouping variable, classification was more successful in terms of correct classification percentages.
PLSDA results
Although there is an overlap between the coins from before and after András II, (and there is a number of outliers among the coins from before András II), the two groups are separated to a satisfactory extent. The correct classification rate was 76.6%. The inclusion of unknowns is ambiguous in the cases of five coins, but the rest of the outliers can be included in one of the groups with sufficient probabilities.
MCRALS results
Conclusions
With the introduction of several indicator variables, we can observe two welldefined groups in the PCA score plot. The clustering is justified by numismatic and historic theories, and supported by the results of other types of chemometric analysis. The precision of PCA evaluation is not increased by the use of Xray spectra instead of elemental compositions, because with the increasing amount of data, the amount of noise increases proportionally.
We have successfully classified the coins to their corresponding periods with a correct classification percentage of 7678% based on Xray fluorescence data with the use of four statistical analysis methods. This result is considered satisfactory, because the introduction of errors is not limited to the acquisition of spectra and the evaluation, but can also originate from the incorrect archaeological identification of the coins. If the group of unknowns was omitted, CART was the most successful method in classifying the samples to the correct groups. MCRALS based LDA could classify the unknown coins into the group 2 (István IImre).
Elemental compositions of the silver coins from the Árpád Dynasty and the following dynasties were diverse, so overlaps between the groups are possible. Reasons for these overlaps range from counterfeiting to the bad condition of certain coins including the errors of archaeological identification.
However, determining the chemical composition of the coins and evaluating the data with chemometric methods can provide scientifically valid results to aid the archaeologists’ and numismatists’ work in classifying the coins according to their times of origin.
Declarations
Acknowledgement
The authors thank Professor Richard Brereton for his useful advices and steady encouragement.
Authors’ Affiliations
References
 KallithrakasKontos N, Katsanos AA, Touratsoglou J: Trace element analysis of Alexander the Great’s silver tetradrachms minted in macedoni. Nuclear Instruments and Methods in Physics Research Section B. 2000, 171: 342349. 10.1016/S0168583X(00)002688.View Article
 Bugoi R, Constantinescu B, Constantin F, Catana D, Plostinaru D, Sasianu A: Archaeometrical studies of Greek and Roman silver coins. J Radioanal Nucl Chem. 1999, 242 (3): 777781. 10.1007/BF02347394.View Article
 Linke R, Schreiner M, Demortier G: The application of photon, electron and proton induced Xray analysis for the identification and characterisation of medieval silver coins. Nuclear Instruments and Methods in Physics Research Section B. 2004, 226: 172178. 10.1016/j.nimb.2004.03.084.View Article
 Pitarch A, Queralt I, AlvarezPerez A: Analysis of Catalonian silver coins from the Spanish war of independence period (1808–1814) by energy dispersive Xray fluorescence. Nuclear Instruments and Methods in Physics Research Section B. 2011, 269: 308312. 10.1016/j.nimb.2010.11.045.View Article
 Pitarch A, Queralt I: Energy dispersive Xray fluorescence analysis of ancient coins: the case of Greek silver drachmae from the emporion site in Spain. Nuclear Instruments and Methods in Physics Research Section B. 2010, 268: 16821685. 10.1016/j.nimb.2010.02.004.View Article
 Hida M, Sato H, Sugawara H, Mitsui T: Classification of counterfeit coins using multivariate analysis with Xray diffraction and Xray fuorescence methods. Forensic Sci Int. 2001, 115: 129134. 10.1016/S03790738(00)003091.View Article
 Ida H, Kawai J: Analysis of wrapped or cased object by a handheld Xray fluorescence spectrometer. Forensic Sci Int. 2005, 151: 267272. 10.1016/j.forsciint.2005.02.017.View Article
 Beckhoff B, Kanngießer B, Langhoff N, Wedell R, Wolff H: Handbook of practical Xray fluorescence analysis. 2006, Berlin: SpringerView Article
 Jenkins R: Xray fluorescence spectrometry. 1988, New York: John Wiley & Sons
 Pessanha S, Guilherme A, Carvalho ML: Comparison of matrix effects on portable and stationary XRF spectrometers for cultural heritage samples. Applied Physics A. 2009, 97: 497505. 10.1007/s003390095251x.View Article
 Wold S, Esbensen K, Geladi P: Principal component analysis. Chemom Intell Lab Syst. 1987, 2: 3752. 10.1016/01697439(87)800849.View Article
 Christie OHJ: Some fundamental criteria for multivariate correlation methodologies. Chemom Intell Lab Syst. 1987, 2: 5359. 10.1016/01697439(87)800850.View Article
 Joliffe J: Principal component analysis. 1986, Berlin: SpringerView Article
 Malinowsky F, Howery D: Factor analysis in chemistry. 1980, New York: John Wiley and Sons
 Hastie T, Tibshirani R, Friedman JH: The elements of statistical leraning. 2001, Germany: SpringerView Article
 Timofeev R: Classification and regression trees (CART), theory and applications. Master thesis. 2004, Humboldt University, Berlin: CASE – Center of Applied Statistics and Economics
 Breiman L, Freidman JH, Olshen RA, Stone CJ: Classification and regression trees. 1984, Belmont: Wadsworth International
 Geladi P, Kowalski BR: Partial leastsquares regression: a tutorial. Anal Chim Acta. 1986, 185: 117.View Article
 Lindberg W, Persson JA, Wold S: Partial leastsquares method for spectrofluorimetric analysis of mixtures of humic acid and ligninsulfonate. Anal Chem. 1983, 55: 643648. 10.1021/ac00255a014.View Article
 Strang G: Linear algebra and its application. 1980, New York: Academic Press
 Tauler R, Casassas E, IzquierdoRidorsa A: Selfmodeling curve resolution in studies of spectrometric titrations of multiequilibria systems by factoranalysis. Anal Chim Acta. 1991, 248: 447458. 10.1016/S00032670(00)846620.View Article
 Tauler R: Multivariate curve resolution applied to second order data. Chemom Intell Lab Syst. 1995, 30: 133146. 10.1016/01697439(95)00047X.View Article
 De Juan A, Tauler R: Chemometrics applied to unravel multicomponent processes and mixtures  revisiting latest trends in multivariate resolution. Anal Chim Acta. 2003, 500: 195210. 10.1016/S00032670(03)007244.View Article
 De Juan A, Vander Heyden Y, Tauler R, Massart DL: Assessment of new constraints applied to the alternating least squares method. Anal Chim Acta. 1997, 346: 307318. 10.1016/S00032670(97)900696.View Article
 Tauler R: Calculation of maximum and minimum band boundaries of feasible solutions for species profiles obtained by multivariate curve resolution. J Chemom. 2001, 15: 627646. 10.1002/cem.654.View Article
 Van Benthem MH, Keenan MR, Haaland DM: Application of equality constraints on variables during alternating least squares procedures. J Chemom. 2002, 16: 613622. 10.1002/cem.761.View Article
 Golub GH, Van Loan CF: Matrix computations (2nd Ed.). 1989, Baltimore: The John Hopkins University Press
 Tauler R, Smilde A, Kowalski BR: Selectivity, local rank, 3way dataanalysis and ambiguity in multivariate curve resolution. J Chemom. 1995, 9: 3158. 10.1002/cem.1180090105.View Article
 Jiang JH, Liang Y, Ozaki Y: Principles and methodologies in selfmodeling curve resolution. Chemom Intell Lab Syst. 2004, 71: 112. 10.1016/j.chemolab.2003.07.002.View Article
 Multivariate curve resolution homepage. http://www.mcrals.info/,
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.