Minerals Engineering: T.J. Napier-Munn
Minerals Engineering: T.J. Napier-Munn
Minerals Engineering: T.J. Napier-Munn
Minerals Engineering
journal homepage: www.elsevier.com/locate/mineng
a r t i c l e i n f o a b s t r a c t
Article history: Grade-recovery curves obtained from kinetic batch flotation testing are, like any other measurement,
Received 5 February 2012 subject to experimental error. This leads to uncertainty in the true position of each cumulative grade-
Accepted 30 March 2012 recovery point, the curve itself, and the kinetics. This uncertainty is rarely if ever taken into account when
Available online 27 May 2012
interpreting such curves, in particular when comparing curves obtained under different conditions. This
paper proposes a methodology to deal with this problem.
Keywords: The standard formula is used to establish true confidence intervals for the grade and recovery at each
Froth flotation
replicated timed concentrate point, and the 2-sample t-test is used to compare these point values
Modelling
Statistics
between tests conducted under different conditions. The properties of the grade-recovery curves can
be compared by fitting an appropriate model to the two data sets and using a bootstrap to create distri-
butions of differences between the model parameters and the model predictions of recovery at any cho-
sen concentrate grade, reflecting the uncertainty in the original data. It is then easy to construct
hypothesis tests on the parameter differences and on the mean difference at the chosen grade(s) between
the two curves. The same approach can be used to construct confidence intervals on the fitted curves and
to test differences in estimated flotation rates. An extra sum of squares test can be used to compare the
fitted grade-recovery curves as a whole. Details of the methods are presented, suitable for spreadsheets.
These methods are relatively easy to apply but require that all batch flotation tests be replicated. The
alternative (single tests under each condition) ignores the existence of experimental error and renders
the data susceptible to subjective and perhaps erroneous interpretation. Using these methods it is not
unusual to find that grade-recovery curves thought to represent truly different flotation performance
are not in fact statistically different, especially at the longer flotation times (high recoveries).
Ó 2012 Elsevier Ltd. All rights reserved.
1. Introduction and problem statement will need to be changed. In testing new reagents or grinding media
it is necessary to compare the flotation response of the new condi-
Batch flotation testing is widely used to assess the flotation re- tion with the standard or current alternative to determine whether
sponse of ores under particular conditions. The test usually com- there has been an improvement. Mineral engineers seek to move
prises some standard procedure for grinding the ore and floating the whole curve to a higher grade-recovery position through mod-
it in a batch cell. Sequential timed concentrates are taken to allow ifications to conditions. The ultimate theoretical curve is that pre-
a cumulative concentrate grade-recovery curve to be constructed. scribed by the liberation characteristics of the ore (Nice and Brown,
This curve contains much useful information including the trade- 1995).
off between grade and recovery and the kinetic characteristics im- Fig. 1 shows two grade-recovery curves in which a copper ore
plied in the shape and location of the curve. Batch flotation testing was floated with a standard collector (A) and then with an alterna-
is described by Runge (2010). tive collector (B), to determine if collector B could give an im-
Grade-recovery curves are used for many purposes. For exam- proved performance. Four sequential timed concentrates were
ple, in future ores testing on a mine site it is common practice to taken in each case. Test B appears to show faster flotation initially,
compare the flotation response of future ores to the currently trea- but the curves approach each other as the flotation time increases
ted material to determine whether the grind or flotation conditions (from right to left), with final recoveries which are similar but with
Test B giving a higher concentrate grade.
It is almost inevitable in such testing that grade-recovery curves
⇑ Address: Julius Kruttschnitt Mineral Research Centre, University Mine, Isles will be compared in some way. Comparisons are usually made by
Road, Indooroopilly, Queensland 4068, Australia. Tel.: +61 7 3365 5888; fax: +61 7
eye with no reference to the inevitable experimental uncertainties
3365 5999.
E-mail address: [email protected]
inherent in the construction of the curve. Curves are implicitly
0892-6875/$ - see front matter Ó 2012 Elsevier Ltd. All rights reserved.
http://dx.doi.org/10.1016/j.mineng.2012.03.036
T.J. Napier-Munn / Minerals Engineering 34 (2012) 70–77 71
Table 1
Statistics for mean recovery and ER for concentrates 1 and 4, Tests A and B.
95 Table 2
t-tests for differences in ER and recovery between A and B for cons. 1 and 4.
Cum. Cu Recovery (%)
90
Value Concentrate 1 Concentrate 4
85
Test B ER Rec. % ER Rec. %
80
Difference (B–A) 0.49 6.31 2.29 -0.83
75 Test A t-Value 0.48 6.31 1.59 0.58
1-sided P(t) 0.328 0.002 0.093 0.295
70
Confidence level (%) 67.2 99.8 90.7 70.5
65
60 B is significantly higher than that with collector A (about 6%). At
10 15 20 25 30
the final concentrate, there is some evidence for a higher ER with
Cum. Enrichment Ratio collector B but there is no difference in recovery. Similar conclu-
sions hold for the intermediate concentrates (not shown), with
Fig. 3. Grade-recovery curves, including 90% confidence intervals on the mean
enrichment ratio and recoveries for each flotation time.
no difference in recovery but P-values of 0.063 and 0.075 for the
difference in ER for concentrates 2 and 3 respectively. These con-
clusions are not unexpected in the light of the confidence intervals
100
shown in Fig. 3. They suggest a real difference in flotation rate early
in the process with the B curve displaced somewhat to higher
recoveries at the longer times. This difference disappears at longer
Sample Size
times.
95% confidence
10 The comparison of grade-recovery points at a particular flota-
tion time is valid only for that time. Its conclusion is independent
of all other parts of the curve. A problem arises in considering par-
90% confidence ticular points in the context of other points on the curve. To what
do we attribute differences between points at a particular time? It
1
0.0 0.5 1.0 1.5 2.0
may be that any significant difference truly reflects an improve-
ment in flotation conditions with the new collector. However the
+- Bound/Std.dev. difference may be due to other factors, and the two points, though
Fig. 4. Sample size required to achieve a given confidence in the mean.
apparently different, may in fact be on the same grade-recovery
curve. This can only be tested by considering the curves as a whole.
Equally we may want to test the differences in rate constants
as the desired bound decreases and the standard deviation
which requires a rate model to be fitted to the kinetic data.
increases.
The question now arises as to whether the mean ER and the
mean recovery at a given flotation time are the same for the two
collectors. This question can be asked and answered for each flota- 3. Comparing the full grade-recovery curve
tion time separately, because the replicated cumulative grade and
recovery estimates at each time can be considered as an indepen- 3.1. The general approach
dent sample of the population of cumulative grades and recoveries
at that time. The question cannot be asked simultaneously for all A rigorous comparison of two full grade-recovery curves is a dif-
the flotation times (ie for the whole curve) because the error model ferent matter to comparing the grade or recovery at the same
for the cumulative curve is not known. This problem is discussed timed points. There are various ways to do this but some are com-
further below. plicated by the fact that the curves are cumulated, meaning that
Some indication can be obtained by inspecting the confidence the results for each time are not independent and the error model
intervals at the points to be compared (e.g. Fig. 3). This is important is unclear. In addition, the approach will depend on what the user
anyway to remind us of the experimental uncertainties when wishes to achieve with the comparison. As examples we shall con-
interpreting the data. However the rigorous method of comparing sider four useful possibilities:
mean values under these circumstances is the 2-sample t-test,
which is available in Excel either as a tool (Tools > Data Analy- Comparing the parameters of the first order rate models for the
sis > t-test) or as a function (=TTEST). Table 2 shows the results timed data.
of t-tests of the difference in mean recovery and ER between Tests Comparing the parameters of models fitted to the grade-
A and B for the first and last concentrates. A 1-sided test is appro- recovery data.
priate because we are testing the alternative hypothesis that B > A, Comparing predicted recovery at selected values of ER (or
i.e. we are expecting an improvement in flotation with the new col- concentrate grade). This is equivalent to generalising the
lector. The null hypothesis is that B = A, i.e. there is no difference. comparisons we made at the experimental flotation times, for
The P-values and corresponding confidence levels suggest that which we used the t-test.
at Con.1 there is no difference in ER but the recovery with collector Comparison of the two fitted grade-recovery curves as a whole.
T.J. Napier-Munn / Minerals Engineering 34 (2012) 70–77 73
We first fit an appropriate model to the data by the usual Table 3 shows the estimated parameters and statistics for the fit
method of least squares, in this case using Excel’s Solver routine. of Eq. (3) to the two data sets. The standard errors of the parame-
We then generate a large number of parameter estimates by ters and the fit, and the coefficient of multiple determination (R2),
bootstrapping, that is by fitting the model many times (say 1000 were estimated using the Excel macro SolvStat (Billo, 2001).
times) to the original model predictions perturbed by random The difference in the values of Rm is small but the k-values differ
noise reflecting the inherent uncertainty in these predictions. by about 0.5 min1. The question is: are these differences statisti-
Because the tests were replicated, this uncertainty will reflect the cally significant in the context of the uncertainties in their
real experimental error in the data. Press et al (1989) discuss the estimation?
method and its mathematical justification. Table 4 shows an Excel worksheet set up to solve the problem,
We will then have 1000 values of the quantities we are inter- using the data of Test A.
ested in, such as rate constant or the recovery at ER = 17, for both Column A contains the triply replicated concentrate times, and
test conditions, A and B, whose variability reflects the uncertainty Column B the corresponding experimental recoveries. Column C
in the original fit of the model. We hope that the model form is contains the recoveries predicted by the original fit of the rate
appropriate because then the uncertainty in the fit will reflect only model with the parameters shown in Table 3. Column D generates
the experimental error in the original data rather than any short- the normally distributed random numbers with zero mean and
coming in the model. We can then make our deductions simply standard deviation = 2.004 (the standard error of the original
by inspecting the distribution of these quantities of interest, in par- model fit – see Table 3) and adds them to Column C. The first
ticular the distribution of the differences in key values between row of this column thus contains the function =C2 + NORMINV
Tests A and B. (RANDOMNV(),0,$H$6) where H6 contains the standard error of
This strategy requires calling Solver 1000 times. Each time, nor- the original fit. Cells H1 and H2 contain the parameters estimated
mally distributed random numbers with mean zero and standard by the current run of Solver, and Column E contains the recoveries
deviation equal to the standard error of the original model fit are predicted by those parameters. The sum of squares to be mini-
added to the original model predictions. These form the new ‘data’ mised by Solver is that between Columns D and E and is calculated
for Solver to use to fit the model. To perform this task we use the in cell H4 from the Excel function =SUMXMY2(col.D,col.E).
Excel add-in MCSimSolver (Barreto and Howland, 2005) which has MCSimSolver can be set up to run any number of simula-
three important features for this application: tions (1000 is the default), and to record any cell selected in
the worksheet. In this case cells H1 and H2 are selected
1. It can repeatedly call Solver to perform the same fit, a defined because we wish to interrogate the distribution of the two
number of times (e.g. 1000). model parameters. The output is 1000 values of Rmax and k
2. It can store the resulting 1000 values of quantities that we (and any other quantity that the worksheet may compute and
choose to define, such as the model parameters and predictions. that we may select).
3. It includes a non-volatile random number generator called The question we want to answer concerns the differences in
RANDOMNV() which is needed because once the set of random model parameters between Tests A and B. We therefore construct
numbers is generated for a given fit we need to leave Solver to 1000 differences between the values of the two parameters in
iterate to a solution; if we used a standard volatile Excel func- Tests A and B and inspect the properties of these differences, as
tion such as RAND() then the values of recovery would change shown in Table 5.
for each iteration and Solver would fail. The upper and lower 95% confidence limits are calculated sim-
ply as the 0.025 and 0.975 percentiles of the 1000 values of the dif-
3.2. Comparing first order rate parameters ferences, using Excel’s =PERCENTILE function. The z hypothesis test
is conducted by calculating z = mean/std.dev. and then calculating
It is often helpful to compare directly the kinetic parameters of the 1-sided P-value from the function =1-NORMSDIST(z); the 2-
the two separations. The four times at which concentrates were sided value is twice this.
taken in Tests A and B were 1, 3, 6 and 11 min. We will fit the As we are searching for an improvement in the performance of
simple first order rate equation to these data, separately for the Test B over that of Test A it is a 1-sided test. Clearly there is no sig-
two tests (a more complex model incorporating fast and slow nificant difference in the values of Rmax: P = 0.26 and the confi-
components could also be used): dence interval includes zero. However there is a highly
significant difference in the values of the rate constant, k.
Rt ¼ Rm ð1 ekt Þ ð3Þ P = 1.5 105 (>99.99% confidence that the difference is not zero).
where Rt = recovery at time t, Rm is the maximum recovery at infi- The magnitude of the difference is 0.47 min1 and the 95% confi-
nite time, and k is the first order rate constant (min1). Eq. (3) is dence interval of the difference is 0.26–0.69 min1.
non-linear in the parameters. The parameters can be estimated by We can obtain a visual impression of these hypothesis tests by
using Excel’s Solver to minimise the sums of the squares, SS, of inspecting the distribution of bootstrapped parameter differences
the differences between the observed recovery values and those as histograms, shown in Fig. 5.
predicted by the fitted model: 26% of the distribution of Rm differences lie above zero and 74%
below. We would normally require 5% or less to be above (or
X
n
SS ¼ ^ i Þ2
ðRi R ð4Þ below) zero before we would accept the alternative hypothesis
i¼1
Table 4
Excel worksheet for bootstrapped fit of rate model (Eq. (3)).
120 140
80
80
60
60
40
40
20 20
0 0
-3.2 -2.4 -1.6 -0.8 0.0 0.8 1.6 -0.2 -0.1 0.0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
Difference in Rm (B-A) Difference in k (B-A)
Table 6 for the hypothesis test of 0.077. This reaches significance at 90%
Fitted Bruey model parameters for Tests A and B. but not 95% confidence, an equivocal result. There is no such equiv-
Parameters Test A Test B ocation in the case of the recoveries at ER = 25. The mean differ-
Value Std. error Value Std. error ence is 6.3% and P = 2 107. We are very confident that this
difference is real, with confidence limits of 3.8–8.6%. Test B has a
Rmax 97.28 5.41 91.81 3.56
c 8.60 1.21 10.62 1.86
significantly higher recovery at ER 25 than Test A.
ER50 32.11 1.38 33.55 1.74 Fig. 7 shows the 95% confidence limits of the fitted grade-
R2 0.951 0.931 recovery curves. They are non-symmetric as we would expect, with
Std. error 2.037 1.528 the limits widening at each end of the curves where there are no
data. They also overlap strongly at each end which is consistent
with the non-significance in the differences in the Rmax and ER50
95 parameters as both these values are well outside the range of the
Cum. Cu Recovery (%)
Table 7
Statistics for differences in Bruey model parameters between Tests A and B.
90
algebra on the fitted form of Eq. (7) to get ER on the left hand side
85 will lead to an incorrect solution.
80 Likewise there is no absolute need to use enrichment ratio
75
rather than concentrate grade. It has been found however that ER
Model fits to experimental data
is often more meaningful because it is normalised to feed grade
70 which may vary even in closely controlled laboratory experiments.
95% CLs Test A
65 95% CLs Test B In plant data ER is often necessary to reproduce the conventional
60
shape of a grade-recovery curve which is not possible using con-
10 15 20 25 30 centrate grade alone because of uncontrolled variations in feed
Cum. Enrichment Ratio grade.
The example given in this paper uses an unweighted least
Fig. 7. 95% confidence limits for model fits for Tests A and B grade-recovery curves. squares criterion for fitting models. It is also possible to apply
weighted least squares, using for example the variances of the ac-
tual grade (ER) and recovery measurements at each time as the in-
The difference in predicted recovery at an ER of 17 is 1.6%. The verse weights for each time, if these are known.
significance of this difference is equivocal, reaching significance The proposed bootstrap solution can be used to compare any
at 90% confidence but not 95%. replicated data sets for which appropriate models are available,
The difference in predicted recovery at an ER of 25 is 6.3%, including other forms of flotation performance curves from batch
which is highly significant. tests, pilot tests or production data, separability curves in general,
A better fit is obtained by fitting the model separately to the and the comparison of the linear recovery-feed grade relationships
two data sets rather than globally to the combined data set. This in plant trials discussed by Napier-Munn (1998). The advantages of
implies that the two grade-recovery curves are really different the present method in comparing grade-recovery curves are:
overall.
1. It makes no assumptions regarding the linearity of the data.
This particular data set illustrates well the need for careful 2. It is distribution-free, that is, it makes no assumptions as to the
interpretation of the question ‘are they different?’ It depends on distribution of the original data. It only assumes that the ran-
what we mean by ‘different’ and the answers must be interpreted dom errors are normally distributed.
accordingly. For some data sets all the answers are clearly ‘no’ and 3. It does not require that the two fitted grade-recovery models
for others ‘yes’. In this case, which is not at all unusual, it depends give parallel curves. When they are not parallel then several
on how the question is formulated. This does not detract from the comparisons may be necessary at different values of ER.
utility of the process. It simply reminds us, if we needed reminding,
that these curves contain a lot of information, and are subject to The bootstrap comparison method does depend on the assump-
the usual experimental uncertainties. Rate constants tell us differ- tion that the model forms chosen to represent the data, whether ki-
ent things from grade-recovery relationships, and the latter may netic or grade-recovery, are correct in the sense that any lack of fit is
themselves differ over the range of the data. Asking whether the attributable only to experimental error and not functional form. The
parameters are different is a different question from asking if the use of 1000 repetitions is a compromise between the desired preci-
curves as a whole are different. A systematic assessment of the dif- sion of estimate (10,000 might be a better number) and the time re-
ferences between two batch flotation data sets, using the tools sug- quired for the bootstrap. The Bruey 3-parameter model took about
gested in this paper, can give rigorous conclusions of value to the 3 min to generate 1000 Solver fits on an average laptop PC.
experimenter.
The present grade-recovery example has recovery plotted on 5. Conclusions and Recommendations
the y-axis and enrichment ratio on the x-axis. There is no reason
why they should not be plotted in reverse, and this is necessary Grade-recovery curves, like any other empirical measurement,
if differences in grade (or ER) are to be tested at particular values are subject to experimental error. In order to ensure that informed
Table 8
Statistics for comparing the grade-recovery curves using extra sums of squares.
conclusions are drawn from inspection of these curves, this error and mass pulls can propagate into the whole curve, rendering
should be embraced and managed. Standard statistical procedures interpretation difficult. Similarly there is a lack of important infor-
such as confidence intervals and hypothesis tests can be used to mation about water recovery and entrainment. Other ways of for-
calculate the uncertainty in each mean grade-recovery point and mulating grade-recovery information could be considered, and this
to compare such points. A new bootstrap method has been intro- is likely to be a fruitful area of future research (Neethling and
duced to compare the kinetic curves in terms of rate parameters, Cilliers, 2008). The modelling of the data in terms of the mass
and the full grade-recovery curves in terms of model parameters proportions of slow and fast floating species is an established
and recoveries at any selected values of enrichment ratio. This in- alternative which may carry more useful information (Morrison
volves repeatedly fitting a model to each data set perturbed by ran- and Alexander, 1998).
dom errors with standard deviation equal to the original model fit
standard error, to generate a distribution of model parameters. Acknowledgements
These are then used to generate many estimates of the quantities
of interest (parameters and predictions) and thus many estimates The author’s interest in this problem arose out of stimulating
of the difference in those quantities between two data sets. These discussions with Dr. Neville Plint of Anglo Platinum, Deryck de
differences have statistical properties which correctly reflect the Vaux of Anglo Research, Prof. Dee Bradshaw of the JKMRC, Dr. Chris
uncertainties in the original data. The distribution of differences Greet of Magotteaux Australia and Dr. Frank Bruey of Cytec. He
is used to perform non-parametric hypothesis tests of the signifi- thanks Dr. Bruey for permission to quote his re-parameterisation
cance of the observed mean difference and to calculate confidence of Vera’s function. He also acknowledges helpful discussions with
intervals on the differences. Confidence intervals on the fitted Prof. Bill Whiten of the JKMRC. Dr. Rob Morrison of the JKMRC
curves can be computed in a similar way. These procedures are rel- kindly read an early version of paper and made useful suggestions.
atively easy to implement in a spreadsheet. Some of the ideas in the paper were first presented at the SAIMM
No-one wants to be told to do three or four (or more) tests Minerals Processing 09 conference in Cape Town in August 2009.
where historically one would have been regarded as the norm.
However replicate testing is essential if these methods are to be References
utilised. The alternative is to ignore the inevitable presence of
experimental error and perhaps draw misleading or erroneous Barreto, H., Howland, F.M., 2005. Introductory Econometrics: Using Monte Carlo
Simulation with Microsoft Excel. Camb. Uni. Press, p. 798.
conclusions from the data. It is therefore recommended that all Billo, E.J., 2001. Excel for Chemists: A Comprehensive Guide. John Wiley and Sons,
batch flotation tests be replicated and that confidence intervals Ch.12 – Non-Linear Regression using the Solver.
be reported such as those in Figs. 3 and 7, however awkward the Bruey, F., 2010. Private Communication (Cytec Inc.).
Gy, P.M., 1979. Sampling of Particulate Materials: Theory and Practice. Elsevier.
consequences. The number of replications required to estimate Morrison, R.D., Alexander, D.J., 1998. Rapid estimation of floatability components in
mean point grades and recoveries to a given uncertainty can be cal- industrial flotation plants. Miner. Eng. 11 (2), 133–143.
culated (Eq. (2) and Fig. 4). It is further recommended that the Napier-Munn, T.J., 1998. Analysing plant trials by comparing recovery-grade
regression lines. Miner. Eng. 11 (10), 949–958.
methods described in this paper be used wherever comparisons Neethling, S.J., Cilliers, J.J., 2008. Predicting and correcting grade-recovery curves:
are to be made between batch flotation results collected under dif- theoretical aspects. Int. J. Miner. Proc. 89, 17–22.
ferent conditions, and that the confidence intervals on the model Nice, R.W., Brown, P.J., 1995. The design of a base metals separation process. In:
Proc. XIX Int. Miner. Proc. Cong., San Francisco, (SME), 137–143.
parameters and curves also be routinely reported.
Press, W.H., Flannery, B.P., Teukolsky, S.A., Vetterling, W.T., 1989. Numerical
The fact that the two grade-recovery curves shown in Figs. 6 Recipes. Cambridge Uni. Press, p. 702.
and 7 are not statistically different at the longer flotation times is Runge, K., 2010. Laboratory flotation testing – an essential tool for ore
characterisation. In: Greet, C.J. (Ed.), Ch.9 in Flotation Plant Optimisation,
not an unusual result in such cases, not least because at higher
AusIMM.
recoveries there is less room for a superior performance to express Vera, M.A., Franzidis, J.P., Manlapig, E.V., 2000. The locus of flotation performance.
itself. This kind of analysis highlights potential deficiencies in the In: Proc. XXI Int. Miner. Proc. Cong. Rome, 23–28 July. Elsevier Science, pp. 74–
way kinetic batch flotation data are presented and interpreted as 82.