Non-deterministic load and dump behaviour in mining haul trucks: a case of study

Abstract

Complex load and haul cycles in mining are composed of individual steps, whose times could be better described by a statistical distribution than by the average value. In order to evaluate how loading times and dumping times behave, this paper tested a large dataset of loading and dumping times measured at an open pit limestone mine in Brazil against the distributions most commonly used to model these variables, Log-normal and Normal; as well as Gamma, Logistic, Weibull and Exponential distributions. None of the tested distributions provided statistically significant adherence to the data, but it was possible to identify that for most equipment, Logistic and Normal distributions would produce less error on stochastic modelling then the other tested distributions.

Keywords

Mining stochastic modelling load and haul statistical distribution

Introduction

The matching between haulage equipment and loading equipment is often the most important cost driver during the earthmoving phase of mining operations accounting for approximately 60% of total operating costs in open-pit mines (Upadhyay and Askari-Nasab 2018). Tightly related to the excavation and production planning, proper fleet sizing allows for the optimal usage of trucks, loaders and excavators. An optimized match between loading and hauling equipment means maximum productivity, resulting in lower capital and operating costs for a given planned production.

Traditional fleet sizing techniques rely on deterministic models, meaning variables have fixed values and the models output a fixed outcome, which in turn is corrected by performance factors such as Utilization and Availability.

The productivity of hauling and loading equipment is an important variable, being one of the main considerations in fleet sizing. It is defined as the amount of material handled per unit of time, usually tons per hour.

Therefore, in a deterministic fleet sizing, it is usual to calculate the expected productivity of trucks and excavators or loaders, and from there choose the best match considering investment and running costs, safety and operational restrictions (Hustrulid et al. 2013). This implies that Utilization and Availability should show some degree of correlation with actual equipment productivity and previous works have shown such correlations can be used along with regression techniques, to model past events (Lanke et al. 2016).

To estimate the real productivity, the deterministic methods will generally calculate the maximum theoretical productivity, or nominal productivity and multiply this value by a performance factor, which aims to correct the value to a more realistic scenario in which the many constraints of the actual operation would interfere with the nominal scenario and degrade the performance (Mambo 2017). An accurate prediction of productivity would ideally ensure the planned production will be met and no installed load and haul capacity will be wasted.

Approaches to simulation that use average values of cycle times as their input variables, estimating a static scenario, albeit adjusted, fail to try to approximate real world operational conditions.

Determinist methods try to foresee a future operation, but are based solely on estimations and historical data, making these techniques prone to significant errors, due to the fact that actual future operations might not fit properly on calculated Utilization and Availability parameters. Mining operations are complex systems, subject to human imperfections and randomness, which makes it very difficult to model analytically (Soofastaei et al. 2016).

The main source of error in deterministic models derives from the fact that real world variables are non-fixed and as such, cannot be properly represented by their averages. This is especially relevant when evaluating truck bunching and overall queue formation as well as idleness of excavators and loaders. A truck having a slower cycle may force the next truck to lose time, be it by slowing down during travel time of having to wait for the loading equipment to finish loading the previous truck, negatively impacting this truck's cycle time. Modelling dynamic processed by fixed averages tend to result in calculations that are not representative of reality (Curi 2014).

The stochastic approach is one way to reduce the aforementioned error and its validation when compared with real-world data shows much better predictions than deterministic models (Martins 2013). Yet, very few works have been published about the behaviour of stochastic variables of load-haul-dump cycle times. There is no consensus on if the individual steps within the cycle can be modelled by a known statistical distribution and if so, which statistical distributions better model the data. Some of the few such works suggest empirically that a Normal distribution could be a viable probability density distribution for modelling loading and dumping times (Rodrigues and Pinto 2006; Martins 2013). Other study based on data from an Indonesian coal mine chose Log-normal distribution as the best fit (Chaowasakoo et al. 2017b), in agreement with a previous availability-based simulation framework using data from a Chilean open-pit mine (Mena et al. 2013). More recent studies adopted Log-normal for dumping and Normal for loading (Januario and Souza 2019).

The present paper is a part of the research conducted by this research team, focused on the stochastic simulation of load and haul operations in mining, in which we discuss the behaviour of the loading and dumping times, the most relevant fixed cycle variables. The paper aims to better understand if some of the statistical distributions most commonly used to model these operations are indeed representative of these operations and how they compare to an empirical cumulative distribution of this study case's measured data.

Materials and methods

In order to evaluate how the cycle variables behave from a statistical standpoint, this research gathered data provided by the truck drivers with the aid of a mining management system, or mining dispatch system, following a strict protocol. The measurements took place in a limestone mine located in Minas Gerais, Brazil, between August and October, 2019, with a total of 14,855 full cycles measured with a fleet of nine off highway haul trucks and three on highway dump trucks, both fleets with 30t capacity bins, in an interchangeable load and haul operation (classified as type m-trucks-for-n-shovels dispatching (Chaowasakoo et al. 2017a)) as depicted on Figure 1.

Figure 1

Fleets and existing combinations for the load and haul cycle at the mine. Images are available in colour online.

For loading times, the start of measurement was defined as the moment when the truck was positioned under the loading equipment bucket, or on a pre-defined mark in cases where the loading equipment were unable to help truck spotting. The end of measurement was standardized as the moment immediately before the truck is released by the loading equipment and starts moving. For dumping, the start was defined as the pressing of the button or pulling of the lever that activates the hydraulic system which raises the truck bin, and the end corresponds to the moment immediately before the truck starts moving away from the dumping spot. Figure 2 represents the generic load-haul-dump cycle with the parameters for beginning and ending of load and dump times.

Figure 2

Data gathering parameters for loading and dumping times within the mining cycle. Images are available in colour online.

All truck drivers were trained in this protocol, and in order to reduce errors produced by the intrinsic differences between equipment, we grouped loading time data considering the type of machinery (three excavators and four hydraulic loaders) and dump time was filtered by each truck separately.

Outliers and human error were treated by discarding measurements below 0.2 min in loading and 0.15 min in dumping, as well as value above 5 min in loading and 1 min in dumping. These hard caps were arbitrated based on typical cycle times provided by manufacturers (Caterpillar Inc 2013, 2018) on physical limitations of the equipment and reasonable assumptions of operational factors, in which should those be outside the proposed range it is safe to assume it is either bad data, caused by human error, or an atypical situation caused by external factors.

Data points were then grouped by truck class (off highway and on highway) and by loading equipment type (excavators and loaders) as shown on Figure 1. Finally, the data were analysed using Minitab (Minitab Inc. 2013) statistical software's probability plots, and had their Goodness of Fit calculated for the most common statistical distributions, using Anderson–Darling test (Minitab Inc. 2017) as reference to their adherence to each of the distributions.

For graphical analysis all data were plotted and compared to known cumulative distribution functions in order to provide an empirical and visual verification of how close the distribution is to the real-world data.

Table 1 shows the basic statistics of each sub-set of loading time data. There are clear differences in average loading times for excavators and loaders, and although this is an indirect measurement (measured by truck drivers and not by loading equipment operators), all data has shown very similar overall distribution, peaking around the mean and with high symmetry between both sides.

Table 1

Loading sub-datasets statistics.

Class	Tag	Samples	Mean	Standard Deviation	Min	Q1	Median	Q3	Max
(minutes)
Excavator	ESC-1	1439	2.9729	0.6960	0.22	2.53	2.96	3.38	4.98
ESC-2	1178	3.0495	0.5953	0.23	2.68	3.01	3.40	4.91
ESC-M	2770	2.7862	0.7983	0.23	2.17	2.69	3.28	4.99
Loader	PCR-3	986	2.5336	0.7075	0.22	2.00	2.54	2.97	5.00
PCR-4	20	2.5345	0.3619	1.74	2.32	2.62	2.72	3.41
PCR-5	423	2.5936	0.7568	0.27	2.08	2.49	2.97	4.97
PCR-6	448	2.4362	0.6391	0.20	1.96	2.38	2.78	4.93

Initial data points: 14,855; used data points: 7,264; discarded data points: 7,591.

Dumping statistics are shown in Table 2, with overall similar averages and standard deviations, as well as peaking close to the mean. Since dumping is more dependent on the mechanical and hydraulic set of each truck then on operator skill, it is expected to be more uniform across a given fleet of trucks from the same category then loading times.

Table 2

Dumping sub-datasets statistics.

Class	Tag	Samples	Mean	Standard Deviation	Min	Q1	Median	Q3	Max
(minutes)
Off Highway	CFE-1	875	0.4716	0.1146	0.15	0.41	0.46	0.53	0.92
CFE-2	1126	0.4860	0.1747	0.15	0.33	0.50	0.62	1.00
CFE-3	0	–	–	–	–	–	–	–
CFE-4	0	–	–	–	–	–	–	–
CFE-5	1743	0.5457	0.1186	0.15	0.48	0.52	0.58	1.00
CFE-7	363	0.4390	0.1287	0.15	0.35	0.44	0.51	0.95
CFE-8	1563	0.4350	0.1278	0.15	0.36	0.42	0.49	1.00
CFE-9	0	–	–	–	–	–	–	–
CFE-10	0	–	–	–	–	–	–	–
On Highway	CRB-1	733	0.4658	0.1568	0.15	0.37	0.44	0.54	1.00
CRB-2	0	–	–	–	–	–	–	–
CRB-3	453	0.5044	0.1610	0.15	0.40	0.49	0.59	1.00

Initial data points: 14,855; used data points: 6,856; discarded data points: 7,999.

Since data gathering was based on manual input and self-report, it was expected that a high number of bad data points would occur. This resulted in all data from CFE-3, CFE-4, CFE-9, CFE-10 and CRB-2 being discarded due to not fitting the specified range of acceptable values, as well as some of the data from other trucks, totaling 51.10% of the loading data and 53.85% of the dumping data being discarded. This is indicative of some truck drivers unwillingness to properly follow the guidelines of this study, as well as a large amount of human error.

The trade-off in this case would be using a fully automated data gathering solution, capable of more consistent data gathering, but subject to instrument precision error and less discretion than a human operator. Future studies should explore this alternative, as well as new technologies.

Results

The Anderson–Darling number is a statistical tool capable of evaluating how close a given dataset is from a known probability density function. We tested each sub-dataset for Log-normal, Exponential, Normal, Weibull, Gamma and Logistic distributions. Table 3 presents the calculated Anderson–Darling values for loading times. Smaller Anderson–Darling numbers are indicative of better fit to the tested statistical distribution.

Table 3

Anderson–Darling values for loading times.

Class	Tag	Log-normal	Exponential	Normal	Weibull	Gamma	Logistic
Excavator	ESC-1	37.549	396.964	4.270	9.965	17.281	1.331
ESC-2	21.475	359.578	4.212	12.968	9.629	1.767
ESC-M	5.137	662.886	23.044	27.683	5.483	19.631
Loader	PCR-3	6.027	240.319	3.068	5.564	2.592	3.202
PCR-4	0.668	6.960	0.538	0.619	0.610	0.448
PCR-5	3.381	101.932	4.682	6.243	2.115	2.865
PCR-6	5.212	117.067	4.104	6.963	2.914	2.544

Larger datasets have shown a tendency to result in higher Anderson–Darling values. This effect can be attributed to goodness-of-fit tests discriminatory power, which increases with an increased sample size, resulting in even small differences between two datasets being considered statistically significant (Lazariv and Lehmann 2018).

Across most loading equipment the Logistic distribution has shown to be the best fit, followed by Normal. For loaders, the Gamma and Log-normal distributions also yielded good results.

Excavator ESC-M is discordant from the rest of loading equipment. This can be attributed to the fact that this excavator was operated and maintained by a third-party contractor and was mostly used for pit expansion, loading coarse blasted rock and as ancillary equipment on earthmoving tasks.

For dumping times, we tested for the same distributions and presented the results in Table 4. Analogue to Table 3, smaller Anderson–Darling numbers are indicative of better fit to the tested distribution.

Table 4

Anderson–Darling values for dumping times.

Class	Tag	Log-normal	Exponential	Normal	Weibull	Gamma	Logistic
Off Highway	CFE-1	11.391	244.251	12.710	22.912	9.657	4.555
CFE-2	21.390	205.645	9.103	8.029	15.284	11.522
CFE-5	40.805	525.340	57.043	79.738	42.731	36.875
CFE-7	3.240	85.539	1.194	2.231	1.781	0.865
CFE-8	12.121	387.219	25.121	36.792	12.101	10.969
On Highway	CRB-1	2.918	158.344	9.723	10.909	2.901	5.131
CRB-3	3.145	99.348	2.556	3.353	1.516	1.299

As noted with loading times, for dumping times larger datasets tend to result in higher Anderson–Darling test values. The Logistic distribution has the best fit for most trucks, although Gamma, Log-normal and Normal are also good fit for some of the datasets.

The results were reasonably consistent among all trucks and loading equipment. Also, loading and dumping times, both followed the same overall curve shape with similar Anderson–Darling test values. It is noteworthy however, that the mean and standard deviation of the datasets were different for each truck class and loading equipment type.

To exemplify the results graphically, we chose a single sub-dataset as reference. Figure 3 exemplifies the consolidated results of Goodness of Fit tests for the loading time of hydraulic excavator ESC-1, with Anderson–Darling (AD) values for each distribution. Although some probability density functions were less prone to error than others, all p-values are below significance level for all tests, including the other sub-datasets, which implies that none of the tested data is statistically significant to the distributions. Therefore, modelling using any of the tested probability density functions to model loading and dumping times is subject to some degree of error.

Figure 3

Goodness of Fit plots for hydraulic excavator ESC-1, for a 95% confidence interval. Images are available in colour online.

To verify the Goodness of Fit test results and to test the definition proposed by Chaowasakoo et al. (2017b), Figure 4 shows the cumulative probability distribution of ESC-1 load time, where it is possible to see the gaps between the Log-normal curve and real world data from the mine, as well as how Normal and logistic curves provide a much better fit.

Figure 4

Empirical cumulative probability distribution of load time for ESC-1. Images are available in colour online.

The empirical distributions for all equipment combined, along with corresponding histograms is shown in Figure 5 for both Loading and Dumping times.

Figure 5

Empirical cumulative probability distribution and histograms of load and dump times. Images are available in colour online.

Discussion

Previous to this study, the most comprehensive study to date, done by Chaowasakoo et al. (2017b), analysed data from a coal mine and adopted Log-normal distribution as being more representative for unitary operations, based on graphical analyses and empirical fit of the Log-normal and exponential curves compared to the empirical probability density function. The Log-normal distribution proposed by Chaowasakoo et al. (2017b) has better fit than the exponential distribution, but it still has significant error when compared to the empirical probability density function.

The effect of rare longer times was not dismissed by Chaowasakoo et al. (2017b), which could be useful to represent outside influence factors in this particular case, but makes it impossible to isolate the cycle variables for further analysis. By adopting hard limits to how much time a loading or dumping operation can still be considered typical, in order to reduce data errors and outliers, the empirical function would strongly resemble the bell-shaped curve, often associated with the standard Gaussian distribution, which in turn would make Chaowasakoo et al. (2017b) results consistent with the results presented on this paper.

For excavator ESC-M, since the kind of work performed varied during the time of data gathering, for any meaningful conclusion, it would be necessary to filter the data in order to analyse each specific activity separately, which is beyond the scope of this paper.

The data gathering was based on human reporting, and as such could be flawed due to human error. This makes the analysis of Anderson–Darling statistics, as well as p-value, of very little significance. However, even though bad data are significant enough to result in unreliable statistical tests, when observed in its cumulative probability distribution, these unreliable datum points are diluted within the much larger volume of good data and the overall graphical behaviour accurately shows how the variable behaves in real world conditions. This brings greater relevance to the pure graphical analysis, which shows, qualitatively, between the many probability distribution, which are the ones with less error.

Conclusion

Loading and dumping times have shown low similarity with all tested probability distributions (Log-normal, Exponential, Normal, Weibull, Gamma and Logistic), when tested using the Anderson–Darling test.

The graphical analysis has shown that the divergence is much greater with Log-normal distribution, as suggested by part of the literature (Mena et al. 2013; Chaowasakoo et al. 2017b), when compared to Logistic and Normal distributions.

Although using Log-normal distribution on a stochastic simulation may be a pragmatic choice, the error from this model was very high when this variable was isolated on the tested dataset, when compared to others such as Normal and Logistic distribution, which however having the null hypothesis rejected, are the ones that best fit the data measured in a real mining operation.

Out of the many unit operations comprising the mining load and haul cycle, loading and dumping are both the easiest to model and the ones with more reliable self-reported data. Further work should be done to evaluate other load and haul cycle's times, as well as to consolidate the models for these two more reliable variables. For the other cycle step times, it is expected more external influences and thus worse adherence to known statistical models.

Whenever there is reliable data supporting the use of a proper empirical distribution function for each of the individual processes within the load-haul-dump cycle, it would be advisable to adopt such empirical distribution since this study failed to find enough statistical evidence that any of the known tested distribution is representative of those processes.

Further investigation is needed in different mines, handling different materials and with different equipment. Automated data gathering may also be beneficial as a means of reducing the uncertainty caused by human error.

Footnotes

Disclosure statement

No potential conflict of interest was reported by the author(s).

Data availability statement

No separate dataset will be release along with this paper. All relevant data and analysis are provided within the main text.

References

Caterpillar Inc. 2013. Field guide 2013. Peoria: Caterpillar Inc.

Caterpillar Inc. 2018. Caterpillar performance handbook 48. 3rd ed. Peoria: Caterpillar Inc.

Chaowasakoo

2017a. Digitalization of mine operations: scenarios to benefit in real-time truck dispatching. Int J Min Sci Technol Chin Univ Min Technol. 27 (2): 229 – 236. doi: 10.1016/j.ijmst.2017.01.007.

Chaowasakoo

2017b. Improving fleet management in mines: the benefit of heterogeneous match factor. Eur J Operat Res Elsevier B.V. 261 (3): 1052 – 1065. doi: 10.1016/j.ejor.2017.02.039.

Curi

2014. Minas a Ceu Aberto: Planejamento de Lavra. 1st ed. Edited by Chaves

A. P.

São Paulo: Oficina de Textos.

Hustrulid

, Kuchta

, Martin

2013. Open pit mine planning & design. 3rd ed. Boca Raton: CRC Press.

Januario

LHN

, Souza

JCd.

2019. Uso Da Simulação Computacional a Eventos Discretos Para Determinar a Frota Ótima De Caminhões Em Mineração. Tecnologia em Metalurgia Materiais e Mineração. 16 (1): 51 – 56. doi: 10.4322/2176-1523.20191501.

Lanke

, Ghodarati

, Hoseinie

SH.

2016. Uncertainty analysis of production in open pit mines – operational parameter regression analysis of mining machinery. Min Sci. 23: 147 – 160. doi: 10.5277/msc162312.

Lazariv

, Lehmann

2018. Goodness-of-fit tests for large datasets. Dresden: Technical University Dresden. http://arxiv.org/abs/1810.09753.

10.

Mambo

IF.

2017. Simulação da Operação de Carregamento e Transporte Numa Mina à Céu Aberto de Carvão. Ouro Preto: Universidade Federal de Ouro Preto. http://www.repositorio.ufop.br/bitstream/123456789/7691/1/DISSERTAÇÃO_SimulaçãoOperaçãoCarregamento.pdf.

11.

Martins

AG.

2013. Simulação das Operações de Lavra da Mina de Brucutu Utilizando um Modelo de Programação Linear Para Alocar os Equipamentos de Carga. Ouro Preto: Universidade Federal de Ouro Preto. https://www.repositorio.ufop.br/handle/123456789/3148.

12.

Mena

2013. Availability-based simulation and optimization modeling framework for open-pit mine truck allocation under dynamic constraints. Int J Min Sci Technol Chin Univ Min Technol. 23 (1): 113 – 119. doi: 10.1016/j.ijmst.2013.01.017.

13.

Minitab Inc. 2013. Minitab ^® 17.1.0.

14.

Minitab Inc. 2017. ‘The Anderson-Darling statistic’, Minitab 17 Support.

15.

Rodrigues

, Pinto

LR.

2006. Análise comparativa de metodologias utilizadas no despacho de caminhões em minas a céu aberto. Belo Horizonte: Universidade Federal de Minas Gerais. https://repositorio.ufmg.br/bitstream/1843/NVEA-72CKG8/1/l_sara_fabricia_rodrigues.pdf.

16.

Soofastaei

2016. A discrete-event model to simulate the effect of truck bunching due to payload variance on cycle time, hauled mine materials and fuel consumption. Int J Min Sci Technol. Chin Univ Min Technol. 26 (5): 745 – 752. doi: 10.1016/j.ijmst.2016.05.047.

17.

Upadhyay

, Askari-Nasab

2018. Simulation and optimization approach for uncertainty-based short-term planning in open pit mines. Int J Min Sci Technol Chin Univ Min Technol. 28 (2): 153 – 166. doi: 10.1016/j.ijmst.2017.12.003.