Abstract
We present here a laboratory-based experimental protocol that seeks to establish and characterize the relationship between ground-penetrating radar attributes and the mechanical properties (density, porosity, and compressive strength) of typical industry concrete mixes. The experimental data consist of ground-penetrating radar attributes from 900 MHz radargrams that correspond to simultaneously measured physical properties of Portland cement concrete, alkali-activated concrete, and cement mortar. Appropriate regression models are trained and tested on this data set to predict each physical property from ground-penetrating radar attributes. From a small selection of individual attributes, including total phase and intensity, trained random forest regression models predict porosity (R2 = 0.83 from the instantaneous amplitude), density (R2 = 0.67 from the intensity attribute), and compressive strength (R2 = 0.51 from instantaneous amplitude). These novel relationships between physical properties and ground-penetrating radar attributes indicate that material properties could be predicted from the attributes of ordinary ground-penetrating radar scans of concrete.
Keywords
Introduction
Accurate estimates of in situ material properties are an essential component of effective execution, conservation, and maintenance of civil infrastructure, historic buildings, and geotechnical projects. In order to assess and address maintenance or conditions of a structures, the current conditions—the composition, density, porosity, corrosion, and remaining capacity—must be considered and accounted for.
Outside the laboratory, standard in situ material characterization methods for infrastructure materials include reliable invasive, but resource intensive approaches such as core sampling or neutron probes.1–3 The aim of these approaches is to estimate properties such as moisture content, strength, or porosity of construction materials. Where invasive tests are not possible, alternatives such as ultrasonic pulse velocity or rebound hammer tests must be used; these tests can be difficult to interpret or perform reliably. 4 Nevertheless, calibrated ultrasonic velocity models can predict strength and stiffness with high-correlation coefficients and low root mean square error (RMSE, less than 10 MPa).4,5
Ground-penetrating radar
Another technique, ground-penetrating radar (GPR), is often cited as being sensitive to mechanical properties, but the connection to the electromagnetic properties is not fully characterized or understood.6,7 For common applications, such as detection of rebar, layer thickness, or delamination, GPR data are collected in reflection mode from the surface of the surveyed material. Transects of GPR data are made up of traces, which are the individual “soundings” collected at discrete points along the transect. The trace is a time series containing the amplitude of the reflected signal; the relative amplitude is dependent on the contrast between the electromagnetic properties (i.e. dielectric constant
In civil engineering, GPR is applied to map and locate utilities9–11 and monitor and inspect concrete structures for defects and anomalies in the service of civil infrastructure structural health monitoring (SHM) and life-cycle management.11–14 Specifically, GPR has been used for damage identification in different ages and mixes of concrete, 15 corrosion detection and prevention,13,16 robust estimates of pavement thickness,17–19 verification of construction specifications and details,14,20–22 and identification of voids and other defects in civil structures.12,23
Attribute analysis
While most applications consider only visual interpretation of the amplitudes of the GPR data, some approaches consider other aspects of the signal. Originally inspired by reservoir characterization in the field of reflection seismics, attribute analysis is another way to analyze GPR data and make certain features of the data more apparent and less ambiguous.24–26 Attribute analysis refers to investigating a calculated feature or quantity (i.e. an “attribute”) of the data rather than qualitative interpretation of the amplitudes alone; attributes can be calculated in both time and frequency domains.27–29 GPR attribute analyses have been used to successfully differentiate between plastic, concrete, and metal,29,30 perform transient analysis of contamination, 27 estimate concrete hydration and water content, 31 estimate soil dielectrics and glacier composition,32,33 interpret subtle archeological features,28,34 and differentiate between two cures of the same concrete mix. 14
The attributes commonly used for these investigations in a civil engineering context include amplitude and frequency-based attributes such as energy,29,30,35 frequency spectra;18,29 image processing methods for automated detection or classification of reflections;9,34,36,37 transient or time-lapse attributes;
27
and a selection of complex trace attributes from seismic geophysics and image analysis such as maxima and coherence (Figure 7).34,37 Attribute analysis in these works effectively deals with complicated circumstances or ambiguous data to identify subtle geological or archeological features27,28,34 and differentiate between targets or phases.29,30,33 Amplitude attributes are dependent on a number of factors and tend to be highly variable, while frequency and phase attributes may be more robust and better at characterizing interfaces.27,37,38 Outside of hydrological applications relating soil properties (
Concrete and GPR
As a construction material, concrete is perhaps the most versatile and common; this is reflected in the prevalence of infrastructure surveys using GPR. The form and material properties of concrete can be easily controlled. Concrete has the advantages of excellent compressive strength, low relative construction and fabrication costs, and suitability to be deployed in various challenging environments (i.e. water). The primary component of modern concrete is ordinary Portland cement (OPC), which is both expensive and resource intensive to produce.40,41 Standard industry practices and relevant building codes allow for up to
The role of water in both the electromagnetic properties and the physical properties of concrete has been highlighted as an important factor to consider in the field of nondestructive material characterization using GPR or radio frequencies.10,31,46–48 Indeed, water is the parameter in concrete that dominates the behavior of the signal via controlling the dielectric constant. 49 This mirrors the relationships used in remote sensing and noninvasive estimation of soil moisture by dielectric constant.50–54 These relationships between water and electromagnetic signals are also applied to understand water content and related measures of porosity in construction materials55,56 as well as with parameters of concrete mixes.31,46 Properties, such as wave attenuation and dielectric constant, can be linked to water content and hydration of concrete.31,49,57 These physical properties are, in turn, related to the mechanical or engineering properties of the materials, such as compressive strength or Young’s modulus, by established methods such as ultrasonic pulse velocity testing.4,58,59 These relationships are beginning to prove widely useful for measuring pavement compaction.18,60,61 In pavement and asphalt applications, samples are fabricated and characterized to determine parameters for models that predict properties, such as density and compaction, helping to characterize road surface quality, consistency, and safety.17,18,61–63 Some work with GPR is being done to estimate and track the evolution of volumetric water content and dielectric properties, but mainly in the interest of understanding early age concrete hydration.7,31,49,64–66 Active research in this area focuses on estimating chloride content and volumetric water content based on the fundamental relationship these properties have with measurable GPR attributes. However, GPR reflection survey data are not yet widely used to characterize civil engineering materials or to nondestructively estimate physical properties of interest like porosity or density, for use in inspection and analysis of existing structures.
GPR and supervised learning
In both unsupervised (or “deep”) learning and supervised learning, algorithms are used to determine relationships between input and output data (regression) or make decisions about these data (classification or density estimation). 67 In supervised learning problems, the user selects a mathematical model or algorithm that will learn the appropriate relationship and classifications of a subset of sample data. That trained model is often evaluated on a set of testing data that was not a part of the learning stage. Classification is increasingly being applied to GPR data with image processing and pattern recognition algorithms that perform detection and classification of reflections.
Open access computational resources, such as scikit-learn, 68 PyTorch, and TensorFlow, 69 have enabled researchers in the fields of material science and civil engineering to use machine learning (ML) for a variety of predictions concerning concrete. While classification models help estimate the condition of concrete 70 or perform automatic hazard identification with support vector machines (SVMs) and hidden Markov models (HMMs),71,72 regression models are developed to predict material properties. 73 Scikit-learn is a Python-based module that makes a wide variety of ML algorithms for supervised and unsupervised learning available to users, prioritizing ease of use in fields outside computer science, quality implementation, and user selection of parameters. 68
It has been demonstrated that in many cases ensemble methods and genetic programming can outperform supervised learning models (such as linear regression and SVMs74–76) in predicting the final concrete strength from the design mix. Another approach to predicting compressive strength is to use nondestructive testing data in conjunction with the regression models. This approach does not rely on a priori knowledge about the concrete (e.g. mix design) and can be applied to any existing or new structure. Predictions for concrete compressive-strength-based regression models trained on ultrasonic pulse velocity and rebound hammer test data have been successful.5,59,77 Recently, deep neural network (DNN) regression has been used to predict defects in metal alloys using a relatively small data set of 487 samples by fine tuning the model to the sample. 78 Other models for the density or compaction of asphalt pavements can be calibrated using techniques such as air-launched GPR antennas.61,79,80 These models are usually calibrated with data from samples of the particular asphalt mix, either obtained by core drilling or fabricated quality assurance (QA) specimens reserved during paving operations.61,80 However, no attempt has been made to determine concrete properties using GPR scans or to make those predictions more general.
A direct GPR-based relationship between the material properties of concrete and attributes of the reflected electromagnetic response has not yet been documented. The primary objective of this article is to generate representative data in order to perform a well sampled and controlled survey to establish and validate the relationship between material properties (density, porosity, and compressive strength) and GPR attributes on the experimental data. To understand the relationship between GPR attributes and material properties, laboratory concrete samples are fabricated and tested using both noninvasive GPR scanning and direct/destructive traditional tests. The samples are tested over a period of 8 weeks, with GPR scanning of concrete beams and traditional direct tests of cylinders for porosity, density, and compressive strength. Over the concurrent testing period, a labeled data set of the material properties and GPR attributes is created. The aim is to use this novel data set to test whether correlation exists between the material properties and GPR attributes can be used to make reasonable predictions of the tested material properties. Scikit-learn, a python ML library, is used to develop, train, and test regression models relating material properties to GPR attributes.
Materials and methods
Experiment design
Concrete is selected as the material for this experiment due to the ease of fabricating samples with a range of properties and forms and for the potential transferability of the results to natural construction materials such as stone and brick. In this experiment, the physical properties of concrete samples (density, porosity, and compressive strength) from a variety of mix designs and curing conditions are measured and the relationship these properties have with GPR attributes is determined (Figure 1).

Experimental outline of concrete samples and testing protocol. The direct tests measure the labels: density
Concrete mix design
The mechanical properties of mature Portland cement concrete (PCC) can be manipulated via the ratios of water, binder, aggregates, and admixtures in the concrete mix. Reducing the w/c ratio (ratio of water to cement and other supplementary cementitious materials (SCMs) such as silica fume, flyash, and granulated blast furnace slag) can increase the strength of the mature concrete and reduce the mature porosity, which can increase the concrete’s durability.43–45,81,82 High strength or HPC mixes with low water content often include admixtures which increase the workability of the mix without changing the w/c ratio. When used effectively, these admixtures can increase the strength and durability of the concrete, but in excess can have adverse effects on the mechanical properties. 43 To get a representative sampling of concrete used in industry, an HPC with low w/c and high SCM content, normal weight concrete (NWC) with and without SCMs, and an AA mix were selected. In addition, a cement mortar (CP) is included that shares mix proportions with NWC, but with the omission of the coarse aggregate and admixtures. Variation in the final properties of these mixes is controlled by varying the SCM and varying the curing conditions 45 (Table 1 and Figure 1). The variations in mix design and curing condition for the full range of samples are summarized in Figure 1 and Table 1.
Mix design and water/cement ratios for included concrete and mortar.
AA: alkali activated; HPC: high-performance concrete; NWC: normal weight concrete.
The HPC mix designs that contain flyash are based on the study by Burg and Ost; 83 HPC with slag and HPC without SCMs are based on the NJ DOT HPC specifications; 42 the NWC designs are based on the control mixes in Folliard and Berke. 84 Flyash and slag are added to the normal weight mix in proportions specified in NJDOT 903.03.01. 42 The raw materials, including Portland cement (OPC), silicate solution, and aggregates, were acquired from commercial sources and meet relevant industry quality standards. SCMs were donated from industry (flyash and slag). The coarse aggregate was washed and air dried in preparation; the water content of the air dried aggregate was found by oven drying and the adjusted w/c ratio is reported in Table 1.
Sample fabrication
According to the relevant American Society for Testing and Materials (ASTM) standards, beams and cylinder samples from each concrete mix were fabricated from commercially available materials. Each week, between two and four sets of samples (i.e. mix and cure combinations) were fabricated. The concrete was mixed in batches using an industrial stand mixer with a capacity of about 60 pounds. First, 75% of the water, silica fume (if any), admixtures (according to ASTM C192), and coarse aggregates are mixed together. Then, the remaining binders (flyash, OPC, and so on) are added, followed by the fine aggregates and remaining 25% of the water. After each step, the batch is mixed for at least 2 min and then for at least 10 min once all components are added. The materials for each mix were measured by weight and mixed with an industrial stand mixer in 60 pound batches; two batches were mixed for each beam and between 2 and 3 batches were mixed for a set of cylinders. Consistency between batches was ensured during measurement of each component and mixing sequence. Filled molds were vibrated for 2 min to remove air bubbles and consolidate the samples. In some NWC cylinders, the admixtures contained in the mixes reduced viscosity of the freshly mixed concrete so much that vibration settled the coarse and fine aggregates into clear layers, but this was not observed in the beams (see relevant discussion of
After 1 week, the samples are removed from their molds and the first round of a series of testing is completed. The beam and cylinder samples are stored in the same conditions for the remainder of the testing period, which spans about 8 weeks for each mix (7, 14, 28, and 56 days). Each test includes both nondestructive GPR radargrams of the beams and direct destructive testing of the cylinders for density
Direct testing
Three cylinders are tested at each test date; a total of 219 cylinders were tested (Figure 2). First, the density of all three cylinders was determined geometrically using the measured mass and volume. Two of the samples were then tested in direct compression with an Instron load frame, and the third was used to estimate the fractional porosity (

(a) Samples during covered and exposed curing, (b) cylinders ready for testing, (c) fractional porosity testing by cold water saturation, and (d) direct compressive testing of cylinder.
A summary of the physical properties in relation to the final data set and the distribution of those properties is presented in Table 2 and Figure 3. Note that the large range of values, particularly for porosity and compressive strength, are due in part to the development of properties as the samples age (Figure 3). The density is relatively constant over time; while porosity follows the expected increasing trend, compressive strength values did not uniformly increase over time and had high variability. The distribution of material property values across all the mixes is shown in Figure 4.
Summary of physical tests performed on samples.
NWC: normal weight concrete; HPC: high-performance concrete; AA: alkali activated; CP: cement mortar.
The standard deviation reflects all tests for each mix.

Density, strength, and porosity trends visible over time.

Histogram of the three physical properties. Note the size of each experimental sample: for each mix, three samples are tested for density, two for compressive strength, and one for porosity.
The experimental data can also be visualized using a pair-plot between the various material properties to determine any correlation between the measured quantities (Figure 5). The pair-plot shows distribution of a single material property (the diagonal terms) and the correlation between pairs of properties (non-diagonal terms). Though there are general trends, such as decreasing porosity with decreasing compressive strength, the non-diagonal terms show that no direct correlations can be found between any pair of material properties and hence the correlation between the electromagnetic properties and the physical properties of concrete will be developed individually.

Pair-plot of the material properties. Since the number of samples for each of the physical properties is different, the pair-plot is created using only those samples for which all the three property values are known.
Data processing and model development
GPR data collection
GPR traces were collected in tandem with the direct material testing using the available 900-MHz antenna. At each date when physical tests were performed, the beams were simultaneously scanned while resting on the ground (with a visible reflection at that depth, Figure 6). For each beam, two radargrams or B-scans were collected parallel to the rebar in the longitudinal direction (for repeatability) and one stationary transient set of traces or A-scans were collected with the antenna oriented perpendicularly to the rebar and main axis of the beam. In all cases, the antenna and collection settings were held constant. The 900-MHz antenna was selected to give a good recording of the data within the concrete rather than to resolve features such as pores and aggregate in the concrete itself; the main reflections in the radargrams are the direct or ground wave, the bottom of the beam, and sometimes the reflection from the rebar. The resolution of the antenna and the sampling rate are relatively low for capturing a distinct reflection from the top of the beam and the rebar; in beams where the dielectric constant is high and the signal velocity is low, such as at very early ages, the rebar reflection can be distinguished. In the case of mature concrete, the reflections are from the top and bottom of the beam. The rebar in Figure 6 is not clearly visible, as the radargram was collected more than 14 days after pouring.

(a) GPR scanning configuration for beams, showing reading unit (gray) and antenna (orange) in transverse orientation and longitudinal orientation (inset), (b) preprocessed GPR radargram, and (c) variance in one attribute shown for a single beam.
GPR preprocessing and attribute calculation
GPR data were preprocessed and trimmed before computing the attributes. Gaining or amplitude recovery was not performed to preserve the relative reflection amplitudes. The initial reflections were trimmed at the “first break” according to the STALTA algorithm, as in Wong et al. 86 and Allen 87 and the ends of the traces were truncated at the second arrival of the beam reflection, which is where the signal amplitude has decayed to the average noise level in the radargrams. The traces were resampled to 512 samples and then dewowed by removing a running average along the trace (in time) to center them at 0. 8 Because the radargrams were dominated by reflections from the top and bottom surfaces of the beams and the rebar reflection has low visibility, no attempt was made to account for the rebar reflection during attribute calculations. A total of 177 radargrams were collected, some of which correspond to beams that contained two different mixes. The longitudinal radargrams that contain two different mixes were split and then matched with the correct values of the properties (labels). The three radargrams for each sample and for each test date were preprocessed and then the attributes of the traces in those radargrams were computed and used in the independent predictions of density, porosity, and compressive strength. The attributes are computed from 150 traces from the center of each radargram or from the center of one-half of the radargram for two-mix beams, then averaged. In the case of the instantaneous amplitudes, this produces the average envelope amplitude as in Pettinelli et al. 32 The three GPR radargram attributes were concatenated to form the feature vectors. For attributes that are the same length as the traces (trace attributes), the feature vector contained 1536 values, while summary attributes contained 150 values from each of the radargrams for 450 values.
A summary of the 23 attributes considered in the algorithm is presented in Table 3 and some are shown in Figure 7. This selection of attributes includes attributes that are shown in the literature to have either correlations with properties of interest or reliable performance relative to environmental variation. The selected attributes capture the attenuation, surface reflection, and scattering of the GPR signal. Each attribute was used independently to train a model, except in the case of a set of summary attributes, which were used together and shown in bold in Table 3. All attributes are described in the Appendix 1 and the most successful attributes are presented in the results. Each attribute was used individually to train a model and one additional model was trained on the list of summary attributes. This group of summary attributes contained a list of the average attribute value from each radargram (of 150 values) for each summary attribute. Note that the centroid frequency is the weighted average of all the frequencies found in the traces (using a Fourier transform), while the centroid of instantaneous frequency is the centroid of the instantaneous frequency (found with the time derivative of phase). The attenuation ratio
Summary of included attributes with summary attributes shown in bold (see Appendix 1).

Sample processed trace and the corresponding selected attributes. Additional summary attributes are listed with the most closely related attributes.
Development of regression models
To explore the correlation between the electromagnetic properties of GPR reflections in concrete with its material properties, ML models were developed and tested. In general, an ML pipeline includes two steps, namely, a training phase and a prediction phase. In the training phase, a subset of the data called the training set is used to learn the model. In the prediction phase, the learned model is evaluated based on its ability to predict values (labels) for the remaining testing or validation data that were withheld from the training phase. The ML model and associated software developed by the authors used the training data and cross validation to train five different regressors selected based on their performance with small data sets. 68 The baseline model predictions of the property values from the most important features were used to further tune the model with random search and grid search before arriving at the final model (Figure 8).

Overview of the ML pipeline for training and testing the prediction models.
To understand the problem at hand, consider the GPR traces as the data set
The biggest challenge with the current approach was the small data set available to create regression models for the prediction of material properties. In order to have enough samples to create a reliable model for porosity and compressive strength, the following data imputation approaches were adopted (also see Table 2). In the case of compressive strength, the missing measurement was replaced with the mean. Mean substitution is a standard practice in data science even though it results in statistically correlated samples. 89 For porosity, since only one value of measurement per specimen is available, the authors had impute missing values in order to have a minimum number of samples for supervised ML models as reported in the literature. 90 While this introduces error in porosity, the small variability of GPR measurements (see Figure 6(c)) as well as averaging the attributes over the entire length of the specimen minimizes this error and does not affect the results significantly.
The major components of the ML model and associated software are as follows:
Train–test split: The new imputed samples were divided into training and testing data sets in a stratified manner. Splitting the data into a testing and training set does have some impact on the predictions, so different methods were tested to divide the training and testing data.
91
Proportions of 80% training and 20% testing data were used because it provided the best models by
Feature selection: The attributes computed from the GPR radargrams have a large number of features (e.g. 512, except in the case of summary attributes calculated from individual traces like total energy). To improve computational efficiency, feature selection was performed to reduce the size of the training set. The feature importance was computed based on the
Training the models: Though uncommon in material science, a large training set is beneficial to learn the model.
93
In this work, the following five algorithms in sklearn, which are better suited for small data sets, were used for the material property prediction based on GPR attributes.
68
K-nearest neighbor (KNN): KNN is a non-parametric method used for regression. The output value, Support vector regression (SVR): SVR is also a non-parametric technique based on SVM analysis, first identified in 1992.
95
It uses kernel functions for regression and classification, with this work using linear and radial basis functions as the kernels.
68
Ridge regression (RR): RR is a linear regression model where the loss function is the linear least squares function and regularization is given by the Random forest (RF): RF is an ensemble regression method that fits decision trees on a subset of the whole data set. This overcomes the issue of overfitting commonly faced by decision trees and improves the accuracy of the model. The predictions are based on the mean of the output of all the decision trees.68,97 AdaBoost (ADA): ADA is also an ensemble method and works on the principle of combining the results of a number of weak regressors to obtain a strong learner.98,99 The regressor is considered adaptive as it modifies the weights of the different learning models in subsequent training steps based on the error of current step predictions.
68
All the above models were trained using the reduced training set by choosing a baseline set of hyper-parameters, which are parameters of the algorithm itself rather than the particular model. For example, the random forest model allows hyper-parameters such as the number of estimators and the depth of the tree to be set. 68 The number of estimators was chosen as 1000 for the baseline model and other hyper-parameters were left at default values.
Fine tuning models: The best baseline models were fine tuned to improve the model predictions. First, the feasible set of hyper-parameters was reduced using a randomized search.
100
Then, the values for the reduced set of hyper-parameters were tuned with a more exhaustive grid search. These methods used the cross-validation score to find the best parameters for each model, thereby enabling determination of the correlation between the GPR radargram attributes and physical properties. The best model was used to predict the material properties of the training set. The efficiency of the model was evaluated using the
Validation: The most successful models selected for each property based on the cross validation scores were used to predict the material properties of the test set

Distribution of the train–test split (stratified approach, 80/20) for model creation for (a) density (kg/m3), (b) compressive strength (MPa), and (c) porosity (volume fraction). Compressive strength and porosity values are shown after data imputation so that the total number of samples for each property is the same.
Since one aim of this work is to establish that there is a correlation between the attributes and the properties, the existence of a successful model demonstrates that there is correlation (i.e. the function
Results and discussion
The predictions of the ML models are presented in this section. Once models are trained and the relationship between GPR attributes and measured material properties is established with the training data, the testing data can be used to make predictions of the material properties.
Successful attributes
Of the complete set of 23 attributes, models trained on about half of them are successful (see Table 3 and Appendix 1):
Instantaneous amplitude and maximum instantaneous amplitude
Instantaneous absolute amplitude
Intensity and total energy
Total phase
Centroid of instantaneous frequency
Attenuation ratio and constant for top and bottom reflections (
Successful attributes are those that provide meaningful predictions when the appropriate model is applied to the validation data. The successful attributes are described in more detail here and can broadly be considered in two categories: summary attributes such as total phase or centroid frequency present one value per trace and are averaged over the whole radargram, and trace attributes such as intensity or instantaneous amplitude present one average attribute with the same length as a trace. The variance within a single attribute is shown in Figure 6(c), demonstrating that the mean “trace” is a good representation to use and that the features have a reasonable level of variance and consistency. Most of the successful attributes are directly related to the instantaneous amplitudes of the traces. The most successful attributes are derived from the complex trace found using the Hilbert transform, such as the instantaneous amplitude, which is the real component, and the intensity, which is the squared magnitude.
32
The corresponding summary attributes are also successful, including the maximum real amplitude, the maximum instantaneous amplitude, and total energy, which is normalized and calculated by finding the total intensity of the trace.
30
Four further summary attributes are also successful: total phase, instantaneous frequency centroid, and the attenuation ratio and constants. Total phase is found by summing the unwrapped phase of the instantaneous signal.
7
The instantaneous frequency is the time rate of change of the unwrapped phase; the centroid of the instantaneous frequency (not to be confused with the centroid frequency) is the weighted average of the instantaneous frequencies in the trace. The centroid frequency is the centroid of the frequency components of the signal obtained from the Fourier transform, weighted by the frequencies present in the signal. Two slightly different measures of attenuation are also successful attributes: the attenuation constant
Correlation between GPR attributes and physical properties
The results of the model development process demonstrate that there is a nonlinear relationship between the GPR attributes and the measured physical properties. The RF models are able to explain variation in the measured properties (Figure 10). In Figure 10, the value of the property measured in the lab is plotted against the value of the property predicted by the attribute-based RF model. The dashed line represents a perfect prediction, where the measured value is the same as the value predicted from the GPR attribute. Because regression models were found in a robust manner that controls for overfitting, the high accuracy of the models during training indicates that there is indeed a correlation between the GPR attributes and the physical property being predicted. In the case of porosity, the data imputation procedure results in one true value of porosity corresponding to three GPR radargrams/sets of attributes and predicted values. Because of variation between the radargrams themselves, the predicted values for those three samples are different.

In the training set, high correlation between the attributes and material properties is demonstrated by the tuned regression models. The x-axis shows the measured (“true”) value of the property and the y-axis shows the predicted value, meaning that the dashed line represents perfect prediction
Model validation
The ability of the models to predict material properties is demonstrated using the testing data,

Comparison of the baseline prediction of the model for three different training–testing data sets obtained by choosing three different random states. The random states used for the three figures were (a) 0, (b) 10, and (c) 42.
The relative prediction error and the percent error (relative error × 100) for each property are shown in Figure 12. In general, there are larger errors for smaller values of porosity, but for density the opposite is true. In the case of porosity, this is likely due to the limited resolution of the cold water saturation method, which has been shown to estimate 10%–20% lower porosity than the vacuum saturation method because it does not capture closed or dead-end pores. 85 The relative error is across all the mixes and not dependent on the mix used for training and testing as shown in Figure 14. The models would be able to predict the porosity better if a different approach for porosity measurement, especially one that accounts for dead-end pores, was used. Furthermore, because the values are very low, small deviations between the measured and predicted value produce large percent errors in excess of 50%. For higher values of fractional porosity (above 0.05), the percent error decreases to less than 25% (Figure 12). In the case of density, the main source of errors is the predictions rather than the measured values. 45 Still, the relative error for density is fairly low, around 6% (Figure 12). There are demonstrated challenges (errors up to 50%) in determining the compressive strength from nondestructive methods. 59 In this work, the maximum percent error (omitting outliers) for compressive strength is around 35% (Figure 12).

Column (a) presents the prediction error as a function of the predicted property (porosity is plotted as fractional porosity
Overall, the predictions present a general trend where large values are overpredicted and small values are underpredicted, particularly in the case of compressive strength (Figure 13). This is a weakness of the modeling approach that is likely explained by the combination of a noisy and stratified set of labels. The model assumes a homogeneous distribution that is not actually present in the stratified data. The set of labels could be improved by removing outliers, but this could reduce the transfer of the models to other data sets and could introduce unnecessary bias.

Column (a) shows baseline results of the best performing models. Column (b) shows final results of the best performing model after the parameters have been tuned. Results are shown for each property: fractional porosity, compressive strength (MPa), and density (kg/m3).
The range of measurements of material properties (see Figure 3) in the chosen specimens affects the accuracy and sensitivity of the predicted properties and helps explain the relative success of each prediction. Both the measurement range and the modeling approach affect the predictions. Density has the smallest measurement range and highest repeatability of the three considered property measurements; in this case, the model accounts for a small amount of variance from the measured labels. GPR attributes and reflections comprise the bulk of the variability in the density predictions (i.e. mortar vs NWC samples have density variation controlled by the aggregate content, which changes the GPR reflections and attributes). The porosity measurements, while limited by the accuracy of the cold water saturation method, have relatively high precision or repeatability. Because of the data imputation, the porosity models account for variance primarily from the features rather than the labels. Compressive strength, however, has higher measurement range and much lower repeatability because the method is highly sensitive to flaws in individual samples. Compressive strength models therefore must account for high variability in the measured labels as well as the variance in the features. Since all models use the same set of features, the better predictions for porosity and density can be attributed to the lower measurement error of those properties. Density predictions experience higher variance in the GPR attribute labels than the property measurements. Compressive strength is poorly predicted, which is primarily influenced by the measurement range and high variance in those labels.
Another factor in the relative success of the predictions is fundamentally related to the attenuation of GPR signals. The attenuation of GPR signals in concrete is a dominant effect that is well described by the amplitude attributes in the concrete beams and can be related to the physical properties. Because the GPR
Because amplitude attributes capture attenuation, they can used to make predictions and explain why porosity and density are better predicted by the attributes. Porosity and density are two properties that directly influence signal attenuation. The experiment design, consistent sample geometry, and scanning protocol allows attenuation to be captured in the amplitude attributes. The models suggest that porosity can be more directly measured (and better predicted) by amplitude attributes because there is more scattering and higher attenuation caused by the higher porosity (i.e. lower amplitudes, especially later in time). This finding is consistent with amplitude attributes in concrete in other work14,57 and encourages further investigation to refine the correlation between attributes and attenuation to be used in predictions of physical properties.
Conclusion
This work identified preliminary fundamental relationships between the physical properties of concrete and GPR signal attenuation phenomena, which are recorded in the amplitudes of GPR reflections and robustly captured by GPR attributes like intensity and instantaneous amplitude. These relationships can be leveraged to make predictions of two of the three measured properties in this work (porosity and density) from GPR attributes, but prediction is not possible for compressive strength with

Porosity prediction using different mixes for the baseline model.
Future work will validate these models on real field data and further define the relationships by finding attributes or combinations of attributes that better capture the strength of the concrete, making predictions of compressive strength based on the attribute-predicted porosity and density 102 or by combining the GPR attributes with other nondestructive testing data. In addition, higher frequency antennas could be used to scan the beams and obtain different attenuation characteristics and better reflections from the rebar, which would enable better estimates of the dielectric constant, which in turn has a strong effect on both the GPR signal and is indicated in physical properties.31,66
Footnotes
Appendix 1
Acknowledgements
The authors would like to acknowledge the support of Claire White and Joe Vocaturo in designing and conducting the experiment and Victor Charpentier and Jack Reilly for their assistance managing the samples.
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: I.M.M. was financially supported by NSF GRFP (No. 1148900). Any opinions, findings, conclusions, or recommendations expressed herein are those of the authors and do not necessarily reflect the views of the funding agency.
