Abstract
Vehicle routing optimization has proven effective in reducing enterprise operating costs and improving customer satisfaction in logistics management. However, existing studies rarely incorporate customer value differentiation into operational decision-making processes, resulting in a disconnect between service priority and customer importance in cold chain logistics. To bridge this gap, this paper proposes a customer-centered optimization model that prioritizes service for high-value customers through differentiated penalty functions for time window violations. The objective function minimizes the total cost, including carbon emission, energy consumption, fixed cost, refrigeration, cargo damage, courier waiting, and customer penalty costs. A genetic algorithm (GA) was developed and validated against CPLEX on small-scale instances, achieving an absolute optimal solution gap within 2.0%. For medium- and large-scale instances, CPLEX encountered memory errors, while the GA efficiently obtained feasible solutions for all instances. Experiments were conducted on instances adapted from Solomon benchmarks with three customer distribution patterns and five random seeds. A comprehensive sensitivity analysis confirmed the robustness of the optimization approach, ensuring that high-value customers do not experience time window violations. The proposed penalty function for different customer types achieved the best performance in relation to total cost. These findings provide decision support and a theoretical basis for customer-centered cold chain logistics operations.
Keywords
Introduction
As a crucial link in ensuring the quality of fresh products, cold chain logistics directly affects customer satisfaction through factors such as transportation efficiency, delivery timeliness, and product quality ( 1 ). In recent years, the rapid development of e-commerce and emerging retail models has led to a continuous increase in demand for cold chain logistics within new business formats such as community group-buying ( 2 ). Meanwhile, consumers’ demands for the freshness of fresh products and delivery timeliness have further compounded the challenges faced by cold chain logistics in relation to operational efficiency and service quality. In addition, against the backdrop of carbon emission reduction targets, the low-carbon transformation of cold chain logistics has become an imperative requirement for the industry’s development ( 3 ). In this context, vehicle routing optimization, as a core component of logistics system optimization, directly affects enterprises’ operational costs, service levels, and the progress of low-carbon transformation, thereby serving as a critical breakthrough point for addressing the prevailing development dilemmas in cold chain logistics.
The vehicle routing problem (VRP) aims to achieve optimal objectives, such as minimizing transportation costs and maximizing service quality, by optimizing vehicle routes. Over the past decades, VRP research has progressively evolved from single-objective models with limited constraints to complex scenarios involving multiple objectives and diverse constraints, resulting in rich theoretical achievements. To address the diverse requirements of real-world logistics scenarios, the VRP has evolved into numerous representative variants. These mainly include the capacitated vehicle routing problem (CVRP) ( 4 ), the vehicle routing problem with time windows (VRPTW) ( 5 ), the green vehicle routing problem (GVRP) ( 6 ), the vehicle routing problem for cold chain logistics (VRPCCL) ( 7 ), and the multidepot vehicle routing problem (MDVRP) ( 8 ). For cold chain logistics, the VRP must also consider additional factors such as refrigeration requirements and cargo damage, thereby forming the VRPCCL.
Although significant advancements have been made in model development, there is still a crucial shortcoming in the existing literature, namely the homogeneous treatment of customers. Most VRPCCL models assume that all customers have the same importance to the enterprise and therefore adopt uniform service priorities and penalty functions. This assumption fundamentally contradicts the reality of customer relationship management, as enterprises usually differentiate customers based on their value. High-value customers (HVC) expect and deserve priority services, while lower-value customers may tolerate a certain degree of delay without causing serious damage to the enterprise. Neglecting customer value differentiation in operational decision-making thus creates a significant disconnect between customer management strategy and vehicle routing optimization.
To bridge this gap, this paper proposes a customer-centered cold chain vehicle routing optimization model to reflect the heterogeneity of customer value. Specifically, we introduce a differentiated penalty function that imposes higher penalties for time window violations by HVC, thereby directly embedding service priority into the model. To realize this contribution in the cold chain logistics scenarios, this model considers a series of complex operational factors that interact with customer service priority. Specifically, the model incorporates: (i) a heterogeneous fleet consisting of fuel-powered and electric vehicles; (ii) simultaneous pick-up and delivery requirements; and (iii) refrigeration, cargo damage, and carbon emissions. Although these elements are important, they are all auxiliary components that enable the customer-centered vehicle routing model to accurately reflect real-world application scenarios. In addition, a genetic algorithm (GA) has been developed to solve the proposed model. To evaluate its performance, we conducted experiments on small-, medium-, and large-scale instances.
This paper is composed of six sections. The initial section is the introduction. The second section is a literature review. The third section presents the methodology. The fourth section describes the algorithm design. The fifth section covers the numerical case application. The final section presents the conclusions and suggestions for future research.
Literature Review
The VRPCCL is an extension of the VRP to cold chain logistics scenarios. This problem requires additional consideration of refrigeration, cargo damage, and other factors. Aligned with the research focus of this paper, this section presents a systematic review of the CVRP, the VRPTW, the GVRP, the vehicle routing problem with simultaneous pick-up and delivery (VRPSPD), and the heterogeneous fleet vehicle routing problem (HFVRP).
Capacitated Vehicle Routing Problem
As a fundamental extension of the VRP, the CVRP incorporates vehicle capacity constraints, thereby more accurately reflecting resource limitations in real-world logistics distribution. Dantzig and Ramser ( 9 ) first formulated the mathematical programming model for the CVRP, establishing a theoretical foundation for subsequent research. The classic CVRP usually assumes that customer requirements are deterministic, the objective function is single, and only a single capacity constraint is considered. However, such simplified assumptions are insufficient to fully depict complex logistics scenarios. To enhance the applicability of the model, researchers gradually introduced various constraints and proposed a series of extended CVRP models. Gounaris et al. ( 10 ) investigated the robust CVRP under demand uncertainty. To minimize delivery costs, a tabu search algorithm was employed, and the relationship between the chance-constrained CVRP and the robust CVRP was analyzed. Alesiani et al. ( 11 ) employed clustering methods to reduce the problem scale of the CVRP. The findings reveal that this approach can effectively improve solution efficiency and is thus well suited for the rapid resolution of large-scale CVRP instances. However, the majority of existing research on the CVRP focuses either on a single constraint or on the integration of a set of constraints, while largely overlooking customer value heterogeneity. At the same time, there is a lack of deep integration with heterogeneous fleets and simultaneous pick-up and delivery in cold chain logistics scenarios, which limits the adaptability and practicality of models that require differentiated customer service.
Vehicle Routing Problem with Time Windows
The VRPTW, as a significant extension of the VRP, incorporates time window constraints for each customer node. Time windows can be classified into two categories according to the strictness of their constraints: hard time windows and soft time windows ( 12 ). In 1987, Solomon conducted the first systematic study on the VRPTW, proposing an insertion-type heuristic algorithm and validating its effectiveness through extensive numerical experiments ( 13 ). However, in practical logistics operations, hard time windows often prove inflexible in addressing real-world uncertainties because of their strict constraint requirements. Consequently, the VRPTW considering more flexible soft time window constraints has gradually become an important research direction.
In the VRPTW model with soft time windows, vehicles are permitted to arrive at customer nodes outside the predefined time windows. However, penalty costs must be incurred for either early or delayed arrivals, and such costs should be integrated into the optimization objective function. The design of the penalty function is crucial, as it can reflect the relative importance of timeliness for each customer. However, most studies adopt a uniform penalty function applicable to all customers. Russell and Urban ( 14 ) developed a linear penalty function for the VRP with soft time windows and employed a tabu search algorithm. The results showed that the proposed model is applicable to a broad range of practical scenarios. Fang et al. ( 15 ) adopted a linear penalty function for the VRPCCL and designed a hybrid ant colony optimization algorithm. The effectiveness of their approach was further validated through comparative analysis on benchmark instances. Zhang et al. ( 16 ) investigated the multivehicle routing problem with soft time windows, proposed a piecewise linear penalty function, and employed a multiagent reinforcement learning approach. Their experimental results demonstrate that this method outperforms both Google OR-Tools and traditional methods in relation to solution quality. Fu et al. ( 17 ) designed a unified penalty function and adopted a tabu search algorithm to address various VRPTW variants, further confirming the superiority of their method.
A notable exception is the research by Yu et al. ( 18 ), who proposed a customer classification method based on Recency, Frequency, Monetary Value analysis of actual transaction data. They employed the Density-Based Spatial Clustering of Applications with Noise algorithm to identify three distinct customer groups and designed corresponding penalty functions for real-time delivery routing. Their results indicated that a customer-centered delivery strategy significantly improved the service timeliness for key customers. However, their model was developed for urban real-time delivery and did not address the complexities of cold chain logistics, such as refrigeration, cargo damage, heterogeneous fleets, and carbon emissions. Our research extends this customer-centered logic to the more complex context of cold chain logistics.
Green Vehicle Routing Problem
With the continuous advancement of dual carbon strategic goals, the carbon emissions of cold chain logistics have drawn increasing attention because of characteristics such as the continuous operation of refrigeration systems and high-energy consumption during transportation. Integrating carbon emission considerations into the VRP has emerged as a significant research field. Erdoğan and Miller-Hooks ( 19 ) proposed the GVRP model and developed a corresponding solution algorithm, aiming to assist enterprises operating alternative fuel-powered vehicle fleets in overcoming operational challenges arising from limited driving ranges and insufficient refueling infrastructure. Liu et al. ( 20 ) further considered the impact of carbon tax policy by developing a joint distribution GVRP model for cold chain logistics enterprises, employing a simulated annealing algorithm. Their study shows that compared with a single distribution model, joint distribution has significant advantages in reducing total operating costs and carbon emissions. Ge et al. ( 21 ) developed a multivehicle routing optimization model with time windows that incorporates carbon emission factors and proposed a hybrid genetic algorithm. The results indicate that variations in key parameters of the carbon trading mechanism significantly influence the total cost of logistics distribution. Wang et al. ( 22 ) established a heterogeneous fleet GVRP model with soft time windows, specifically incorporating urban traffic restriction constraints, and proposed an improved ant colony optimization algorithm. Their research findings provide valuable decision-making support for government authorities in formulating traffic management policies, while simultaneously offering effective guidance for logistics enterprises. While carbon emissions have been extensively incorporated into GVRP research, the integration of environmental objectives with customer value considerations remains unexplored. Current models optimize both costs and emissions simultaneously but treat all customers equally, which may force HVC to bear delays. This highlights the need to establish models that can balance multiple objectives while also considering differences in customer values.
Vehicle Routing Problem with Simultaneous Pick-up and Delivery
VRPSPD represents a significant variant of the VRP. Its primary objective is to efficiently fulfill customers’ simultaneous demands for both pick-up and delivery services. This model effectively reduces deadhead travel distances, enhances vehicle utilization efficiency, and reduces operational costs. Angelelli and Mansini ( 23 ) investigated the VRPSPD in a single depot with a homogeneous vehicle fleet, aiming to minimize the total vehicle travel distance, and employed a branch-and-bound algorithm. Wang et al. ( 24 ) established a mixed integer programming model integrating simultaneous pick-up and delivery to minimize total costs and proposed a parallel simulated annealing algorithm. Lei and Hao ( 25 ) proposed a memetic algorithm for the VRPSPD, further enriching the solution methods. Liu et al. ( 26 ) established an optimization model aimed at minimizing vehicle cost and travel distance cost for the VRPSPD under time window constraints. An adaptive brainstorm algorithm was employed, and the effectiveness and practicality of the proposed approach were validated through extensive experiments, including small and large-scale instances as well as a real-world case. Existing VRPSPD research shares the common limitation of customer homogeneity. The distinction between pick-up and delivery demands adds operational complexity, but the fundamental question of which customers should receive service priority remains unaddressed. Moreover, most studies assume homogeneous fleets, ignoring differences among vehicle types that affect cost and carbon emissions. As a result, these models are difficult to adapt the operation needs of logistics enterprises.
Heterogeneous Fleet Vehicle Routing Problem
In existing VRP research, some scholars assume a homogeneous fleet in which vehicles are identical in key parameters such as capacity, operating cost, and speed. However, in real-world logistics scenarios, enterprises often operate multiple types of vehicles, thus forming the HFVRP. Song et al. ( 27 ) investigated the VRPCCL considering time window constraints, different vehicle types, and energy consumption, and developed an improved artificial fish swarm algorithm to minimize total operational costs. The effectiveness of their approach was validated through numerical experiments. Kopfer and Vornhusen ( 28 ) developed a mixed integer programming model for VRPs incorporating time windows, charging infrastructure, and heterogeneous fleets, which was subsequently solved using commercial optimization solvers. Wang et al. ( 29 ) investigated the electric VRP for heterogeneous fleets incorporating nonlinear charging functions, formulated a mixed integer linear programming model, and validated the effectiveness of their approach through numerical experiments of varying scales. Zhao et al. ( 30 ) investigated the VRP with time windows for heterogeneous fleets under stochastic demand conditions and proposed a hybrid algorithm integrating simulated annealing with variable neighborhood search. Although existing research on hybrid VRPs more accurately reflects the actual vehicle fleets of logistics enterprises, these studies have not incorporated customer value differences into the models, nor have they designed differentiated penalty mechanisms for different customer types. In addition, most studies fail to realize the integrated optimization of heterogeneous fleets and simultaneous pick-up and delivery, which makes it difficult to effectively adapt these models to modern logistics operations.
Research Gaps and Contributions
Although existing VRP research has gradually expanded from single constraint to multiple constraints and from homogeneous fleets to heterogeneous fleets, gaps still remain. First, most existing research generally adopts the assumption of customer homogeneity, ignoring the core factor of customer value differentiation. It fails to classify customer types or design corresponding differentiated penalty functions according to their importance to the enterprises, which cannot adapt to the needs of differentiated customer management in logistics enterprises. Second, most vehicle routing optimization models fail to differentiate between cargo damage costs incurred during the pick-up and delivery process, leading to an inaccurate cost calculation. Third, existing models lack an integration of customer value with operational constraints. The question of how to design penalty functions that reflect customer importance while accounting for heterogeneous fleets, SPD, carbon emission, refrigeration, and cargo damage has not been addressed.
To address the aforementioned research gaps, with customer value differentiation as the core point and HVC service guarantee as the primary goal, this paper integrates the actual operation requirements of cold chain logistics and makes the following key contributions:
A customer-centered optimization model was constructed. We designed the VRPCCL objective to incorporate the diversity of customer value. Differentiated penalty functions were designed and directly embedded into the model to ensure that HVCs are not disadvantaged.
Integration with the complexities of cold chain logistics. To ensure that the customer-centered model can operate effectively in real-world decision-making scenarios, we have systematically integrated it with the key characteristics of cold chain logistics: heterogeneous fleets, SPD, and comprehensive cost components.
A genetic algorithm and comprehensive sensitivity analysis. We develop a genetic algorithm to solve the proposed model and conduct systematic experiments on instances generated from Solomon benchmarks, covering three customer distribution patterns (C-type, R-type, RC-type) with five random seeds. Subsequently, we carry out a comprehensive sensitivity analysis.
In summary, the research findings offer valuable decision-making support and a theoretical foundation for cold chain logistics enterprises aiming to realize differentiated customer service.
Methodology
This section elaborates on the methodology of the VRPCCL model, which incorporates various customer types and the SPD of a heterogeneous fleet under carbon emissions. First, the problem description is provided. Second, we categorize customers and establish corresponding penalty functions. Third, the notations used in the model are introduced. Then, a VRPCCL optimization model is established to minimize total cost. Finally, the constraints related to the VRPCCL model are formulated.
Problem Description
This paper extends previous studies on the VRPCCL by incorporating additional factors such as heterogeneous fleets, customer types, and SPD. We focus on a single distribution center (DC) for fresh goods equipped with both electric and fuel-powered vehicles and develop an optimization model to determine the optimal vehicle routes with the objective of minimizing total cost. The model integrates several key factors: service time, customer demand (the demands of the recipients and senders of goods), penalty functions for different customer types, courier waiting time, carbon emissions, and vehicle capacity throughout the pick-up and delivery process.
Figure 1 presents a schematic overview of the VRPCCL, illustrating the DC, customer nodes, routes for electric and fuel-powered vehicles, time windows, and customer demands. Customers are classified into three categories: high-value, potential-value, and low-value. In addition, there are three distinct types of demand: pick-up only, delivery only, and simultaneous pick-up and delivery. Vehicles depart from the DC and return to it after fulfilling all customer demands.

Vehicle routing considering customer types and the simultaneous pick-up and delivery of heterogeneous fleets.
The model is based on the following assumptions:
Parameters including pick-up and delivery demands, service time, and time windows are given. Each customer node must be served exactly once by a single vehicle, and its demands cannot be split. Customers arrive at the time window lower bound (TWLB) precisely.
All vehicle information at the DC is given. Each vehicle departs with sufficient energy to complete its assigned tasks at a constant speed, without exceeding its capacity.
The quality of fresh products is maintained under fixed refrigeration temperature requirements.
Customer Type and Penalty Function
Customer value heterogeneity is a crucial factor in real-world logistics operations, as different customers contribute differently to the long-term profitability of the enterprise. Based on the customer classification established by Yu et al. ( 18 ), who demonstrated the effectiveness of value-based classification in vehicle routing optimization using transaction data, we adopt three types of customers: HVC, potential-value customers (PVC), and low-value customers (LVC).
Ideally, customer classification should be determined based on the observed consumption frequency and amount ( 18 ). However, since customer data cannot be obtained in cold chain logistics scenario, we cannot directly use the Solomon benchmark instances. Therefore, in this study, we simulate customer types by randomly assigning each customer to one of three categories, following a reasonable distribution inspired by the Pareto principle and the research results of Yu et al. ( 18 ). Specifically, we classify approximately 30% of the customers as HVC, 40% as PVC, and 30% as LVC.
Based on the three customer types, we have designed differentiated penalty functions for time window violation. Following the common practice in VRPTW research ( 15 , 31 ), we adopt penalty functions with different coefficients for each customer type. The design of the penalty function aims to reflect the different tolerances of each customer type. The specific forms are defined as follows:
(1) High-value customers
HVC are crucial to logistics enterprise and require strict punctuality in pick-up and delivery services, with substantial penalties imposed for time window violations. The time window for a customer is given as
(2) Potential-value customers
PVC are the focus of the logistics enterprise’s development, with the potential to evolve into HVC. Accordingly, their time windows should be satisfied to the greatest extent possible. If the vehicle arrives after the TWUB, that is,
(3) Low-value customer
LVC occupy a marginal position within the operational priorities of logistics enterprises; therefore, it is unnecessary to allocate excessive resources to guarantee service within their designated time windows. When the vehicle arrives after the TWUB, that is,
(4) The relationship between penalty functions
If the vehicle arrives after the TWUB, that is,
Notations
Table 1 presents the parameters, symbols, and decision variables involved in the model.
Notations of the Model
Note: CNY = renminbi; DC = distribution center.
Model Formulation
This section presents the formulation of the VRPCCL, including the objective function and constraints.
Objective Function
This study aims to minimize the total cost
Energy Consumption Cost (C1)
Vehicles consume energy during operation. Given the heterogeneous fleet, energy consumption costs are calculated separately for each vehicle type. For fuel-powered vehicles, the energy consumption cost corresponds to the fuel consumption cost, estimated using a load-based fuel consumption model ( 32 ). For electric vehicles, the energy consumption cost is defined as the electricity consumption cost, calculated through a distance-dependent electricity consumption ( 33 ), as specified in Equations 5 to 7.
Carbon Emission Cost (C2)
Carbon emissions from fuel-powered vehicles are primarily calculated based on fuel consumption and the carbon emission coefficient ( 34 ). In contrast, electric vehicles operate on electrical energy and produce zero direct emissions. However, given that electricity generation is predominantly reliant on thermal power sources, indirect carbon emissions are still generated ( 35 ). The corresponding carbon emission costs are formulated in Equations 8 and 9.
Fixed Cost (C3)
Fixed costs primarily comprise vehicle depreciation, employee wages, and insurance expenses, which are independent of energy consumption, travel distance, and travel time ( 36 ), as formulated in Equations 10 and 11.
Refrigeration Cost (C4)
Throughout the vehicle journey, which includes driving, loading, and unloading, refrigeration must be continuously applied to maintain the required temperature. The refrigeration cost is directly related to the total refrigeration time ( 15 ), as shown in Equation 12.
Cargo Damage Cost (C5)
Fresh goods are prone to damage during transport, loading, and unloading processes. Consequently, cargo damage costs must be incorporated into the total cost. This study addresses SPD, where delivery demands decrease from the DC to customer nodes, while pick-up demands increase from customers back to the DC. Building on the model by Fang et al. ( 15 ), pick-up and delivery demands are treated separately, as formulated in Equations 13 to 15.
Courier Waiting Cost (C6)
If a courier arrives at a customer node before the TWLB, a waiting cost is incurred, as defined in Equation 16.
Customer Penalty Cost (C7)
If a courier arrives at a customer node after the TWUB, a customer penalty cost is incurred, as shown in Equation 17.
In summary, Equation 18 establishes the total cost by incorporating all the aforementioned cost components.
Constraints
As specified in Equation 19, each customer node is visited exactly once. Equation 20 ensures that every vehicle departs from the DC to carry out its operations and returns to the DC after completing all assigned tasks. Equation 21 represents the flow conservation constraint. Equation 22 ensures that each customer is served by exactly one vehicle. The time continuity constraints between any two consecutive customer nodes are formulated in Equations 23 to 25. Load constraints on departure from and return to the DC are described in Equations 26 and 27. The load relationship that must be maintained between two consecutive customer nodes is defined in Equations 28 and 29. Finally, the constraints on the decision variables are provided in Equations 30 to 32.
Solution Algorithm
The VRPCCL proposed in this study is an integer programming model involving numerous binary variables and constraints. Wang and Chen ( 37 ) demonstrated that the VRP with simultaneous pick-up and delivery under time windows is NP-hard. Building on this foundation, our work extends the model by incorporating a heterogeneous fleet, carbon emissions, cargo damage, and refrigeration, which further expands the solution space and increases its complexity. Consequently, the extended model is also NP-hard. Although commercial solvers such as Gurobi and CPLEX can be applied to solve integer programming models, their computational time increases significantly as the problem scale grows. Heuristic algorithms, known for their strong applicability, are widely adopted. In this paper, we employ a genetic algorithm because of its proven effectiveness in handling large-scale and complex combinatorial optimization problems ( 38 ). The key steps of the algorithm are shown in the following section.
Chromosome Coding and Initial Population
This paper adopts a real-coded chromosome, illustrated in Figure 2. The initial population is generated based on the heterogeneous fleets, pick-up, and delivery demands. For a given Route1, a candidate customer node

Chromosome coding.

Population initialization.
Genetic Operators
Selection Operator
Crossover Operator
Put the numbers of
The crossover operator is illustrated in Figure 4.

Crossover operation.
Mutation Operator
The mutation operator employs a self-mutation mechanism applied directly to the chromosome structure. An example of its implementation is shown in Figure 5.

Mutation operation.
Fitness Function
The VRPCCL model proposed in this paper is formulated as a single-objective optimization problem aimed at minimizing total cost. The fitness function is defined by Equation 33, which exhibits an inverse relationship between the objective value and fitness.
Application
In this section, we employ small-, medium-, and large-scale numerical cases to further validate the effectiveness of the proposed methodology. All experiments were implemented in Python using the CPLEX solver API. The computational environment consisted of an Intel(R) Core (TM) i5-11400 processor 2.6 GHz, Intel(R) UHD Graphics 730, and 32 GB of RAM.
Numerical Cases
Since no established benchmark is available for our specific problem, this study generates instances by adapting the well-known Solomon benchmarks for the VRPTW. To ensure statistical robustness and address the randomness inherent in the adaptation procedure, we adopt a three-dimensional instance generation approach.
Consequently, for each problem scale (10, 25, 50, and 100 customers), we generate a set of 15 instances (3 benchmark types × 5 random seeds). For example, for the medium scale (R-type, C-type, RC-type 25 customers), the instance set includes: C101_25 (Seed 1 to 5), R101_25 (Seed 1 to 5), and RC101_25 (Seed 1 to 5). The experiments are conducted on these 60-instance sets to capture the variability introduced by both customer distribution and random splitting.
The model parameters and their corresponding values are summarized in Table 2, with data sourced from the literature ( 15 , 30 , 39 , 40 ).
Parameter Values in the Model
Note: CNY = renminbi.
Results Analysis of Genetic Algorithm
The model is solved using a genetic algorithm, with parameter settings determined based on established methods from the literature ( 41 ). The population size is 50, while the crossover probability values are selected as 0.6, 0.7, 0.8, and 0.9, and the mutation probability values are chosen as 0.05, 0.15, 0.25, and 0.35. A full combination of these values yields 16 distinct parameter configurations. Each configuration is independently executed 10 times using the Solomon C101_25 benchmark instance, and the corresponding results are summarized in Table 3.
Results of Combinations of Crossover and Mutation Probability for GA
Note: CNY = renminbi; GA = genetic algorithm; Pc = crossover probability; Pm = mutation probability.
As shown in Table 3, the range of the objective function is between 9,400 and 11,500, and almost all the results are obtained within 120 s. In the preceding results, the smaller values of the objective function can be obtained in the following combinations: (0.7, 0.25), (0.7, 0.35), (0.8, 0.25), and (0.9, 0.25). Furthermore, the average values of the objective function under the combinations of (0.6, 0.35), (0.8, 0.25), (0.8, 0.35), and (0.9, 0.35) are relatively small. Based on the preceding results, the crossover probability is ultimately set to 0.8 and the mutation probability to 0.25, as this combination yields the best performance in relation to both the smaller and the averaged objective values.
Taking the Solomon C101_25 benchmark instance as an example, the convergence trend shown in Figure 6 indicates that the proposed GA exhibits stable convergence performance. Therefore, the genetic algorithm is suitable for solving the VRPCCL model proposed in this paper and shows good performance.

Trend of the objective function value iteration in genetic algorithm (GA).
Comparison of Results between GA and CPLEX
Before applying the GA to medium- and large-scale problems, we first validate its correctness on small-scale instances where optimal solutions can be obtained using CPLEX. The GA is implemented in the same computational environment as CPLEX (Version 12.6.3). For CPLEX, the “timelimit” is set to 86,400 s and the “MIN gap” tolerance to 0.01. For GA, the time limit is set to 1,000 s and the number of iterations to 1,000.
The objective value of CPLEX is the average over five random seeds with a MIP gap tolerance of 0.01. For the genetic algorithm, the objective value is obtained by first averaging 10 independent runs for each random seed and subsequently averaging across the five random seeds.
For small-scale instances, we compare the GA solutions with optimal solutions obtained from CPLEX across all instances (3 benchmark types × 5 random seeds). As shown in Table 4, the GA obtains solutions that are close to the optimal values for all benchmark types, with absolute gap values within 2.0% and standard deviations between 0.9% and 3.9%.
Performance Comparison of GA and CPLEX
Note: CNY = renminbi; GA = genetic algorithm; OOM = out of memory; NA = not available. The gap is computed using a formula: (GA_objective value - CPLEX_objective value) / (CPLEX_objective value) × 100%.
For RC101_10, the gap is 1.2% (standard deviation 0.9%), indicating that the GA solutions are slightly higher than those obtained by CPLEX. For C101_10 and R101_10, small negative gaps are observed. These negative gaps do not imply that the GA outperforms the optimal solutions obtained by CPLEX. Instead, they are mainly caused by statistical averaging effects and the numerical tolerances adopted by the CPLEX solver, which may lead to minor numerical discrepancies between averaged results. For medium- to large-scale instances, CPLEX encounters memory errors and fails to obtain feasible solutions, while the GA successfully obtains feasible solutions for all instances with reasonable computation times (e.g., 205.6 s for 100-customer instances).
Sensitivity Analysis
Since customer types in our experiments are randomly generated and the penalty coefficients involve subjective judgment, it is essential to verify that our conclusions are robust to these choices. Therefore, we conduct a comprehensive sensitivity analysis examining four key aspects: (i) random customer type assignment; (ii) penalty coefficient magnitudes; (iii) the proportion of HVC; and (iv) the penalty function form.
Sensitivity to Random Customer Type Assignment
To determine whether the random assignment of customer types affects our results, we generate 10 distinct customer type configurations for the RC101_25 instance by varying the random seed, while holding the demand and service time split constant. We solve the optimization model for each configuration and record the results. Across the 10 random seeds, the number of HVCs ranges from 7 to 9, consistent with the expected approximate proportion of 30%.
Table 5 presents the results across 10 random customer type assignments. HVC maintains zero timeouts in all cases, whereas PVC and LVC exhibit timeout counts ranging from 1 to 7 (mean 3.5) and 2 to 8 (mean 3.2), respectively. The total cost remains stable between 4,014.3 and 4,255.7 CNY, demonstrating that model performance is robust to the specific realization of customer type assignment.
Sensitivity to Random Customer Type Assignment
Note: HVC = high-value customers; PVC = potential-value customers; LVC = low-value customers; SD = standard deviation.
Sensitivity to Penalty Coefficient Magnitudes
The penalty coefficients (
Table 6 presents the sensitivity results on the RC101_25 instance. HVC achieves zero timeouts across all penalty-scaling factors, demonstrating that service prioritization depends on the relative ordering of penalties (
Sensitivity to Penalty Coefficient Magnitudes
Note: HVC = high-value customers; PVC = potential-value customers; LVC = low-value customers.
Sensitivity to the Proportion of High-Value Customers
To examine the sensitivity of our results to the proportion of HVC, we vary the number of HVCs in the RC101_25 instance while holding the total number of customers constant. To accommodate the discrete nature of customer counts, we select five integer HVC counts. The baseline of 8 HVCs (32%) approximates the 30% target described in the section “Customer Type and Penalty Function,” as 7.5 is not an integer. The other scenarios, namely 4 (16%), 12 (48%), 15 (60%), and 18 HVCs (72%), are chosen to span a reasonable range around this baseline, allowing us to assess whether the model’s ability to prioritize HVC is sensitive to the number of customers classified as HVC.
As illustrated in Table 7, HVC achieves zero timeouts across all five scenarios, with HVC proportions ranging from 16% to 72%. This demonstrates that the model’s ability to prioritize HVC is robust to variations in the number of customers classified as HVC. Total cost increases with the HVC proportion, as more customers require priority service, but the main conclusion remains unchanged.
Sensitivity to the Proportion of HVCs
Note: HVC = high-value customers; PVC = potential-value customers; LVC = low-value customers.
Sensitivity to Penalty Function Form
To assess the sensitivity of our results to the mathematical form of the penalty functions, we conduct a comparative analysis on the RC101_25 instance. The original mixed form defined in Equations 1–3 is evaluated against three alternative formulations applied uniformly across all customer types. For each alternative, we maintain the same penalty coefficients to preserve relative importance, using
Original mixed form: as defined in Equations 1–3.
Linear form:
Logarithm form:
Exponential form:
As presented in Table 8, the zero-timeout guarantee for HVCs holds across all three formulations, demonstrating that prioritization of HVC is driven by the relative ordering of penalties (
Sensitivity to Penalty Function Form
Note: HVC = high-value customers; PVC = potential-value customers; LVC = low-value customers.
The logarithmic form achieves a total cost of 5,188.7 CNY, falling between the original mixed form and the linear form. None of the HVC experience any timeouts. In contrast, all other customer types incur timeouts, reflecting a clear differentiation in service priority. Specifically, PVC receive higher priority than LVC, which is consistent with the logarithmic structure embedded in the original mixed form.
The linear form, serving as a conventional benchmark, achieves zero HVC timeouts at a total cost of 6,354.1 CNY, showing that even the simplest penalty structure can protect HVCs when the penalty coefficients are properly distinguished. However, its cost is significantly higher than that of the original mixed form.
Under the exponential form, only LVC experiences timeouts; this further confirms that HVCs never experience delays, but the total cost increases significantly because of the additional resources required to ensure timely delivery for all customer types. While theoretically feasible, such an extreme scenario is often impractical in real-world operations because of the excessive cost. In contrast, the original mixed form guarantees zero delays for HVCs while allowing a certain degree of delay for lower-value customers, thereby achieving a lower total cost.
Summary of Sensitivity Analysis
In summary, although the customer classification in this study relies on simulated data because of the absence of real-world transaction records, the robustness of the proposed customer-centered optimization model has been comprehensively validated through sensitivity analysis across random customer type assignments, penalty coefficient magnitudes, HVC proportions, and penalty function forms. This provides strong evidence that the optimization approach can perform reliably in practical applications under reasonable parameter choices.
Conclusions and Future Research
This paper proposes a customer-centered vehicle routing optimization model for cold chain logistics that incorporates customer value differentiation through differentiated penalty functions. The model integrates heterogeneous fleets, simultaneous pick-up and delivery, refrigeration, cargo damage, and carbon emissions to ensure its practical applicability. A genetic algorithm is developed and validated against CPLEX on small-scale instances (absolute optimal solution gap ≤2.0%), demonstrating its reliability for medium- and large-scale problems where CPLEX fails because of memory errors. In addition, a comprehensive sensitivity analysis is conducted to assess the robustness of the model. The key findings are summarized as follows:
HVC consistently achieve zero delays in all experiments. A comprehensive sensitivity analysis confirms that this result is robust to variations in random customer type assignment, penalty coefficient magnitudes, HVC proportion, and penalty function form.
Compared with linear, logarithm, and exponential alternatives, the original mixed-form penalty function achieves the lowest total cost. It guarantees zero delays for HVC while allowing a certain degree of delay for lower-value customers.
The genetic algorithm exhibits reliable and scalable performance. It obtains feasible solutions for instances with reasonable computation times. The algorithm’s stability is confirmed by its low optimal solution gap on small instances and low standard deviations across random splits.
Future research will be carried out from the following perspectives. First, given that practical operations of logistics enterprises often involve multiple depots and multiechelon distribution structures, extending the proposed model to incorporate these complexities constitutes a promising research direction. Second, future research could address more real-world scenarios that incorporate demand uncertainty and dynamic customer behavior. Specifically, investigating how stochastic demand and evolving customer preferences influence vehicle routing decisions would be valuable.
Footnotes
Author Contributions
The authors confirm contribution to the paper as follows: study conception and design: Wanchen Gao, Shichang Lu, Junying Yue; data collection: Wanchen Gao; analysis and interpretation of results: Wanchen Gao, Junying Yue; draft manuscript preparation: Wanchen Gao, Shichang Lu, Jun Zhao. All authors reviewed the results and approved the final version of the manuscript.
Declaration of Conflicting Interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This research is funded by the Department of Education of Liaoning Province, China (JYTMS20231007).
