Model name: EV Range Estimation Model
Goal: To evaluate which Electrical Vehicles (EV) might be able to replace an Internal Combustion Engine (ICE) vehicle for a given duty cycle
Base model: Linear Regression
Model type: Supervised
Model version: 1.0 in production
Developed by: Geotab Sustainability team
Primary intended uses:
Out-of-scope uses: The model is specifically designed and trained using data from light-duty vehicles, such as passenger cars and delivery vans. It is not suitable for heavy-duty vehicles.
Target users/User groups: Fleet managers.
This section outlines the key aspects of the data used to develop and evaluate the model. We first describe the training and testing data, and then detail the data pipeline and preprocessing steps used to prepare the data for modeling. Lastly, we discuss the privacy considerations and protections implemented to ensure responsible handling of sensitive data.
The EV energy estimation model works by using linear regression on historical trip data to predict the energy consumption of EVs (in kWh) based on key trip characteristics and environmental factors. It takes specific inputs such as total trip distance, total trip duration, average trip speed, and average outside air temperature.
The total energy consumption is assumed to be a weighted sum of these components: energy to overcome friction, energy to overcome drag, baseline auxiliary power (lights, seat heating, etc.), and energy to power HVAC. Different inputs are used to estimate each part. The linear regression model approximates how much each component contributes to the total energy consumption.
The data is split into a 75% training set and a 25% testing set through random selection.
The pipeline for the input data is broken up into five stages, each capturing specific data points which are used in the Energy Prediction Model:
Selection criteria for distance are as follows:
In this section, we highlight some ethical challenges that were encountered during the model development, including bias and fairness considerations. Additionally, we provide the assumptions and constraints of our model, including any limitations in the data or the model's scope that could affect its performance, in order to foster the understanding of the model's strengths and limitations to the stakeholders which is crucial to use the model responsibly and interpreting its results.
Assumptions:
Constraints:
Mean Absolute Error (MAE) and R-squared (R²) are used as evaluation metrics to assess the performance of the model on different make-models (e.g. Renault Zoe):
Performance is also assessed across different trip lengths (short, medium, long) to ensure a balanced and more nuanced accuracy.