01
Project Overview
AI-driven site selection PoC to accelerate bp's EV charging network expansion in China
Problem Statement
China's EV market hit 13.1M vehicles in 2022 (+67% YoY), creating urgent demand for high-value charging locations. Existing site selection was manual, inefficient, costly, and difficult to assess at scale. No structured AI modelling tool existed in the industry to systematically identify optimal sites.
Manual & inefficient
High assessment cost
No AI tooling
Fast-moving market
Solution & PoC Objectives
Build a modelling platform combining GIS spatial AI, ML classification, and forecast algorithms to score, rank, and predict performance of candidate EV charging sites β enabling business teams to make faster, data-driven expansion decisions. Pilot city: Guangzhou. Validation against business development on-site investigation results.
AI site scoring
Performance forecast
GIS visualisation
Validated vs. BD data
02
System Architecture
5-layer AI platform: raw data β derived features β ML engine β AI modules β user interface
USER LAYER
πΊοΈ Heat Map
Density visualisation
Density visualisation
π Site Evaluation
Score + grade
Score + grade
π Site Recommendation
Ranked list
Ranked list
β Good Sites
Factor analysis
Factor analysis
π Perf. Forecast
Revenue Β· break-even
Revenue Β· break-even
β SaaS Platform API (Alading)
AI MODULES
π― Location Evaluation
Classification algorithms
Classification algorithms
π‘ Location Recommendation
Clustering + CF
Clustering + CF
π Performance Forecast
Regression algorithms
Regression algorithms
β Spark MLlib Β· Model Training Pipeline Β· Feature Selection
ML ENGINE
π Kernel Density
Spatial distribution
Spatial distribution
π OD/PA Model
Population flow
Population flow
π Spatial Interpolation
Feature coverage
Feature coverage
πΈοΈ Force Layout
Feature correlation
Feature correlation
ποΈ Business District
Commercial analysis
Commercial analysis
π Value Factors
High-value site ID
High-value site ID
β Data Processing Β· Deduplication Β· Normalisation Β· Derived Feature Engineering
DATA LAYER
π₯ Population Portrait
Demographics Β· finance
Demographics Β· finance
π AOI / POI
Location attributes
Location attributes
π NEV Data
Static vehicle counts
Static vehicle counts
π£οΈ Traffic Flow
Dynamic + real-time
Dynamic + real-time
β‘ Existing EVC Sites
Competitor landscape
Competitor landscape
β Multi-source Data Ingestion Β· Crawler Β· Telecom APIs Β· Map APIs
SOURCE DATA
π‘ Telecom Providers
πΊοΈ Map / GIS Companies
π Ride-hailing Platforms
π¦ Insurance Data
πΈοΈ Web Crawlers
Stack:
Spark MLlib
Python
GIS / Spatial AI
SaaS Platform
REST APIs
03
Data Dashboard & Market Context
Market scale, data coverage, and PoC resource breakdown
EV Site Planning PoC β Project Dashboard
2023 Β· Guangzhou Pilot
13.1M
EVs in China (2022)
β +67% year-on-year growth
6
Core ML Models Deployed
β Kernel density Β· OD/PA Β· Spatial Β· β¦
5+
Data Source Categories
β Population Β· NEV Β· POI Β· Traffic Β· EVC
$40K
PoC Investment (DS&E)
β SaaS + validation report
ML Algorithm Coverage by Module
Data Source Diversity
04
Core ML Models
6 spatial AI and statistical models forming the site-scoring engine
Model 01
π Kernel Density Model
Collects feature data within a defined analysis radius, calculates the density of relevant features (population, POIs, NEVs) in the surrounding area. Produces spatial heatmaps to surface demand hotspots.
Spatial statisticsBandwidth selection
Model 02
π OD / PA Flow Model
Processes tens of millions of smart mobile device records to model city-level population travel patterns. Derives Origin-Destination and Production-Attraction matrices to estimate actual demand flows to candidate sites.
Big data processingFlow estimation
Model 03
π Spatial Interpolation Model
Analyses the spatial extent of feature distributions and finds the nearest input sample subset to predict feature values at unsampled locations across the entire study area.
IDW / KrigingGIS interpolation
Model 04
πΈοΈ Force-Oriented Layout Model
Draws a topological relationship network by calculating correlation degrees between all features β visualises the aggregation and compactness of site-relevant factors to reveal key driver clusters.
Graph layoutCorrelation analysis
Model 05
ποΈ Business District Analysis Model
Analyses customer situation, site conditions, and surrounding operational factors within a defined commercial area to identify the optimal location address within a business district.
Commercial circleBoundary analysis
Model 06
π Value Factors Analysis Model
Identifies high-value sites by analysing key success factors from existing good-performing sites, then forms a location selection model to systematically discover new high-potential sites.
Feature importanceGood-site learning
05
Data Sources & Feature Engineering
5 data categories across demand signals, supply data, and geospatial intelligence
Population Portrait
Population size, demographics (gender, age), lifestyle preferences (travel, shopping), and financial profiles β used to model demand density and user segment affinity.
Source: Telecom companies Β· Insurance companies
AOI / POI (Area & Point of Interest)
Name, address, coordinates, telephone and other attributes for commercial zones, residential areas, transit hubs, and destinations β core inputs for spatial feature extraction.
Source: Map companies Β· Crawler & data processing
New Energy Vehicle Data (Static)
NEV count by brand and model segmented by geographic area β identifies actual EV ownership concentration and reveals high-demand zones.
Source: Insurance companies Β· Crawler
Traffic Flow Data (Dynamic)
Road grade, time-of-day segmented traffic volumes, real-time NEV counts on road, e-hailing NEV booking counts, and GPS tracking within each POI β provides demand dynamics.
Source: Ride-hailing platforms Β· Telecom companies
Existing EV Charging Site Data
Competitor and existing site attributes: name, pile count, pricing, real-time availability (free/busy), and power rating β informs supply gap and competitive landscape analysis.
Source: Public charging networks Β· Web crawlers
Derived Features (computed)
Commercial circle boundaries Β· Opportunity zones Β· Peer radiation areas Β· Competitive zones Β· Equidistance circles Β· Surrounding business density β all derived via the 6 ML models from raw source data.
06
Feature Selection & Model Training Pipeline
Rigorous ML pipeline from raw data preprocessing to model optimisation
Filter Methods
Statistical Relevance
Variance selection (low-variance removal)
Pearson & Spearman correlation
Chi-square test
ANOVA (F-statistic)
Kendall's tau
Mutual information method
Wrapper Methods
Model-guided Selection
Recursive Feature Elimination (RFE)
Cross-validated RFE with estimator
Embedded Methods
Penalty-based
L1 / Lasso regularisation
Tree-based feature importance
Dimensionality Reduction
Compression & Decorrelation
PCA (Principal Component Analysis)
LDA (Linear Discriminant Analysis)
Model Evaluation Metrics
KS statistic & PSI
Accuracy, Recall, F1 Score
AUC / ROC curve
Training pipeline:
Data preprocessingβ
Feature extraction & selectionβ
Model training/buildingβ
Model testingβ
Model evaluationβ
Model optimisation
07
AI Platform β 5 User Modules
End-to-end visual interface for business teams to evaluate and plan EV charging sites
Heat Map
Input: filter conditions (area, EV density, POI type)
β
Output: colour-coded demand density heat map across city
Site Evaluation
Input: candidate site address
β
Output: overall composite score + recommendation grade (A/B/C)
Site Recommendation
Input: filter conditions (district, budget, site type)
β
Output: ranked list of recommended sites sorted by score
Good Sites Analysis
Input: known high-performing site address
β
Output: key success factors and their relative weights
Performance Forecast
Input: site address + investment cost
β
Output: customer unit price Β· annual revenue split Β· break-even point
08
Project Management & Delivery
End-to-end PoC ownership from vendor selection to model validation
Phase 1 β Market Research
Benchmarking & Vendor Selection
Evaluated multiple China technology providers on data quality, coverage, prior practice experience, and cost. Selected Alading as prioritised partner following structured benchmarking.
Phase 2 β PoC Design
Solution Architecture & Scope Definition
Defined 3-module AI system (Evaluation Β· Recommendation Β· Forecast), identified 5 data source categories, scoped Guangzhou as pilot city. Secured $40K DS&E global funding.
Phase 3 β Contract & Build
Contract Signed β First Deployment
Signed contract with Alading. Oversaw SaaS platform configuration with heat map, site evaluation, site recommendation, good-site models, and performance forecast modules.
Phase 4 β Validation
Trial Launch & Model Accuracy Validation
Compared AI modelling results against Business Development team's on-site investigation findings. Generated validation report covering functions, performance, and model accuracy.
PoC Deliverables
SaaS platform modules5 functional
ML models deployed6 models
Validation reportFunction + accuracy
Total budget$40K USD
Pilot cityGuangzhou
Risk Β· High
Data Availability
Real-time traffic and NEV tracking data subject to provider access restrictions and coverage gaps.
β Multi-source fallback Β· data SLA in contract
Risk Β· Medium
Model Accuracy
Prediction vs. BD on-site investigation divergence β new domain with limited labelled training data.
β Iterative retraining Β· accuracy KPI gates
09
Technology Stack
Spatial AI, distributed ML, and GIS-integrated analytics platform
Spark MLlib
Distributed ML
Python
ML + Data Pipeline
GIS / Spatial AI
Geospatial analysis
PCA / LDA
Dimensionality reduction
Kernel Density
Spatial statistics
Graph / Network
Feature correlation
SaaS Platform
Alading / partner
REST APIs
Data ingestion
Collab. Filtering
Recommendation
Regression Models
Performance forecast
Big Data Pipeline
ETL Β· preprocessing
RFE / Feature Sel.
Model optimisation