Problem Statement
Knowledge was fragmented across 8+ siloed systems — internal portal, OA, multiple knowledge bases, and data platforms — with no unified search or recommendation layer. Agents and staff had to switch between apps repeatedly, losing productivity. No intelligence layer existed to surface relevant content proactively.
Solution & Mandate
Build an enterprise-grade NLP platform — branded as "Taiping Encyclopedia" — that crawls, indexes, and semantically understands content from all internal and external sources, then surfaces it intelligently across all user touchpoints (mobile, PC portal, OA) via smart search and AI recommendation.
Staff Mobile App
Agent Sales App
Customer App / WeChat
Internal Portal
Intent + Full-text
Daily Picks · Hot Rank
ASR / TTS
FAQ + NLU dialog
Unified Content Card
BERT-based routing
Finance-domain tuned
Vector embeddings
Collab filtering
R&D innovation
Full-text search
Semantic embeddings
Content relations
Content taxonomy
User action logs
Org-level permissions
| API Module | Category | Core Technology | Description | Consumers |
|---|---|---|---|---|
/search/semantic |
Search | ES + BM25 (finance-tuned) + vector similarity | Multi-source full-text + semantic search with intent routing | All 4 touchpoints |
/intent/classify |
NLP | BERT fine-tuned on insurance domain | Query intent recognition → routes to Q&A, search, or report lookup | Search gateway |
/recommend/feed |
Rec Engine | Collaborative filtering + content-based + feature correlation | Personalised content feed using 4-strategy ensemble model | Mobile apps, portal |
/content/penetrate |
Infra | Custom crawl framework + permission binding | Proprietary content penetration — crawls source systems without exposing raw access; permission-bound at org level. | Indexing pipeline |
/tags/extract |
NLP | Jieba + domain lexicon + TF-IDF | Auto-extracts semantic tags and entity labels from indexed content | Indexing pipeline |
/behaviour/track |
Infra | Event probe + Kafka + Flink (streaming) | Real-time user behaviour ingestion across all apps — feeds recommendation model | Rec engine, analytics |
Front-End Touchpoints (Consumer Systems)
Back-End Data Sources (10+ Feeds)
Annual Savings Calculation (Primary Portal Pilot)
25,000 DAU × (1 rec @50s saved + 4 searches @10s saved) × 22 working days ÷ 60 = 33 min/user/month saved. Total = 21,000 person-days/year. At ¥360K/person-year all-in cost → ¥25.83M gross saving minus ¥2.29M project cost = ¥23.5M net ROI from primary portal alone.