[Patent 4] Modular GenAI System Design
The Most Common Mistake in Enterprise LLMs: Equating “Powerful” with “Useful”
When I first started building enterprise GenAI products, I was also enamored by the belief that “the bigger the model, the better; the more features, the stronger.”
Until I saw the actual deployment costs.
If an LLM service deployed on an enterprise intranet uses a full-featured model for every scenario, the GPU costs, inference latency, and maintenance complexity… all skyrocket.
More importantly: most business scenarios simply don’t need all those features.
A customer service bot needs natural language understanding + dialogue management + knowledge base querying. A financial analysis assistant needs data comprehension + report generation. A personalized recommendation system needs user personas + preference matching.
This insight led to my patent M671449 “Customizable Generative Artificial Intelligence System”—a truly modular GenAI architecture.
Core Design: “Building Block” AI Systems
The fundamental modules of this system include:
| Module | Function |
|---|---|
| NLU Module | Converts user input into structured, processed messages. |
| Dialogue Management | Decides action strategies based on processed messages. |
| Knowledge Base Module | Queries structured and unstructured data (supports knowledge graphs). |
| Text Generation | Generates response text based on decisions and query results. |
Advanced optional modules:
- Sentiment Analysis Module: Adds emotional warmth to responses.
- Personalization Module: Connects to the internal server’s customer history data.
- Learning & Adaptation Module: Continuously optimizes generation quality from interaction feedback.
- Central Control Module: Orchestrates execution sequence and info passing among modules.
Depending on the business scenario, you can deploy only the required combination of modules, significantly lowering hardware requirements and system complexity.
What Does This Mean for a GenAI Product Owner?
As a GenAI PO, the greatest inspiration this architecture gave me is:
The architectural design of an AI product dictates its marginal cost curve.
The marginal cost of full-feature deployments is almost fixed (regardless of how many features are used, the cost remains). Modular deployment allows you to dynamically allocate resources based on usage volume, horizontally scale when business grows, and experiment with new scenarios at an extremely low cost.
This is actually very traditional software engineering thinking—Separation of Concerns—just applied to AI components today.
Three GenAI Architecture Design Principles
From this patent, I organized three architectural principles useful for GenAI Product Managers:
1. Assemble on demand, not all at once Don’t try to implement every feature in version 1. Find the core modules first, validate them, and then stack on more.
2. Every module should be independently replaceable LLM technology evolves incredibly fast. If your text generation module is “hardcoded” into the system, the cost of swapping models later will crush you.
3. The knowledge base module is your competitive moat Other modules are easily replicated by competitors, but your knowledge base—customer data, business rules, institutional memory—is the true, uncopyable moat.
M671449 Customizable Generative Artificial Intelligence System | Grant Date: 2025/06/11 | Sole Inventor: Nils Liu
💬 Read more: 2025 Year in Review (English)
[Patent 6] Building a Knowledge Graph from a GenAI...
How does a bank GenAI Product Manager design an LLM system that automatically builds a knowledge graph from business pain points, and successfully obtain a...
[Patent 5] GenAI Security Blind Spots
When a GenAI system queries sensitive data, how do you prevent malicious users from bypassing security? This article details how a bank AI Product Manager d...
[Patent 6] Building a Knowledge Graph from a GenAI...
How does a bank GenAI Product Manager design an LLM system that automatically builds a knowledge graph from business pain points, and successfully obtain a...
[Patent 5] GenAI Security Blind Spots
When a GenAI system queries sensitive data, how do you prevent malicious users from bypassing security? This article details how a bank AI Product Manager d...