← Back to Insights

[Patent 4] Modular GenAI System Design

Nils Liu
Patents Blog GenAI Architecture
[Patent 4] Modular GenAI System Design

The Most Common Mistake in Enterprise LLMs: Equating “Powerful” with “Useful”

When I first started building enterprise GenAI products, I was also enamored by the belief that “the bigger the model, the better; the more features, the stronger.”

Until I saw the actual deployment costs.

If an LLM service deployed on an enterprise intranet uses a full-featured model for every scenario, the GPU costs, inference latency, and maintenance complexity… all skyrocket.

More importantly: most business scenarios simply don’t need all those features.

A customer service bot needs natural language understanding + dialogue management + knowledge base querying. A financial analysis assistant needs data comprehension + report generation. A personalized recommendation system needs user personas + preference matching.

This insight led to my patent M671449 “Customizable Generative Artificial Intelligence System”—a truly modular GenAI architecture.


Core Design: “Building Block” AI Systems

The fundamental modules of this system include:

ModuleFunction
NLU ModuleConverts user input into structured, processed messages.
Dialogue ManagementDecides action strategies based on processed messages.
Knowledge Base ModuleQueries structured and unstructured data (supports knowledge graphs).
Text GenerationGenerates response text based on decisions and query results.

Advanced optional modules:

  • Sentiment Analysis Module: Adds emotional warmth to responses.
  • Personalization Module: Connects to the internal server’s customer history data.
  • Learning & Adaptation Module: Continuously optimizes generation quality from interaction feedback.
  • Central Control Module: Orchestrates execution sequence and info passing among modules.

Depending on the business scenario, you can deploy only the required combination of modules, significantly lowering hardware requirements and system complexity.


What Does This Mean for a GenAI Product Owner?

As a GenAI PO, the greatest inspiration this architecture gave me is:

The architectural design of an AI product dictates its marginal cost curve.

The marginal cost of full-feature deployments is almost fixed (regardless of how many features are used, the cost remains). Modular deployment allows you to dynamically allocate resources based on usage volume, horizontally scale when business grows, and experiment with new scenarios at an extremely low cost.

This is actually very traditional software engineering thinking—Separation of Concerns—just applied to AI components today.


Three GenAI Architecture Design Principles

From this patent, I organized three architectural principles useful for GenAI Product Managers:

1. Assemble on demand, not all at once Don’t try to implement every feature in version 1. Find the core modules first, validate them, and then stack on more.

2. Every module should be independently replaceable LLM technology evolves incredibly fast. If your text generation module is “hardcoded” into the system, the cost of swapping models later will crush you.

3. The knowledge base module is your competitive moat Other modules are easily replicated by competitors, but your knowledge base—customer data, business rules, institutional memory—is the true, uncopyable moat.


M671449 Customizable Generative Artificial Intelligence System | Grant Date: 2025/06/11 | Sole Inventor: Nils Liu

💬 Read more: 2025 Year in Review (English)

Get the latest insights

Join the newsletter to receive my latest articles on GenAI, AI Agents, and architecture.

No spam. Unsubscribe anytime.