KNOWLEDGE PORTAL

Transforming eCommerce Search and Analytics with Knowledge Graphs

Summary

A Knowledge Graph Approach to Intelligent Product Discovery in eCommerce

A white paper by Matthew de La Fe, Senior Manager of AI.

An approach for eCommerce and digital commerce leaders. Download the “Knowledge Graphs for eCommerce” whitepaper


Executive Summary

eCommerce search continues to underperform expectations, leading to lower sale conversions and depressing revenues. Maintaining a catalog of changing products to meet ever-changing demands and marketplace trends becomes a costly procedure each cycle. Despite adoption of AI-assisted product tagging, search continues to consistently return zero or inaccurate results, and customers regularly encounter irrelevant or incomplete search outcomes.

These failures are not primarily caused by poor models or insufficient data. They are architectural.

Most modern AI tagging systems process products independently, extracting attributes from images and descriptions without a shared semantic foundation. As a result, two functionally identical products may be described and tagged differently depending on marketing language, image emphasis, or data availability.

This white paper presents an alternative architecture: knowledge graph driven product discovery. Rather than managing thousands of products as isolated records, a knowledge graph models the catalog as interconnected entities of products, materials, occasions, styles, and properties, which are governed by explicit business relationships and rules. When a product is connected to a material, it can inherit relevant properties through controlled semantic rules rather than ad-hoc tagging.

Knowledge graphs require upfront ontology design, ongoing governance, and do not eliminate every search failure. They are not a replacement for lexical or vector search engines, but a semantic augmentation layer that improves recall, consistency, explainability, and long-term maintenance economics.

Organizations that adopt this architecture reduce unproductive zero-result searches, significantly improve search consistency across large catalogs, and lower merchandising overhead as attributes shift from product-level tagging to ontology-level management.


1. Current State: AI-Assisted Tagging Limitations

1.1 How AI-Assisted Tagging Works Today

Many eCommerce platforms now rely on AI to automate product tagging. Product images and descriptions are processed by vision models and large language models.  Attributes such as color, style, and material are extracted. Tags are generated and applied to product records. AI-assisted tagging has reduced manual labor and improved baseline consistency for visible attributes such as color, pattern, and style.

1.2 Structural Limitations: Isolated Processing

Despite efficiency gains, AI-assisted tagging remains constrained. Each product is processed independently, without reference to a shared domain knowledge. As a result, attributes depend heavily on product descriptions. Semantically equivalent products may surface for different queries. Search relevance can vary across the catalog.

Two linen shirts may be tagged differently depending on whether the description emphasizes “summer wear” or “relaxed fit.” It is not inferred that linen, by its nature, is commonly breathable and cooling fabric.

Product A: “Classic linen shirt perfect for warm weather”
AI Tags: linen, classic, warm weather, summer

Product B: “Relaxed fit shirt in premium linen”
AI Tags: linen, relaxed fit, premium

A customer searching “summer shirt” finds Product A but not Product B. Advanced post-processing rules and co-occurrence models can address these issues, but they effectively recreate ontology-like behavior implicitly based on training data. Synonym lists can balloon to unmanageable levels, and explanation is hidden behind probabilities. Maintaining a large data catalog becomes burdensome, especially in industries with short cycles. The conclusion is not that AI cannot reduce this gap, but that doing so at the product level is uneconomical and ungovernable at scale.

1.3 The Impact on Search Performance

These architectural constraints manifest directly in search. Consumers are leaving sites after negative search experiences. In an era of digital dominance, eCommerce players must treat the online experience with as much reverence as in-store experiences of the past. A poorly architected search is the equivalent unorganized, unkempt displays.

2. The Knowledge Graph Alternative

Knowledge graphs offer a fundamentally different architecture. Major retailers such as, Amazon, Walmart, Ikea, and Zalando have adopted a graph-based approach for product catalogs.

2.1 Core Concepts: Entities, Properties, and Relationships

A knowledge graph represents catalog knowledge as a network of entities governed by an ontology:

Entities (Nodes): Products, materials, categories, occasions, climates, styles
Properties (Node Attributes): Attributes attached to entities (e.g., breathable, lightweight, formal)
Relationships (Edges): Explicit links between entities (e.g., hasMaterial, suitableFor, idealFor)

This structure allows knowledge to be defined once and reused consistently across the catalog.

Graph showing the nodes and relationships for a fashion Knowledge Graph.

2.2 Controlled Property Inheritance

Property inheritance is the key differentiator. When a product is linked to a material such as linen, relevant properties may be inherited subject to rules, such as:

Fabric Weights: Heavy linen may not qualify as “lightweight”
Construction: Linen + lining ≠ breathable; linen blazer ≠ summer shirt
Blends: A 20% linen blend does not inherit full linen properties
Category: Same material may have different property applicability by garment type

Properties are inherited based on material relationships, subject to ontology rules and exclusions. Constraint validation frameworks prevent inappropriate propagation. This rule-governed inheritance ensures semantic accuracy while preserving consistency.

2.3 Maintenance Economics

Traditional tagging scales linearly with catalog size. More importantly, when customer language changes (for example, “eco-friendly” becomes a dominant search), updates occur once at the ontology level and propagate systematically without touching product records.

3. AI’s Role In A Knowledge Graph Architecture

Knowledge graphs do not eliminate AI; they reposition it. Instead of asking AI to generate open-ended tags per product, AI is deployed for constrained tasks that operate at the semantic level.

3.1 Entity Classification (Constrained AI)

Instead of AI-assisted tagging, AI classifies products against known ontology entities. Predictions are confidence-scored, routed through human-in-the-loop review workflows, and continuously evaluated. This is a fundamentally easier AI task with higher accuracy because the output space is constrained to known entities with clear definitions.

Traditional AI: “What tags describe this product?”
Output: Unlimited possibilities, inconsistent across products, no governance

Knowledge Graph AI: “Classify against our ontology:”
Material: Linen (94%) → review queue → approved → all linen properties inherited
Output: Constrained, verifiable, auditable, governed

3.2 Ontology Enrichment

Our approach includes analyzing search logs, product descriptions, and external content to propose new properties, emerging terminology, and gaps in semantic coverage. Behavior data suggests new relationships between nodes.

AI Ontology Analysis Example

Detection: “moisture-wicking shirts” has 10,000 monthly searches with no ontology match
Analysis: 847 products reference this in descriptions; linked to Polyester (412), Merino (203), Nylon (232)
Recommendation: Add property “moisture-wicking” to 3 material nodes
Governance: Review and approve. Property propagates to 847 products.

3.3 Query-to-Ontology Mapping

Our approach maps natural language queries to ontology concepts, enabling disambiguation, multi-concept resolution, and explainable query interpretation. This mapping layer improves continuously through machine learning on click-through data and query refinement patterns.

4. Knowledge Graph Impact and Implementation

4.1 Zero-Result Reduction


Knowledge graphs reduce zero-result searches by creating multiple semantic paths to products.

ZERO-RESULT CAUSEKG IMPACTIMPROVEMENT
Synonym GapsProperty InheritanceHigh
Incomplete TaggingRelationship HoppingHigh
Novel TerminologyAI maps to existing conceptsMedium-High
MisspellingsRequires lexical fuzzy matchingNone (same)
Trend-driven NoiseOntology enrichment (delayed)Medium

4.2 Hybrid Search Enablement

In production, knowledge graphs augment existing infrastructure

Lexical search (BM25, inverted indexes): Handles keyword matching, typo tolerance, stemming
Vector retrieval (embeddings): Captures semantic similarity for long-tail queries
Knowledge graph reasoning: Provides governed semantic expansion and property inheritance
Ranking, personalization, merchandising rules: Business logic preserved exactly as-is

Knowledge graphs complement lexical and vector retrieval layers. The graph provides a semantic backbone to deliver auditable, controlled, and business-aligned interpretation.

4.3 Where Knowledge Graphs Can Fail

Poor Design: Ontologies must reflect the true nature of the consumer search
Missing query mapping layer: Queries must map to ontology concepts
Over-connection: Trade-offs between precision and recall must be made
Novel Concepts: Unique search trends require ontology manipulation
Ontology errors: Governance structure is essential for maintenance

5. Business Impact

5.1 Operational Efficiency



Faster product onboarding: Classify against ontology vs. comprehensive tagging
Faster trend response: Update ontology structure (one change) vs. retag products (thousands of changes)
Consistency guaranteed by structure: Not dependent on individual tagging accuracy or AI-assisted training
Improved auditability and governance: All semantic decisions are traceable and reviewable

6. Implementation Considerations

Ontology design requires cross-functional input from search, merchandising, data science, and domain experts. Governance ownership must be explicit.

Measurement should track relevance and meaningful query resolution, not just zero-result counts. Continuous refinement is expected.

6.1 Governance and Ownership

A knowledge graph requires governance.

Ontology Stewards: Approve structural changes to the graph
Workflow Reviewers: Approve AI proposed propagation
Management Trackers: Measure KPIs against traditional search conversion

7. Architecture: Traditional vs. Knowledge Graph

The following diagrams compare the traditional eCommerce search architecture with the knowledge graph–augmented approach. The knowledge graph does not replace existing infrastructure. It augments it by providing a governed semantic layer.

7.1 Traditional eCommerce Search Architecture

Graphic showing a traditional eCommerce search architecture.

7.2 Knowledge Graph Augmented Architecture

The knowledge graph–augmented stack preserves all existing infrastructure while adding a semantic layer that improves recall, consistency, and explainability.

Graphic showing the Knowledge Graph under an augmented search architecture.

7.3 Architectural Benefits

Recall without noise: Increases findability and preserve precision
• Explainability: Results can be traced to explicit semantic relationships
Resilience: Customer language evolves without requiring product re-tagging
Composable: Works with existing commerce platforms and search engines

DIMENSIONTRADITIONALKG-AUGMENTED
Retrieval depends onPer-product tags + dictionariesGoverned entities + relationships
Failure modeOne missed tag = missed productOntology gap = fix once, propagate
MaintenanceN products × M tagsX entities × M properties + N classifications
ExplainabilityOpaque score blendingTraceable semantic relationships
Trend responseManual retag + synonymsOntology update, auto-propagation
AI roleOpen-ended tag generationConstrained classification + enrichment

8. Addressing Common Objections

Objection 1: “Isn’t this just moving the tagging problem somewhere else?”

Response: The problem is not tagging itself, but where tagging occurs. Product-level tagging scales linearly with catalog size and is inherently inconsistent. Ontology-level definition scales with domain complexity, not SKU count, and is governed by explicit rules and review workflows. One ontology update to a material node propagates to every product using that material without touching product records.

Objection 2: “Won’t this over-generalize products and hurt precision?”


Response
: Only if implemented incorrectly. Property inheritance is rule-governed, contextual, and category-aware. Constraint validation prevents inappropriate propagation (e.g., a lined linen blazer does not inherit “summer-only” attributes). Precision is preserved through validation rules and downstream ranking layers.

Objection 3: “We already use embeddings. Don’t they solve this?”


Response
: Embeddings improve recall but lack explicit semantics, governance, and explainability. A vector search may surface a linen shirt for “breathable,” but it cannot explain why, cannot be audited, and cannot be governed by business rules. Knowledge graphs provide the semantic backbone that embeddings alone cannot: auditable meaning, controlled inheritance, and business-aligned interpretation. They are complementary, not competitive.

Objection 4: “This sounds expensive to maintain.”


Response
: The opposite is typically true. Maintenance shifts from thousands of individual product records to a small, governed set of entities and rules. Over time, operational effort decreases as ontology changes propagate automatically.

Objection 5: “Our catalog and brands are too unique for a single ontology.”


Response: Ontologies are extensible by design. Brand-specific semantics, exclusions, and overrides are expected and supported. A shared core ontology enables cross-catalog consistency, while extensions preserve differentiation.

Objection 6: “What happens when the ontology is wrong?”


Response: Ontology errors are visible, traceable, and correctable in one place. Compare this to traditional tagging, where silent errors are distributed across thousands of SKUs with no auditability. A governed ontology with stewardship workflows is fundamentally safer than unmonitored product-level tagging at scale.

Conclusion

AI-assisted tagging has reached diminishing returns in eCommerce search. The limitation is not model quality, but architecture.

Knowledge graphs provide a governed semantic foundation that enables AI to operate at the structural level to improve consistency, explainability, and long-term economics. When implemented as part of a hybrid search stack, they deliver meaningful reductions in search failure while positioning organizations for continuous improvement as customer language evolves.

The honest assessment: this is not a magic solution. It requires upfront investment in ontology design, ongoing governance, and clear organizational ownership. But it fundamentally changes the economics of catalog management and creates a resilient, auditable architecture that scales with domain complexity rather than SKU count.

Next Steps

Cyberhill Partners specializes in ontology-driven eCommerce architectures. We can assess your current search architecture, quantify semantic gaps, design a custom ontology, and implement AI-powered governance systems.

Contact us for a complimentary eCommerce search and semantic architecture assessment.


Schedule an AI eCommerce Strategy Session

Download the “Knowledge Graphs for eCommerce” white paper


Appendix: Technical Reference

A.1 Sample SPARQL Query

Sample of a SPARQL query used for eCommerce search and analytics.

A.2 AI Roles Summary

AI ROLEFUNCTIONGOVERNANCE
Entity ClassificationConstrained classification against ontologyReview workflows, brand-specific overrides
Ontology EnrichmentDiscover new properties from search logsSteward approval before propagation
Relationship DiscoveryIdentify connections from behavioral dataAdvisory only; domain expert approval
Query MappingMap natural language to ontology conceptsClick-through learning, continuous monitoring

A.3 Glossary

  • Knowledge Graph: A structured representation of entities, properties, and relationships stored as triples
  • Ontology: A formal specification of entity types, relationship types, property definitions, and constraint rules
  • Triple: Subject → Predicate → Object (e.g., Linen_Shirt → hasMaterial → Linen)
  • Node: An entity in the graph (product, material, occasion, property)
  • Edge: A relationship connecting two nodes (hasMaterial, hasProperty, idealFor)
  • Property Inheritance: Rule-governed transfer of properties through relationships, subject to constraints
  • Graph Traversal: Navigating connected nodes to expand recall beyond keyword matching
  • Ontology Steward: Named individual or team responsible for reviewing and approving ontology changes
  • Constraint Validation: Rules that prevent inappropriate property propagation (e.g., SHACL)
You might also be interested in
Palantir AIP vs. Cerebro: Cost, Lock-In, and Who Actually Owns Your AI
In this article:  Palantir’s commercial momentum is real. AIP and Foundry are genuinely powerful platforms, and for certain organizations, they’re the right call. But “powerful” and “right for your enterprise”…
LEARN MORE
The Hidden Cost of Enterprise AI Platforms
The Hidden Cost of Enterprise AI Platforms: Why the Biggest Risk Isn’t the Technology
Only 6% of enterprises could walk away from their AI vendor without disruption. The real cost of enterprise AI isn't the license — it's what you give up.
LEARN MORE
Abstract visualization of interconnected data nodes and knowledge graph pathways on a dark navy background — representing the AI solutions marketplace structure and ontology-driven enterprise AI architecture
The AI Solutions Marketplace White Paper: Market Structure, Capital Flows, and Cyberill’s Position
AI VC funding surged 85% to $211 billion in 2025 — but the investment thesis has fundamentally changed. This whitepaper maps the five-layer marketplace structure, exposes the fault lines enterprises…
LEARN MORE