Tuesday, March 31, 2026

What is Amazon Kendra?

Amazon Kendra is an AI-powered document search service from AWS.

๐Ÿ‘‰ In simple terms:

It lets you index documents from multiple sources into a central repository and enables natural language search over them.

Unlike basic keyword search, Kendra uses ML/NLP to understand intent and return context-aware answers.


๐Ÿ“š 1. Kendra as a Document Search Service

Kendra acts like:

“Google for your enterprise documents”

Key capabilities:

  • Centralized document indexing

  • Natural language querying

  • Extracts answers (not just links)

  • Role-based access filtering


๐Ÿง  2. Does it create a central index?

๐Ÿ‘‰ Yes — this is core to Kendra

  • You create an Index

  • All documents are ingested into this index

  • Search queries run against this index


Architecture:

Data Sources → Kendra Index → Search API → Application / UI

๐Ÿ“„ 3. Supported Document Types

Kendra supports a wide range of formats:

๐Ÿ“ Common formats:

  • PDF

  • Word (DOC, DOCX)

  • Excel (XLS, XLSX)

  • PowerPoint (PPT, PPTX)

  • HTML

  • XML

  • JSON

  • Plain text


๐Ÿงพ Structured + semi-structured:

  • FAQs

  • Knowledge base articles

  • Wiki pages

  • Emails (via connectors)


๐Ÿ–ผ️ Images?

  • Not directly searchable

  • But can be indexed if:

    • Text is extracted using:

      • Amazon Textract


๐Ÿ’ฌ 4. Natural Language Search

๐Ÿ‘‰ One of Kendra’s strongest features

Example queries:

  • “What is the leave policy for contractors?”

  • “How to reset VPN password?”

  • “Show SLA for premium customers”


What happens internally:

  • Query understanding (NLP)

  • Semantic matching (not just keywords)

  • Ranking based on relevance


๐Ÿ‘‰ Output:

  • Direct answers (highlighted)

  • Ranked documents


๐Ÿ”— 5. Integrations (Very Powerful)

Kendra integrates with many enterprise systems:


๐Ÿ“ฆ AWS-native sources:

  • Amazon S3

  • Amazon RDS

  • Amazon DynamoDB


๐Ÿข SaaS / enterprise tools:

  • SharePoint

  • OneDrive

  • Google Drive

  • Confluence

  • Salesforce

  • ServiceNow

๐Ÿ‘‰ (via built-in connectors)


๐Ÿ”Œ Custom sources:

  • Use:

    • Kendra APIs

    • Custom connectors


๐Ÿ–ฅ️ 6. How to Use from AWS Console

Step-by-step:

1️⃣ Create Index

  • Go to Kendra → Create index

  • Configure:

    • Name

    • IAM role

    • Capacity


2️⃣ Add Data Sources

  • Choose connector:

    • S3 / SharePoint / etc.

  • Configure access

  • Start sync


3️⃣ Indexing

  • Documents are:

    • Crawled

    • Parsed

    • Indexed


4️⃣ Search

  • Use:

    • Console search UI

    • API (Query API)


5️⃣ Build Application

  • Integrate search into:

    • Web apps

    • Chatbots

    • Internal tools


๐Ÿ” 7. Authentication & Security

Kendra supports multiple auth mechanisms:


๐Ÿ”‘ 1. IAM (Primary)

  • Access via:

    • AWS SDK / CLI

  • Controlled via IAM roles & policies


๐Ÿง‘‍๐Ÿ’ผ 2. User Context Filtering

  • Document-level permissions

  • Integrated with:

    • Active Directory

    • SSO systems

๐Ÿ‘‰ Ensures:

Users only see documents they are allowed to


๐ŸŒ 3. API Access

  • Signed requests (SigV4)

  • Used by applications


๐Ÿ” 4. Identity Providers

  • SAML-based SSO

  • Integration with enterprise identity systems


⚙️ 8. How Kendra Works Internally (Simplified)

Ingestion → Parsing → NLP Enrichment → Indexing → Query Engine
  • Extracts metadata

  • Understands document structure

  • Builds semantic index


๐Ÿงฉ 9. Advanced Features

๐Ÿ”น FAQ support

  • Direct Q&A matching


๐Ÿ”น Relevance tuning

  • Boost certain documents


๐Ÿ”น Custom metadata

  • Filter search results


๐Ÿ”น Incremental sync

  • Only updates changed documents


๐Ÿš€ 10. Where Kendra Fits (Important Insight)

Given your background (RAG, GenAI, document parsing):

๐Ÿ‘‰ Kendra can replace parts of your pipeline:

Instead of:

Parsing → Chunking → Embedding → Vector DB → Retrieval

You can use:

Kendra Index → Query API → Results

๐Ÿ‘‰ Or combine:

๐Ÿ”ฅ Kendra + LLM (Best pattern)

  • Kendra → retrieval

  • LLM → summarization / reasoning


๐Ÿ Final Summary

  • Amazon Kendra = intelligent document search engine

  • Creates a central index

  • Supports:

    • Multiple document formats

    • Natural language queries

  • Integrates with:

    • AWS + enterprise tools

  • Access via:

    • Console

    • APIs

    • IAM / SSO


๐Ÿ’ฌ One-line takeaway:

Kendra is a managed enterprise search + semantic retrieval system, ideal for building internal knowledge search and RAG-style applications.


If you want, I can:

  • Compare Kendra vs OpenSearch vs Vector DB (very useful for your use case)

  • Show architecture for Kendra + LLM chatbot

  • Or suggest when NOT to use Kendra (cost/performance tradeoffs)

No comments:

Post a Comment