wisedev-code · wisedev-pstach · Aug 28, 2025 · Aug 25, 2025 · Aug 27, 2025 · Aug 27, 2025
diff --git a/src/assets/docs/docs-index.json b/src/assets/docs/docs-index.json
@@ -105,6 +105,26 @@
                }
             ]
          },
+          {
+            "title": "Knowledge base",
+            "children": [
+              {
+                "title": "Overview",
+                "path": "examples/agent-knowledge-overview"
+              },
+              {
+                "title": "With File source",
+                "path": "examples/example-agent-knowledge-file"
+              },
+              {
+                "title": "With Web source",
+                "path": "examples/example-agent-knowledge-web"
+              },
+              {
+                "title": "With Mcp source",
+                "path": "examples/example-agent-knowledge-mcp"
+              }
+              ]},
          {
             "title":"Flow",
             "children":[

diff --git a/src/assets/docs/examples/agent-knowledge-overview.md b/src/assets/docs/examples/agent-knowledge-overview.md
@@ -0,0 +1,131 @@
+# Knowledge Base Feature Overview
+
+## Introduction
+
+The Knowledge Base feature in MaIN.NET 0.5.0 enables agents to access and utilize external information sources intelligently. This feature allows agents to provide accurate, context-aware responses by leveraging persisted knowledge from files, web sources, and MCP (Model Context Protocol) servers.
+
+## How Knowledge Base Works
+
+### Core Concept
+The knowledge base system creates an `index.json` file that contains a set of sources with assigned tags. When an agent receives a query, it can:
+
+1. **Evaluate Query Relevance** - Determine if external knowledge is needed
+2. **Tag Matching** - Find relevant knowledge sources based on tags
+3. **Source Selection** - Choose appropriate knowledge items (1-4 tags maximum)
+4. **Context Integration** - Use selected knowledge to enhance responses
+
+### Knowledge Storage Structure
+Knowledge is persisted as an index with the following structure:
+- **Name**: Identifier for the knowledge item
+- **Type**: File, URL, Text, or MCP
+- **Value**: Actual content or reference
+- **Tags**: Array of searchable keywords
+
+## Knowledge Decision Process
+
+The `AnswerCommandHandler` implements intelligent knowledge usage through three modes:
+
+### KnowledgeUsage.UseMemory
+Uses traditional chat memory without external knowledge sources.
+
+### KnowledgeUsage.UseKnowledge
+Intelligently decides whether to use knowledge based on query analysis:
+- Evaluates if the question requires external knowledge
+- Uses knowledge unless certain the question needs only basic facts (like "What is 2+2?" or "Capital of France?")
+- When in doubt, defaults to using external knowledge
+
+### KnowledgeUsage.AlwaysUseKnowledge
+Always processes queries through the knowledge base system.
+
+## Knowledge Types Supported
+
+### File Sources (KnowledgeItemType.File)
+- Local files (Markdown, text, etc.)
+- Added to `ChatMemoryOptions.FilesData`
+- Example: Company documentation, policies, manuals
+
+### Web Sources (KnowledgeItemType.Url)
+- Web pages and online resources
+- Added to `ChatMemoryOptions.WebUrls`
+- Example: Online tutorials, documentation sites
+
+### Text Sources (KnowledgeItemType.Text)
+- Direct text content
+- Added to `ChatMemoryOptions.TextData`
+- Example: Structured data, snippets
+
+### MCP Sources (KnowledgeItemType.Mcp)
+- Model Context Protocol servers
+- Handled through `mcpService.Prompt()`
+- Example: GitHub integration, filesystem access, web search
+
+## Tag-Based Retrieval System
+
+The system uses JSON-serialized tag matching:
+
+1. **Index Creation**: Available knowledge sources with their tags
+2. **Query Analysis**: LLM determines relevant tags for the user query
+3. **Source Filtering**: Finds knowledge items where tags intersect with query tags or match item names
+4. **Context Building**: Selected sources are integrated into the chat context
+
+## KnowledgeBuilder API
+
+The `KnowledgeBuilder` provides a fluent API for constructing knowledge bases:
+
+### File Sources
+```csharp
+KnowledgeBuilder.Instance
+    .AddFile("filename", "./path/to/file.md", tags: ["tag1", "tag2"])
+```
+
+### Web Sources
+```csharp
+KnowledgeBuilder.Instance
+    .AddUrl("source_name", "https://example.com", tags: ["web", "tutorial"])
+```
+
+### MCP Sources
+```csharp
+KnowledgeBuilder.Instance
+    .AddMcp(new Mcp
+    {
+        Name = "ServerName",
+        Command = "npx",
+        Arguments = ["server-package"],
+        Backend = BackendType.OpenAi,
+        Model = "gpt-4"
+    }, ["mcp", "external"])
+```
+
+## Integration with Agents
+
+Knowledge bases integrate seamlessly with MaIN.NET agents:
+
+```csharp
+var context = await AIHub.Agent()
+    .WithModel("your-model")
+    .WithKnowledge(KnowledgeBuilder.Instance
+        .AddFile(...)
+        .AddUrl(...)
+        .AddMcp(...))
+    .WithSteps(StepBuilder.Instance
+        .AnswerUseKnowledge()
+        .Build())
+    .CreateAsync();
+```
+
+## Performance and Limitations
+
+- **Tag Limit**: Returns 1-4 tags maximum per query to maintain focus
+- **MCP Limitation**: Cannot combine responses from multiple MCP servers in one request
+- **Grammar Control**: Uses structured grammars for decision-making and tag selection
+- **Memory Integration**: Knowledge sources are added to chat memory for processing
+
+## Notification System
+
+The system provides progress notifications showing:
+- Selected knowledge items (Name|Type format)
+- Model being used for processing
+- Real-time updates during knowledge processing
+
+This knowledge base implementation provides a lightweight yet powerful way to enhance agent capabilities with external information sources while maintaining performance and accuracy.
diff --git a/src/assets/docs/examples/example-agent-knowledge-file.md b/src/assets/docs/examples/example-agent-knowledge-file.md
@@ -0,0 +1,170 @@
+# Agent with Knowledge File Example
+
+## Overview
+
+This example demonstrates how to create an agent that can access and query local file-based knowledge sources. The agent acts as a company assistant for "TechVibe Solutions," helping employees find answers about company information stored in multiple markdown files.
+
+## Code Example
+
+```csharp
+using Examples.Utils;
+using MaIN.Core.Hub;
+using MaIN.Core.Hub.Utils;
+using MaIN.Domain.Entities.Agents.AgentSource;
+
+namespace Examples.Agents;
+
+public class AgentWithKnowledgeFileExample : IExample
+{
+    public async Task Start()
+    {
+        Console.WriteLine("Agent with knowledge base example");
+        AIHub.Extensions.DisableLLamaLogs();
+
+        var context = await AIHub.Agent()
+            .WithModel("gemma3:4b")
+            .WithInitialPrompt("""
+                You are a helpful assistant that answers questions about a company. Try to
+                help employees find answers to their questions. Company you work for is TechVibe Solutions.
+                """)
+            .WithKnowledge(KnowledgeBuilder.Instance
+                .AddFile("people.md", "./Files/Knowledge/people.md", 
+                    tags: ["workers", "employees", "company"])
+                .AddFile("organization.md", "./Files/Knowledge/organization.md",
+                    tags:["company structure", "company policy", "company culture", "company overview"])
+                .AddFile("events.md", "./Files/Knowledge/events.md",
+                    tags: ["company events", "company calendar", "company agenda"])
+                .AddFile("office_layout.md", "./Files/Knowledge/office_layout.md",
+                    tags: ["company layout", "company facilities", "company environment", "office items", "supplies"]))
+            .WithSteps(StepBuilder.Instance
+                .AnswerUseKnowledge()
+                .Build())
+            .CreateAsync();
+
+        var result = await context
+            .ProcessAsync("Hey! Where I can find some printer paper?");
+        Console.WriteLine(result.Message.Content);
+    }
+}
+```
+
+## Key Components
+
+### Model Configuration
+- **Model**: `gemma3:4b` - A lightweight local model suitable for company assistance tasks
+- **Initial Prompt**: Establishes the agent's role as a TechVibe Solutions company assistant
+
+### Knowledge Sources
+
+The example demonstrates four different file-based knowledge sources:
+
+#### 1. People Directory (`people.md`)
+- **Purpose**: Employee information and contact details
+- **Tags**: `["workers", "employees", "company"]`
+- **Use Cases**: Finding colleagues, contact information, organizational structure
+
+#### 2. Organization Information (`organization.md`)
+- **Purpose**: Company structure, policies, and culture documentation
+- **Tags**: `["company structure", "company policy", "company culture", "company overview"]`
+- **Use Cases**: Understanding company hierarchy, policies, cultural guidelines
+
+#### 3. Events Calendar (`events.md`)
+- **Purpose**: Company events and calendar information
+- **Tags**: `["company events", "company calendar", "company agenda"]`
+- **Use Cases**: Upcoming meetings, company events, important dates
+
+#### 4. Office Layout (`office_layout.md`)
+- **Purpose**: Physical office information, facilities, and supplies
+- **Tags**: `["company layout", "company facilities", "company environment", "office items", "supplies"]`
+- **Use Cases**: Finding office resources, navigation, facility information
+
+### Step Configuration
+- Uses `AnswerUseKnowledge()` which intelligently determines when to access knowledge sources
+- The system evaluates each query to decide if external knowledge is needed
+
+## Example Query Flow
+
+When a user asks: **"Hey! Where I can find some printer paper?"**
+
+1. **Query Analysis**: The system analyzes the question and identifies relevant tags
+2. **Knowledge Matching**: Matches tags like "office items" and "supplies" to `office_layout.md`
+3. **Content Retrieval**: Loads the relevant file content into memory
+4. **Response Generation**: Uses the file content to provide specific location information
+
+## File Structure Requirements
+
+The example expects the following file structure:
+```
+./Files/Knowledge/
+├── people.md
+├── organization.md
+├── events.md
+└── office_layout.md
+```
+
+## Use Cases
+
+This pattern is ideal for:
+
+### Corporate Knowledge Bases
+- Employee handbooks
+- Policy documentation
+- Organizational charts
+- Office information
+
+### Documentation Systems
+- Technical documentation
+- User manuals
+- FAQ collections
+- Standard operating procedures
+
+### Educational Content
+- Course materials
+- Reference guides
+- Learning resources
+- Curriculum information
+
+## Best Practices
+
+### Tag Selection
+- Use specific, descriptive tags that match how users might ask questions
+- Include both broad categories (`"company"`) and specific terms (`"supplies"`)
+- Consider synonyms and alternative ways users might phrase queries
+
+### File Organization
+- Keep files focused on specific topics
+- Use clear, descriptive filenames
+- Maintain consistent formatting within files
+- Regular updates to keep information current
+
+### Model Selection
+- Local models like `gemma3:4b` work well for company-specific knowledge
+- Consider model size vs. response quality based on your needs
+- Test with your specific knowledge content to ensure good performance
+
+## Integration Tips
+
+### Error Handling
+Consider adding error handling for missing files:
+```csharp
+// Verify files exist before creating agent
+var filePaths = new[] { 
+    "./Files/Knowledge/people.md",
+    "./Files/Knowledge/organization.md",
+    // ... other files
+};
+
+foreach (var path in filePaths)
+{
+    if (!File.Exists(path))
+        throw new FileNotFoundException($"Knowledge file not found: {path}");
+}
+```
+
+### Dynamic Content
+For frequently updated content, consider:
+- Implementing file watchers to detect changes
+- Periodic knowledge base rebuilding
+- Version control integration for content management
+
+This example provides a solid foundation for building file-based knowledge systems that can scale to handle extensive corporate or domain-specific information repositories.