Agentic RAG - Advanced Retrieval-Augmented Generation

Agentic RAG brings autonomous, self-improving capabilities to RAG systems through intelligent decorators and adaptive strategies.

Overview

Agentic RAG transforms traditional RAG into an intelligent, self-improving system that:

Plans complex queries automatically
Reflects on result quality and improves
Adapts retrieval strategies dynamically
Reasons across multiple documents
Learns from user feedback
Verifies sources and generates citations

Quick Start

import { AgenticRAGService } from '@hazeljs/rag/agentic';
import { MemoryVectorStore } from '@hazeljs/rag';
import { OpenAIEmbeddings } from '@hazeljs/ai';

// Initialize
const vectorStore = new MemoryVectorStore(new OpenAIEmbeddings());
const agenticRAG = new AgenticRAGService({ vectorStore });

// Use with all agentic features
const results = await agenticRAG.retrieve('complex query about AI');

Features

1. Query Planning & Decomposition

Automatically breaks complex queries into manageable sub-queries.

@QueryPlanner({
  decompose: true,
  maxSubQueries: 5,
  parallel: true
})
async retrieve(query: string): Promise<SearchResult[]> {
  // Automatically decomposes and executes sub-queries
}

Benefits:

Handles complex multi-part questions
Parallel execution for speed
Better coverage of query aspects

2. Self-Reflection & Correction

Evaluates result quality and iteratively improves.

@SelfReflective({
  maxIterations: 3,
  qualityThreshold: 0.8,
  enableAutoImprovement: true
})
async retrieve(query: string): Promise<SearchResult[]> {
  // Automatically reflects and improves results
}

Benefits:

Higher quality results
Self-correcting errors
Confidence scoring

3. Adaptive Retrieval Strategy

Dynamically selects the best retrieval method.

@AdaptiveRetrieval({
  strategies: ['similarity', 'hybrid', 'mmr'],
  autoSelect: true,
  contextAware: true
})
async retrieve(query: string): Promise<SearchResult[]> {
  // Automatically chooses best strategy
}

Strategies:

Similarity: Semantic search
Hybrid: Keyword + semantic
MMR: Diverse results

4. Multi-Hop Reasoning

Chains multiple retrieval steps for complex reasoning.

@MultiHop({
  maxHops: 3,
  strategy: 'breadth-first'
})
async deepRetrieve(query: string): Promise<ReasoningChain> {
  // Performs multi-hop reasoning
}

Benefits:

Answers complex questions
Connects information across documents
Provides reasoning chain

5. HyDE (Hypothetical Document Embeddings)

Generates hypothetical answers to improve retrieval.

@HyDE({
  generateHypothesis: true,
  numHypotheses: 3
})
async hydeRetrieve(query: string): Promise<SearchResult[]> {
  // Uses hypothetical documents
}

Benefits:

Better retrieval for abstract queries
Improved semantic matching
State-of-the-art technique

6. Corrective RAG (CRAG)

Self-corrects retrieval errors with fallback mechanisms.

@CorrectiveRAG({
  relevanceThreshold: 0.7,
  fallbackToWeb: true
})
async correctiveRetrieve(query: string): Promise<SearchResult[]> {
  // Self-corrects low-quality results
}

Benefits:

Detects low-quality results
Automatic correction
Fallback mechanisms

7. Context-Aware Retrieval

Maintains conversation context for better results.

@ContextAware({
  windowSize: 5,
  entityTracking: true,
  topicModeling: true
})
async conversationalRetrieve(query: string, sessionId: string): Promise<SearchResult[]> {
  // Uses conversation context
}

Benefits:

Conversational memory
Entity tracking
Topic continuity

8. Query Rewriting

Expands and rewrites queries for better coverage.

@QueryRewriter({
  techniques: ['expansion', 'clarification', 'synonym'],
  llmBased: true
})
async rewriteQuery(query: string): Promise<SearchResult[]> {
  // Generates query variations
}

Techniques:

Expansion: Add context
Synonyms: Alternative terms
Clarification: More specific

9. Source Verification

Verifies sources and generates citations.

@SourceVerification({
  checkFreshness: true,
  verifyAuthority: true,
  requireCitations: true
})
async verifiedRetrieve(query: string): Promise<VerifiedResponse> {
  // Returns verified sources with citations
}

Checks:

Source authority
Content freshness
Relevance scores

10. Active Learning

Learns from user feedback to improve over time.

@ActiveLearning({
  feedbackEnabled: true,
  retrainThreshold: 100
})
async learningRetrieve(query: string): Promise<SearchResult[]> {
  // Learns from feedback
}

@Feedback()
async provideFeedback(resultId: string, rating: number): Promise<void> {
  // Store feedback
}

Benefits:

Continuous improvement
Personalization
Adaptive ranking

11. Caching

Smart caching for performance.

@Cached({
  ttl: 3600,
  maxSize: 100
})
async retrieve(query: string): Promise<SearchResult[]> {
  // Cached results
}

Features:

LRU eviction
TTL expiration
Hit rate tracking

Complete Example

import {
  AgenticRAGService,
  QueryPlanner,
  SelfReflective,
  AdaptiveRetrieval,
  MultiHop,
  HyDE,
  CorrectiveRAG,
  ContextAware,
  QueryRewriter,
  SourceVerification,
  ActiveLearning,
  Cached,
} from '@hazeljs/rag/agentic';

class ResearchAssistant {
  constructor(private vectorStore: VectorStore) {}

  /**
   * Production-ready retrieval with all features
   */
  @QueryPlanner({ decompose: true, maxSubQueries: 5, parallel: true })
  @SelfReflective({ maxIterations: 3, qualityThreshold: 0.8 })
  @AdaptiveRetrieval({ autoSelect: true, contextAware: true })
  @HyDE({ generateHypothesis: true, numHypotheses: 3 })
  @CorrectiveRAG({ relevanceThreshold: 0.7, fallbackToWeb: true })
  @ContextAware({ windowSize: 5, entityTracking: true, topicModeling: true })
  @QueryRewriter({ techniques: ['expansion', 'synonym'], llmBased: true })
  @SourceVerification({ checkFreshness: true, verifyAuthority: true, requireCitations: true })
  @ActiveLearning({ feedbackEnabled: true, retrainThreshold: 100 })
  @Cached({ ttl: 3600 })
  async research(query: string, sessionId: string): Promise<SearchResult[]> {
    return this.vectorStore.search(query, { sessionId } as any);
  }

  @Feedback()
  async provideFeedback(resultId: string, rating: number, relevant: boolean): Promise<void> {
    // Feedback stored automatically
  }
}

// Usage
const assistant = new ResearchAssistant(vectorStore);

// Execute research with all agentic features
const results = await assistant.research(
  'Compare machine learning approaches for natural language processing',
  'session-123'
);

// Provide feedback
await assistant.provideFeedback(results[0].id, 5, true);

Key Differentiators

Decorator-First Design - Clean, composable API
Self-Improving - Learns and adapts automatically
Production-Ready - Built-in caching, error handling
Type-Safe - Full TypeScript support
Observable - Metadata for monitoring
Extensible - Easy to add custom decorators

Performance

Query Planning: 2-3x better coverage on complex queries
Self-Reflection: 15-20% improvement in result quality
HyDE: 10-15% better retrieval for abstract queries
Caching: 10x faster for repeated queries
Active Learning: Continuous improvement over time

Configuration

Global Configuration

const config = {
  vectorStore: myVectorStore,
  llmProvider: myLLMProvider,
  enableAllFeatures: true,
};

const agenticRAG = new AgenticRAGService(config);

Per-Method Configuration

Each decorator accepts its own configuration:

@QueryPlanner({ decompose: true, maxSubQueries: 5 })
@SelfReflective({ maxIterations: 3, qualityThreshold: 0.8 })
@Cached({ ttl: 3600, maxSize: 100 })
async retrieve(query: string): Promise<SearchResult[]> {
  // Custom configuration per method
}

Monitoring

Access metadata from decorators:

import { getQueryPlan, getReflections, getAdaptiveStrategy } from '@hazeljs/rag/agentic';

// Get query plan
const plan = getQueryPlan(target, 'retrieve');

// Get reflection results
const reflections = getReflections(target, 'retrieve');

// Get adaptive strategy
const strategy = getAdaptiveStrategy(target, 'retrieve');

Best Practices

Start Simple - Begin with basic decorators, add more as needed
Monitor Performance - Track metrics and adjust thresholds
Provide Feedback - Enable active learning for continuous improvement
Cache Wisely - Balance freshness vs performance
Verify Sources - Always verify for production use cases

Advanced Topics

Custom Decorators

Create your own agentic decorators:

export function CustomDecorator(config: CustomConfig) {
  return function (target: any, propertyKey: string, descriptor: PropertyDescriptor) {
    const originalMethod = descriptor.value;
    
    descriptor.value = async function (...args: any[]) {
      // Pre-processing
      const result = await originalMethod.apply(this, args);
      // Post-processing
      return result;
    };
    
    return descriptor;
  };
}

Combining with Agent Runtime

Integrate Agentic RAG with the Agent Runtime for stateful, persistent research workflows:

import { Agent, Tool, AgentRuntime } from '@hazeljs/agent';
import { DatabaseStateManager } from '@hazeljs/agent/state';
import { AgenticRAGService } from '@hazeljs/rag/agentic';
import { PrismaClient } from '@prisma/client';

@Agent({ name: 'research-agent' })
class ResearchAgent {
  constructor(private agenticRAG: AgenticRAGService) {}

  @Tool({ description: 'Research a topic with multi-hop reasoning' })
  async research(query: string): Promise<any> {
    return this.agenticRAG.retrieve(query);
  }
  
  @Tool({ description: 'Deep research with context awareness' })
  async deepResearch(query: string, context: any): Promise<any> {
    // Uses conversation history from agent state
    return this.agenticRAG.retrieve(query);
  }
}

// Setup with database persistence
const prisma = new PrismaClient();
const stateManager = new DatabaseStateManager({
  client: prisma,
  softDelete: true,
  autoArchive: true,
});

const runtime = new AgentRuntime({
  stateManager,
  // ... other config
});

// Execute with persistent state
const result = await runtime.execute('research-agent', 'complex research query', {
  sessionId: 'user-123',
  enableMemory: true,
  enableRAG: true,
});

// State is persisted to database - can resume later
await runtime.resume(result.executionId, 'follow-up question');

Benefits of Database Persistence:

Stateful Research Sessions: Maintain context across multiple research queries
Resume Long-Running Research: Pause and resume complex multi-hop reasoning
Audit Trail: Full history of research steps and decisions
Session Management: Track research sessions per user
Query History: Access previous research results and context

Use Cases

Research Assistants - Deep research with multi-hop reasoning
Customer Support - Context-aware conversational retrieval
Legal Research - Verified sources with citations
Medical Q&A - High-quality, self-correcting answers
Knowledge Management - Adaptive, learning-enabled search

Integration with RAG Package

Agentic RAG extends the core RAG package with intelligent capabilities:

import { RAGPipeline, MemoryVectorStore } from '@hazeljs/rag';
import { AgenticRAGService } from '@hazeljs/rag/agentic';

// Use with standard RAG pipeline
const pipeline = new RAGPipeline({
  vectorStore: new MemoryVectorStore(embeddings),
  // ... other config
});

// Enhance with agentic capabilities
const agenticRAG = new AgenticRAGService({
  vectorStore: pipeline.getVectorStore(),
});

License

MIT