How to configure RAG generation settings in IntelliWP, complete WordPress AI agent setup

June 11, 2025

Configuring your WordPress RAG system’s generation settings correctly is crucial for creating a WordPress AI agent that truly understands your business and speaks with your brand’s voice. In this comprehensive guide, we’ll walk you through every aspect of the Generation Settings tab in IntelliWP, showing you exactly how to customize your WordPress AI agent’s behavior for optimal performance.

What are WordPress RAG generation settings?

The Generation Settings control how your WordPress AI agent formulates and delivers responses to users. Unlike the retrieval phase that finds relevant content, the generation phase determines the style, tone, and format of the final answer your visitors receive. Think of it as training your WordPress RAG system’s communication skills.

Accessing WordPress RAG generation settings in IntelliWP

To configure your WordPress AI agent generation settings:

  1. Navigate to your WordPress admin dashboard
  2. Go to IntelliWP → RAG Configuration
  3. Click on the Generation Settings tab
  4. You’ll see the main configuration panel with four key sections

LLM model selection and fine-tuning for WordPress AI agent

Available model options

The first setting you’ll encounter is the LLM Model dropdown, which offers two categories:

Base Models:

  • gpt-3.5-turbo: Fast, cost-effective, ideal for straightforward customer service
  • gpt-4: More sophisticated reasoning, better for complex queries
  • gpt-4-turbo: Latest version with improved performance and larger context window

Fine-Tuned Models:

  • Custom LLM fine-tuning models trained specifically on your content
  • Organization-specific adaptations you’ve created through fine-tuning
  • Industry-specialized versions developed via LLM fine-tuning processes

Understanding LLM fine-tuning for WordPress RAG

LLM fine-tuning allows you to create specialized models that understand your specific business domain, terminology, and communication style. When you use fine-tuning with your WordPress RAG system, you’re essentially teaching the AI to speak your language more naturally.

Choosing the right model for your WordPress RAG system

For most WordPress sites, gpt-3.5-turbo provides excellent results at a reasonable cost. LLM fine-tuning allows you to create specialized models that understand your specific business domain, terminology, and communication style. When you use LLM fine-tuning with your WordPress RAG system, you’re essentially teaching the AI to speak your language more naturally.

Consider LLM fine-tuning when:

  • Complex technical explanations
  • Multi-step problem solving
  • Nuanced understanding of industry-specific terms

Temperature control, adjusting WordPress AI agent response creativity

Understanding the temperature slider for WordPress RAG

The temperature setting appears as a slider ranging from 0 to 1. This critical parameter controls how creative or consistent your WordPress AI agent responses will be.

Default Setting: 0.3 (recommended for most business applications)

Temperature values explained

0.0 – 0.2 (Highly Deterministic)

  • Produces nearly identical responses to similar questions
  • Best for: FAQ responses, pricing information, technical specifications
  • Trade-off: Can feel robotic or repetitive

0.3 – 0.4 (Balanced Professional)

  • Maintains consistency while allowing natural variation
  • Best for: Customer support, product information, general business queries
  • Trade-off: Optimal balance for most use cases

0.5 – 0.7 (Creative Variation)

  • More personality and linguistic variety
  • Best for: Marketing content, engaging conversations, brand storytelling
  • Trade-off: Slightly less predictable responses

0.8 – 1.0 (Highly Creative)

  • Maximum creativity and variation
  • Best for: Content creation, brainstorming, creative writing
  • Trade-off: May occasionally produce unexpected responses

Maximum length configuration

Setting token limits

The Maximum Length field controls how long your AI responses can be, measured in tokens (roughly 3-4 characters per token).

Configuration Options:

  • Minimum: 100 tokens
  • Maximum: 4000 tokens
  • Default: 1000 tokens
  • Increment: 100 tokens

Recommended token ranges

Short Responses (200-400 tokens):

  • Ideal for: Quick answers, contact information, simple explanations
  • Example use: “What are your business hours?”

Medium Responses (500-800 tokens):

  • Ideal for: Product descriptions, process explanations, moderate detail
  • Example use: “How does your return policy work?”

Long Responses (1000-1500 tokens):

  • Ideal for: Detailed guides, comprehensive comparisons, tutorials
  • Example use: “Explain your onboarding process”

Extended Responses (1500+ tokens):

  • Ideal for: Technical documentation, in-depth analysis
  • Example use: “Compare all your service plans in detail”

System prompt, defining your WordPress AI agent’s personality

The WordPress RAG system prompt interface

The System Prompt is a large text area where you define your WordPress AI agent’s personality, behavior, and response guidelines. This is the most important setting for customizing your WordPress RAG system’s voice.

Default system prompt

IntelliWP starts with this basic prompt:

“You are a specialized assistant for the website. Respond to queries based only on the information in the following context. If the information is not in the context, honestly indicate that you don’t have that information.”

Customizing your WordPress RAG system prompt

Here’s how to create effective system prompts for different business types using WordPress AI agent:

Professional services system prompt

You are a professional customer service representative for [Your Company Name]. 
Maintain a courteous, helpful tone while being solution-oriented. 
Always thank customers for their inquiries and provide clear, accurate information. 
If you don't have specific information, offer to connect them with a specialist. 
Use professional language and keep responses focused and concise.
Answer only based on the provided context.

E-commerce store system prompt

You are a knowledgeable sales assistant for [Your Store Name]. 
Be enthusiastic about our products while being honest and helpful. 
Help customers find the right products for their needs and provide 
detailed information about features, pricing, and availability. 
When you don't have specific product information, acknowledge this 
and suggest contacting our sales team.
Answer only based on the provided context.

Technical support system prompt

You are a technical support specialist for [Your Company Name]. 
Provide clear, step-by-step instructions and accurate technical information. 
Use appropriate technical terminology but explain complex concepts simply. 
When troubleshooting, ask clarifying questions if needed. 
If you cannot resolve an issue with available information, 
escalate to technical support.
Answer only based on the provided context.

Advanced WordPress RAG configuration tips

Testing your WordPress AI agent configuration

After setting up your WordPress RAG generation parameters, test with these scenarios:

  1. Simple factual question: “What are your contact details?”
  2. Complex product query: “Which service plan would work best for a small business?”
  3. Technical question: “How do I integrate your API?”
  4. Out-of-scope question: “What’s the weather today?”

Optimizing WordPress AI agent for your industry

SaaS Companies:

  • Model: gpt-4
  • Temperature: 0.2-0.3
  • Max Tokens: 800-1200
  • Focus: Technical accuracy, clear explanations for WordPress RAG

E-commerce Sites:

  • Model: gpt-3.5-turbo
  • Temperature: 0.4-0.5
  • Max Tokens: 400-800
  • Focus: Product knowledge, sales assistance with WordPress AI agent

Professional Services:

  • Model: gpt-3.5-turbo or gpt-4
  • Temperature: 0.3-0.4
  • Max Tokens: 600-1000
  • Focus: Expertise demonstration, trust building through WordPress RAG

Common configuration mistakes

Setting temperature too high: Results in inconsistent, unpredictable responses Setting temperature too low: Creates robotic, repetitive interactions Overly long max tokens: Increases costs and may overwhelm users Vague system prompts: Leads to generic, unhelpful WordPress AI agent responses Not testing configurations: Deploys WordPress RAG that doesn’t match brand voice

Monitoring and adjusting WordPress RAG settings

Key performance indicators for WordPress AI agent

Track these metrics to evaluate your WordPress RAG configuration effectiveness:

  • Response Relevance: Are answers staying on-topic and helpful?
  • User Engagement: Are visitors continuing conversations?
  • Query Resolution: Are users finding the information they need?
  • Cost Efficiency: Are token usage and API costs within budget?

When to adjust settings

Increase Temperature When:

  • Responses feel too robotic or repetitive
  • Users want more engaging, conversational interactions
  • Brand voice needs more personality

Decrease Temperature When:

  • Responses are inconsistent or off-brand
  • Accuracy is more important than creativity
  • Dealing with sensitive topics requiring precision

Adjust Max Tokens When:

  • Responses are consistently too short for complex queries
  • Costs are exceeding budget due to long responses
  • Users prefer shorter, more concise answers

Saving and implementing changes

Once you’ve configured your WordPress RAG generation settings:

  1. Click Save Settings at the bottom of the form
  2. Wait for the confirmation message
  3. Test your WordPress AI agent on the frontend to verify changes
  4. Monitor performance for 24-48 hours
  5. Make incremental adjustments as needed

Conclusion

Properly configured WordPress RAG generation settings transform your IntelliWP chatbot from a generic AI assistant into a specialized WordPress AI agent representative of your brand. By carefully tuning the model selection, temperature, token limits, and system prompt, you create a WordPress AI agent that understands your business context and communicates effectively with your audience.

Remember that WordPress RAG configuration is an iterative process. Start with conservative settings, test thoroughly, and gradually optimize based on real user interactions and feedback. With the right generation settings, your IntelliWP WordPress RAG system becomes a powerful tool for customer engagement and support.