The following samples demonstrate advanced usage of the ChatCompletionAgent
.
When configuring chat history management, there are two important settings to consider:
- Purpose: Defines the target number of messages to retain after applying truncation or summarization.
- Controls: Determines how much recent conversation history is preserved, while older messages are either discarded or summarized.
- Recommendations for adjustment:
- Smaller values: Ideal for memory-constrained environments or scenarios where brief context is sufficient.
- Larger values: Useful when retaining extensive conversational context is critical for accurate responses or complex dialogue.
- Purpose: Provides a buffer to prevent premature reduction when the message count slightly exceeds
reducer_msg_count
. - Controls: Ensures essential message pairs (e.g., a user query and the assistant’s response) aren't unintentionally truncated.
- Recommendations for adjustment:
- Smaller values: Use to enforce stricter message reduction criteria, potentially truncating older message pairs sooner.
- Larger values: Recommended for preserving critical conversation segments, particularly in sensitive interactions involving API function calls or detailed responses.
The combination of these parameters determines when history reduction occurs and how much of the conversation is retained.
Example:
- If
reducer_msg_count = 10
andreducer_threshold = 5
, message history won't be truncated until the total message count exceeds 15. This strategy maintains conversational context flexibility while respecting memory limitations.
-
Performance-focused environments:
- Lower
reducer_msg_count
to conserve memory and accelerate processing.
- Lower
-
Context-sensitive scenarios:
- Higher
reducer_msg_count
andreducer_threshold
help maintain continuity across multiple interactions, crucial for multi-turn conversations or complex workflows.
- Higher
-
Iterative Experimentation:
- Start with default values (
reducer_msg_count = 10
,reducer_threshold = 10
), and adjust according to the specific behavior and response quality required by your application.
- Start with default values (