docs: add guide for bot reasoning guardrails #1479

Pouyanpi · 2025-10-28T11:57:03Z

Related Issue(s)

#1427
#1431
#1432
#1434

Checklist

I've read the CONTRIBUTING guidelines.
I've updated the documentation if applicable.
I've added tests if applicable.
@mentions of the person or team responsible for reviewing proposed changes.

github-actions · 2025-10-28T11:58:28Z

Documentation preview

https://nvidia-nemo.github.io/Guardrails/review/pr-1479

codecov-commenter · 2025-10-28T12:04:07Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

update update simplify cleanup

Add a note specifying that bot reasoning guardrails are supported only in Colang 1.0. Update example file references for improved clarity.

greptile-apps

Greptile Overview

Greptile Summary

This PR adds documentation for a new feature that allows developers to access and guardrail bot reasoning traces (exposed via the bot_thinking variable). The documentation covers three access methods - Colang flows, Python actions, and prompt templates - with examples ranging from simple pattern matching to complete self-check output implementations. The guide fits naturally into the advanced user guides section alongside other specialized guardrail features like bot-message-instructions and tools-integration, following the established pattern of documenting complex features with progressive examples and reference implementations.

PR Description Notes:

The description, related issues, and checklist are all empty/unchecked. Consider adding a brief explanation of what bot thinking/reasoning guardrails are and why this documentation was added.

Important Files Changed

Filename	Score	Overview
docs/index.md	1/5	Adds new bot-thinking-guardrails page to TOC but contains critical typo: 'advaced' instead of 'advanced' in file path
docs/user-guides/advanced/bot-thinking-guardrails.md	4/5	Comprehensive new documentation guide explaining bot_thinking variable access patterns with examples and reference implementations

Confidence score: 1/5

This PR contains a critical typo that will break the documentation build and must be fixed before merging.
Score reflects a single-character typo in the TOC file path ('advaced' vs 'advanced') that will prevent Sphinx/MkDocs from locating the new documentation file, causing build failures.
The docs/index.md file requires immediate attention to correct line 71 from user-guides/advaced/bot-thinking-guardrails to user-guides/advanced/bot-thinking-guardrails to match the actual file location.

Sequence Diagram

sequenceDiagram
    participant User
    participant LLMRails
    participant ReasoningLLM as Reasoning LLM<br/>(Main Model)
    participant OutputRails as Output Rails
    participant ColangFlow as Colang Flow<br/>(check_reasoning)
    participant CustomAction as Custom Action<br/>(check_reasoning_quality)
    participant SelfCheckLLM as Self-Check LLM<br/>(Moderation)
    participant PromptTemplate as Prompt Template

    User->>LLMRails: Send user message
    LLMRails->>ReasoningLLM: Generate response with reasoning
    ReasoningLLM-->>LLMRails: Return response + reasoning trace
    LLMRails->>LLMRails: Extract reasoning to $bot_thinking variable
    
    alt Output Rails with Colang Flow
        LLMRails->>OutputRails: Trigger output rails
        OutputRails->>ColangFlow: Execute flow with $bot_thinking
        ColangFlow->>ColangFlow: Check if "confidential" in $bot_thinking
        alt Contains sensitive content
            ColangFlow-->>OutputRails: Block response
            OutputRails-->>LLMRails: bot refuse to respond
        else Safe content
            ColangFlow-->>OutputRails: Allow response
        end
    end
    
    alt Output Rails with Custom Action
        LLMRails->>OutputRails: Trigger output rails
        OutputRails->>CustomAction: execute check_reasoning_quality(context)
        CustomAction->>CustomAction: Get context.get("bot_thinking")
        CustomAction->>CustomAction: Check forbidden patterns
        alt Forbidden pattern found
            CustomAction-->>OutputRails: Return False
            OutputRails-->>LLMRails: bot refuse to respond
        else Safe reasoning
            CustomAction-->>OutputRails: Return True
        end
    end
    
    alt Self-Check Output with Reasoning
        LLMRails->>OutputRails: Trigger self check output
        OutputRails->>PromptTemplate: Render prompt with {{ bot_thinking }}
        PromptTemplate-->>OutputRails: Prompt includes bot_response and bot_thinking
        OutputRails->>SelfCheckLLM: Send prompt for moderation
        SelfCheckLLM->>SelfCheckLLM: Evaluate response and reasoning
        alt Should block
            SelfCheckLLM-->>OutputRails: "Yes"
            OutputRails-->>LLMRails: Block response
        else Should allow
            SelfCheckLLM-->>OutputRails: "No"
            OutputRails-->>LLMRails: Allow response
        end
    end
    
    LLMRails-->>User: Return final response or refusal

_{2 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

docs/index.md

docs/user-guides/advanced/bot-thinking-guardrails.md

greptile-apps

Greptile Overview

Greptile Summary

This review covers only the changes made since the last review, not the entire PR. A critical syntax error was introduced on line 10 of docs/index.md where an HTML comment closing tag is malformed (<!-- end-documentation-reuse -- missing the closing >). This will almost certainly break the documentation build process. The previously identified typo on line 71 (path correction from 'advaced' to 'advanced') appears to remain unfixed. No other files were modified in this iteration.

Important Files Changed

Filename	Score	Overview
docs/index.md	1/5	Introduced a malformed HTML comment tag on line 10 that will likely break documentation builds; typo fix on line 71 may still be pending

Confidence score: 0/5

This PR will almost certainly cause immediate documentation build failures and should not be merged in its current state.
Score reflects a critical syntax error (malformed HTML comment on line 10) that will break the Sphinx/MkDocs build pipeline, making the documentation generation fail completely.
Pay immediate attention to docs/index.md line 10—the comment tag must be properly closed as  before this can be merged.

Additional Comments (1)

docs/index.md, line 10 (link)

syntax: incomplete closing comment tag: missing closing >

_{1 file reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps

Greptile Overview

Greptile Summary

This review covers only the changes made since the last review, not the entire PR. The developer has addressed previous feedback by updating GitHub repository links from commit-specific URLs to branch-based URLs (develop). Specifically, the links on lines 146 and 200 were changed to match the branch-based format already used on line 194, ensuring consistency and maintainability across all three reference links in the documentation. This change ensures that all documentation links point to the current codebase rather than historical snapshots.

Important Files Changed

Filename	Score	Overview
docs/user-guides/advanced/bot-thinking-guardrails.md	5/5	Updated two GitHub links from commit-specific to branch-based URLs for consistency

Confidence score: 5/5

This PR is safe to merge with minimal risk
All previous style and consistency issues have been resolved; the changes are purely documentation link updates with no code impact
No files require special attention

_{1 file reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

docs/user-guides/advanced/bot-thinking-guardrails.md

miyoungc · 2025-10-30T20:37:11Z

Doc edit PR: #1484

greptile-apps

Greptile Overview

Greptile Summary

Adds comprehensive documentation for bot reasoning guardrails feature, explaining how to access and control LLM reasoning traces through the bot_thinking variable.

Key additions:

Introduction to reasoning-capable LLMs and the bot_thinking variable
Three access patterns: Colang flows, custom actions, and prompt templates
Three guardrailing approaches: output rails with pattern matching, custom actions, and self-check output
Complete working examples aligned with existing self_check_thinking configuration
Proper cross-references to related documentation and examples

Quality indicators:

All code examples are syntactically correct and match existing patterns in the codebase
All referenced files and examples exist and are accurate
Clear disclaimers about toy examples vs production code
Consistent formatting and proper use of admonitions (note, important)
Previous feedback addressed (typos fixed in docs/index.md)

Confidence Score: 5/5

This documentation PR is safe to merge with no issues found
This is a documentation-only PR that adds a well-structured guide with accurate code examples, proper cross-references to existing files, and clear disclaimers. All referenced paths exist, syntax is correct, and previous feedback has been addressed.
No files require special attention

Important Files Changed

File Analysis

Filename	Score	Overview
docs/user-guides/advanced/bot-thinking-guardrails.md	5/5	New comprehensive documentation guide for bot reasoning guardrails with clear examples and proper structure

Sequence Diagram

sequenceDiagram
    participant User
    participant NeMo as NeMo Guardrails
    participant LLM as Reasoning LLM
    participant Rail as Output Rail
    participant Action as Custom Action

    User->>NeMo: Send user message
    NeMo->>LLM: Generate response
    LLM-->>NeMo: Response + reasoning trace
    NeMo->>NeMo: Extract reasoning to bot_thinking
    
    alt Output Rail with Pattern Matching
        NeMo->>Rail: Check bot_thinking variable
        Rail->>Rail: Match patterns (e.g., "confidential")
        alt Pattern found
            Rail-->>NeMo: Block response
            NeMo-->>User: Refusal message
        else Pattern not found
            Rail-->>NeMo: Allow response
            NeMo-->>User: Original response
        end
    end
    
    alt Output Rail with Custom Action
        NeMo->>Action: Execute check_reasoning_quality(context)
        Action->>Action: Access context.get("bot_thinking")
        Action->>Action: Validate against forbidden patterns
        Action-->>NeMo: Return True/False
        alt Action returns False
            NeMo-->>User: Refusal message
        else Action returns True
            NeMo-->>User: Original response
        end
    end
    
    alt Self-Check Output
        NeMo->>Rail: Trigger self check output
        Rail->>Rail: Render prompt with bot_thinking
        Rail->>LLM: Send moderation request
        LLM-->>Rail: Should block? (Yes/No)
        alt Should block
            Rail-->>NeMo: Block response
            NeMo-->>User: Refusal message
        else Should not block
            Rail-->>NeMo: Allow response
            NeMo-->>User: Original response
        end
    end

_{1 file reviewed, no comments}

_{Edit Code Review Agent Settings | Greptile}

Pouyanpi force-pushed the docs/bot-thinking-rails branch 2 times, most recently from 0501e6a to aca68bd Compare October 29, 2025 09:28

docs: add guide for bot reasoning guardrails

8033392

update update simplify cleanup

Pouyanpi force-pushed the docs/bot-thinking-rails branch from aca68bd to 8033392 Compare October 29, 2025 09:31

Pouyanpi added 3 commits October 29, 2025 10:38

docs: clarify Colang version for bot reasoning guide

d99abff

Add a note specifying that bot reasoning guardrails are supported only in Colang 1.0. Update example file references for improved clarity.

add bot thinking guardrails to toctree

af66c7d

docs: update self-check config link to develop branch

028c635

Pouyanpi self-assigned this Oct 29, 2025

Pouyanpi added this to the v0.18.0 milestone Oct 29, 2025

Pouyanpi added the documentation Improvements or additions to documentation label Oct 29, 2025

Pouyanpi marked this pull request as ready for review October 29, 2025 09:44

greptile-apps bot reviewed Oct 29, 2025

View reviewed changes

docs/index.md Outdated Show resolved Hide resolved

docs/user-guides/advanced/bot-thinking-guardrails.md Outdated Show resolved Hide resolved

docs/user-guides/advanced/bot-thinking-guardrails.md Outdated Show resolved Hide resolved

fix typo

dee9607

greptile-apps bot reviewed Oct 29, 2025

View reviewed changes

fix references to use develop branch

9808aa2

greptile-apps bot reviewed Oct 29, 2025

View reviewed changes

docs/user-guides/advanced/bot-thinking-guardrails.md Outdated Show resolved Hide resolved

Pouyanpi requested a review from miyoungc October 29, 2025 10:00

miyoungc mentioned this pull request Oct 30, 2025

docs: edit #1479 #1484

Merged

4 tasks

docs: edit #1479 (#1484)

f16f545

greptile-apps bot reviewed Oct 31, 2025

View reviewed changes

miyoungc approved these changes Oct 31, 2025

View reviewed changes

miyoungc merged commit d380fe1 into develop Oct 31, 2025
10 checks passed

miyoungc deleted the docs/bot-thinking-rails branch October 31, 2025 15:58

tgasser-nv mentioned this pull request Nov 3, 2025

feat(benchmark): Add Procfile to run Guardrails and mock LLMs #1490

Draft

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: add guide for bot reasoning guardrails #1479

docs: add guide for bot reasoning guardrails #1479

Uh oh!

Pouyanpi commented Oct 28, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 28, 2025

Uh oh!

codecov-commenter commented Oct 28, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot left a comment •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

miyoungc commented Oct 30, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

docs: add guide for bot reasoning guardrails #1479

docs: add guide for bot reasoning guardrails #1479

Uh oh!

Conversation

Pouyanpi commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Related Issue(s)

Checklist

Uh oh!

github-actions bot commented Oct 28, 2025

Documentation preview

Uh oh!

codecov-commenter commented Oct 28, 2025

Codecov Report

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Greptile Summary

Important Files Changed

Confidence score: 1/5

Sequence Diagram

Uh oh!

Uh oh!

Uh oh!

Uh oh!

greptile-apps bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Greptile Overview

Greptile Summary

Important Files Changed

Confidence score: 0/5

Additional Comments (1)

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Greptile Summary

Important Files Changed

Confidence score: 5/5

Uh oh!

Uh oh!

miyoungc commented Oct 30, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Pouyanpi commented Oct 28, 2025 •

edited

Loading

greptile-apps bot left a comment •

edited

Loading