Fix assure match detection and add coverage for parallel batches #185

xfhg · 2025-09-17T14:37:52Z

Summary

add a shared runParallelRipgrep helper that detects actual matches while streaming batch output
report assure execution results through a policy status handler and cover parallel batches with new tests
tidy API status messages and SSH prompts flagged by go vet so go test ./... succeeds

Testing

go test ./...

https://chatgpt.com/codex/tasks/task_e_68cab288b49c832496bfad4568dc4760

Important

The pull request refactors the executeAssure and executeParallelAssure functions for better clarity and performance, introduces a new runParallelRipgrep function to handle parallel execution, and adds tests for the executeAssure function to ensure correct behavior. Additionally, it improves logging and status reporting mechanisms.
^{This description was created by}^{for 5ab833f.}

cursor

This PR is being reviewed by Cursor Bugbot

Details

You are on the Bugbot Free tier. On this plan, Bugbot will review limited PRs each billing cycle.

To receive Bugbot reviews on all of your PRs, visit the Cursor dashboard to activate Pro and start your 14-day free trial.

cursor · 2025-09-17T14:42:35Z

cmd/rg_executor.go

+
+			if !found && bytes.Contains(output, []byte(`"type":"match"`)) {
+				found = true
+			}


Bug: Ripgrep Match Detection Flaw

The runParallelRipgrep function's match detection logic is contradictory. It checks for "type":"match" in ripgrep's output even when ripgrep exits with code 1, which explicitly indicates no matches. This can lead to false positives and is fragile due to relying on a literal string check in the raw JSON output.

llamapreview

Auto Pull Request Review from LlamaPReview

🎯 TL;DR & Recommendation

Recommendation: Approve with suggestions

This PR introduces a shared parallel execution helper and adds tests for better coverage, but includes minor maintainability and architectural suggestions to enhance robustness and reduce duplication.

📄 Documentation Diagram

This diagram documents the new parallel ripgrep execution workflow introduced by the PR.

sequenceDiagram
    participant User
    participant executeAssure
    participant runParallelRipgrep
    participant StatusHandler
    User->>executeAssure: Trigger execution
    executeAssure->>runParallelRipgrep: Call with batch args
    note over runParallelRipgrep: PR #35;185 added this helper
    runParallelRipgrep->>runParallelRipgrep: Process batches in parallel
    runParallelRipgrep-->>executeAssure: Return matches found
    executeAssure->>StatusHandler: Report policy status
    note over StatusHandler: New status reporting mechanism
    StatusHandler-->>User: Final result

🌟 Strengths

Adds comprehensive tests for parallel execution, ensuring reliability.
Centralizes batch sizing logic, improving consistency across the codebase.

Priority	File	Category	Impact Summary	Anchors
P2	cmd/scan.go	Maintainability	Duplication of batch size constant increases maintenance risk	symbol:executeParallelScans, constant:parallelBatchSize
P2	cmd/rg_executor.go	Maintainability	Fragile string matching could break on JSON format changes
P2	cmd/assure.go	Architecture	Asymmetric function interfaces suggest divergence in responsibilities	symbol:executeSingleAssure, constant:parallelBatchSize
P2	cmd/assure.go	Architecture	Global handler pattern adds indirect state mutation	symbol:reportPolicyStatus
P2	cmd/api.go	Maintainability	Hard-coded array indices risk panics on format changes

🔍 Notable Themes

Maintainability improvements needed to reduce code duplication and fragility in string parsing and array access.
Architectural consistency suggested for function interfaces to align responsibilities and state management.

⚠️ **Unanchored Suggestions (Manual Review Recommended)**

The following suggestions could not be precisely anchored to a specific line in the diff. This can happen if the code is outside the changed lines, has been significantly refactored, or if the suggestion is a general observation. Please review them carefully in the context of the full file.

📁 File: `cmd/scan.go`

The new parallelBatchSize constant (value 25) in rg_executor.go centralizes batch sizing logic, but the related context shows executeParallelScans still uses a hard-coded batchSize of 25. This creates duplication and maintenance risk—if batch sizing logic changes, it must be updated in two places. The new parallel execution helper (runParallelRipgrep) should be adopted here for consistency.

Suggestion:

func executeParallelScans(rgPath string, baseArgs []string, filesToScan []string, writer *bufio.Writer) error {
	var wg sync.WaitGroup
	errChan := make(chan error, (len(filesToScan)+parallelBatchSize-1)/parallelBatchSize)
	// Use runParallelRipgrep or refactor to share batch logic

Related Code:

func executeParallelScans(rgPath string, baseArgs []string, filesToScan []string, writer *bufio.Writer) error {
	const batchSize = 25
	var wg sync.WaitGroup
	errChan := make(chan error, len(filesToScan)/batchSize+1)

💡 LlamaPReview Community
Have feedback on this AI Code review tool? Join our GitHub Discussions to share your thoughts and help shape the future of LlamaPReview.

llamapreview · 2025-09-17T15:57:42Z

cmd/assure.go

 	}
-	if len(filesToAssure) > 25 {
+	if len(filesToAssure) > parallelBatchSize {
 		matchesFound, err = executeParallelAssure(rgPath, codePatternAssureJSON, filesToAssure, writer)
 	} else {
 		matchesFound, err = executeSingleAssure(rgPath, codePatternAssureJSON, filesToAssure, targetDir, policy, writer)


P2 | Confidence: High

The parallel execution threshold now uses the centralized parallelBatchSize, improving consistency. However, executeSingleAssure still receives a targetDir and policy parameter while executeParallelAssure does not—this architectural asymmetry suggests the helper functions have diverged responsibilities. Consider aligning their interfaces or merging logic.

llamapreview · 2025-09-17T15:57:42Z

cmd/assure.go


 	}

+	reportPolicyStatus(policy, matchesFound)


P2 | Confidence: High

The new status reporting mechanism introduces a global handler pattern. While tests validate its behavior, the related context doesn't show other callers of executeAssure. This pattern is extensible but adds indirect state mutation—ensure future maintainers understand this side effect.

llamapreview · 2025-09-17T15:57:42Z

cmd/api.go

 			if sarifReport.Runs[0].Invocations[0].Properties.ReportCompliant {
-
-				resultMsg = fmt.Sprintf("🟢 %s : %s", "Compliant")
-
+				resultMsg = "🟢 Compliant"
 			} else {
-				resultMsg = fmt.Sprintf("🔴 %s : %s", "Non Compliant")
+				resultMsg = "🔴 Non Compliant"
 			}


P2 | Confidence: High

Hard-coded array indices (e.g., [0]) assume a specific SARIF report structure. This is brittle—if the report format changes, this code may panic or misbehave. Add bounds checking or use helper methods to access nested properties safely.

Suggested change

if sarifReport.Runs[0].Invocations[0].Properties.ReportCompliant {

resultMsg = fmt.Sprintf("🟢 %s : %s", "Compliant")

resultMsg = "🟢 Compliant"

} else {

resultMsg = fmt.Sprintf("🔴 %s : %s", "Non Compliant")

resultMsg = "🔴 Non Compliant"

}

if len(sarifReport.Runs) > 0 && len(sarifReport.Runs[0].Invocations) > 0 {

if props := sarifReport.Runs[0].Invocations[0].Properties; props != nil {

if props.ReportCompliant {

resultMsg = "🟢 Compliant"

} else {

resultMsg = "🔴 Non Compliant"

}

}

}

llamapreview · 2025-09-17T15:57:42Z

cmd/rg_executor.go

+			if !found && bytes.Contains(output, []byte(`"type":"match"`)) {
+				found = true
+			}


P2 | Confidence: Medium

Match detection relies on fragile string matching against JSON output. While functional, this could break if ripgrep's JSON format changes. A more robust approach would parse the JSON to check for match objects. The current implementation may also false-positive on content containing the literal string "type":"match".

Code Suggestion:

// Consider proper JSON parsing for robustness var outputs []map[string]interface{} if err := json.Unmarshal(output, &outputs); err == nil { for _, entry := range outputs { if t, ok := entry["type"].(string); ok && t == "match" { found = true break } } }

Fix parallel assure match detection and add tests

5ab833f

xfhg added the codex label Sep 17, 2025 — with ChatGPT Codex Connector

cursor bot reviewed Sep 17, 2025

View reviewed changes

xfhg assigned xfhg and unassigned xfhg Sep 17, 2025

llamapreview bot reviewed Sep 17, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix assure match detection and add coverage for parallel batches #185

Fix assure match detection and add coverage for parallel batches #185

Uh oh!

xfhg commented Sep 17, 2025 •

edited by ghost

Loading

Uh oh!

cursor bot left a comment

Uh oh!

cursor bot Sep 17, 2025

Uh oh!

llamapreview bot left a comment

Uh oh!

llamapreview bot Sep 17, 2025

Uh oh!

llamapreview bot Sep 17, 2025

Uh oh!

llamapreview bot Sep 17, 2025

Uh oh!

llamapreview bot Sep 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Fix assure match detection and add coverage for parallel batches #185

Are you sure you want to change the base?

Fix assure match detection and add coverage for parallel batches #185

Uh oh!

Conversation

xfhg commented Sep 17, 2025 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Testing

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

This PR is being reviewed by Cursor Bugbot

Uh oh!

cursor bot Sep 17, 2025

Choose a reason for hiding this comment

Bug: Ripgrep Match Detection Flaw

Uh oh!

llamapreview bot left a comment

Choose a reason for hiding this comment

Auto Pull Request Review from LlamaPReview

🎯 TL;DR & Recommendation

📄 Documentation Diagram

🌟 Strengths

🔍 Notable Themes

📁 File: cmd/scan.go

Uh oh!

llamapreview bot Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

llamapreview bot Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

llamapreview bot Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

llamapreview bot Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xfhg commented Sep 17, 2025 •

edited by ghost

Loading

📁 File: `cmd/scan.go`