chore(logging): Log model name and base URL before invoking LLMs #1465

JashG · 2025-10-20T16:29:42Z

Description

This PR logs the model name and base URL before an LLM is invoked.

Some context: From the NeMo Guardrails microservice, we've run into different errors that can be root caused to the incorrect base URL being used for a model. This can be difficult to root cause, since neither the URL nor model gets logged at any point in the call chain.

Related Issue(s)

Checklist

I've read the CONTRIBUTING guidelines.
I've updated the documentation if applicable.
I've added tests if applicable.
@mentions of the person or team responsible for reviewing proposed changes.

codecov-commenter · 2025-10-20T16:38:14Z

Codecov Report

❌ Patch coverage is 90.47619% with 4 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
nemoguardrails/logging/callbacks.py	69.23%	4 Missing ⚠️

📢 Thoughts on this report? Let us know!

JashG · 2025-10-20T17:35:33Z

Looks like the failing test_stats_logging_interval_timing test is flakey with a fix in #1463

Pouyanpi · 2025-10-21T09:17:32Z

Thanks for the PR @JashG ! I agree this feature looks really useful for debugging!

I have one suggestion: it might be better to move this logging logic to the LoggingCallbackHandler in nemoguardrails/logging/callbacks.py instead of utils.py.
here’s why:

all LLM related logs already go through the callbacks (on_llm_start, on_chat_model_start, etc.), so it keeps things consistent and easier to maintain.
the current approach logs twice (once for string prompts and once for message lists), while callbacks handle each LLM call once.
more context available, the serialized parameter in callbacks includes metadata like model configuration and base URLs.

Implementation details:
You can get the base URL directly from the serialized parameter:

for ChatOpenAI: serialized['kwargs']['openai_api_base']
for ChatNVIDIA: extract it from serialized['repr']
and this is provider dependent.

that way, the log would appear right before the existing “Invocation Params” entry in on_llm_start() and on_chat_model_start(), keeping all the LLM call logs together.

what do you think?

Pouyanpi · 2025-10-21T09:18:14Z

Looks like the failing test_stats_logging_interval_timing test is flakey with a fix in #1463

Yes, we merged the fix to develop. You might want to do a rebase 👍🏻

greptile-apps

Greptile Overview

Greptile Summary

Adds logging of model name and base URL before LLM invocations to aid in debugging incorrect URL configurations.

Key Changes:

Created new utility function extract_model_name_and_base_url() in nemoguardrails/logging/utils.py that extracts model information from serialized LLM parameters via kwargs (for ChatOpenAI) or repr string parsing (for other providers)
Modified LoggingCallbackHandler to log model name and URL in both on_llm_start and on_chat_model_start callbacks
Added comprehensive test coverage (9 test cases) for various provider formats and edge cases
Minor type annotation improvement to _infer_model_name() in actions/llm/utils.py

Implementation Details:
The extraction logic handles multiple URL attribute patterns (api_base, api_host, azure_endpoint, base_url, endpoint, endpoint_url, openai_api_base) to support various LLM providers. The code prioritizes kwargs values over repr string values to ensure accurate extraction.

Confidence Score: 4/5

This PR is safe to merge with minor code quality considerations
The implementation is solid with comprehensive test coverage and addresses a legitimate debugging need. Score reduced by 1 point due to code duplication in the callback handlers (the same logging logic appears twice), but this doesn't affect functionality or introduce bugs.
Pay attention to nemoguardrails/logging/callbacks.py - contains duplicated logging code that could be refactored

Important Files Changed

File Analysis

Filename	Score	Overview
nemoguardrails/logging/utils.py	5/5	New utility file for extracting model name and base URL from serialized LLM parameters with comprehensive test coverage
nemoguardrails/logging/callbacks.py	4/5	Added logging of model name and base URL before LLM invocation in both `on_llm_start` and `on_chat_model_start` callbacks
tests/test_callbacks.py	5/5	Added comprehensive unit tests for the `extract_model_name_and_base_url` utility function covering various providers and edge cases
nemoguardrails/actions/llm/utils.py	5/5	Minor type annotation update to `_infer_model_name` to accept `Runnable` in addition to `BaseLanguageModel`, plus unused logger addition

Sequence Diagram

sequenceDiagram
    participant Client as LLM Client
    participant CB as LoggingCallbackHandler
    participant Utils as extract_model_name_and_base_url
    participant Logger as log.info
    
    Note over Client,Logger: String Prompt Flow
    Client->>CB: on_llm_start(serialized, prompts)
    CB->>Utils: extract_model_name_and_base_url(serialized)
    Utils->>Utils: Check kwargs for model_name, openai_api_base
    Utils->>Utils: Parse repr string for model/url patterns
    Utils-->>CB: return (model_name, base_url)
    alt base_url exists
        CB->>Logger: log.info("Invoking LLM: model={model}, url={url}")
    else only model_name exists
        CB->>Logger: log.info("Invoking LLM: model={model}")
    else neither exists
        CB->>Logger: log.info("Invoking LLM")
    end
    CB->>Logger: log.info("Invocation Params")
    CB->>Logger: log.info("Prompt")
    
    Note over Client,Logger: Chat Messages Flow
    Client->>CB: on_chat_model_start(serialized, messages)
    CB->>Utils: extract_model_name_and_base_url(serialized)
    Utils->>Utils: Check kwargs for model_name, openai_api_base
    Utils->>Utils: Parse repr string for model/url patterns
    Utils-->>CB: return (model_name, base_url)
    alt base_url exists
        CB->>Logger: log.info("Invoking LLM: model={model}, url={url}")
    else only model_name exists
        CB->>Logger: log.info("Invoking LLM: model={model}")
    else neither exists
        CB->>Logger: log.info("Invoking LLM")
    end
    CB->>Logger: log.info("Invocation Params")
    CB->>Logger: log.info("Prompt Messages")

_{4 files reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2025-11-04T17:22:57Z

nemoguardrails/logging/callbacks.py

+        # Log model name and base URL
+        model_name, base_url = extract_model_name_and_base_url(serialized)
+        if base_url:
+            log.info(f"Invoking LLM: model={model_name}, url={base_url}")
+        elif model_name:
+            log.info(f"Invoking LLM: model={model_name}")
+        else:
+            log.info("Invoking LLM")


style: The exact same logging logic appears in both on_llm_start (lines 68-75) and on_chat_model_start (lines 118-125). Extract to a helper method to reduce duplication and improve maintainability.

JashG · 2025-11-04T17:26:47Z

@Pouyanpi thanks for that pointer! I agree, centralizing the logic in LoggingCallbackHandler makes more sense. The one minor drawback: repr is the string representation of the provider class, rather than an instance of the class, so my logic parses that string for the model and base URL properties. Let me know if you have any concerns with this approach.

greptile-apps

Greptile Overview

Greptile Summary

This PR adds logging of model name and base URL before LLM invocations to help debug issues with incorrect base URLs being used for models. The implementation extracts these values from serialized LLM configuration using both structured kwargs and regex parsing of repr strings.

Key Changes:

Created extract_model_name_and_base_url() utility function in new nemoguardrails/logging/utils.py file
Integrated logging calls in both on_llm_start() and on_chat_model_start() callback handlers
Added comprehensive test coverage including edge cases for various LLM providers
Updated type hint for _infer_model_name() to accept Union[BaseLanguageModel, Runnable]

Implementation Approach:

Primary extraction from kwargs (for ChatOpenAI and similar providers)
Fallback to regex parsing of repr strings (for other providers like ChatNIM)
Supports multiple URL attribute names: api_base, api_host, azure_endpoint, base_url, endpoint, endpoint_url, openai_api_base

Confidence Score: 4/5

This PR is safe to merge with minor improvements recommended
The implementation is solid with comprehensive test coverage and addresses a real debugging need. Score reduced by 1 point due to: (1) a minor typo in comments, and (2) duplicate logging logic in two callback methods that could be refactored into a helper method for better maintainability. The core functionality is sound and well-tested.
nemoguardrails/logging/callbacks.py has duplicate logging code that could be refactored, but this is a minor maintainability concern

Important Files Changed

File Analysis

Filename	Score	Overview
nemoguardrails/logging/utils.py	5/5	New utility function added to extract model name and base URL from serialized LLM config using kwargs and regex parsing
nemoguardrails/logging/callbacks.py	4/5	Added logging of model name and base URL in both on_llm_start and on_chat_model_start handlers with duplicate logic

Sequence Diagram

sequenceDiagram
    participant App as Application
    participant CB as LoggingCallbackHandler
    participant Util as extract_model_name_and_base_url
    participant LLM as LLM Provider
    participant Log as Logger

    App->>CB: on_llm_start(serialized, prompts, ...)
    CB->>Util: extract_model_name_and_base_url(serialized)
    
    alt Has kwargs
        Util->>Util: Extract from kwargs["model_name"]
        Util->>Util: Extract from kwargs["openai_api_base"]
    end
    
    alt No kwargs or missing values
        Util->>Util: Parse serialized["repr"] string
        Util->>Util: Regex search for model/model_name
        Util->>Util: Regex search for url attributes
    end
    
    Util-->>CB: return (model_name, base_url)
    
    alt base_url exists
        CB->>Log: log.info("Invoking LLM: model={model_name}, url={base_url}")
    else only model_name exists
        CB->>Log: log.info("Invoking LLM: model={model_name}")
    else neither exists
        CB->>Log: log.info("Invoking LLM")
    end
    
    CB->>Log: log.info("Invocation Params :: ...")
    CB->>Log: log.info("Prompt :: ...")
    CB->>LLM: Continue with LLM invocation

_{1 file reviewed, 1 comment}

_{Edit Code Review Agent Settings | Greptile}

greptile-apps · 2025-11-04T17:33:43Z

nemoguardrails/logging/utils.py

+            if match:
+                model_name = match.group(1)
+
+        # Extract base URL. The propety name may vary between providers, so try common attribute patterns.


syntax: Typo: "propety" should be "property"

Suggested change

# Extract base URL. The propety name may vary between providers, so try common attribute patterns.

# Extract base URL. The property name may vary between providers, so try common attribute patterns.

JashG marked this pull request as ready for review October 20, 2025 17:11

Pouyanpi self-requested a review October 21, 2025 09:17

JashG added 2 commits November 3, 2025 11:25

chore(logging): Log model name and base URL before invoking LLMs

4a98503

Refactor logic to live in LoggingCallbackHandler

617ac25

JashG force-pushed the jgulabrai/add-llm-invocation-logging branch from 54d2f96 to 617ac25 Compare November 4, 2025 17:19

greptile-apps bot reviewed Nov 4, 2025

View reviewed changes

Add comment

ac24834

greptile-apps bot reviewed Nov 4, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore(logging): Log model name and base URL before invoking LLMs #1465

chore(logging): Log model name and base URL before invoking LLMs #1465

JashG commented Oct 20, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented Oct 20, 2025 •

edited by codecov bot

Loading

Uh oh!

JashG commented Oct 20, 2025

Uh oh!

Pouyanpi commented Oct 21, 2025

Uh oh!

Pouyanpi commented Oct 21, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Nov 4, 2025

Uh oh!

JashG commented Nov 4, 2025

Uh oh!

greptile-apps bot left a comment

Uh oh!

greptile-apps bot Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	# Extract base URL. The propety name may vary between providers, so try common attribute patterns.
	# Extract base URL. The property name may vary between providers, so try common attribute patterns.

chore(logging): Log model name and base URL before invoking LLMs #1465

Are you sure you want to change the base?

chore(logging): Log model name and base URL before invoking LLMs #1465

Conversation

JashG commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue(s)

Checklist

Uh oh!

codecov-commenter commented Oct 20, 2025 • edited by codecov bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

JashG commented Oct 20, 2025

Uh oh!

Pouyanpi commented Oct 21, 2025

Uh oh!

Pouyanpi commented Oct 21, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

JashG commented Nov 4, 2025

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Greptile Overview

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Sequence Diagram

Uh oh!

greptile-apps bot Nov 4, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JashG commented Oct 20, 2025 •

edited

Loading

codecov-commenter commented Oct 20, 2025 •

edited by codecov bot

Loading