Skip to content

Conversation

@Frankstein73
Copy link
Collaborator

  • Updated the batch size validation logic in ActivationBatchler to account for the data dimension size, ensuring consistent shapes across tensors and lists.
  • Improved assertion error messages to provide clearer feedback on batch size discrepancies.
  • Updated the total count in the tqdm progress bar to account for the device mesh size, ensuring accurate progress tracking during activation chunk processing.

- Updated the batch size validation logic in `ActivationBatchler` to account for the data dimension size, ensuring consistent shapes across tensors and lists.
- Improved assertion error messages to provide clearer feedback on batch size discrepancies.
…edActivationLoader

- Updated the total count in the `tqdm` progress bar to account for the device mesh size, ensuring accurate progress tracking during activation chunk processing.
@Frankstein73 Frankstein73 requested a review from dest1n1s July 7, 2025 11:01
@dest1n1s dest1n1s merged commit 13576f8 into main Jul 7, 2025
2 checks passed
dest1n1s added a commit that referenced this pull request Nov 8, 2025
Fix batch size validation for data parallelism and adjust total count for activation processing in CachedActivationLoader
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants