Skip to content

Conversation

@hnyls2002
Copy link

@hnyls2002 hnyls2002 commented Sep 29, 2025

This is a hack PR to support DeepseekV32Config as the key deepseek_v32 is missing in transformers. Also see #41196 and sgl-project/sglang#11060 (comment)

@github-actions
Copy link
Contributor

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto

@Rocketknight1
Copy link
Member

cc @ArthurZucker for text models

Copy link
Collaborator

@ArthurZucker ArthurZucker left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hey! thanks, #41251 will be here in no time to fix!

@hnyls2002
Copy link
Author

@ArthurZucker Great thanks!!! Can not wait for your PR

@hnyls2002 hnyls2002 closed this Oct 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants