Fix _tied_weights_keys mapping for Transformers v5

To clarify: this PR fixes the initial Transformers v5+ lTo clarify: this PR fixes the initial Transformers v5+ loading crash caused by _tied_weights_keys being a list (expects a dict-like mapping in v5, i.e., .keys()).

With this change, loading proceeds past the tied-weights stage / weight materialization on Transformers v5+.

The DynamicCache.from_legacy_cache error shown in the log is a separate Transformers v5 API change and is not related to _tied_weights_keys. I can open a follow-up PR for the DynamicCache update if desired.

https://github.com/LG-AI-EXAONE/EXAONE-3.5/pull/7

Bias92

Feb 4

•

edited Feb 4

Update: Also fixed the DynamicCache compatibility issue.

All changes in this PR:

_tied_weights_keys: list → dict
DynamicCache.from_legacy_cache() → DynamicCache()
Removed to_legacy_cache() call

Updated : https://github.com/LG-AI-EXAONE/EXAONE-3.5/pull/7

Tested with Transformers 5.0.0 - Model loads and runs inference successfully ✅

nuxlear

LG AI Research org Feb 4

It seems the generation output is broken. This might be another issue caused by an incompatibility.

I think it would be better to use the example from our quickstart.

Also, you’ll need to update this PR to apply the changes, rather than opening a new PR on GitHub.
We don’t manage the modeling code or related scripts in our official GitHub repository.

This documentation may be helpful:
https://huggingface.co/docs/hub/repositories-pull-requests-discussions#pull-requests-advanced-usage

nuxlear

LG AI Research org Feb 4

We've started integrating the modeling code for Transformers v5 by using modular transformer framework.
The overall model structure will remain the same, even if some class names change.

Given this, no further work is needed for now, so it would be best to merge this PR and open a new one for the integration.
Thank you for your effort and contribution to EXAONE 3.5 😀

nuxlear changed pull request status to merged Feb 4

Bias92

Feb 5

Thank you for merging! 🎉

If you need any help with the Transformers v5 integration in the future, feel free to let me know.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

LGAI-EXAONE
/

EXAONE-3.5-2.4B-Instruct

Fix _tied_weights_keys mapping for Transformers v5

Problem

Solution

Related