Deferred weight loading #242

chhzh123 · 2025-10-15T01:58:57Z

This PR fixes #241 by creating a lazy_init function to prevent initializing random weights when creating the model at the very beginning.

gemini-code-assist · 2025-10-15T01:59:00Z

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

chhzh123 · 2025-10-15T01:59:27Z

cc @Kipsora

chhzh123 · 2025-10-17T02:01:07Z

Memory usage and latency comparison with and without lazy init:

Prayer3th · 2025-10-20T03:17:47Z

We have now switched to using nnx.eval_shape for model initialization. This approach eliminates the need to load placeholder tensors from init entirely, and the code modifications do not interfere with the forward processing logic. Therefore, this PR will be closed. For details on the nnx.eval_shape changes, see #248.

chhzh123 requested a review from Prayer3th October 15, 2025 01:59

chhzh123 added 7 commits October 16, 2025 04:26

Add lazy_init

3956a8c

Add test

78c51ee

Update materialize

052424f

Add materialize for each model

572abbb

Add test script

7e9ce3b

Move initializer into mesh

961f375

Update test

805cc56

chhzh123 force-pushed the deferred_weight_loading branch from 94a683f to 805cc56 Compare October 16, 2025 04:28

Prayer3th closed this Oct 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Deferred weight loading #242

Deferred weight loading #242

Uh oh!

chhzh123 commented Oct 15, 2025

Uh oh!

gemini-code-assist bot commented Oct 15, 2025

Uh oh!

chhzh123 commented Oct 15, 2025

Uh oh!

chhzh123 commented Oct 17, 2025

Uh oh!

Prayer3th commented Oct 20, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Deferred weight loading #242

Deferred weight loading #242

Uh oh!

Conversation

chhzh123 commented Oct 15, 2025

Uh oh!

gemini-code-assist bot commented Oct 15, 2025

Uh oh!

chhzh123 commented Oct 15, 2025

Uh oh!

chhzh123 commented Oct 17, 2025

Uh oh!

Prayer3th commented Oct 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Prayer3th commented Oct 20, 2025 •

edited

Loading