[BUG] We may need to remove max_memory arg #115

Qubitium · 2024-06-29T09:09:24Z

For very large models, multiple GPU may be needed for quantization but max_memory arg appears to be broken. Everything should be handled by accelerate and there should be no need for this arg. Investigate.

delete max_memory=max_memory can run.

Originally posted by @Xu-Chen in #48 (comment)

The text was updated successfully, but these errors were encountered:

Qubitium · 2024-07-01T13:14:11Z

We will remove this. Advanced users should just use and pass device_map fo accelerate. We should not be a arg modifying proxy for acclerate. Passing accelerate config args to accelerate is cleaner, more powerful, and we dont have to maintain compat.

CL-ModelCloud self-assigned this Jul 1, 2024

CL-ModelCloud mentioned this issue Jul 2, 2024

[REFRACTOR] remove max_memory arg #144

Merged

Qubitium closed this as completed Jul 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] We may need to remove max_memory arg #115

[BUG] We may need to remove max_memory arg #115

Qubitium commented Jun 29, 2024

Qubitium commented Jul 1, 2024 •

edited

Loading

[BUG] We may need to remove max_memory arg #115

[BUG] We may need to remove max_memory arg #115

Comments

Qubitium commented Jun 29, 2024

Qubitium commented Jul 1, 2024 • edited Loading

Qubitium commented Jul 1, 2024 •

edited

Loading