File tree Expand file tree Collapse file tree 1 file changed +32
-14
lines changed Expand file tree Collapse file tree 1 file changed +32
-14
lines changed Original file line number Diff line number Diff line change @@ -14,27 +14,45 @@ Main Quantization APIs
14
14
:nosignatures:
15
15
16
16
quantize _
17
- autoquant
17
+ autoquant
18
18
19
- Quantization APIs for quantize _
19
+ Inference APIs for quantize \_
20
20
-------------------------------
21
21
22
22
.. autosummary ::
23
23
:toctree: generated/
24
24
:nosignatures:
25
25
26
- int4_weight_only
27
- int8_weight_only
28
- int8_dynamic_activation_int4_weight
29
- int8_dynamic_activation_int8_weight
30
- uintx_weight_only
31
- gemlite_uintx_weight_only
32
- intx_quantization_aware_training
33
- from_intx_quantization_aware_training
34
- float8_weight_only
35
- float8_dynamic_activation_float8_weight
36
- float8_static_activation_float8_weight
37
- fpx_weight_only
26
+ Int4WeightOnlyConfig
27
+ Float8DynamicActivationFloat8WeightConfig
28
+ Float8WeightOnlyConfig
29
+ Float8StaticActivationFloat8WeightConfig
30
+ Int8DynamicActivationInt4WeightConfig
31
+ GemliteUIntXWeightOnlyConfig
32
+ Int8WeightOnlyConfig
33
+ Int8DynamicActivationInt8WeightConfig
34
+ UIntXWeightOnlyConfig
35
+ FPXWeightOnlyConfig
36
+
37
+ .. currentmodule :: torchao.quantization.qat
38
+
39
+ QAT APIs
40
+ ----------------------
41
+
42
+ .. autosummary ::
43
+ :toctree: generated/
44
+ :nosignatures:
45
+
46
+ IntXQuantizationAwareTrainingConfig
47
+ FromIntXQuantizationAwareTrainingConfig
48
+ FakeQuantizeConfig
49
+ Int4WeightOnlyQATQuantizer
50
+ Int8DynActInt4WeightQATQuantizer
51
+ Int4WeightOnlyEmbeddingQATQuantizer
52
+ ComposableQATQuantizer
53
+ initialize_fake_quantizers
54
+
55
+ .. currentmodule :: torchao.quantization
38
56
39
57
Quantization Primitives
40
58
-----------------------
You can’t perform that action at this time.
0 commit comments