Skip to content

Commit 4c06318

Browse files
authored
Update Quantization docs to show newer AOConfigs (#2317)
1 parent 70f2b85 commit 4c06318

File tree

1 file changed

+32
-14
lines changed

1 file changed

+32
-14
lines changed

docs/source/api_ref_quantization.rst

Lines changed: 32 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -14,27 +14,45 @@ Main Quantization APIs
1414
:nosignatures:
1515

1616
quantize_
17-
autoquant
17+
autoquant
1818

19-
Quantization APIs for quantize_
19+
Inference APIs for quantize\_
2020
-------------------------------
2121

2222
.. autosummary::
2323
:toctree: generated/
2424
:nosignatures:
2525

26-
int4_weight_only
27-
int8_weight_only
28-
int8_dynamic_activation_int4_weight
29-
int8_dynamic_activation_int8_weight
30-
uintx_weight_only
31-
gemlite_uintx_weight_only
32-
intx_quantization_aware_training
33-
from_intx_quantization_aware_training
34-
float8_weight_only
35-
float8_dynamic_activation_float8_weight
36-
float8_static_activation_float8_weight
37-
fpx_weight_only
26+
Int4WeightOnlyConfig
27+
Float8DynamicActivationFloat8WeightConfig
28+
Float8WeightOnlyConfig
29+
Float8StaticActivationFloat8WeightConfig
30+
Int8DynamicActivationInt4WeightConfig
31+
GemliteUIntXWeightOnlyConfig
32+
Int8WeightOnlyConfig
33+
Int8DynamicActivationInt8WeightConfig
34+
UIntXWeightOnlyConfig
35+
FPXWeightOnlyConfig
36+
37+
.. currentmodule:: torchao.quantization.qat
38+
39+
QAT APIs
40+
----------------------
41+
42+
.. autosummary::
43+
:toctree: generated/
44+
:nosignatures:
45+
46+
IntXQuantizationAwareTrainingConfig
47+
FromIntXQuantizationAwareTrainingConfig
48+
FakeQuantizeConfig
49+
Int4WeightOnlyQATQuantizer
50+
Int8DynActInt4WeightQATQuantizer
51+
Int4WeightOnlyEmbeddingQATQuantizer
52+
ComposableQATQuantizer
53+
initialize_fake_quantizers
54+
55+
.. currentmodule:: torchao.quantization
3856

3957
Quantization Primitives
4058
-----------------------

0 commit comments

Comments
 (0)