Update compression notes

g-despot · g-despot · commit dbd92bd8a897 · 2025-09-25T12:45:48.000+02:00
diff --git a/_includes/compression-by-default.mdx b/_includes/compression-by-default.mdx
@@ -0,0 +1,5 @@
+:::info Compression by Default
+
+Starting with `v1.33`, Weaviate enables **8-bit [RQ quantization](/weaviate/configuration/compression/rq-compression) by default** when creating new collections to ensure efficient resource utilization and faster performance. This behavior can be changed through the [`DEFAULT_QUANTIZATION`](/deploy/configuration/env-vars#DEFAULT_QUANTIZATION) environment variable. Note that once enabled, quantization can't be disabled for a collection.
+
+:::
diff --git a/_includes/configuration/pq-compression/overview-text.mdx b/_includes/configuration/pq-compression/overview-text.mdx
@@ -1 +1 @@
-Product quantization (PQ) is a form of data compression for vectors. PQ reduces the HNSW index's memory footprint so you can work with larger datasets. For a discussion of how PQ saves memory, see [Product quantization](/weaviate/concepts/vector-quantization#product-quantization).
+[**Product quantization (PQ)**](/weaviate/concepts/vector-quantization#product-quantization) is a form of data compression for vectors. PQ reduces the HNSW index's memory footprint so you can work with larger datasets. For a discussion of how PQ saves memory, see [Product quantization](/weaviate/concepts/vector-quantization#product-quantization).
diff --git a/docs/weaviate/best-practices/index.md b/docs/weaviate/best-practices/index.md
@@ -104,11 +104,9 @@ If you have a large number of vectors, consider using vector quantization to red
 
 For HNSW indexes, we suggest enabling [rotational quantization (RQ)](../configuration/compression/rq-compression.md) as a starting point. It provides significant memory usage benefits and almost no loss in query accuracy. 
 
-:::info Compression by Default
+import CompressionByDefault from '/_includes/compression-by-default.mdx';
 
-Starting with `v1.33`, Weaviate enables **8-bit RQ quantization by default** when creating new collections to ensure efficient resource utilization and faster performance. This behavior can be changed through the [`DEFAULT_QUANTIZATION`](/deploy/configuration/env-vars/index.md#DEFAULT_QUANTIZATION) environment variable.
-
-:::
+<CompressionByDefault/>
 
 :::tip Further resources
 - [How-to: Configure vector quantization](../configuration/compression/index.md)
diff --git a/docs/weaviate/concepts/vector-quantization.md b/docs/weaviate/concepts/vector-quantization.md
@@ -15,6 +15,10 @@ Weaviate currently offers four vector quantization techniques:
 - [Scalar quantization (SQ)](#scalar-quantization)
 - [Rotational quantization (RQ)](#rotational-quantization)
 
+import CompressionByDefault from '/\_includes/compression-by-default.mdx';
+
+<CompressionByDefault/>
+
 ## What is quantization?
 
 In general, quantization techniques reduce the memory footprint by representing numbers with lower precision numbers, like rounding a number to the nearest integer. In neural networks, quantization reduces the values of the weights or activations of the model stored as a 32-bit floating-point number (4 bytes) to a lower precision number, such as an 8-bit integer (1 byte).
diff --git a/docs/weaviate/configuration/compression/bq-compression.md b/docs/weaviate/configuration/compression/bq-compression.md
@@ -13,11 +13,11 @@ import TSCodeBQOptions from '!!raw-loader!/\_includes/code/howto/configure.bq-co
 import GoCode from '!!raw-loader!/\_includes/code/howto/go/docs/configure/compression.bq_test.go';
 import JavaCode from '!!raw-loader!/\_includes/code/howto/java/src/test/java/io/weaviate/docs/bq-compression.java';
 
-:::info Added in `v1.23`
-BQ is available for the [`flat` index](/weaviate/concepts/indexing/vector-index.md#flat-index) type from `v1.23` onwards and for the [`hnsw` index](/weaviate/config-refs/indexing/vector-index.mdx#hnsw-index) type from `v1.24`.
-:::
+import CompressionByDefault from '/\_includes/compression-by-default.mdx';
+
+<CompressionByDefault/>
 
-Binary quantization (BQ) is a vector compression technique that can reduce the size of a vector.
+[**Binary quantization (BQ)**](/weaviate/concepts/vector-quantization#binary-quantization) is a vector compression technique that can reduce the size of a vector.
 
 To use BQ, enable it as shown below and add data to the collection.
 
diff --git a/docs/weaviate/configuration/compression/index.md b/docs/weaviate/configuration/compression/index.md
@@ -18,11 +18,9 @@ To balance resource costs and system performance, consider one of these options:
 
 You can also [disable quantization](uncompressed.md) for a collection.
 
-:::info Compression by Default
+import CompressionByDefault from '/_includes/compression-by-default.mdx';
 
-Starting with `v1.33`, Weaviate enables **8-bit RQ quantization by default** when creating new collections to ensure efficient resource utilization and faster performance. This behavior can be changed through the [`DEFAULT_QUANTIZATION`](/deploy/configuration/env-vars/index.md#DEFAULT_QUANTIZATION) environment variable.
-
-:::
+<CompressionByDefault/>
 
 ## Multi-vector encoding
 
diff --git a/docs/weaviate/configuration/compression/pq-compression.md b/docs/weaviate/configuration/compression/pq-compression.md
@@ -13,9 +13,9 @@ import TSCodeManualPQ from '!!raw-loader!/\_includes/code/howto/configure.pq-com
 import GoCode from '!!raw-loader!/\_includes/code/howto/go/docs/configure/compression.pq_test.go';
 import JavaCode from '!!raw-loader!/\_includes/code/howto/java/src/test/java/io/weaviate/docs/pq-compression.java';
 
-:::note
-Starting in v1.23, AutoPQ simplifies configuring PQ on new collections.
-:::
+import CompressionByDefault from '/\_includes/compression-by-default.mdx';
+
+<CompressionByDefault/>
 
 import PQOverview from '/\_includes/configuration/pq-compression/overview-text.mdx' ;
 
diff --git a/docs/weaviate/configuration/compression/rq-compression.md b/docs/weaviate/configuration/compression/rq-compression.md
@@ -12,32 +12,29 @@ import GoCode from '!!raw-loader!/\_includes/code/howto/go/docs/configure/compre
 import TSCode from '!!raw-loader!/\_includes/code/howto/configure-rq/rq-compression-v3.ts';
 import JavaCode from '!!raw-loader!/\_includes/code/howto/java/src/test/java/io/weaviate/docs/rq-compression.java';
 
-:::info Added in `v1.32`
-
-**8-bit Rotational quantization (RQ)** was added in **`v1.32`**.
-
-:::
-
-:::caution Preview
+import CompressionByDefault from '/\_includes/compression-by-default.mdx';
 
-**1-bit Rotational quantization (RQ)** was added in **`v1.33`** as a **preview**.<br/>
-
-This means that the feature is still under development and may change in future releases, including potential breaking changes.
-**We do not recommend using this feature in production environments at this time.**
-
-:::
+<CompressionByDefault/>
 
 [**Rotational quantization (RQ)**](../../concepts/vector-quantization.md#rotational-quantization) is a fast vector compression technique that offers significant performance benefits. Two RQ variants are available in Weaviate:
 
 - **8-bit RQ**: Up to 4x compression while retaining almost perfect recall (98-99% on most datasets). **Recommended** for most use cases.
 - **1-bit RQ**: Close to 32x compression as dimensionality increases with moderate recall across various datasets.
 
 :::note HNSW only
+
 RQ is currently not supported for the flat index type.
+
 :::
 
 ## 8-bit RQ
 
+:::info Added in `v1.32`
+
+**8-bit Rotational quantization (RQ)** was added in **`v1.32`**.
+
+:::
+
 [8-bit RQ](../../concepts/vector-quantization.md#8-bit-rq) provides up-to 4x compression while maintaining 98-99% recall in internal testing. It is generally recommended for most use cases as the default quantization techniques.
 
 ### Enable compression for new collection
@@ -112,6 +109,15 @@ RQ can also be enabled for an existing collection by updating the collection def
 
 ## 1-bit RQ
 
+:::caution Preview
+
+**1-bit Rotational quantization (RQ)** was added in **`v1.33`** as a **preview**.<br/>
+
+This means that the feature is still under development and may change in future releases, including potential breaking changes.
+**We do not recommend using this feature in production environments at this time.**
+
+:::
+
 [1-bit RQ](../../concepts/vector-quantization.md#1-bit-rq) is an quantization technique that provides close to 32x compression as dimensionality increases. 1-bit RQ serves as a more robust and accurate alternative to [BQ](./bq-compression.md) with only a slight performance trade-off. While more performant than PQ in terms of encoding time and distance calculations, 1-bit RQ typically offers slightly lower recall than well-tuned [PQ](./pq-compression.md).
 
 ### Enable compression for new collection
diff --git a/docs/weaviate/configuration/compression/sq-compression.md b/docs/weaviate/configuration/compression/sq-compression.md
@@ -13,11 +13,11 @@ import TSCodeSQOptions from '!!raw-loader!/\_includes/code/howto/configure-sq/sq
 import GoCode from '!!raw-loader!/\_includes/code/howto/go/docs/configure/compression.sq_test.go';
 import JavaCode from '!!raw-loader!/\_includes/code/howto/java/src/test/java/io/weaviate/docs/sq-compression.java';
 
-:::info Added in v1.26.0
+import CompressionByDefault from '/\_includes/compression-by-default.mdx';
 
-:::
+<CompressionByDefault/>
 
-[Scalar quantization (SQ)](/weaviate/concepts/vector-quantization#scalar-quantization) is a vector compression technique that can reduce the size of a vector.
+[**Scalar quantization (SQ)**](/weaviate/concepts/vector-quantization#scalar-quantization) is a vector compression technique that can reduce the size of a vector.
 
 To use SQ, enable it in the collection definition, then add data to the collection.
 
diff --git a/docs/weaviate/configuration/compression/uncompressed.md b/docs/weaviate/configuration/compression/uncompressed.md
@@ -13,7 +13,11 @@ import GoCode from '!!raw-loader!/\_includes/code/howto/go/docs/configure/compre
 import TSCode from '!!raw-loader!/\_includes/code/howto/configure-rq/rq-compression-v3.ts';
 import JavaCode from '!!raw-loader!/\_includes/code/howto/java/src/test/java/io/weaviate/docs/rq-compression.java';
 
-You can opt-out of using vector quantization to compress your vector data. 
+import CompressionByDefault from '/\_includes/compression-by-default.mdx';
+
+<CompressionByDefault/>
+
+You can opt-out of using vector quantization to compress your vector data.
 
 ## Disable compression for new collection
 

-Original file line number
+Diff line change
@@ @@ -0,0 +1,5 @@ @@
 +:::info Compression by Default
++
 +Starting with `v1.33`, Weaviate enables **8-bit [RQ quantization](/weaviate/configuration/compression/rq-compression) by default** when creating new collections to ensure efficient resource utilization and faster performance. This behavior can be changed through the [`DEFAULT_QUANTIZATION`](/deploy/configuration/env-vars#DEFAULT_QUANTIZATION) environment variable. Note that once enabled, quantization can't be disabled for a collection.
++
 +:::
Original file line number	Diff line number	Diff line change
`@@ -1 +1 @@`
`1`		`-Product quantization (PQ) is a form of data compression for vectors. PQ reduces the HNSW index's memory footprint so you can work with larger datasets. For a discussion of how PQ saves memory, see [Product quantization](/weaviate/concepts/vector-quantization#product-quantization).`
	`1`	`+[Product quantization (PQ)](/weaviate/concepts/vector-quantization#product-quantization) is a form of data compression for vectors. PQ reduces the HNSW index's memory footprint so you can work with larger datasets. For a discussion of how PQ saves memory, see [Product quantization](/weaviate/concepts/vector-quantization#product-quantization).`