CINECA IRIS Institutional Research Information System

In end-to-end learned image compression, encoder and decoder are jointly trained to minimize a R + λ D cost function, where λ controls the trade-off between rate of the quantized latent representation and image quality. Unfortunately, a distinct encoder-decoder pair with millions of parameters must be trained for each λ, hence the need to switch encoders and to store multiple encoders and decoders on the user device for every target rate. This paper proposes to exploit a differentiable quantizer designed around a parametric sum of hyperbolic tangents, called STanH, that relaxes the step-wise quantization function. STanH is implemented as a differentiable activation layer with learnable quantization parameters that can be plugged into a pre-trained fixed rate model and refined to achieve different target bitrates. Experimental results show that our method enables variable rate coding with comparable efficiency to the state-of-the-art, yet with significant savings in terms of ease of deployment, training time, and storage costs.

STanH : Parametric Quantization for Variable Rate Learned Image Compression

Alberto Presta;Enzo Tartaglione;Attilio Fiandrotti;Marco Grangetto

2025-01-01

Abstract

In end-to-end learned image compression, encoder and decoder are jointly trained to minimize a R + λ D cost function, where λ controls the trade-off between rate of the quantized latent representation and image quality. Unfortunately, a distinct encoder-decoder pair with millions of parameters must be trained for each λ, hence the need to switch encoders and to store multiple encoders and decoders on the user device for every target rate. This paper proposes to exploit a differentiable quantizer designed around a parametric sum of hyperbolic tangents, called STanH, that relaxes the step-wise quantization function. STanH is implemented as a differentiable activation layer with learnable quantization parameters that can be plugged into a pre-trained fixed rate model and refined to achieve different target bitrates. Experimental results show that our method enables variable rate coding with comparable efficiency to the state-of-the-art, yet with significant savings in terms of ease of deployment, training time, and storage costs.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2025
			
	Titolo rivista
	
				IEEE TRANSACTIONS ON IMAGE PROCESSING
			
	N. Volume
	
				34
			
	Pagine (da)
	
				639
			
	Pagine (a)
	
				651
			
	DOI
	
				https://dx.doi.org/10.1109/TIP.2025.3527883
			
	URL del prodotto (archivi open access, fulltext su sito editore, etc.)
	
				https://ieeexplore.ieee.org/document/10843163
			
	Parole Chiave
	
				differentiable quantization, Learned image compression, quantizer annealing, variable rate image coding
			
	Tutti gli autori
	
						Alberto Presta; Enzo Tartaglione; Attilio Fiandrotti; Marco Grangetto
					
	Appare nelle tipologie:
	
				03A-Articolo su Rivista

File in questo prodotto:

File	Dimensione	Formato
THIRD_REBUTTAL_StanH.pdf Accesso aperto Tipo di file: POSTPRINT (VERSIONE FINALE DELL’AUTORE) Dimensione 386.64 kB Formato Adobe PDF Visualizza/Apri	386.64 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2318/2047970

Citazioni

ND

3

3

social impact