HOMEto discover JOINING USto achieve PUBLICATIONSto innovate GRANTSto establish ACTIVITIESto engage PEOPLEto collaborate TEACHINGto inspire CONTACTSto explore

Post-Doctoral Research Associate Postion Available

We are recruiting a Post-Doctoral Research Associate in Computer Vision and Artificial Intelligence. Deadline: 17th May 2026. More information here.

TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with State Space Model

Jiaxu Liu, Li Li, Hubert P. H. Shum and Toby P. Breckon
Proceedings of the 2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), 2026

Abstract

Diffusion models currently demonstrate impressive performance over various generative tasks. Recent work on image diffusion highlights the strong capabilities of Mamba (state space models) due to its efficient handling of long-range dependencies and sequential data modeling. Unfortunately, joint consideration of state space models with 3D point cloud generation remains limited. To harness the powerful capabilities of the Mamba model for 3D point cloud generation, we propose a novel diffusion framework containing dual latent Mamba block (DM-Block) and a time-variant frequency encoder (TF-Encoder). The DM-Block apply a space-filling curve to reorder points into sequences suitable for Mamba state-space modeling, while operating in a latent space to mitigate the computational overhead that arises from direct 3D data processing. Meanwhile, the TF-Encoder takes advantage of the ability of the diffusion model to refine fine details in later recovery stages by prioritizing key points within the U-Net architecture. This frequency-based mechanism ensures enhanced detail quality in the final stages of generation. Experimental results on the ShapeNet-v2 dataset demonstrate that our method achieves state-of-the-art performance (ShapeNet-v2: 0.14% on 1-NNA-Abs50 EMD and 57.90% on COV EMD) on certain metrics for specific categories while reducing computational parameters and inference time by up to 10X and 9X, respectively. The source code is included in the supplementary material and will be released.

Downloads

Paper (4.7MB)

arXiv

Cite This Research

Plain Text

Jiaxu Liu, Li Li, Hubert P. H. Shum and Toby P. Breckon, "TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with State Space Model," in Proceedings of the 2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop, Colorado, USA, IEEE/CVF, 2026.

BibTeX

@inproceedings{liu26tfdm,
author={Liu, Jiaxu and Li, Li and Shum, Hubert P. H. and Breckon, Toby P.},
booktitle={Proceedings of the 2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop},
title={TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with State Space Model},
year={2026},
publisher={IEEE/CVF},
location={Colorado, USA},
}

RIS

TY  - CONF
AU  - Liu, Jiaxu
AU  - Li, Li
AU  - Shum, Hubert P. H.
AU  - Breckon, Toby P.
T2  - Proceedings of the 2026 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshop
TI  - TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with State Space Model
PY  - 2026
PB  - IEEE/CVF
ER  -

Similar Research

Shuang Chen, Haozheng Zhang, Amir Atapour-Abarghouei and Hubert P. H. Shum, "SEM-Net: Efficient Pixel Modelling for Image Inpainting with Spatially Enhanced SSM", Proceedings of the 2025 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025

Shuang Chen, Amir Atapour-Abarghouei, Haozheng Zhang and Hubert P. H. Shum, "MxT: Mamba x Transformer for Image Inpainting", Proceedings of the 2024 British Machine Vision Conference (BMVC), 2024

Shuang Chen, Amir Atapour-Abarghouei and Hubert P. H. Shum, "HINT: High-quality INpainting Transformer with Mask-Aware Encoding and Enhanced Attention", IEEE Transactions on Multimedia (TMM), 2024

Jamie Stirling, Noura Al Moubayed and Hubert P. H. Shum, "Investigating Permutation-Invariant Discrete Representation Learning for Spatially Aligned Images", Proceedings of the 2026 International Conference on Pattern Recognition (ICPR), 2026

HomeGoogle ScholarLinkedInYouTubeGitHubORCIDResearchGateEmail

Last updated on 4 May 2026
RSS Feed