Session Assignment

Paper ID

Paper Title

Session

Primary Subject Area

18

One Shot Object Detection Via Hierarchical Adaptive Alignment

DRT-1

Machine Learning for Multimedia

115

ACCR: Auto-labeling for Ancient Chinese Handwritten Characters Recognition on CNN

DRT-1

Machine Learning for Multimedia

95

BAM: A Bi-directional Attention Module for Masked Face Recognition

DRT-1

Machine Learning for Multimedia

141

Improved PSP-Net Segmentation Network for Automatic Detection of Neovascularization in Color Fundus Images

DRT-1

Machine Learning for Multimedia

101

MCascade R-CNN: A Modified Cascade R-CNN for Detection of Calcified on Coronary Artery Angiography Images

DRT-1

Machine Learning for Multimedia

27

CNN-Based Post-Processing Filter for Video Compression with Multi-Scale Feature Representation

LBC-1

Emerging Techniques for Image and Video Coding Standards

14

Learned Lossless JPEG Transcoding via Joint Lossy and Residual Compression

LBC-1

Emerging Techniques for Image and Video Coding Standards

159

Frequency-aware Learned Image Compression for Quality Scalability

LBC-1

Image and Video Compression Beyond Standards

119

A Learning-based Approach for Martian Image Compression

LBC-1

Image and Video Compression Beyond Standards

85

Neural Frank-Wolfe Policy Optimization for Region-of-Interest Intra-Frame Coding with HEVC/H.265

LBC-1

Image and Video Compression Beyond Standards

127

On Pre-chewing Compression Degradation for Learned Video Compression

LBC-2

Image and Video Compression Beyond Standards

84

Autoencoder-based intra prediction with auxiliary feature

LBC-2

Image and Video Compression Beyond Standards

151

End-to-end Image Compression with Swin-Transformer

LBC-2

Image and Video Compression Beyond Standards

59

Reducing The Mismatch Between Marginal and Learned Distributions in Neural Video Compression

LBC-2

Image and Video Compression Beyond Standards

70

High-frequency guided CNN for video compression artifacts reduction

LBC-2

Emerging Techniques for Image and Video Coding Standards

56

Deep Reference Frame Interpolation based Inter Prediction Enhancement for Versatile Video Coding

LBC-3

Emerging Techniques for Image and Video Coding Standards

152

Improving Latent Quantization of Learned Image Compression with Gradient Scaling

LBC-3

Image and Video Compression Beyond Standards

114

A new way of video compression via forward-referencing using deep learning

LBC-3

Image and Video Compression Beyond Standards

31

Rate Controllable Learned Image Compression Based on RFL Model

LBC-3

Image and Video Compression Beyond Standards

165

Multi-stage locally and long-range correlated feature fusion for Learned In-loop Filter in VVC

LBC-3

Image and Video Compression Beyond Standards

50

A efficient predictive wavelet transform for LiDAR point cloud attribute compression

PCC-1

(Special Session) 3D Point Cloud Acquisition, Processing and Communication (3DPC-APC)

103

PCGFormer: Lossy Point Cloud Geometry Compression via Local Self-Attention

PCC-1

(Special Session) Immersive Visual Volumetric Content Representation and Compression

43

Residual-based Near-lossless Point Cloud Geometry Compression

PCC-1

Dynamic Point Cloud Capture and Compression

90

RGBD-based Real-time Volumetric Reconstruction System: Architecture Design and Implementation

PCC-1

Dynamic Point Cloud Capture and Compression

71

Geometry Reconstruction for Spatial Scalability in Point Cloud Compression Based on the Prediction of Neighbours’ Weights

PCC-1

Dynamic Point Cloud Capture and Compression

23

Reduced Reference Quality Assessment for Point Cloud Compression

QOE-1

Evaluation of Image and Video Coding Standards

63

Video Quality Assessment based on Quality Aggregation Networks

QOE-1

Machine Learning for Multimedia

108

No Reference Stereoscopic Video Quality Assessment based on Human Vision System

QOE-1

Multimedia Content Analysis, Representation, and Understanding

41

Generalized Gaussian Distribution Based Distortion Model for the H.266/VVC Video Coder

QOE-1

Emerging Techniques for Image and Video Coding Standards

73

No-reference Stereoscopic Image Quality Assessment Based on Parallel Multi-scale Perception

QOE-1

Multimedia Content Analysis, Representation, and Understanding

100

MSCI: A Multi-source Compound Image Database for Compression Distortion Quality Assessment

QOE-1

Visual Communications

135

Controllable Space-Time Video Super-Resolution via Enhanced Bidirectional Flow Warping

SR-1

Automated Machine Learning for Visual Signal Processing

20

Single Image Super-Resolution Using ConvNeXt

SR-1

Machine Learning for Multimedia

89

Refine-PU: A Graph Convolutional Point Cloud Upsampling Network using Spatial Refinement

SR-1

Machine Learning for Multimedia

72

Face Super Resolution based on Contrastive Learning

SR-1

Multimedia Content Analysis, Representation, and Understanding

2

Visual Analysis motivated Super-Resolution Model for Image Reconstruction

SR-1

Video Coding for Machines

198

A Fast Motion Estimation Method With Hamming Distance for LiDAR Point Cloud Compression

SS-1

(Special Session) 3D Point Cloud Acquisition, Processing and Communication (3DPC-APC)

199

PointNetGeM: Simple and Efficient Point Cloud Based Network for Place Recognition

SS-1

(Special Session) 3D Point Cloud Acquisition, Processing and Communication (3DPC-APC)

200

SparseARFM-SI: Rotary Point Cloud Place Recognition Based on Multi-Resolution and Attention Mechanism

SS-1

(Special Session) 3D Point Cloud Acquisition, Processing and Communication (3DPC-APC)

171

Augmented Normalizing Flow for Point Cloud Geometry Coding

SS-1

(Special Session) 3D Point Cloud Acquisition, Processing and Communication (3DPC-APC)

203

Azimuth Adjustment Considering LiDAR Calibration for the Predictive Geometry Compression in G-PCC

SS-1

(Special Session) 3D Point Cloud Acquisition, Processing and Communication (3DPC-APC)

117

Fast Inter Prediction Mode Decision Method Based On Random Forest For H.266/VVC

VC-1

Emerging Techniques for Image and Video Coding Standards

148

Global Homography Motion Compensation for Versatile Video Coding

VC-1

Emerging Techniques for Image and Video Coding Standards

29

Performance Analysis of WebRTC Embedding Optimized HEVC CodeC

VC-1

Emerging Techniques for Image and Video Coding Standards

102

An Efcient Content-aware Downsampling-based Video Compression Framework

VC-1

Image and Video Compression Beyond Standards

170

Adaptive boundary width of Geometric Partitioning Mode for Beyond Versatile Video Coding

VC-1

Image and Video Compression Beyond Standards

160

Block Importance Mapping for Video Encoding

VC-2

Image and Video Compression Beyond Standards

28

History-parameter-based Affine Model Inheritance

VC-2

Image and Video Compression Beyond Standards

118

Fast CU Partition Method Based on Extra Trees for VVC Intra Coding

VC-2

Image and Video Compression Beyond Standards

155

Efficient Interpolation Filters for Chroma Motion Compensation in Video Coding

VC-2

Image and Video Compression Beyond Standards

80

Enhanced motion list reordering for video coding

VC-2

Image and Video Compression Beyond Standards

69

3D Tensor Display for Non-Lambertian Content

SS-2

(Special Session) Immersive Visual Volumetric Content Representation and Compression

26

DYNAMIC MESH COMMONALITY MODELING USING THE CUBOIDAL PARTITIONING

SS-2

(Special Session) Immersive Visual Volumetric Content Representation and Compression

195

Low Light RAW Image Enhancement Using Paired Fast Fourier Convolution and Transformer

SS-3

(Special Session) Low level vision and signal recovery

133

Spike Signal Reconstruction Based on Inter-Spike Similarity

SS-3

(Special Session) Low level vision and signal recovery

201

Recurrent Multi-connection Fusion Network for Single Image Deraining

SS-3

(Special Session) Low level vision and signal recovery

54

A Large-scale Sports Tracking Dataset and Progressive Re-detection Based Sports Tracking

DRT-2

Machine Learning for Multimedia

174

Clothing Retrieval from Class Aware Attention Embedding to KN Loss Learning

DRT-2

Machine Learning for Multimedia

91

ML-FDA: Meta-Learning via Feature Distribution Alignment for Few-Shot Learning

DRT-2

Machine Learning for Multimedia

40

Weakly Supervised Region-Level Contrastive Learning for Efficient Object Detection

DRT-2

Machine Learning for Multimedia

61

PickDet: A Detection Framework for Aerial-view Scene

DRT-2

Multimedia Content Analysis, Representation, and Understanding

139

ERINet: Effective Rotation Invariant Network for Point Cloud based Place Recognition

DRT-3

Multimedia Content Analysis, Representation, and Understanding

13

DE-CrossDet: Divisible and Extensible Crossline Representation for Object Detection

DRT-3

Multimedia Content Analysis, Representation, and Understanding

126

Mask-Guided Transformer for Human-Object Interaction Detection

DRT-3

Multimedia Content Analysis, Representation, and Understanding

206

Asynchronous Autoregressive Prediction for Satellite Anomaly Detection

DRT-3

Multimedia Content Analysis, Representation, and Understanding

154

CdCLR: Clip-Driven Contrastive Learning for Skeleton-Based Action Recognition

DRT-3

Multimedia Content Analysis, Representation, and Understanding

49

Semantic Compensation Based Dual-Stream Feature Interaction Network for Multi-oriented Scene Text Detection

DRT-4

Multimedia Content Analysis, Representation, and Understanding

169

Blood Volume Pulse Signal Extraction based on Spatio-Temporal Low-Rank Approximation for Heart Rate Estimation

DRT-4

Multimedia Content Analysis, Representation, and Understanding

111

On Data Annotation Efficiency for Image Based Crowd Counting

DRT-4

Multimedia Content Analysis, Representation, and Understanding

77

Annotating Only at Definite Pixels: A Novel Weakly Supervised Semantic Segmentation Method for Sea Fog Recognition

DRT-4

Multimedia Content Analysis, Representation, and Understanding

79

Cross-Layer Feature based Multi-Granularity Visual Classification

DRT-4

Multimedia Content Analysis, Representation, and Understanding

35

Space and Level Cooperation Framework for Pathological Cancer Grading

DRT-5

Multimedia Content Analysis, Representation, and Understanding

66

Dual-stream Self-attention Network for Image Captioning

DRT-5

Multimedia Content Analysis, Representation, and Understanding

88

STSI: Efficiently Mine Spatio-Temporal Semantic Information between Different Multimodal for Video Captioning

DRT-5

Multimedia Content Analysis, Representation, and Understanding

99

Texture-aware Network for Smoke Density Estimation

DRT-5

Multimedia Content Analysis, Representation, and Understanding

24

A Fast and Effective Framework for Camera Calibration in Sport Videos

QOE-2

Machine Learning for Multimedia

153

Blind Gaussian Deep Denoiser Network using Multi-Scale Pixel Attention

QOE-2

Machine Learning for Multimedia

186

Distribution-aware Low-bit Quantization for 3D Point Cloud Networks

QOE-2

Multimedia Content Analysis, Representation, and Understanding

175

Multi-information Aggregation Network for Fundus Image Quality Assessment

QOE-2

Multimedia Content Analysis, Representation, and Understanding

124

Ultra-High Resolution Image Segmentation with Efficient Multi-Scale Collective Fusion

QOE-2

Multimedia Content Analysis, Representation, and Understanding

37

Semantic Attribute Guided Image Aesthetics Assessment

QOE-3

Multimedia Content Analysis, Representation, and Understanding

137

Spectral Analysis of Aerial Light Field for Optimization Sampling and Rendering of Unmanned Aerial Vehicle

QOE-3

Visual Communications

134

A Sparsity Analysis of Light Field Signal For Capturing Optimization of Multi-view Images

QOE-3

Visual Communications

62

Quality Assessment of Screen Content Images Based on Multi-Pathway Convolutional Neural Network

QOE-3

Multimedia Content Analysis, Representation, and Understanding

176

DesnowFormer: an effective transformer-based image desnowing network

QOE-3

Multimedia Content Analysis, Representation, and Understanding

19

CFNet: A Coarse-to-Fine Network for Few Shot Semantic Segmentation

SSP-1

Multimedia Content Analysis, Representation, and Understanding

30

Robust Dynamic Background Modeling for Foreground Estimation

SSP-1

Multimedia Content Analysis, Representation, and Understanding

110

ENDE-GNN: An Encoder-decoder GNN Framework for Sketch Semantic Segmentation

SSP-1

Multimedia Content Analysis, Representation, and Understanding

39

Mining Regional Relation from Pixel-wise Annotation for Scene Parsing

SSP-1

Multimedia Content Analysis, Representation, and Understanding

6

Hierarchical Reinforcement Learning Based Video Semantic Coding for Segmentation

SSP-1

Video Coding for Machines

53

MRIQA: Subjective Method and Objective Model for Magnetic Resonance Image Quality Assessment

QOE-4

Multimedia Content Analysis, Representation, and Understanding

15

High-Speed Scene Reconstruction from Low-Light Spike Streams

QOE-4

Multimedia Content Analysis, Representation, and Understanding

121

SAD360: Spherical Viewport-Aware Dynamic Tiling for 360-Degree Video Streaming

QOE-4

Multimedia Delivery, Multimedia Broadcasting

75

Recurrent Network with Enhanced Alignment and Attention-Guided Aggregation for Compressed Video Quality Enhancement

QOE-4

Automated Machine Learning for Visual Signal Processing

113

On the Importance of Temporal Dependencies of Weight Updates in Communication Efficient Federated Learning

QOE-4

Compression of Neural Networks for Visual Communications

158

Image Inpainting with Frequency Domain Wavelet Convolution

QOE-5

Multimedia Content Analysis, Representation, and Understanding

125

Flocking Birds of a Feather Together:\\ Dual-step GAN Distillation via Realer-Fake Samples

QOE-5

Compression of Neural Networks for Visual Communications

149

A Privacy-Preserving and End-to-End-Based Encrypted Image Retrieval Scheme

QOE-5

Image/Video Privacy

130

A Comparative Study of Cross-Model Universal Adversarial Perturbation for Face Forgery

QOE-5

Image/Video Privacy

10

Distinguishing Computer-generated Images from Photographic Images: A Texture-Aware deep learning-based Method

QOE-5

Machine Learning for Multimedia

VC-1,2

Video Coding

LBC-1,2,3

Learning Based Compression

SR-1

Super Resolution

SS-1

Special Session

PCC-1

Point Cloud Compression

DRT-1

Detection, Recognition and Tracking

QOE-1

Quality of Experience

PS-1

Privacy and Security

SSP-1

Segmentation and Scene Parsing