List of Accepted Papers

Following is the list of accepted ICIP 2023 papers, sorted by paper title. You can use the search feature of your web browser to find your paper number. Notifications to all authors have also been sent by email. If you have not received your notification of the results by email, please contact us at icip2023@cmsworkshops.com.

Paper Number Paper Title
28393D brain registration with intensity shift robustness
33953D Face Reconstruction based on Weakly-Supervised Learning Morphable Face Model
27003D Facial Expression Generator Based on Transformer VAE
25143D HIPPOCAMPUS SEGMENTATION USING A HOG BASED LOSS FUNCTION WITH MAJORITY POOLING
17203D Human Motion Prediction via Activity-driven Attention-MLP Association
28273D Unsupervised Region-Aware Registration Transformer
28753D-CSL: SELF-SUPERVISED 3D CONTEXT SIMILARITY LEARNING FOR NEAR-DUPLICATE VIDEO RETRIEVAL
31103D-DDA: 3D Dual-Domain Attention For Brain Tumor Segmentation
23073M3D: Multi-view, Multi-path, Multi-representation for 3D Object Detection
2640A 3D Label Stereo Matching Method Using Underwater Energy Function
3130A BASELINE ON CONTINUAL LEARNING METHODS FOR VIDEO ACTION RECOGNITION
2576A CAM-enhancing Generative Person Re-ID Method based Global and Local Features
2206A CONTRARIO DETECTION OF H.264 VIDEO DOUBLE COMPRESSION
1866A CONTRASTIVE LEARNING APPROACH FOR SCREENSHOT DEMOIRÉING
2059A CONVERGENT NEURAL NETWORK FOR NON-BLIND IMAGE DEBLURRING
1099A Convolutional-Transformer Network for Crack Segmentation with Boundary Awareness
2195A DECOUPLED SPATIAL-CHANNEL INVERTED BOTTLENECK FOR IMAGE COMPRESSION
2770A DIFFERENTIABLE GAUSSIAN PROTOTYPE LAYER FOR EXPLAINABLE FRUIT SEGMENTATION
2377A FEATURE REFINEMENT MODULE FOR LIGHT-WEIGHT SEMANTIC SEGMENTATION NETWORK
2613A GLOBAL-LOCAL CONTRASTIVE LEARNING FRAMEWORK FOR VIDEO CAPTIONING
1495A Joint Model-Driven Unfolding Network For Degraded Low-Quality Color-Depth Images Enhancement
1616A Key Feature-Enhanced Network for Remote Sensing Object Detection
1929A LARGE SCALE MULTI-VIEW RGBD VISUAL AFFORDANCE LEARNING DATASET
2873A LIGHTWEIGHT HYBRID REPRESENTATION FOR VIRTUAL COMPLEX SCENES
3269A Multichannel Localization Method for Camouflaged Object Detection
2480A MULTI-MODAL TRANSFORMER APPROACH FOR FOOTBALL EVENT CLASSIFICATION
3463A Multiscale Approach to Deep Blind Image Quality Assessment
1166A MULTI-SCALE CELL SEGMENTATION METHOD FOR DETECTING HEMATOLOGICAL DISORDERS
1180A MULTISCALE RESIDUAL SOLVER FOR TOTAL VARIATION MODELS
1959A MULTI-STREAM NETWORK FOR MESH DENOISING VIA GRAPH NEURAL NETWORKS WITH GAUSSIAN CURVATURE
1036A NO-REFERENCE QUALITY ASSESSMENT METHOD FOR DIGITAL HUMAN HEAD
2738A NOVEL CLASS ACTIVATION MAP FOR VISUAL EXPLANATIONS IN MULTI-OBJECT SCENES
1510A NOVEL PSEUDO-LABEL GENERATION METHOD FOR SEMI-SUPERIVISED SAR TARGET RECOGNITION BASED ON DEEP LEARNING
1875A NOVEL SELECTIVE ENCRYPTION SCHEME FOR H.266/VVC VIDEO
3451A NOVEL WEAKLY SUPERVISED SEGMENTATION APPROACH FOR RAPID LEFT VENTRICLE ANNOTATION
2773A PENALIZED MODIFIED HUBER REGULARIZATION TO IMPROVE ADVERSARIAL ROBUSTNESS
1793A Privacy-Preserving Approach for Multi-Source Domain Adaptive Object Detection
2536A PROBABILITY-BASED ALL-ZERO BLOCK EARLY TERMINATION ALGORITHM FOR QSHVC
1578A SALIENCY-AWARE METHOD FOR ARBITRARY STYLE TRANSFER
2484A Semi-Paired Approach For Label-to-Image Translation
3098A SHALLOW U-NET WITH SPLIT-FUSED ATTENTION MECHANISM FOR RETINAL VESSEL SEGMENTATION
2082A STRUCTURE-FUSION NETWORK FOR MEDICAL IMAGE CLASSIFICATION
1127A TOPOLOGY BASED DENOISING APPROACH FOR 2D SCALAR FIELDS
1236A two-dimensional difference histogram equalization with fuzzy cumulative distribution correction for dark images
1504A Unified Framework for Static and Dynamic Functional Connectivity Augmentation for Multi-Domain Brain Disorder Classification
2350A VISIBLE AND INFRARED IMAGE FUSION FRAMEWORK BASED ON DUAL-PATH ENCODER-DECODER AND MULTI-SCALE DISCRETE WAVELET TRANSFORM
2313AAFACE: ATTRIBUTE-AWARE ATTENTIONAL NETWORK FOR FACE RECOGNITION
1555ABNORMAL-AWARE LOSS AND FULL DISTILLATION FOR UNSUPERVISED ANOMALY DETECTION BASED ON KNOWLEDGE DISTILLATION
2811ACCURATE REGISTRATION BETWEEN ULTRA-WIDE-FIELD AND NARROW ANGLE RETINA IMAGES WITH 3D EYEBALL SHAPE OPTIMIZATION
3049ACCURATE SEGMENTATION FOR PATHOLOGICAL LUNG BASED ON INTEGRATION OF 3D APPEARANCE AND SURFACE MODELS
1587ACCURATE SINGLE-IMAGE DEFOCUS DEBLURRING BASED ON IMPROVED INTEGRATION WITH DEFOCUS MAP ESTIMATION
2142ACTION ANTICIPATION WITH GOAL CONSISTENCY
3481Activating Frequency and ViT for 3D Point Cloud Quality Assessment without Reference
1291ADAPTIVE ANCHOR LABEL PROPAGATION FOR TRANSDUCTIVE FEW-SHOT LEARNING
1408ADAPTIVE AND ROBUST MMWAVE-BASED 3D HUMAN MESH ESTIMATION FOR DIVERSE POSES
2146ADAPTIVE CAMOUFLAGE PATTERN GENERATION TO DIFFERENT ENVIRONMENTS VIA CONTENT-AWARE STYLE TRANSFER
2003ADAPTIVE GRAPH CONVOLUTION MODULE FOR SALIENT OBJECT DETECTION
1388ADAPTIVE SEMI-SUPERVISED MIXUP WITH IMPLICIT LABEL LEARNING AND SAMPLE RATIO BALANCING
1341ADA-VIT: ATTENTION-GUIDED DATA AUGMENTATION FOR VISION TRANSFORMERS
1895ADDING DISTANCE INFORMATION TO SELF-SUPERVISED LEARNING FOR RICH REPRESENTATIONS
1221ADFA: Attention-augmented Differentiable top-k Feature Adaptation for unsupervised medical anomaly detection
1378Adopting Self-supervised Learning into Unsupervised Video Summarization through Restorative score.
2908ADVANCING THE RATE-DISTORTION-COMPUTATION FRONTIER FOR NEURAL IMAGE COMPRESSION
1965ADVERSARIAL DEFECT SYNTHESIS FOR INDUSTRIAL PRODUCTS IN LOW DATA REGIME
2907Adversarial Defense via Perturbation-Disentanglement in Hyperspectral Image Classification
2183ADVERSARIAL EXAMPLE DETECTION BAYESIAN GAME
1144AFNET-M: ADAPTIVE FUSION NETWORK WITH MASKS FOR 2D+3D FACIAL EXPRESSION RECOGNITION
1159AICT: AN ADAPTIVE IMAGE COMPRESSION TRANSFORMER
2815All-intra rate control using low complexity video features for Versatile Video Coding
2784AN ADJUSTABLE FAST DECISION METHOD FOR AFFINE MOTION ESTIMATION IN VVC
2331AN ALTERNATIVE TO BILINEAR AND NEAREST-NEIGHBOUR ENLARGING FOR MONITOR DISPLAYS
2834AN AUTOMATIC COLORECTAL POLYPS DETECTION APPROACH FOR CT COLONOGRAPHY
2263AN EFFICIENT DEEP UNROLLING SUPER-RESOLUTION NETWORK FOR LIDAR AUTOMOTIVE SCENES
1316An Efficient Deep Video Model for Deepfake Detection
3340AN ENHANCED NEURON ATTRIBUTION-BASED ATTACK VIA PIXEL DROPPING
1237AN IMPROVED UPPER BOUND ON THE RATE-DISTORTION FUNCTION OF IMAGES
2731An Inter-observer consistent deep adversarial training for visual scanpath prediction
2287AN L2-NORMALIZED SPATIAL ATTENTION NETWORK FOR ACCURATE AND FAST CLASSIFICATION OF BRAIN TUMORS IN 2D T1-WEIGHTED CE-MRI IMAGES
2038ARBITRARY POINT CLOUD UPSAMPLING VIA DUAL BACK-PROJECTION NETWORK
2472ArtiFact: A Large-Scale Dataset with Artificial and Factual Images for Generalizable and Robust Synthetic Image Detection
3097ASVFI: Audio-driven Speaker Video Frame Interpolation
1285ASYMMETRIC SCALABLE CROSS-MODAL HASHING
1891ATTEN-ADAPTER: A UNIFIED ATTENTION-BASED ADAPTER FOR EFFICIENT TUNING
2659ATTENTION-GUIDED CONTRASTIVE MASKED IMAGE MODELING FOR TRANSFORMER-BASED Self-SUPERVISED LEARNING
2047ATTENTIVE DEEP K-SVD NETWORK FOR PATCH CORRELATED IMAGE DENOISING
2184ATTRIBUTE LEARNING WITH KNOWLEDGE ENHANCED PARTIAL ANNOTATIONS
2055Audio-Visual Quality Assessment for User Generated Content: Database and Method
2924AUTOMATED DIAGNOSIS OF BREAST CANCER USING DEEP LEARNING-BASED WHOLE SLIDE IMAGE ANALYSIS OF MOLECULAR BIOMARKERS
1900AUTONOMOUS POLYCRYSTALLINE MATERIAL DECOMPOSITION FOR HYPERSPECTRAL NEUTRON TOMOGRAPHY
2179BACKGROUND CLUSTERING PRE-TRAINING FOR FEW-SHOT SEGMENTATION
1073BackGround Masked Guided Network for Skin Lesion Segmentation in Dermoscopy Image
3178Base Layer Efficiency in Scalable Human-Machine Coding
1594BATINET: BACKGROUND-AWARE TEXT TO IMAGE SYNTHESIS AND MANIPULATION NETWORK
2421BAYESIAN HYBRID LOSS FOR HYPERSPECTRAL SISR USING 3D WIDE RESIDUAL CNN
3114BCKD: BLOCK-CORRELATION KNOWLEDGE DISTILLATION
3131BITRATE-PERFORMANCE OPTIMIZED MODEL TRAINING FOR THE NEURAL NETWORK CODING (NNC) STANDARD
1334BITS-NET: BLIND IMAGE TRANSPARENCY SEPARATION NETWORK
2628BLACKBOX FACE RECONSTRUCTION FROM DEEP FACIAL EMBEDDINGS USING A DIFFERENT FACE RECOGNITION MODEL
2009Blind Omnidirectional Image Quality Assessment: Integrating Local Statistics and Global Semantics
2588BLIND QUALITY ASSESSMENT OF LIGHT FIELD IMAGE BASED ON SPATIO-ANGULAR TEXTURAL VARIATION
3346BLOCK-BASED MOTION ESTIMATION FOR DEEP-LEARNED VIDEO CODING
3485BPQA: A BLIND POINT CLOUD QUALITY ASSESSMENT METHOD
2582BS-YOLOV5S: INSULATOR DEFECT DETECTION WITH ATTENTION MECHANISM AND MULTI-SCALE FUSION
2187CAN HUMAN ATTRIBUTE SEGMENTATION BE MORE ROBUST TO OPERATIONAL CONTEXTS WITHOUT NEW LABELS?
1488CAN WE DISTILL KNOWLEDGE FROM POWERFUL TEACHERS DIRECTLY?
2430Capsule Transformer Network for Dynamic Hand Gesture Recognition using Multimodal Data
2949CDNET: CLUSTER DECISION FOR DEEPFAKE DETECTION GENERALIZATION
1493cDPMSR: CONDITIONAL DIFFUSION PROBABILISTIC MODELS FOR SINGLE IMAGE SUPER-RESOLUTION
1649Change Detection for Remote Sensing Images based on Semantic Prototypes and Contrastive Learning
2347CHANNEL PRUNING VIA ATTENTION MODULE AND MEMORY CURVE
1245CKT: CROSS-IMAGE KNOWLEDGE TRANSFER FOR TEXTURE ANOMALY DETECTION
3176CLASSIFICATION TASK ASSISTED SEGMENTATION NETWORK FOR BREAST TUMOR SEGMENTATION IN ULTRASOUND IMAGES
1130CLIP4STEREO: REVISITING DOMAIN GENERALIZED STEREO MATCHING VIA CLIP
1476CLIP-FG:SELECTING DISCRIMINATIVE IMAGE PATCHES BY CONTRASTIVE LANGUAGE-IMAGE PRE-TRAINING FOR FINE-GRAINED IMAGE CLASSIFICATION
3123CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection
2068CLOT: Contrastive Learning-Driven and Optimal Transport-Based Training for Simultaneous Clustering
3366ClothFit: Cloth-Human-Attribute Guided Virtual Try-On Network Using 3D Simulated Dataset
3134CNN-BASED ESTIMATION OF WATER DEPTH FROM MULTISPECTRAL DRONE IMAGERY FOR MOSQUITO CONTROL
1951Coarse-to-Fine Pyramid Feature Mining for Wheat Head Detection
1480COCCA: POINT CLOUD COMPLETION THROUGH CAD CROSS-ATTENTION
2349Coco-Teach: A CONTRASTIVE CO-TEACHING NETWORK FOR INCREMENTAL 3D OBJECT DETECTION
2564COLOR LEARNING FOR IMAGE COMPRESSION
1491Combining Self-Supervised and Supervised Learning with Noisy Labels
2573COMPACT SELECTIVE TRANSFORMER BASED ON INFORMATION ENTROPY FOR FACIAL EXPRESSION RECOGNITION IN THE WILD
2772Comparative Study of Saliency- and Scanpath-Based Approaches for Patch Selection in Image Quality Assessment
2294COMPLEXITY REDUCTION OF GRAPH SIGNAL DENOISING BASED ON FAST GRAPH FOURIER TRANSFORM
2271COMPLEXITY SCALABLE LEARNING-BASED IMAGE DECODING
2160COMPLEXITY-EFFICIENT QUANTIZER SELECTION FOR HEVC ENCODER
3272COMPOUND MULTI-BRANCH FEATURE FUSION FOR IMAGE DERAINDROP
3462Conditional Injective Flows for Bayesian Imaging
2912CONFIDENCE-AWARE CLUSTERED LANDMARK FILTERING FOR HYBRID 3D FACE TRACKING
2338CONSISTENT AND DIVERSE HUMAN MOTION PREDICTION USING CONDITIONAL VARIATIONAL AUTOENCODER WITH CONTEXT-AWARE LATENT SPACE
1200CONSISTENT AND MULTI-SCALE SCENE GRAPH TRANSFORMER FOR SEMANTIC-GUIDED IMAGE OUTPAINTING
3096CONTENT-ADAPTIVE PARALLEL ENTROPY CODING FOR END-TO-END IMAGE COMPRESSION
1010CONTEXT-AWARE DATA AUGMENTATION FOR LIDAR 3D OBJECT DETECTION
1268Context-Aware Inpainter-Refiner for Skeleton-Based Human Motion Completion
2655Context-Aware Multi-Stream Networks for Dimensional Emotion Prediction in Images
2696CONTEXT-AWARE PEDESTRIAN TRAJECTORY PREDICTION WITH MULTIMODAL TRANSFORMER
3431CONTEXT-AWARE TRANSFORMERS FOR WEAKLY SUPERVISED BAGGAGE THREAT LOCALIZATION
2781CONTINUAL LEARNING FOR OUT-OF-DISTRIBUTION PEDESTRIAN DETECTION
3373CONTOUR ARTIFACT REMOVAL FOR EXPANDED HDR CONTENT
2468CONTOUR-ASSISTED LONG-RANGE PERCEPTUAL NETWORK FOR CAMOUFLAGED INSTANCE SEGMENTATION
1313CONTROLLING FACIAL ATTRIBUTE SYNTHESIS BY DISENTANGLING ATTRIBUTE FEATURE AXES IN LATENT SPACE
3063CORRELATION AND FOREGROUND ATTENTION TO IMPROVE OBJECT DETECTION
1364COST-EFFICIENT MULTI-INSTANCE MULTI-LABEL ACTIVE LEARNING VIA CORRELATION OF FEATURES
2409COUPLING SPATIAL AND CHANNEL TRANSFORMER FOR SINGLE IMAGE DERAINING
1613COVARIANCE-AWARE FEATURE ALIGNMENT WITH PRE-COMPUTED SOURCE STATISTICS FOR TEST-TIME ADAPTATION TO MULTIPLE IMAGE CORRUPTIONS
2982CPU MICROARCHITECTURAL PERFORMANCE ANALYSIS OF SVT-AV1 ENCODER
1233CROSS SPECTRAL IMAGE RECONSTRUCTION USING A DEEP GUIDED NEURAL NETWORK
1477Cross-Domain Few-Shot Classification via Inter-Source Stylization
1125Cross-Inferential Networks for Source-free Unsupervised Domain Adaptation
1462CROSS-LAYER PATCH ALIGNMENT AND INTRA-AND-INTER PATCH RELATIONS FOR KNOWLEDGE DISTILLATION
1473CROSS-SCALE QUERY-SUPPORT ALIGNMENT APPROACH FOR SMALL OBJECT DETECTION IN THE FEW-SHOT REGIME
1847CR-UNIT: Unsupervised Image-to-Image Translation with Content Reconstruction
1721CSSBA: A CLEAN LABEL SAMPLE-SPECIFIC BACKDOOR ATTACK
1575CTI-UNET: HYBRID LOCAL FEATURES AND GLOBAL REPRESENTATIONS EFFICIENTLY
1731Curriculum Knowledge Switching for Pancreas Segmentation
2497CXRMIM: MASKED IMAGE MODELING PRE-TRAINING PARADIGM FOR CHEST X-RAY IMAGES ANALYSIS
1624Data Augmentation using Corner CutMix and an Auxiliary Self-supervised Loss
2693DATA GENERATION WITH STRUCTURE ENFORCING ADVERSARIAL LEARNING
2292DATA POISONING ATTACK AIMING THE VULNERABILITY OF CONTINUAL LEARNING
1349DATASET-LEVEL DIRECTED IMAGE TRANSLATION FOR CROSS-DOMAIN CROWD COUNTING
2251DAUT: UNDERWATER IMAGE ENHANCEMENT USING DEPTH AWARE U-SHAPE TRANSFORMER
1619DEEP ACTIVE LEARNING BASED ON SALIENCY-GUIDED DATA AUGMENTATION FOR IMAGE CLASSIFICATION
1548DEEP BAYESIAN BLIND COLOR DECONVOLUTION OF HISTOLOGICAL IMAGES
3443DEEP CNN-BASED PRE-ENCODING PERCEPTUAL QUALITY CONTROL AND PREDICTION
1865DEEP CROSS-MODAL STEGANOGRAPHY USING NEURAL REPRESENTATIONS
2937DEEP LEARNING BASED WORKFLOW FOR ACCELERATED INDUSTRIAL X-RAY COMPUTED TOMOGRAPHY
3363DEEP LEARNING MEETS PARTICLE SWARM OPTIMIZATION FOR AORTIC VALVE CALCIUM SCORING FROM CARDIAC COMPUTED TOMOGRAPHY
2394DEEP LEARNING RECONSTRUCTION FOR SINGLE PIXEL IMAGING WITH GENERATIVE ADVERSARIAL NETWORKS
2742DEEP LEARNING-BASED COMPRESSED DOMAIN POINT CLOUD CLASSIFICATION
2958DEEP OC-SORT: MULTI-PEDESTRIAN TRACKING BY ADAPTIVE RE-IDENTIFICATION
1600Deep robust image restoration using the Moore-Penrose blur inverse
1042DEEP UNFOLDING NETWORK WITH PHYSICS-BASED PRIORS FOR UNDERWATER IMAGE ENHANCEMENT
1831DEEP UNROLLING SHRINKAGE NETWORK FOR DYNAMIC MR IMAGING
1986DEEP UNSUPERVISED HASHING WITH SEMANTIC CONSISTENCY LEARNING
2387DEEP UNSUPERVISED REFLECTION REMOVAL USING DIFFUSION MODELS
2469DEEP VARIATIONAL SEGMENTATION OF TOPOLOGY-CONSTRAINED OBJECT SETS, WITH CORRELATED UNCERTAINTY MODELS, FOR ROBUSTNESS TO DEGRADATIONS
2373Deepfake Face Provenance for Proactive Forensics
1486DEEP-LEARNING-BASED ENERGY AWARE IMAGES
3450Deformation Robust Text Spotting with Geometric Prior
1457DEGRADATION CONDITIONED GAN FOR DEGRADATION GENERALIZATION OF FACE RESTORATION MODELS
2952DENOISING POINT CLOUDS WITH INTENSITY AND SPATIAL FEATURES IN RAINY WEATHER
2486DENSE DEPTH ESTIMATION FOR SURGICAL ENDOSCOPE ROBOT WITH MULTI-BASELINE DEPTH MAP FUSION
2902DENSECL: HAZE MITIGATION USING DENSE BLOCKS AND CONTRASTIVE LOSS REGULARIZATION
1698Densely Connected Swin-UNet for Multiscale Information Aggregation in Medical Image Segmentation
2836DEPTH ESTIMATION OF MULTI-MODAL SCENE BASED ON MULTI-SCALE MODULATION
2918DEPTH MAP ESTIMATION FROM MULTI-VIEW IMAGES WITH NERF-BASED REFINEMENT
1463DESIGNING STRONG BASELINES FOR TERNARY NEURAL NETWORK QUANTIZATION THROUGH SUPPORT AND MASS EQUALIZATION
2264DETECTING STABLE DIFFUSION GENERATED IMAGES USING FREQUENCY ARTIFACTS: A CASE STUDY ON DISNEY-STYLE ART
2674DETECTION TRANSFORMER WITH DIVERSIFIED OBJECT QUERIES
2417DF-Net: Diversity-Focused Network for Video Object Detection
1438DFT-CAM: DISCRETE FOURIER TRANSFORM DRIVEN CLASS ACTIVATION MAP
1461DIFFERENTIAL ENHANCED SIAMESE SEGMENTATION NETWORK FOR PRINTED LABEL DEFECT DETECTION
2110DIFFUSIONSTR: DIFFUSION MODEL FOR SCENE TEXT RECOGNITION
1593DISPLAY POWER MODELING FOR ENERGY CONSUMPTION CONTROL
2449DISTILLING KNOWLEDGE OF BIDIRECTIONAL LANGUAGE MODEL FOR SCENE TEXT RECOGNITION
3332DLAHSD: Dynamic Label adopted in Auxiliary Head for SAR Detection
2426DLEN: DEEP LAPLACIAN ENHANCEMENT NETWORKS FOR LOW-LIGHT IMAGES
1492DNC-NET: DUAL-NEIGHBOURHOOD CONSENSUS NETWORK FOR FEATURE MATCHING
1521Document binarization with Multi-branch Gated Convolutional Generative Adversarial Networks
1514DOCUMENT CHANGE DETECTION WITH HIERARCHICAL PATCH COMPARISON
2137DODGING THE DOUBLE DESCENT IN DEEP NEURAL NETWORKS
1163DOG ACCURACY VIA EQUIVARIANCE: GET THE INTERPOLATION RIGHT
2247DOMAIN ADAPTATION IN POWER LINE SEGMENTATION: A NEW SYNTHETIC DATASET
2239Domain Adaptation of Digital Pathology Images using Joint Stain Color and Image Quality Constraints
1647DOMAIN GENERALIZATION METHOD FOR PERSON RE-ID USING METABIN AND MIXSTYLE
2079Domain Invariant Regularization By Disentangling content and style features for Visual Domain Generalization
1620DOMAIN-GENERALIZED FACE ANTI-SPOOFING WITH UNKNOWN ATTACKS
1969DPDM: FEATURE-BASED POSE REFINEMENT WITH DEEP POSE AND DEEP MATCH FOR MONOCULAR VISUAL ODOMETRY
1877DP-NET: LEARNING DISCRIMINATIVE PARTS FOR IMAGE RECOGNITION
2855DSG-PL: ROI EXTRACTION BASED ON DUAL SALIENCY GUIDED PROGRESSIVE LEARNING FOR WEAKLY LABELED REMOTE SENSING IMAGES
1367Dual Temporal Transformers for Fine-Grained Dangerous Action Recognition
1538DUAL TRANSFORMER ENCODER MODEL FOR MEDICAL IMAGE CLASSIFICATION
1520DYNAMIC DUAL-GRAPH FUSION CONVOLUTIONAL NETWORK FOR ALZHEIMER'S DISEASE DIAGNOSIS
2400DYNAMIC POINT CLOUD COMPRESSION APPROACH USING HEXAHEDRON SEGMENTATION
2980DYNAMIC RANGE TRANSFORMER (DRT): LEARNING ENHANCED LOG-PERCEPTUAL INFORMATION WITH SWIN-FOURIER CONVOLUTION NETWORK FOR HDR IMAGING
1826Dynamic Unilateral Dual Learning for Text to Image Synthesis
1325Early Detection of Cars Exiting Road-side Parking
2888EARLY DIAGNOSIS OF PROSTATE CANCER USING PARAMETRIC ESTIMATION OF IVIM FROM DW-MRI
1685EDGE SYNTHESIS BLOCK: A BUILDING UNIT FOR REAL-TIME SINGLE IMAGE SUPER RESOLUTION
1494EFFICIENT AERIAL IMAGE OBJECT DETECTION WITH IMAGING CONDITION DECOMPOSITION
3499EFFICIENT ANOMALY DETECTION USING SELF-SUPERVISED MULTI-CUE TASKS
3206EFFICIENT ANY-TARGET BACKDOOR ATTACK WITH PSEUDO POISONED SAMPLES
1774EFFICIENT CONVOLUTION AND TRANSFORMER-BASED NETWORK FOR VIDEO FRAME INTERPOLATION
2274EFFICIENT JOINT VIDEO DENOISING AND SUPER-RESOLUTION
2262EFFICIENT PER-SHOT TRANSFORMER-BASED BITRATE LADDER PREDICTION FOR ADAPTIVE VIDEO STREAMING
1561EFFICIENT PREDICTION OF MODEL TRANSFERABILITY IN SEMANTIC SEGMENTATION TASKS
3362EFFICIENT PRUNING METHOD FOR LEARNED LOSSY IMAGE COMPRESSION MODELS BASED ON SIDE INFORMATION
2769Efficient Transfer by Robust Label Selection and Learning with Pseudo-Labels
1787EFFICIENT-HDRTV: EFFICIENT SDR TO HDR CONVERSION FOR HDR TV
3023ELEGAN: AN EFFICIENT LOW LIGHT ENHANCEMENT GAN FOR UNPAIRED SUPERVISION
2619ENABLING HIGH-RESOLUTION POSE ESTIMATION IN REAL TIME USING ACTIVE PERCEPTION
2022ENABLING THE ENCODER-EMPOWERED GAN-BASED VIDEO GENERATORS FOR LONG VIDEO GENERATION
3017ENCODER COMPLEXITY CONTROL IN SVT-AV1 BY SPEED-ADAPTIVE PRESET SWITCHING
1318ENCODING-AWARE DEEP VIDEO SUPER-RESOLUTION FRAMEWORK
2445END TO END GENERATIVE META CURRICULUM LEARNING FOR MEDICAL DATA AUGMENTATION
1888Endoscopic Feature Enhancement for Stomach 3D Reconstruction without Dyeing
2309END-TO-END LEARNED LIGHT FIELD IMAGE RESCALING USING JOINT SPATIAL-ANGULAR AND EPIPOLAR INFORMATION
1435END-TO-END TRAINABLE WEAKLY NON-NEGATIVE FACTORIZATION
1934ENHANCED TEMPORAL MOTION DERIVATION BEYOND VVC
2927ENHANCED U-TRANSFORMER NETWORKS FOR AUTOMATIC PULMONARY VESSEL SEGMENTATION IN CT IMAGES
1412ENHANCING LOW-LIGHT IMAGES USING INFRARED ENCODED IMAGES
2032Enhancing Next Active Object-based Egocentric Action Anticipation with Guided Attention
3181ENHANCING TARGETED TRANSFERABILITY VIA SUPPRESSING HIGH-CONFIDENCE LABELS
2490EPIGRAPHICALLY-RELAXED LINEARLY-INVOLVED GENERALIZED MOREAU-ENHANCED MODEL FOR LAYERED MIXED NORM REGULARIZATION
2643ERROR CONCEALMENT FOR SCALABLE VIDEO CODING BASED ON DEFORMABLE CONVOLUTION NETWORK
2819ESTIMATED DEPTH BASED PROGRESSIVE INTERACTIVE FRAMEWORK FOR RGB SALIENT OBJECT DETECTION IN IMAGES
3045EVENT DATA STREAM COMPRESSION BASED ON POINT CLOUD REPRESENTATION
1269EVENT-BASED CAMERA SIMULATION USING MONTE CARLO PATH TRACING WITH ADAPTIVE DENOISING
2018EXPLORING ANATOMICAL SIMILARITY IN CARDIAC-GATED SPECT IMAGES FOR MOTION COMPENSATION WITH A DEEP LEARNING NETWORK
2698EXPLORING DIFFUSION MODELS FOR UNSUPERVISED VIDEO ANOMALY DETECTION
1599EXPLORING EFFECTIVE KNOWLEDGE DISTILLATION FOR TINY OBJECT DETECTION
2026EXPLORING SELF-SUPERVISED REPRESENTATION LEARNING FOR LOW-RESOURCE MEDICAL IMAGE ANALYSIS
1585EXPLORING THE CONNECTION BETWEEN NEURON COVERAGE AND ADVERSARIAL ROBUSTNESS IN DNN CLASSIFIERS
1067FACE PHOTO-SKETCH SYNTHESIS VIA DOMAIN-INVARIANT FEATURE EMBEDDING
2977FACET-LEVEL SEGMENTATION OF 3D TEXTURES ON CULTURAL HERITAGE OBJECTS
3208FACIAL EXPRESSION RECOGNITION USING LIGHT FIELD CAMERAS: A COMPARATIVE STUDY OF DEEP LEARNING ARCHITECTURES
1750FALSE CORRESPONDENCE REMOVAL VIA REVISITING SEMANTIC CONTEXT WITH POSITION-ATTENTIVE LEARNING
2547FAST LEARNING-BASED SPLIT TYPE PREDICTION ALGORITHM FOR VVC
2242FAST OPTIMAL TRANSPORT FOR LATENT DOMAIN ADAPTATION
1490FAST QTMT PARTITION FOR VVC INTRA CODING USING U-NET FRAMEWORK
2459FAST-CONVERGENT FEDERATED LEARNING VIA CYCLIC AGGREGATION
1512FAT: FIELD-AWARE TRANSFORMER FOR 3D POINT CLOUD SEMANTIC SEGMENTATION
1723FEATURE ADVERSARIAL DISTILLATION FOR POINT CLOUD CLASSIFICATION
1912FEATURE ENHANCEMENT AND FUSION FOR RGB-T SALIENT OBJECT DETECTION
2851Feature Fusion Enhanced Super Resolution for Low Bitrate Screen Content Compression
3047FEATURE INTEGRATION VIA BACK-PROJECTION ORDERING MULTI-MODAL GAUSSIAN PROCESS LATENT VARIABLE MODEL FOR RATING PREDICTION
1481FEATURE SPACE DATA AUGMENTATION FOR VIEWPOINT-ROBUST ACTION RECOGNITION IN VIDEOS
2044FEATURE STRUCTURE SIMILARITY INDEX FOR HYBRID HUMAN AND MACHINE VISION
1766FEATURE-AWARE PROHIBITED ITEMS DETECTION FOR X-RAY IMAGES
2042FEATURE-DOMAIN PROXIMAL HIGH-DIMENSIONAL GRADIENT DESCENT NETWORK FOR IMAGE COMPRESSED SENSING
2462FEDMBP: MULTI-BRANCH PROTOTYPE FEDERATED LEARNING ON HETEROGENEOUS DATA
1971FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION BASED ON CROSS-DOMAIN SPECTRAL SEMANTIC RELATION TRANSFORMER
3074FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION WITH SPECTRAL-SPATIAL FEATURE FUSION BASED ON FUZZY BROAD LEARNING SYSTEM
2337Few-Shot Lip-Password Based Speaker Verification
3079FGC-VC: FLOW-GUIDED CONTEXT VIDEO COMPRESSION
1725FGCVQA: FINE-GRAINED CROSS-ATTENTION FOR MEDICAL VQA
1948Fibonet: A Light-weight and Efficient Neural Network for Image Segmentation
1479FIGHTING OVER-FITTING WITH QUANTIZATION FOR LEARNING DEEP NEURAL NETWORKS ON NOISY LABELS
2073FILM GRAIN REMOVAL USING METADATA
1837FINALIZATION OF VVENC'S SCREEN CONTENT DETECTOR AND TWO-PASS RATE CONTROL USING PRE-FILTERING STATISTICS
2112FINDING CAMOUFLAGED OBJECT GUIDED BY CONTOUR AND ATTENTION
3371Fine-to-coarse Object Classification of Very Large Images
2268Fisheye Multiple Object Tracking by Learning Distortions without Dewarping
3043FLASH COMPENSATED LOW-LIGHT ENHANCEMENT VIA HIERARCHICAL NETWORK PREDICTION
3364Flow-based one-class anomaly detection with Multi-frequency Feature fusion
1344FLOW-GUIDED DEFORMABLE ATTENTION NETWORK FOR FAST ONLINE VIDEO SUPER-RESOLUTION
2656FLOW-GUIDED TRANSFORMER FOR VIDEO COLORIZATION
3084FORWARD DIFFUSION GUIDED RECONSTRUCTION AS A MULTI-MODAL MULTI-TASK LEARNING SCHEME
2928FOURIER SERIES AND LAPLACIAN NOISE-BASED QUANTIZATION ERROR COMPENSATION FOR END-TO-END LEARNING-BASED IMAGE COMPRESSION
2479FPGA-ACCELERATED HEVC ENCODER FOR ENERGY-EFFICIENT MULTI-ACCESS EDGE COMPUTING
3497FRACTIONAL FOURIER TRANSFORM MEETS TRANSFORMER ENCODER
2841FREQUENCY DISENTANGLED FEATURES IN NEURAL IMAGE COMPRESSION
1623FREQUENCY ENHANCEMENT NETWORK FOR EFFICIENT COMPRESSED VIDEO ACTION RECOGNITION
2542Frequency-Aware Re-parameterization for Over-fitting Based Image Compression
3257FROM FELINE CLASSIFICATION TO SKILLS EVALUATION: A MULTITASK LEARNING FRAMEWORK FOR EVALUATING MICRO SUTURING NEUROSURGICAL SKILLS
1259FULLY AUTOMATED SCAN-TO-BIM VIA POINT CLOUD INSTANCE SEGMENTATION
2886FULLY AUTOMATIC CERVICAL VERTEBRAE SEGMENTATION VIA ENHANCED U2-NET
3230FUNCTIONAL KNOWLEDGE TRANSFER WITH SELF-SUPERVISED REPRESENTATION LEARNING
2297Fusing Explicit and Implicit Flow for Optical Flow Estimation
1235FUZZY-CONDITIONED DIFFUSION AND DIFFUSION PROJECTION ATTENTION APPLIED TO FACIAL IMAGE CORRECTION
1631GAITMM: MULTI-GRANULARITY MOTION SEQUENCE LEARNING FOR GAIT RECOGNITION
2699Generalizable Embeddings with Cross-batch Metric Learning
1459GENERALIZED PSEUDO-LABELING IN CONSISTENCY REGULARIZATION FOR SEMI-SUPERVISED LEARNING
3273GEOMETRIC MAGNIFICATION-BASED ATTENTION GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED MICRO-GESTURE RECOGNITION
1807GEOMETRIC PRIOR-ASSISTED FEATURE PRESENTATION ENHANCEMENT FOR OBJECT DETECTION IN AERIAL IMAGES
1968GEOMETRY-AWARE VIDEO QUALITY ASSESSMENT FOR DYNAMIC DIGITAL HUMAN
3316GLOBAL BALANCED NETWORKS FOR MULTI-VIEW STEREO
1833GLOBAL-LOCAL AWARENESS NETWORK FOR IMAGE SUPER-RESOLUTION
2428GMML is All you Need
3036GNP ATTACK: TRANSFERABLE ADVERSARIAL EXAMPLES VIA GRADIENT NORM PENALTY
1422GPCGC: A GREEN POINT CLOUD GEOMETRY CODING METHOD
3057GRAD-FEC: UNEQUAL LOSS PROTECTION OF DEEP FEATURES IN COLLABORATIVE INTELLIGENCE
1677GRAPHRPE: RELATIVE POSITION ENCODING GRAPH TRANSFORMER FOR 3D HUMAN POSE ESTIMATION
1591GRID-TRANSFORMER FOR FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION
2732Group Masked Model Learning for General Audio Representation
3350GS-NET: GLOBAL SELF-ATTENTION GUIDED CNN FOR MULTI-STAGE GLAUCOMA CLASSIFICATION
1060HALF OF AN IMAGE IS ENOUGH FOR QUALITY ASSESSMENT
2705HANDS IN FOCUS: SIGN LANGUAGE RECOGNITION VIA TOP-DOWN ATTENTION
3385HARD SAMPLES BASED MARGIN LOSS FOR FACE VERIFICATION
2371HDR-LMDA: A LOCAL AREA-BASED MIXED DATA AUGMENTATION METHOD FOR HDR VIDEO RECONSTRUCTION
2465HDTC: Hybrid Model OF DUAL-TRANSFORMER AND CONVOLUTIONAL NEURAL NETWORK FROM RGB-D FOR DETECTION OF LETTUCE GROWTH TRAITS
2667HER2-SISH HISTOPATHOLOGY IMAGE CLASSIFICATION USING DEEP NEURAL NETWORKS
2181HETEROGENEOUS IMAGE CHANGE DETECTION BASED ON DEEP IMAGE TRANSLATION AND FEATURE REFINEMENT-AGGREGATION
2861HIERARCHICAL ARITHMETIC CODING OF DISPLACEMENTS FOR DYNAMIC MESH COMPRESSION
2282HIERARCHICAL CONDITIONAL SEMI-PAIRED IMAGE-TO-IMAGE TRANSLATION FOR MULTI-TASK IMAGE DEFECT CORRECTION ON SHOPPING WEBSITES
2475HIERARCHICAL FEATURE FUSION TRANSFORMER FOR NO-REFERENCE IMAGE QUALITY ASSESSMENT
3164HIERARCHICAL MULTI-TASK LEARNING VIA TASK AFFINITY GROUPINGS
2584HIERARCHICAL TERRAIN ATTENTION AND MULTI-SCALE RAINFALL GUIDANCE FOR FLOOD IMAGE PREDICTION
1232HIGH DYNAMIC RANGE IMAGE TONE MAPPING BASED ON LAYER DECOMPOSITION AND IMAGE FUSION
2368HIGH DYNAMIC RANGE IMAGING WITH MULTI-EXPOSURE BINNING ON QUAD BAYER COLOR FILTER ARRAY
2045HIGH-ACCURACY GESTURE RECOGNITION USING MM-WAVE RADAR BASED ON CONVOLUTIONAL BLOCK ATTENTION MODULE
2031HIGH-PRECISION MOTION VECTOR REFINEMENT FOR BI-DIRECTIONAL OPTICAL FLOW
2774HIGH-THROUGHPUT AND MULTIPLIERLESS HARDWARE DESIGN FOR THE AV1 LOCAL WARPED MC INTERPOLATION
1817HINTING PIPELINE AND MULTIVARIATE REGRESSION CNN FOR MAIZE KERNEL COUNTING ON THE EAR
1905Hi-Res ACG: Towards High-Resolution Anime Characters Generation
2518HM-PCGC: A HUMAN-MACHINE BALANCED POINT CLOUD GEOMETRY COMPRESSION SCHEME
1249HOKEM: HUMAN AND OBJECT KEYPOINT-BASED EXTENSION MODULE FOR HUMAN-OBJECT INTERACTION DETECTION
1391HQRetouch: Learning Professional Face Retouching via Masked Feature Fusion and Semantic-Aware Modulation
3075HRFNET: HIGH-RESOLUTION FORGERY NETWORK FOR LOCALIZING SATELLITE IMAGE MANIPULATION
3368HUMAN-INTERPRETABLE AND DEEP FEATURES FOR IMAGE PRIVACY CLASSIFICATION
3456Hybrid Contrastive Prototypical Network for Few-Shot Scene Classification
1803ICCL: SELF-SUPERVISED INTRA- AND CROSS-MODAL CONTRASTIVE LEARNING WITH 2D-3D PAIRS FOR 3D SCENE UNDERSTANDING
3487ICIP 2023 CHALLENGE: FULL-REFERENCE AND NON-REFERENCE POINT CLOUD QUALITY ASSESSMENT METHODS WITH SUPPORT VECTOR REGRESSION
3491IEEE ICIP 2023 CHALLENGE ON THE AUTOMATIC DETECTION OF MOSQUITO BREEDING GROUNDS
2608IKD+: RELIABLE LOW COMPLEXITY DEEP MODELS FOR RETINOPATHY CLASSIFICATION
2663IMAGE CODING VIA PERCEPTUALLY INSPIRED GRAPH LEARNING
2494IMAGE DEHAZING GUIDED BY LOW-PASS REINFORCED AIRLIGHT
2380IMAGE INPAINTING BY MSCSWIN TRANSFORMER ADVERSARIAL AUTOENCODER
1848IMAGE INPAINTING WITH INFORMATION LOSS REDUCTION AND TEXTURE-STRUCTURE FEATURE FUSION
2885IMAGE STITCHING BASED ON MULTI-SCALE MESHES
1932IMAGE TRANSLATION-BASED DENIABLE ENCRYPTION AGAINST MODEL EXTRACTION ATTACK
2673IMAGE-COUPLED VOLUME PROPAGATION FOR STEREO MATCHING
2037IMBALANCE-AWARE ADAPTIVE MARGIN LOSS FOR FAIR MULTI-LABEL FACE ATTRIBUTE RECOGNITION
2954Implicit Attention-based Cross-modal Collaborative Learning for Action Recognition
1181IMPOSING TOTAL VARIATION PRIOR INTO GUIDED FILTER
1609IMPROVE UNSUPERVISED DEEP HASHING VIA MASKED CONTRASTIVE LEARNING
3460Improved Bilinear Pooling With Pseudo Square-Rooted Matrix
3486IMPROVED YOLOV7 WITH TRANSFORMER PREDICTION HEAD FOR AUTOMATED DETECTION OF MOSQUITO BREEDING GROUNDS
2275IMPROVEMENT OF IMAGE SEGMENTATION MODEL FOR HANDWRITTEN NOTEBOOK ANALYTICS
3251Improving Adversarial Transferability via Feature Translation
2878Improving CNN-based Person Re-identification using score Normalization
3318IMPROVING GENERALIZATION IN FACIAL MANIPULATION DETECTION USING IMAGE NOISE RESIDUALS AND TEMPORAL FEATURES
3238IMPROVING LEARNED INVERTIBLE CODING WITH INVERTIBLE ATTENTION AND BACK-PROJECTION
1696IMPROVING NERF WITH HEIGHT DATA FOR UTILIZATION OF GIS DATA
2435IMPROVING ROBUSTNESS OF SINGLE IMAGE SUPER-RESOLUTION MODELS WITH MONTE CARLO METHOD
2189IMPROVING SPHERICAL IMAGE RESAMPLING THROUGH VIEWPORT-ADAPTIVITY
1700IMPROVING TRANSLATION INVARIANCE IN CONVOLUTIONAL NEURAL NETWORKS WITH PERIPHERAL PREDICTION PADDING
1188Improving Video Colorization by Test-Time Tuning
2799INDUCTIVE GRAPH NEURAL NETWORKS FOR MOVING OBJECT SEGMENTATION
2642INFERENCE ACCELERATION OF DEEP LEARNING CLASSIFIERS BASED ON RNN
3358INFRARED SMALL TARGET DETECTION BASED ON SALIENCY GUIDED MULTI-TASK LEARNING
2813INTEGER QUANTIZED LEARNED IMAGE COMPRESSION
2508INTELLIGENT PAINTER: PICTURE COMPOSITION WITH RESAMPLING DIFFUSION MODEL
2353INTER-FRAME CODING FOR DYNAMIC MESHES VIA TEMPORALLY-CONSISTENT RE-MESHING
2437INTERPRETABLE VISUAL QUESTION ANSWERING REFERRING TO OUTSIDE KNOWLEDGE
2687INTERPRETABLE VISUAL QUESTION ANSWERING VIA REASONING SUPERVISION
2169INTERPRETING CONVOLUTIONAL NEURAL NETWORKS BY EXPLAINING THEIR PREDICTIONS
1413INTERPRETING LATENT REPRESENTATION IN NEURAL RADIANCE FIELDS FOR MANIPULATING OBJECT SEMANTICS
3015INTER-SCALE SURE-LET IMAGE RESTORATION WITH DEEP UNROLLED IMAGE PRIOR
3142Introducing a Framework for Single-Human Tracking using Event-based Cameras
3484IR-SETNET: SPARSITY AWARE ENSEMBLE NETWORK FOR INFRARED IMAGING BASED DRONE LOCALIZATION AND TRACKING IN DISTORTED SURVEILLANCE VIDEOS
2820IT WASN'T ME: IRREGULAR IDENTITY IN DEEPFAKE VIDEOS
1240Joint Demosaicing and Denoising with Gradient Guidance in Quad Bayer CFA
1854JOINT OPTIMIZED POINT CLOUD COMPRESSION FOR 3D OBJECT DETECTION
1736Joint Probability Distribution Regression for Image Cropping
1571JOINT UNDER-SAMPLING PATTERN OPTIMIZATION AND CONTENT-BASED RECONSTRUCTION NETWORK FOR FAST MRI RECONSTRUCTION
2279JPEG COMPLIANT COMPRESSION FOR DNN VISION
1339JPEG INFORMATION REGULARIZED DEEP IMAGE PRIOR FOR DENOISING
2284JPEG PLENO LEARNING-BASED POINT CLOUD CODING: A PERFORMANCE ANALYSIS
2416JPEG PLENO LIGHT FIELD ENCODER WITH MESH BASED VIEW WARPING
1308KD-FIXMATCH: KNOWLEDGE DISTILLATION SIAMESE NEURAL NETWORKS
2115KEYPOINTS DICTIONARY LEARNING FOR FAST AND ROBUST ALIGNMENT
2610L2FUSION: LOW-LIGHT ORIENTED INFRARED AND VISIBLE IMAGE FUSION
1226LADDER SIAMESE NETWORK: A METHOD AND INSIGHTS FOR MULTI-LEVEL SELF-SUPERVISED LEARNING
2101LANGUAGE IDENTIFICATION AS IMPROVEMENT FOR LIP-BASED BIOMETRIC VISUAL SYSTEMS
2991LAPTRAN: TRANSFORMER EMBEDDING GRAPH LAPLACIAN FOR POINT CLOUD PART SEGMENTATION
2226LATENTPATCH: A NON-PARAMETRIC APPROACH FOR FACE GENERATION AND EDITING
1686LATENT-SHIFT: GRADIENT OF ENTROPY HELPS NEURAL CODECS
1152LDCFORMER: INCORPORATING LEARNABLE DESCRIPTIVE CONVOLUTION TO VISION TRANSFORMER FOR FACE ANTI-SPOOFING
1431Learn more: Sub-significant area learning for fine-grained visual classification
2897LEARNABLE SNAKE R-CNN FOR INSTANCE-LEVEL BIOMEDICAL IMAGE SEGMENTATION
2244LEARNED IMAGE COMPRESSION GUIDED ADAPTIVE QUANTIZATION FOR PERCEPTUAL QUALITY
2145Learned Image Compression with Large Capacity and Low Redundancy of Latent Representation
2734LEARNED IMAGE COMPRESSION WITH MULTI-SCAN BASED CHANNEL FUSION
1828LEARNING DISENTANGLED FEATURES FOR NERF-BASED FACE RECONSTRUCTION
2266Learning Extended Depth of Field Hyperspectral Imaging
2099LEARNING MULTI-SCALE FEATURES FOR JPEG IMAGE ARTIFACTS REMOVAL
2291LEARNING MUTUALLY IN CROWD SCENES FOR PEDESTRIAN DETECTION
2782LEARNING RAW IMAGE DENOISING USING A PARAMETRIC COLOR IMAGE MODEL
2864Learning Spatially-Adaptive Squeeze-Excitation Networks for Few Shot Image Synthesis
2035Learning Spatially-Adaptive Style-Modulation Networks for Single Image Synthesis
1618LEARNING SPATIAL-TEMPORAL EMBEDDINGS FOR SEQUENTIAL POINT CLOUD FRAME INTERPOLATION
1884LEARNING TO DRAW THROUGH A MULTI-STAGE ENVIRONMENT MODEL BASED REINFORCEMENT LEARNING
1007LEARNING TORSO PRIOR FOR CO-SPEECH GESTURE GENERATION WITH BETTER HAND SHAPE
1239LEARNING-BASED RATE CONTROL FOR LEARNING-BASED POINT CLOUD GEOMETRY CODING
2616LEARNT DEEP HYPERPARAMETER SELECTION IN ADVERSARIAL TRAINING FOR COMPRESSED VIDEO ENHANCEMENT WITH A PERCEPTUAL CRITIC
2019LEVERAGING EFFICIENT TRAINING AND FEATURE FUSION IN TRANSFORMERS FOR MULTIMODAL CLASSIFICATION
1290LEVERAGING OPTICAL FLOW FEATURES FOR HIGHER GENERALIZATION POWER IN VIDEO OBJECT SEGMENTATION
1528LEVERAGING VISUAL PROMPTS TO GUIDE LANGUAGE MODELING FOR REFERRING VIDEO OBJECT SEGMENTATION
2295LGSQE: LIGHTWEIGHT GENERATED SAMPLE QUALITY EVALUATION
2144Lightweight CNN-Based In-loop Filter for VVC Intra Coding
2990Lightweight Deep Deblurring Model with Discriminative Multi-scale Feature Fusion
3294LIGHTWEIGHT MULTI-VIEW-GROUP NEURAL NETWORK FOR 3D SHAPE CLASSIFICATION
2522Lightweight Network Towards Real-time Image Denoising on Mobile Devices
2058LITE-HRNET PLUS: FAST AND ACCURATE FACIAL LANDMARK DETECTION
2213LKBQ: PUSHING THE LIMIT OF POST-TRAINING QUANTIZATION TO EXTREME 1 BIT
3111LLA-FLOW: A LIGHTWEIGHT LOCAL AGGREGATION ON COST VOLUME FOR OPTICAL FLOW ESTIMATION
1917LLDE: ENHANCING LOW-LIGHT IMAGES WITH DIFFUSION MODEL
1863LLIEFORMER: A LOW-LIGHT IMAGE ENHANCEMENT TRANSFORMER NETWORK WITH A DEGRADED RESTORATION MODEL
1998LMPDNET: TOF-PET LIST-MODE IMAGE RECONSTRUCTION USING MODEL-BASED DEEP LEARNING METHOD
1927LOCAL CONTEXT AND DIMENSIONAL RELATION AWARE TRANSFORMER NETWORK FOR CONTINUOUS AFFECT ESTIMATION
2398LOCAL TEXTURE COMPLEXITY GUIDED ADVERSARIAL ATTACK
2570LOCAL-AWARE INTRA TEMPLATE MATCHING PREDICTION
1051LOCAL-GLOBAL CONTRAST FOR LEARNING VOICE-FACE REPRESENTATIONS
2607LOCALLY ACCUMULATED ADAM FOR DISTRIBUTED TRAINING WITH SPARSE UPDATES
2366LONG-TAILED FEDERATED LEARNING VIA AGGREGATED META MAPPING
3380LOSSY LIDAR POINT CLOUD COMPRESSION VIA CYLINDRICAL 3D CONVOLUTION NETWORKS
1889LOW LIGHT RGB AND IR IMAGE FUSION WITH SELECTIVE CNN-TRANSFORMER NETWORK
2411LOW-SAMPLING-FREQUENCY PLANE WAVE MEDICAL ULTRASOUND IMAGING BASED ON ADVERSARIAL LEARNING
2333LSR: A Light-Weight Super-Resolution Method
2708LT-VIT: A VISION TRANSFORMER FOR MULTI-LABEL CHEST X-RAY CLASSIFICATION
1834LUMINANCE-PRESERVING VISIBLE AND NEAR-INFRARED IMAGE FUSION NETWORK WITH EDGE GUIDANCE
2080M3FPOLYPSEGNET: SEGMENTATION NETWORK WITH MULTI-FREQUENCY FEATURE FUSION FOR POLYP LOCALIZATION IN COLONOSCOPY IMAGES
3438MACHINE LEARNING DETECTS A BIOPSY NEEDLE IN ULTRASOUND IMAGES
2785MACHINE-ATTENTION-BASED VIDEO CODING FOR MACHINES
2447MAP-informed Unrolled Algorithms for Hyper-parameter Estimation
1815MCTE: MARRYING CONVOLUTION AND TRANSFORMER EFFICIENTLY FOR END-TO-END MEDICAL IMAGE SEGMENTATION
2539MDFD: STUDY OF DISTRIBUTED NON-IID SCENARIOS AND FRECHET DISTANCE-BASED EVALUATION
3436MEASURE4DHAND: DYNAMIC HAND MEASUREMENT EXTRACTION FROM 4D SCANS
2758MEGL: MULTI-EXPERTS GUIDED LEARNING NETWORK FOR SINGLE CAMERA TRAINING PERSON RE-IDENTIFICATION
3261MENAS: MULTI-TRIAL EVOLUTIONARY NEURAL ARCHITECTURE SEARCH WITH LOTTERY TICKETS
1250METAGRAD: ADAPTIVE GRADIENT QUANTIZATION WITH HYPERNETWORKS
2197MGT-PC: MEMORY-GUIDED TRANSFORMER FOR ROBUST POINT CLOUD CLASSIFICATION
2477Micro-Expression Recognition with Layered Relations and More Input Frames
2167MINING FALSE POSITIVE EXAMPLES FOR TEXT-BASED PERSON RE-IDENTIFICATION
2186Mitigating Dataset Bias in Image Captioning through CLIP Confounder-free Captioning Network
2438MIX-NET: AUTOMATIC SEGMENTATION OF COVID-19 CT IMAGES BASED ON PARALLEL DESIGN
2345Modality Meets Long-term Tracker: A Siamese Dual Fusion Framework for Tracking UAV
1765MODALITY-AWARE OOD SUPPRESSION USING FEATURE DISCREPANCY FOR MULTI-MODAL EMOTION RECOGNITION
1816MODEL DOCTOR FOR DIAGNOSING AND TREATING SEGMENTATION ERROR
2212Model-agnostic visual explanations via approximate bilinear models
2563Modeling and Interpreting 6-D Object Pose Estimation
2939MODELING HIERARCHICAL TOPOLOGICAL STRUCTURE IN SCIENTIFIC IMAGES WITH GRAPH NEURAL NETWORKS
2001MORE SYNERGY, LESS REDUNDANCY: EXPLOITING JOINT MUTUAL INFORMATION FOR SELF-SUPERVISED LEARNING
1659MOTION PLANE ADAPTIVE MOTION MODELING FOR SPHERICAL VIDEO CODING IN H.266/VVC
3179MQ-CODER INSPIRED ARITHMETIC CODER FOR SYNTHETIC DNA DATA STORAGE
3347MSV-RGNN: MULTISCALE VOXEL GRAPH NEURAL NETWORK FOR 3D OBJECT DETECTION
1885MTJND: MULTI-TASK DEEP LEARNING FRAMEWORK FOR IMPROVED JND PREDICTION
3078MULTI HYBRID EXTRACTOR NETWORK FOR 3D HUMAN POSE ESTIMATION
1745MULTI TASK-BASED FACIAL EXPRESSION SYNTHESIS WITH SUPERVISION LEARNING AND FEATURE DISENTANGLEMENT OF IMAGE STYLE
2322MULTI-CLASSIFICATION OF RETINAL DISEASES USING A PYRAMIDAL ENSEMBLE DEEP FRAMEWORK
3087MULTI-DIMENSIONAL PRUNED SPARSE CONVOLUTION FOR EFFICIENT 3D OBJECT DETECTION
2852MULTI-EXIT VISION TRANSFORMER WITH CUSTOM FINE-TUNING FOR FINE-GRAINED IMAGE RECOGNITION
2390MULTI-LABEL ADVERSARIAL ATTACK BASED ON LABEL CORRELATION
2418MULTILAYER ATTENTION MECHANISM FOR CHANGE DETECTION IN SAR IMAGE SPATIAL-FREQUENCY DOMAIN
2090MULTIMODAL GRAPH SIGNAL DENOISING WITH SIMULTANEOUS GRAPH LEARNING USING DEEP ALGORITHM UNROLLING
1415MULTI-MODAL HIERARCHICAL ATTENTION-BASED DENSE VIDEO CAPTIONING
1439MULTI-OBJECT TRACKING AS ATTENTION MECHANISM
1008MULTI-OBJECT TRACKING BY ITERATIVELY ASSOCIATING DETECTIONS WITH UNIFORM APPEARANCE FOR TRAWL-BASED FISHING BYCATCH MONITORING
2723MULTIPLE DESCRIPTION VIDEO CODING FOR REAL-TIME APPLICATIONS USING HEVC
2651MULTI-SCALE DEFORMABLE ALIGNMENT AND CONTENT-ADAPTIVE INFERENCE FOR FLEXIBLE-RATE BI-DIRECTIONAL VIDEO COMPRESSION
3242MULTISCALE REPRESENTATIONS LEARNING TRANSFORMER FRAMEWORK FOR POINT CLOUD CLASSIFICATION
2225Multi-scale temporal feature fusion for few-shot action recognition
2180MULTI-SCALE TRANSFORMER NETWORK FOR SALIENCY PREDICTION ON 360-DEGREE IMAGES
2412MULTI-SEMANTIC ALIGNMENT CO-REASONING NETWORK FOR VIDEO QUESTION ANSWERING
3500Multi-Surface Multi-Technique (MUST) Latent Fingerprint Database
2986MULTI-TASK MODEL BASED ON VISION TASK LEVEL FOR SALIENCY OBJECT DETECTION IN FOGGY CONDITION
2281MULTITHREADED ALGORITHMS FOR LOSSLESS INTRA COMPRESSION OF POINT CLOUD GEOMETRY BASED ON THE SILHOUETTE 3D CODER
3496MULTIVARIATE TIME SERIES IMPUTATION WITH TRANSFORMERS
2768MULTI-VIEW 3D COMPTON IMAGE RECONSTRUCTION WITH A GENERALIZED LIST-MODE MLEM ALGORITHM
2899MULTI-VIEW VARIATIONAL RECURRENT NEURAL NETWORK FOR HUMAN EMOTION RECOGNITION USING MULTI-MODAL BIOLOGICAL SIGNALS
1256MUTUAL RELATIVE POSITION LEARNING TRANSFORMER FOR CROSS-VIEW GEO-LOCALIZATION
2932MUTUALLY SUPERVISED LEARNING VIA INTERACTIVE CONSISTENCY FOR GEOGRAPHIC OBJECT SEGMENTATION FROM WEAKLY LABELED REMOTE SENSING IMAGERY
2193NERD: NEURAL FIELD-BASED DEMOSAICKING
1190NEURAL AUGMENTED EXPOSURE INTERPOLATION FOR HDR IMAGING
1497NEURAL FIELD REAL-TIME TRANSMISSION USING MULTIPLE DESCRIPTION CODING WITH RANDOM POSITION SAMPLING
2108NEURAL GLOBAL ILLUMINATION FOR INVERSE RENDERING
2795NEV-NCD: NEGATIVE LEARNING, ENTROPY, AND VARIANCE REGULARIZATION BASED NOVEL ACTION CATEGORIES DISCOVERY
3408NIGHTTIME HAZE REMOVAL WITH SPATIALLY VARIANT AMBIENT LIGHT AND SALIENCY-WEIGHTED FUSED TRANSMISSION
2106NOISE-AVOIDANCE SAMPLING FOR ANNOTATION MISSING OBJECT DETECTION
1776NONLOCAL LOW-RANK RESIDUAL MODELING FOR IMAGE COMPRESSIVE SENSING RECONSTRUCTION
2376NOVEL ANNOTATION AND METRICS FOR MANGROVE SPECIES CLASSIFICATION USING BOUNDING BOX OBJECT DETECTION
1681NTRANS-NET: A MULTI-SCALE NEUTROSOPHIC-UNCERTAINTY GUIDED TRANSFORMER NETWORK FOR INDOOR DEPTH COMPLETION
1630NUCQ: NON-UNIFORM CONDITIONAL QUANTIZATION FOR LEARNED IMAGE COMPRESSION
2831Object Detection and Counting Challenges in Real Street Monitoring: Case Study of Homeless Encampments
1478OBJECT-CENTRIC VIDEO PREDICTION VIA DECOUPLING OF OBJECT DYNAMICS AND INTERACTIONS
2152OCVOS: Object-Centric Representation for Video Object Segmentation
2945ODD: ONE-CLASS ANOMALY DETECTION VIA THE DIFFUSION MODEL
2454OEST: OUTLIER EXPOSURE BY SIMPLE TRANSFORMATIONS FOR OUT-OF-DISTRIBUTION DETECTION
1547OMISSION-FREE INPAINTING: A THREE-STAGE APPROACH TO ENSURE OBJECT GENERATION
3461Omnidirectional Video Super-Resolution using Deep Learning
2219ONDA-DETR: ONLINE DOMAIN ADAPTATION FOR DETECTION TRANSFORMERS WITH SELF-TRAINING FRAMEWORK
3387ONLINE PEDESTRIAN TRACKING USING A DENSE FISHEYE CAMERA NETWORK WITH EDGE COMPUTING
1890OOD ATTACK: GENERATING OVERCONFIDENT OUT-OF-DISTRIBUTION EXAMPLES TO FOOL DEEP NEURAL CLASSIFIERS
1606OPEN-SET RECOGNITION FOR FACIAL-EXPRESSION RECOGNITION
3136OPTICAL CHARACTER RECOGNITION FOR MEDICAL RECORDS DIGITIZATION WITH DEEP LEARNING
1789Optimized Coded Aperture Design in Compressive Spectral Imaging via Coherence Minimization
1850OPTIMIZING TRANSFORMER FOR LARGE-HOLE IMAGE INPAINTING
2887OVERLAP LOSS: RETHINKING WEAKLY SUPERVISED INSTANCE SEGMENTATION IN CROWDED SCENES
1274PAIRWISE FEATURE LEARNING FOR UNSEEN PLANT DISEASE RECOGNITION
1881PALMPRINT ANTI-SPOOFING BASED ON DOMAIN-ADVERSARIAL TRAINING AND ONLINE TRIPLET MINING
2870PANCREATIC CANCER DETECTION USING HYPERSPECTRAL IMAGING AND MACHINE LEARNING
1874Parallel Gradient Blend for Class Incremental Learning
1898Parameter-efficient Vision Transformer with Linear Attention
3135PART AWARE GRAPH CONVOLUTION NETWORK WITH TEMPORAL ENHANCEMENT FOR SKELETON-BASED ACTION RECOGNITION
3011PARTS BASED ATTENTION FOR HIGHLY OCCLUDED PEDESTRIAN DETECTION WITH TRANSFORMERS
1289PAST INFORMATION AGGREGATION FOR MULTI-PERSON TRACKING
1653Patch-wise Auto-Encoder for Visual Anomaly Detection
3454PERCEPTION-ORIENTED OMNIDIRECTIONAL IMAGE SUPER-RESOLUTION BASED ON TRANSFORMER NETWORK
1849PFC-UNIT: UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION WITH PRE-TRAINED FINE-GRAINED CLASSIFICATION
1215PFTA-Net: Progressive Feature Alignment and Temporal Attention Fusion Networks for Video Inpainting
2129PHYSICS-INFORMED DEEP DEBLURRING: OVER-PARAMETERIZED VS. UNDER-PARAMETERIZED
1242PL-UNEXT: PER-STAGE EDGE DETAIL AND LINE FEATURE GUIDED SEGMENTATION FOR POWER LINE DETECTION
1185Point Cloud Denoising via Momentum Ascent in Gradient Fields
2724POINT CLOUD GEOMETRY AND COLOR CODING IN A LEARNING-BASED ECOSYSTEM FOR JPEG CODING STANDARDS
3144POINT CLOUD UPSAMPLING WITH DYNAMIC GRAPH SCATTERING TRANSFORM
3482POINTPCA+: EXTENDING POINTPCA OBJECTIVE QUALITY ASSESSMENT METRIC
1217POLSAR IMAGE CLASSIFICATION BASED-ON SEMI-SUPERVISED POLARIMETRIC FEATURE SELECTION
3490Positronium lifetime image reconstruction for TOF PET
3139PREDICTING MECHANICAL PROPERTIES OF CARBON NANOTUBE (CNT) IMAGES USING MULTI-LAYER SYNTHETIC FINITE ELEMENT MODEL SIMULATIONS
2854PREDICTION OF DEEP ICE LAYER THICKNESS USING ADAPTIVE RECURRENT GRAPH NEURAL NETWORKS
2840PREDICTIVE CODING FOR ANIMATION-BASED VIDEO COMPRESSION
2126PREFAB-GEN : AD HOC IMAGE GENERATION FOR PRE-MANUFACTURING OF TIRES USING IMAGE-TO-IMAGE TRANSLATION
2749PRE-TRAINING WITH FRACTAL IMAGES FACILITATES LEARNED IMAGE QUALITY ESTIMATION
1505PRNET: A PROGRESSIVE REGRESSION NETWORK FOR NO-REFERENCE USER-GENERATED-CONTENT (UGC) VIDEO QUALITY ASSESSMENT
2600PROCESSING ENERGY MODELING FOR NEURAL NETWORK BASED IMAGE COMPRESSION
2714Product Image Representation Learning on Large Scale Noisy Datasets
2968PROGRESSIVE MIXUP AUGMENTED TEACHER-STUDENT LEARNING FOR UNSUPERVISED DOMAIN ADAPTATION
2116PROGRESSIVE MULTI-VIEW FUSION FOR 3D HUMAN POSE ESTIMATION
2929PROGRESSIVE REFINEMENT LEARNING BASED ON FEATURE INTERACTIVE FUSION FOR SEMANTIC SEGMENTATION OF REMOTE SENSING LIMITED DATASET
3128PROMPT PROTOTYPE LEARNING BASED ON RANKING INSTRUCTION FOR FEW-SHOT VISUAL TASKS
1690PSCO: A POINT CLOUD SCENE CLASSIFICATION MODEL BASED ON CONTRAST LEARNING
1320PSEUDO LABELS REFINEMENT WITH INTRA-CAMERA SIMILARITY FOR UNSUPERVISED PERSON RE-IDENTIFICATION
1037PS-NERV: PATCH-WISE STYLIZED NEURAL REPRESENTATIONS FOR VIDEOS
2726PUSHING THE LIMITS OF THE WIENER FILTER IN IMAGE DENOISING
2163PYRAMID MASKED IMAGE MODELING FOR TRANSFORMER-BASED AERIAL OBJECT DETECTION
2574PYRAMID TRANSFORMER DRIVEN MULTIBRANCH FUSION FOR POLYP SEGMENTATION IN COLONOSCOPIC VIDEO IMAGES
2122QUANTIFIABLE ROBUSTNESS ESTIMATION FOR OBJECT DETECTION WITH CNNS USING INTRINSIC DIMENSIONALITY
1472Query by Activity Video in the Wild
2023Query-based Video Summarization with Pseudo Label Supervision
1925QVRF: A QUANTIZATION-ERROR-AWARE VARIABLE RATE FRAMEWORK FOR LEARNED IMAGE COMPRESSION
2682RADAR HRRP UNSEEN CLASS RECOGNITION BASED ON THE JOINT DICTIONARY LEARNING
3494RAY-SPACE MOTION COMPENSATION FOR LENSLET PLENOPTIC VIDEO CODING
2586RDEPD: RE-EXPLORING DEPTH ESTIMATION FOR PEDESTRIAN DETECTION
2877REALIZATION OF DIGRAPH FILTERS VIA AUGMENTED GFT
3480REAL-TIME DRONE DETECTION AND TRACKING IN DISTORTED INFRARED IMAGES
1374REAL-TIME SUPERMARKET PRODUCT RECOGNITION ON MOBILE DEVICES USING SCALABLE PIPELINES
2010REAL-TIME WHEEL DETECTION AND RIM CLASSIFICATION IN AUTOMOTIVE PRODUCTION
1017REAPER: ARTICULATED OBJECT 6D POSE ESTIMATION WITH DEEP REINFORCEMENT LEARNING
2755RECOVERING QUALITY SCORES IN NOISY PAIRWISE SUBJECTIVE EXPERIMENTS USING NEGATIVE LOG-LIKELIHOOD
2837RECTANGULAR-OUTPUT IMAGE STITCHING
1691REDUCED COMPLEXITY MULTISCALE CNN FOR IN-LOOP VIDEO RESTORATION
2549Regularizing Neural Radiance Fields from Sparse RGB-D Inputs
1516REPRESENTATION LEARNING OF VERTEX HEATMAPS FOR 3D HUMAN MESH RECONSTRUCTION FROM MULTI-VIEW IMAGES
2894RESIDENTIAL EXTRACTION BASED ON WEAKLY-SUPERVISED SIMILARITY-AWARE MULTI-SOURCE ALIGNMENT STRATEGY WITH LIMITED SAR DATA
2824RESSCAL3D: RESOLUTION SCALABLE 3D SEMANTIC SEGMENTATION
2093RESTORABLE VISIBLE AND INFRARED IMAGE FUSION
1855RESTORATION OF EXTREMELY COMPRESSED BACKGROUND FOR VCM USING GUIDED GENERATIVE PRIORS
1389RETHINKING LONG-TAILED VISUAL RECOGNITION WITH DYNAMIC PROBABILITY SMOOTHING AND FREQUENCY WEIGHTED FOCUSING
2787RETINEX-BASED IMAGE DENOISING / CONTRAST ENHANCEMENT USING GRADIENT GRAPH LAPLACIAN REGULARIZER
1823RETRIEVE THE VISIBLE FEATURE TO IMPROVE THERMAL PEDESTRIAN DETECTION USING DISCREPANCY PRESERVING MEMORY NETWORK
1540REUSE NON-TERRAIN POLICIES FOR LEARNING TERRAIN-ADAPTIVE HUMANOID LOCOMOTION SKILLS
2199REVISITING MODALITY IMBALANCE IN MULTIMODAL PEDESTRIAN DETECTION
2800Revolutionizing Thermal Imaging: GAN-based Vision Transformers for Image Enhancement
2808RFID-ASSISTED VISUAL MULTIPLE OBJECT TRACKING WITHOUT USING VISUAL APPEARANCE AND MOTION
1292RINGING ARTIFACT REDUCTION METHOD FOR ULTRASOUND RECONSTRUCTION USING MULTI-AGENT CONSENSUS EQUILIBRIUM
2535ROBUST BOUNDING BOX REGRESSION FOR SMALL OBJECT DETECTION
1753ROBUST FACE ANTI-SPOOFING FRAMEWORK WITH CONVOLUTIONAL VISION TRANSFORMER
2487ROBUST FEATURE LEARNING AGAINST NOISY LABEL
1278Robust graph neural diffusion for image matching
3014ROBUST GRAPH-BASED SEGMENTATION OF NOISY POINT CLOUDS
2227ROBUST MULTISPECTRAL PEDESTRIAN DETECTION VIA SPECTRAL POSITION-FREE FEATURE MAPPING
3312ROBUST NUCLEUS CLASSIFICATION WITH ITERATIVE GRAPH REPRESENTATIONAL LEARNING
1451Robust RGB-T tracking via consistency regulated scene perception
1760ROBUST WIND TURBINE BLADE SEGMENTATION FROM RGB IMAGES IN THE WILD
1680ROTATION XGBOOST BASED METHOD FOR HYPERSPECTRAL IMAGE CLASSIFICATION WITH LIMITED TRAINING SAMPLES
2706RSFDM-NET: REAL-TIME SPATIAL AND FREQUENCY DOMAINS MODULATION NETWORK FOR UNDERWATER IMAGE ENHANCEMENT
2393SANDWICHED VIDEO COMPRESSION: EFFICIENTLY EXTENDING THE REACH OF STANDARD CODECS WITH NEURAL WRAPPERS
3069SAR TARGET EXTRACTION BASED ON SALIENCY-GUIDED CROSS-DOMAIN DISCREPANCY ALIGNMENT STRATEGY
3223SATPLATE: A GERMANY LICENSE PLATE DETECTION DATASET AND BASELINES
3253SCAPEGOAT GENERATION FOR PRIVACY PROTECTION FROM DEEPFAKE
1204SCENE FLOW ESTIMATION FROM POINT CLOUDS WITH CONTRASTIVE LOSS AND DUAL PSEUDO LABELS
1507SCENE TEXT RECOGNITION MODELS EXPLAINABILITY USING LOCAL FEATURES
1470SCENE TEXT SEGMENTATION BY PAIRED DATA SYNTHESIS
1131SCORE-BASED DIFFUSION MODELS FOR BAYESIAN IMAGE RECONSTRUCTION
2176SCRATCHHOI: TRAINING HUMAN-OBJECT INTERACTION DETECTORS FROM SCRATCH
2205SDAT-FORMER: FOGGY SCENE SEMANTIC SEGMENTATION VIA A STRONG DOMAIN ADAPTATION TEACHER
2300SDWD: STYLE DIVERSITY WEIGHTED DISTANCE EVALUATES THE INTRA-CLASS DATA DIVERSITY OF DISTRIBUTED GANS
2671SEGMENTATION AND CLASSIFICATION-BASED DIAGNOSIS OF TUMORS FROM BREAST ULTRASOUND IMAGES USING MULTIBRANCH UNET
1869Segmentation of the Left Ventricle by SDD double threshold selection and CHT
1864SEGMENTATION OF THE LEFT VENTRICLE FOR THE CARDIAC PHASES BETWEEN END-DIASTOLE AND END-SYSTOLE
2646SELECTING A DIVERSE SET OF AESTHETICALLY-PLEASING AND REPRESENTATIVE VIDEO THUMBNAILS USING REINFORCEMENT LEARNING
2524SELF ADAPTIVE GLOBAL-LOCAL FEATURE ENHANCEMENT FOR RADIOLOGY REPORT GENERATION
2756SELF PATCH LABELING USING QUALITY DISTRIBUTION ESTIMATION FOR CNN-BASED 360-IQA TRAINING
1928SELF-COMPENSATING LEARNING FOR FEW-SHOT SEGMENTATION
2987Self-enhanced training framework for referring expression grounding
1511SELF-REINFORCING FOR FEW-SHOT MEDICAL IMAGE SEGMENTATION
1058SELF-SUPERVISED 3D SKELETON REPRESENTATION LEARNING WITH ACTIVE SAMPLING AND ADAPTIVE RELABELING FOR ACTION RECOGNITION
1742SELF-SUPERVISED CONTRASTIVE LEARNING FOR AUDIO-VISUAL ACTION RECOGNITION
3224SELF-SUPERVISED DENOISING OF OPTICAL COHERENCE TOMOGRAPHY WITH INTER-FRAME REPRESENTATION
2529SELF-SUPERVISED FOCUS MEASURE FUSING FOR DEPTH ESTIMATION FROM COMPUTER-GENERATED HOLOGRAMS
1626Self-supervised Learning for Context-independent DfD Network using Multi-view Rank Supervision
2764SELF-SUPERVISED LEARNING FOR SCANNED HALFTONE CLASSIFICATION WITH NOVEL AUGMENTATION TECHNIQUES
1016SEMANTIC AND INSTANCE-AWARE PIXEL-ADAPTIVE CONVOLUTION FOR PANOPTIC SEGMENTATION
1805SEMANTIC CIRCLE DETECTION AND CIRCLE-INNER SEGMENTATION FOR TREE-WISE CITRUS SUMMER SHOOT MANAGEMENT IN AERIAL IMAGES
1664SEMANTIC LEARNING NETWORK FOR CONTROLLABLE VIDEO CAPTIONING
2869SEMANTIC MAPPING OF INCREMENTAL 3D POINT CLOUDS BASED ON MULTI-HOP GRAPH ATTENTION NETWORK
3255Semantic Scene Completion with Point Cloud Representation and Transformer-based Feature Fusion
2577SEMANTIC-EMBEDDED KNOWLEDGE ACQUISITION AND REASONING FOR IMAGE SEGMENTATION
1346SEM-CS: SEMANTIC CLIPSTYLER FOR TEXT-BASED IMAGE STYLE TRANSFER
1642SEM-FCNET: SEMANTIC FEATURE ENHANCEMENT AND FULLY CONVOLUTIONAL NETWORK MODEL FOR REMOTE SENSING OBJECT DETECTION
1018SEMI-SUPERVISED CONTRASTIVE LEARNING OF GLOBAL AND LOCAL REPRESENTATION FOR 3D MEDICAL IMAGE SEGMENTATION
2084SEMI-SUPERVISED FEW-SHOT SEGMENTATION WITH NOISY SUPPORT IMAGES
1727SGSR: A Saliency-Guided Image Super-Resolution Network
2237SIAMCLIM: TEXT-BASED PEDESTRIAN SEARCH VIA MULTI-MODAL SIAMESE CONTRASTIVE LEARNING
1161Siamese Network Representation for Active Learning
3474SIMPLE BASELINES FOR PROJECTION-BASED FULL-REFERENCE AND NO-REFERENCE POINT CLOUD QUALITY ASSESSMENT
1612SIMPLE SELF-DISTILLATION LEARNING FOR NOISY IMAGE CLASSIFICATION
1570SIMULTANEOUS WATERMARKING AND DRACO 3D OBJECT COMPRESSION METHOD
3426SINGLE IMAGE LDR TO HDR CONVERSION USING CONDITIONAL DIFFUSION
2567SINGLE-DOMAIN GENERALIZATION FOR SEMANTIC SEGMENTATION VIA DUAL-LEVEL DOMAIN AUGMENTATION
1111SINGLE-IMAGE HDR RECONSTRUCTION BASED ON TWO-STAGE GAN STRUCTURE
1821SINGLE-STAGE HEAVY-TAILED FOOD CLASSIFICATION
3159SKELETON ACTION RECOGNITION BASED ON SPATIO-TEMPORAL FEATURES
1610SKETCHFFUSION: SKETCH-GUIDED IMAGE EDITING WITH DIFFUSION MODEL
2802SMOOTH AND STEPWISE SELF-DISTILLATION FOR OBJECT DETECTION
2036SOFT-INTROVAE FOR CONTINUOUS LATENT SPACE IMAGE SUPER-RESOLUTION
3442SPATIAL-FREQUENCY NETWORK FOR THE SEGMENTATION OF REMOTE SENSING IMAGES
2159SPATIALLY-ADAPTIVE LEARNING-BASED IMAGE COMPRESSION WITH HIERARCHICAL MULTI-SCALE LATENT SPACES
2541SPATIAL-TEMPORAL TRANSFORMER NETWORK FOR HUMAN MOCAP DATA RECOVERY
2007SPATIO-TEMPORAL PERCEPTION-DISTORTION TRADE-OFF IN LEARNED VIDEO SR
3103SPECTRAL GROUPING DRIVEN HYPERSPECTRAL SUPER-RESOLUTION
1716SPIKING GLOM: BIO-INSPIRED ARCHITECTURE FOR NEXT-GENERATION OBJECT RECOGNITION
2786STAGE OF DECAY ESTIMATION EXPLOITING EXOGENOUS AND ENDOGENOUS IMAGE ATTRIBUTES TO MINIMIZE MANUAL LABELING EFFORTS AND MAXIMIZE CLASSIFICATION PERFORMANCE
3324STANet: Spatiotemporal Adaptive Network For Remote Sensing Images
1767ST-MFNET MINI: KNOWLEDGE DISTILLATION-DRIVEN FRAME INTERPOLATION
2083STRENGTHENING DEEP LEARNING MODEL FOR ROBUST SCREENING OF VOLUMETRIC CHEST RADIOGRAPHIC SCANS
2402STRUCTURE-AWARE GENERATIVE ADVERSARIAL NETWORK FOR TEXT-TO-IMAGE GENERATION
1822STYLE TRANSFER BETWEEN MICROSCOPY AND MAGNETIC RESONANCE IMAGING VIA GENERATIVE ADVERSARIAL NETWORK IN SMALL SAMPLE SIZE SETTINGS
2735SUBJECTIVE ASSESSMENT OF THE IMPACT OF A CONTENT ADAPTIVE OPTIMISER FOR COMPRESSING 4K HDR CONTENT WITH AV1
2948SUBJECTIVE QUALITY ASSESSMENT OF ENHANCED RETINAL IMAGES
2148SUPER-RESOLUTION OF BVOC MAPS BY ADAPTING DEEP LEARNING METHODS
1234SWINAT-UNET: A NEW BACKBONE FOR PRECIPITATION NOWCASTING
2161TAMM: A TASK-ADAPTIVE MULTI-MODAL FUSION NETWORK FOR FACIAL-RELATED HEALTH ASSESSMENTS ON 3D FACIAL IMAGES
2198TAQ: TOP-K ATTENTION-AWARE QUANTIZATION FOR VISION TRANSFORMERS
1083TARGET-DISCRIMINABILITY-INDUCED MULTI-SOURCE-FREE DOMAIN ADAPTATION
3056TASK-ADAPTIVE FEATURE MATCHING LOSS FOR IMAGE DEBLURRING
1034TASK-AGNOSTIC OPEN-SET PROTOTYPE FOR FEW-SHOT OPEN-SET RECOGNITION
1437TASK-AWARE GRAPH CONVOLUTIONAL NETWORK FOR ACTIVE LEARNING
3365TEACHER-STUDENT NETWORK FOR REAL-WORLD FACE SUPER-RESOLUTION WITH PROGRESSIVE EMBEDDING OF EDGE INFORMATION
1406TEAM DETR: GUIDE QUERIES AS A PROFESSIONAL TEAM IN DETECTION TRANSFORMERS
1455Tell Your Story: Text-Driven Face Video Synthesis With High Diversity via Adversarial Learning
1319TEXT-GUIDED FACIAL IMAGE MANIPULATION FOR WILD IMAGES VIA MANIPULATION DIRECTION-BASED LOSS
3266THE ELLIPTIC ENERGY LOSS FOR ROTATED OBJECT DETECTION IN AERIAL IMAGES
2846THE FIRST COMPREHENSIVE DATASET WITH MULTIPLE DISTORTION TYPES FOR VISUAL JUST-NOTICEABLE DIFFERENCES
3471THE FIRST PLACE SOLUTION FOR ICIP2023 CHALLENGE INFRARED IMAGING-BASED DRONE DETECTION AND TRACKING IN DISTORTED SURVEILLANCE VIDEOS
1829THE MULTIVARIATE TRANSFORMER NETWORK FOR MILD COGNITIVE IMPAIRMENT IDENTIFICATION
2064The Oil and Water Separation Phenomenon Inspired Loss for Feature Learning
2647THERMAL INFRARED GUIDED COLOR IMAGE DEHAZING
1764Token-consistent Dropout for Calibrated Vision Transformers
1399Towards Modeling 3D Dense Shape Correspondence from Category-Specific Multi-View Images
1786TOWARDS QUERY EFFICIENT AND GENERALIZABLE BLACK-BOX FACE RECONSTRUCTION ATTACK
2025Towards Robustness: Enhancing Deep Learning Models through Meta-Learning and Bilevel Optimization for Accurate Car Damage Classification
3271TP-YOLO: A Lightweight Attention-based Architecture for Tiny Pest Detection
1251TR3D: TOWARDS REAL-TIME INDOOR 3D OBJECT DETECTION
3489TRACKING AIDED DRONE BIRD CLASSIFICATION USING YOLO AND LSTM
3085TRAINING CARTOONIZATION NETWORK WITHOUT CARTOON
1737TRAINING-FREE LOCATION-AWARE TEXT-TO-IMAGE SYNTHESIS
1410TRANSBUILDING: AN END-TO-END POLYGONAL BUILDING EXTRACTION WITH TRANSFORMERS
1220TRANSFORMATION CONSISTENCY FOR REMOTE SENSING IMAGE SUPER-RESOLUTION
2921TRANSFORMER-BASED VARIABLE-RATE IMAGE COMPRESSION WITH REGION-OF-INTEREST CONTROL
1541TRANSFORMING MULTIDIMENSIONAL DATA INTO IMAGES TO OVERCOME THE CURSE OF DIMENSIONALITY
1684TRANSPOINTFLOW: LEARNING SCENE FLOW FROM POINT CLOUDS WITH TRANSFORMER
3004TRG-DQA: TEXTURE RESIDUAL-GUIDED DEHAZED IMAGE QUALITY ASSESSMENT
2623TRICKVOS: A BAG OF TRICKS FOR VIDEO OBJECT SEGMENTATION
2250Truncated Weighted Nuclear Norm Regularization and Sparsity for Image Denoising
2081TSANET: TEMPORAL AND SCALE ALIGNMENT FOR UNSUPERVISED VIDEO OBJECT SEGMENTATION
3147TSFC: TEXTURE AND STRUCTURE FEATURES COUPLING FOR IMAGE INPAINTING
2414ULCOMPRESS: A UNIFIED LOW BIT-RATE IMAGE COMPRESSION FRAMEWORK VIA INVERTIBLE IMAGE REPRESENTATION
2636UNCERTAINTY AWARE IMPLICIT IMAGE FUNCTION FOR ARBITRARY-SCALE SUPER-RESOLUTION
1911UNDERWATER IMAGE ENHANCEMENT AND SUPER-RESOLUTION USING IMPLICIT NEURAL NETWORKS
1231UNIFIED LEARNING-BASED LOSSY AND LOSSLESS JPEG RECOMPRESSION
1361Unknown Class Feature Transformation for Open Set Domain Adaptation without Source Data
2791UNROLLED IPPG: VIDEO HEART RATE ESTIMATION VIA UNROLLING PROXIMAL GRADIENT DESCENT
2134UNSUPERVISED ANOMALY DETECTION USING VARIATIONAL AUTOENCODER WITH GAUSSIAN RANDOM FIELD PRIOR
1794UNSUPERVISED ANOMALY DETECTION WITH LOCAL-SENSITIVE VQVAE AND GLOBAL-SENSITIVE TRANSFORMERS
2527UNSUPERVISED DEEP HASHING WITH DEEP SEMANTIC DISTILLATION
3370UNSUPERVISED DOMAIN ADAPTATION WITH IMBALANCED CHARACTER DISTRIBUTION FOR SCENE TEXT RECOGNITION
2985UNSUPERVISED DOMAIN ADAPTIVE LEARNING FOR IMAGE DESNOWING WITH REAL-WORLD DATA
1636UNSUPERVISED DOMAIN ADAPTIVE PERSON RE-IDENTIFICATION WITH ADAPTIVE STRUCTURE LEARNING
2721USGG: UNION MESSAGE BASED SCENE GRAPH GENERATION
3182Using classifier discrepancy for cross-domain image retrieval
2441USURP: UNIVERSAL SINGLE-SOURCE ADVERSARIAL PERTURBATIONS ON MULTIMODAL EMOTION RECOGNITION
2346UT-GAN: A NOVEL UNPAIRED TEXTUAL-ATTENTION GENERATIVE ADVERSARIAL NETWORK FOR LOW-LIGHT TEXT IMAGE ENHANCEMENT
3444UTILIZING SUPER-RESOLUTION FOR ENHANCED AUTOMOTIVE RADAR OBJECT DETECTION
3449VARIATIONAL DEEP ATMOSPHERIC TURBULENCE CORRECTION FOR VIDEO
2866VARIATIONAL FEATURE DISENTANGLEMENT FOR FEW-SHOT DOMAIN ADAPTATION
1096Video Question Answering using Clip-guided Visual-text Attention
1173VIDEO SUMMARIZATION THROUGH FINE-GRAINED HIERARCHICAL MODELING WITH MULTI-DIMENSIONAL FEATURES
2915VIDEO SUPER-RESOLUTION VIA EVENT-DRIVEN TEMPORAL ALIGNMENT
2359VIDEO-MUSIC RETRIEVAL WITH FINE-GRAINED CROSS-MODAL ALIGNMENT
2650VIDEO-SWINUNET: SPATIO-TEMPORAL DEEP LEARNING FRAMEWORK FOR VFSS INSTANCE SEGMENTATION
1717VISUAL AND SPATIAL CONTEXT FUSION FOR IMPLICIT HUMAN RECONSTRUCTION
1902VIVA: A Variational Image Vectorization Algorithm on Dual-Primal Graph Pairs
2615WAVELET-BASED FREQUENCY-DIVIDING INTERACTIVE CNN FOR IMAGE CLASSIFICATION
2856WCANET: WAVELET CHANNEL ATTENTION NETWORK FOR CITRUS VARIETY IDENTIFICATION
3006WEAKLY SEMI-SUPERVISED ORIENTED OBJECT DETECTION WITH POINTS
2585WEAKLY SUPERVISED DISENTANGLEMENT WITH TRIPLET NETWORK
1759WEIGHTED ANISOTROPIC -- ISOTROPIC TOTAL VARIATION FOR POISSON DENOISING
3235WHAT MODALITY MATTERS? EXPLOITING HIGHLY RELEVANT FEATURES FOR VIDEO ADVERTISEMENT INSERTION
1203WHEN VISIBLE-TO-THERMAL FACIAL GAN BEATS CONDITIONAL DIFFUSION
2796XI-NET: TRANSFORMER BASED SEISMIC WAVEFORM RECONSTRUCTOR
1674X-Ray spectral estimation using Dictionary Learning
3062YOLO-MAXVOD FOR REAL-TIME VIDEO OBJECT DETECTION
3488YOLOV7 FOR MOSQUITO BREEDING GROUNDS DETECTION AND TRACKING
1671YOU ONLY NEED 80K PARAMETERS TO ENHANCE IMAGE: LEARNING PERIODIC FEATURES FOR IMAGE ENHANCEMENT
2343Zero-shot Human-Object Interaction (HOI) Classification by Bridging Generative and Contrastive Image-Language Models
1936ZERO-SHOT HYPERSPECTRAL IMAGE DENOISING WITH SELF-COMPLETION WITH PATTERNED MASKS
2751ZREC: ROBUST RECOVERY OF MEAN AND PERCENTILE OPINION SCORES