List of Accepted Papers
Following is the list of accepted ICIP 2023 papers, sorted by paper title. You can use the search feature of your web browser to find your paper number. Notifications to all authors have also been sent by email. If you have not received your notification of the results by email, please contact us at icip2023@cmsworkshops.com.
Paper Number | Paper Title |
---|---|
2839 | 3D brain registration with intensity shift robustness |
3395 | 3D Face Reconstruction based on Weakly-Supervised Learning Morphable Face Model |
2700 | 3D Facial Expression Generator Based on Transformer VAE |
2514 | 3D HIPPOCAMPUS SEGMENTATION USING A HOG BASED LOSS FUNCTION WITH MAJORITY POOLING |
1720 | 3D Human Motion Prediction via Activity-driven Attention-MLP Association |
2827 | 3D Unsupervised Region-Aware Registration Transformer |
2875 | 3D-CSL: SELF-SUPERVISED 3D CONTEXT SIMILARITY LEARNING FOR NEAR-DUPLICATE VIDEO RETRIEVAL |
3110 | 3D-DDA: 3D Dual-Domain Attention For Brain Tumor Segmentation |
2307 | 3M3D: Multi-view, Multi-path, Multi-representation for 3D Object Detection |
2640 | A 3D Label Stereo Matching Method Using Underwater Energy Function |
3130 | A BASELINE ON CONTINUAL LEARNING METHODS FOR VIDEO ACTION RECOGNITION |
2576 | A CAM-enhancing Generative Person Re-ID Method based Global and Local Features |
2206 | A CONTRARIO DETECTION OF H.264 VIDEO DOUBLE COMPRESSION |
1866 | A CONTRASTIVE LEARNING APPROACH FOR SCREENSHOT DEMOIRÉING |
2059 | A CONVERGENT NEURAL NETWORK FOR NON-BLIND IMAGE DEBLURRING |
1099 | A Convolutional-Transformer Network for Crack Segmentation with Boundary Awareness |
2195 | A DECOUPLED SPATIAL-CHANNEL INVERTED BOTTLENECK FOR IMAGE COMPRESSION |
2770 | A DIFFERENTIABLE GAUSSIAN PROTOTYPE LAYER FOR EXPLAINABLE FRUIT SEGMENTATION |
2377 | A FEATURE REFINEMENT MODULE FOR LIGHT-WEIGHT SEMANTIC SEGMENTATION NETWORK |
2613 | A GLOBAL-LOCAL CONTRASTIVE LEARNING FRAMEWORK FOR VIDEO CAPTIONING |
1495 | A Joint Model-Driven Unfolding Network For Degraded Low-Quality Color-Depth Images Enhancement |
1616 | A Key Feature-Enhanced Network for Remote Sensing Object Detection |
1929 | A LARGE SCALE MULTI-VIEW RGBD VISUAL AFFORDANCE LEARNING DATASET |
2873 | A LIGHTWEIGHT HYBRID REPRESENTATION FOR VIRTUAL COMPLEX SCENES |
3269 | A Multichannel Localization Method for Camouflaged Object Detection |
2480 | A MULTI-MODAL TRANSFORMER APPROACH FOR FOOTBALL EVENT CLASSIFICATION |
3463 | A Multiscale Approach to Deep Blind Image Quality Assessment |
1166 | A MULTI-SCALE CELL SEGMENTATION METHOD FOR DETECTING HEMATOLOGICAL DISORDERS |
1180 | A MULTISCALE RESIDUAL SOLVER FOR TOTAL VARIATION MODELS |
1959 | A MULTI-STREAM NETWORK FOR MESH DENOISING VIA GRAPH NEURAL NETWORKS WITH GAUSSIAN CURVATURE |
1036 | A NO-REFERENCE QUALITY ASSESSMENT METHOD FOR DIGITAL HUMAN HEAD |
2738 | A NOVEL CLASS ACTIVATION MAP FOR VISUAL EXPLANATIONS IN MULTI-OBJECT SCENES |
1510 | A NOVEL PSEUDO-LABEL GENERATION METHOD FOR SEMI-SUPERIVISED SAR TARGET RECOGNITION BASED ON DEEP LEARNING |
1875 | A NOVEL SELECTIVE ENCRYPTION SCHEME FOR H.266/VVC VIDEO |
3451 | A NOVEL WEAKLY SUPERVISED SEGMENTATION APPROACH FOR RAPID LEFT VENTRICLE ANNOTATION |
2773 | A PENALIZED MODIFIED HUBER REGULARIZATION TO IMPROVE ADVERSARIAL ROBUSTNESS |
1793 | A Privacy-Preserving Approach for Multi-Source Domain Adaptive Object Detection |
2536 | A PROBABILITY-BASED ALL-ZERO BLOCK EARLY TERMINATION ALGORITHM FOR QSHVC |
1578 | A SALIENCY-AWARE METHOD FOR ARBITRARY STYLE TRANSFER |
2484 | A Semi-Paired Approach For Label-to-Image Translation |
3098 | A SHALLOW U-NET WITH SPLIT-FUSED ATTENTION MECHANISM FOR RETINAL VESSEL SEGMENTATION |
2082 | A STRUCTURE-FUSION NETWORK FOR MEDICAL IMAGE CLASSIFICATION |
1127 | A TOPOLOGY BASED DENOISING APPROACH FOR 2D SCALAR FIELDS |
1236 | A two-dimensional difference histogram equalization with fuzzy cumulative distribution correction for dark images |
1504 | A Unified Framework for Static and Dynamic Functional Connectivity Augmentation for Multi-Domain Brain Disorder Classification |
2350 | A VISIBLE AND INFRARED IMAGE FUSION FRAMEWORK BASED ON DUAL-PATH ENCODER-DECODER AND MULTI-SCALE DISCRETE WAVELET TRANSFORM |
2313 | AAFACE: ATTRIBUTE-AWARE ATTENTIONAL NETWORK FOR FACE RECOGNITION |
1555 | ABNORMAL-AWARE LOSS AND FULL DISTILLATION FOR UNSUPERVISED ANOMALY DETECTION BASED ON KNOWLEDGE DISTILLATION |
2811 | ACCURATE REGISTRATION BETWEEN ULTRA-WIDE-FIELD AND NARROW ANGLE RETINA IMAGES WITH 3D EYEBALL SHAPE OPTIMIZATION |
3049 | ACCURATE SEGMENTATION FOR PATHOLOGICAL LUNG BASED ON INTEGRATION OF 3D APPEARANCE AND SURFACE MODELS |
1587 | ACCURATE SINGLE-IMAGE DEFOCUS DEBLURRING BASED ON IMPROVED INTEGRATION WITH DEFOCUS MAP ESTIMATION |
2142 | ACTION ANTICIPATION WITH GOAL CONSISTENCY |
3481 | Activating Frequency and ViT for 3D Point Cloud Quality Assessment without Reference |
1291 | ADAPTIVE ANCHOR LABEL PROPAGATION FOR TRANSDUCTIVE FEW-SHOT LEARNING |
1408 | ADAPTIVE AND ROBUST MMWAVE-BASED 3D HUMAN MESH ESTIMATION FOR DIVERSE POSES |
2146 | ADAPTIVE CAMOUFLAGE PATTERN GENERATION TO DIFFERENT ENVIRONMENTS VIA CONTENT-AWARE STYLE TRANSFER |
2003 | ADAPTIVE GRAPH CONVOLUTION MODULE FOR SALIENT OBJECT DETECTION |
1388 | ADAPTIVE SEMI-SUPERVISED MIXUP WITH IMPLICIT LABEL LEARNING AND SAMPLE RATIO BALANCING |
1341 | ADA-VIT: ATTENTION-GUIDED DATA AUGMENTATION FOR VISION TRANSFORMERS |
1895 | ADDING DISTANCE INFORMATION TO SELF-SUPERVISED LEARNING FOR RICH REPRESENTATIONS |
1221 | ADFA: Attention-augmented Differentiable top-k Feature Adaptation for unsupervised medical anomaly detection |
1378 | Adopting Self-supervised Learning into Unsupervised Video Summarization through Restorative score. |
2908 | ADVANCING THE RATE-DISTORTION-COMPUTATION FRONTIER FOR NEURAL IMAGE COMPRESSION |
1965 | ADVERSARIAL DEFECT SYNTHESIS FOR INDUSTRIAL PRODUCTS IN LOW DATA REGIME |
2907 | Adversarial Defense via Perturbation-Disentanglement in Hyperspectral Image Classification |
2183 | ADVERSARIAL EXAMPLE DETECTION BAYESIAN GAME |
1144 | AFNET-M: ADAPTIVE FUSION NETWORK WITH MASKS FOR 2D+3D FACIAL EXPRESSION RECOGNITION |
1159 | AICT: AN ADAPTIVE IMAGE COMPRESSION TRANSFORMER |
2815 | All-intra rate control using low complexity video features for Versatile Video Coding |
2784 | AN ADJUSTABLE FAST DECISION METHOD FOR AFFINE MOTION ESTIMATION IN VVC |
2331 | AN ALTERNATIVE TO BILINEAR AND NEAREST-NEIGHBOUR ENLARGING FOR MONITOR DISPLAYS |
2834 | AN AUTOMATIC COLORECTAL POLYPS DETECTION APPROACH FOR CT COLONOGRAPHY |
2263 | AN EFFICIENT DEEP UNROLLING SUPER-RESOLUTION NETWORK FOR LIDAR AUTOMOTIVE SCENES |
1316 | An Efficient Deep Video Model for Deepfake Detection |
3340 | AN ENHANCED NEURON ATTRIBUTION-BASED ATTACK VIA PIXEL DROPPING |
1237 | AN IMPROVED UPPER BOUND ON THE RATE-DISTORTION FUNCTION OF IMAGES |
2731 | An Inter-observer consistent deep adversarial training for visual scanpath prediction |
2287 | AN L2-NORMALIZED SPATIAL ATTENTION NETWORK FOR ACCURATE AND FAST CLASSIFICATION OF BRAIN TUMORS IN 2D T1-WEIGHTED CE-MRI IMAGES |
2038 | ARBITRARY POINT CLOUD UPSAMPLING VIA DUAL BACK-PROJECTION NETWORK |
2472 | ArtiFact: A Large-Scale Dataset with Artificial and Factual Images for Generalizable and Robust Synthetic Image Detection |
3097 | ASVFI: Audio-driven Speaker Video Frame Interpolation |
1285 | ASYMMETRIC SCALABLE CROSS-MODAL HASHING |
1891 | ATTEN-ADAPTER: A UNIFIED ATTENTION-BASED ADAPTER FOR EFFICIENT TUNING |
2659 | ATTENTION-GUIDED CONTRASTIVE MASKED IMAGE MODELING FOR TRANSFORMER-BASED Self-SUPERVISED LEARNING |
2047 | ATTENTIVE DEEP K-SVD NETWORK FOR PATCH CORRELATED IMAGE DENOISING |
2184 | ATTRIBUTE LEARNING WITH KNOWLEDGE ENHANCED PARTIAL ANNOTATIONS |
2055 | Audio-Visual Quality Assessment for User Generated Content: Database and Method |
2924 | AUTOMATED DIAGNOSIS OF BREAST CANCER USING DEEP LEARNING-BASED WHOLE SLIDE IMAGE ANALYSIS OF MOLECULAR BIOMARKERS |
1900 | AUTONOMOUS POLYCRYSTALLINE MATERIAL DECOMPOSITION FOR HYPERSPECTRAL NEUTRON TOMOGRAPHY |
2179 | BACKGROUND CLUSTERING PRE-TRAINING FOR FEW-SHOT SEGMENTATION |
1073 | BackGround Masked Guided Network for Skin Lesion Segmentation in Dermoscopy Image |
3178 | Base Layer Efficiency in Scalable Human-Machine Coding |
1594 | BATINET: BACKGROUND-AWARE TEXT TO IMAGE SYNTHESIS AND MANIPULATION NETWORK |
2421 | BAYESIAN HYBRID LOSS FOR HYPERSPECTRAL SISR USING 3D WIDE RESIDUAL CNN |
3114 | BCKD: BLOCK-CORRELATION KNOWLEDGE DISTILLATION |
3131 | BITRATE-PERFORMANCE OPTIMIZED MODEL TRAINING FOR THE NEURAL NETWORK CODING (NNC) STANDARD |
1334 | BITS-NET: BLIND IMAGE TRANSPARENCY SEPARATION NETWORK |
2628 | BLACKBOX FACE RECONSTRUCTION FROM DEEP FACIAL EMBEDDINGS USING A DIFFERENT FACE RECOGNITION MODEL |
2009 | Blind Omnidirectional Image Quality Assessment: Integrating Local Statistics and Global Semantics |
2588 | BLIND QUALITY ASSESSMENT OF LIGHT FIELD IMAGE BASED ON SPATIO-ANGULAR TEXTURAL VARIATION |
3346 | BLOCK-BASED MOTION ESTIMATION FOR DEEP-LEARNED VIDEO CODING |
3485 | BPQA: A BLIND POINT CLOUD QUALITY ASSESSMENT METHOD |
2582 | BS-YOLOV5S: INSULATOR DEFECT DETECTION WITH ATTENTION MECHANISM AND MULTI-SCALE FUSION |
2187 | CAN HUMAN ATTRIBUTE SEGMENTATION BE MORE ROBUST TO OPERATIONAL CONTEXTS WITHOUT NEW LABELS? |
1488 | CAN WE DISTILL KNOWLEDGE FROM POWERFUL TEACHERS DIRECTLY? |
2430 | Capsule Transformer Network for Dynamic Hand Gesture Recognition using Multimodal Data |
2949 | CDNET: CLUSTER DECISION FOR DEEPFAKE DETECTION GENERALIZATION |
1493 | cDPMSR: CONDITIONAL DIFFUSION PROBABILISTIC MODELS FOR SINGLE IMAGE SUPER-RESOLUTION |
1649 | Change Detection for Remote Sensing Images based on Semantic Prototypes and Contrastive Learning |
2347 | CHANNEL PRUNING VIA ATTENTION MODULE AND MEMORY CURVE |
1245 | CKT: CROSS-IMAGE KNOWLEDGE TRANSFER FOR TEXTURE ANOMALY DETECTION |
3176 | CLASSIFICATION TASK ASSISTED SEGMENTATION NETWORK FOR BREAST TUMOR SEGMENTATION IN ULTRASOUND IMAGES |
1130 | CLIP4STEREO: REVISITING DOMAIN GENERALIZED STEREO MATCHING VIA CLIP |
1476 | CLIP-FG:SELECTING DISCRIMINATIVE IMAGE PATCHES BY CONTRASTIVE LANGUAGE-IMAGE PRE-TRAINING FOR FINE-GRAINED IMAGE CLASSIFICATION |
3123 | CLIP-TSA: CLIP-Assisted Temporal Self-Attention for Weakly-Supervised Video Anomaly Detection |
2068 | CLOT: Contrastive Learning-Driven and Optimal Transport-Based Training for Simultaneous Clustering |
3366 | ClothFit: Cloth-Human-Attribute Guided Virtual Try-On Network Using 3D Simulated Dataset |
3134 | CNN-BASED ESTIMATION OF WATER DEPTH FROM MULTISPECTRAL DRONE IMAGERY FOR MOSQUITO CONTROL |
1951 | Coarse-to-Fine Pyramid Feature Mining for Wheat Head Detection |
1480 | COCCA: POINT CLOUD COMPLETION THROUGH CAD CROSS-ATTENTION |
2349 | Coco-Teach: A CONTRASTIVE CO-TEACHING NETWORK FOR INCREMENTAL 3D OBJECT DETECTION |
2564 | COLOR LEARNING FOR IMAGE COMPRESSION |
1491 | Combining Self-Supervised and Supervised Learning with Noisy Labels |
2573 | COMPACT SELECTIVE TRANSFORMER BASED ON INFORMATION ENTROPY FOR FACIAL EXPRESSION RECOGNITION IN THE WILD |
2772 | Comparative Study of Saliency- and Scanpath-Based Approaches for Patch Selection in Image Quality Assessment |
2294 | COMPLEXITY REDUCTION OF GRAPH SIGNAL DENOISING BASED ON FAST GRAPH FOURIER TRANSFORM |
2271 | COMPLEXITY SCALABLE LEARNING-BASED IMAGE DECODING |
2160 | COMPLEXITY-EFFICIENT QUANTIZER SELECTION FOR HEVC ENCODER |
3272 | COMPOUND MULTI-BRANCH FEATURE FUSION FOR IMAGE DERAINDROP |
3462 | Conditional Injective Flows for Bayesian Imaging |
2912 | CONFIDENCE-AWARE CLUSTERED LANDMARK FILTERING FOR HYBRID 3D FACE TRACKING |
2338 | CONSISTENT AND DIVERSE HUMAN MOTION PREDICTION USING CONDITIONAL VARIATIONAL AUTOENCODER WITH CONTEXT-AWARE LATENT SPACE |
1200 | CONSISTENT AND MULTI-SCALE SCENE GRAPH TRANSFORMER FOR SEMANTIC-GUIDED IMAGE OUTPAINTING |
3096 | CONTENT-ADAPTIVE PARALLEL ENTROPY CODING FOR END-TO-END IMAGE COMPRESSION |
1010 | CONTEXT-AWARE DATA AUGMENTATION FOR LIDAR 3D OBJECT DETECTION |
1268 | Context-Aware Inpainter-Refiner for Skeleton-Based Human Motion Completion |
2655 | Context-Aware Multi-Stream Networks for Dimensional Emotion Prediction in Images |
2696 | CONTEXT-AWARE PEDESTRIAN TRAJECTORY PREDICTION WITH MULTIMODAL TRANSFORMER |
3431 | CONTEXT-AWARE TRANSFORMERS FOR WEAKLY SUPERVISED BAGGAGE THREAT LOCALIZATION |
2781 | CONTINUAL LEARNING FOR OUT-OF-DISTRIBUTION PEDESTRIAN DETECTION |
3373 | CONTOUR ARTIFACT REMOVAL FOR EXPANDED HDR CONTENT |
2468 | CONTOUR-ASSISTED LONG-RANGE PERCEPTUAL NETWORK FOR CAMOUFLAGED INSTANCE SEGMENTATION |
1313 | CONTROLLING FACIAL ATTRIBUTE SYNTHESIS BY DISENTANGLING ATTRIBUTE FEATURE AXES IN LATENT SPACE |
3063 | CORRELATION AND FOREGROUND ATTENTION TO IMPROVE OBJECT DETECTION |
1364 | COST-EFFICIENT MULTI-INSTANCE MULTI-LABEL ACTIVE LEARNING VIA CORRELATION OF FEATURES |
2409 | COUPLING SPATIAL AND CHANNEL TRANSFORMER FOR SINGLE IMAGE DERAINING |
1613 | COVARIANCE-AWARE FEATURE ALIGNMENT WITH PRE-COMPUTED SOURCE STATISTICS FOR TEST-TIME ADAPTATION TO MULTIPLE IMAGE CORRUPTIONS |
2982 | CPU MICROARCHITECTURAL PERFORMANCE ANALYSIS OF SVT-AV1 ENCODER |
1233 | CROSS SPECTRAL IMAGE RECONSTRUCTION USING A DEEP GUIDED NEURAL NETWORK |
1477 | Cross-Domain Few-Shot Classification via Inter-Source Stylization |
1125 | Cross-Inferential Networks for Source-free Unsupervised Domain Adaptation |
1462 | CROSS-LAYER PATCH ALIGNMENT AND INTRA-AND-INTER PATCH RELATIONS FOR KNOWLEDGE DISTILLATION |
1473 | CROSS-SCALE QUERY-SUPPORT ALIGNMENT APPROACH FOR SMALL OBJECT DETECTION IN THE FEW-SHOT REGIME |
1847 | CR-UNIT: Unsupervised Image-to-Image Translation with Content Reconstruction |
1721 | CSSBA: A CLEAN LABEL SAMPLE-SPECIFIC BACKDOOR ATTACK |
1575 | CTI-UNET: HYBRID LOCAL FEATURES AND GLOBAL REPRESENTATIONS EFFICIENTLY |
1731 | Curriculum Knowledge Switching for Pancreas Segmentation |
2497 | CXRMIM: MASKED IMAGE MODELING PRE-TRAINING PARADIGM FOR CHEST X-RAY IMAGES ANALYSIS |
1624 | Data Augmentation using Corner CutMix and an Auxiliary Self-supervised Loss |
2693 | DATA GENERATION WITH STRUCTURE ENFORCING ADVERSARIAL LEARNING |
2292 | DATA POISONING ATTACK AIMING THE VULNERABILITY OF CONTINUAL LEARNING |
1349 | DATASET-LEVEL DIRECTED IMAGE TRANSLATION FOR CROSS-DOMAIN CROWD COUNTING |
2251 | DAUT: UNDERWATER IMAGE ENHANCEMENT USING DEPTH AWARE U-SHAPE TRANSFORMER |
1619 | DEEP ACTIVE LEARNING BASED ON SALIENCY-GUIDED DATA AUGMENTATION FOR IMAGE CLASSIFICATION |
1548 | DEEP BAYESIAN BLIND COLOR DECONVOLUTION OF HISTOLOGICAL IMAGES |
3443 | DEEP CNN-BASED PRE-ENCODING PERCEPTUAL QUALITY CONTROL AND PREDICTION |
1865 | DEEP CROSS-MODAL STEGANOGRAPHY USING NEURAL REPRESENTATIONS |
2937 | DEEP LEARNING BASED WORKFLOW FOR ACCELERATED INDUSTRIAL X-RAY COMPUTED TOMOGRAPHY |
3363 | DEEP LEARNING MEETS PARTICLE SWARM OPTIMIZATION FOR AORTIC VALVE CALCIUM SCORING FROM CARDIAC COMPUTED TOMOGRAPHY |
2394 | DEEP LEARNING RECONSTRUCTION FOR SINGLE PIXEL IMAGING WITH GENERATIVE ADVERSARIAL NETWORKS |
2742 | DEEP LEARNING-BASED COMPRESSED DOMAIN POINT CLOUD CLASSIFICATION |
2958 | DEEP OC-SORT: MULTI-PEDESTRIAN TRACKING BY ADAPTIVE RE-IDENTIFICATION |
1600 | Deep robust image restoration using the Moore-Penrose blur inverse |
1042 | DEEP UNFOLDING NETWORK WITH PHYSICS-BASED PRIORS FOR UNDERWATER IMAGE ENHANCEMENT |
1831 | DEEP UNROLLING SHRINKAGE NETWORK FOR DYNAMIC MR IMAGING |
1986 | DEEP UNSUPERVISED HASHING WITH SEMANTIC CONSISTENCY LEARNING |
2387 | DEEP UNSUPERVISED REFLECTION REMOVAL USING DIFFUSION MODELS |
2469 | DEEP VARIATIONAL SEGMENTATION OF TOPOLOGY-CONSTRAINED OBJECT SETS, WITH CORRELATED UNCERTAINTY MODELS, FOR ROBUSTNESS TO DEGRADATIONS |
2373 | Deepfake Face Provenance for Proactive Forensics |
1486 | DEEP-LEARNING-BASED ENERGY AWARE IMAGES |
3450 | Deformation Robust Text Spotting with Geometric Prior |
1457 | DEGRADATION CONDITIONED GAN FOR DEGRADATION GENERALIZATION OF FACE RESTORATION MODELS |
2952 | DENOISING POINT CLOUDS WITH INTENSITY AND SPATIAL FEATURES IN RAINY WEATHER |
2486 | DENSE DEPTH ESTIMATION FOR SURGICAL ENDOSCOPE ROBOT WITH MULTI-BASELINE DEPTH MAP FUSION |
2902 | DENSECL: HAZE MITIGATION USING DENSE BLOCKS AND CONTRASTIVE LOSS REGULARIZATION |
1698 | Densely Connected Swin-UNet for Multiscale Information Aggregation in Medical Image Segmentation |
2836 | DEPTH ESTIMATION OF MULTI-MODAL SCENE BASED ON MULTI-SCALE MODULATION |
2918 | DEPTH MAP ESTIMATION FROM MULTI-VIEW IMAGES WITH NERF-BASED REFINEMENT |
1463 | DESIGNING STRONG BASELINES FOR TERNARY NEURAL NETWORK QUANTIZATION THROUGH SUPPORT AND MASS EQUALIZATION |
2264 | DETECTING STABLE DIFFUSION GENERATED IMAGES USING FREQUENCY ARTIFACTS: A CASE STUDY ON DISNEY-STYLE ART |
2674 | DETECTION TRANSFORMER WITH DIVERSIFIED OBJECT QUERIES |
2417 | DF-Net: Diversity-Focused Network for Video Object Detection |
1438 | DFT-CAM: DISCRETE FOURIER TRANSFORM DRIVEN CLASS ACTIVATION MAP |
1461 | DIFFERENTIAL ENHANCED SIAMESE SEGMENTATION NETWORK FOR PRINTED LABEL DEFECT DETECTION |
2110 | DIFFUSIONSTR: DIFFUSION MODEL FOR SCENE TEXT RECOGNITION |
1593 | DISPLAY POWER MODELING FOR ENERGY CONSUMPTION CONTROL |
2449 | DISTILLING KNOWLEDGE OF BIDIRECTIONAL LANGUAGE MODEL FOR SCENE TEXT RECOGNITION |
3332 | DLAHSD: Dynamic Label adopted in Auxiliary Head for SAR Detection |
2426 | DLEN: DEEP LAPLACIAN ENHANCEMENT NETWORKS FOR LOW-LIGHT IMAGES |
1492 | DNC-NET: DUAL-NEIGHBOURHOOD CONSENSUS NETWORK FOR FEATURE MATCHING |
1521 | Document binarization with Multi-branch Gated Convolutional Generative Adversarial Networks |
1514 | DOCUMENT CHANGE DETECTION WITH HIERARCHICAL PATCH COMPARISON |
2137 | DODGING THE DOUBLE DESCENT IN DEEP NEURAL NETWORKS |
1163 | DOG ACCURACY VIA EQUIVARIANCE: GET THE INTERPOLATION RIGHT |
2247 | DOMAIN ADAPTATION IN POWER LINE SEGMENTATION: A NEW SYNTHETIC DATASET |
2239 | Domain Adaptation of Digital Pathology Images using Joint Stain Color and Image Quality Constraints |
1647 | DOMAIN GENERALIZATION METHOD FOR PERSON RE-ID USING METABIN AND MIXSTYLE |
2079 | Domain Invariant Regularization By Disentangling content and style features for Visual Domain Generalization |
1620 | DOMAIN-GENERALIZED FACE ANTI-SPOOFING WITH UNKNOWN ATTACKS |
1969 | DPDM: FEATURE-BASED POSE REFINEMENT WITH DEEP POSE AND DEEP MATCH FOR MONOCULAR VISUAL ODOMETRY |
1877 | DP-NET: LEARNING DISCRIMINATIVE PARTS FOR IMAGE RECOGNITION |
2855 | DSG-PL: ROI EXTRACTION BASED ON DUAL SALIENCY GUIDED PROGRESSIVE LEARNING FOR WEAKLY LABELED REMOTE SENSING IMAGES |
1367 | Dual Temporal Transformers for Fine-Grained Dangerous Action Recognition |
1538 | DUAL TRANSFORMER ENCODER MODEL FOR MEDICAL IMAGE CLASSIFICATION |
1520 | DYNAMIC DUAL-GRAPH FUSION CONVOLUTIONAL NETWORK FOR ALZHEIMER'S DISEASE DIAGNOSIS |
2400 | DYNAMIC POINT CLOUD COMPRESSION APPROACH USING HEXAHEDRON SEGMENTATION |
2980 | DYNAMIC RANGE TRANSFORMER (DRT): LEARNING ENHANCED LOG-PERCEPTUAL INFORMATION WITH SWIN-FOURIER CONVOLUTION NETWORK FOR HDR IMAGING |
1826 | Dynamic Unilateral Dual Learning for Text to Image Synthesis |
1325 | Early Detection of Cars Exiting Road-side Parking |
2888 | EARLY DIAGNOSIS OF PROSTATE CANCER USING PARAMETRIC ESTIMATION OF IVIM FROM DW-MRI |
1685 | EDGE SYNTHESIS BLOCK: A BUILDING UNIT FOR REAL-TIME SINGLE IMAGE SUPER RESOLUTION |
1494 | EFFICIENT AERIAL IMAGE OBJECT DETECTION WITH IMAGING CONDITION DECOMPOSITION |
3499 | EFFICIENT ANOMALY DETECTION USING SELF-SUPERVISED MULTI-CUE TASKS |
3206 | EFFICIENT ANY-TARGET BACKDOOR ATTACK WITH PSEUDO POISONED SAMPLES |
1774 | EFFICIENT CONVOLUTION AND TRANSFORMER-BASED NETWORK FOR VIDEO FRAME INTERPOLATION |
2274 | EFFICIENT JOINT VIDEO DENOISING AND SUPER-RESOLUTION |
2262 | EFFICIENT PER-SHOT TRANSFORMER-BASED BITRATE LADDER PREDICTION FOR ADAPTIVE VIDEO STREAMING |
1561 | EFFICIENT PREDICTION OF MODEL TRANSFERABILITY IN SEMANTIC SEGMENTATION TASKS |
3362 | EFFICIENT PRUNING METHOD FOR LEARNED LOSSY IMAGE COMPRESSION MODELS BASED ON SIDE INFORMATION |
2769 | Efficient Transfer by Robust Label Selection and Learning with Pseudo-Labels |
1787 | EFFICIENT-HDRTV: EFFICIENT SDR TO HDR CONVERSION FOR HDR TV |
3023 | ELEGAN: AN EFFICIENT LOW LIGHT ENHANCEMENT GAN FOR UNPAIRED SUPERVISION |
2619 | ENABLING HIGH-RESOLUTION POSE ESTIMATION IN REAL TIME USING ACTIVE PERCEPTION |
2022 | ENABLING THE ENCODER-EMPOWERED GAN-BASED VIDEO GENERATORS FOR LONG VIDEO GENERATION |
3017 | ENCODER COMPLEXITY CONTROL IN SVT-AV1 BY SPEED-ADAPTIVE PRESET SWITCHING |
1318 | ENCODING-AWARE DEEP VIDEO SUPER-RESOLUTION FRAMEWORK |
2445 | END TO END GENERATIVE META CURRICULUM LEARNING FOR MEDICAL DATA AUGMENTATION |
1888 | Endoscopic Feature Enhancement for Stomach 3D Reconstruction without Dyeing |
2309 | END-TO-END LEARNED LIGHT FIELD IMAGE RESCALING USING JOINT SPATIAL-ANGULAR AND EPIPOLAR INFORMATION |
1435 | END-TO-END TRAINABLE WEAKLY NON-NEGATIVE FACTORIZATION |
1934 | ENHANCED TEMPORAL MOTION DERIVATION BEYOND VVC |
2927 | ENHANCED U-TRANSFORMER NETWORKS FOR AUTOMATIC PULMONARY VESSEL SEGMENTATION IN CT IMAGES |
1412 | ENHANCING LOW-LIGHT IMAGES USING INFRARED ENCODED IMAGES |
2032 | Enhancing Next Active Object-based Egocentric Action Anticipation with Guided Attention |
3181 | ENHANCING TARGETED TRANSFERABILITY VIA SUPPRESSING HIGH-CONFIDENCE LABELS |
2490 | EPIGRAPHICALLY-RELAXED LINEARLY-INVOLVED GENERALIZED MOREAU-ENHANCED MODEL FOR LAYERED MIXED NORM REGULARIZATION |
2643 | ERROR CONCEALMENT FOR SCALABLE VIDEO CODING BASED ON DEFORMABLE CONVOLUTION NETWORK |
2819 | ESTIMATED DEPTH BASED PROGRESSIVE INTERACTIVE FRAMEWORK FOR RGB SALIENT OBJECT DETECTION IN IMAGES |
3045 | EVENT DATA STREAM COMPRESSION BASED ON POINT CLOUD REPRESENTATION |
1269 | EVENT-BASED CAMERA SIMULATION USING MONTE CARLO PATH TRACING WITH ADAPTIVE DENOISING |
2018 | EXPLORING ANATOMICAL SIMILARITY IN CARDIAC-GATED SPECT IMAGES FOR MOTION COMPENSATION WITH A DEEP LEARNING NETWORK |
2698 | EXPLORING DIFFUSION MODELS FOR UNSUPERVISED VIDEO ANOMALY DETECTION |
1599 | EXPLORING EFFECTIVE KNOWLEDGE DISTILLATION FOR TINY OBJECT DETECTION |
2026 | EXPLORING SELF-SUPERVISED REPRESENTATION LEARNING FOR LOW-RESOURCE MEDICAL IMAGE ANALYSIS |
1585 | EXPLORING THE CONNECTION BETWEEN NEURON COVERAGE AND ADVERSARIAL ROBUSTNESS IN DNN CLASSIFIERS |
1067 | FACE PHOTO-SKETCH SYNTHESIS VIA DOMAIN-INVARIANT FEATURE EMBEDDING |
2977 | FACET-LEVEL SEGMENTATION OF 3D TEXTURES ON CULTURAL HERITAGE OBJECTS |
3208 | FACIAL EXPRESSION RECOGNITION USING LIGHT FIELD CAMERAS: A COMPARATIVE STUDY OF DEEP LEARNING ARCHITECTURES |
1750 | FALSE CORRESPONDENCE REMOVAL VIA REVISITING SEMANTIC CONTEXT WITH POSITION-ATTENTIVE LEARNING |
2547 | FAST LEARNING-BASED SPLIT TYPE PREDICTION ALGORITHM FOR VVC |
2242 | FAST OPTIMAL TRANSPORT FOR LATENT DOMAIN ADAPTATION |
1490 | FAST QTMT PARTITION FOR VVC INTRA CODING USING U-NET FRAMEWORK |
2459 | FAST-CONVERGENT FEDERATED LEARNING VIA CYCLIC AGGREGATION |
1512 | FAT: FIELD-AWARE TRANSFORMER FOR 3D POINT CLOUD SEMANTIC SEGMENTATION |
1723 | FEATURE ADVERSARIAL DISTILLATION FOR POINT CLOUD CLASSIFICATION |
1912 | FEATURE ENHANCEMENT AND FUSION FOR RGB-T SALIENT OBJECT DETECTION |
2851 | Feature Fusion Enhanced Super Resolution for Low Bitrate Screen Content Compression |
3047 | FEATURE INTEGRATION VIA BACK-PROJECTION ORDERING MULTI-MODAL GAUSSIAN PROCESS LATENT VARIABLE MODEL FOR RATING PREDICTION |
1481 | FEATURE SPACE DATA AUGMENTATION FOR VIEWPOINT-ROBUST ACTION RECOGNITION IN VIDEOS |
2044 | FEATURE STRUCTURE SIMILARITY INDEX FOR HYBRID HUMAN AND MACHINE VISION |
1766 | FEATURE-AWARE PROHIBITED ITEMS DETECTION FOR X-RAY IMAGES |
2042 | FEATURE-DOMAIN PROXIMAL HIGH-DIMENSIONAL GRADIENT DESCENT NETWORK FOR IMAGE COMPRESSED SENSING |
2462 | FEDMBP: MULTI-BRANCH PROTOTYPE FEDERATED LEARNING ON HETEROGENEOUS DATA |
1971 | FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION BASED ON CROSS-DOMAIN SPECTRAL SEMANTIC RELATION TRANSFORMER |
3074 | FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION WITH SPECTRAL-SPATIAL FEATURE FUSION BASED ON FUZZY BROAD LEARNING SYSTEM |
2337 | Few-Shot Lip-Password Based Speaker Verification |
3079 | FGC-VC: FLOW-GUIDED CONTEXT VIDEO COMPRESSION |
1725 | FGCVQA: FINE-GRAINED CROSS-ATTENTION FOR MEDICAL VQA |
1948 | Fibonet: A Light-weight and Efficient Neural Network for Image Segmentation |
1479 | FIGHTING OVER-FITTING WITH QUANTIZATION FOR LEARNING DEEP NEURAL NETWORKS ON NOISY LABELS |
2073 | FILM GRAIN REMOVAL USING METADATA |
1837 | FINALIZATION OF VVENC'S SCREEN CONTENT DETECTOR AND TWO-PASS RATE CONTROL USING PRE-FILTERING STATISTICS |
2112 | FINDING CAMOUFLAGED OBJECT GUIDED BY CONTOUR AND ATTENTION |
3371 | Fine-to-coarse Object Classification of Very Large Images |
2268 | Fisheye Multiple Object Tracking by Learning Distortions without Dewarping |
3043 | FLASH COMPENSATED LOW-LIGHT ENHANCEMENT VIA HIERARCHICAL NETWORK PREDICTION |
3364 | Flow-based one-class anomaly detection with Multi-frequency Feature fusion |
1344 | FLOW-GUIDED DEFORMABLE ATTENTION NETWORK FOR FAST ONLINE VIDEO SUPER-RESOLUTION |
2656 | FLOW-GUIDED TRANSFORMER FOR VIDEO COLORIZATION |
3084 | FORWARD DIFFUSION GUIDED RECONSTRUCTION AS A MULTI-MODAL MULTI-TASK LEARNING SCHEME |
2928 | FOURIER SERIES AND LAPLACIAN NOISE-BASED QUANTIZATION ERROR COMPENSATION FOR END-TO-END LEARNING-BASED IMAGE COMPRESSION |
2479 | FPGA-ACCELERATED HEVC ENCODER FOR ENERGY-EFFICIENT MULTI-ACCESS EDGE COMPUTING |
3497 | FRACTIONAL FOURIER TRANSFORM MEETS TRANSFORMER ENCODER |
2841 | FREQUENCY DISENTANGLED FEATURES IN NEURAL IMAGE COMPRESSION |
1623 | FREQUENCY ENHANCEMENT NETWORK FOR EFFICIENT COMPRESSED VIDEO ACTION RECOGNITION |
2542 | Frequency-Aware Re-parameterization for Over-fitting Based Image Compression |
3257 | FROM FELINE CLASSIFICATION TO SKILLS EVALUATION: A MULTITASK LEARNING FRAMEWORK FOR EVALUATING MICRO SUTURING NEUROSURGICAL SKILLS |
1259 | FULLY AUTOMATED SCAN-TO-BIM VIA POINT CLOUD INSTANCE SEGMENTATION |
2886 | FULLY AUTOMATIC CERVICAL VERTEBRAE SEGMENTATION VIA ENHANCED U2-NET |
3230 | FUNCTIONAL KNOWLEDGE TRANSFER WITH SELF-SUPERVISED REPRESENTATION LEARNING |
2297 | Fusing Explicit and Implicit Flow for Optical Flow Estimation |
1235 | FUZZY-CONDITIONED DIFFUSION AND DIFFUSION PROJECTION ATTENTION APPLIED TO FACIAL IMAGE CORRECTION |
1631 | GAITMM: MULTI-GRANULARITY MOTION SEQUENCE LEARNING FOR GAIT RECOGNITION |
2699 | Generalizable Embeddings with Cross-batch Metric Learning |
1459 | GENERALIZED PSEUDO-LABELING IN CONSISTENCY REGULARIZATION FOR SEMI-SUPERVISED LEARNING |
3273 | GEOMETRIC MAGNIFICATION-BASED ATTENTION GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED MICRO-GESTURE RECOGNITION |
1807 | GEOMETRIC PRIOR-ASSISTED FEATURE PRESENTATION ENHANCEMENT FOR OBJECT DETECTION IN AERIAL IMAGES |
1968 | GEOMETRY-AWARE VIDEO QUALITY ASSESSMENT FOR DYNAMIC DIGITAL HUMAN |
3316 | GLOBAL BALANCED NETWORKS FOR MULTI-VIEW STEREO |
1833 | GLOBAL-LOCAL AWARENESS NETWORK FOR IMAGE SUPER-RESOLUTION |
2428 | GMML is All you Need |
3036 | GNP ATTACK: TRANSFERABLE ADVERSARIAL EXAMPLES VIA GRADIENT NORM PENALTY |
1422 | GPCGC: A GREEN POINT CLOUD GEOMETRY CODING METHOD |
3057 | GRAD-FEC: UNEQUAL LOSS PROTECTION OF DEEP FEATURES IN COLLABORATIVE INTELLIGENCE |
1677 | GRAPHRPE: RELATIVE POSITION ENCODING GRAPH TRANSFORMER FOR 3D HUMAN POSE ESTIMATION |
1591 | GRID-TRANSFORMER FOR FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION |
2732 | Group Masked Model Learning for General Audio Representation |
3350 | GS-NET: GLOBAL SELF-ATTENTION GUIDED CNN FOR MULTI-STAGE GLAUCOMA CLASSIFICATION |
1060 | HALF OF AN IMAGE IS ENOUGH FOR QUALITY ASSESSMENT |
2705 | HANDS IN FOCUS: SIGN LANGUAGE RECOGNITION VIA TOP-DOWN ATTENTION |
3385 | HARD SAMPLES BASED MARGIN LOSS FOR FACE VERIFICATION |
2371 | HDR-LMDA: A LOCAL AREA-BASED MIXED DATA AUGMENTATION METHOD FOR HDR VIDEO RECONSTRUCTION |
2465 | HDTC: Hybrid Model OF DUAL-TRANSFORMER AND CONVOLUTIONAL NEURAL NETWORK FROM RGB-D FOR DETECTION OF LETTUCE GROWTH TRAITS |
2667 | HER2-SISH HISTOPATHOLOGY IMAGE CLASSIFICATION USING DEEP NEURAL NETWORKS |
2181 | HETEROGENEOUS IMAGE CHANGE DETECTION BASED ON DEEP IMAGE TRANSLATION AND FEATURE REFINEMENT-AGGREGATION |
2861 | HIERARCHICAL ARITHMETIC CODING OF DISPLACEMENTS FOR DYNAMIC MESH COMPRESSION |
2282 | HIERARCHICAL CONDITIONAL SEMI-PAIRED IMAGE-TO-IMAGE TRANSLATION FOR MULTI-TASK IMAGE DEFECT CORRECTION ON SHOPPING WEBSITES |
2475 | HIERARCHICAL FEATURE FUSION TRANSFORMER FOR NO-REFERENCE IMAGE QUALITY ASSESSMENT |
3164 | HIERARCHICAL MULTI-TASK LEARNING VIA TASK AFFINITY GROUPINGS |
2584 | HIERARCHICAL TERRAIN ATTENTION AND MULTI-SCALE RAINFALL GUIDANCE FOR FLOOD IMAGE PREDICTION |
1232 | HIGH DYNAMIC RANGE IMAGE TONE MAPPING BASED ON LAYER DECOMPOSITION AND IMAGE FUSION |
2368 | HIGH DYNAMIC RANGE IMAGING WITH MULTI-EXPOSURE BINNING ON QUAD BAYER COLOR FILTER ARRAY |
2045 | HIGH-ACCURACY GESTURE RECOGNITION USING MM-WAVE RADAR BASED ON CONVOLUTIONAL BLOCK ATTENTION MODULE |
2031 | HIGH-PRECISION MOTION VECTOR REFINEMENT FOR BI-DIRECTIONAL OPTICAL FLOW |
2774 | HIGH-THROUGHPUT AND MULTIPLIERLESS HARDWARE DESIGN FOR THE AV1 LOCAL WARPED MC INTERPOLATION |
1817 | HINTING PIPELINE AND MULTIVARIATE REGRESSION CNN FOR MAIZE KERNEL COUNTING ON THE EAR |
1905 | Hi-Res ACG: Towards High-Resolution Anime Characters Generation |
2518 | HM-PCGC: A HUMAN-MACHINE BALANCED POINT CLOUD GEOMETRY COMPRESSION SCHEME |
1249 | HOKEM: HUMAN AND OBJECT KEYPOINT-BASED EXTENSION MODULE FOR HUMAN-OBJECT INTERACTION DETECTION |
1391 | HQRetouch: Learning Professional Face Retouching via Masked Feature Fusion and Semantic-Aware Modulation |
3075 | HRFNET: HIGH-RESOLUTION FORGERY NETWORK FOR LOCALIZING SATELLITE IMAGE MANIPULATION |
3368 | HUMAN-INTERPRETABLE AND DEEP FEATURES FOR IMAGE PRIVACY CLASSIFICATION |
3456 | Hybrid Contrastive Prototypical Network for Few-Shot Scene Classification |
1803 | ICCL: SELF-SUPERVISED INTRA- AND CROSS-MODAL CONTRASTIVE LEARNING WITH 2D-3D PAIRS FOR 3D SCENE UNDERSTANDING |
3487 | ICIP 2023 CHALLENGE: FULL-REFERENCE AND NON-REFERENCE POINT CLOUD QUALITY ASSESSMENT METHODS WITH SUPPORT VECTOR REGRESSION |
3491 | IEEE ICIP 2023 CHALLENGE ON THE AUTOMATIC DETECTION OF MOSQUITO BREEDING GROUNDS |
2608 | IKD+: RELIABLE LOW COMPLEXITY DEEP MODELS FOR RETINOPATHY CLASSIFICATION |
2663 | IMAGE CODING VIA PERCEPTUALLY INSPIRED GRAPH LEARNING |
2494 | IMAGE DEHAZING GUIDED BY LOW-PASS REINFORCED AIRLIGHT |
2380 | IMAGE INPAINTING BY MSCSWIN TRANSFORMER ADVERSARIAL AUTOENCODER |
1848 | IMAGE INPAINTING WITH INFORMATION LOSS REDUCTION AND TEXTURE-STRUCTURE FEATURE FUSION |
2885 | IMAGE STITCHING BASED ON MULTI-SCALE MESHES |
1932 | IMAGE TRANSLATION-BASED DENIABLE ENCRYPTION AGAINST MODEL EXTRACTION ATTACK |
2673 | IMAGE-COUPLED VOLUME PROPAGATION FOR STEREO MATCHING |
2037 | IMBALANCE-AWARE ADAPTIVE MARGIN LOSS FOR FAIR MULTI-LABEL FACE ATTRIBUTE RECOGNITION |
2954 | Implicit Attention-based Cross-modal Collaborative Learning for Action Recognition |
1181 | IMPOSING TOTAL VARIATION PRIOR INTO GUIDED FILTER |
1609 | IMPROVE UNSUPERVISED DEEP HASHING VIA MASKED CONTRASTIVE LEARNING |
3460 | Improved Bilinear Pooling With Pseudo Square-Rooted Matrix |
3486 | IMPROVED YOLOV7 WITH TRANSFORMER PREDICTION HEAD FOR AUTOMATED DETECTION OF MOSQUITO BREEDING GROUNDS |
2275 | IMPROVEMENT OF IMAGE SEGMENTATION MODEL FOR HANDWRITTEN NOTEBOOK ANALYTICS |
3251 | Improving Adversarial Transferability via Feature Translation |
2878 | Improving CNN-based Person Re-identification using score Normalization |
3318 | IMPROVING GENERALIZATION IN FACIAL MANIPULATION DETECTION USING IMAGE NOISE RESIDUALS AND TEMPORAL FEATURES |
3238 | IMPROVING LEARNED INVERTIBLE CODING WITH INVERTIBLE ATTENTION AND BACK-PROJECTION |
1696 | IMPROVING NERF WITH HEIGHT DATA FOR UTILIZATION OF GIS DATA |
2435 | IMPROVING ROBUSTNESS OF SINGLE IMAGE SUPER-RESOLUTION MODELS WITH MONTE CARLO METHOD |
2189 | IMPROVING SPHERICAL IMAGE RESAMPLING THROUGH VIEWPORT-ADAPTIVITY |
1700 | IMPROVING TRANSLATION INVARIANCE IN CONVOLUTIONAL NEURAL NETWORKS WITH PERIPHERAL PREDICTION PADDING |
1188 | Improving Video Colorization by Test-Time Tuning |
2799 | INDUCTIVE GRAPH NEURAL NETWORKS FOR MOVING OBJECT SEGMENTATION |
2642 | INFERENCE ACCELERATION OF DEEP LEARNING CLASSIFIERS BASED ON RNN |
3358 | INFRARED SMALL TARGET DETECTION BASED ON SALIENCY GUIDED MULTI-TASK LEARNING |
2813 | INTEGER QUANTIZED LEARNED IMAGE COMPRESSION |
2508 | INTELLIGENT PAINTER: PICTURE COMPOSITION WITH RESAMPLING DIFFUSION MODEL |
2353 | INTER-FRAME CODING FOR DYNAMIC MESHES VIA TEMPORALLY-CONSISTENT RE-MESHING |
2437 | INTERPRETABLE VISUAL QUESTION ANSWERING REFERRING TO OUTSIDE KNOWLEDGE |
2687 | INTERPRETABLE VISUAL QUESTION ANSWERING VIA REASONING SUPERVISION |
2169 | INTERPRETING CONVOLUTIONAL NEURAL NETWORKS BY EXPLAINING THEIR PREDICTIONS |
1413 | INTERPRETING LATENT REPRESENTATION IN NEURAL RADIANCE FIELDS FOR MANIPULATING OBJECT SEMANTICS |
3015 | INTER-SCALE SURE-LET IMAGE RESTORATION WITH DEEP UNROLLED IMAGE PRIOR |
3142 | Introducing a Framework for Single-Human Tracking using Event-based Cameras |
3484 | IR-SETNET: SPARSITY AWARE ENSEMBLE NETWORK FOR INFRARED IMAGING BASED DRONE LOCALIZATION AND TRACKING IN DISTORTED SURVEILLANCE VIDEOS |
2820 | IT WASN'T ME: IRREGULAR IDENTITY IN DEEPFAKE VIDEOS |
1240 | Joint Demosaicing and Denoising with Gradient Guidance in Quad Bayer CFA |
1854 | JOINT OPTIMIZED POINT CLOUD COMPRESSION FOR 3D OBJECT DETECTION |
1736 | Joint Probability Distribution Regression for Image Cropping |
1571 | JOINT UNDER-SAMPLING PATTERN OPTIMIZATION AND CONTENT-BASED RECONSTRUCTION NETWORK FOR FAST MRI RECONSTRUCTION |
2279 | JPEG COMPLIANT COMPRESSION FOR DNN VISION |
1339 | JPEG INFORMATION REGULARIZED DEEP IMAGE PRIOR FOR DENOISING |
2284 | JPEG PLENO LEARNING-BASED POINT CLOUD CODING: A PERFORMANCE ANALYSIS |
2416 | JPEG PLENO LIGHT FIELD ENCODER WITH MESH BASED VIEW WARPING |
1308 | KD-FIXMATCH: KNOWLEDGE DISTILLATION SIAMESE NEURAL NETWORKS |
2115 | KEYPOINTS DICTIONARY LEARNING FOR FAST AND ROBUST ALIGNMENT |
2610 | L2FUSION: LOW-LIGHT ORIENTED INFRARED AND VISIBLE IMAGE FUSION |
1226 | LADDER SIAMESE NETWORK: A METHOD AND INSIGHTS FOR MULTI-LEVEL SELF-SUPERVISED LEARNING |
2101 | LANGUAGE IDENTIFICATION AS IMPROVEMENT FOR LIP-BASED BIOMETRIC VISUAL SYSTEMS |
2991 | LAPTRAN: TRANSFORMER EMBEDDING GRAPH LAPLACIAN FOR POINT CLOUD PART SEGMENTATION |
2226 | LATENTPATCH: A NON-PARAMETRIC APPROACH FOR FACE GENERATION AND EDITING |
1686 | LATENT-SHIFT: GRADIENT OF ENTROPY HELPS NEURAL CODECS |
1152 | LDCFORMER: INCORPORATING LEARNABLE DESCRIPTIVE CONVOLUTION TO VISION TRANSFORMER FOR FACE ANTI-SPOOFING |
1431 | Learn more: Sub-significant area learning for fine-grained visual classification |
2897 | LEARNABLE SNAKE R-CNN FOR INSTANCE-LEVEL BIOMEDICAL IMAGE SEGMENTATION |
2244 | LEARNED IMAGE COMPRESSION GUIDED ADAPTIVE QUANTIZATION FOR PERCEPTUAL QUALITY |
2145 | Learned Image Compression with Large Capacity and Low Redundancy of Latent Representation |
2734 | LEARNED IMAGE COMPRESSION WITH MULTI-SCAN BASED CHANNEL FUSION |
1828 | LEARNING DISENTANGLED FEATURES FOR NERF-BASED FACE RECONSTRUCTION |
2266 | Learning Extended Depth of Field Hyperspectral Imaging |
2099 | LEARNING MULTI-SCALE FEATURES FOR JPEG IMAGE ARTIFACTS REMOVAL |
2291 | LEARNING MUTUALLY IN CROWD SCENES FOR PEDESTRIAN DETECTION |
2782 | LEARNING RAW IMAGE DENOISING USING A PARAMETRIC COLOR IMAGE MODEL |
2864 | Learning Spatially-Adaptive Squeeze-Excitation Networks for Few Shot Image Synthesis |
2035 | Learning Spatially-Adaptive Style-Modulation Networks for Single Image Synthesis |
1618 | LEARNING SPATIAL-TEMPORAL EMBEDDINGS FOR SEQUENTIAL POINT CLOUD FRAME INTERPOLATION |
1884 | LEARNING TO DRAW THROUGH A MULTI-STAGE ENVIRONMENT MODEL BASED REINFORCEMENT LEARNING |
1007 | LEARNING TORSO PRIOR FOR CO-SPEECH GESTURE GENERATION WITH BETTER HAND SHAPE |
1239 | LEARNING-BASED RATE CONTROL FOR LEARNING-BASED POINT CLOUD GEOMETRY CODING |
2616 | LEARNT DEEP HYPERPARAMETER SELECTION IN ADVERSARIAL TRAINING FOR COMPRESSED VIDEO ENHANCEMENT WITH A PERCEPTUAL CRITIC |
2019 | LEVERAGING EFFICIENT TRAINING AND FEATURE FUSION IN TRANSFORMERS FOR MULTIMODAL CLASSIFICATION |
1290 | LEVERAGING OPTICAL FLOW FEATURES FOR HIGHER GENERALIZATION POWER IN VIDEO OBJECT SEGMENTATION |
1528 | LEVERAGING VISUAL PROMPTS TO GUIDE LANGUAGE MODELING FOR REFERRING VIDEO OBJECT SEGMENTATION |
2295 | LGSQE: LIGHTWEIGHT GENERATED SAMPLE QUALITY EVALUATION |
2144 | Lightweight CNN-Based In-loop Filter for VVC Intra Coding |
2990 | Lightweight Deep Deblurring Model with Discriminative Multi-scale Feature Fusion |
3294 | LIGHTWEIGHT MULTI-VIEW-GROUP NEURAL NETWORK FOR 3D SHAPE CLASSIFICATION |
2522 | Lightweight Network Towards Real-time Image Denoising on Mobile Devices |
2058 | LITE-HRNET PLUS: FAST AND ACCURATE FACIAL LANDMARK DETECTION |
2213 | LKBQ: PUSHING THE LIMIT OF POST-TRAINING QUANTIZATION TO EXTREME 1 BIT |
3111 | LLA-FLOW: A LIGHTWEIGHT LOCAL AGGREGATION ON COST VOLUME FOR OPTICAL FLOW ESTIMATION |
1917 | LLDE: ENHANCING LOW-LIGHT IMAGES WITH DIFFUSION MODEL |
1863 | LLIEFORMER: A LOW-LIGHT IMAGE ENHANCEMENT TRANSFORMER NETWORK WITH A DEGRADED RESTORATION MODEL |
1998 | LMPDNET: TOF-PET LIST-MODE IMAGE RECONSTRUCTION USING MODEL-BASED DEEP LEARNING METHOD |
1927 | LOCAL CONTEXT AND DIMENSIONAL RELATION AWARE TRANSFORMER NETWORK FOR CONTINUOUS AFFECT ESTIMATION |
2398 | LOCAL TEXTURE COMPLEXITY GUIDED ADVERSARIAL ATTACK |
2570 | LOCAL-AWARE INTRA TEMPLATE MATCHING PREDICTION |
1051 | LOCAL-GLOBAL CONTRAST FOR LEARNING VOICE-FACE REPRESENTATIONS |
2607 | LOCALLY ACCUMULATED ADAM FOR DISTRIBUTED TRAINING WITH SPARSE UPDATES |
2366 | LONG-TAILED FEDERATED LEARNING VIA AGGREGATED META MAPPING |
3380 | LOSSY LIDAR POINT CLOUD COMPRESSION VIA CYLINDRICAL 3D CONVOLUTION NETWORKS |
1889 | LOW LIGHT RGB AND IR IMAGE FUSION WITH SELECTIVE CNN-TRANSFORMER NETWORK |
2411 | LOW-SAMPLING-FREQUENCY PLANE WAVE MEDICAL ULTRASOUND IMAGING BASED ON ADVERSARIAL LEARNING |
2333 | LSR: A Light-Weight Super-Resolution Method |
2708 | LT-VIT: A VISION TRANSFORMER FOR MULTI-LABEL CHEST X-RAY CLASSIFICATION |
1834 | LUMINANCE-PRESERVING VISIBLE AND NEAR-INFRARED IMAGE FUSION NETWORK WITH EDGE GUIDANCE |
2080 | M3FPOLYPSEGNET: SEGMENTATION NETWORK WITH MULTI-FREQUENCY FEATURE FUSION FOR POLYP LOCALIZATION IN COLONOSCOPY IMAGES |
3438 | MACHINE LEARNING DETECTS A BIOPSY NEEDLE IN ULTRASOUND IMAGES |
2785 | MACHINE-ATTENTION-BASED VIDEO CODING FOR MACHINES |
2447 | MAP-informed Unrolled Algorithms for Hyper-parameter Estimation |
1815 | MCTE: MARRYING CONVOLUTION AND TRANSFORMER EFFICIENTLY FOR END-TO-END MEDICAL IMAGE SEGMENTATION |
2539 | MDFD: STUDY OF DISTRIBUTED NON-IID SCENARIOS AND FRECHET DISTANCE-BASED EVALUATION |
3436 | MEASURE4DHAND: DYNAMIC HAND MEASUREMENT EXTRACTION FROM 4D SCANS |
2758 | MEGL: MULTI-EXPERTS GUIDED LEARNING NETWORK FOR SINGLE CAMERA TRAINING PERSON RE-IDENTIFICATION |
3261 | MENAS: MULTI-TRIAL EVOLUTIONARY NEURAL ARCHITECTURE SEARCH WITH LOTTERY TICKETS |
1250 | METAGRAD: ADAPTIVE GRADIENT QUANTIZATION WITH HYPERNETWORKS |
2197 | MGT-PC: MEMORY-GUIDED TRANSFORMER FOR ROBUST POINT CLOUD CLASSIFICATION |
2477 | Micro-Expression Recognition with Layered Relations and More Input Frames |
2167 | MINING FALSE POSITIVE EXAMPLES FOR TEXT-BASED PERSON RE-IDENTIFICATION |
2186 | Mitigating Dataset Bias in Image Captioning through CLIP Confounder-free Captioning Network |
2438 | MIX-NET: AUTOMATIC SEGMENTATION OF COVID-19 CT IMAGES BASED ON PARALLEL DESIGN |
2345 | Modality Meets Long-term Tracker: A Siamese Dual Fusion Framework for Tracking UAV |
1765 | MODALITY-AWARE OOD SUPPRESSION USING FEATURE DISCREPANCY FOR MULTI-MODAL EMOTION RECOGNITION |
1816 | MODEL DOCTOR FOR DIAGNOSING AND TREATING SEGMENTATION ERROR |
2212 | Model-agnostic visual explanations via approximate bilinear models |
2563 | Modeling and Interpreting 6-D Object Pose Estimation |
2939 | MODELING HIERARCHICAL TOPOLOGICAL STRUCTURE IN SCIENTIFIC IMAGES WITH GRAPH NEURAL NETWORKS |
2001 | MORE SYNERGY, LESS REDUNDANCY: EXPLOITING JOINT MUTUAL INFORMATION FOR SELF-SUPERVISED LEARNING |
1659 | MOTION PLANE ADAPTIVE MOTION MODELING FOR SPHERICAL VIDEO CODING IN H.266/VVC |
3179 | MQ-CODER INSPIRED ARITHMETIC CODER FOR SYNTHETIC DNA DATA STORAGE |
3347 | MSV-RGNN: MULTISCALE VOXEL GRAPH NEURAL NETWORK FOR 3D OBJECT DETECTION |
1885 | MTJND: MULTI-TASK DEEP LEARNING FRAMEWORK FOR IMPROVED JND PREDICTION |
3078 | MULTI HYBRID EXTRACTOR NETWORK FOR 3D HUMAN POSE ESTIMATION |
1745 | MULTI TASK-BASED FACIAL EXPRESSION SYNTHESIS WITH SUPERVISION LEARNING AND FEATURE DISENTANGLEMENT OF IMAGE STYLE |
2322 | MULTI-CLASSIFICATION OF RETINAL DISEASES USING A PYRAMIDAL ENSEMBLE DEEP FRAMEWORK |
3087 | MULTI-DIMENSIONAL PRUNED SPARSE CONVOLUTION FOR EFFICIENT 3D OBJECT DETECTION |
2852 | MULTI-EXIT VISION TRANSFORMER WITH CUSTOM FINE-TUNING FOR FINE-GRAINED IMAGE RECOGNITION |
2390 | MULTI-LABEL ADVERSARIAL ATTACK BASED ON LABEL CORRELATION |
2418 | MULTILAYER ATTENTION MECHANISM FOR CHANGE DETECTION IN SAR IMAGE SPATIAL-FREQUENCY DOMAIN |
2090 | MULTIMODAL GRAPH SIGNAL DENOISING WITH SIMULTANEOUS GRAPH LEARNING USING DEEP ALGORITHM UNROLLING |
1415 | MULTI-MODAL HIERARCHICAL ATTENTION-BASED DENSE VIDEO CAPTIONING |
1439 | MULTI-OBJECT TRACKING AS ATTENTION MECHANISM |
1008 | MULTI-OBJECT TRACKING BY ITERATIVELY ASSOCIATING DETECTIONS WITH UNIFORM APPEARANCE FOR TRAWL-BASED FISHING BYCATCH MONITORING |
2723 | MULTIPLE DESCRIPTION VIDEO CODING FOR REAL-TIME APPLICATIONS USING HEVC |
2651 | MULTI-SCALE DEFORMABLE ALIGNMENT AND CONTENT-ADAPTIVE INFERENCE FOR FLEXIBLE-RATE BI-DIRECTIONAL VIDEO COMPRESSION |
3242 | MULTISCALE REPRESENTATIONS LEARNING TRANSFORMER FRAMEWORK FOR POINT CLOUD CLASSIFICATION |
2225 | Multi-scale temporal feature fusion for few-shot action recognition |
2180 | MULTI-SCALE TRANSFORMER NETWORK FOR SALIENCY PREDICTION ON 360-DEGREE IMAGES |
2412 | MULTI-SEMANTIC ALIGNMENT CO-REASONING NETWORK FOR VIDEO QUESTION ANSWERING |
3500 | Multi-Surface Multi-Technique (MUST) Latent Fingerprint Database |
2986 | MULTI-TASK MODEL BASED ON VISION TASK LEVEL FOR SALIENCY OBJECT DETECTION IN FOGGY CONDITION |
2281 | MULTITHREADED ALGORITHMS FOR LOSSLESS INTRA COMPRESSION OF POINT CLOUD GEOMETRY BASED ON THE SILHOUETTE 3D CODER |
3496 | MULTIVARIATE TIME SERIES IMPUTATION WITH TRANSFORMERS |
2768 | MULTI-VIEW 3D COMPTON IMAGE RECONSTRUCTION WITH A GENERALIZED LIST-MODE MLEM ALGORITHM |
2899 | MULTI-VIEW VARIATIONAL RECURRENT NEURAL NETWORK FOR HUMAN EMOTION RECOGNITION USING MULTI-MODAL BIOLOGICAL SIGNALS |
1256 | MUTUAL RELATIVE POSITION LEARNING TRANSFORMER FOR CROSS-VIEW GEO-LOCALIZATION |
2932 | MUTUALLY SUPERVISED LEARNING VIA INTERACTIVE CONSISTENCY FOR GEOGRAPHIC OBJECT SEGMENTATION FROM WEAKLY LABELED REMOTE SENSING IMAGERY |
2193 | NERD: NEURAL FIELD-BASED DEMOSAICKING |
1190 | NEURAL AUGMENTED EXPOSURE INTERPOLATION FOR HDR IMAGING |
1497 | NEURAL FIELD REAL-TIME TRANSMISSION USING MULTIPLE DESCRIPTION CODING WITH RANDOM POSITION SAMPLING |
2108 | NEURAL GLOBAL ILLUMINATION FOR INVERSE RENDERING |
2795 | NEV-NCD: NEGATIVE LEARNING, ENTROPY, AND VARIANCE REGULARIZATION BASED NOVEL ACTION CATEGORIES DISCOVERY |
3408 | NIGHTTIME HAZE REMOVAL WITH SPATIALLY VARIANT AMBIENT LIGHT AND SALIENCY-WEIGHTED FUSED TRANSMISSION |
2106 | NOISE-AVOIDANCE SAMPLING FOR ANNOTATION MISSING OBJECT DETECTION |
1776 | NONLOCAL LOW-RANK RESIDUAL MODELING FOR IMAGE COMPRESSIVE SENSING RECONSTRUCTION |
2376 | NOVEL ANNOTATION AND METRICS FOR MANGROVE SPECIES CLASSIFICATION USING BOUNDING BOX OBJECT DETECTION |
1681 | NTRANS-NET: A MULTI-SCALE NEUTROSOPHIC-UNCERTAINTY GUIDED TRANSFORMER NETWORK FOR INDOOR DEPTH COMPLETION |
1630 | NUCQ: NON-UNIFORM CONDITIONAL QUANTIZATION FOR LEARNED IMAGE COMPRESSION |
2831 | Object Detection and Counting Challenges in Real Street Monitoring: Case Study of Homeless Encampments |
1478 | OBJECT-CENTRIC VIDEO PREDICTION VIA DECOUPLING OF OBJECT DYNAMICS AND INTERACTIONS |
2152 | OCVOS: Object-Centric Representation for Video Object Segmentation |
2945 | ODD: ONE-CLASS ANOMALY DETECTION VIA THE DIFFUSION MODEL |
2454 | OEST: OUTLIER EXPOSURE BY SIMPLE TRANSFORMATIONS FOR OUT-OF-DISTRIBUTION DETECTION |
1547 | OMISSION-FREE INPAINTING: A THREE-STAGE APPROACH TO ENSURE OBJECT GENERATION |
3461 | Omnidirectional Video Super-Resolution using Deep Learning |
2219 | ONDA-DETR: ONLINE DOMAIN ADAPTATION FOR DETECTION TRANSFORMERS WITH SELF-TRAINING FRAMEWORK |
3387 | ONLINE PEDESTRIAN TRACKING USING A DENSE FISHEYE CAMERA NETWORK WITH EDGE COMPUTING |
1890 | OOD ATTACK: GENERATING OVERCONFIDENT OUT-OF-DISTRIBUTION EXAMPLES TO FOOL DEEP NEURAL CLASSIFIERS |
1606 | OPEN-SET RECOGNITION FOR FACIAL-EXPRESSION RECOGNITION |
3136 | OPTICAL CHARACTER RECOGNITION FOR MEDICAL RECORDS DIGITIZATION WITH DEEP LEARNING |
1789 | Optimized Coded Aperture Design in Compressive Spectral Imaging via Coherence Minimization |
1850 | OPTIMIZING TRANSFORMER FOR LARGE-HOLE IMAGE INPAINTING |
2887 | OVERLAP LOSS: RETHINKING WEAKLY SUPERVISED INSTANCE SEGMENTATION IN CROWDED SCENES |
1274 | PAIRWISE FEATURE LEARNING FOR UNSEEN PLANT DISEASE RECOGNITION |
1881 | PALMPRINT ANTI-SPOOFING BASED ON DOMAIN-ADVERSARIAL TRAINING AND ONLINE TRIPLET MINING |
2870 | PANCREATIC CANCER DETECTION USING HYPERSPECTRAL IMAGING AND MACHINE LEARNING |
1874 | Parallel Gradient Blend for Class Incremental Learning |
1898 | Parameter-efficient Vision Transformer with Linear Attention |
3135 | PART AWARE GRAPH CONVOLUTION NETWORK WITH TEMPORAL ENHANCEMENT FOR SKELETON-BASED ACTION RECOGNITION |
3011 | PARTS BASED ATTENTION FOR HIGHLY OCCLUDED PEDESTRIAN DETECTION WITH TRANSFORMERS |
1289 | PAST INFORMATION AGGREGATION FOR MULTI-PERSON TRACKING |
1653 | Patch-wise Auto-Encoder for Visual Anomaly Detection |
3454 | PERCEPTION-ORIENTED OMNIDIRECTIONAL IMAGE SUPER-RESOLUTION BASED ON TRANSFORMER NETWORK |
1849 | PFC-UNIT: UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION WITH PRE-TRAINED FINE-GRAINED CLASSIFICATION |
1215 | PFTA-Net: Progressive Feature Alignment and Temporal Attention Fusion Networks for Video Inpainting |
2129 | PHYSICS-INFORMED DEEP DEBLURRING: OVER-PARAMETERIZED VS. UNDER-PARAMETERIZED |
1242 | PL-UNEXT: PER-STAGE EDGE DETAIL AND LINE FEATURE GUIDED SEGMENTATION FOR POWER LINE DETECTION |
1185 | Point Cloud Denoising via Momentum Ascent in Gradient Fields |
2724 | POINT CLOUD GEOMETRY AND COLOR CODING IN A LEARNING-BASED ECOSYSTEM FOR JPEG CODING STANDARDS |
3144 | POINT CLOUD UPSAMPLING WITH DYNAMIC GRAPH SCATTERING TRANSFORM |
3482 | POINTPCA+: EXTENDING POINTPCA OBJECTIVE QUALITY ASSESSMENT METRIC |
1217 | POLSAR IMAGE CLASSIFICATION BASED-ON SEMI-SUPERVISED POLARIMETRIC FEATURE SELECTION |
3490 | Positronium lifetime image reconstruction for TOF PET |
3139 | PREDICTING MECHANICAL PROPERTIES OF CARBON NANOTUBE (CNT) IMAGES USING MULTI-LAYER SYNTHETIC FINITE ELEMENT MODEL SIMULATIONS |
2854 | PREDICTION OF DEEP ICE LAYER THICKNESS USING ADAPTIVE RECURRENT GRAPH NEURAL NETWORKS |
2840 | PREDICTIVE CODING FOR ANIMATION-BASED VIDEO COMPRESSION |
2126 | PREFAB-GEN : AD HOC IMAGE GENERATION FOR PRE-MANUFACTURING OF TIRES USING IMAGE-TO-IMAGE TRANSLATION |
2749 | PRE-TRAINING WITH FRACTAL IMAGES FACILITATES LEARNED IMAGE QUALITY ESTIMATION |
1505 | PRNET: A PROGRESSIVE REGRESSION NETWORK FOR NO-REFERENCE USER-GENERATED-CONTENT (UGC) VIDEO QUALITY ASSESSMENT |
2600 | PROCESSING ENERGY MODELING FOR NEURAL NETWORK BASED IMAGE COMPRESSION |
2714 | Product Image Representation Learning on Large Scale Noisy Datasets |
2968 | PROGRESSIVE MIXUP AUGMENTED TEACHER-STUDENT LEARNING FOR UNSUPERVISED DOMAIN ADAPTATION |
2116 | PROGRESSIVE MULTI-VIEW FUSION FOR 3D HUMAN POSE ESTIMATION |
2929 | PROGRESSIVE REFINEMENT LEARNING BASED ON FEATURE INTERACTIVE FUSION FOR SEMANTIC SEGMENTATION OF REMOTE SENSING LIMITED DATASET |
3128 | PROMPT PROTOTYPE LEARNING BASED ON RANKING INSTRUCTION FOR FEW-SHOT VISUAL TASKS |
1690 | PSCO: A POINT CLOUD SCENE CLASSIFICATION MODEL BASED ON CONTRAST LEARNING |
1320 | PSEUDO LABELS REFINEMENT WITH INTRA-CAMERA SIMILARITY FOR UNSUPERVISED PERSON RE-IDENTIFICATION |
1037 | PS-NERV: PATCH-WISE STYLIZED NEURAL REPRESENTATIONS FOR VIDEOS |
2726 | PUSHING THE LIMITS OF THE WIENER FILTER IN IMAGE DENOISING |
2163 | PYRAMID MASKED IMAGE MODELING FOR TRANSFORMER-BASED AERIAL OBJECT DETECTION |
2574 | PYRAMID TRANSFORMER DRIVEN MULTIBRANCH FUSION FOR POLYP SEGMENTATION IN COLONOSCOPIC VIDEO IMAGES |
2122 | QUANTIFIABLE ROBUSTNESS ESTIMATION FOR OBJECT DETECTION WITH CNNS USING INTRINSIC DIMENSIONALITY |
1472 | Query by Activity Video in the Wild |
2023 | Query-based Video Summarization with Pseudo Label Supervision |
1925 | QVRF: A QUANTIZATION-ERROR-AWARE VARIABLE RATE FRAMEWORK FOR LEARNED IMAGE COMPRESSION |
2682 | RADAR HRRP UNSEEN CLASS RECOGNITION BASED ON THE JOINT DICTIONARY LEARNING |
3494 | RAY-SPACE MOTION COMPENSATION FOR LENSLET PLENOPTIC VIDEO CODING |
2586 | RDEPD: RE-EXPLORING DEPTH ESTIMATION FOR PEDESTRIAN DETECTION |
2877 | REALIZATION OF DIGRAPH FILTERS VIA AUGMENTED GFT |
3480 | REAL-TIME DRONE DETECTION AND TRACKING IN DISTORTED INFRARED IMAGES |
1374 | REAL-TIME SUPERMARKET PRODUCT RECOGNITION ON MOBILE DEVICES USING SCALABLE PIPELINES |
2010 | REAL-TIME WHEEL DETECTION AND RIM CLASSIFICATION IN AUTOMOTIVE PRODUCTION |
1017 | REAPER: ARTICULATED OBJECT 6D POSE ESTIMATION WITH DEEP REINFORCEMENT LEARNING |
2755 | RECOVERING QUALITY SCORES IN NOISY PAIRWISE SUBJECTIVE EXPERIMENTS USING NEGATIVE LOG-LIKELIHOOD |
2837 | RECTANGULAR-OUTPUT IMAGE STITCHING |
1691 | REDUCED COMPLEXITY MULTISCALE CNN FOR IN-LOOP VIDEO RESTORATION |
2549 | Regularizing Neural Radiance Fields from Sparse RGB-D Inputs |
1516 | REPRESENTATION LEARNING OF VERTEX HEATMAPS FOR 3D HUMAN MESH RECONSTRUCTION FROM MULTI-VIEW IMAGES |
2894 | RESIDENTIAL EXTRACTION BASED ON WEAKLY-SUPERVISED SIMILARITY-AWARE MULTI-SOURCE ALIGNMENT STRATEGY WITH LIMITED SAR DATA |
2824 | RESSCAL3D: RESOLUTION SCALABLE 3D SEMANTIC SEGMENTATION |
2093 | RESTORABLE VISIBLE AND INFRARED IMAGE FUSION |
1855 | RESTORATION OF EXTREMELY COMPRESSED BACKGROUND FOR VCM USING GUIDED GENERATIVE PRIORS |
1389 | RETHINKING LONG-TAILED VISUAL RECOGNITION WITH DYNAMIC PROBABILITY SMOOTHING AND FREQUENCY WEIGHTED FOCUSING |
2787 | RETINEX-BASED IMAGE DENOISING / CONTRAST ENHANCEMENT USING GRADIENT GRAPH LAPLACIAN REGULARIZER |
1823 | RETRIEVE THE VISIBLE FEATURE TO IMPROVE THERMAL PEDESTRIAN DETECTION USING DISCREPANCY PRESERVING MEMORY NETWORK |
1540 | REUSE NON-TERRAIN POLICIES FOR LEARNING TERRAIN-ADAPTIVE HUMANOID LOCOMOTION SKILLS |
2199 | REVISITING MODALITY IMBALANCE IN MULTIMODAL PEDESTRIAN DETECTION |
2800 | Revolutionizing Thermal Imaging: GAN-based Vision Transformers for Image Enhancement |
2808 | RFID-ASSISTED VISUAL MULTIPLE OBJECT TRACKING WITHOUT USING VISUAL APPEARANCE AND MOTION |
1292 | RINGING ARTIFACT REDUCTION METHOD FOR ULTRASOUND RECONSTRUCTION USING MULTI-AGENT CONSENSUS EQUILIBRIUM |
2535 | ROBUST BOUNDING BOX REGRESSION FOR SMALL OBJECT DETECTION |
1753 | ROBUST FACE ANTI-SPOOFING FRAMEWORK WITH CONVOLUTIONAL VISION TRANSFORMER |
2487 | ROBUST FEATURE LEARNING AGAINST NOISY LABEL |
1278 | Robust graph neural diffusion for image matching |
3014 | ROBUST GRAPH-BASED SEGMENTATION OF NOISY POINT CLOUDS |
2227 | ROBUST MULTISPECTRAL PEDESTRIAN DETECTION VIA SPECTRAL POSITION-FREE FEATURE MAPPING |
3312 | ROBUST NUCLEUS CLASSIFICATION WITH ITERATIVE GRAPH REPRESENTATIONAL LEARNING |
1451 | Robust RGB-T tracking via consistency regulated scene perception |
1760 | ROBUST WIND TURBINE BLADE SEGMENTATION FROM RGB IMAGES IN THE WILD |
1680 | ROTATION XGBOOST BASED METHOD FOR HYPERSPECTRAL IMAGE CLASSIFICATION WITH LIMITED TRAINING SAMPLES |
2706 | RSFDM-NET: REAL-TIME SPATIAL AND FREQUENCY DOMAINS MODULATION NETWORK FOR UNDERWATER IMAGE ENHANCEMENT |
2393 | SANDWICHED VIDEO COMPRESSION: EFFICIENTLY EXTENDING THE REACH OF STANDARD CODECS WITH NEURAL WRAPPERS |
3069 | SAR TARGET EXTRACTION BASED ON SALIENCY-GUIDED CROSS-DOMAIN DISCREPANCY ALIGNMENT STRATEGY |
3223 | SATPLATE: A GERMANY LICENSE PLATE DETECTION DATASET AND BASELINES |
3253 | SCAPEGOAT GENERATION FOR PRIVACY PROTECTION FROM DEEPFAKE |
1204 | SCENE FLOW ESTIMATION FROM POINT CLOUDS WITH CONTRASTIVE LOSS AND DUAL PSEUDO LABELS |
1507 | SCENE TEXT RECOGNITION MODELS EXPLAINABILITY USING LOCAL FEATURES |
1470 | SCENE TEXT SEGMENTATION BY PAIRED DATA SYNTHESIS |
1131 | SCORE-BASED DIFFUSION MODELS FOR BAYESIAN IMAGE RECONSTRUCTION |
2176 | SCRATCHHOI: TRAINING HUMAN-OBJECT INTERACTION DETECTORS FROM SCRATCH |
2205 | SDAT-FORMER: FOGGY SCENE SEMANTIC SEGMENTATION VIA A STRONG DOMAIN ADAPTATION TEACHER |
2300 | SDWD: STYLE DIVERSITY WEIGHTED DISTANCE EVALUATES THE INTRA-CLASS DATA DIVERSITY OF DISTRIBUTED GANS |
2671 | SEGMENTATION AND CLASSIFICATION-BASED DIAGNOSIS OF TUMORS FROM BREAST ULTRASOUND IMAGES USING MULTIBRANCH UNET |
1869 | Segmentation of the Left Ventricle by SDD double threshold selection and CHT |
1864 | SEGMENTATION OF THE LEFT VENTRICLE FOR THE CARDIAC PHASES BETWEEN END-DIASTOLE AND END-SYSTOLE |
2646 | SELECTING A DIVERSE SET OF AESTHETICALLY-PLEASING AND REPRESENTATIVE VIDEO THUMBNAILS USING REINFORCEMENT LEARNING |
2524 | SELF ADAPTIVE GLOBAL-LOCAL FEATURE ENHANCEMENT FOR RADIOLOGY REPORT GENERATION |
2756 | SELF PATCH LABELING USING QUALITY DISTRIBUTION ESTIMATION FOR CNN-BASED 360-IQA TRAINING |
1928 | SELF-COMPENSATING LEARNING FOR FEW-SHOT SEGMENTATION |
2987 | Self-enhanced training framework for referring expression grounding |
1511 | SELF-REINFORCING FOR FEW-SHOT MEDICAL IMAGE SEGMENTATION |
1058 | SELF-SUPERVISED 3D SKELETON REPRESENTATION LEARNING WITH ACTIVE SAMPLING AND ADAPTIVE RELABELING FOR ACTION RECOGNITION |
1742 | SELF-SUPERVISED CONTRASTIVE LEARNING FOR AUDIO-VISUAL ACTION RECOGNITION |
3224 | SELF-SUPERVISED DENOISING OF OPTICAL COHERENCE TOMOGRAPHY WITH INTER-FRAME REPRESENTATION |
2529 | SELF-SUPERVISED FOCUS MEASURE FUSING FOR DEPTH ESTIMATION FROM COMPUTER-GENERATED HOLOGRAMS |
1626 | Self-supervised Learning for Context-independent DfD Network using Multi-view Rank Supervision |
2764 | SELF-SUPERVISED LEARNING FOR SCANNED HALFTONE CLASSIFICATION WITH NOVEL AUGMENTATION TECHNIQUES |
1016 | SEMANTIC AND INSTANCE-AWARE PIXEL-ADAPTIVE CONVOLUTION FOR PANOPTIC SEGMENTATION |
1805 | SEMANTIC CIRCLE DETECTION AND CIRCLE-INNER SEGMENTATION FOR TREE-WISE CITRUS SUMMER SHOOT MANAGEMENT IN AERIAL IMAGES |
1664 | SEMANTIC LEARNING NETWORK FOR CONTROLLABLE VIDEO CAPTIONING |
2869 | SEMANTIC MAPPING OF INCREMENTAL 3D POINT CLOUDS BASED ON MULTI-HOP GRAPH ATTENTION NETWORK |
3255 | Semantic Scene Completion with Point Cloud Representation and Transformer-based Feature Fusion |
2577 | SEMANTIC-EMBEDDED KNOWLEDGE ACQUISITION AND REASONING FOR IMAGE SEGMENTATION |
1346 | SEM-CS: SEMANTIC CLIPSTYLER FOR TEXT-BASED IMAGE STYLE TRANSFER |
1642 | SEM-FCNET: SEMANTIC FEATURE ENHANCEMENT AND FULLY CONVOLUTIONAL NETWORK MODEL FOR REMOTE SENSING OBJECT DETECTION |
1018 | SEMI-SUPERVISED CONTRASTIVE LEARNING OF GLOBAL AND LOCAL REPRESENTATION FOR 3D MEDICAL IMAGE SEGMENTATION |
2084 | SEMI-SUPERVISED FEW-SHOT SEGMENTATION WITH NOISY SUPPORT IMAGES |
1727 | SGSR: A Saliency-Guided Image Super-Resolution Network |
2237 | SIAMCLIM: TEXT-BASED PEDESTRIAN SEARCH VIA MULTI-MODAL SIAMESE CONTRASTIVE LEARNING |
1161 | Siamese Network Representation for Active Learning |
3474 | SIMPLE BASELINES FOR PROJECTION-BASED FULL-REFERENCE AND NO-REFERENCE POINT CLOUD QUALITY ASSESSMENT |
1612 | SIMPLE SELF-DISTILLATION LEARNING FOR NOISY IMAGE CLASSIFICATION |
1570 | SIMULTANEOUS WATERMARKING AND DRACO 3D OBJECT COMPRESSION METHOD |
3426 | SINGLE IMAGE LDR TO HDR CONVERSION USING CONDITIONAL DIFFUSION |
2567 | SINGLE-DOMAIN GENERALIZATION FOR SEMANTIC SEGMENTATION VIA DUAL-LEVEL DOMAIN AUGMENTATION |
1111 | SINGLE-IMAGE HDR RECONSTRUCTION BASED ON TWO-STAGE GAN STRUCTURE |
1821 | SINGLE-STAGE HEAVY-TAILED FOOD CLASSIFICATION |
3159 | SKELETON ACTION RECOGNITION BASED ON SPATIO-TEMPORAL FEATURES |
1610 | SKETCHFFUSION: SKETCH-GUIDED IMAGE EDITING WITH DIFFUSION MODEL |
2802 | SMOOTH AND STEPWISE SELF-DISTILLATION FOR OBJECT DETECTION |
2036 | SOFT-INTROVAE FOR CONTINUOUS LATENT SPACE IMAGE SUPER-RESOLUTION |
3442 | SPATIAL-FREQUENCY NETWORK FOR THE SEGMENTATION OF REMOTE SENSING IMAGES |
2159 | SPATIALLY-ADAPTIVE LEARNING-BASED IMAGE COMPRESSION WITH HIERARCHICAL MULTI-SCALE LATENT SPACES |
2541 | SPATIAL-TEMPORAL TRANSFORMER NETWORK FOR HUMAN MOCAP DATA RECOVERY |
2007 | SPATIO-TEMPORAL PERCEPTION-DISTORTION TRADE-OFF IN LEARNED VIDEO SR |
3103 | SPECTRAL GROUPING DRIVEN HYPERSPECTRAL SUPER-RESOLUTION |
1716 | SPIKING GLOM: BIO-INSPIRED ARCHITECTURE FOR NEXT-GENERATION OBJECT RECOGNITION |
2786 | STAGE OF DECAY ESTIMATION EXPLOITING EXOGENOUS AND ENDOGENOUS IMAGE ATTRIBUTES TO MINIMIZE MANUAL LABELING EFFORTS AND MAXIMIZE CLASSIFICATION PERFORMANCE |
3324 | STANet: Spatiotemporal Adaptive Network For Remote Sensing Images |
1767 | ST-MFNET MINI: KNOWLEDGE DISTILLATION-DRIVEN FRAME INTERPOLATION |
2083 | STRENGTHENING DEEP LEARNING MODEL FOR ROBUST SCREENING OF VOLUMETRIC CHEST RADIOGRAPHIC SCANS |
2402 | STRUCTURE-AWARE GENERATIVE ADVERSARIAL NETWORK FOR TEXT-TO-IMAGE GENERATION |
1822 | STYLE TRANSFER BETWEEN MICROSCOPY AND MAGNETIC RESONANCE IMAGING VIA GENERATIVE ADVERSARIAL NETWORK IN SMALL SAMPLE SIZE SETTINGS |
2735 | SUBJECTIVE ASSESSMENT OF THE IMPACT OF A CONTENT ADAPTIVE OPTIMISER FOR COMPRESSING 4K HDR CONTENT WITH AV1 |
2948 | SUBJECTIVE QUALITY ASSESSMENT OF ENHANCED RETINAL IMAGES |
2148 | SUPER-RESOLUTION OF BVOC MAPS BY ADAPTING DEEP LEARNING METHODS |
1234 | SWINAT-UNET: A NEW BACKBONE FOR PRECIPITATION NOWCASTING |
2161 | TAMM: A TASK-ADAPTIVE MULTI-MODAL FUSION NETWORK FOR FACIAL-RELATED HEALTH ASSESSMENTS ON 3D FACIAL IMAGES |
2198 | TAQ: TOP-K ATTENTION-AWARE QUANTIZATION FOR VISION TRANSFORMERS |
1083 | TARGET-DISCRIMINABILITY-INDUCED MULTI-SOURCE-FREE DOMAIN ADAPTATION |
3056 | TASK-ADAPTIVE FEATURE MATCHING LOSS FOR IMAGE DEBLURRING |
1034 | TASK-AGNOSTIC OPEN-SET PROTOTYPE FOR FEW-SHOT OPEN-SET RECOGNITION |
1437 | TASK-AWARE GRAPH CONVOLUTIONAL NETWORK FOR ACTIVE LEARNING |
3365 | TEACHER-STUDENT NETWORK FOR REAL-WORLD FACE SUPER-RESOLUTION WITH PROGRESSIVE EMBEDDING OF EDGE INFORMATION |
1406 | TEAM DETR: GUIDE QUERIES AS A PROFESSIONAL TEAM IN DETECTION TRANSFORMERS |
1455 | Tell Your Story: Text-Driven Face Video Synthesis With High Diversity via Adversarial Learning |
1319 | TEXT-GUIDED FACIAL IMAGE MANIPULATION FOR WILD IMAGES VIA MANIPULATION DIRECTION-BASED LOSS |
3266 | THE ELLIPTIC ENERGY LOSS FOR ROTATED OBJECT DETECTION IN AERIAL IMAGES |
2846 | THE FIRST COMPREHENSIVE DATASET WITH MULTIPLE DISTORTION TYPES FOR VISUAL JUST-NOTICEABLE DIFFERENCES |
3471 | THE FIRST PLACE SOLUTION FOR ICIP2023 CHALLENGE INFRARED IMAGING-BASED DRONE DETECTION AND TRACKING IN DISTORTED SURVEILLANCE VIDEOS |
1829 | THE MULTIVARIATE TRANSFORMER NETWORK FOR MILD COGNITIVE IMPAIRMENT IDENTIFICATION |
2064 | The Oil and Water Separation Phenomenon Inspired Loss for Feature Learning |
2647 | THERMAL INFRARED GUIDED COLOR IMAGE DEHAZING |
1764 | Token-consistent Dropout for Calibrated Vision Transformers |
1399 | Towards Modeling 3D Dense Shape Correspondence from Category-Specific Multi-View Images |
1786 | TOWARDS QUERY EFFICIENT AND GENERALIZABLE BLACK-BOX FACE RECONSTRUCTION ATTACK |
2025 | Towards Robustness: Enhancing Deep Learning Models through Meta-Learning and Bilevel Optimization for Accurate Car Damage Classification |
3271 | TP-YOLO: A Lightweight Attention-based Architecture for Tiny Pest Detection |
1251 | TR3D: TOWARDS REAL-TIME INDOOR 3D OBJECT DETECTION |
3489 | TRACKING AIDED DRONE BIRD CLASSIFICATION USING YOLO AND LSTM |
3085 | TRAINING CARTOONIZATION NETWORK WITHOUT CARTOON |
1737 | TRAINING-FREE LOCATION-AWARE TEXT-TO-IMAGE SYNTHESIS |
1410 | TRANSBUILDING: AN END-TO-END POLYGONAL BUILDING EXTRACTION WITH TRANSFORMERS |
1220 | TRANSFORMATION CONSISTENCY FOR REMOTE SENSING IMAGE SUPER-RESOLUTION |
2921 | TRANSFORMER-BASED VARIABLE-RATE IMAGE COMPRESSION WITH REGION-OF-INTEREST CONTROL |
1541 | TRANSFORMING MULTIDIMENSIONAL DATA INTO IMAGES TO OVERCOME THE CURSE OF DIMENSIONALITY |
1684 | TRANSPOINTFLOW: LEARNING SCENE FLOW FROM POINT CLOUDS WITH TRANSFORMER |
3004 | TRG-DQA: TEXTURE RESIDUAL-GUIDED DEHAZED IMAGE QUALITY ASSESSMENT |
2623 | TRICKVOS: A BAG OF TRICKS FOR VIDEO OBJECT SEGMENTATION |
2250 | Truncated Weighted Nuclear Norm Regularization and Sparsity for Image Denoising |
2081 | TSANET: TEMPORAL AND SCALE ALIGNMENT FOR UNSUPERVISED VIDEO OBJECT SEGMENTATION |
3147 | TSFC: TEXTURE AND STRUCTURE FEATURES COUPLING FOR IMAGE INPAINTING |
2414 | ULCOMPRESS: A UNIFIED LOW BIT-RATE IMAGE COMPRESSION FRAMEWORK VIA INVERTIBLE IMAGE REPRESENTATION |
2636 | UNCERTAINTY AWARE IMPLICIT IMAGE FUNCTION FOR ARBITRARY-SCALE SUPER-RESOLUTION |
1911 | UNDERWATER IMAGE ENHANCEMENT AND SUPER-RESOLUTION USING IMPLICIT NEURAL NETWORKS |
1231 | UNIFIED LEARNING-BASED LOSSY AND LOSSLESS JPEG RECOMPRESSION |
1361 | Unknown Class Feature Transformation for Open Set Domain Adaptation without Source Data |
2791 | UNROLLED IPPG: VIDEO HEART RATE ESTIMATION VIA UNROLLING PROXIMAL GRADIENT DESCENT |
2134 | UNSUPERVISED ANOMALY DETECTION USING VARIATIONAL AUTOENCODER WITH GAUSSIAN RANDOM FIELD PRIOR |
1794 | UNSUPERVISED ANOMALY DETECTION WITH LOCAL-SENSITIVE VQVAE AND GLOBAL-SENSITIVE TRANSFORMERS |
2527 | UNSUPERVISED DEEP HASHING WITH DEEP SEMANTIC DISTILLATION |
3370 | UNSUPERVISED DOMAIN ADAPTATION WITH IMBALANCED CHARACTER DISTRIBUTION FOR SCENE TEXT RECOGNITION |
2985 | UNSUPERVISED DOMAIN ADAPTIVE LEARNING FOR IMAGE DESNOWING WITH REAL-WORLD DATA |
1636 | UNSUPERVISED DOMAIN ADAPTIVE PERSON RE-IDENTIFICATION WITH ADAPTIVE STRUCTURE LEARNING |
2721 | USGG: UNION MESSAGE BASED SCENE GRAPH GENERATION |
3182 | Using classifier discrepancy for cross-domain image retrieval |
2441 | USURP: UNIVERSAL SINGLE-SOURCE ADVERSARIAL PERTURBATIONS ON MULTIMODAL EMOTION RECOGNITION |
2346 | UT-GAN: A NOVEL UNPAIRED TEXTUAL-ATTENTION GENERATIVE ADVERSARIAL NETWORK FOR LOW-LIGHT TEXT IMAGE ENHANCEMENT |
3444 | UTILIZING SUPER-RESOLUTION FOR ENHANCED AUTOMOTIVE RADAR OBJECT DETECTION |
3449 | VARIATIONAL DEEP ATMOSPHERIC TURBULENCE CORRECTION FOR VIDEO |
2866 | VARIATIONAL FEATURE DISENTANGLEMENT FOR FEW-SHOT DOMAIN ADAPTATION |
1096 | Video Question Answering using Clip-guided Visual-text Attention |
1173 | VIDEO SUMMARIZATION THROUGH FINE-GRAINED HIERARCHICAL MODELING WITH MULTI-DIMENSIONAL FEATURES |
2915 | VIDEO SUPER-RESOLUTION VIA EVENT-DRIVEN TEMPORAL ALIGNMENT |
2359 | VIDEO-MUSIC RETRIEVAL WITH FINE-GRAINED CROSS-MODAL ALIGNMENT |
2650 | VIDEO-SWINUNET: SPATIO-TEMPORAL DEEP LEARNING FRAMEWORK FOR VFSS INSTANCE SEGMENTATION |
1717 | VISUAL AND SPATIAL CONTEXT FUSION FOR IMPLICIT HUMAN RECONSTRUCTION |
1902 | VIVA: A Variational Image Vectorization Algorithm on Dual-Primal Graph Pairs |
2615 | WAVELET-BASED FREQUENCY-DIVIDING INTERACTIVE CNN FOR IMAGE CLASSIFICATION |
2856 | WCANET: WAVELET CHANNEL ATTENTION NETWORK FOR CITRUS VARIETY IDENTIFICATION |
3006 | WEAKLY SEMI-SUPERVISED ORIENTED OBJECT DETECTION WITH POINTS |
2585 | WEAKLY SUPERVISED DISENTANGLEMENT WITH TRIPLET NETWORK |
1759 | WEIGHTED ANISOTROPIC -- ISOTROPIC TOTAL VARIATION FOR POISSON DENOISING |
3235 | WHAT MODALITY MATTERS? EXPLOITING HIGHLY RELEVANT FEATURES FOR VIDEO ADVERTISEMENT INSERTION |
1203 | WHEN VISIBLE-TO-THERMAL FACIAL GAN BEATS CONDITIONAL DIFFUSION |
2796 | XI-NET: TRANSFORMER BASED SEISMIC WAVEFORM RECONSTRUCTOR |
1674 | X-Ray spectral estimation using Dictionary Learning |
3062 | YOLO-MAXVOD FOR REAL-TIME VIDEO OBJECT DETECTION |
3488 | YOLOV7 FOR MOSQUITO BREEDING GROUNDS DETECTION AND TRACKING |
1671 | YOU ONLY NEED 80K PARAMETERS TO ENHANCE IMAGE: LEARNING PERIODIC FEATURES FOR IMAGE ENHANCEMENT |
2343 | Zero-shot Human-Object Interaction (HOI) Classification by Bridging Generative and Contrastive Image-Language Models |
1936 | ZERO-SHOT HYPERSPECTRAL IMAGE DENOISING WITH SELF-COMPLETION WITH PATTERNED MASKS |
2751 | ZREC: ROBUST RECOVERY OF MEAN AND PERCENTILE OPINION SCORES |