List of Accepted Papers

Following is the list of accepted ICIP 2024 papers, sorted by paper title. You can use the search feature of your web browser to find your paper number. Notifications to all authors have also been sent by email. If you have not received your notification of the results by email, please contact us at papers@2024.ieeeicip.org.

Paper Number Paper Title
11483D Clothed Human Reconstruction From One In-the-wild RGB Image
10483D SEMANTIC SCENE COMPLETION FROM A DEPTH MAP WITH UNSUPERVISED LEARNING FOR SEMANTICS PRIORITISATION
25733D-COCO: EXTENSION OF MS-COCO DATASET FOR SCENE UNDERSTANDING AND 3D RECONSTRUCTION
21263DLaneFormer: Rethinking Learning Views for 3D Lane Detection
20073F-PnP: Compressive Sensing using Nonlocal Self-Similarity and Deep Learning Priors
1616A 1D PLUG-AND-PLAY SYNTHETIC DATA DEEP LEARNING FOR UNDERSAMPLED MAGNETIC RESONANCE IMAGE RECONSTRUCTION
2265A benchmark of variance of opinion scores in image quality assessment
2832A Channel-Wise Multi-Scale Network for Single Image Super-Resolution
1896A CNN-TRANSFORMER NETWORK BASED SNR GUIDED HIGH FREQUENCY RECONSTRUCTION FOR LOW LIGHT IMAGE ENHANCEMENT
1801A comparative study of perceptual quality metrics for audio-driven talking head videos
1119A CONFIDENCE-AWARE MATCHING STRATEGY FOR GENERALIZED MULTI-OBJECT TRACKING
1085A CONTEXT-ORIENTED MULTI-SCALE NEURAL NETWORK FOR FIRE SEGMENTATION
2136A CROSS DOMAIN GENERATIVE NETWORK FOR ACCELERATED MRI
2567A DATASET FOR UNDERSTANDING OPEN UGC VIDEO DATASETS
1989A DECODING SCHEME WITH SUCCESSIVE AGGREGATION OF MULTI-LEVEL FEATURES FOR LIGHT-WEIGHT SEMANTIC SEGMENTATION
1588A DICTIONARY BASED APPROACH FOR REMOVING OUT-OF-FOCUS BLUR
1294A DUAL-DOMAIN COLLABORATION NETWORK FOR VCS RECONSTRUCTION
2662A FUSION-BASED APPROACH FOR BLIND CONTRAST-ENHANCED IMAGE RANKING
1571A HARD CONVEX-SHAPE CONSTRAINT IN DNNS FOR OBJECT SEGMENTATION
2439A HUE-PRESERVING CONTRAST ENHANCEMENT METHOD USING HISTOGRAM SPECIFICATION FOR EACH RGB COMPONENT
2147A Large-capacity data hiding scheme in encrypted VVC video
1700A LEARNABLE RADAR IMAGING PARADIGM DRIVEN BY DEEP GENERATIVE MODEL
1477A MODULAR AND ROBUST PHYSICS-BASED APPROACH FOR LENSLESS IMAGE RECONSTRUCTION
1261A MULTI-MODALITY FEATURE ENHANCEMENT METHOD BASED ON FEATURE DISENTANGLEMENT FOR SAR IMAGE TARGET DETECTION
1745A MULTI-SCALE FEATURE FUSION NETWORK FOR CHIP SURFACE DEFECT DETECTION
2437A NEEDLE IN A (MEDICAL) HAYSTACK: DETECTING A BIOPSY NEEDLE IN ULTRASOUND IMAGES USING VISION TRANSFORMERS
2533A Neuroimaging YOLOv8-Based CAD Framework for Anosmia Grading in COVID-19
2838A New Approach in Automated Fingerprint Presentation Attack Detection Using Optical Coherence Tomography
2434A NEW EFFICIENT SPLIT & MERGE ALGORITHM FOR EMBEDDED SYSTEMS
1696A new fingerprinting technique for engraved binary matrix authentication
1282A NEW PEOPLE-OBJECT INTERACTION DATASET AND NVS BENCHMARKS
2731A NOVEL APPROACH FOR 3D RENAL SEGMENTATION USING A MODIFIED GAN MODEL AND TEXTURE ANALYSIS
1753A Novel architecture for image vectorization with increasing granularity
2466A Practical Calibration Method for Cameras and Multiple Line-Lasers in Light Sectioning Systems for Underwater Environments
2449A PRECONDITIONING APPROACH TO OPTIMIZING SENSING MATRIX FOR IMPROVED COMPRESSED SENSING CT RECONSTRUCTION
1966A REAL-WORLD SATELLITE VIDEO SUBJECTIVE QOE DATABASE
2327A SELF-SUPERVISED DIFFUSION FRAMEWORK FOR FACIAL EMOTION RECOGNITION
1596A SINGLE GRAPH CONVOLUTION IS ALL YOU NEED: EFFICIENT GRAYSCALE IMAGE CLASSIFICATION
2087A Sparse Graph Formulation for Efficient Spectral Image Segmentation
2531A SPATIO-TEMPORAL ALIGNED SUNET MODEL FOR LOW-LIGHT VIDEO ENHANCEMENT
1747A STATISTICAL IMAGE REALISM SCORE FOR DEEPFAKE DETECTION
1701A STUDY ON THE EFFECT OF COLOR SPACES IN LEARNED IMAGE COMPRESSION
2116A Subjective Quality Evaluation of 3D Mesh with Dynamic Level of Detail in Virtual Reality
2424A Text Detector Based On The Specific Text Prompt
2241A TOOLKIT TO BENCHMARK POINT CLOUD QUALITY METRICS WITH MULTI-TRACK EVALUATION CRITERIA
2392A Trustworthy Authentication Against Visual Master Face Dictionary Attacks (Trauma)
1913AAGF: AN EFFICIENT TRANSFORMER WITH MIX-FEATURES FOR VISUAL PLACE RECOGNITION
2411ACCELERATING CASCADE CLASSIFIER TRAINING WITH GENETIC ALGORITHMS FOR EDGE ML APPLICATIONS
2370Accurate colon segmentation using 2D convolutional neural networks with 3D contextual information
1731ACML: Attention-based Cross-Modality Learning for Cloth-Changing and Occluded Person Re-Identification
2227ADAPROMPT: PROMPT TUNING WITH ADAPTIVE NEIGHBOURS FOR GENERALIZED CATEGORY DISCOVERY
1590Adaptative Context Normalization: A Boost for Deep Learning in Image Processing
1721ADAPTING LEARNED IMAGE CODECS TO SCREEN CONTENT VIA ADJUSTABLE TRANSFORMATIONS
1969ADAPTIVE ADVERSARIAL CROSS-ENTROPY LOSS FOR SHARPNESS-AWARE MINIMIZATION
2309Adaptive downsampling and spatial upconversion for point cloud compression
2556ADAPTIVE SAMPLING METHOD FOR WHOLE-BODY LOW-DOSE PET RECONSTRUCTION BASED ON RECONSTRUCTION DIFFICULTY
2455ADAPTIVE SPATIAL-TEMPORAL MODELLING FOR HUMAN MOTION PREDICTION
1040ADAPTIVE TILT-SERIES ALIGNMENT WITH FEATURE RESAMPLING IN CRYO-ELECTRON TOMOGRAPHY
1917ADAPTIVELY HIERARCHICAL QUANTIZATION VARIATIONAL AUTOENCODER BASED ON FEATURE DECOUPLING AND SEMANTIC CONSISTENCY FOR IMAGE GENERATION
1435ADAPTRACK: ADAPTIVE THRESHOLDING-BASED MATCHING FOR MULTI-OBJECT TRACKING
1519ADAPTXRAY: VISION TRANSFORMER AND ADAPTER IN X-RAY IMAGES FOR PROHIBITED ITEMS DETECTION
1743ADAVIPRO: REGION-BASED ADAPTIVE VISUAL PROMPT FOR LARGE-SCALE MODELS ADAPTING
1485Advanced Object Detection in Multibeam Forward-looking Sonar Images Using Linear Cross-Attention Techniques
1961ADVANCING COLORECTAL POLYP SEGMENTATION WITH WATERSHED ALGORITHM-ENHANCED PARALLEL SELF-SUPERVISED LEARNING
2195AdvART: Adversarial Art for Camouflaged Object Detection Attacks
1713ADVERSARIAL DETECTION TRANSFORMER FOR KUZUSHIJI RECOGNITION
1469Adversarial EM for Partially-Supervised Image-Quality Enhancement: Application to Low-Dose PET Imaging
1367ADVERSARIAL ROBUSTNESS FOR DEEP METRIC LEARNING
2545ADVERSARIALLY ROBUST CONTINUAL LEARNING WITH ANTI-FORGETTING LOSS
1560AERIAL VIEW RIVER LANDFORM VIDEO SEGMENTATION: A WEAKLY SUPERVISED CONTEXT-AWARE TEMPORAL CONSISTENCY DISTILLATION APPROACH
2110AGENT-GUIDED GAZE ESTIMATION NETWORK BY TWO-EYE ASYMMETRY EXPLORATION
2471AIGCOIQA2024: PERCEPTUAL QUALITY ASSESSMENT OF AI GENERATED OMNIDIRECTIONAL IMAGES
2009AI-GENERATED IMAGE DETECTION WITH WASSERSTEIN DISTANCE COMPRESSION AND DYNAMIC AGGREGATION
1879ALIGNFACE: ENHANCING FACE VERIFICATION MODELS THROUGH ADAPTIVE ALIGNMENT OF POSE, EXPRESSION, AND ILLUMINATION
2188ALL SKELETONS ARE CREATED EQUAL! A DOMAIN ADAPTATION TRANSFORMER TO HANDLE MULTIPLE TOPOLOGIES
2209An Alpha-Divergence Approach to Robust Canonical Correlation Analysis
1852An Anchor-free Contour-based Method for Instance Segmentation
2561AN EXPLAINABLE SPECTRAL ANALYSIS FOR LIGHT FIELD IMAGE QUALITY ASSESSMENT
2532AN IMAGE DECOMPOSITION-GUIDED NETWORK FOR IMAGE INTERPOLATION
1681AN INDOOR SCENE LOCALIZATION METHOD USING GRAPHICAL SUMMARY OF MULTI-VIEW RGB-D IMAGES
2519AN INTERNATIONAL STANDARD FOR ASSESSING TRUSTWORTHINESS IN MEDIA
1565AN INTERPRETABLE DEEP GRAPH NEURAL NETWORK BASED ON ATTENTIONAL MULTI-SCALE FEATURE FUSION FOR FMRI ANALYSIS
1289An Optimal Transport-based Method for Medical Image Generation
2155ANALYZING VISIBLE ARTICULATORY MOVEMENTS IN SPEECH PRODUCTION FOR SPEECH-DRIVEN 3D FACIAL ANIMATION
2415ANOMALY DETECTION FOR THE IDENTIFICATION OF VOLCANIC UNREST IN SATELLITE IMAGERY
2544ANOMALY UNVEILED: SECURING IMAGE CLASSIFICATION AGAINST ADVERSARIAL PATCH ATTACKS
1388APNET: GENERATING PRECISE ANOMALY PRIOR INFORMATION FOR MIXED-SUPERVISED DEFECT DETECTION
2681ARE OBJECTIVE EXPLANATORY EVALUATION METRICS TRUSTWORTHY? AN ADVERSARIAL ANALYSIS
2606ASSESSING VIDEO SHAKINESS: A NOVEL DATA AND PROTOCOLS FRAMEWORK
2078ATAC-NET: ZOOMED VIEW WORKS BETTER FOR ANOMALY DETECTION
2791ATTENTION DOWN-SAMPLING TRANSFORMER, RELATIVE RANKING AND SELF-CONSISTENCY FOR BLIND IMAGE QUALITY ASSESSMENT
1511ATTENTION ENHANCEMENT WITH PARALLEL GROUPS FOR REMOTE SENSING OBJECT DETECTION
2072ATTENTION-BASED FEW-SHOT DIAGNOSIS OF CHEST X-RAYS USING SEMANTIC SIGNATURES
1413ATU-NET: AN ADAPTIVE TRANSFORMATION-BASED U-NET FOR MEDICAL IMAGE SEGMENTATION
2582Automated Segmentation of Lung Regions in 3D CT Scans Using Hybrid Unsupervised-Supervised Models
2821Automatic Point Cloud Registration for 3D Virtualto-Real Registration Using Macro and Micro Structures
1587BAYESIAN BLIND IMAGE DECONVOLUTION USING AN HYPERBOLIC-SECANT PRIOR
2823BAYGO: DECENTRALIZED BAYESIAN LEARNING AND INFORMATION-AWARE GRAPH OPTIMIZATION FRAMEWORK
1882BIDFUSE: HARNESSING BI-DIRECTIONAL ATTENTION WITH MODALITY-SPECIFIC ENCODERS FOR INFRARED-VISIBLE IMAGE FUSION
2779BI-DIRECTIONAL TRACKLET EMBEDDING FOR MULTI-OBJECT TRACKING
1151BINARY-DECOMPOSED VISION TRANSFORMER: COMPRESSING AND ACCELERATING VISION TRANSFORMER BY BINARY DECOMPOSITION
1352BI-PREDICTIVE INTRA BLOCK COPY FOR ENHANCED VIDEO CODING BEYOND VVC
2169BLEND & PREDICT: DOMAIN-ADAPTABLE FEW-SHOT LEARNING FOR MICROSCOPY IMAGING
2139BMT-BENCH: A BENCHMARK SPORTS DATASET FOR VIDEO GENERATION
1547BOX-LEVEL CLASS-BALANCED SAMPLING FOR ACTIVE OBJECT DETECTION
1880BRI3L: A BRIGHTNESS ILLUSION IMAGE DATASET FOR IDENTIFICATION AND LOCALIZATION OF REGIONS OF ILLUSORY PERCEPTION
2311BuRnSNet: BURN REGION SEGMENTATION NETWORK FROM COLOR IMAGES WITH TWO-WAY CNN
2201B-WALK: BERNOULLI PRINCIPLE GUIDED BIASED RANDOM WALK FOR CURVE CONNECTION
2529CAFCT-NET: A CNN-TRANSFORMER HYBRID NETWORK WITH CONTEXTUAL AND ATTENTIONAL FEATURE FUSION FOR LIVER TUMOR SEGMENTATION
1923CAMERA CALIBRATION THROUGH GEOMETRIC CONSTRAINTS FROM ROTATION AND PROJECTION MATRICES
1903CAMOUFLAGED OBJECT DETECTION VIA STYLE TRANSFER-BASED DATA AUGMENTATION
1946CAPTIV8 : A comprehensive large scale CAPsule endoscopy dataset for Integrated diagnosis
2070CASCADING UNKNOWN DETECTION WITH KNOWN CLASSIFICATION FOR OPEN SET RECOGNITION
1304CASeg: CLIP-based Action Segmentation with learnable text prompt
1911Category-Agnostic Pose Estimation for Point Clouds
1281CELL CYCLE STATE PREDICTION USING GRAPH NEURAL NETWORKS
1984CENTERRADARNET: JOINT 3D OBJECT DETECTION AND TRACKING FRAMEWORK USING 4D FMCW RADAR
1771CHARACTERIZATION OF DIM LIGHT RESPONSE IN DVS PIXEL: DISCONTINUITY OF EVENT TRIGGERING TIME
2588ChatGPT and Biometrics: An Assessment of Face Recognition, Gender Detection, and Age Estimation Capabilities
2566Class-Specific Channel Attention for Few Shot Learning
1739ClearDepth: Addressing Depth Distortions Caused by Eyelashes for Accurate Geometric Gaze Estimation on Mobile Devices
2351CLIFS: CLIP-DRIVEN FEW-SHOT LEARNING FOR BAGGAGE THREAT CLASSIFICATION
1617CLIP-BASED COMPOSITION-AWARE IMAGE CROPPING
2274CLIP-MEDFAKE: SYNTHETIC DATA AUGMENTATION WITH AI-GENERATED CONTENT FOR IMPROVED MEDICAL IMAGE CLASSIFICATION
2640CLOUDS AND HAZE CO-REMOVAL BASED ON WEIGHT-TUNED OVERLAP REFINEMENT DIFFUSION MODEL FOR REMOTE SENSING IMAGES
1869CM²-NET: CONTINUAL CROSS-MODAL MAPPING NETWORK FOR DRIVER ACTION RECOGNITION
1920CO2WOUNDS-V2: EXTENDED CHRONIC WOUNDS DATASET FROM LEPROSY PATIENTS
1155COARSE-FINE SPECTRAL-AWARE DEFORMABLE CONVOLUTION FOR HYPERSPECTRAL IMAGE RECONSTRUCTION
1156COARSE-TO-FINE SPATIO-TEMPORAL LUMINANCE-AWARE RECONSTRUCTION FOR HIGH-SPEED MOTION SCENE
1970CodaMal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost Microscopes
2474Collaborative Intelligence for Vision Transformers: A Token Sparsity-Driven Edge-Cloud Framework
2555COMBINING RAFT-BASED STEREO DISPARITY AND OPTICAL FLOW MODELS FOR SCENE FLOW ESTIMATION
2686COMPARISON OF CROWDSOURCING AND LABORATORY SETTINGS FOR SUBJECTIVE ASSSESSMENT OF VIDEO QUALITY AND ACCEPTABILITY & ANNOYANCE
2030COMPETITIVE LEARNING FOR ACHIEVING CONTENT-SPECIFIC FILTERS IN VIDEO CODING FOR MACHINES
2615COMPRESSION-AWARE TUNING FOR COMPRESSING VOLUMETRIC RADIANCE FIELDS
1722COMPUTATIONALLY EFFICIENT KALMAN FILTER FRAMEWORK FOR INTRA-FRAME IMAGE RECONSTRUCTION WITH A ROLLING SHUTTER CAMERA
2042CONDITIONAL OPTIMAL FILTER SELECTION FOR MULTISPECTRAL OBJECT CLASSIFICATION
1197Conditional Past Experience Generation for Dark Continual Learning
1861Confidence Aware Stereo Matching for Realistic Cluttered Scenario
2621CONSTRUCTING AN INTERPRETABLE DEEP DENOISER BY UNROLLING GRAPH LAPLACIAN REGULARIZER
2031Content-Aware Supervision for Diffusion-Based Restoration of Extremely Compressed Background for VCM
1635Context-adaptive Entropy Model With Adapters For Lossless Point Cloud Geometry Compression
1569CONTEXTUALITY HELPS REPRESENTATION LEARNING FOR GENERALIZED CATEGORY DISCOVERY
1509CONTINUAL ROAD-SCENE SEMANTIC SEGMENTATION VIA FEATURE-ALIGNED SYMMETRIC MULTI-MODAL NETWORK
1524CONTOUR-WEIGHTED LOSS FOR CLASS-IMBALANCED IMAGE SEGMENTATION
1323CONTRAST-GUIDED WIREFRAME PARSING
1024Controllable Unsupervised Event-based Video Generation
1420Convex-hull Estimation using XPSNR for Versatile Video Coding
2570CONVOLUTIONAL NEURAL NETWORK WITH LEARNABLE MASKS FOR EIT BASED TACTILE SENSING
1443CORRELATION-AWARE JOINT PRUNING-QUANTIZATION USING GRAPH NEURAL NETWORKS
1170COUNTING REPETITIVE ACTIONS IN EVENT STREAM
2380CROCOS-V1: ENHANCING MASK LEAKAGE AND BOUNDING BOX LOCALIZATION FOR REAL-TIME CROP/WEED INSTANCE SEGMENTATION
1069CROSS-ACTION CROSS-SUBJECT SKELETON ACTION RECOGNITION VIA SIMULTANEOUS ACTION-SUBJECT LEARNING WITH TWO-STEP FEATURE REMOVAL
1994CROSS-DOMAIN FEW-SHOT IN-CONTEXT LEARNING FOR ENHANCING TRAFFIC SIGN RECOGNITION
2175CROSS-FUSION OF BAND-SPECIFIC SPECTRAL FEATURES FOR MULTI-BAND NIR COLORIZATION
2740CROSS-MODAL ALIGNMENT OF LOCAL AND GLOBAL FEATURES FOR ZERO-SHOT CHINESE CHARACTER RECOGNITION
1041CROWDASSIGN: A LABEL ASSIGNMENT SCHEME FOR PEDESTRIAN DETECTION IN CROWDED SCENES
2542CST-YOLO: A NOVEL METHOD FOR BLOOD CELL DETECTION BASED ON IMPROVED YOLOV7 AND CNN-SWIN TRANSFORMER
1321DALSM: A DIRECTION-AWARE LINE SEGMENT MATCHING METHOD
2368DAPlankton: Benchmark Dataset for Multi-instrument Plankton Recognition via Fine-grained Domain Adaptation
2574DCCM: Dual Data Consistency Guided Consistency Model for Inverse Problems
1803DCCTNET: KIDNEY TUMORS SEGMENTATION BASED ON DUAL-LEVEL COMBINATION OF CNN AND TRANSFORMER
2494DECLOUDING OF SATELLITE IMAGES FOR CROP GROWTH MONITORING VIA UNROLLING OF GRADIENT GRAPH LAPLACIAN REGULARIZER
1251DECOMPL: DECOMPOSITIONAL LEARNING WITH ATTENTION POOLING FOR GROUP ACTIVITY RECOGNITION FROM A SINGLE VOLLEYBALL IMAGE
1765Decoupling Domain Invariance and Variance with Tailored Prompts for Open-Set Domain Adaptation
1860DEEP CONVOLUTIONAL NEURAL NETWORK PREDICTION FOR GLAUCOMA DETECTION USING OCT AND OCT-ANGIOGRAPHY DISC- AND MACULA-CENTERED IMAGES AND THEIR COMBINED POWER
1152Deep Fusion of Visible and Near Infrared Images for Registration and Defogging Using Cross Modal Transformer
1599DEEP LEARNING APPROACH FOR RENAL CELL CARCINOMA DETECTION, SUBTYPING, AND GRADING
2347Deep Learning-Based Leaf Image Analysis for Tomato Plant Disease Detection and Classification
2285Deep Multi-Graph Embedded Clustering for Community Detection in fMRI Functional Brain Networks Across Individuals
1685Deep optical flow learning with deformable large-kernel Cross-attention
2366DEEP REGULARIZATION FOR SCALE-AGNOSTIC SUPERRESOLUTION OF MR IMAGES
2481Deep Spectral Siamese Network for Heterogeneous Object Verification in Amazon Robotic Warehouse
2565DEEPFAKE DETECTION VIA SEPARABLE SELF-CONSISTENCY LEARNING
1496DEEPFAKE DETECTION WITH COMBINED UNSUPERVISED-SUPERVISED CONTRASTIVE LEARNING
1273DEEP-LEARNING-BASED MAGNETIC RESONANCE SIMULTANEOUS MULTISLICE IMAGING USING HOLOGRAPHIC IMAGE DECODING
2213DEEPSKINFORMER: SKIN LESION SEGMENTATION USING HIERARCHICAL TRANSFORMERS AND EDGE ENHANCEMENT
1132DEFENDING AGAINST PHYSICAL ADVERSARIAL PATCH ATTACKS ON INFRARED HUMAN DETECTION
2670DELVING INTO THE EXPLAINABILITY OF PROTOTYPE-BASED CNN FOR BIOLOGICAL CELL ANALYSIS
1444DENSITY-GUIDED DENSE PSEUDO LABEL SELECTION FOR SEMI-SUPERVISED ORIENTED OBJECT DETECTION
1208Detectability of Defects in the Presence of Linear Nuisance Parameters and Images Signal-Dependent Noise
2103Detecting Biomedical Copy-Move Forgery by Attention-based Multiscale Deep Descriptors
2596DIRECTIONAL AND TOPOLOGICAL TRANSFORMER WITH TOPOLOGY PRIORS FOR 4D CELLULAR IMAGE SEGMENTATION
1047DIRECTIONAL ANTENNA SYSTEMS FOR LONG-RANGE THROUGH-WALL HUMAN ACTIVITY RECOGNITION
1356DISENTANGLED KNOWLEDGE DISTILLATION FOR UNIFIED MULTI-CLASS ANOMALY DETECTION
2319Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning
1359DIVERSIFIED TASK AUGMENTATION WITH REDUNDANCY REDUCTION FOR CROSS-DOMAIN FEW-SHOT LEARNING
1094DIVERSIFYING DEEP ENSEMBLES: A SALIENCY MAP APPROACH FOR ENHANCED OOD DETECTION, CALIBRATION, AND ACCURACY
1194DOMAIN DILATION FOR SINGLE DOMAIN GENERALIZATION
2092DRAFT - DISTILLED RECURRENT ALL-PAIRS FIELD TRANSFORMS FOR OPTICAL FLOW
2350Driving through Graphs: A Bipartite Graph for Traffic Scene Analysis
2534DTPose: Learning Disentangled Token Representation for Effective Human Pose Estimation
1009DTSN: NO-REFERENCE IMAGE QUALITY ASSESSMENT VIA DEFORMABLE TRANSFORMER AND SEMANTIC NETWORK
1606DUAL ATTENTION ENHANCED TRANSFORMER FOR IMAGE DEFOCUS DEBLURRING
1637DUAL MULTI-MODAL FEATURE FUSION NETWORK FOR THE EVALUATION OF OSTEOSARCOMA
1360Dual-Path Coupled Image Deraining Network via Spatial-Frequency Interaction
2194DYNAMIC ACTIVATION FUNCTION BASED ON THE BRANCHING PROCESS AND ITS APPLICATION IN IMAGE CLASSIFICATION
1490Dynamic MRI reconstruction using low-rank plus sparse decomposition with smoothness regularization
1236E2GS: EVENT ENHANCED GAUSSIAN SPLATTING
1776E2SIFT: NEUROMORPHIC SIFT VIA DIRECT FEATURE PYRAMID RECOVERY FROM EVENTS
2021Early prediction of the transferability of bovine embryos from videomicroscopy
1825EarthquakeNet: A High-Resolution UAV-Based Dataset for Earthquake Damage Assessment
1399ECAP: EXTENSIVE CUT-AND-PASTE AUGMENTATION FOR UNSUPERVISED DOMAIN ADAPTIVE SEMANTIC SEGMENTATION
1870EDGE-GUIDED PIXEL LEVEL CONNECTED COMPONENT ASSISTED CAMOUFLAGED OBJECT DETECTION
1677Edge-Reserved Knowledge Distillation for Image Matting
1609EFFICIENT BLACK-BOX ADVERSARIAL ATTACK ON DEEP CLUSTERING MODELS
1070EFFICIENT CIRCULAR AND CONFOCAL NON-LINE-OF-SIGHT IMAGING WITH TRANSIENT SINOGRAM SUPER RESOLUTION
1527EFFICIENT LEARNED WAVELET IMAGE AND VIDEO CODING
1958EFFICIENT SEMANTIC SEGMENTATION FOR AERIAL IMAGERY USING QUERY POINTS AND SUPERPIXEL SUPERVISION
2672EFFICIENT VISUAL QUESTION ANSWERING ON EMBEDDED DEVICES: CROSS-MODALITY ATTENTION WITH EVOLUTIONARY QUANTIZATION
1960EMBEDDING ATTENTION BLOCKS FOR ANSWER GROUNDING
1769Empirical Research on Quantization for 3D Multi-Modal ViT models
2102End-to-end Learned Lossy Dynamic Point Cloud Attribute Compression
1835End-to-End Learned Scalable Multilayer Feature Compression for Machine Vision Tasks
2359ENERGY REDUCTION OPPORTUNITIES IN HDR VIDEO ENCODING
1234Enhanced Detection of Small Objects in Aerial Imagery: A High-Resolution Neural Network Approach with Amplified Feature Pyramid and Sigmoid Re-weighting
2745ENHANCED FACIAL RESTORATION WITH MISINFORMATION-FILTERED GUIDE-DENOISING DIFFUSION PROBABILISTIC MODELS
1810ENHANCED PROTOTYPICAL PART NETWORK (EPPNET) FOR EXPLAINABLE IMAGE CLASSIFICATION VIA PROTOTYPES
2282Enhancing Intubation Accuracy: Advanced Tracheal Segmentation Techniques in Video Endoscopy
2726ENHANCING PERCEPTUAL QUALITY ASSESSMENT FOR 360-DEGREE IMAGES BASED ON ADAPTIVE PATCH LABELING AND MULTI-LABEL LEARNING
2337ENHANCING TMIV PERFORMANCE THROUGH PROXIMITY-AWARE GROUPING AND PRESERVATION OF SMALL CLUSTERS
2817ENN: A NEURAL NETWORK WITH DCT ADAPTIVE ACTIVATION FUNCTIONS
1899ENSEMBLE OF DEEP VARIATIONAL MIXTURE MODELS FOR UNSUPERVISED CLUSTERING
1384ESTATE: EXPERT-GUIDED STATE TEXT ENHANCEMENT FOR ZERO-SHOT INDUSTRIAL ANOMALY DETECTION
1836ESTIMATING INDOOR SCENE DEPTH MAPS FROM ULTRASONIC ECHOES
1673ET: EXPLAIN TO TRAIN: LEVERAGING EXPLANATIONS TO ENHANCE THE TRAINING OF A MULTIMODAL TRANSFORMER
2698EVALUATING 3D HUMAN POSE ESTIMATION IN OCCLUDED MULTI-SENSOR SCENARIOS: DATASET AND ANNOTATION APPROACH.
1295EVENT-SPECIFIC EEG-FNIRS FEATURE FUSION FOR ALZHEIMER’S DISEASE CLASSIFICATION
1385EXPLAINING 3D OBJECT DETECTION THROUGH SHAPLEY VALUE-BASED ATTRIBUTION MAP
2442EXPLAINING REPRESENTATION LEARNING WITH PERCEPTUAL COMPONENTS
2397EXPLOITING CHANGE BLINDNESS TO REDUCE BITRATE AND DISPLAY LUMINANCE IN VIDEO STREAMING
1580EXPLORING ATTENTION MECHANISMS IN INTEGRATION OF MULTI-MODAL INFORMATION FOR SIGN LANGUAGE RECOGNITION AND TRANSLATION
1274EXPLORING SALIENCY BIAS IN MANIPULATION DETECTION
2736EXPLORING THE IMPACT OF MOIRE PATTERN ON DEEPFAKE DETECTORS
2196EXPLORING THE POTENTIAL OF RECURRENCE QUANTIFICATION ANALYSIS FOR VIDEO ANALYSIS AND MOTION DETECTION
2355Exploring the Potential of Synthetic Data to Replace Real Data
2775EXPOSING THE LIMITS OF DEEPFAKE DETECTION USING NOVEL FACIAL MOLE ATTACK: A PERCEPTUAL BLACK-BOX ADVERSARIAL ATTACK STUDY
1999Extended multiple cross-component linear models with adaptive thresholding and overlapped averaging beyond VVC
1252EXTENDING SEGMENT ANYTHING MODEL INTO AUDITORY AND TEMPORAL DIMENSIONS FOR AUDIO-VISUAL SEGMENTATION
1900Face Drawing GAN by Channel Attention and Matrix Product Attention
1077FACE MORPHING DETECTION IN SOCIAL MEDIA CONTENT
1615FACTORIZED EMBEDDING GRAPH MATCHING NETWORK FOR LEARNING LAWLER’S QUADRATIC ASSIGNMENT PROBLEM
2051FANET: FEATURE AMPLIFICATION NETWORK FOR SEMANTIC SEGMENTATION IN CLUTTERED BACKGROUND
2707FANTOM: Federated Adversarial Network for Training Multi-sequence Magnetic Resonance Imaging in Semantic Segmentation
1526FAST CODING MODE PREDICTION FOR INTRA PREDICTION IN VVC SCC
1950FAST CONSTANT-QUALITY VIDEO ENCODING USING VVENC WITH RATE CAPPING BASED ON PRE-ANALYSIS STATISTICS
1531FAST EDGE-AWARE OCCLUSION DETECTION IN THE CONTEXT OF MULTISPECTRAL CAMERA ARRAYS
1343FAST INTER MODE DECISION WITH RESOLUTION SAMPLING FOR VVC 360-DEGREE VIDEO CODING
1318FAST TEMPLATE MATCHING-BASED REFERENCE PICTURE PADDING FOR VIDEO CODING
1162FAST UNSUPERVISED TENSOR RESTORATION VIA LOW-RANK DECONVOLUTION
2665FAWN: FLOOR-AND-WALLS NORMAL REGULARIZATION FOR DIRECT NEURAL TSDF RECONSTRUCTION
2690FC3DNET: A FULLY CONNECTED ENCODER-DECODER FOR EFFICIENT DEMOIRÉING
2024Feature Decomposition Transformers for Infrared and Visible Image Fusion
1438FEATURE ENHANCED LEARNING IMAGE COMPRESSION WITH RECURRENT CRISS-CROSS ATTENTION
1530FEATURES DISENTANGLEMENT FOR EXPLAINABLE CONVOLUTIONAL NEURAL NETWORKS
1337FedAwa: Aggregation Weight Adjustment in Federated Domain Generalization
1791FedMI: A FEDERATED LEARNING FRAMEWOEK FOR SECURE SHARING OF MEDICAL IMAGES
1216FINE-DETAILED NEURAL INDOOR SCENE RECONSTRUCTION USING MULTI-LEVEL IMPORTANCE SAMPLING AND MULTI-VIEW CONSISTENCY
1451FINE-TUNING TEXT-TO-IMAGE DIFFUSION MODELS FOR CLASS-WISE SPURIOUS FEATURE GENERATION
2593FISHEYE STEREO CAMERA USING FISHEYE VERTICAL STEREO METHOD
1909FLEXAE: A SELF-CONDITIONED DETECTOR TO PREVENT MODEL OVERFITTING FOR UNSUPERVISED VIDEO ANOMALY DETECTION
1878FOOD: FACIAL AUTHENTICATION AND OUT-OF-DISTRIBUTION DETECTION WITH SHORT-RANGE FMCW RADAR
1732FOOTBOTS: A TRANSFORMER-BASED ARCHITECTURE FOR MOTION PREDICTION IN SOCCER
2727FOURIER PTYCHOGRAPHY MICROSCOPY WITH INTEGRATED POSITIONAL MISALIGNMENT CORRECTION
2386Fourier Ptychography with Information Entropy Based No-Reference Image Quality Assessment Learning
2705FREQ-MIP-AA : FREQUENCY MIP REPRESENTATION FOR ANTI-ALIASING NEURAL RADIANCE FIELDS
2381FREQUENCY-SPATIAL DOMAIN INFORMATION FUSION NETWORK FOR PAN-SHARPENING
2182FULL-REFERENCE POINT CLOUD QUALITY ASSESSMENT USING SPECTRAL GRAPH WAVELETS
1826FUSION OF INDEPENDENT AND INTERACTIVE FEATURES FOR HUMAN-OBJECT INTERACTION DETECTION
2388GABIC: GRAPH-BASED ATTENTION BLOCK FOR IMAGE COMPRESSION
1508GABOR FEATURE NETWORK FOR TRANSFORMER-BASED BUILDING CHANGE DETECTION MODEL IN REMOTE SENSING
1850GaitGS: Temporal Feature Learning in Granularity and Span Dimension for Gait Recognition
2205GEEG-YOLOV8: GAUSSIAN ENHANCED EUCLIDEAN NORM GHOST ATTENTION FOR REAL-TIME POLYP DETECTION
1075GENERALIZED NESTED LATENT VARIABLE MODELS FOR LOSSY CODING APPLIED TO WIND TURBINE SCENARIOS
2216GENERATE DSLR-LIKE IMAGE WITH GLOBAL INFORMATION AND PRIOR GUIDED ISP
1821Generative Visual Compression: A Review
1270GENGMM: GENERALIZED GAUSSIAN-MIXTURE-BASED DOMAIN ADAPTATION MODEL FOR SEMANTIC SEGMENTATION
1231GIRAFFE: A GENETIC PROGRAMMING ALGORITHM TO BUILD DEEP LEARNING ENSEMBLES FOR ECG ARRHYTHMIA CLASSIFICATION
1222GradTrans: Transformer-based Gradient Guidance for Image Generation
1646GRAPH CONVOLUTIONAL NETWORKS WITH MINIMAL APPEARANCE INFORMATION FOR ACTION RECOGNITION
1592GRAPHIC - Graph-based Representation for Analyzing People's High-level Interactions in Crowds
1605GUIDED CONTEXT GATING: LEARNING TO LEVERAGE SALIENT LESIONS IN RETINAL FUNDUS IMAGES
1453GUMBEL-NERF: REPRESENTING UNSEEN OBJECTS AS PART-COMPOSITIONAL NEURAL RADIANCE FIELDS
2689HAND-OBJECT RECONSTRUCTION VIA INTERACTION-AWARE GRAPH ATTENTION MECHANISM
2043HDPLIFTER: HIERARCHICAL DYNAMICS PERCEPTION FOR 2D-TO-3D HUMAN POSE LIFTING
2477Hierarchical Vertex-wise Intensification Graph Convolution for Skeleton-based Activity Recognition
1405HIGHLY CONSTRAINED CODED APERTURE IMAGING SYSTEMS DESIGN VIA A KNOWLEDGE DISTILLATION APPROACH
1947HistoHDR-Net: Histogram Equalization for Single LDR to HDR Image Translation
1089HoloGesture: A Multimodal Dataset for Hand Gesture Recognition Robust to Hand Textures on Head-Mounted Mixed-Reality Devices
2487HOW TO TRAIN YOUR VAE
2688HYBRID SINGLE INPUT AND MULTIPLE OUTPUT METHOD FOR COMPRESSING FEATURES TOWARDS MACHINE VISION TASKS
2044HYPERSPECTRAL IMAGE CLASSIFICATION WITH FUZZY SPATIAL-SPECTRAL CLASS DISCRIMINATE INFORMATION
1651ILLUMINATION-ENHANCED INFRARED AND LOW-LIGHT VISIBLE IMAGE FUSION
2266IMAGE CODING FOR MACHINE VIA ANALYTICS-DRIVEN APPEARANCE REDUNDANCY REDUCTION
2589Image Coding for Machines with Edge Information Learning Using Segment Anything
2338Imbalanced data robust online continual learning based on evolving class aware memory selection and built-in contrastive representation learning
1611IMPROVEMENT OF IMAGE RECONSTRUCTION FOR MRI USING PHASE-SCRAMBLING FOURIER TRANSFORM AND DUAL-DOMAIN STRATEGY
2094Improving Automatic Target Recognition with Infrared Imagery using Vision Transformers and Focused Data Augmentation
2669IMPROVING IMAGE CODING FOR MACHINES THROUGH OPTIMIZING ENCODER VIA AUXILIARY LOSS
1403IMPROVING IMAGE DE-RAINING USING REFERENCE-GUIDED TRANSFORMERS
1345IMPROVING REAL-TIME NEAR-INFRARED FACE ALIGNMENT WITH A PAIRED VIS-NIR DATASET AND DATA AUGMENTATION THROUGH IMAGE-TO-IMAGE TRANSLATION
1584IMPROVING SELF-SUPERVISED VISION TRANSFORMERS FOR VISUAL CONTROL
1245IMU-ASSISTED TARGET-FREE EXTRINSIC CALIBRATION OF HETEROGENEOUS LIDARS BASED ON CONTINUOUS-TIME OPTIMIZATION
1724INCREASING TRUST IN IMAGE ANALYSIS BY DETECTING TRELLIS QUANTIZATION IN JPEG IMAGES
1991IN-LOOP FILTER FOR OBJECT MASK CODING IN VERSATILE VIDEO CODING
1907INSTANCE-AWARE UNCERTAINTY FOR ACTIVE LEARNING IN OBJECT DETECTION
1190Integrating Vision-Language Supervision for Uniform Appearance Tracking
2423INTELLIGENT MULTI-VIEW TEST TIME AUGMENTATION
2528INTERACTIVE TEACHING FOR FINE-GRANULAR FEW-SHOT OBJECT RECOGNITION USING VISION TRANSFORMERS
2099INTERPRETING THE FRAUDULENCE LEVEL OF DIFFERENT FINGER PHOTO PRESENTATION ATTACK INSTRUMENTS
2637INTRINSIC IMAGE DECOMPOSITION BASED ON QUANTIZED PRIOR CODEBOOK
2819INVERTIBLE ENERGY-AWARE IMAGES
2425INVESTIGATING AND REDUCING THE IMPAIRMENT OF POINT SPREAD EFFECT FOR SPATIOTEMPORAL FUSION OF REMOTE SENSING IMAGERY
2771Investigating Self-Supervised Methods for Label-Efficient Learning
2353Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-ONNs
1211JOINT IMAGE RESTORATION FOR DOMAIN ADAPTIVE OBJECT DETECTION IN FOGGY WEATHER CONDITION
1649JOINTRF: END-TO-END JOINT OPTIMIZATION FOR DYNAMIC NEURAL RADIANCE FIELD REPRESENTATION AND COMPRESSION
1474JPEG Image Ciphering based on Chaotic Encryption
2577KNOWLEDGE-INFUSED LEARNING FOR FINE-GRAINED PLANT DISEASE RECOGNITION
2074Koopcon: A new approach towards smarter and less complex learning
2177LAND USE CLASSIFICATION VIA MULTI-MODAL COMPLEMENTARY FEATURE FUSION AND CONTEXT INFORMATION ENHANCEMENT FOR OPTICAL AND SAR IMAGES
2256LATENT ENHANCING AUTOENCODER FOR OCCLUDED IMAGE CLASSIFICATION
1380Learn by an Example Transformer for Domain Generalization in Video Object Segmentation
1823Learned Compression of Encoding Distributions
2218LEARNED IMAGE COMPRESSION FOR BOTH HUMANS AND MACHINES VIA DYNAMIC ADAPTATION
1749Learned Image Compression Using a Long and Short Attention Module
1172Learned Image Compression with Text Quality Enhancement
2272Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression
1840LEARNING A RAIN-INVARIANT NETWORK FOR INSTANCE SEGMENTATION IN THE RAIN
2081Learning Orthonormal Features in Self-Supervised Learning using Functional Maximal Correlation
1042Learning Temporal Cues for Fine-grained Action Recognition
2260LEARNING WITH INSTANCE-DEPENDENT NOISY LABELS BY ANCHOR HALLUCINATION AND HARD SAMPLE LABEL CORRECTION
2428LEARNING-BASED POINT CLOUD DECODING WITH INDEPENDENT AND SCALABLE REDUCED COMPLEXITY
1610LEARNING-BASED VIDEO COMPRESSION WITH CONTINUOUSLY VARIABLE BITRATE CODING
1814LEGIT: TEXT LEGIBILITY FOR USER-GENERATED MEDIA
2082LENSLESS PHASE RETRIEVAL WITH REGULARIZATION BY BLIND NOISE MAP ESTIMATION AND DENOISING
2630LERCPOSE: LEARNED RANKING AND CONTRASTIVE LOSS FOR ROBUST HEAD POSE ESTIMATION
1521LEVERAGING GENERATED IMAGE CAPTIONS FOR VISUAL COMMONSENSE REASONING
1634LFGN: Low-level Feature-Guided Network for Adversarial Defense
1927LiCAF: LiDAR-Camera Asymmetric Fusion for Gait Recognition
2613LIDAR DEPTH MAP GUIDED IMAGE COMPRESSION MODEL
2086LIGHTWEIGHT RECURRENT NEURAL NETWORK FOR IMAGE SUPER-RESOLUTION
1657LIGHT-WEIGHT SELF-SUPERVISED CONTRASTIVE LEARNING NETWORK FOR SMALL SAMPLE HYPERSPECTRAL IMAGE CLASSIFICATION
2118LIGHTWEIGHT UNDERWATER IMAGE ENHANCEMENT VIA IMPULSE RESPONSE OF LOW-PASS FILTER BASED ATTENTION NETWORK
1445LIPFACE: LIPSCHITZ-CONDITIONED FOR RESOLUTION ROBUST FACE RECOGNITION
1039LISD: AN EFFICIENT MULTI-TASK LEARNING FRAMEWORK FOR LIDAR SEGMENTATION AND DETECTION
1979LMBF-Net: A Lightweight Multipath Bidirectional Focal Attention Network for Multifeatures Segmentation
2154Localization of Image Splicing Under Segment Anything Model with Integrated Compression and Edge Artifacts
2389LOCALIZING MOMENTS OF ACTIONS IN UNTRIMMED VIDEOS OF INFANTS WITH AUTISM SPECTRUM DISORDER
2248LONG-TERM GEO-POSITIONED RE-IDENTIFICATION DATASET OF URBAN ELEMENTS
1259LOW-RANK MATRIX AND TENSOR DECOMPOSITION USING RANDOMIZED TWO-SIDED SUBSPACE ITERATION WITH APPLICATION TO VIDEO RECONSTRUCTION
1442LRDif: Diffusion Models for Under-Display Camera Emotion Recognition
2304LSDM-PCB: A Lightweight Small Defect Detection Model For Printed Circuit Board
2129LUMINATE: LINGUISTIC UNDERSTANDING AND MULTI-GRANULARITY INTERACTION FOR VIDEO OBJECT SEGMENTATION
2597LWIRPOSE: A novel Long Wave Infrared Thermal Image Pose Dataset and Benchmark
1604M3T: MULTI-MODAL MEDICAL TRANSFORMER TO BRIDGE CLINICAL CONTEXT WITH VISUAL INSIGHTS FOR RETINAL IMAGE MEDICAL DESCRIPTION GENERATION
2633MAMBA-PCGC: MAMBA-BASED POINT CLOUD GEOMETRY COMPRESSION
1663MASK-BASED INVISIBLE BACKDOOR ATTACKS ON OBJECT DETECTION
1577Masked Momentum Contrastive Learning for Semantic Understanding by Observation
2602Masked Signal Modeling for Plastic Waste Resin Classification
1489MAVAD: AUDIO-VISUAL DATASET AND METHOD FOR ANOMALY DETECTION IN TRAFFIC VIDEOS
2592MCT-NET: A LIGHTWEIGHT MULTISCALE CONVOLUTIONAL TRANSFORMER NETWORK FOR POLYP SEGMENTATION
1204MDBFUSION: A VISIBLE AND INFRARED IMAGE FUSION FRAMEWORK CAPABLE FOR MOTION DEBLURRING
2187MEDeA: Multi-view Efficient Depth Adjustment
2165MEDICAL KNOWLEDGE-GUIDED SEMI-SUPERVISED BI-VENTRICULAR SEGMENTATION
1427MEMSVD: LONG-RANGE TEMPORAL STRUCTURE CAPTURING USING INCREMENTAL SVD
1137META-DM: APPLICATIONS OF DIFFUSION MODELS ON FEW-SHOT LEARNING
1941METAHEURISTIC CAMERA CALIBRATION FOR OPTICAL TOMOGRAPHIC IMAGING IN INDUSTRIAL ENVIRONMENTS
1135MFLFC:MULTI-FRAME FUSION BASED LOW-RESOLUTION FEATURE COMPRESSION FOR OBJECT TRACKING
1680MGRQ: POST-TRAINING QUANTIZATION FOR VISION TRANSFORMER WITH MIXED GRANULARITY RECONSTRUCTION
2715MICRO-EXPRESSION RECOGNITION BASED ON 3DCNN COMBINED WITH GRU AND NEW ATTENTION MECHANISM
2564MINIMIZATION OF SUBMESH BOUNDARY ERRORS IN DYNAMIC MESH CODING
2059MIX-DOMAIN CONTRASTIVE LEARNING FOR UNPAIRED H&E-TO-IHC STAIN TRANSLATION
2551MMAQ: A Multi-modal Self-supervised Approach For Estimating Air Quality From Remote Sensing Data
2296MODIPHY: MULTIMODAL OBSCURED DETECTION FOR IOT USING PHANTOM CONVOLUTION-ENABLED FASTER YOLO
1865MOTION-ADAPTIVE INFERENCE FOR FLEXIBLE LEARNED B-FRAME COMPRESSION
1538Motion-Lie Transformer : Geometric Attention for 3D Human Pose Motion Prediction
1636MSD-CRFS: MULTI-SCALE DUAL AGGREGATION CONDITIONAL RANDOM FIELDS FOR MONOCULAR DEPTH ESTIMATION
1335MSGAT: MULTI-STAGE GRAPH ATTENTION NETWORK FOR HUMAN MOTION PREDICTION
2708MSSPG-AL: FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION WITH ACTIVE LEARNING UPDATED MULTI-SCALE SUPERPIXEL GRAPH FUSION
2450MTA-PS: TOWARDS PRACTICAL PERSON SEARCH IN VIDEOS
2722MULTI-ATTRIBUTE VISION TRANSFORMERS ARE EFFICIENT AND ROBUST LEARNERS
1964MULTICLASSIFICATION OF VOCAL FOLDS DISORDERS FROM VIDEOS BY SPATIO-TEMPORAL DEEP FEATURES
1573MULTI-MODAL MEDICAL IMAGE FUSION FOR NON-SMALL CELL LUNG CANCER CLASSIFICATION
1915MULTIMODAL TRANSFORMER USING CROSS-CHANNEL ATTENTION FOR OBJECT DETECTION IN REMOTE SENSING IMAGES
1809MULTIMODAL-ENHANCED OBJECTNESS LEARNER FOR CORNER CASE DETECTION IN AUTONOMOUS DRIVING
1744Multi-path Interference Mitigation for Indirect Time-of-Flight Camera by the Distortion of Coding Curve
1781MULTI-REFERENCE FLOW-GUIDED CROSS-DOMAIN RECONSTRUCTION FOR GENERAL OBJECT 6D POSE ESTIMATION
1690MULTI-TASK AFFINITY PROPAGATION BASED NATURAL IMAGE MATTING
2750MULTI-VIEW MULTI-FOCUS IMAGE FUSION: A NOVEL BENCHMARK DATASET AND METHOD
2399MULTI-VIEW NETWORK FOR COLORECTAL POLYPS DETECTION IN CT COLONOGRAPHY
1796MVAFormer: RGB-Based Multi-View Spatio-Temporal Action Recognition with Transformer
1729MVCrackViT: Robust Multi-View Crack Detection for Point Cloud Segmentation using View Attention
2587MWIRSTD: A MWIR SMALL TARGET DETECTION DATASET
1468NAVIGATING LIMITATIONS WITH PRECISION: A FINE-GRAINED ENSEMBLE APPROACH TO WRIST PATHOLOGY RECOGNITION ON A LIMITED X-RAY DATASET
1529NEURAL MESH FUSION: UNSUPERVISED 3D PLANAR SURFACE UNDERSTANDING
1855NEURAL RADIANCE FIELD-ASSISTED STATIC-SCENE VIDEO CODING
2254NN-BASED IN-LOOP FILTERING WITH INPUTS TRANSFORMED
2410Non-Separable Wavelet Transform using Learnable Convolutional Lifting Steps
1314NORM-INTEGRATED SOFTMAX LOSS FOR DEEP FACE RECOGNITION
2224NOVEL META ATTENTION GUIDED FRAMEWORK FOR BREAST ABNORMALITY CLASSIFICATION WITH COMBINATION OF FSL AND DA
2463NYCTALE: Neuro-Evidence Transformer for Adaptive and Personalized Lung Nodule Invasiveness Prediction
2115OBJECT DETECTION FRAMEWORK USING MULTIPLE TONE MAPPINGS ON HIGH-DYNAMIC-RANGE IMAGES
2011OBJECT-AWARE ADAPTIVE IMAGE RETARGETING VIA IMPORTANCE MAP FUSION
2324ODVista: An Omnidirectional Video Dataset for Super-Resolution and Quality Enhancement Tasks
2575OMRA: Online Motion Resolution Adaptation to Remedy Domain Shift in Learned Hierarchical B-frame Coding
1497ON ANNOTATION-FREE OPTIMIZATION OF VIDEO CODING FOR MACHINES
1945ON EFFICIENT NEURAL NETWORK ARCHITECTURES FOR IMAGE COMPRESSION
2341ON THE CLOUD DETECTION FROM BACKSCATTERED IMAGES GENERATED FROM A LIDAR-BASED CEILOMETER: CURRENT STATE AND OPPORTUNITIES
2080ON THE DETECTION OF IMAGES GENERATED FROM TEXT
1459On the Exploitation of DCT-Traces in the Generative-AI Domain
2184ONE-HOT LOGISTIC REGRESSION FOR RADIOMICS-BASED CLASSIFICATION
2185ONE-SHOT MULTI-RATE PRUNING OF GRAPH CONVOLUTIONAL NETWORKS FOR SKELETON-BASED RECOGNITION
1486ONLINE ANCHOR-BASED TRAINING FOR IMAGE CLASSIFICATION TASKS
1402OPEN WORLD OBJECT DETECTION VIA COOPERATIVE FOUNDATION MODELS FOR DRIVING SCENES
2141OpenAnimalTracks: A Dataset for Animal Track Recognition
1393Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model
2267OPTIMIZED DECOUPLED STRUCTURE WITH NON-LOCAL ATTENTION FOR DEEP IMAGE COMPRESSION
2653OPTIMIZING LEARNED IMAGE COMPRESSION ON SCALAR AND ENTROPY-CONSTRAINT QUANTIZATION
2802PAON: A NEW NEURON MODEL USING PADE APPROXIMANTS
1348Parallel Task-Prompts ICM: A Versatile Feature Codec for Machine Vision
1897PARTIAL INTER-FRAME CODING FOR DYNAMIC MESHES
1540PCA-UNet for Object Segmentation
1912Perceptual Learned Image Compression via End-To-End JND-Based Optimization
1285PersonaTalk: Preserving Personalized Dynamic Speech Style In Talking Face Generation
1974PHYSIOLOGICAL MODELING WITH MULTISPECTRAL IMAGING FOR HEART RATE ESTIMATION
1105PICTURE PARTITIONING DESIGN OF NEURAL NETWORK-BASED INTRA CODING FOR VIDEO CODING FOR MACHINES
2163Pilot-Free Semantic Communication over Multi-User MIMO Fading Channels
1772PIXEL-WISE COLOR CONSTANCY VIA SMOOTHNESS TECHNIQUES IN MULTI-ILLUMINANT SCENES
2013POINT CLOUD GEOMETRY SCALABLE CODING WITH A QUALITY-CONDITIONED LATENTS PROBABILITY ESTIMATOR
1326POSE-INVARIANT LEARNING FOR EFFICIENT PERSON IDENTIFICATION FROM HYPERSPECTRAL HAND IMAGES
1038POWER-LLAVA: LARGE LANGUAGE AND VISION ASSISTANT FOR POWER TRANSMISSION LINE INSPECTION
2549PRIORFORMER : A UGC-VQA METHOD WITH CONTENT AND DISTORTION PRIORS
1858Privacy-Preserving Visual Cues Communication for Hearing-Impaired People Using Deep Learning
1325Progressive Learning with Visual Prompt Tuning for Variable-Rate Image Compression
1856PROJECT, SKATE, AND REFRESH: IMPROVED SCHRODINGER BRIDGE SAMPLER FOR IMAGE RESTORATION
1066Prompt Performance Prediction for Image Generation
1987Prune Channel and Distill: Discriminative Knowledge Distillation for Semantic Segmentation
2005PUAD: FRUSTRATINGLY SIMPLE METHOD FOR ROBUST ANOMALY DETECTION
1808PVDN-Urban - A Dataset for Provident Vehicle Detection at Night in Urban Scenarios
2790PWISeg: Weakly-supervised Surgical Instrument Instance Segmentation
1127PYRAMID CODER: HIERARCHICAL CODE GENERATOR FOR COMPOSITIONAL VISUAL QUESTION ANSWERING
2375Quadruple-Consistency Vision Transformer for Medical Image Segmentation with Limited Number of Sparse Annotations
1614QUALITY OF EXPERIENCE OF VIEWPORT ADAPTIVE OMNIDIRECTIONAL VIDEO STREAMING
1463QUANTIZATION AFTER INTER PREDICTION IN DISPLACEMENT CODING OF DYNAMIC MESHES
1667RAFMNET: REINFORCED ATTENTION FUSION AND MULTISCALE NETWORK FOR NOISY INFRARED AND VISIBLE IMAGE FUSION
2008RAGE for the Machine: Image Compression with Low-Cost Random Access for Embedded Applications
1829RATE-COMPLEXITY OPTIMIZATION IN LOSSLESS NEURAL-BASED IMAGE COMPRESSION
2088Rate-Quality or Energy-Quality Pareto Fronts for Adaptive Video Streaming?
1263RDSSD: 3D Single Stage Object Detector for Roadside LiDAR Sensors
2521Reading Is Believing: Revisiting Language Bottleneck Models for Image Classification
2507REAL-TIME AND RESOURCE-EFFICIENT MULTI-SCALE ADAPTIVE ROBOTICS VISION FOR UNDERWATER OBJECT DETECTION AND DOMAIN GENERALIZATION
1955Real-time Monocular Depth Estimation on Embedded Systems
1574REAL-TIME SEMANTIC VIDEO COMMUNICATION OF GENERAL SCENES
1973REAL-TIME VIDEO PREDICTION WITH FAST VIDEO INTERPOLATION MODEL AND PREDICTION TRAINING
2489REAL-WORLD ATMOSPHERIC TURBULENCE CORRECTION VIA DOMAIN ADAPTATION
2140RECONSTRUCT DYNAMIC SCENE FOR SPIKE CAMERA BASED ON 3D SPACE TIME SIMILARITY
2064Recurrent 3-D Multi-level Visual Transformer for Joint Classification of Heterogeneous 2-D and 3-D Radiographic Data
2693REDEFINING CYSTOSCOPY WITH AI: BLADDER CANCER DIAGNOSIS USING AN EFFICIENT HYBRID CNN-TRANSFORMER MODEL
2207REDEFINING VISUAL QUALITY: THE IMPACT OF LOSS FUNCTIONS ON INR-BASED IMAGE COMPRESSION
2343Reducing motion artifacts in brain MRI using vision transformers and self-supervised learning
2624REFERRING IMAGE SEGMENTATION WITH TWO-STAGE MULTI-MODAL INTERACTION
2048Refining Myocardial Infarction Detection: A Novel Multi-Modal Composite Kernel Strategy in One-Class Classification
1241REINFORCEMENT LEARNING-BASED SECURE VIDEO TRANSMISSION FOR IOV SYSTEMS
2699REINFORCING PRE-TRAINED MODELS USING COUNTERFACTUAL IMAGES
1267REMOTE SENSING IMAGE UNEVEN HAZE REMOVAL BASED ON HAZE DENSITY ESTIMATION AND SALIENCY-DRIVEN DUAL CHANNEL FUSION
1659REMOVING REFLECTIVE FLARE IN REAL-WORLD CONDITIONS
2240ReSet: A Residual Set-Transformer approach to tackle the ugly-duckling sign in melanoma detection
1948RESNERF-PCAC: SUPER RESOLVING RESIDUAL LEARNING NERF FOR HIGH EFFICIENCY POINT CLOUD ATTRIBUTES CODING
1711RES-NERV : RESIDUAL BLOCKS FOR A PRACTICAL IMPLICIT NEURAL VIDEO DECODER
2107RESSCAL3D++: JOINT ACQUISITION AND SEMANTIC SEGMENTATION OF 3D POINT CLOUDS
2251Rethinking Domain Adaptation and Generalization in the Era of CLIP
2400RETHINKING TEMPORAL SELF-SIMILARITY FOR REPETITIVE ACTION COUNTING
2143RFG-HDR: REPRESENTATIVE FEATURE-GUIDED TRANSFORMER FOR MULTI-EXPOSURE HIGH DYNAMIC RANGE IMAGING
1100RFNET: REFINED FUSION THREE-BRANCH RGB-D SALIENT OBJECT DETECTION NETWORK
1793ROBUST 3D SEMANTIC SEGMENTATION WITH INCOMPLETE POINT CLOUDS BASED ON SEQUENTIAL FRAME SAMPLING
1665ROBUST REPRESENTATION LEARNING WITH SELF-DISTILLATION FOR DOMAIN GENERALIZATION
2190ROBUST SKIN COLOR DRIVEN PRIVACY-PRESERVING FACE RECOGNITION VIA FUNCTION SECRET SHARING
2269Robustness of tensor decomposition-based neural network compression
2173ROI-DVC: A REGION-OF-INTEREST BASED DEEP VIDEO CODING FRAMEWORK
1037ROTATED R-CNN: A TWO-STAGE OBJECT DETECTION METHOD ADAPTED TO ORIENTED BOUNDING BOXES
1053RSUD20K: A Dataset for Road Scene Understanding In Autonomous Driving
1303S³GCN: SPORT SCORING SIAMESE GRAPH CONVOLUTION NETWORK
1737Saliency as a Schedule: Intuitive Image Attribution
1520SALIENCY-AWARE END-TO-END LEARNED VARIABLE-BITRATE 360-DEGREE IMAGE COMPRESSION
1654SALIENT GUIDED TEXT DETECTION IN E-COMMERCE IMAGES
1956Sample Domain Prediction and Transform Skip for Region Adaptive Hierarchical Transform in Geometric Point Cloud Compression
1347SANeRV: Scene-Adaptive Neural Representation for Videos
1391SCALABLE HYPERSPHERE EMBEDDING FOR SEMANTIC METRIC LEARNING
1740SCENE GENERALIZED MULTI-VIEW PEDESTRIAN DETECTION WITH ROTATION-BASED AUGMENTATION AND REGULARIZATION
1849SCENE TEXT RECOGNITION USING PROGRESSIVE RECTIFICATION NETWORK AND SPELLING ERROR CORRECTION LANGUAGE MODEL
2050SE3D: A FRAMEWORK FOR SALIENCY METHOD EVALUATION IN 3D IMAGING
1676SEGGUARD: DEFENDING SCENE SEGMENTATION AGAINST ADVERSARIAL PATCH ATTACK
2354Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation
1221SEGMENTATION OF HARD EXUDATES AND HEMORRHAGES FROM DIABETIC RETINOPATHY IMAGES USING RESIDUAL U-NET WITH SQUEEZE AND EXCITE BLOCKS
2376SELF-SUPERVISED ANOMALY DETECTION AND A NEW BENCHMARK FOR X-RAY CARGO IMAGES
1906SELF-SUPERVISED MULTI-VIEW STEREO WITH ADAPTIVE DEPTH PRIORS
2316SEMANTIC ENHANCED FEW-SHOT OBJECT DETECTION
2635SEMANTIC-ENHANCED POINT-BOX JOINT PROMPTING FOR VIDEO OBJECT SEGMENTATION
2036Semantic-Region Specific Lookup Tables for Image Enhancement via Unpaired Learning
1365SEMI-SUPERVISED 3D OBJECT DETECTION WITH CHANNEL AUGMENTATION USING TRANSFORMATION EQUIVARIANCE
2352SEMI-SUPERVISED ACTION RECOGNITION FROM NEWBORN RESUSCITATION VIDEOS
1202SEMI-SUPERVISED GRAPHICAL DEEP DICTIONARY LEARNING FOR HYPERSPECTRAL IMAGE CLASSIFICATION FROM LIMITED SAMPLES
2057SET-NAS: Sample-Efficient Training for Neural Architecture Search with Strong Predictor and Stratified Sampling
1837SFD: SIMILAR FRAME DATASET FOR CONTENT-BASED VIDEO RETRIEVAL
2170SFNET - A SPATIAL-FREQUENCY DOMAIN NEURAL NETWORK FOR IMAGE LENS FLARE REMOVAL
1820SG-JND: SEMANTIC-GUIDED JUST NOTICEABLE DISTORTION PREDICTOR FOR IMAGE COMPRESSION
2516Shadow-Aware Makeup Transfer with Lighting Adaptation
2651SIMILARITY-WEIGHTED IOU (SIOU): A COMPREHENSIVE METRIC FOR EVALUATING MODEL PERFORMANCE THROUGH SIMILARITY-WEIGHTED CLASS OVERLAPS
1726SIMPLE IMAGE SIGNAL PROCESSING USING GLOBAL CONTEXT GUIDANCE
1183SimSAM: Simple Siamese Representation-Based Semantic Affinity Matrix for unsupervised image segmentation
1164SINGLE-PANORAMA CLASSIFICATION OF 3D OBJECTS USING HORIZONTALLY STACKED DILATED CONVOLUTIONS
2276Sino-CT-Fusion-Net: A Lightweight Deep Learning Framework for Detection and Classification of Intracranial Hemorrhages
1707SKETCH2MANGA: SHADED MANGA SCREENING FROM SKETCH WITH DIFFUSION MODELS
1032SLNL: Soft Label Regularization for Semi-Supervised Facial Expression Recognition with Negative Label Learning
2732Smo-CLIP: Enhancing Anomalous Smoke Density Assessment using A Hybrid LLM-VLM Approach
1440SN-NET: SEMISMOOTH NEWTON DRIVEN LIGHTWEIGHT NETWORK FOR REAL-WORLD IMAGE DENOISING
2342SODA: A DATASET FOR SMALL OBJECT DETECTION IN UAV CAPTURED IMAGERY
2283SOME CAN BE BETTER THAN ALL: MULTIMODAL STAR TRANSFORMER FOR VISUAL DIALOG
2290SOURCE-FREE CONTINUAL ADAPTIVE LEARNING WITH LIMITED LABELS ON EVOLVING DATA DRIFTS
1800SOVASEG-NET: SCALE INVARIANT OVARIAN TUMORS SEGMENTATION FROM ULTRASOUND IMAGES
1072SPARSE TRANSFORMER REFINEMENT SIMILARITY MAP FOR AERIAL TRACKING
2091SPATIAL PLAID ATTENTION DECODER FOR SEMANTIC SEGMENTATION
1876SPATIAL-CHANNEL COLLABORATED ATTENTION FOR CROSS-SCALE CROWD COUNTING
2504SPATIALITY-AWARE PROMPT TUNING FOR FEW-SHOT SMALL OBJECT DETECTION
2617SPATIO-TEMPORAL ADAPTATION WITH DILATED NEIGHBOURHOOD ATTENTION FOR ACCIDENT ANTICIPATION
2792SS-CXR: SELF-SUPERVISED PRETRAINING USING CHEST X-RAYS TOWARDS A DOMAIN SPECIFIC FOUNDATION MODEL
1141Standard compliant video coding using low complexity, switchable neural wrappers
1092START-TV: A CLOSED-FORM INITIALIZATION FOR TOTAL VARIATION MODELS
2249STATISTICS-AWARE AUDIO-VISUAL DEEPFAKE DETECTOR
1253STAY FOCUS ON OBJECT: CROSS-DOMAIN DETECTION USING DOMAIN-INVARIANT OBJECT REPRESENTATION
2830STEGANALYSIS OF AI MODELS LSB ATTACKS
2714Streaming Neural Images
2085STREAMLINED HYBRID ANNOTATION FRAMEWORK USING SCALABLE CODESTREAM FOR BANDWIDTH-RESTRICTED UAV OBJECT DETECTION
2607STRUCTURED PRUNING AND QUANTIZATION FOR LEARNED IMAGE COMPRESSION
2055SUBBLOCK-BASED COMBINED INTER AND INTRA PREDICTION BEYOND VVC
2315Subgroups for Detection Transformer
1815SUBJECTIVE PORTRAIT REGION CROPPING ON LANDSCAPE VIDEO STUDY
1080SUBJECTIVE QUALITY ASSESSMENT OF THERMAL INFRARED IMAGES
2694SUPER: SELFIE UNDISTORTION AND HEAD POSE EDITING WITH IDENTITY PRESERVATION
1921SUPERPIXEL MIXING: A DATA AUGMENTATION TECHNIQUE FOR ROBUST DEEP VISUAL RECOGNITION MODELS
2465SUPER-RESOLUTION FOR NEAR-EYE LIGHT FIELD DISPLAY IN FOURIER SPACE
2298SURFACE ANOMALY DETECTION WITH ANOMALOUS FEATURE RESTRICTION AND DIFFERENCE-AWARE ENHANCEMENT
2340SYNTHMANTICLIDAR: A SYNTHETIC DATASET FOR SEMANTIC SEGMENTATION ON LIDAR IMAGING
1687TALKING-HEAD VIDEO COMPRESSION WITH MOTION SEMANTIC ENHANCEMENT MODEL
2252TAXES ARE ALL YOU NEED: INTEGRATION OF TAXONOMICAL HIERARCHY RELATIONSHIPS INTO THE CONTRASTIVE LOSS
2100TCA-NET: TRIPLET CONCATENATED-ATTENTIONAL NETWORK FOR MULTIMODAL ENGAGEMENT ESTIMATION
1334TDAD: TRIDENT DISTILLATIONS FOR ANOMALY DETECTION
1461TEMPORAL CLUSTERING AND TEMPORAL REFERENCE BASED SPECULAR DETECTION FOR 1-MS VISUAL FEEDBACK SYSTEM
2279TEMPORAL REGULARIZATION FOR ROBUST MOTION COMPENSATION IN REDUCED DOSE CARDIAC-GATED SPECT IMAGES
2536TEMPORAL SCALABLE CODING FOR DYNAMIC MESHES
2293TEMPORAL TRANSFORMER ENCODER FOR VIDEO CLASS INCREMENTAL LEARNING
2675TEMPORAL-SPATIAL SPDAGG NETWORK FOR SKELETON-BASED HUMAN ACTION RECOGNITION FROM AERIAL PERSPECTIVES
2816The Bjøntegaard Bible: Why Your Way of Comparing Video Codecs May Be Wrong
2029THERMAL VIDEODIFF (TVD): A DIFFUSION ARCHITECTURE FOR THERMAL VIDEO SYNTHESIS
1437THQA: A Perceptual Quality Assessment Database for Talking Heads
1431THROUGH-WALL IMAGING BASED ON WIFI CHANNEL STATE INFORMATION
2406Toward Efficient Deep Blind RAW Image Restoration
2142TOWARD LOW ARTIFACT VIRTUAL TRY-ON VIA PRE-WARPING PARTITIONED CLOTHING ALIGNMENT
2284Towards Better Control of Latent Spaces for Face Editing
1926TOWARDS GENERALIZABLE REFERRING IMAGE SEGMENTATION VIA TARGET PROMPT AND VISUAL COHERENCE
2826Towards Generated Image Provenance Analysis Via Conceptual-Similar-Guided-SLIP Retrieval
2071TOWARDS PRIVACY-ENHANCING PROVENANCE ANNOTATIONS FOR IMAGES
1639TOWARDS ROBUST PERSON RE-IDENTIFICATION VIA EFFICIENT AND GENERALIZED ADVERSARIAL TRAINING
2263TOWARDS ROBUST VISUAL LOCALIZATION USING MULTI-VIEW IMAGES AND HD VECTOR MAP
1954Towards the Detection of AI-Synthesized Human Face Images
1811TOWARDS UNIFYING ANATOMY SEGMENTATION: AUTOMATED GENERATION OF A FULL-BODY CT DATASET
2840Trainable Fractional Fourier Transform
1248Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval
1937TRUSTWORTHY SR: RESOLVING AMBIGUITY IN IMAGE SUPER-RESOLUTION VIA DIFFUSION MODELS AND HUMAN FEEDBACK
2329TSF-Net3D: TSF-Net for 3D Point Cloud Attribute Compression Artifacts Removal
1495TWO HEADS BETTER THAN ONE: DUAL DEGRADATION REPRESENTATION FOR BLIND SUPER-RESOLUTION
1628TWO-LEVEL INTRA PREDICTION USING HIGH-ORDER MACROPIXEL NEIGHBORS FOR PLENOPTIC VIDEO CODING
1838TWO-STAGE TRIPLETNET: LIGHT WEIGHT REMOTE SENSING SCENE CLASSIFICATION
1311U-Convnext Network for Infrared Small Target Detection
1423UIMT: A framework for improving unimodal inference via multimodal training
2407UNCALIBRATED AND UNSUPERVISED PHOTOMETRIC STEREO WITH PIECEWISE REGULARIZER
2079UNCERTAINTY-AWARE AB3DMOT BY VARIATIONAL 3D OBJECT DETECTION
2318Uncovering communities of pipelines in the task-fMRI analytical space
2496Underwater Change Detection using Multiple Sampling-based Probabilistic Learner and Feature Preservance Discriminator
2628UniCrowd Simulator: Visual and Behavioral Fidelity for the Generation of Crowd Datasets
1257Universal Black-box Adversarial Patch Attack with Optimized Genetic Algorithm
1824UNLEASHING FINE-COARSE CURVE PERCEPTION VIA TRUNK-BRANCH PERTURBATION
2744UNLEASHING THE POWER OF GENERALIZED ITERATIVE CLOSEST POINT FOR SWIFT AND EFFECTIVE POINT CLOUD REGISTRATION
2234UNROLLED PROJECTED GRADIENT ALGORITHM FOR STAIN SEPARATION IN DIGITAL HISTOPATHOLOGICAL IMAGES
1965UNSUPERVISED COORDINATE-BASED VIDEO DENOISING
1169UNSUPERVISED DOMAIN ADAPTIVE SEMANTIC SEGMENTATION BASED ON CLIP-GUIDED PROTOTYPICAL CONTRASTIVE LEARNING
1720U-TELL: UNSUPERVISED TASK EXPERT LIFELONG LEARNING
2015UTrCGAN:Uncertainty-Driven Cycle-Consistent Generative Adversarial Network for Low-Light Image Enhancement
1968VAG: VOXEL ATTENUATION GRID FOR SPARSE-VIEW CBCT RECONSTRUCTION
1562VCDSET: A NEW VEHICLE COLLISION DATASET IN ASIA COUNTRIES FOR ANTICIPATING ACCIDENTS
1050VF-NET: ROBUSTNESS VIA UNDERSTANDING DISTORTIONS AND TRANSFORMATIONS
1889Video Class-Incremental Learning with CLIP based Transformer
1417VITO: VISION TRANSFORMER OPTIMIZATION VIA KNOWLEDGE DISTILLATION ON DECODERS
1462VIZECGNET: VISUAL ECG IMAGE NETWORK FOR CARDIOVASCULAR DISEASES CLASSIFICATION WITH MULTI-MODAL TRAINING AND KNOWLEDGE DISTILLATION
1556VR-based generation of photorealistic synthetic data for training hand-object tracking models
1936WAVELET-ENHANCED CNN FOR DEPRESSION CLASSIFICATION BASED ON MRI IMAGES
2595WEATHER-AWARE DRONE-VIEW OBJECT DETECTION VIA ENVIRONMENTAL CONTEXT UNDERSTANDING
1669WHEN SELF-SUPERVISED PRE-TRAINING MEETS SINGLE IMAGE DENOISING
2093WRAPPINGNET: MESH AUTOENCODER VIA DEEP SPHERE DEFORMATION
1161YOLO-FEDER FUSIONNET: A NOVEL DEEP LEARNING ARCHITECTURE FOR DRONE DETECTION
2133YouTube SFV+HDR Quality Dataset
1988ZERO-SHOT COMPOSED IMAGE RETRIEVAL CONSIDERING QUERY-TARGET RELATIONSHIP LEVERAGING MASKED IMAGE-TEXT PAIRS
2839ZJUT-EIFD: A Synchronously Collected External and Internal Fingerprint Database