List of Accepted Papers
Following is the list of accepted ICIP 2024 papers, sorted by paper title. You can use the search feature of your web browser to find your paper number. Notifications to all authors have also been sent by email. If you have not received your notification of the results by email, please contact us at papers@2024.ieeeicip.org.
Paper Number | Paper Title |
---|---|
1148 | 3D Clothed Human Reconstruction From One In-the-wild RGB Image |
1048 | 3D SEMANTIC SCENE COMPLETION FROM A DEPTH MAP WITH UNSUPERVISED LEARNING FOR SEMANTICS PRIORITISATION |
2573 | 3D-COCO: EXTENSION OF MS-COCO DATASET FOR SCENE UNDERSTANDING AND 3D RECONSTRUCTION |
2126 | 3DLaneFormer: Rethinking Learning Views for 3D Lane Detection |
2007 | 3F-PnP: Compressive Sensing using Nonlocal Self-Similarity and Deep Learning Priors |
1616 | A 1D PLUG-AND-PLAY SYNTHETIC DATA DEEP LEARNING FOR UNDERSAMPLED MAGNETIC RESONANCE IMAGE RECONSTRUCTION |
2265 | A benchmark of variance of opinion scores in image quality assessment |
2832 | A Channel-Wise Multi-Scale Network for Single Image Super-Resolution |
1896 | A CNN-TRANSFORMER NETWORK BASED SNR GUIDED HIGH FREQUENCY RECONSTRUCTION FOR LOW LIGHT IMAGE ENHANCEMENT |
1801 | A comparative study of perceptual quality metrics for audio-driven talking head videos |
1119 | A CONFIDENCE-AWARE MATCHING STRATEGY FOR GENERALIZED MULTI-OBJECT TRACKING |
1085 | A CONTEXT-ORIENTED MULTI-SCALE NEURAL NETWORK FOR FIRE SEGMENTATION |
2136 | A CROSS DOMAIN GENERATIVE NETWORK FOR ACCELERATED MRI |
2567 | A DATASET FOR UNDERSTANDING OPEN UGC VIDEO DATASETS |
1989 | A DECODING SCHEME WITH SUCCESSIVE AGGREGATION OF MULTI-LEVEL FEATURES FOR LIGHT-WEIGHT SEMANTIC SEGMENTATION |
1588 | A DICTIONARY BASED APPROACH FOR REMOVING OUT-OF-FOCUS BLUR |
1294 | A DUAL-DOMAIN COLLABORATION NETWORK FOR VCS RECONSTRUCTION |
2662 | A FUSION-BASED APPROACH FOR BLIND CONTRAST-ENHANCED IMAGE RANKING |
1571 | A HARD CONVEX-SHAPE CONSTRAINT IN DNNS FOR OBJECT SEGMENTATION |
2439 | A HUE-PRESERVING CONTRAST ENHANCEMENT METHOD USING HISTOGRAM SPECIFICATION FOR EACH RGB COMPONENT |
2147 | A Large-capacity data hiding scheme in encrypted VVC video |
1700 | A LEARNABLE RADAR IMAGING PARADIGM DRIVEN BY DEEP GENERATIVE MODEL |
1477 | A MODULAR AND ROBUST PHYSICS-BASED APPROACH FOR LENSLESS IMAGE RECONSTRUCTION |
1261 | A MULTI-MODALITY FEATURE ENHANCEMENT METHOD BASED ON FEATURE DISENTANGLEMENT FOR SAR IMAGE TARGET DETECTION |
1745 | A MULTI-SCALE FEATURE FUSION NETWORK FOR CHIP SURFACE DEFECT DETECTION |
2437 | A NEEDLE IN A (MEDICAL) HAYSTACK: DETECTING A BIOPSY NEEDLE IN ULTRASOUND IMAGES USING VISION TRANSFORMERS |
2533 | A Neuroimaging YOLOv8-Based CAD Framework for Anosmia Grading in COVID-19 |
2838 | A New Approach in Automated Fingerprint Presentation Attack Detection Using Optical Coherence Tomography |
2434 | A NEW EFFICIENT SPLIT & MERGE ALGORITHM FOR EMBEDDED SYSTEMS |
1696 | A new fingerprinting technique for engraved binary matrix authentication |
1282 | A NEW PEOPLE-OBJECT INTERACTION DATASET AND NVS BENCHMARKS |
2731 | A NOVEL APPROACH FOR 3D RENAL SEGMENTATION USING A MODIFIED GAN MODEL AND TEXTURE ANALYSIS |
1753 | A Novel architecture for image vectorization with increasing granularity |
2466 | A Practical Calibration Method for Cameras and Multiple Line-Lasers in Light Sectioning Systems for Underwater Environments |
2449 | A PRECONDITIONING APPROACH TO OPTIMIZING SENSING MATRIX FOR IMPROVED COMPRESSED SENSING CT RECONSTRUCTION |
1966 | A REAL-WORLD SATELLITE VIDEO SUBJECTIVE QOE DATABASE |
2327 | A SELF-SUPERVISED DIFFUSION FRAMEWORK FOR FACIAL EMOTION RECOGNITION |
1596 | A SINGLE GRAPH CONVOLUTION IS ALL YOU NEED: EFFICIENT GRAYSCALE IMAGE CLASSIFICATION |
2087 | A Sparse Graph Formulation for Efficient Spectral Image Segmentation |
2531 | A SPATIO-TEMPORAL ALIGNED SUNET MODEL FOR LOW-LIGHT VIDEO ENHANCEMENT |
1747 | A STATISTICAL IMAGE REALISM SCORE FOR DEEPFAKE DETECTION |
1701 | A STUDY ON THE EFFECT OF COLOR SPACES IN LEARNED IMAGE COMPRESSION |
2116 | A Subjective Quality Evaluation of 3D Mesh with Dynamic Level of Detail in Virtual Reality |
2424 | A Text Detector Based On The Specific Text Prompt |
2241 | A TOOLKIT TO BENCHMARK POINT CLOUD QUALITY METRICS WITH MULTI-TRACK EVALUATION CRITERIA |
2392 | A Trustworthy Authentication Against Visual Master Face Dictionary Attacks (Trauma) |
1913 | AAGF: AN EFFICIENT TRANSFORMER WITH MIX-FEATURES FOR VISUAL PLACE RECOGNITION |
2411 | ACCELERATING CASCADE CLASSIFIER TRAINING WITH GENETIC ALGORITHMS FOR EDGE ML APPLICATIONS |
2370 | Accurate colon segmentation using 2D convolutional neural networks with 3D contextual information |
1731 | ACML: Attention-based Cross-Modality Learning for Cloth-Changing and Occluded Person Re-Identification |
2227 | ADAPROMPT: PROMPT TUNING WITH ADAPTIVE NEIGHBOURS FOR GENERALIZED CATEGORY DISCOVERY |
1590 | Adaptative Context Normalization: A Boost for Deep Learning in Image Processing |
1721 | ADAPTING LEARNED IMAGE CODECS TO SCREEN CONTENT VIA ADJUSTABLE TRANSFORMATIONS |
1969 | ADAPTIVE ADVERSARIAL CROSS-ENTROPY LOSS FOR SHARPNESS-AWARE MINIMIZATION |
2309 | Adaptive downsampling and spatial upconversion for point cloud compression |
2556 | ADAPTIVE SAMPLING METHOD FOR WHOLE-BODY LOW-DOSE PET RECONSTRUCTION BASED ON RECONSTRUCTION DIFFICULTY |
2455 | ADAPTIVE SPATIAL-TEMPORAL MODELLING FOR HUMAN MOTION PREDICTION |
1040 | ADAPTIVE TILT-SERIES ALIGNMENT WITH FEATURE RESAMPLING IN CRYO-ELECTRON TOMOGRAPHY |
1917 | ADAPTIVELY HIERARCHICAL QUANTIZATION VARIATIONAL AUTOENCODER BASED ON FEATURE DECOUPLING AND SEMANTIC CONSISTENCY FOR IMAGE GENERATION |
1435 | ADAPTRACK: ADAPTIVE THRESHOLDING-BASED MATCHING FOR MULTI-OBJECT TRACKING |
1519 | ADAPTXRAY: VISION TRANSFORMER AND ADAPTER IN X-RAY IMAGES FOR PROHIBITED ITEMS DETECTION |
1743 | ADAVIPRO: REGION-BASED ADAPTIVE VISUAL PROMPT FOR LARGE-SCALE MODELS ADAPTING |
1485 | Advanced Object Detection in Multibeam Forward-looking Sonar Images Using Linear Cross-Attention Techniques |
1961 | ADVANCING COLORECTAL POLYP SEGMENTATION WITH WATERSHED ALGORITHM-ENHANCED PARALLEL SELF-SUPERVISED LEARNING |
2195 | AdvART: Adversarial Art for Camouflaged Object Detection Attacks |
1713 | ADVERSARIAL DETECTION TRANSFORMER FOR KUZUSHIJI RECOGNITION |
1469 | Adversarial EM for Partially-Supervised Image-Quality Enhancement: Application to Low-Dose PET Imaging |
1367 | ADVERSARIAL ROBUSTNESS FOR DEEP METRIC LEARNING |
2545 | ADVERSARIALLY ROBUST CONTINUAL LEARNING WITH ANTI-FORGETTING LOSS |
1560 | AERIAL VIEW RIVER LANDFORM VIDEO SEGMENTATION: A WEAKLY SUPERVISED CONTEXT-AWARE TEMPORAL CONSISTENCY DISTILLATION APPROACH |
2110 | AGENT-GUIDED GAZE ESTIMATION NETWORK BY TWO-EYE ASYMMETRY EXPLORATION |
2471 | AIGCOIQA2024: PERCEPTUAL QUALITY ASSESSMENT OF AI GENERATED OMNIDIRECTIONAL IMAGES |
2009 | AI-GENERATED IMAGE DETECTION WITH WASSERSTEIN DISTANCE COMPRESSION AND DYNAMIC AGGREGATION |
1879 | ALIGNFACE: ENHANCING FACE VERIFICATION MODELS THROUGH ADAPTIVE ALIGNMENT OF POSE, EXPRESSION, AND ILLUMINATION |
2188 | ALL SKELETONS ARE CREATED EQUAL! A DOMAIN ADAPTATION TRANSFORMER TO HANDLE MULTIPLE TOPOLOGIES |
2209 | An Alpha-Divergence Approach to Robust Canonical Correlation Analysis |
1852 | An Anchor-free Contour-based Method for Instance Segmentation |
2561 | AN EXPLAINABLE SPECTRAL ANALYSIS FOR LIGHT FIELD IMAGE QUALITY ASSESSMENT |
2532 | AN IMAGE DECOMPOSITION-GUIDED NETWORK FOR IMAGE INTERPOLATION |
1681 | AN INDOOR SCENE LOCALIZATION METHOD USING GRAPHICAL SUMMARY OF MULTI-VIEW RGB-D IMAGES |
2519 | AN INTERNATIONAL STANDARD FOR ASSESSING TRUSTWORTHINESS IN MEDIA |
1565 | AN INTERPRETABLE DEEP GRAPH NEURAL NETWORK BASED ON ATTENTIONAL MULTI-SCALE FEATURE FUSION FOR FMRI ANALYSIS |
1289 | An Optimal Transport-based Method for Medical Image Generation |
2155 | ANALYZING VISIBLE ARTICULATORY MOVEMENTS IN SPEECH PRODUCTION FOR SPEECH-DRIVEN 3D FACIAL ANIMATION |
2415 | ANOMALY DETECTION FOR THE IDENTIFICATION OF VOLCANIC UNREST IN SATELLITE IMAGERY |
2544 | ANOMALY UNVEILED: SECURING IMAGE CLASSIFICATION AGAINST ADVERSARIAL PATCH ATTACKS |
1388 | APNET: GENERATING PRECISE ANOMALY PRIOR INFORMATION FOR MIXED-SUPERVISED DEFECT DETECTION |
2681 | ARE OBJECTIVE EXPLANATORY EVALUATION METRICS TRUSTWORTHY? AN ADVERSARIAL ANALYSIS |
2606 | ASSESSING VIDEO SHAKINESS: A NOVEL DATA AND PROTOCOLS FRAMEWORK |
2078 | ATAC-NET: ZOOMED VIEW WORKS BETTER FOR ANOMALY DETECTION |
2791 | ATTENTION DOWN-SAMPLING TRANSFORMER, RELATIVE RANKING AND SELF-CONSISTENCY FOR BLIND IMAGE QUALITY ASSESSMENT |
1511 | ATTENTION ENHANCEMENT WITH PARALLEL GROUPS FOR REMOTE SENSING OBJECT DETECTION |
2072 | ATTENTION-BASED FEW-SHOT DIAGNOSIS OF CHEST X-RAYS USING SEMANTIC SIGNATURES |
1413 | ATU-NET: AN ADAPTIVE TRANSFORMATION-BASED U-NET FOR MEDICAL IMAGE SEGMENTATION |
2582 | Automated Segmentation of Lung Regions in 3D CT Scans Using Hybrid Unsupervised-Supervised Models |
2821 | Automatic Point Cloud Registration for 3D Virtualto-Real Registration Using Macro and Micro Structures |
1587 | BAYESIAN BLIND IMAGE DECONVOLUTION USING AN HYPERBOLIC-SECANT PRIOR |
2823 | BAYGO: DECENTRALIZED BAYESIAN LEARNING AND INFORMATION-AWARE GRAPH OPTIMIZATION FRAMEWORK |
1882 | BIDFUSE: HARNESSING BI-DIRECTIONAL ATTENTION WITH MODALITY-SPECIFIC ENCODERS FOR INFRARED-VISIBLE IMAGE FUSION |
2779 | BI-DIRECTIONAL TRACKLET EMBEDDING FOR MULTI-OBJECT TRACKING |
1151 | BINARY-DECOMPOSED VISION TRANSFORMER: COMPRESSING AND ACCELERATING VISION TRANSFORMER BY BINARY DECOMPOSITION |
1352 | BI-PREDICTIVE INTRA BLOCK COPY FOR ENHANCED VIDEO CODING BEYOND VVC |
2169 | BLEND & PREDICT: DOMAIN-ADAPTABLE FEW-SHOT LEARNING FOR MICROSCOPY IMAGING |
2139 | BMT-BENCH: A BENCHMARK SPORTS DATASET FOR VIDEO GENERATION |
1547 | BOX-LEVEL CLASS-BALANCED SAMPLING FOR ACTIVE OBJECT DETECTION |
1880 | BRI3L: A BRIGHTNESS ILLUSION IMAGE DATASET FOR IDENTIFICATION AND LOCALIZATION OF REGIONS OF ILLUSORY PERCEPTION |
2311 | BuRnSNet: BURN REGION SEGMENTATION NETWORK FROM COLOR IMAGES WITH TWO-WAY CNN |
2201 | B-WALK: BERNOULLI PRINCIPLE GUIDED BIASED RANDOM WALK FOR CURVE CONNECTION |
2529 | CAFCT-NET: A CNN-TRANSFORMER HYBRID NETWORK WITH CONTEXTUAL AND ATTENTIONAL FEATURE FUSION FOR LIVER TUMOR SEGMENTATION |
1923 | CAMERA CALIBRATION THROUGH GEOMETRIC CONSTRAINTS FROM ROTATION AND PROJECTION MATRICES |
1903 | CAMOUFLAGED OBJECT DETECTION VIA STYLE TRANSFER-BASED DATA AUGMENTATION |
1946 | CAPTIV8 : A comprehensive large scale CAPsule endoscopy dataset for Integrated diagnosis |
2070 | CASCADING UNKNOWN DETECTION WITH KNOWN CLASSIFICATION FOR OPEN SET RECOGNITION |
1304 | CASeg: CLIP-based Action Segmentation with learnable text prompt |
1911 | Category-Agnostic Pose Estimation for Point Clouds |
1281 | CELL CYCLE STATE PREDICTION USING GRAPH NEURAL NETWORKS |
1984 | CENTERRADARNET: JOINT 3D OBJECT DETECTION AND TRACKING FRAMEWORK USING 4D FMCW RADAR |
1771 | CHARACTERIZATION OF DIM LIGHT RESPONSE IN DVS PIXEL: DISCONTINUITY OF EVENT TRIGGERING TIME |
2588 | ChatGPT and Biometrics: An Assessment of Face Recognition, Gender Detection, and Age Estimation Capabilities |
2566 | Class-Specific Channel Attention for Few Shot Learning |
1739 | ClearDepth: Addressing Depth Distortions Caused by Eyelashes for Accurate Geometric Gaze Estimation on Mobile Devices |
2351 | CLIFS: CLIP-DRIVEN FEW-SHOT LEARNING FOR BAGGAGE THREAT CLASSIFICATION |
1617 | CLIP-BASED COMPOSITION-AWARE IMAGE CROPPING |
2274 | CLIP-MEDFAKE: SYNTHETIC DATA AUGMENTATION WITH AI-GENERATED CONTENT FOR IMPROVED MEDICAL IMAGE CLASSIFICATION |
2640 | CLOUDS AND HAZE CO-REMOVAL BASED ON WEIGHT-TUNED OVERLAP REFINEMENT DIFFUSION MODEL FOR REMOTE SENSING IMAGES |
1869 | CM²-NET: CONTINUAL CROSS-MODAL MAPPING NETWORK FOR DRIVER ACTION RECOGNITION |
1920 | CO2WOUNDS-V2: EXTENDED CHRONIC WOUNDS DATASET FROM LEPROSY PATIENTS |
1155 | COARSE-FINE SPECTRAL-AWARE DEFORMABLE CONVOLUTION FOR HYPERSPECTRAL IMAGE RECONSTRUCTION |
1156 | COARSE-TO-FINE SPATIO-TEMPORAL LUMINANCE-AWARE RECONSTRUCTION FOR HIGH-SPEED MOTION SCENE |
1970 | CodaMal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost Microscopes |
2474 | Collaborative Intelligence for Vision Transformers: A Token Sparsity-Driven Edge-Cloud Framework |
2555 | COMBINING RAFT-BASED STEREO DISPARITY AND OPTICAL FLOW MODELS FOR SCENE FLOW ESTIMATION |
2686 | COMPARISON OF CROWDSOURCING AND LABORATORY SETTINGS FOR SUBJECTIVE ASSSESSMENT OF VIDEO QUALITY AND ACCEPTABILITY & ANNOYANCE |
2030 | COMPETITIVE LEARNING FOR ACHIEVING CONTENT-SPECIFIC FILTERS IN VIDEO CODING FOR MACHINES |
2615 | COMPRESSION-AWARE TUNING FOR COMPRESSING VOLUMETRIC RADIANCE FIELDS |
1722 | COMPUTATIONALLY EFFICIENT KALMAN FILTER FRAMEWORK FOR INTRA-FRAME IMAGE RECONSTRUCTION WITH A ROLLING SHUTTER CAMERA |
2042 | CONDITIONAL OPTIMAL FILTER SELECTION FOR MULTISPECTRAL OBJECT CLASSIFICATION |
1197 | Conditional Past Experience Generation for Dark Continual Learning |
1861 | Confidence Aware Stereo Matching for Realistic Cluttered Scenario |
2621 | CONSTRUCTING AN INTERPRETABLE DEEP DENOISER BY UNROLLING GRAPH LAPLACIAN REGULARIZER |
2031 | Content-Aware Supervision for Diffusion-Based Restoration of Extremely Compressed Background for VCM |
1635 | Context-adaptive Entropy Model With Adapters For Lossless Point Cloud Geometry Compression |
1569 | CONTEXTUALITY HELPS REPRESENTATION LEARNING FOR GENERALIZED CATEGORY DISCOVERY |
1509 | CONTINUAL ROAD-SCENE SEMANTIC SEGMENTATION VIA FEATURE-ALIGNED SYMMETRIC MULTI-MODAL NETWORK |
1524 | CONTOUR-WEIGHTED LOSS FOR CLASS-IMBALANCED IMAGE SEGMENTATION |
1323 | CONTRAST-GUIDED WIREFRAME PARSING |
1024 | Controllable Unsupervised Event-based Video Generation |
1420 | Convex-hull Estimation using XPSNR for Versatile Video Coding |
2570 | CONVOLUTIONAL NEURAL NETWORK WITH LEARNABLE MASKS FOR EIT BASED TACTILE SENSING |
1443 | CORRELATION-AWARE JOINT PRUNING-QUANTIZATION USING GRAPH NEURAL NETWORKS |
1170 | COUNTING REPETITIVE ACTIONS IN EVENT STREAM |
2380 | CROCOS-V1: ENHANCING MASK LEAKAGE AND BOUNDING BOX LOCALIZATION FOR REAL-TIME CROP/WEED INSTANCE SEGMENTATION |
1069 | CROSS-ACTION CROSS-SUBJECT SKELETON ACTION RECOGNITION VIA SIMULTANEOUS ACTION-SUBJECT LEARNING WITH TWO-STEP FEATURE REMOVAL |
1994 | CROSS-DOMAIN FEW-SHOT IN-CONTEXT LEARNING FOR ENHANCING TRAFFIC SIGN RECOGNITION |
2175 | CROSS-FUSION OF BAND-SPECIFIC SPECTRAL FEATURES FOR MULTI-BAND NIR COLORIZATION |
2740 | CROSS-MODAL ALIGNMENT OF LOCAL AND GLOBAL FEATURES FOR ZERO-SHOT CHINESE CHARACTER RECOGNITION |
1041 | CROWDASSIGN: A LABEL ASSIGNMENT SCHEME FOR PEDESTRIAN DETECTION IN CROWDED SCENES |
2542 | CST-YOLO: A NOVEL METHOD FOR BLOOD CELL DETECTION BASED ON IMPROVED YOLOV7 AND CNN-SWIN TRANSFORMER |
1321 | DALSM: A DIRECTION-AWARE LINE SEGMENT MATCHING METHOD |
2368 | DAPlankton: Benchmark Dataset for Multi-instrument Plankton Recognition via Fine-grained Domain Adaptation |
2574 | DCCM: Dual Data Consistency Guided Consistency Model for Inverse Problems |
1803 | DCCTNET: KIDNEY TUMORS SEGMENTATION BASED ON DUAL-LEVEL COMBINATION OF CNN AND TRANSFORMER |
2494 | DECLOUDING OF SATELLITE IMAGES FOR CROP GROWTH MONITORING VIA UNROLLING OF GRADIENT GRAPH LAPLACIAN REGULARIZER |
1251 | DECOMPL: DECOMPOSITIONAL LEARNING WITH ATTENTION POOLING FOR GROUP ACTIVITY RECOGNITION FROM A SINGLE VOLLEYBALL IMAGE |
1765 | Decoupling Domain Invariance and Variance with Tailored Prompts for Open-Set Domain Adaptation |
1860 | DEEP CONVOLUTIONAL NEURAL NETWORK PREDICTION FOR GLAUCOMA DETECTION USING OCT AND OCT-ANGIOGRAPHY DISC- AND MACULA-CENTERED IMAGES AND THEIR COMBINED POWER |
1152 | Deep Fusion of Visible and Near Infrared Images for Registration and Defogging Using Cross Modal Transformer |
1599 | DEEP LEARNING APPROACH FOR RENAL CELL CARCINOMA DETECTION, SUBTYPING, AND GRADING |
2347 | Deep Learning-Based Leaf Image Analysis for Tomato Plant Disease Detection and Classification |
2285 | Deep Multi-Graph Embedded Clustering for Community Detection in fMRI Functional Brain Networks Across Individuals |
1685 | Deep optical flow learning with deformable large-kernel Cross-attention |
2366 | DEEP REGULARIZATION FOR SCALE-AGNOSTIC SUPERRESOLUTION OF MR IMAGES |
2481 | Deep Spectral Siamese Network for Heterogeneous Object Verification in Amazon Robotic Warehouse |
2565 | DEEPFAKE DETECTION VIA SEPARABLE SELF-CONSISTENCY LEARNING |
1496 | DEEPFAKE DETECTION WITH COMBINED UNSUPERVISED-SUPERVISED CONTRASTIVE LEARNING |
1273 | DEEP-LEARNING-BASED MAGNETIC RESONANCE SIMULTANEOUS MULTISLICE IMAGING USING HOLOGRAPHIC IMAGE DECODING |
2213 | DEEPSKINFORMER: SKIN LESION SEGMENTATION USING HIERARCHICAL TRANSFORMERS AND EDGE ENHANCEMENT |
1132 | DEFENDING AGAINST PHYSICAL ADVERSARIAL PATCH ATTACKS ON INFRARED HUMAN DETECTION |
2670 | DELVING INTO THE EXPLAINABILITY OF PROTOTYPE-BASED CNN FOR BIOLOGICAL CELL ANALYSIS |
1444 | DENSITY-GUIDED DENSE PSEUDO LABEL SELECTION FOR SEMI-SUPERVISED ORIENTED OBJECT DETECTION |
1208 | Detectability of Defects in the Presence of Linear Nuisance Parameters and Images Signal-Dependent Noise |
2103 | Detecting Biomedical Copy-Move Forgery by Attention-based Multiscale Deep Descriptors |
2596 | DIRECTIONAL AND TOPOLOGICAL TRANSFORMER WITH TOPOLOGY PRIORS FOR 4D CELLULAR IMAGE SEGMENTATION |
1047 | DIRECTIONAL ANTENNA SYSTEMS FOR LONG-RANGE THROUGH-WALL HUMAN ACTIVITY RECOGNITION |
1356 | DISENTANGLED KNOWLEDGE DISTILLATION FOR UNIFIED MULTI-CLASS ANOMALY DETECTION |
2319 | Distinctive Image Captioning: Leveraging Ground Truth Captions in CLIP Guided Reinforcement Learning |
1359 | DIVERSIFIED TASK AUGMENTATION WITH REDUNDANCY REDUCTION FOR CROSS-DOMAIN FEW-SHOT LEARNING |
1094 | DIVERSIFYING DEEP ENSEMBLES: A SALIENCY MAP APPROACH FOR ENHANCED OOD DETECTION, CALIBRATION, AND ACCURACY |
1194 | DOMAIN DILATION FOR SINGLE DOMAIN GENERALIZATION |
2092 | DRAFT - DISTILLED RECURRENT ALL-PAIRS FIELD TRANSFORMS FOR OPTICAL FLOW |
2350 | Driving through Graphs: A Bipartite Graph for Traffic Scene Analysis |
2534 | DTPose: Learning Disentangled Token Representation for Effective Human Pose Estimation |
1009 | DTSN: NO-REFERENCE IMAGE QUALITY ASSESSMENT VIA DEFORMABLE TRANSFORMER AND SEMANTIC NETWORK |
1606 | DUAL ATTENTION ENHANCED TRANSFORMER FOR IMAGE DEFOCUS DEBLURRING |
1637 | DUAL MULTI-MODAL FEATURE FUSION NETWORK FOR THE EVALUATION OF OSTEOSARCOMA |
1360 | Dual-Path Coupled Image Deraining Network via Spatial-Frequency Interaction |
2194 | DYNAMIC ACTIVATION FUNCTION BASED ON THE BRANCHING PROCESS AND ITS APPLICATION IN IMAGE CLASSIFICATION |
1490 | Dynamic MRI reconstruction using low-rank plus sparse decomposition with smoothness regularization |
1236 | E2GS: EVENT ENHANCED GAUSSIAN SPLATTING |
1776 | E2SIFT: NEUROMORPHIC SIFT VIA DIRECT FEATURE PYRAMID RECOVERY FROM EVENTS |
2021 | Early prediction of the transferability of bovine embryos from videomicroscopy |
1825 | EarthquakeNet: A High-Resolution UAV-Based Dataset for Earthquake Damage Assessment |
1399 | ECAP: EXTENSIVE CUT-AND-PASTE AUGMENTATION FOR UNSUPERVISED DOMAIN ADAPTIVE SEMANTIC SEGMENTATION |
1870 | EDGE-GUIDED PIXEL LEVEL CONNECTED COMPONENT ASSISTED CAMOUFLAGED OBJECT DETECTION |
1677 | Edge-Reserved Knowledge Distillation for Image Matting |
1609 | EFFICIENT BLACK-BOX ADVERSARIAL ATTACK ON DEEP CLUSTERING MODELS |
1070 | EFFICIENT CIRCULAR AND CONFOCAL NON-LINE-OF-SIGHT IMAGING WITH TRANSIENT SINOGRAM SUPER RESOLUTION |
1527 | EFFICIENT LEARNED WAVELET IMAGE AND VIDEO CODING |
1958 | EFFICIENT SEMANTIC SEGMENTATION FOR AERIAL IMAGERY USING QUERY POINTS AND SUPERPIXEL SUPERVISION |
2672 | EFFICIENT VISUAL QUESTION ANSWERING ON EMBEDDED DEVICES: CROSS-MODALITY ATTENTION WITH EVOLUTIONARY QUANTIZATION |
1960 | EMBEDDING ATTENTION BLOCKS FOR ANSWER GROUNDING |
1769 | Empirical Research on Quantization for 3D Multi-Modal ViT models |
2102 | End-to-end Learned Lossy Dynamic Point Cloud Attribute Compression |
1835 | End-to-End Learned Scalable Multilayer Feature Compression for Machine Vision Tasks |
2359 | ENERGY REDUCTION OPPORTUNITIES IN HDR VIDEO ENCODING |
1234 | Enhanced Detection of Small Objects in Aerial Imagery: A High-Resolution Neural Network Approach with Amplified Feature Pyramid and Sigmoid Re-weighting |
2745 | ENHANCED FACIAL RESTORATION WITH MISINFORMATION-FILTERED GUIDE-DENOISING DIFFUSION PROBABILISTIC MODELS |
1810 | ENHANCED PROTOTYPICAL PART NETWORK (EPPNET) FOR EXPLAINABLE IMAGE CLASSIFICATION VIA PROTOTYPES |
2282 | Enhancing Intubation Accuracy: Advanced Tracheal Segmentation Techniques in Video Endoscopy |
2726 | ENHANCING PERCEPTUAL QUALITY ASSESSMENT FOR 360-DEGREE IMAGES BASED ON ADAPTIVE PATCH LABELING AND MULTI-LABEL LEARNING |
2337 | ENHANCING TMIV PERFORMANCE THROUGH PROXIMITY-AWARE GROUPING AND PRESERVATION OF SMALL CLUSTERS |
2817 | ENN: A NEURAL NETWORK WITH DCT ADAPTIVE ACTIVATION FUNCTIONS |
1899 | ENSEMBLE OF DEEP VARIATIONAL MIXTURE MODELS FOR UNSUPERVISED CLUSTERING |
1384 | ESTATE: EXPERT-GUIDED STATE TEXT ENHANCEMENT FOR ZERO-SHOT INDUSTRIAL ANOMALY DETECTION |
1836 | ESTIMATING INDOOR SCENE DEPTH MAPS FROM ULTRASONIC ECHOES |
1673 | ET: EXPLAIN TO TRAIN: LEVERAGING EXPLANATIONS TO ENHANCE THE TRAINING OF A MULTIMODAL TRANSFORMER |
2698 | EVALUATING 3D HUMAN POSE ESTIMATION IN OCCLUDED MULTI-SENSOR SCENARIOS: DATASET AND ANNOTATION APPROACH. |
1295 | EVENT-SPECIFIC EEG-FNIRS FEATURE FUSION FOR ALZHEIMER’S DISEASE CLASSIFICATION |
1385 | EXPLAINING 3D OBJECT DETECTION THROUGH SHAPLEY VALUE-BASED ATTRIBUTION MAP |
2442 | EXPLAINING REPRESENTATION LEARNING WITH PERCEPTUAL COMPONENTS |
2397 | EXPLOITING CHANGE BLINDNESS TO REDUCE BITRATE AND DISPLAY LUMINANCE IN VIDEO STREAMING |
1580 | EXPLORING ATTENTION MECHANISMS IN INTEGRATION OF MULTI-MODAL INFORMATION FOR SIGN LANGUAGE RECOGNITION AND TRANSLATION |
1274 | EXPLORING SALIENCY BIAS IN MANIPULATION DETECTION |
2736 | EXPLORING THE IMPACT OF MOIRE PATTERN ON DEEPFAKE DETECTORS |
2196 | EXPLORING THE POTENTIAL OF RECURRENCE QUANTIFICATION ANALYSIS FOR VIDEO ANALYSIS AND MOTION DETECTION |
2355 | Exploring the Potential of Synthetic Data to Replace Real Data |
2775 | EXPOSING THE LIMITS OF DEEPFAKE DETECTION USING NOVEL FACIAL MOLE ATTACK: A PERCEPTUAL BLACK-BOX ADVERSARIAL ATTACK STUDY |
1999 | Extended multiple cross-component linear models with adaptive thresholding and overlapped averaging beyond VVC |
1252 | EXTENDING SEGMENT ANYTHING MODEL INTO AUDITORY AND TEMPORAL DIMENSIONS FOR AUDIO-VISUAL SEGMENTATION |
1900 | Face Drawing GAN by Channel Attention and Matrix Product Attention |
1077 | FACE MORPHING DETECTION IN SOCIAL MEDIA CONTENT |
1615 | FACTORIZED EMBEDDING GRAPH MATCHING NETWORK FOR LEARNING LAWLER’S QUADRATIC ASSIGNMENT PROBLEM |
2051 | FANET: FEATURE AMPLIFICATION NETWORK FOR SEMANTIC SEGMENTATION IN CLUTTERED BACKGROUND |
2707 | FANTOM: Federated Adversarial Network for Training Multi-sequence Magnetic Resonance Imaging in Semantic Segmentation |
1526 | FAST CODING MODE PREDICTION FOR INTRA PREDICTION IN VVC SCC |
1950 | FAST CONSTANT-QUALITY VIDEO ENCODING USING VVENC WITH RATE CAPPING BASED ON PRE-ANALYSIS STATISTICS |
1531 | FAST EDGE-AWARE OCCLUSION DETECTION IN THE CONTEXT OF MULTISPECTRAL CAMERA ARRAYS |
1343 | FAST INTER MODE DECISION WITH RESOLUTION SAMPLING FOR VVC 360-DEGREE VIDEO CODING |
1318 | FAST TEMPLATE MATCHING-BASED REFERENCE PICTURE PADDING FOR VIDEO CODING |
1162 | FAST UNSUPERVISED TENSOR RESTORATION VIA LOW-RANK DECONVOLUTION |
2665 | FAWN: FLOOR-AND-WALLS NORMAL REGULARIZATION FOR DIRECT NEURAL TSDF RECONSTRUCTION |
2690 | FC3DNET: A FULLY CONNECTED ENCODER-DECODER FOR EFFICIENT DEMOIRÉING |
2024 | Feature Decomposition Transformers for Infrared and Visible Image Fusion |
1438 | FEATURE ENHANCED LEARNING IMAGE COMPRESSION WITH RECURRENT CRISS-CROSS ATTENTION |
1530 | FEATURES DISENTANGLEMENT FOR EXPLAINABLE CONVOLUTIONAL NEURAL NETWORKS |
1337 | FedAwa: Aggregation Weight Adjustment in Federated Domain Generalization |
1791 | FedMI: A FEDERATED LEARNING FRAMEWOEK FOR SECURE SHARING OF MEDICAL IMAGES |
1216 | FINE-DETAILED NEURAL INDOOR SCENE RECONSTRUCTION USING MULTI-LEVEL IMPORTANCE SAMPLING AND MULTI-VIEW CONSISTENCY |
1451 | FINE-TUNING TEXT-TO-IMAGE DIFFUSION MODELS FOR CLASS-WISE SPURIOUS FEATURE GENERATION |
2593 | FISHEYE STEREO CAMERA USING FISHEYE VERTICAL STEREO METHOD |
1909 | FLEXAE: A SELF-CONDITIONED DETECTOR TO PREVENT MODEL OVERFITTING FOR UNSUPERVISED VIDEO ANOMALY DETECTION |
1878 | FOOD: FACIAL AUTHENTICATION AND OUT-OF-DISTRIBUTION DETECTION WITH SHORT-RANGE FMCW RADAR |
1732 | FOOTBOTS: A TRANSFORMER-BASED ARCHITECTURE FOR MOTION PREDICTION IN SOCCER |
2727 | FOURIER PTYCHOGRAPHY MICROSCOPY WITH INTEGRATED POSITIONAL MISALIGNMENT CORRECTION |
2386 | Fourier Ptychography with Information Entropy Based No-Reference Image Quality Assessment Learning |
2705 | FREQ-MIP-AA : FREQUENCY MIP REPRESENTATION FOR ANTI-ALIASING NEURAL RADIANCE FIELDS |
2381 | FREQUENCY-SPATIAL DOMAIN INFORMATION FUSION NETWORK FOR PAN-SHARPENING |
2182 | FULL-REFERENCE POINT CLOUD QUALITY ASSESSMENT USING SPECTRAL GRAPH WAVELETS |
1826 | FUSION OF INDEPENDENT AND INTERACTIVE FEATURES FOR HUMAN-OBJECT INTERACTION DETECTION |
2388 | GABIC: GRAPH-BASED ATTENTION BLOCK FOR IMAGE COMPRESSION |
1508 | GABOR FEATURE NETWORK FOR TRANSFORMER-BASED BUILDING CHANGE DETECTION MODEL IN REMOTE SENSING |
1850 | GaitGS: Temporal Feature Learning in Granularity and Span Dimension for Gait Recognition |
2205 | GEEG-YOLOV8: GAUSSIAN ENHANCED EUCLIDEAN NORM GHOST ATTENTION FOR REAL-TIME POLYP DETECTION |
1075 | GENERALIZED NESTED LATENT VARIABLE MODELS FOR LOSSY CODING APPLIED TO WIND TURBINE SCENARIOS |
2216 | GENERATE DSLR-LIKE IMAGE WITH GLOBAL INFORMATION AND PRIOR GUIDED ISP |
1821 | Generative Visual Compression: A Review |
1270 | GENGMM: GENERALIZED GAUSSIAN-MIXTURE-BASED DOMAIN ADAPTATION MODEL FOR SEMANTIC SEGMENTATION |
1231 | GIRAFFE: A GENETIC PROGRAMMING ALGORITHM TO BUILD DEEP LEARNING ENSEMBLES FOR ECG ARRHYTHMIA CLASSIFICATION |
1222 | GradTrans: Transformer-based Gradient Guidance for Image Generation |
1646 | GRAPH CONVOLUTIONAL NETWORKS WITH MINIMAL APPEARANCE INFORMATION FOR ACTION RECOGNITION |
1592 | GRAPHIC - Graph-based Representation for Analyzing People's High-level Interactions in Crowds |
1605 | GUIDED CONTEXT GATING: LEARNING TO LEVERAGE SALIENT LESIONS IN RETINAL FUNDUS IMAGES |
1453 | GUMBEL-NERF: REPRESENTING UNSEEN OBJECTS AS PART-COMPOSITIONAL NEURAL RADIANCE FIELDS |
2689 | HAND-OBJECT RECONSTRUCTION VIA INTERACTION-AWARE GRAPH ATTENTION MECHANISM |
2043 | HDPLIFTER: HIERARCHICAL DYNAMICS PERCEPTION FOR 2D-TO-3D HUMAN POSE LIFTING |
2477 | Hierarchical Vertex-wise Intensification Graph Convolution for Skeleton-based Activity Recognition |
1405 | HIGHLY CONSTRAINED CODED APERTURE IMAGING SYSTEMS DESIGN VIA A KNOWLEDGE DISTILLATION APPROACH |
1947 | HistoHDR-Net: Histogram Equalization for Single LDR to HDR Image Translation |
1089 | HoloGesture: A Multimodal Dataset for Hand Gesture Recognition Robust to Hand Textures on Head-Mounted Mixed-Reality Devices |
2487 | HOW TO TRAIN YOUR VAE |
2688 | HYBRID SINGLE INPUT AND MULTIPLE OUTPUT METHOD FOR COMPRESSING FEATURES TOWARDS MACHINE VISION TASKS |
2044 | HYPERSPECTRAL IMAGE CLASSIFICATION WITH FUZZY SPATIAL-SPECTRAL CLASS DISCRIMINATE INFORMATION |
1651 | ILLUMINATION-ENHANCED INFRARED AND LOW-LIGHT VISIBLE IMAGE FUSION |
2266 | IMAGE CODING FOR MACHINE VIA ANALYTICS-DRIVEN APPEARANCE REDUNDANCY REDUCTION |
2589 | Image Coding for Machines with Edge Information Learning Using Segment Anything |
2338 | Imbalanced data robust online continual learning based on evolving class aware memory selection and built-in contrastive representation learning |
1611 | IMPROVEMENT OF IMAGE RECONSTRUCTION FOR MRI USING PHASE-SCRAMBLING FOURIER TRANSFORM AND DUAL-DOMAIN STRATEGY |
2094 | Improving Automatic Target Recognition with Infrared Imagery using Vision Transformers and Focused Data Augmentation |
2669 | IMPROVING IMAGE CODING FOR MACHINES THROUGH OPTIMIZING ENCODER VIA AUXILIARY LOSS |
1403 | IMPROVING IMAGE DE-RAINING USING REFERENCE-GUIDED TRANSFORMERS |
1345 | IMPROVING REAL-TIME NEAR-INFRARED FACE ALIGNMENT WITH A PAIRED VIS-NIR DATASET AND DATA AUGMENTATION THROUGH IMAGE-TO-IMAGE TRANSLATION |
1584 | IMPROVING SELF-SUPERVISED VISION TRANSFORMERS FOR VISUAL CONTROL |
1245 | IMU-ASSISTED TARGET-FREE EXTRINSIC CALIBRATION OF HETEROGENEOUS LIDARS BASED ON CONTINUOUS-TIME OPTIMIZATION |
1724 | INCREASING TRUST IN IMAGE ANALYSIS BY DETECTING TRELLIS QUANTIZATION IN JPEG IMAGES |
1991 | IN-LOOP FILTER FOR OBJECT MASK CODING IN VERSATILE VIDEO CODING |
1907 | INSTANCE-AWARE UNCERTAINTY FOR ACTIVE LEARNING IN OBJECT DETECTION |
1190 | Integrating Vision-Language Supervision for Uniform Appearance Tracking |
2423 | INTELLIGENT MULTI-VIEW TEST TIME AUGMENTATION |
2528 | INTERACTIVE TEACHING FOR FINE-GRANULAR FEW-SHOT OBJECT RECOGNITION USING VISION TRANSFORMERS |
2099 | INTERPRETING THE FRAUDULENCE LEVEL OF DIFFERENT FINGER PHOTO PRESENTATION ATTACK INSTRUMENTS |
2637 | INTRINSIC IMAGE DECOMPOSITION BASED ON QUANTIZED PRIOR CODEBOOK |
2819 | INVERTIBLE ENERGY-AWARE IMAGES |
2425 | INVESTIGATING AND REDUCING THE IMPAIRMENT OF POINT SPREAD EFFECT FOR SPATIOTEMPORAL FUSION OF REMOTE SENSING IMAGERY |
2771 | Investigating Self-Supervised Methods for Label-Efficient Learning |
2353 | Joint End-to-End Image Compression and Denoising: Leveraging Contrastive Learning and Multi-Scale Self-ONNs |
1211 | JOINT IMAGE RESTORATION FOR DOMAIN ADAPTIVE OBJECT DETECTION IN FOGGY WEATHER CONDITION |
1649 | JOINTRF: END-TO-END JOINT OPTIMIZATION FOR DYNAMIC NEURAL RADIANCE FIELD REPRESENTATION AND COMPRESSION |
1474 | JPEG Image Ciphering based on Chaotic Encryption |
2577 | KNOWLEDGE-INFUSED LEARNING FOR FINE-GRAINED PLANT DISEASE RECOGNITION |
2074 | Koopcon: A new approach towards smarter and less complex learning |
2177 | LAND USE CLASSIFICATION VIA MULTI-MODAL COMPLEMENTARY FEATURE FUSION AND CONTEXT INFORMATION ENHANCEMENT FOR OPTICAL AND SAR IMAGES |
2256 | LATENT ENHANCING AUTOENCODER FOR OCCLUDED IMAGE CLASSIFICATION |
1380 | Learn by an Example Transformer for Domain Generalization in Video Object Segmentation |
1823 | Learned Compression of Encoding Distributions |
2218 | LEARNED IMAGE COMPRESSION FOR BOTH HUMANS AND MACHINES VIA DYNAMIC ADAPTATION |
1749 | Learned Image Compression Using a Long and Short Attention Module |
1172 | Learned Image Compression with Text Quality Enhancement |
2272 | Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression |
1840 | LEARNING A RAIN-INVARIANT NETWORK FOR INSTANCE SEGMENTATION IN THE RAIN |
2081 | Learning Orthonormal Features in Self-Supervised Learning using Functional Maximal Correlation |
1042 | Learning Temporal Cues for Fine-grained Action Recognition |
2260 | LEARNING WITH INSTANCE-DEPENDENT NOISY LABELS BY ANCHOR HALLUCINATION AND HARD SAMPLE LABEL CORRECTION |
2428 | LEARNING-BASED POINT CLOUD DECODING WITH INDEPENDENT AND SCALABLE REDUCED COMPLEXITY |
1610 | LEARNING-BASED VIDEO COMPRESSION WITH CONTINUOUSLY VARIABLE BITRATE CODING |
1814 | LEGIT: TEXT LEGIBILITY FOR USER-GENERATED MEDIA |
2082 | LENSLESS PHASE RETRIEVAL WITH REGULARIZATION BY BLIND NOISE MAP ESTIMATION AND DENOISING |
2630 | LERCPOSE: LEARNED RANKING AND CONTRASTIVE LOSS FOR ROBUST HEAD POSE ESTIMATION |
1521 | LEVERAGING GENERATED IMAGE CAPTIONS FOR VISUAL COMMONSENSE REASONING |
1634 | LFGN: Low-level Feature-Guided Network for Adversarial Defense |
1927 | LiCAF: LiDAR-Camera Asymmetric Fusion for Gait Recognition |
2613 | LIDAR DEPTH MAP GUIDED IMAGE COMPRESSION MODEL |
2086 | LIGHTWEIGHT RECURRENT NEURAL NETWORK FOR IMAGE SUPER-RESOLUTION |
1657 | LIGHT-WEIGHT SELF-SUPERVISED CONTRASTIVE LEARNING NETWORK FOR SMALL SAMPLE HYPERSPECTRAL IMAGE CLASSIFICATION |
2118 | LIGHTWEIGHT UNDERWATER IMAGE ENHANCEMENT VIA IMPULSE RESPONSE OF LOW-PASS FILTER BASED ATTENTION NETWORK |
1445 | LIPFACE: LIPSCHITZ-CONDITIONED FOR RESOLUTION ROBUST FACE RECOGNITION |
1039 | LISD: AN EFFICIENT MULTI-TASK LEARNING FRAMEWORK FOR LIDAR SEGMENTATION AND DETECTION |
1979 | LMBF-Net: A Lightweight Multipath Bidirectional Focal Attention Network for Multifeatures Segmentation |
2154 | Localization of Image Splicing Under Segment Anything Model with Integrated Compression and Edge Artifacts |
2389 | LOCALIZING MOMENTS OF ACTIONS IN UNTRIMMED VIDEOS OF INFANTS WITH AUTISM SPECTRUM DISORDER |
2248 | LONG-TERM GEO-POSITIONED RE-IDENTIFICATION DATASET OF URBAN ELEMENTS |
1259 | LOW-RANK MATRIX AND TENSOR DECOMPOSITION USING RANDOMIZED TWO-SIDED SUBSPACE ITERATION WITH APPLICATION TO VIDEO RECONSTRUCTION |
1442 | LRDif: Diffusion Models for Under-Display Camera Emotion Recognition |
2304 | LSDM-PCB: A Lightweight Small Defect Detection Model For Printed Circuit Board |
2129 | LUMINATE: LINGUISTIC UNDERSTANDING AND MULTI-GRANULARITY INTERACTION FOR VIDEO OBJECT SEGMENTATION |
2597 | LWIRPOSE: A novel Long Wave Infrared Thermal Image Pose Dataset and Benchmark |
1604 | M3T: MULTI-MODAL MEDICAL TRANSFORMER TO BRIDGE CLINICAL CONTEXT WITH VISUAL INSIGHTS FOR RETINAL IMAGE MEDICAL DESCRIPTION GENERATION |
2633 | MAMBA-PCGC: MAMBA-BASED POINT CLOUD GEOMETRY COMPRESSION |
1663 | MASK-BASED INVISIBLE BACKDOOR ATTACKS ON OBJECT DETECTION |
1577 | Masked Momentum Contrastive Learning for Semantic Understanding by Observation |
2602 | Masked Signal Modeling for Plastic Waste Resin Classification |
1489 | MAVAD: AUDIO-VISUAL DATASET AND METHOD FOR ANOMALY DETECTION IN TRAFFIC VIDEOS |
2592 | MCT-NET: A LIGHTWEIGHT MULTISCALE CONVOLUTIONAL TRANSFORMER NETWORK FOR POLYP SEGMENTATION |
1204 | MDBFUSION: A VISIBLE AND INFRARED IMAGE FUSION FRAMEWORK CAPABLE FOR MOTION DEBLURRING |
2187 | MEDeA: Multi-view Efficient Depth Adjustment |
2165 | MEDICAL KNOWLEDGE-GUIDED SEMI-SUPERVISED BI-VENTRICULAR SEGMENTATION |
1427 | MEMSVD: LONG-RANGE TEMPORAL STRUCTURE CAPTURING USING INCREMENTAL SVD |
1137 | META-DM: APPLICATIONS OF DIFFUSION MODELS ON FEW-SHOT LEARNING |
1941 | METAHEURISTIC CAMERA CALIBRATION FOR OPTICAL TOMOGRAPHIC IMAGING IN INDUSTRIAL ENVIRONMENTS |
1135 | MFLFC:MULTI-FRAME FUSION BASED LOW-RESOLUTION FEATURE COMPRESSION FOR OBJECT TRACKING |
1680 | MGRQ: POST-TRAINING QUANTIZATION FOR VISION TRANSFORMER WITH MIXED GRANULARITY RECONSTRUCTION |
2715 | MICRO-EXPRESSION RECOGNITION BASED ON 3DCNN COMBINED WITH GRU AND NEW ATTENTION MECHANISM |
2564 | MINIMIZATION OF SUBMESH BOUNDARY ERRORS IN DYNAMIC MESH CODING |
2059 | MIX-DOMAIN CONTRASTIVE LEARNING FOR UNPAIRED H&E-TO-IHC STAIN TRANSLATION |
2551 | MMAQ: A Multi-modal Self-supervised Approach For Estimating Air Quality From Remote Sensing Data |
2296 | MODIPHY: MULTIMODAL OBSCURED DETECTION FOR IOT USING PHANTOM CONVOLUTION-ENABLED FASTER YOLO |
1865 | MOTION-ADAPTIVE INFERENCE FOR FLEXIBLE LEARNED B-FRAME COMPRESSION |
1538 | Motion-Lie Transformer : Geometric Attention for 3D Human Pose Motion Prediction |
1636 | MSD-CRFS: MULTI-SCALE DUAL AGGREGATION CONDITIONAL RANDOM FIELDS FOR MONOCULAR DEPTH ESTIMATION |
1335 | MSGAT: MULTI-STAGE GRAPH ATTENTION NETWORK FOR HUMAN MOTION PREDICTION |
2708 | MSSPG-AL: FEW-SHOT HYPERSPECTRAL IMAGE CLASSIFICATION WITH ACTIVE LEARNING UPDATED MULTI-SCALE SUPERPIXEL GRAPH FUSION |
2450 | MTA-PS: TOWARDS PRACTICAL PERSON SEARCH IN VIDEOS |
2722 | MULTI-ATTRIBUTE VISION TRANSFORMERS ARE EFFICIENT AND ROBUST LEARNERS |
1964 | MULTICLASSIFICATION OF VOCAL FOLDS DISORDERS FROM VIDEOS BY SPATIO-TEMPORAL DEEP FEATURES |
1573 | MULTI-MODAL MEDICAL IMAGE FUSION FOR NON-SMALL CELL LUNG CANCER CLASSIFICATION |
1915 | MULTIMODAL TRANSFORMER USING CROSS-CHANNEL ATTENTION FOR OBJECT DETECTION IN REMOTE SENSING IMAGES |
1809 | MULTIMODAL-ENHANCED OBJECTNESS LEARNER FOR CORNER CASE DETECTION IN AUTONOMOUS DRIVING |
1744 | Multi-path Interference Mitigation for Indirect Time-of-Flight Camera by the Distortion of Coding Curve |
1781 | MULTI-REFERENCE FLOW-GUIDED CROSS-DOMAIN RECONSTRUCTION FOR GENERAL OBJECT 6D POSE ESTIMATION |
1690 | MULTI-TASK AFFINITY PROPAGATION BASED NATURAL IMAGE MATTING |
2750 | MULTI-VIEW MULTI-FOCUS IMAGE FUSION: A NOVEL BENCHMARK DATASET AND METHOD |
2399 | MULTI-VIEW NETWORK FOR COLORECTAL POLYPS DETECTION IN CT COLONOGRAPHY |
1796 | MVAFormer: RGB-Based Multi-View Spatio-Temporal Action Recognition with Transformer |
1729 | MVCrackViT: Robust Multi-View Crack Detection for Point Cloud Segmentation using View Attention |
2587 | MWIRSTD: A MWIR SMALL TARGET DETECTION DATASET |
1468 | NAVIGATING LIMITATIONS WITH PRECISION: A FINE-GRAINED ENSEMBLE APPROACH TO WRIST PATHOLOGY RECOGNITION ON A LIMITED X-RAY DATASET |
1529 | NEURAL MESH FUSION: UNSUPERVISED 3D PLANAR SURFACE UNDERSTANDING |
1855 | NEURAL RADIANCE FIELD-ASSISTED STATIC-SCENE VIDEO CODING |
2254 | NN-BASED IN-LOOP FILTERING WITH INPUTS TRANSFORMED |
2410 | Non-Separable Wavelet Transform using Learnable Convolutional Lifting Steps |
1314 | NORM-INTEGRATED SOFTMAX LOSS FOR DEEP FACE RECOGNITION |
2224 | NOVEL META ATTENTION GUIDED FRAMEWORK FOR BREAST ABNORMALITY CLASSIFICATION WITH COMBINATION OF FSL AND DA |
2463 | NYCTALE: Neuro-Evidence Transformer for Adaptive and Personalized Lung Nodule Invasiveness Prediction |
2115 | OBJECT DETECTION FRAMEWORK USING MULTIPLE TONE MAPPINGS ON HIGH-DYNAMIC-RANGE IMAGES |
2011 | OBJECT-AWARE ADAPTIVE IMAGE RETARGETING VIA IMPORTANCE MAP FUSION |
2324 | ODVista: An Omnidirectional Video Dataset for Super-Resolution and Quality Enhancement Tasks |
2575 | OMRA: Online Motion Resolution Adaptation to Remedy Domain Shift in Learned Hierarchical B-frame Coding |
1497 | ON ANNOTATION-FREE OPTIMIZATION OF VIDEO CODING FOR MACHINES |
1945 | ON EFFICIENT NEURAL NETWORK ARCHITECTURES FOR IMAGE COMPRESSION |
2341 | ON THE CLOUD DETECTION FROM BACKSCATTERED IMAGES GENERATED FROM A LIDAR-BASED CEILOMETER: CURRENT STATE AND OPPORTUNITIES |
2080 | ON THE DETECTION OF IMAGES GENERATED FROM TEXT |
1459 | On the Exploitation of DCT-Traces in the Generative-AI Domain |
2184 | ONE-HOT LOGISTIC REGRESSION FOR RADIOMICS-BASED CLASSIFICATION |
2185 | ONE-SHOT MULTI-RATE PRUNING OF GRAPH CONVOLUTIONAL NETWORKS FOR SKELETON-BASED RECOGNITION |
1486 | ONLINE ANCHOR-BASED TRAINING FOR IMAGE CLASSIFICATION TASKS |
1402 | OPEN WORLD OBJECT DETECTION VIA COOPERATIVE FOUNDATION MODELS FOR DRIVING SCENES |
2141 | OpenAnimalTracks: A Dataset for Animal Track Recognition |
1393 | Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model |
2267 | OPTIMIZED DECOUPLED STRUCTURE WITH NON-LOCAL ATTENTION FOR DEEP IMAGE COMPRESSION |
2653 | OPTIMIZING LEARNED IMAGE COMPRESSION ON SCALAR AND ENTROPY-CONSTRAINT QUANTIZATION |
2802 | PAON: A NEW NEURON MODEL USING PADE APPROXIMANTS |
1348 | Parallel Task-Prompts ICM: A Versatile Feature Codec for Machine Vision |
1897 | PARTIAL INTER-FRAME CODING FOR DYNAMIC MESHES |
1540 | PCA-UNet for Object Segmentation |
1912 | Perceptual Learned Image Compression via End-To-End JND-Based Optimization |
1285 | PersonaTalk: Preserving Personalized Dynamic Speech Style In Talking Face Generation |
1974 | PHYSIOLOGICAL MODELING WITH MULTISPECTRAL IMAGING FOR HEART RATE ESTIMATION |
1105 | PICTURE PARTITIONING DESIGN OF NEURAL NETWORK-BASED INTRA CODING FOR VIDEO CODING FOR MACHINES |
2163 | Pilot-Free Semantic Communication over Multi-User MIMO Fading Channels |
1772 | PIXEL-WISE COLOR CONSTANCY VIA SMOOTHNESS TECHNIQUES IN MULTI-ILLUMINANT SCENES |
2013 | POINT CLOUD GEOMETRY SCALABLE CODING WITH A QUALITY-CONDITIONED LATENTS PROBABILITY ESTIMATOR |
1326 | POSE-INVARIANT LEARNING FOR EFFICIENT PERSON IDENTIFICATION FROM HYPERSPECTRAL HAND IMAGES |
1038 | POWER-LLAVA: LARGE LANGUAGE AND VISION ASSISTANT FOR POWER TRANSMISSION LINE INSPECTION |
2549 | PRIORFORMER : A UGC-VQA METHOD WITH CONTENT AND DISTORTION PRIORS |
1858 | Privacy-Preserving Visual Cues Communication for Hearing-Impaired People Using Deep Learning |
1325 | Progressive Learning with Visual Prompt Tuning for Variable-Rate Image Compression |
1856 | PROJECT, SKATE, AND REFRESH: IMPROVED SCHRODINGER BRIDGE SAMPLER FOR IMAGE RESTORATION |
1066 | Prompt Performance Prediction for Image Generation |
1987 | Prune Channel and Distill: Discriminative Knowledge Distillation for Semantic Segmentation |
2005 | PUAD: FRUSTRATINGLY SIMPLE METHOD FOR ROBUST ANOMALY DETECTION |
1808 | PVDN-Urban - A Dataset for Provident Vehicle Detection at Night in Urban Scenarios |
2790 | PWISeg: Weakly-supervised Surgical Instrument Instance Segmentation |
1127 | PYRAMID CODER: HIERARCHICAL CODE GENERATOR FOR COMPOSITIONAL VISUAL QUESTION ANSWERING |
2375 | Quadruple-Consistency Vision Transformer for Medical Image Segmentation with Limited Number of Sparse Annotations |
1614 | QUALITY OF EXPERIENCE OF VIEWPORT ADAPTIVE OMNIDIRECTIONAL VIDEO STREAMING |
1463 | QUANTIZATION AFTER INTER PREDICTION IN DISPLACEMENT CODING OF DYNAMIC MESHES |
1667 | RAFMNET: REINFORCED ATTENTION FUSION AND MULTISCALE NETWORK FOR NOISY INFRARED AND VISIBLE IMAGE FUSION |
2008 | RAGE for the Machine: Image Compression with Low-Cost Random Access for Embedded Applications |
1829 | RATE-COMPLEXITY OPTIMIZATION IN LOSSLESS NEURAL-BASED IMAGE COMPRESSION |
2088 | Rate-Quality or Energy-Quality Pareto Fronts for Adaptive Video Streaming? |
1263 | RDSSD: 3D Single Stage Object Detector for Roadside LiDAR Sensors |
2521 | Reading Is Believing: Revisiting Language Bottleneck Models for Image Classification |
2507 | REAL-TIME AND RESOURCE-EFFICIENT MULTI-SCALE ADAPTIVE ROBOTICS VISION FOR UNDERWATER OBJECT DETECTION AND DOMAIN GENERALIZATION |
1955 | Real-time Monocular Depth Estimation on Embedded Systems |
1574 | REAL-TIME SEMANTIC VIDEO COMMUNICATION OF GENERAL SCENES |
1973 | REAL-TIME VIDEO PREDICTION WITH FAST VIDEO INTERPOLATION MODEL AND PREDICTION TRAINING |
2489 | REAL-WORLD ATMOSPHERIC TURBULENCE CORRECTION VIA DOMAIN ADAPTATION |
2140 | RECONSTRUCT DYNAMIC SCENE FOR SPIKE CAMERA BASED ON 3D SPACE TIME SIMILARITY |
2064 | Recurrent 3-D Multi-level Visual Transformer for Joint Classification of Heterogeneous 2-D and 3-D Radiographic Data |
2693 | REDEFINING CYSTOSCOPY WITH AI: BLADDER CANCER DIAGNOSIS USING AN EFFICIENT HYBRID CNN-TRANSFORMER MODEL |
2207 | REDEFINING VISUAL QUALITY: THE IMPACT OF LOSS FUNCTIONS ON INR-BASED IMAGE COMPRESSION |
2343 | Reducing motion artifacts in brain MRI using vision transformers and self-supervised learning |
2624 | REFERRING IMAGE SEGMENTATION WITH TWO-STAGE MULTI-MODAL INTERACTION |
2048 | Refining Myocardial Infarction Detection: A Novel Multi-Modal Composite Kernel Strategy in One-Class Classification |
1241 | REINFORCEMENT LEARNING-BASED SECURE VIDEO TRANSMISSION FOR IOV SYSTEMS |
2699 | REINFORCING PRE-TRAINED MODELS USING COUNTERFACTUAL IMAGES |
1267 | REMOTE SENSING IMAGE UNEVEN HAZE REMOVAL BASED ON HAZE DENSITY ESTIMATION AND SALIENCY-DRIVEN DUAL CHANNEL FUSION |
1659 | REMOVING REFLECTIVE FLARE IN REAL-WORLD CONDITIONS |
2240 | ReSet: A Residual Set-Transformer approach to tackle the ugly-duckling sign in melanoma detection |
1948 | RESNERF-PCAC: SUPER RESOLVING RESIDUAL LEARNING NERF FOR HIGH EFFICIENCY POINT CLOUD ATTRIBUTES CODING |
1711 | RES-NERV : RESIDUAL BLOCKS FOR A PRACTICAL IMPLICIT NEURAL VIDEO DECODER |
2107 | RESSCAL3D++: JOINT ACQUISITION AND SEMANTIC SEGMENTATION OF 3D POINT CLOUDS |
2251 | Rethinking Domain Adaptation and Generalization in the Era of CLIP |
2400 | RETHINKING TEMPORAL SELF-SIMILARITY FOR REPETITIVE ACTION COUNTING |
2143 | RFG-HDR: REPRESENTATIVE FEATURE-GUIDED TRANSFORMER FOR MULTI-EXPOSURE HIGH DYNAMIC RANGE IMAGING |
1100 | RFNET: REFINED FUSION THREE-BRANCH RGB-D SALIENT OBJECT DETECTION NETWORK |
1793 | ROBUST 3D SEMANTIC SEGMENTATION WITH INCOMPLETE POINT CLOUDS BASED ON SEQUENTIAL FRAME SAMPLING |
1665 | ROBUST REPRESENTATION LEARNING WITH SELF-DISTILLATION FOR DOMAIN GENERALIZATION |
2190 | ROBUST SKIN COLOR DRIVEN PRIVACY-PRESERVING FACE RECOGNITION VIA FUNCTION SECRET SHARING |
2269 | Robustness of tensor decomposition-based neural network compression |
2173 | ROI-DVC: A REGION-OF-INTEREST BASED DEEP VIDEO CODING FRAMEWORK |
1037 | ROTATED R-CNN: A TWO-STAGE OBJECT DETECTION METHOD ADAPTED TO ORIENTED BOUNDING BOXES |
1053 | RSUD20K: A Dataset for Road Scene Understanding In Autonomous Driving |
1303 | S³GCN: SPORT SCORING SIAMESE GRAPH CONVOLUTION NETWORK |
1737 | Saliency as a Schedule: Intuitive Image Attribution |
1520 | SALIENCY-AWARE END-TO-END LEARNED VARIABLE-BITRATE 360-DEGREE IMAGE COMPRESSION |
1654 | SALIENT GUIDED TEXT DETECTION IN E-COMMERCE IMAGES |
1956 | Sample Domain Prediction and Transform Skip for Region Adaptive Hierarchical Transform in Geometric Point Cloud Compression |
1347 | SANeRV: Scene-Adaptive Neural Representation for Videos |
1391 | SCALABLE HYPERSPHERE EMBEDDING FOR SEMANTIC METRIC LEARNING |
1740 | SCENE GENERALIZED MULTI-VIEW PEDESTRIAN DETECTION WITH ROTATION-BASED AUGMENTATION AND REGULARIZATION |
1849 | SCENE TEXT RECOGNITION USING PROGRESSIVE RECTIFICATION NETWORK AND SPELLING ERROR CORRECTION LANGUAGE MODEL |
2050 | SE3D: A FRAMEWORK FOR SALIENCY METHOD EVALUATION IN 3D IMAGING |
1676 | SEGGUARD: DEFENDING SCENE SEGMENTATION AGAINST ADVERSARIAL PATCH ATTACK |
2354 | Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation |
1221 | SEGMENTATION OF HARD EXUDATES AND HEMORRHAGES FROM DIABETIC RETINOPATHY IMAGES USING RESIDUAL U-NET WITH SQUEEZE AND EXCITE BLOCKS |
2376 | SELF-SUPERVISED ANOMALY DETECTION AND A NEW BENCHMARK FOR X-RAY CARGO IMAGES |
1906 | SELF-SUPERVISED MULTI-VIEW STEREO WITH ADAPTIVE DEPTH PRIORS |
2316 | SEMANTIC ENHANCED FEW-SHOT OBJECT DETECTION |
2635 | SEMANTIC-ENHANCED POINT-BOX JOINT PROMPTING FOR VIDEO OBJECT SEGMENTATION |
2036 | Semantic-Region Specific Lookup Tables for Image Enhancement via Unpaired Learning |
1365 | SEMI-SUPERVISED 3D OBJECT DETECTION WITH CHANNEL AUGMENTATION USING TRANSFORMATION EQUIVARIANCE |
2352 | SEMI-SUPERVISED ACTION RECOGNITION FROM NEWBORN RESUSCITATION VIDEOS |
1202 | SEMI-SUPERVISED GRAPHICAL DEEP DICTIONARY LEARNING FOR HYPERSPECTRAL IMAGE CLASSIFICATION FROM LIMITED SAMPLES |
2057 | SET-NAS: Sample-Efficient Training for Neural Architecture Search with Strong Predictor and Stratified Sampling |
1837 | SFD: SIMILAR FRAME DATASET FOR CONTENT-BASED VIDEO RETRIEVAL |
2170 | SFNET - A SPATIAL-FREQUENCY DOMAIN NEURAL NETWORK FOR IMAGE LENS FLARE REMOVAL |
1820 | SG-JND: SEMANTIC-GUIDED JUST NOTICEABLE DISTORTION PREDICTOR FOR IMAGE COMPRESSION |
2516 | Shadow-Aware Makeup Transfer with Lighting Adaptation |
2651 | SIMILARITY-WEIGHTED IOU (SIOU): A COMPREHENSIVE METRIC FOR EVALUATING MODEL PERFORMANCE THROUGH SIMILARITY-WEIGHTED CLASS OVERLAPS |
1726 | SIMPLE IMAGE SIGNAL PROCESSING USING GLOBAL CONTEXT GUIDANCE |
1183 | SimSAM: Simple Siamese Representation-Based Semantic Affinity Matrix for unsupervised image segmentation |
1164 | SINGLE-PANORAMA CLASSIFICATION OF 3D OBJECTS USING HORIZONTALLY STACKED DILATED CONVOLUTIONS |
2276 | Sino-CT-Fusion-Net: A Lightweight Deep Learning Framework for Detection and Classification of Intracranial Hemorrhages |
1707 | SKETCH2MANGA: SHADED MANGA SCREENING FROM SKETCH WITH DIFFUSION MODELS |
1032 | SLNL: Soft Label Regularization for Semi-Supervised Facial Expression Recognition with Negative Label Learning |
2732 | Smo-CLIP: Enhancing Anomalous Smoke Density Assessment using A Hybrid LLM-VLM Approach |
1440 | SN-NET: SEMISMOOTH NEWTON DRIVEN LIGHTWEIGHT NETWORK FOR REAL-WORLD IMAGE DENOISING |
2342 | SODA: A DATASET FOR SMALL OBJECT DETECTION IN UAV CAPTURED IMAGERY |
2283 | SOME CAN BE BETTER THAN ALL: MULTIMODAL STAR TRANSFORMER FOR VISUAL DIALOG |
2290 | SOURCE-FREE CONTINUAL ADAPTIVE LEARNING WITH LIMITED LABELS ON EVOLVING DATA DRIFTS |
1800 | SOVASEG-NET: SCALE INVARIANT OVARIAN TUMORS SEGMENTATION FROM ULTRASOUND IMAGES |
1072 | SPARSE TRANSFORMER REFINEMENT SIMILARITY MAP FOR AERIAL TRACKING |
2091 | SPATIAL PLAID ATTENTION DECODER FOR SEMANTIC SEGMENTATION |
1876 | SPATIAL-CHANNEL COLLABORATED ATTENTION FOR CROSS-SCALE CROWD COUNTING |
2504 | SPATIALITY-AWARE PROMPT TUNING FOR FEW-SHOT SMALL OBJECT DETECTION |
2617 | SPATIO-TEMPORAL ADAPTATION WITH DILATED NEIGHBOURHOOD ATTENTION FOR ACCIDENT ANTICIPATION |
2792 | SS-CXR: SELF-SUPERVISED PRETRAINING USING CHEST X-RAYS TOWARDS A DOMAIN SPECIFIC FOUNDATION MODEL |
1141 | Standard compliant video coding using low complexity, switchable neural wrappers |
1092 | START-TV: A CLOSED-FORM INITIALIZATION FOR TOTAL VARIATION MODELS |
2249 | STATISTICS-AWARE AUDIO-VISUAL DEEPFAKE DETECTOR |
1253 | STAY FOCUS ON OBJECT: CROSS-DOMAIN DETECTION USING DOMAIN-INVARIANT OBJECT REPRESENTATION |
2830 | STEGANALYSIS OF AI MODELS LSB ATTACKS |
2714 | Streaming Neural Images |
2085 | STREAMLINED HYBRID ANNOTATION FRAMEWORK USING SCALABLE CODESTREAM FOR BANDWIDTH-RESTRICTED UAV OBJECT DETECTION |
2607 | STRUCTURED PRUNING AND QUANTIZATION FOR LEARNED IMAGE COMPRESSION |
2055 | SUBBLOCK-BASED COMBINED INTER AND INTRA PREDICTION BEYOND VVC |
2315 | Subgroups for Detection Transformer |
1815 | SUBJECTIVE PORTRAIT REGION CROPPING ON LANDSCAPE VIDEO STUDY |
1080 | SUBJECTIVE QUALITY ASSESSMENT OF THERMAL INFRARED IMAGES |
2694 | SUPER: SELFIE UNDISTORTION AND HEAD POSE EDITING WITH IDENTITY PRESERVATION |
1921 | SUPERPIXEL MIXING: A DATA AUGMENTATION TECHNIQUE FOR ROBUST DEEP VISUAL RECOGNITION MODELS |
2465 | SUPER-RESOLUTION FOR NEAR-EYE LIGHT FIELD DISPLAY IN FOURIER SPACE |
2298 | SURFACE ANOMALY DETECTION WITH ANOMALOUS FEATURE RESTRICTION AND DIFFERENCE-AWARE ENHANCEMENT |
2340 | SYNTHMANTICLIDAR: A SYNTHETIC DATASET FOR SEMANTIC SEGMENTATION ON LIDAR IMAGING |
1687 | TALKING-HEAD VIDEO COMPRESSION WITH MOTION SEMANTIC ENHANCEMENT MODEL |
2252 | TAXES ARE ALL YOU NEED: INTEGRATION OF TAXONOMICAL HIERARCHY RELATIONSHIPS INTO THE CONTRASTIVE LOSS |
2100 | TCA-NET: TRIPLET CONCATENATED-ATTENTIONAL NETWORK FOR MULTIMODAL ENGAGEMENT ESTIMATION |
1334 | TDAD: TRIDENT DISTILLATIONS FOR ANOMALY DETECTION |
1461 | TEMPORAL CLUSTERING AND TEMPORAL REFERENCE BASED SPECULAR DETECTION FOR 1-MS VISUAL FEEDBACK SYSTEM |
2279 | TEMPORAL REGULARIZATION FOR ROBUST MOTION COMPENSATION IN REDUCED DOSE CARDIAC-GATED SPECT IMAGES |
2536 | TEMPORAL SCALABLE CODING FOR DYNAMIC MESHES |
2293 | TEMPORAL TRANSFORMER ENCODER FOR VIDEO CLASS INCREMENTAL LEARNING |
2675 | TEMPORAL-SPATIAL SPDAGG NETWORK FOR SKELETON-BASED HUMAN ACTION RECOGNITION FROM AERIAL PERSPECTIVES |
2816 | The Bjøntegaard Bible: Why Your Way of Comparing Video Codecs May Be Wrong |
2029 | THERMAL VIDEODIFF (TVD): A DIFFUSION ARCHITECTURE FOR THERMAL VIDEO SYNTHESIS |
1437 | THQA: A Perceptual Quality Assessment Database for Talking Heads |
1431 | THROUGH-WALL IMAGING BASED ON WIFI CHANNEL STATE INFORMATION |
2406 | Toward Efficient Deep Blind RAW Image Restoration |
2142 | TOWARD LOW ARTIFACT VIRTUAL TRY-ON VIA PRE-WARPING PARTITIONED CLOTHING ALIGNMENT |
2284 | Towards Better Control of Latent Spaces for Face Editing |
1926 | TOWARDS GENERALIZABLE REFERRING IMAGE SEGMENTATION VIA TARGET PROMPT AND VISUAL COHERENCE |
2826 | Towards Generated Image Provenance Analysis Via Conceptual-Similar-Guided-SLIP Retrieval |
2071 | TOWARDS PRIVACY-ENHANCING PROVENANCE ANNOTATIONS FOR IMAGES |
1639 | TOWARDS ROBUST PERSON RE-IDENTIFICATION VIA EFFICIENT AND GENERALIZED ADVERSARIAL TRAINING |
2263 | TOWARDS ROBUST VISUAL LOCALIZATION USING MULTI-VIEW IMAGES AND HD VECTOR MAP |
1954 | Towards the Detection of AI-Synthesized Human Face Images |
1811 | TOWARDS UNIFYING ANATOMY SEGMENTATION: AUTOMATED GENERATION OF A FULL-BODY CT DATASET |
2840 | Trainable Fractional Fourier Transform |
1248 | Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval |
1937 | TRUSTWORTHY SR: RESOLVING AMBIGUITY IN IMAGE SUPER-RESOLUTION VIA DIFFUSION MODELS AND HUMAN FEEDBACK |
2329 | TSF-Net3D: TSF-Net for 3D Point Cloud Attribute Compression Artifacts Removal |
1495 | TWO HEADS BETTER THAN ONE: DUAL DEGRADATION REPRESENTATION FOR BLIND SUPER-RESOLUTION |
1628 | TWO-LEVEL INTRA PREDICTION USING HIGH-ORDER MACROPIXEL NEIGHBORS FOR PLENOPTIC VIDEO CODING |
1838 | TWO-STAGE TRIPLETNET: LIGHT WEIGHT REMOTE SENSING SCENE CLASSIFICATION |
1311 | U-Convnext Network for Infrared Small Target Detection |
1423 | UIMT: A framework for improving unimodal inference via multimodal training |
2407 | UNCALIBRATED AND UNSUPERVISED PHOTOMETRIC STEREO WITH PIECEWISE REGULARIZER |
2079 | UNCERTAINTY-AWARE AB3DMOT BY VARIATIONAL 3D OBJECT DETECTION |
2318 | Uncovering communities of pipelines in the task-fMRI analytical space |
2496 | Underwater Change Detection using Multiple Sampling-based Probabilistic Learner and Feature Preservance Discriminator |
2628 | UniCrowd Simulator: Visual and Behavioral Fidelity for the Generation of Crowd Datasets |
1257 | Universal Black-box Adversarial Patch Attack with Optimized Genetic Algorithm |
1824 | UNLEASHING FINE-COARSE CURVE PERCEPTION VIA TRUNK-BRANCH PERTURBATION |
2744 | UNLEASHING THE POWER OF GENERALIZED ITERATIVE CLOSEST POINT FOR SWIFT AND EFFECTIVE POINT CLOUD REGISTRATION |
2234 | UNROLLED PROJECTED GRADIENT ALGORITHM FOR STAIN SEPARATION IN DIGITAL HISTOPATHOLOGICAL IMAGES |
1965 | UNSUPERVISED COORDINATE-BASED VIDEO DENOISING |
1169 | UNSUPERVISED DOMAIN ADAPTIVE SEMANTIC SEGMENTATION BASED ON CLIP-GUIDED PROTOTYPICAL CONTRASTIVE LEARNING |
1720 | U-TELL: UNSUPERVISED TASK EXPERT LIFELONG LEARNING |
2015 | UTrCGAN:Uncertainty-Driven Cycle-Consistent Generative Adversarial Network for Low-Light Image Enhancement |
1968 | VAG: VOXEL ATTENUATION GRID FOR SPARSE-VIEW CBCT RECONSTRUCTION |
1562 | VCDSET: A NEW VEHICLE COLLISION DATASET IN ASIA COUNTRIES FOR ANTICIPATING ACCIDENTS |
1050 | VF-NET: ROBUSTNESS VIA UNDERSTANDING DISTORTIONS AND TRANSFORMATIONS |
1889 | Video Class-Incremental Learning with CLIP based Transformer |
1417 | VITO: VISION TRANSFORMER OPTIMIZATION VIA KNOWLEDGE DISTILLATION ON DECODERS |
1462 | VIZECGNET: VISUAL ECG IMAGE NETWORK FOR CARDIOVASCULAR DISEASES CLASSIFICATION WITH MULTI-MODAL TRAINING AND KNOWLEDGE DISTILLATION |
1556 | VR-based generation of photorealistic synthetic data for training hand-object tracking models |
1936 | WAVELET-ENHANCED CNN FOR DEPRESSION CLASSIFICATION BASED ON MRI IMAGES |
2595 | WEATHER-AWARE DRONE-VIEW OBJECT DETECTION VIA ENVIRONMENTAL CONTEXT UNDERSTANDING |
1669 | WHEN SELF-SUPERVISED PRE-TRAINING MEETS SINGLE IMAGE DENOISING |
2093 | WRAPPINGNET: MESH AUTOENCODER VIA DEEP SPHERE DEFORMATION |
1161 | YOLO-FEDER FUSIONNET: A NOVEL DEEP LEARNING ARCHITECTURE FOR DRONE DETECTION |
2133 | YouTube SFV+HDR Quality Dataset |
1988 | ZERO-SHOT COMPOSED IMAGE RETRIEVAL CONSIDERING QUERY-TARGET RELATIONSHIP LEVERAGING MASKED IMAGE-TEXT PAIRS |
2839 | ZJUT-EIFD: A Synchronously Collected External and Internal Fingerprint Database |