List of Accepted Papers

Following is the list of accepted ICASSP 2020 papers, sorted by paper title. You can use the search feature of your web browser to find your paper number. Notifications to all authors have also been sent by email. If you have not received your notification of the results by email, please contact us at icassp2020@cmsworkshops.com.

5666$\BETA$-NMF AND SPARSITY PROMOTING REGULARIZATIONS FOR COMPLEX MIXTURE UNMIXING. APPLICATION TO 2D HSQC NMR.
20351.5GBIT/S 4.9W HYPERSPECTRAL IMAGE ENCODERS ON A LOW-POWER PARALLEL HETEROGENEOUS PROCESSING PLATFORM
47282D-to-2D Mask Estimation for Speech Enhancement based on Fully Convolutional Neural Network
48333-D ACOUSTIC MODELING FOR FAR-FIELD MULTI-CHANNEL SPEECH RECOGNITION
30653D DEFORMATION SIGNATURE FOR DYNAMIC FACE RECOGNITION
57403D Unknown View Tomography via Rotation Invariants
6108A BEAMFORMING ALGORITHM BASED ON MAXIMUM LIKELIHOOD OF A COMPLEX GAUSSIAN DISTRIBUTION WITH TIME-VARYING VARIANCES FOR ROBUST SPEECH RECOGNITION
3486A BIDIRECTIONAL CONTEXT PROPAGATION NETWORK FOR URINE SEDIMENT PARTICLE DETECTION IN MICROSCOPIC IMAGES
6000A Bi-model Approach for Handling Unknown Slot Values in Dialogue State Tracking
1471A BIN ENCODING TRAINING OF A SPIKING NEURAL NETWORK BASED VOICE ACTIVITY DETECTION
3859A COMPARATIVE STUDY OF ESTIMATING ARTICULATORY MOVEMENTS FROM PHONEME SEQUENCES AND ACOUSTIC FEATURES
2890A COMPARATIVE STUDY OF WESTERN AND CHINESE CLASSICAL MUSIC BASED ON SOUNDSCAPE MODELS
4022A COMPARISON OF POOLING METHODS ON LSTM MODELS FOR RARE ACOUSTIC EVENT CLASSIFICATION
3557A Complexity Efficient DMT-Optimal Tree Pruning Based Sphere Decoding
2310A COMPOSITE DNN ARCHITECTURE FOR SPEECH ENHANCEMENT
6051A COMPREHENSIVE FRAMEWORK FOR 2D-JND EXTENSION TO 360-DEG IMAGES
3595A COMPREHENSIVE STUDY OF RESIDUAL CNNS FOR ACOUSTIC MODELING IN ASR
3252A COMPUTATIONALLY LIGHT ALGORITHM FOR BAYESIAN SPEECH ENHANCEMENT WITH SNR MARGINALIZATION
5175A connected auto-encoders based approach for image separation with side information: with applications to art investigation
1659A CONSTRAINED MAXIMUM LIKELIHOOD ESTIMATOR OF SPEECH AND NOISE SPECTRA WITH APPLICATION TO MULTI-MICROPHONE NOISE REDUCTION
1972A CROSS-TASK TRANSFER LEARNING APPROACH TO ADAPTING DEEP SPEECH ENHANCEMENT MODELS TO UNSEEN BACKGROUND NOISE USING PAIRED SENONE CLASSIFIERS
1929A DATA EFFICIENT END-TO-END SPOKEN LANGUAGE UNDERSTANDING ARCHITECTURE
2706A DATASET FOR MEASURING READING LEVELS IN INDIA AT SCALE
1261A DEEP GRADIENT BOOSTING NETWORK FOR OPTIC DISC AND CUP SEGMENTATION
3971A DEEP LEARNING APPROACH TO OBJECT AFFORDANCE SEGMENTATION
5815A DEEP LEARNING ARCHITECTURE FOR EPILEPTIC SEIZURE CLASSIFICATION BASED ON OBJECT AND ACTION RECOGNITION
4857A DEEP MULTIMODAL APPROACH FOR MAP IMAGE CLASSIFICATION
5952A Deep Neural Network-Driven Feature Learning Method for Polyphonic Acoustic Event Detection from Real-Life Recordings
5238A DENSE U-NET WITH CROSS-LAYER INTERSECTION FOR DETECTION AND LOCALIZATION OF IMAGE FORGERY
2606A DIALOGICAL EMOTION DECODER FOR SPEECH EMOTION RECOGNITION IN SPOKEN DIALOG
1505A DIFFERENTIAL APPROACH FOR RAIN FIELD TOMOGRAPHIC RECONSTRUCTION USING MICROWAVE SIGNALS FROM LEO SATELLITES
1240A DISCRIMINATIVE CONDITION-AWARE BACKEND FOR SPEAKER VERIFICATION
1880A DSP ACCELERATION FRAMEWORK FOR SOFTWARE-DEFINED RADIOS ON X86_64
4409A DUAL-STAGED CONTEXT AGGREGATION METHOD TOWARDS EFFICIENT END-TO-END SPEECH ENHANCEMENT
2658A DYNAMIC STREAM WEIGHT BACKPROP KALMAN FILTER FOR AUDIOVISUAL SPEAKER TRACKING
5258A FAST AND ACCURATE FREQUENT DIRECTIONS ALGORITHM FOR LOW RANK APPROXIMATION VIA BLOCK KRYLOV ITERATION
1517A FAST AND ACCURATE SUPER-RESOLUTION NETWORK USING PROGRESSIVE RESIDUAL LEARNING
5377A Fast Non-contact Vital Signs Detection Method Based on Regional Hidden Markov Model in a 77GHz LFMCW Radar System
2875A FAST PROXIMAL POINT ALGORITHM FOR GENERALIZED GRAPH LAPLACIAN LEARNING
3751A FAST REDUCED-RANK SOUND ZONE CONTROL ALGORITHM USING THE CONJUGATE GRADIENT METHOD
5125A fast sparse covariance-based fitting method for DOA estimation via non-negative least squares
3710A FIFO BASED ACCELERATOR FOR CONVOLUTIONAL NEURAL NETWORKS
1895A FORWARD-BACKWARD ALGORITHM FOR REWEIGHTED PROCEDURES: APPLICATION TO RADIO-ASTRONOMICAL IMAGING
3072A FRAMEWORK FOR PARAMETERS ESTIMATION OF IMAGE OPERATOR CHAIN
1909A Framework for the Robust Evaluation of Sound Event Detection
2771A FREQUENCY-DOMAIN BSS METHOD BASED ON L1 NORM, UNITARY CONSTRAINT, AND CAYLEY TRANSFORM
2258A GATED HYPERNET DECODER FOR POLAR CODES
2144A General Difficulty Control Algorithm for Proof-of-Work Based Blockchains
3591A GENERAL TEST FOR THE LINEAR STRUCTURE OF COVARIANCE MATRICES OF GAUSSIAN POPULATIONS
4486A GENERALIZATION OF PRINCIPAL COMPONENT ANALYSIS
2919A GENERALIZED FRAMEWORK FOR DOMAIN ADAPTATION OF PLDA IN SPEAKER RECOGNITION
1164A GEOMETRIC APPROACH FOR UNSUPERVISED SIMILARITY LEARNING
1462A GRAPH NETWORK MODEL FOR DISTRIBUTED LEARNING WITH LIMITED BANDWIDTH LINKS AND PRIVACY CONSTRAINTS
5783A GREEDY SPARSE APPROXIMATION ALGORITHM BASED ON L1-NORM SELECTION RULES
5945A HARDWARE ARCHITECTURE FOR RECONFIGURABLE INTELLIGENT SURFACES WITH MINIMAL ACTIVE ELEMENTS FOR EXPLICIT PARTIAL CHANNEL ESTIMATION
3281A HIERARCHICAL MODEL FOR DIALOG ACT RECOGNITION CONSIDERING ACOUSTIC AND LEXICAL CONTEXT INFORMATION
4930A HIERARCHICAL TRACKER FOR MULTI-DOMAIN DIALOGUE STATE TRACKING
4135A HYBRID APPROACH FOR THERMOGRAPHIC IMAGING WITH DEEP LEARNING
4220A HYBRID MODEL FOR BIPOLAR DISORDER CLASSIFICATION FROM VISUAL INFORMATION
1483A Hybrid Structural Sparse Error Model for Image Deblocking
3282A HYBRID TEXT NORMALIZATION SYSTEM USING MULTI-HEAD SELF-ATTENTION FOR MANDARIN
4331A Large-Scale Deep Architecture for Personalized Grocery Basket Recommendations
4365A LEARNING APPROACH TO COOPERATIVE COMMUNICATION SYSTEM DESIGN
2321A LIGHTWEIGHT MULTI-LABEL SEGMENTATION NETWORK FOR MOBILE IRIS BIOMETRICS
4509A LINEAR TIME PARTITIONING ALGORITHM FOR FREQUENCY WEIGHTED IMPURITY FUNCTIONS
3310A LOW-COMPLEXITY MAP DETECTOR FOR DISTRIBUTED NETWORKS
3877A LOW-DIMENSIONALITY METHOD FOR DATA-DRIVEN GRAPH LEARNING
2245A LOW-LATENCY SUCCESSIVE CANCELLATION HYBRID DECODER FOR CONVOLUTIONAL POLAR CODES
3450A Low-Resolution ADC Proof-of-Concept Development for A Fully-Digital Millimeter-Wave Joint Communication-Radar
2544A MAXIMUM LIKELIHOOD APPROACH TO MULTI-OBJECTIVE LEARNING USING GENERALIZED GAUSSIAN DISTRIBUTIONS FOR DNN-BASED SPEECH ENHANCEMENT
2273A MEMORY AUGMENTED ARCHITECTURE FOR CONTINUOUS SPEAKER IDENTIFICATION IN MEETINGS
5672A Method for Millimeter-Wave Imaging of Concealed Objects via De-Aliasing
3413A MINIMAL PERSONALIZATION OF DYNAMIC BINAURAL SYNTHESIS WITH MIXED STRUCTURAL MODELING AND SCATTERING DELAY NETWORK
4802A MODEL OF DOUBLE DESCENT FOR HIGH-DIMENSIONAL LOGISTIC REGRESSION
5597A MODEL-BASED DEEP NETWORK FOR MRI RECONSTRUCTION USING APPROXIMATE MESSAGE PASSING ALGORITHM
3852A MODEL-FREE APPROACH TO DISTRIBUTED TRANSMIT BEAMFORMING
1861A MOMENT-BASED APPROACH FOR GUARANTEED TENSOR DECOMPOSITION
3731A Monte Carlo Search-based Triplet Sampling Method for Learning Disentangled Representation of Impulsive Noise on Steering Gear
1475A MULTICHANNEL KALMAN-BASED WIENER FILTER APPROACH FOR SPEAKER INTERFERENCE REDUCTION IN MEETINGS
4841A MULTI-DILATION AND MULTI-RESOLUTION FULLY CONVOLUTIONAL NETWORK FOR SINGING MELODY EXTRACTION
3234A MULTI-PHASE GAMMATONE FILTERBANK FOR SPEECH SEPARATION VIA TASNET
5146A MULTI-SCALED RECEPTIVE FIELD LEARNING APPROACH FOR MEDICAL IMAGE SEGMENTATION
2461A MULTITAPER REASSIGNED SPECTROGRAM FOR INCREASED TIME-FREQUENCY LOCALIZATION PRECISION
2323A multi-view approach for Mandarin non-native mispronunciation verification
3533A NEURAL DOCUMENT LANGUAGE MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL
3010A NEURAL NETWORK BASED ON FIRST PRINCIPLES
1887A NEURAL NETWORK FOR MONAURAL INTRUSIVE SPEECH INTELLIGIBILITY PREDICTION
3671A NEURAL NETWORK-BASED SPIKE SORTING FEATURE MAP THAT RESOLVES SPIKE OVERLAP IN THE FEATURE SPACE
5668A NEW APPLICATION OF ULTRASOUND SIGNAL PROCESSING FOR ARCHAEOLOGICAL CERAMIC CLASSIFICATION
3932A NEW MULTIHYPOTHESIS PREDICTION SCHEME FOR COMPRESSED VIDEO SENSING RECONSTRUCTION
2525A NEW PERSPECTIVE FOR FLEXIBLE FEATURE GATHERING IN SCENE TEXT RECOGNITION VIA CHARACTER ANCHOR POOLING
2499A NEW SAMPLING SCHEME FOR DISTRIBUTED BLIND SPECTRUM SENSING USING ENERGY DETECTORS
4347A NEW VARIATIONAL METHOD FOR DEEP SUPERVISED SEMANTIC IMAGE HASHING
1106A NONINVASIVE METHOD TO DETECT DIABETES MELLITUS AND LUNG CANCER USING THE STACKED SPARSE AUTOENCODER
4896A NOVEL APPROACH FOR INTELLIGIBILITY ASSESSMENT IN DYSARTHRIC SUBJECTS
2293A NOVEL METHOD FOR OBTAINING DIFFUSE FIELD MEASUREMENTS FOR MICROPHONE CALIBRATION
3972A NOVEL MOVING SPARSE ARRAY GEOMETRY WITH INCREASED DEGREES OF FREEDOM
4921A NOVEL PRUNING APPROACH FOR BAGGING ENSEMBLE REGRESSION BASED ON SPARSE REPRESENTATION
4877A NOVEL RANK SELECTION SCHEME IN TENSOR RING DECOMPOSITION BASED ON REINFORCEMENT LEARNING FOR DEEP NEURAL NETWORKS
4791A NOVEL SALIENCY-DRIVEN OIL TANK DETECTION METHOD FOR SYNTHETIC APERTURE RADAR IMAGES
1549A NOVEL TWO-PATHWAY ENCODER-DECODER NETWORK FOR 3D FACE RECONSTRUCTION
5274A PARTIAL RELAXATION DOA ESTIMATOR BASED ON ORTHOGONAL MATCHING PURSUIT
5926A Particle Gibbs Sampling Approach to Topology Inference in Gene Regulatory Networks
2349A PENALTY ALTERNATING DIRECTION METHOD OF MULTIPLIERS FOR DECENTRALIZED COMPOSITE OPTIMIZATION
4649A PRACTICAL TWO-STAGE TRAINING STRATEGY FOR MULTI-STREAM END-TO-END SPEECH RECOGNITION
4284A PRIORI ESTIMATES OF THE GENERALIZATION ERROR FOR AUTOENCODERS
4346A PROBABILISTIC SCHEME FOR REPRESENTATION LEARNING WITH RADIAL TRANSFORM IMAGES
3782A PROTOTYPICAL TRIPLET LOSS FOR COVER DETECTION
2338A PROXIMAL DUAL CONSENSUS METHOD FOR LINEARLY COUPLED MULTI-AGENT NON-CONVEX OPTIMIZATION
1097A RANDOM GOSSIP BMUF PROCESS FOR NEURAL LANGUAGE MODELING
1395A real time implementation of a Bayer domain image deblurring core for optical blur compensation
2844A REAL-TIME DEEP NETWORK FOR CROWD COUNTING
3086A RECURRENT VARIATIONAL AUTOENCODER FOR SPEECH ENHANCEMENT
2570A RECURSIVE BAYESIAN SOLUTION FOR THE EXCESS OVER THRESHOLD DISTRIBUTION WITH STOCHASTIC PARAMETERS
5823A RECURSIVE EDGE DETECTOR FOR COLOR FILTER ARRAY IMAGE
6112A REGULARIZATION FRAMEWORK FOR LEARNING OVER MULTITASK GRAPHS
5027A Regularized Attention Mechanism for Graph Attention Networks
4925A RETURN TO DEREVERBERATION IN THE FREQUENCY DOMAIN USING A JOINT LEARNING APPROACH
5460A ROBUST AUDIO-VISUAL SPEECH ENHANCEMENT MODEL
5784A ROBUST SPEAKER CLUSTERING METHOD BASED ON DISCRETE TIED VARIATIONAL AUTOENCODER
1286A SEGMENTATION BASED ROBUST DEEP LEARNING FRAMEWORK FOR MULTIMODAL RETINAL IMAGE REGISTRATION
3235A Self-Attentive Emotion Recognition Network
2874A SEMI-SUPERVISED APPROACH FOR IDENTIFYING ABNORMAL HEART SOUNDS USING VARIATIONAL AUTOENCODER
3160A SEMI-SUPERVISED RANK TRACKING ALGORITHM FOR ON-LINE UNMIXING OF HYPERSPECTRAL IMAGES
3583A SEQUENCE MATCHING NETWORK FOR POLYPHONIC SOUND EVENT LOCALIZATION AND DETECTION
2361A SIAMESE CONTENT-ATTENTIVE GRAPH CONVOLUTIONAL NETWORK FOR PERSONALITY RECOGNITION USING PHYSIOLOGY
2535A SIMPLE AND EFFICIENT ITERATIVE METHOD FOR TOA LOCALIZATION
4527A SIMPLE BUT EFFECTIVE BERT MODEL FOR DIALOG STATE TRACKING ON RESOURCE-LIMITED SYSTEMS
3834A SIMPLE DERIVATION OF AMP AND ITS STATE EVOLUTION VIA FIRST-ORDER CANCELLATION
4136A SINGLE-RF ARCHITECTURE FOR MULTIUSER MASSIVE MIMO VIA REFLECTING SURFACES
3004A Sparse Linear Array Approach in Automotive Radars Using Matrix Completion
3119A STACKED-AUTOENCODER BASED END-TO-END LEARNING FRAMEWORK FOR DECODE-AND-FORWARD RELAY NETWORKS
2024A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
5020A STUDY OF CHILD SPEECH EXTRACTION USING JOINT SPEECH ENHANCEMENT AND SEPARATION IN REALISTIC CONDITIONS
5512A STUDY OF GENERALIZATION OF STOCHASTIC MIRROR DESCENT ALGORITHMS ON OVERPARAMETERIZED NONLINEAR MODELS
4529A STUDY ON THE TRANSFERABILITY OF ADVERSARIAL ATTACKS IN SOUND EVENT CLASSIFICATION
2539A SWITCHING TRANSMISSION GAME WITH LATENCY AS THE USER'S COMMUNICATION UTILITY
1202A THEORETICAL BASIS FOR PRACTITIONERS HEURISTIC 1/N AND LONG-ONLY QUINTILE PORTFOLIO
3461A TIME-BASED SAMPLING FRAMEWORK FOR FINITE-RATE-OF-INNOVATION SIGNALS
4747A TIME-FREQUENCY NETWORK WITH CHANNEL ATTENTION AND NON-LOCAL MODULES FOR ARTIFICIAL BANDWIDTH EXTENSION
5144A UNIFIED SEQUENCE-TO-SEQUENCE FRONT-END MODEL FOR MANDARIN TEXT-TO-SPEECH SYNTHESIS
2295A VARIATIONAL BAYESIAN APPROACH FOR MULTICHANNEL THROUGH-WALL RADAR IMAGING WITH LOW-RANK AND SPARSE PRIORS
1422A VISUAL-PILOT DEEP FUSION FOR TARGET SPEECH SEPARATION IN MULTI-TALKER NOISY ENVIRONMENT
1655A WHITENESS TEST BASED ON THE SPECTRAL MEASURE OF LARGE NON-HERMITIAN RANDOM MATRICES
2821A WIFI-BASED PASSIVE FALL DETECTION SYSTEM
2244A ZEROTH-ORDER LEARNING ALGORITHM FOR ERGODIC OPTIMIZATION OF WIRELESS SYSTEMS WITH NO MODELS AND NO GRADIENTS
3542ACCELERATING DISTRIBUTED DEEP LEARNING BY ADAPTIVE GRADIENT QUANTIZATION
4244ACCELERATING LINEAR ALGEBRA KERNELS ON A MASSIVELY PARALLEL RECONFIGURABLE ARCHITECTURE.
5267ACCENT ESTIMATION OF JAPANESE WORDS FROM THEIR SURFACES AND ROMANIZATIONS FOR BUILDING LARGE VOCABULARY ACCENT DICTIONARIES
3398Accounting for microprosody in modeling intonation
3815ACCURACY-ROBUSTNESS TRADE-OFF FOR POSITIVELY WEIGHTED NEURAL NETWORKS
4321Accurate 6D Object Pose Estimation by Pose Conditioned Mesh Reconstruction
3718ACCURATE AND SCALABLE VERSION IDENTIFICATION USING MUSICALLY-MOTIVATED EMBEDDINGS
2199ACCURATE LOCALIZATION OF AUV IN MOTION BY EXPLICIT SOLUTION USING TIME DELAYS
4359Accurate Semidefinite Relaxation Method for 3-D Rigid Body Localization Using AOA
4530ACHIEVING FULLY-DIGITAL PERFORMANCE BY HYBRID ANALOG/DIGITAL BEAMFORMING IN WIDE-BAND MASSIVE-MIMO SYSTEMS
5656ACHIEVING THE CAPACITY OF THE DNA STORAGE CHANNEL
5239ACOUSTIC MATCHING BY EMBEDDING IMPULSE RESPONSES
4018Acoustic Model Adaptation for Lecture Transcription and Intelligent Meeting Assistant Systems
5602ACOUSTIC SCENE CLASSIFICATION FOR MISMATCHED RECORDING DEVICES USING HEATED-UP SOFTMAX AND SPECTRUM CORRECTION
4208ACOUSTIC SCENE CLASSIFICATION USING DEEP RESIDUAL NETWORKS WITH LATE FUSION OF SEPARATED HIGH AND LOW FREQUENCY PATHS
1982A-CRNN: A DOMAIN ADAPTATION MODEL FOR SOUND EVENT DETECTION
1792ACTION-MANIPULATION ATTACKS ON STOCHASTIC BANDITS
2131ACTIVE CONTROL OF LINE SPECTRAL NOISE WITH SIMULTANEOUS SECONDARY PATH MODELING WITHOUT AUXILIARY NOISE
3880ACTIVE LEARNING WITH UNSUPERVISED ENSEMBLES OF CLASSIFIERS
2464ACTIVE NOISE CONTROL OVER MULTIPLE REGIONS: PERFORMANCE ANALYSIS
3903ACTIVE SEMI-SUPERVISED LEARNING FOR DIFFUSIONS ON GRAPHS
3129ACU-NET:A 3D ATTENTION CONTEXT U-NET FOR MULTIPLE SCLEROSIS LESION SEGMENTATION
1416ADAPTATION AND LEARNING IN MULTI-TASK DECISION SYSTEMS
4961ADAPTATION OF RNN TRANSDUCER WITH TEXT-TO-SPEECH TECHNOLOGY FOR KEYWORD SPOTTING
1967Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors
4580Adaptive Distributed Stochastic Gradient Descent for Minimizing Delay in the Presence of Stragglers
2854ADAPTIVE ELASTIC LOSS BASED ON PROGRESSIVE INTER-CLASS ASSOCIATION FOR CERVICAL HISTOLOGY IMAGE SEGMENTATION
1108ADAPTIVE FEATURE ENHANCEMENT FOR FASHION LANDMARK DETECTION
1508ADAPTIVE KNOWLEDGE DISTILLATION BASED ON ENTROPY
4751ADAPTIVE MATCHED FILTER USING NON-TARGET FREE TRAINING DATA
2163ADAPTIVE NORMALIZATION FOR FORECASTING LIMIT ORDER BOOK DATA USING CONVOLUTIONAL NEURAL NETWORKS
4304Adaptive prediction of financial time-series for decision-making using a tensorial aggregation approach
4371ADAPTIVE REGION AGGREGATION NETWORK: UNSUPERVISED DOMAIN ADAPTATION WITH ADVERSARIAL TRAINING FOR ECG DELINEATION
1268ADAPTIVE RESOLUTION CHANGE USING UNCODED AREAS AND DICTIONARY LEARNING-BASED SUPER-RESOLUTION IN VERSATILE VIDEO CODING
2478ADAPTIVE SEQUENTIAL INTERPOLATOR USING ACTIVE LEARNING FOR EFFICIENT EMULATION OF COMPLEX SYSTEMS
1947Adaptive Subspace Detectors for Off-Grid Mismatched Targets
5548ADDRESSING ACCENT MISMATCH IN MANDARIN-ENGLISH CODE-SWITCHING SPEECH RECOGNITION
4416ADDRESSING CHALLENGES IN BUILDING WEB-SCALE CONTENT CLASSIFICATION SYSTEMS
5240ADDRESSING THE CONFOUNDS OF INSTRUMENTATION IN SINGER IDENTIFICATION
1889ADDRESSING THE POLYSEMY PROBLEM IN LANGUAGE MODELING WITH ATTENTIONAL MULTI-SENSE EMBEDDINGS
3477ADI17: A FINE-GRAINED ARABIC DIALECT IDENTIFICATION DATASET
1528ADMM-BASED ONE-BIT QUANTIZED SIGNAL DETECTION FOR MASSIVE MIMO SYSTEMS WITH HARDWARE IMPAIRMENTS
3019ADRN: Attention-based Deep Residual Network for Hyperspectral Image Denoising
3627Adversarial Anomaly Detection for Marked Spatio-Temporal Streaming Data
1686Adversarial Attack inspired by K-Anonymity principles
2876ADVERSARIAL ATTACK ON GMM I-VECTOR BASED SPEAKER VERIFICATION SYSTEMS
5836Adversarial Attacks on Deep Unfolded Networks for Sparse Coding
1946ADVERSARIAL DETECTION OF COUNTERFEITED PRINTABLE GRAPHICAL CODES: TOWARDS ”ADVERSARIAL GAMES” IN PHYSICAL WORLD
5071ADVERSARIAL EXAMPLE DETECTION BY CLASSIFICATION FOR DEEP SPEECH RECOGNITION
2339ADVERSARIAL MIXUP SYNTHESIS TRAINING FOR UNSUPERVISED DOMAIN ADAPTATION
2405ADVERSARIAL MULTI-TASK LEARNING FOR SPEAKER NORMALIZATION IN REPLAY DETECTION
4693Adversarial Networks for Secure Wireless Communications
2505Adversarial Text Image Super-Resolution Using Sinkhorn Distance
2660ADVERSARIAL VIDEO COMPRESSION GUIDED BY SOFT EDGE DETECTION
2055ADVMS: A MULTI-SOURCE MULTI-COST DEFENSE AGAINST ADVERSARIAL ATTACKS
1189Age of Information with Finite Horizon and Partial Updates
4642AGE-BASED SCHEDULING POLICY FOR FEDERATED LEARNING IN MOBILE EDGE NETWORKS
2075AIPNET: GENERATIVE ADVERSARIAL PRE-TRAINING OF ACCENT-INVARIANT NETWORKS FOR END-TO-END SPEECH RECOGNITION
3143AL2: PROGRESSIVE ACTIVATION LOSS FOR LEARNING GENERAL REPRESENTATIONS IN CLASSIFICATION NEURAL NETWORKS
2248Algorithmic exploration of American English dialects
4006ALIGNMENT-LENGTH SYNCHRONOUS DECODING FOR RNN TRANSDUCER
5120ALIGNTTS: EFFICIENT FEED-FORWARD TEXT-TO-SPEECH SYSTEM WITHOUT EXPLICIT ALIGNMENT
5503ALL IN ONE NETWORK FOR DRIVER ATTENTION MONITORING
2476ALL YOU NEED IS A SECOND LOOK: TOWARDS TIGHTER ARBITRARY SHAPE TEXT DETECTION
1565ALLOCATION OF COMPUTING TASKS IN DISTRIBUTED MEC SERVERS CO-POWERED BY RENEWABLE SOURCES AND THE POWER GRID
3073ALTERNATIVE HALF-SAMPLE INTERPOLATION FILTERS FOR VERSATILE VIDEO CODING
6015AN ACOUSTIC MODELLING BASED REMOTE ERROR SENSING APPROACH FOR QUIET ZONE GENERATION IN A NOISY ENVIRONMENT
4423AN ADAPTIVE LINEAR ESTIMATOR BASED APPROACH TO BI-DIRECTIONAL MOTION COMPENSATED PREDICTION
6075AN ADMM-BASED APPROACH TO ROBUST ARRAY PATTERN SYNTHESIS
4574AN ALTERNATIVE SIGNATURE DESIGN USING L1 PRINCIPAL COMPONENTS FOR SPREAD-SPECTRUM STEGANOGRAPHY
3109AN ANALYSIS OF SPEECH ENHANCEMENT AND RECOGNITION LOSSES IN LIMITED RESOURCES MULTI-TALKER SINGLE CHANNEL AUDIO-VISUAL ASR
2018AN ANALYTICAL SOLUTION TO JACOBSEN ESTIMATOR FOR WINDOWED SIGNALS
1603An attention enhanced multi-task model for objective speech assessment in real-world environments
2291An Attention-Based Joint Acoustic and Text On-Device End-to-End Model
1662AN EARLY TERMINATION SCHEME FOR SUCCESSIVE CANCELLATION LIST DECODING OF POLAR CODES
5919AN EASY-IMPLEMENTIVE FRAMEWORK OF FAST SUBSPACE CLUSTERING FOR BIG DATA SETS
6120AN EFFECTIVE STYLE TOKEN WEIGHT CONTROL TECHNIQUE FOR END-TO-END EMOTIONAL SPEECH SYNTHESIS
2638AN EFFICIENT ALTERNATIVE TO NETWORK PRUNING THROUGH ENSEMBLE LEARNING
2358AN EFFICIENT AUGMENTED LAGRANGIAN-BASED METHOD FOR LINEAR EQUALITY-CONSTRAINED LASSO
6122AN EFFICIENT COUPLED DICTIONARY LEARNING METHOD
5399AN EFFICIENT EKF BASED TRAINING ALGORITHM FOR LSTM-BASED ONLINE LEARNING
4910AN EFFICIENT METHODOLOGY TO DE-ANONYMIZE THE 5G-NEW RADIO PHYSICAL DOWNLINK CONTROL CHANNEL
6103An Embedding Cost Learning Framework Using GAN
2692An Empirical Bayes Approach to Partially Labeled and Shuffled Data Sets
5319AN EMPIRICAL STUDY OF CONV-TASNET
4812An Empirical Study of Transformer-based Neural Language Model Adaptation
1078AN EMPIRICAL STUDY ON ACOUSTIC FEEDBACK PATH ACROSS HEARING AID USERS
3978AN ENHANCED DECODING ALGORITHM FOR CODED COMPRESSED SENSING
2715An ensemble Based Approach for Generalized Detection of Spoofing Attacks to Automatic Speaker Recognizers
2512AN IMPROVED DEEP NEURAL NETWORK FOR MODELING SPEAKER CHARACTERISTICS AT DIFFERENT TEMPORAL SCALES
1349AN IMPROVED FRAME-UNIT-SELECTION BASED VOICE CONVERSION SYSTEM WITHOUT PARALLEL TRAINING DATA
4528AN IMPROVED SELECTIVE ACTIVE NOISE CONTROL ALGORITHM BASED ON EMPIRICAL WAVELET TRANSFORM
1048AN IMPROVED SOLUTION TO THE FREQUENCY-INVARIANT BEAMFORMING WITH CONCENTRIC CIRCULAR MICROPHONE ARRAYS
2275AN LSTM BASED ARCHITECTURE TO RELATE SPEECH STIMULUS TO EEG
3991AN LSTM-BASED DYNAMIC CHORD PROGRESSION GENERATION SYSTEM FOR INTERACTIVE MUSIC PERFORMANCE
5031AN ODORANT ENCODING MACHINE FOR SAMPLING, RECONSTRUCTION AND ROBUST REPRESENTATION OF ODORANT IDENTITY
4421AN ONLINE KERNEL SCALAR QUANTIZATION SCHEME FOR SIGNAL CLASSIFICATION
6158An Online Plug-and-Play Algorithm for Regularized Image Reconstruction
2201AN ONLINE SPEAKER-AWARE SPEECH SEPARATION APPROACH BASED ON TIME-DOMAIN REPRESENTATION
3805AN ONTOLOGY-AWARE FRAMEWORK FOR AUDIO EVENT CLASSIFICATION
2618AN OPTIMAL CHANNEL ESTIMATION SCHEME FOR INTELLIGENT REFLECTING SURFACES BASED ON A MINIMUM VARIANCE UNBIASED ESTIMATOR
3768An optimal symmetric threshold strategy for remote estimation over the collision channel
4390AN UNSUPERVISED RETINAL VESSEL EXTRACTION AND SEGMENTATION METHOD BASED ON A TUBE MARKED POINT PROCESS MODEL
5935ANALYSIS OF ACOUSTIC FEATURES FOR SPEECH SOUND BASED CLASSIFICATION OF ASTHMATIC AND HEALTHY SUBJECTS
3865ANALYZING ASR PRETRAINING FOR LOW-RESOURCE SPEECH-TO-TEXT TRANSLATION
5885ANGULAR DISCRIMINATIVE DEEP FEATURE LEARNING FOR FACE VERIFICATION
6086ANISOTROPIC GUIDED FILTERING
5950ANOMALOUS SOUND DETECTION BASED ON INTERPOLATION DEEP NEURAL NETWORK
3774Anomaly Detection for Time Series Using VAE-LSTM Hybrid Model
2185ANOMALY DETECTION IN MIXED TIME-SERIES USING A CONVOLUTIONAL SPARSE REPRESENTATION WITH APPLICATION TO SPACECRAFT HEALTH MONITORING
2781Anomaly Detection With Training Data in Hyperspectral Imagery
3278AnomalyDAE: Dual autoencoder for anomaly detection on attributed networks
1440Anti-jamming Routing for Internet of Satellites: A Reinforcement Learning Approach
3876ANYTIME MINIBATCH WITH DELAYED GRADIENTS: SYSTEM PERFORMANCE AND CONVERGENCE ANALYSIS
2342APB2FACE: AUDIO-GUIDED FACE REENACTMENT WITH AUXILIARY POSE AND BLINK SIGNALS
4266Application Informed Motion Signal processing for finger motion tracking using wearable sensors
2093Approaching Optimal Embedding in Audio Steganography with GAN
3373APPROXIMATE BAYESIAN COMPUTATION WITH THE SLICED-WASSERSTEIN DISTANCE
3308APPROXIMATE INFERENCE BY KULLBACK-LEIBLER TENSOR BELIEF PROPAGATION
6085ARBITRARY LENGTH PERFECT INTEGER SEQUENCES USING ALL-PASS POLYNOMIAL
3312ARNET:ATTENTION-BASED REFINEMENT NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION
3116ARRAY-GEOMETRY-AWARE SPATIAL ACTIVE NOISE CONTROL BASED ON DIRECTION-OF-ARRIVAL WEIGHTING
4285ARSM GRADIENT ESTIMATOR FOR SUPERVISED LEARNING TO RANK
5372ARTIFICIAL BANDWIDTH EXTENSION USING CONDITIONAL VARIATIONAL AUTO-ENCODERS AND ADVERSARIAL LEARNING
6013ASR ERROR CORRECTION AND DOMAIN ADAPTATION USING MACHINE TRANSLATION
3400ASR IS ALL YOU NEED: CROSS-MODAL DISTILLATION FOR LIP READING
1829ASSESSING THE SCOPE OF GENERALIZED COUNTERMEASURES FOR ANTI-SPOOFING
1756ASSIMILATION-BASED LEARNING OF CHAOTIC DYNAMICAL SYSTEMS FROM NOISY AND PARTIAL DATA
1998Asymptotic Stochastic Analysis of Partially Relaxed DML
1157ASYMPTOTICALLY OPTIMAL BLIND CALIBRATION OF ACOUSTIC VECTOR SENSOR UNIFORM LINEAR ARRAYS
5589ASYNCHROUNOUS DECENTRALIZED LEARNING OF A NEURAL NETWORK
4754ATOMIC NORM BASED LOCALIZATION OF FAR-FIELD AND NEAR-FIELD SIGNALS WITH GENERALIZED SYMMETRIC ARRAYS
2477ATOMIC NORM DENOISING IN BLIND TWO-DIMENSIONAL SUPER-RESOLUTION
2851ATTENTION DRIVEN FUSION FOR MULTI-MODAL EMOTION RECOGNITION
4360ATTENTION GUIDED REGION DIVISION FOR CROWD COUNTING
1803ATTENTION MECHANISM ENHANCED KERNEL PREDICTION NETWORKS FOR DENOISING OF BURST IMAGES
2357ATTENTIONAL FUSED TEMPORAL TRANSFORMATION NETWORK FOR VIDEO ACTION RECOGNITION
1863ATTENTION-BASED ASR WITH LIGHTWEIGHT AND DYNAMIC CONVOLUTIONS
2474ATTENTION-BASED CURIOSITY-DRIVEN EXPLORATION IN DEEP REINFORCEMENT LEARNING
4296ATTENTION-BASED GATED SCALING ADAPTIVE ACOUSTIC MODEL FOR CTC-BASED SPEECH RECOGNITION
3148Attention-guided Deraining Network via Stage-wise Learning
2632ATTENTION-MASK DENSE MERGER (ATTENDENSE) DEEP HDR FOR GHOST REMOVAL
2288ATTENTIVE CUTMIX: AN ENHANCED DATA AUGMENTATION APPROACH FOR DEEP LEARNING BASED IMAGE CLASSIFICATION
1086Attentive Item2vec: Neural Attentive User Representations
2940ATTENTIVE MODALITY HOPPING MECHANISM FOR SPEECH EMOTION RECOGNITION
1363AUDIO CODEC ENHANCEMENT WITH GENERATIVE ADVERSARIAL NETWORKS
1997AUDIO FEATURE EXTRACTION FOR VEHICLE ENGINE NOISE CLASSIFICATION
4879Audio sound determination using feature space attention based convolution recurrent neural network
6118AUDIO SOURCE SEPARATION USING VARIATIONAL AUTOENCODERS AND WEAK CLASS SUPERVISION
5232AUDIO-ASSISTED IMAGE INPAINTING FOR TALKING FACES
4225Audio-attention discriminative language model for ASR rescoring
1569AUDIO-BASED AUTO-TAGGING WITH CONTEXTUAL TAGS FOR MUSIC
3140AUDIO-BASED DETECTION OF EXPLICIT CONTENT IN MUSIC
2647AUDIO-VISUAL CALIBRATION WITH POLYNOMIAL REGRESSION FOR 2-D PROJECTION USING SVD-PHAT
2305AUDIO-VISUAL RECOGNITION OF OVERLAPPED SPEECH FOR THE LRS2 DATASET
1119AUDITORY MODEL BASED SUBSETTING OF HEAD-RELATED TRANSFER FUNCTION DATASETS
3540AUGLABEL: EXPLOITING WORD REPRESENTATIONS TO AUGMENT LABELS FOR FACE ATTRIBUTE CLASSIFICATION
1263Augmentation Data Synthesis via GANs: Boosting Latent Fingerprint Reconstruction
2908Augmented Grad-CAM: heat-maps super resolution through augmentation
4599AUGMENTING MOLECULAR IMAGES WITH VECTOR REPRESENTATIONS AS A FEATURIZATION TECHNIQUE FOR DRUG CLASSIFICATION
2194Auto-FAS: Searching Lightweight Networks for Face Anti-Spoofing
2582AUTOMATIC AND SIMULTANEOUS ADJUSTMENT OF LEARNING RATE AND MOMENTUM FOR STOCHASTIC GRADIENT-BASED OPTIMIZATION METHODS
5909AUTOMATIC CLASSIFICATION OF VOLUMES OF WATER USING SWALLOW OUNDS FROM CERVICAL AUSCULTATION
3511AUTOMATIC DATA AUGMENTATION VIA DEEP REINFORCEMENT LEARNING FOR EFFECTIVE KIDNEY TUMOR SEGMENTATION
3131AUTOMATIC EPILEPTIC SEIZURE ONSET-OFFSET DETECTION BASED ON CNN IN SCALP EEG
4960AUTOMATIC EVENT DETECTION OF REM SLEEP WITHOUT ATONIA FROM POLYSOMNOGRAPHY SIGNALS USING DEEP NEURAL NETWORKS
4663AUTOMATIC FLUENCY EVALUATION OF SPONTANEOUS SPEECH USING DISFLUENCY-BASED FEATURES
5692AUTOMATIC IDENTIFICATION OF SPEAKERS FROM HEAD GESTURES IN A NARRATION
1109Automatic Lyrics Alignment and Transcription in Polyphonic Music: Does Background music help?
2828AUTOMATIC PREDICTION OF SUICIDAL RISK IN MILITARY COUPLES USING MULTIMODAL INTERACTION CUES FROM COUPLES CONVERSATIONS
4049AUTOMATIC VOCAL TRACT LANDMARK TRACKING IN RTMRI USING FULLY CONVOLUTIONAL NETWORKS AND KALMAN FILTER
4587AUTOMOTIVE COLLISION RISK ESTIMATION UNDER COOPERATIVE SENSING
5354AUTOMOTIVE RADAR SIGNAL INTERFERENCE MITIGATION USING RNN WITH SELF ATTENTION
3924AUTOREGRESSIVE PARAMETER ESTIMATION WITH DNN-BASED PRE-PROCESSING
3753AUXILIARY CAPSULES FOR NATURAL LANGUAGE UNDERSTANDING
3041AV(SE)²: AUDIO-VISUAL SQUEEZE-EXCITE SPEECH ENHANCEMENT
2059AVA Active Speaker: An Audio-Visual Dataset for Active Speaker Detection
5069BACK-AND-FORTH PREDICTION FOR DEEP TENSOR COMPRESSION
1581Back-to-Back Butterfly Network, an Adaptive Permutation Network for New Communication Standards
5256BALANCED BINARY NEURAL NETWORKS WITH GATED RESIDUAL
2586BALANCING RATES AND VARIANCE VIA ADAPTIVE BATCH-SIZES IN FIRST-ORDER STOCHASTIC OPTIMIZATION
2486BANDIT SAMPLING FOR FASTER ACTIVITY AND DATA DETECTION IN MASSIVE RANDOM ACCESS
2125Bandwidth extension of musical audio signals with no side information using dilated convolutional neural networks
5699BANGLA VOICE COMMAND RECOGNITION IN END-TO-END SYSTEM USING TOPIC MODELING BASED CONTEXTUAL RESCORING
2554BATMAN: BAYESIAN TARGET MODELLING FOR ACTIVE INFERENCE
2053BAYESIAN ESTIMATION OF PLDA WITH NOISY TRAINING LABELS, WITH APPLICATIONS TO SPEAKER VERIFICATION
3103BAYESIAN MULTIPLE CHANGE-POINT DETECTION WITH LIMITED COMMUNICATION
2805BBAND INDEX: A NO-REFERENCE BANDING ARTIFACT PREDICTOR
4008BBA-NET: A bi-branch attention network for crowd counting
5502BEAM ELIMINATION BASED ON SEQUENTIALLY ESTIMATED A POSTERIORI PROBABILITIES OF WINNING
2324BEAMFORMED FEATURE FOR LEARNING-BASED DUAL-CHANNEL SPEECH SEPARATION
4743BEAMFORMING DESIGN FOR HIGH-RESOLUTION LOW-INTENSITY FOCUSED ULTRASOUND NEUROMODULATION
3827BEAMFORMING IN INTELLIGENT ENVIRONMENTS BASED ON ULTRA-MASSIVE MIMO PLATFORMS IN MILLIMETER WAVE AND TERAHERTZ BANDS
2363BEAM-TASNET: TIME-DOMAIN AUDIO SEPARATION NETWORK MEETS FREQUENCY-DOMAIN BEAMFORMER
5770BERT IS NOT ALL YOU NEED FOR COMMONSENSE INFERENCE
3825BETTER SAFE THAN SORRY: RISK-AWARE NONLINEAR BAYESIAN ESTIMATION
1481BEYOND THE DCASE 2017 CHALLENGE ON RARE SOUND EVENT DETECTION: A PROPOSAL FOR A MORE REALISTIC TRAINING AND TEST FRAMEWORK
1154BILATERAL RECURRENT NETWORK FOR SINGLE IMAGE DERAINING
6146BILEVEL OPTIMIZATION USING STATIONARY POINT OF LOWER-LEVEL OBJECTIVE FUNCTION
1415BINARY PROBABILITY MODEL FOR LEARNING BASED IMAGE COMPRESSION
5315BINAURAL AUDIO SOURCE REMIXING WITH MICROPHONE ARRAY LISTENING DEVICES
4569BIO-MIMETIC ATTENTIONAL FEEDBACK IN MUSIC SOURCE SEPARATION
5791BIPARTITE BELIEF PROPAGATION POLAR DECODING WITH BIT FLIPPING
4442BIT ALLOCATION FOR MULTI-TASK COLLABORATIVE INTELLIGENCE
5390BLASTER: An Off-Grid Method for Blind and Regularized Acoustic Echoes Retrieval
5800BLIND ADAPTIVE EQUALIZATION USING BIAS-COMPENSATED RLS METHOD
3885BLIND BOUNDED SOURCE SEPARATION USING NEURAL NETWORKS WITH LOCAL LEARNING RULES
6125Blind Constant Modulus Multiuser Detection via Low-Rank Approximation
6069BLIND DETERMINATION OF THE NUMBER OF SOURCES USING DISTANCE CORRELATION
1704Blind Hyperspectral Unmixing using Dual Branch Deep Autoencoder with Orthogonal Sparse Prior
3757BLIND INFERENCE OF CENTRALITY RANKINGS FROM GRAPH SIGNALS
4402BLIND MULTI-SPECTRAL IMAGE PAN-SHARPENING
3096BLIND QUALITY ASSESSMENT OF CAMERA IMAGES BASED ON STRUCTURE, TEXTURE AND COLOR INFORMATION
3388BLIND SOURCE SEPARATION OF GRAPH SIGNALS
1366BLOOD PRESSURE ESTIMATION FROM PPG SIGNALS USING CONVOLUTIONAL NEURAL NETWORKS AND SIAMESE NETWORK
3349Body movement generation for expressive violin performance applying neural networks
1836BOFFIN TTS: FEW-SHOT SPEAKER ADAPTATION BY BAYESIAN OPTIMIZATION
5840BOOSTED LOCALITY SENSITIVE HASHING: DISCRIMINATIVE BINARY CODES FOR SOURCE SEPARATION
1944BP-VB-EP BASED STATIC AND DYNAMIC SPARSE BAYESIAN LEARNING WITH KRONECKER STRUCTURED DICTIONARIES
4045BREATHING AND SPEECH PLANNING IN SPONTANEOUS SPEECH SYNTHESIS
1131Bridging Mixture Density Networks with Meta-learning for Automatic Speaker Identification
2812BRINGING IN THE OUTLIERS: A SPARSE SUBSPACE CLUSTERING APPROACH TO LEARN A DICTIONARY OF MOUSE ULTRASONIC VOCALIZATIONS
3893BUILDING FIRMLY NONEXPANSIVE CONVOLUTIONAL NEURAL NETWORKS
2731BUT System for the Second DIHARD Speech Diarization Challenge
4729Byzantine-Robust Decentralized Stochastic Optimization
1270C3DVQA: FULL-REFERENCE VIDEO QUALITY ASSESSMENT WITH 3D CONVOLUTIONAL NEURAL NETWORK
1494CAD-AEC: CONTEXT-AWARE DEEP ACOUSTIC ECHO CANCELLATION
1358CAMERA CONFIGURATION DESIGN IN COOPERATIVE ACTIVE VISUAL 3D RECONSTRUCTION: A STATISTICAL APPROACH
2597CAN EVERY ANALOG SYSTEM BE SIMULATED ON A DIGITAL COMPUTER?
4562Capacity of the Erasure Shuffling Channel
3011CARTOON-TEXTURE DECOMPOSITION-BASED VARIATIONAL PANSHARPENING
1337CELL-PHONE CLASSIFICATION: A CONVOLUTIONAL NEURAL NETWORK APPROACH EXPLOITING ELECTROMAGNETIC EMANATIONS
5930CGCNN: COMPLEX GABOR CONVOLUTIONAL NEURAL NETWORK ON RAW SPEECH
5556CHALLENGES AND PERSPECTIVES IN NEUROMORPHIC-BASED VISUAL IOT SYSTEMS AND NETWORKS
2038CHANNEL ADVERSARIAL TRAINING FOR SPEAKER VERIFICATION AND DIARIZATION
4456CHANNEL ATTENTION BASED GENERATIVE NETWORK FOR ROBUST VISUAL TRACKING
4383Channel Charting: An Euclidean Distance Matrix Completion Perspective
1138CHANNEL COVARIANCE ESTIMATION IN MULTIUSER MASSIVE MIMO SYSTEMS WITH AN APPROACH BASED ON INFINITE DIMENSIONAL HILBERT SPACES
5711CHANNEL INVARIANT SPEAKER EMBEDDING LEARNING WITH JOINT MULTI-TASK AND ADVERSARIAL TRAINING
5424CHANNEL SELECTION OVER RIEMANNIAN MANIFOLD WITH NON-STATIONARITY CONSIDERATION FOR BRAIN-COMPUTER INTERFACE APPLICATIONS
2294CHANNEL-ATTENTION DENSE U-NET FOR MULTICHANNEL SPEECH ENHANCEMENT
5307CHARACTERIZATION OF A SNAPSHOT FOURIER TRANSFORM IMAGINGSPECTROMETER BASED ON AN ARRAY OF FABRY-PEROT INTERFEROMETERS
2738Characterizing Adversarial Speech Examples Using Self-Attention U-Net Enhancement
1760Character-Level Lexical Emotion Recognition
4575CHIRPING UP THE RIGHT TREE: INCORPORATING BIOLOGICAL TAXONOMIES INTO DEEP BIOACOUSTIC CLASSIFIERS
6139Chronological Age Estimation Under the Guidance of Age-Related Facial Attributes
2177CIF: CONTINUOUS INTEGRATE-AND-FIRE FOR END-TO-END SPEECH RECOGNITION
4338CLASSIFICATION OF DEPTH AND SURFACE EDGES WITH DEEP FEATURES
4880CLASSIFICATION OF EPILEPTIC IEEG SIGNALS BY CNN AND DATA AUGMENTATION
2928CLASSIFICATION OF HIGH-DIMENSIONAL MOTOR IMAGERY TASKS BASED ON AN END-TO-END ROLE ASSIGNED CONVOLUTIONAL NEURAL NETWORK
4850Classify and explain: an interpretable convolutional neural network for lung cancer diagnosis
2822CLASSIFYING ANOMALIES FOR NETWORK SECURITY
5086CLASSIFYING PARTIALLY LABELED NETWORKED DATA VIA LOGISTIC NETWORK LASSO
1702CLCNET: DEEP LEARNING-BASED NOISE REDUCTION FOR HEARING AIDS USING COMPLEX LINEAR CODING
3267CLOCK SYNCHRONIZATION OVER NETWORKS USING SAWTOOTH MODELS
3327CLOTHO: AN AUDIO CAPTIONING DATASET
2183CLOUD-DRIVEN MULTI-WAY MULTIPLE-ANTENNA RELAY SYSTEMS: BEST-USER-LINK SELECTION AND JOINT MMSE DETECTION
1072CLUSTERING OF NONNEGATIVE DATA AND AN APPLICATION TO MATRIX COMPLETION
2740Clutter Identification Based on Sparse Recovery and L1-Type Probabilistic Distance Measures
5500CN-CELEB: A CHALLENGING CHINESE SPEAKER RECOGNITION DATASET
5369CNN-based Analog CSI Feedback in FDD MIMO-OFDM Systems
4151COCHLEAR SIGNAL PROCESSING: A PLATFORM FOR LEARNING THE FUNDAMENTALS OF DIGITAL SIGNAL PROCESSING
1727Coded Illumination and Multiplexing for Lensless Imaging
5356CODE-SWITCHED SPEECH SYNTHESIS USING BILINGUAL PHONETIC POSTERIORGRAM WITH ONLY MONOLINGUAL CORPORA
3549COGANS FOR UNSUPERVISED VISUAL SPEECH ADAPTATION TO NEW SPEAKERS
1728COINCIDENCE, CATEGORIZATION, AND CONSOLIDATION: LEARNING TO RECOGNIZE SOUNDS WITH MINIMAL SUPERVISION
1229COLOR AND ANGULAR RECONSTRUCTION OF LIGHT FIELDS FROM INCOMPLETE-COLOR CODED PROJECTIONS
3453COLOR STABILIZATION FOR MULTI-CAMERA LIGHT-FIELD IMAGING
1461COLOUR COMPRESSION OF PLENOPTIC POINT CLOUDS USING RAHT-KLT WITH PRIOR COLOUR CLUSTERING AND SPECULAR/DIFFUSE COMPONENT SEPARATION
4336COMBINING ACOUSTICS, CONTENT AND INTERACTION FEATURES TO FIND HOT SPOTS IN MEETINGS
2748COMBINING CGAN AND MIL FOR HOTSPOT SEGMENTATION IN BONE SCINTIGRAPHY
5284COMBINING DEEP EMBEDDINGS OF ACOUSTIC AND ARTICULATORY FEATURES FOR SPEAKER IDENTIFICATION
5312COMMUNICATION CONSTRAINED LEARNING WITH UNCERTAIN MODELS
4724COMMUTING CONDITIONAL GANS FOR MULTI-MODAL FUSION
2379COMPARE LEARNING: BI-ATTENTION NETWORK FOR FEW-SHOT LEARNING
3770Comparison of Glottal Closure Instants Detection Algorithms for Emotional Speech
3012COMPARISON OF USER MODELS BASED ON GMM-UBM AND I-VECTORS FOR SPEECH, HANDWRITING, AND GAIT ASSESSMENT OF PARKINSON'S DISEASE PATIENTS
5348COMPLEX PAIRWISE ACTIVITY ANALYSIS VIA INSTANCE LEVEL EVOLUTION REASONING
2366COMPLEX TRAINABLE ISTA FOR LINEAR AND NONLINEAR INVERSE PROBLEMS
2912COMPLEX TRANSFORMER: A FRAMEWORK FOR MODELING COMPLEX-VALUED SEQUENCE
1235COMPLEXITY REDUCTION METHODS FOR INDEX MODULATION BASED DUAL-FUNCTION RADAR COMMUNICATION SYSTEMS
2196COMPOSITE DYNAMIC TEXTURE SYNTHESIS USING HIERARCHICAL LINEAR DYNAMICAL SYSTEM
5628COMPRESSED SENSING BASED CHANNEL ESTIMATION AND OPEN-LOOP TRAINING DESIGN FOR HYBRID ANALOG-DIGITAL MASSIVE MIMO SYSTEMS
1952COMPRESSING FLOW FIELDS WITH EDGE-AWARE HOMOGENEOUS DIFFUSION INPAINTING
4337COMPRESSIVE 2-D OFF-GRID DOA ESTIMATION FOR PROPELLER CAVITATION LOCALIZATION
3211COMPRESSIVE ADAPTIVE BILATERAL FILTERING
5408COMPUTABILITY OF THE PEAK VALUE OF BANDLIMITED SIGNALS
4115COMPUTATION OF "BEST" INTERPOLANTS IN THE Lp SENSE
2605COMPUTING HILBERT TRANSFORM AND SPECTRAL FACTORIZATION FOR SIGNAL SPACES OF SMOOTH FUNCTIONS
4546CONCENTRATION-BASED POLYNOMIAL CALCULATIONS ON NICKED DNA
1303CONDITIONAL DENSITY DRIVEN GRID DESIGN IN POINT-MASS FILTER
5172CONDITIONAL DOMAIN ADVERSARIAL TRANSFER FOR ROBUST CROSS-SITE ADHD CLASSIFICATION USING FUNCTIONAL MRI
3288CONDITIONAL MUTUAL INFORMATION NEURAL ESTIMATOR
2710CONFIDENCE ESTIMATION FOR BLACK BOX AUTOMATIC SPEECH RECOGNITION SYSTEMS USING LATTICE RECURRENT NEURAL NETWORKS
5572CONFIRMNET: CONVOLUTIONAL FIRMNET AND APPLICATION TO IMAGE DENOISING AND INPAINTING
6117CONNECTIONS BETWEEN SPECTRAL PROPERTIES OF ASYMPTOTIC MAPPINGS AND SOLUTIONS TO WIRELESS NETWORK PROBLEMS
3420CONSENSUS-BASED DISTRIBUTED CLUSTERING FOR IOT
1169CONSISTENCY-AWARE MULTI-CHANNEL SPEECH ENHANCEMENT USING DEEP NEURAL NETWORKS
1919CONSTANT ENVELOPE MASSIVE MIMO-OFDM PRECODING: AN IMPROVED FORMULATION AND SOLUTION
4199Constant-Envelope Precoding for Satellite Systems
3643CONSTRAINED SPECTRAL CLUSTERING FOR DYNAMIC COMMUNITY DETECTION
2640CONTENT BASED SINGING VOICE EXTRACTION FROM A MUSICAL MIXTURE
5114Content VS Context: How about “Walking Hand-In-Hand" for Image Clustering?
6039CONTEXT AND UNCERTAINTY MODELING FOR ONLINE SPEAKER CHANGE DETECTION
5226CONTINUAL LEARNING FOR INFINITE HIERARCHICAL CHANGE-POINT DETECTION
1652CONTINUAL LEARNING THROUGH ONE-CLASS CLASSIFICATION USING VAE
4592CONTINUOUS SPEECH SEPARATION: DATASET AND ANALYSIS
5182CONTROL OF LINEAR DYNAMICAL SYSTEMS USING SPARSE INPUTS
2994CONTROLLABLE TIME-DELAY TRANSFORMER FOR REAL-TIME PUNCTUATION PREDICTION AND DISFLUENCY DETECTION
3189CONTROLLING THE PERCEIVED SOUND QUALITY FOR DIALOGUE ENHANCEMENT WITH DEEP LEARNING
3299CONVERGENCE-GUARANTEED INDEPENDENT POSITIVE SEMIDEFINITE TENSOR ANALYSIS BASED ON STUDENT'S T DISTRIBUTION
3049CONVERTING WRITTEN LANGUAGE TO SPOKEN LANGUAGE WITH NEURAL MACHINE TRANSLATION FOR LANGUAGE MODELING
1933CONVEX OPTIMISATION-BASED PRIVACY-PRESERVING DISTRIBUTED AVERAGE CONSENSUS IN WIRELESS SENSOR NETWORKS
1032CONVOLUTIONAL BEAMSPACE FOR ARRAY SIGNAL PROCESSING
2868COOPERATIVE LEARNING VIA FEDERATED DISTILLATION OVER FADING CHANNELS
2334CORRDROP: CORRELATION BASED DROPOUT FOR CONVOLUTIONAL NEURAL NETWORKS
4089CORRECTION OF AUTOMATIC SPEECH RECOGNITION WITH TRANSFORMER SEQUENCE-TO-SEQUENCE MODEL
3485CORRELATED MULTI-ARMED BANDITS WITH A LATENT RANDOM SOURCE
1991CORRGAN: SAMPLING REALISTIC FINANCIAL CORRELATION MATRICES USING GENERATIVE ADVERSARIAL NETWORKS
4236COST AWARE ADVERSARIAL LEARNING
4709COUNTING DENSE OBJECTS IN REMOTE SENSING IMAGES
4577COUPLED TRAINING OF SEQUENCE-TO-SEQUENCE MODELS FOR ACCENTED SPEECH RECOGNITION
5802CP-GAN: CONTEXT PYRAMID GENERATIVE ADVERSARIAL NETWORK FOR SPEECH ENHANCEMENT
2498CPWC: CONTEXTUAL POINT WISE CONVOLUTION FOR OBJECT RECOGNITION
4240CRA: A GENERIC COMPRESSION RATIO ADAPTER FOR END-TO-END DATA-DRIVEN IMAGE COMPRESSIVE SENSING RECONSTRUCTION FRAMEWORKS
5752Cramer-Rao bound on DOA Estimation of finite bandwidth signals using a Moving Sensor
6079CRAMÉR-RAO BOUND UNDER NORM CONSTRAINT
5276CRAMÉR-RAO BOUNDS FOR FLAW LOCALIZATION IN SUBSAMPLED MULTISTATIC MULTICHANNEL ULTRASOUND NDT DATA
2134CRNN-CTC BASED MANDARIN KEYWORDS SPOTTING
1435Cross Image Cubic Interpolator for Spatially Varying Exposures
3775CROSS LINGUAL TRANSFER LEARNING FOR ZERO-RESOURCE DOMAIN ADAPTATION
2190Cross-Domain Adaptation for Biometric Identification Using Photoplethysmogram
2677CROSS-DOMAIN JOINT DICTIONARY LEARNING FOR ECG RECONSTRUCTION FROM PPG
1423CROSS-LINGUAL TOPIC PREDICTION FOR SPEECH USING TRANSLATIONS
3066CROSS-SPEAKER SILENT-SPEECH COMMAND WORD RECOGNITION USING ELECTRO-OPTICAL STOMATOGRAPHY
5653CROSS-STAINED SEGMENTATION FROM RENAL BIOPSY IMAGES USING MULTI-LEVEL ADVERSARIAL LEARNING
4978Cross-VAE: Towards Disentangling Expression from Identity for Human Faces
5341Cross-view Attention Network for Breast Cancer Screening from Multi-view Mammograms
5060CROWDSOURCING-BASED RANKING AGGREGATION FOR PERSON RE-IDENTIFICATION
3403CS-R-FCN: CROSS-SUPERVISED LEARNING FOR LARGE-SCALE OBJECT DETECTION
2599Cumulant Slice Reconstruction from Compressive Measurements and Its Application to Line Spectrum Estimation
6159CURRICULUM LEARNING FOR SPEECH EMOTION RECOGNITION FROM CROWDSOURCED LABELS
3392D2NA: DAY-TO-NIGHT ADAPTATION FOR VISION BASED PARKING MANAGEMENT SYSTEM
4654DAMAGE-SENSITIVE AND DOMAIN-INVARIANT FEATURE EXTRACTION FOR VEHICLE-VIBRATION-BASED BRIDGE HEALTH MONITORING
2484DATA AUGMENTATION USING EMPIRICAL MODE DECOMPOSITION ON NEURAL NETWORKS TO CLASSIFY IMPACT NOISE IN VEHICLE
3478DATA SELECTION KERNEL CONJUGATE GRADIENT ALGORITHM
4243DATA-DRIVEN HARMONIC FILTERS FOR AUDIO REPRESENTATION LEARNING
1177DATA-DRIVEN MODEL SET DESIGN FOR MODEL AVERAGED PARTICLE FILTER
3664DATA-DRIVEN WIND SPEED ESTIMATION USING MULTIPLE MICROPHONES
1369DEBLURRING AND SUPER-RESOLUTION USING DEEP GATED FUSION ATTENTION NETWORKS FOR FACE IMAGES
1199DECENTRALIZED EXPECTED CONSISTENT SIGNAL RECOVERY FOR QUANTIZATION MEASUREMENTS
4942DECENTRALIZED MIN-MAX OPTIMIZATION: FORMULATIONS, ALGORITHMS AND APPLICATIONS IN NETWORK POISONING ATTACK
2630Decentralized optimization with non-identical sampling in presence of stragglers
4004Decentralized Stochastic Non-convex Optimization over Weakly Connected Time-varying Digraphs
1847DECIDABLE VARIABLE-RATE DATAFLOW FOR HETEROGENEOUS SIGNAL PROCESSING SYSTEMS
3138Decoding 5G-NR Communications via Deep Learning
2927DECODING MOVEMENT IMAGINATION AND EXECUTION FROM EEG SIGNALS USING BCI-TRANSFER LEARNING METHOD BASED ON RELATION NETWORK
2536DECOMPOSED CYCLEGAN FOR SINGLE IMAGE DERAINING WITH UNPAIRED DATA
5045DEEP AUDIO-VISUAL SPEECH SEPARATION WITH ATTENTION MECHANISM
2834DEEP AUTOTUNER: A PITCH CORRECTING NETWORK FOR SINGING PERFORMANCES
4230DEEP CASA FOR TALKER-INDEPENDENT MONAURAL SPEECH SEPARATION
4160DEEP CLUSTERING FOR DOMAIN ADAPTATION
4196DEEP CLUSTERING WITH CONCRETE K-MEANS
4175DEEP CONTEXTUALIZED ACOUSTIC REPRESENTATIONS FOR SEMI-SUPERVISED SPEECH RECOGNITION
5598DEEP ENCODED LINGUISTIC AND ACOUSTIC CUES FOR ATTENTION BASED END TO END SPEECH EMOTION RECOGNITION
5326DEEP EXPOSURE FUSION WITH DEGHOSTING VIA HOMOGRAPHY ESTIMATION AND ATTENTION LEARNING
5410DEEP FLOW COLLABORATIVE NETWORK FOR ONLINE VISUAL TRACKING
5192Deep geometric knowledge distillation with graphs
5087DEEP IMAGE DEBLURRING USING LOCAL CORRELATION BLOCK
4158DEEP JAMES-STEIN NEURAL NETWORKS FOR BRAIN-COMPUTER INTERFACES
4868DEEP JOINT SOURCE-CHANNEL CODING FOR WIRELESS IMAGE RETRIEVAL
4354DEEP JOINT-SOURCE CHANNEL CODING OF IMAGES WITH FEEDBACK
1557DEEP LEARNING ABILITIES TO CLASSIFY INTRICATE VARIATIONS IN TEMPORAL DYNAMICS OF MULTIVARIATE TIME SERIES
5554Deep Learning Based Bearing Fault Diagnosis Using 1D Convolutional Neural Network with Modified Octave Convolution
4247DEEP LEARNING BASED PREDICTION OF HYPERNASALITY FOR CLINICAL APPLICATIONS
1586DEEP LEARNING FOR ROBUST POWER CONTROL FOR WIRELESS NETWORKS
4197DEEP LEARNING-BASED BEAM ALIGNMENT IN MMWAVE VEHICULAR NETWORKS
2427DEEP MATRIX COMPLETION ON GRAPHS: APPLICATION IN DRUG TARGET INTERACTION PREDICTION
1558DEEP META-RELATION NETWORK FOR VISUAL FEW-SHOT LEARNING
4483DEEP METRIC LEARNING BASED ON CENTER-RANKED LOSS FOR GAIT RECOGNITION
1282Deep Monocular Video Depth Estimation Using Temporal Attention
3722DEEP MULTI-SCALE GABOR WAVELET NETWORK FOR IMAGE RESTORATION
2304DEEP NEURAL NETWORK BASED MATRIX COMPLETION FOR INTERNET OF THINGS NETWORK LOCALIZATION
5686DEEP NEURAL NETWORKS BASED AUTOMATIC SPEECH RECOGNITION FOR FOUR ETHIOPIAN LANGUAGES
4148DEEP PERCEPTUAL OPTIMIZER FOR VIDEO PRECODING
3174DEEP PRODUCT QUANTIZATION MODULE FOR EFFICIENT IMAGE RETRIEVAL
5116DEEP RAINRATE ESTIMATION FROM HIGHLY ATTENUATED DOWNLINK SIGNALS OF GROUND-BASED COMMUNICATIONS SATELLITE TERMINALS
1764DEEP RESIDUAL NETWORK FOR MSFA RAW IMAGE DENOISING
1237DEEP SOFT INTERFERENCE CANCELLATION FOR MIMO DETECTION
1685DEEP SPEECH EXTRACTION WITH TIME-VARYING SPATIAL FILTERING GUIDED BY DESIRED DIRECTION ATTRACTOR
5457DEEPMULTI-REGIONHASHING
2438Deep-Neural-Network based Fall-back Mechanism in Interference-Aware Receiver Design
4944DEEP-SST-EDDIES: A DEEP LEARNING FRAMEWORK TO DETECT OCEANIC EDDIES IN SEA SURFACE TEMPERATURE IMAGES
2726Defending Graph Convolutional Networks against Adversarial Attacks
3693Defense against adversarial attacks on spoofing countermeasures of ASV
3426Deja-vu: Double Feature Presentation in Deep Transformer Networks
3532DELIBERATION MODEL BASED TWO-PASS END-TO-END SPEECH RECOGNITION
2019DEMYSTIFYING TASNET: A DISSECTING APPROACH
1717DENOISING OF EVENT-BASED SENSORS WITH SPATIO-TEMPORAL CORRELATION
5028DENSE CROWD COUNTING WITH STACKED POOLING FOR BOOSTING SCALE
5830DENSE MAPPING OF INTRACELLULAR DIFFUSION AND DRIFT FROM SINGLE-PARTICLE TRACKING DATA ANALYSIS
1746DENSE RESIDUAL NETWORK FOR RETINAL VESSEL SEGMENTATION
4246DENSELY CONNECTED NEURAL NETWORK WITH DILATED CONVOLUTIONS FOR REAL-TIME SPEECH ENHANCEMENT IN THE TIME DOMAIN
1392DEPTH ESTIMATION FROM SINGLE IMAGE THROUGH MULTI-PATH-MULTI-RATE DIVERSE FEATURE EXTRACTOR
3799DEPTH MAP FINGERPRINTING AND SPLICING DETECTION
2634DEPTHWISE-STFT BASED SEPARABLE CONVOLUTIONAL NEURAL NETWORKS
4956Deriving Compact Feature Representations Via Annealed Contraction
3676DESIGN CONSIDERATIONS FOR HYPOTHESIS REJECTION MODULES IN SPOKEN LANGUAGE UNDERSTANDING SYSTEMS
1856DESIGN OF A CONVERGENCE-AWARE BASED EXPECTATION PROPAGATION ALGORITHM FOR UPLINK MIMO SCMA SYSTEMS
1351DESIGN-GAN: CROSS-CATEGORY FASHION TRANSLATION DRIVEN BY LANDMARK ATTENTION
3410DETECT INSIDER ATTACKS USING CNN IN DECENTRALIZED OPTIMIZATION
2696DETECTING ADVERSARIAL ATTACKS IN TIME-SERIES DATA
1378DETECTING AUTISM SPECTRUM DISORDER USING TOPOLOGICAL DATA ANALYSIS
3909DETECTING EMOTION PRIMITIVES FROM SPEECH AND THEIR USE IN DISCERNING CATEGORICAL EMOTIONS
2923Detecting Mismatch between Text Script and Voice-over Using Utterance Verification Based on Phoneme Recognition Ranking
1753DETECTING MULTIPLE SPEECH DISFLUENCIES USING A DEEP RESIDUAL NETWORK WITH BIDIRECTIONAL LONG SHORT-TERM MEMORY
3007DETECTION AND ANALYSIS OF T/D DELETION IN LIBRISPEECH
4467DETECTION OF ADVERSARIAL ATTACKS AND CHARACTERIZATION OF ADVERSARIAL SUBSPACE
2004DETECTION OF MALICIOUS VBSCRIPT USING STATIC AND DYNAMIC ANALYSIS WTIH RECURRENT DEEP LEARNING
3290DETECTION OF MILD DYSPNEA FROM PAIRS OF SPEECH RECORDINGS
3951DETECTION OF S1 AND S2 LOCATIONS IN PHONOCARDIOGRAM SIGNALS USING ZERO FREQUENCY FILTER
4239DETECTION OF SPEECH EVENTS AND SPEAKER CHARACTERISTICS THROUGH PHOTO-PLETHYSMOGRAPHIC SIGNAL NEURAL PROCESSING
6145Detection of Speech Smoothing on Very Short Clips
2783DETERMINED SOURCE SEPARATION USING THE SPARSITY OF IMPULSE RESPONSES
5691DETERMINISTIC FEATURE DECOUPLING BY SURFING INVARIANCE MANIFOLDS
4472DFSMN-SAN WITH PERSISTENT MEMORY MODEL FOR AUTOMATIC SPEECH RECOGNITION
1455DGAN: DISENTANGLED REPRESENTATION LEARNING FOR ANISOTROPIC BRDF RECONSTRUCTION
2285DIACRITIC-LEVEL PRONUNCIATION ANALYSIS USING PHONOLOGICAL FEATURES
1937DIAGONALIZABLE SHIFT AND FILTERS FOR DIRECTED GRAPHS BASED ON THE JORDAN-CHEVALLEY DECOMPOSITION
5433DIALOGUE HISTORY INTEGRATION INTO END-TO-END SIGNAL-TO-CONCEPT SPOKEN LANGUAGE UNDERSTANDING SYSTEMS
1402DIFFERENTIABLE BRANCHING IN DEEP NETWORKS FOR FAST INFERENCE
6088DIFFERENTIALLY MODULATED SPECTRALLY EFFICIENT FREQUENCY-DIVISION MULTIPLEXING
5633DIGITAL WATERMARKING FOR PROTECTING AUDIO CLASSIFICATION DATASETS
2810DILATED CONVOLUTIONAL NEURAL NETWORKS FOR PANORAMIC IMAGE SALIENCY PREDICTION
6134DIRECTION OF ARRIVAL ESTIMATION FOR REVERBERANT SPEECH BASED ON ENHANCED DECOMPOSITION OF THE DIRECT SOUND
4708DISCOVERING CAUSALITIES FROM CARDIOTOCOGRAPHY SIGNALS USING IMPROVED CONVERGENT CROSS MAPPING WITH GAUSSIAN PROCESSES
1629Discrete Wasserstein Autoencoders for Document Retrieval
1467Discriminant and sparsity based least squares regression with l1 regularization for feature representation
1426DISCRIMINANT GENERATIVE ADVERSARIAL NETWORKS WITH ITS APPLICATION TO EQUIPMENT HEALTH CLASSIFICATION
2861DISENTANGLED MULTIDIMENSIONAL METRIC LEARNING FOR MUSIC SIMILARITY
4364DISENTANGLED SPEECH EMBEDDINGS USING CROSS-MODAL SELF-SUPERVISION
3191Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning
5583DISENTANGLING TIMBRE AND SINGING STYLE WITH MULTI-SINGER SINGING SYNTHESIS SYSTEM
1661DISPERSIVE GRID-FREE ORTHOGONAL MATCHING PURSUIT FOR MODAL ESTIMATION IN OCEAN ACOUSTICS
2954DISTILLING ATTENTION WEIGHTS FOR CTC-BASED ASR SYSTEMS
1236DISTRIBUTED DETECTION OF SPARSE SIGNALS WITH 1-BIT DATA IN TWO-LEVEL TWO-DEGREE TREE-STRUCTURED SENSOR NETWORKS
5292DISTRIBUTED EQUALIZATION AND POWER ALLOCATION FOR MULTI-CARRIER BIDIRECTIONAL FILTER-AND-FORWARD RELAY NETWORKS
6082Distributed Nesterov gradient methods over arbitrary graphs
4867DISTRIBUTED NON-ORTHOGONAL PILOT DESIGN FOR MULTI-CELL MASSIVE MIMO SYSTEMS
1579Distributed Quantization for Sparse Time Sequences
3892DISTRIBUTED TENSOR COMPLETION OVER NETWORKS
5188DISTRIBUTED TRACKING AND CIRCUMNAVIGATION USING BEARING MEASUREMENTS
3070DISTRIBUTED VERIFICATION OF BELIEF PRECISIONS CONVERGENCE IN GAUSSIAN BELIEF PROPAGATION
2585DISTRIBUTED WAVE-DOMAIN ACTIVE NOISE CONTROL BASED ON THE DIFFUSION STRATEGY
5126DISTRIBUTION OF THE PRODUCT OF A COMPLEX GAUSSIAN MATRIX AND VECTOR AND ITS SUM WITH A COMPLEX GAUSSIAN VECTOR
2596DIVERGENCE-BASED ADAPTIVE EXTREME VIDEO COMPLETION
1247Diversity and Sparsity: A New Perspective on Index Tracking
5977dMazeRunner: OPTIMIZING CONVOLUTIONS ON DATAFLOW ACCELERATORS
3406DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS
2917DNN-based Mask Estimation Integrating Spectral and Spatial Features for Robust Beamforming
5811DNN-BASED SPEECH PRESENCE PROBABILITY ESTIMATION FOR MULTI-FRAME SINGLE-MICROPHONE SPEECH ENHANCEMENT
5018DNN-BASED SPEECH RECOGNITION FOR GLOBALPHONE LANGUAGES
1680DNN-CHIP PREDICTOR: A MULTI-GRAINED GRAPH-BASED PERFORMANCE SIMULATOR FOR DNN ACCELERATORS
1813DNN-SUPPORTED MASK-BASED CONVOLUTIONAL BEAMFORMING FOR SIMULTANEOUS DENOISING, DEREVERBERATION, AND SOURCE SEPARATION
2721DOA ESTIMATION IN SYSTEMS WITH NONLINEARITIES FOR MMWAVE COMMUNICATIONS
1560DOA TRACKING VIA SIGNAL-SUBSPACE PROJECTOR UPDATE
4947DOMAIN ADAPTATION FOR GENERALIZATION OF FACE PRESENTATION ATTACK DETECTION IN MOBILE SETTINGS WITH MINIMAL INFORMATION
4058DOMAIN ROBUST, FAST, AND COMPACT NEURAL LANGUAGE MODELS
1343DRIFT DETECTION AND CORRECTION POST-TRACKING
3027DRSS-BASED LOCALISATION USING WEIGHTED INSTRUMENTAL VARIABLES AND SELECTIVE POWER MEASUREMENT
4174D-SLAM: DIFFUSION SOURCE LOCALIZATION AND TRAJECTORY MAPPING
1424DUAL-PATH RNN: EFFICIENT LONG SEQUENCE MODELING FOR TIME-DOMAIN SINGLE-CHANNEL SPEECH SEPARATION
1613DURATION ROBUST WEAKLY SUPERVISED SOUND EVENT DETECTION
5121DYNA-BOLT: DOMAIN ADAPTIVE BINARY FACTORIZATION OF CURRENT WAVEFORMS FOR ENERGY DISAGGREGATION
4755DYNAMIC ATTACK SCORING USING DISTRIBUTED LOCAL DETECTORS
3409Dynamic Channel Pruning for Correlation Filter based Object Tracking
2508DYNAMIC METASURFACE ANTENNAS FOR BIT-CONSTRAINED MIMO-OFDM RECEIVERS
1248DYNAMIC OVERSAMPLING IN 1-BIT QUANTIZED ASYNCHRONOUS LARGE-SCALE MULTIPLE-ANTENNA SYSTEMS FOR SUSTAINABLE IOT NETWORKS
4063Dynamic Resource Allocation for Wireless Edge Machine Learning with Latency and Accuracy Guarantees
5754DYNAMIC RESOURCE OPTIMIZATION AND ALTITUDE SELECTION IN UAV-BASED MULTI-ACCESS EDGE COMPUTING
4835DYNAMIC TEMPORAL RESIDUAL LEARNING FOR SPEECH RECOGNITION
1843DYNAMIC VARIATIONAL AUTOENCODERS FOR VISUAL PROCESS MODELING
4324DYNAMICALLY MODULATED DEEP METRIC LEARNING FOR VISUAL SEARCH
5632DYSARTHRIC SPEECH RECOGNITION WITH LATTICE-FREE MMI
5204E2E-SINCNET: TOWARD FULLY END-TO-END SPEECH RECOGNITION
3261ECG HEARTBEAT CLASSIFICATION BASED ON MULTI-SCALE WAVELET CONVOLUTIONAL NEURAL NETWORKS
5234EDGEFOOL: AN ADVERSARIAL IMAGE ENHANCEMENT FILTER
4748EDNFC-NET: CONVOLUTIONAL NEURAL NETWORK WITH NESTED FEATURE CONCATENATION FOR NUCLEI-INSTANCE SEGMENTATION
3758EEG CONNECTIVITY - INFORMED COOPERATIVE ADAPTIVE LINE ENHANCER FOR RECOGNITION OF BRAIN STATE
5317EEG FEATURE SELECTION USING ORTHOGONAL REGRESSION: APPLICATION TO EMOTION RECOGNITION
3209EFFECT OF CHOICE OF PROBABILITY DISTRIBUTION, RANDOMNESS, AND SEARCH METHODS FOR ALIGNMENT MODELING IN SEQUENCE-TO-SEQUENCE TEXT-TO-SPEECH SYNTHESIS USING HARD ALIGNMENT
5957EFFECT OF FRICATION DURATION AND FORMANT TRANSITIONS ON THE PERCEPTION OF FRICATIVES IN VCV UTTERANCES
5762EFFECT OF UNDERSAMPLING ON NON-NEGATIVE BLIND DECONVOLUTION WITH AUTOREGRESSIVE FILTERS
4034EFFECTIVE APPROXIMATE MAXIMUM LIKELIHOOD ESTIMATION OF ANGLES OF ARRIVAL FOR NON-COHERENT SUB-ARRAYS
3642EFFECTIVE APPROXIMATION OF BANDLIMITED SIGNALS AND THEIR SAMPLES
1025Effective Pipeline for Compressing Deep Object Detectors
5449EFFECTIVE WAVENET ADAPTATION FOR VOICE CONVERSION WITH LIMITED DATA
3729Effectiveness of Random Deep Feature Selection for securing image manipulation detectors against adversarial examples
4038Effectiveness of self-supervised pre-training for ASR
5859EFFECTS OF SPECTRAL TILT ON LISTENERS PREFERENCES AND INTELLIGIBILITY
1977EFFICIENT ALGORITHM TO IMPLEMENT SLIDING SINGULAR SPECTRUM ANALYSIS WITH APPLICATION TO BIOMEDICAL SIGNAL DENOISING
4927EFFICIENT AND SCALABLE NEURAL RESIDUAL WAVEFORM CODING WITH COLLABORATIVE QUANTIZATION
3824EFFICIENT BELIEF PROPAGATION FOR GRAPH MATCHING
5062EFFICIENT BIRD SOUND DETECTION ON THE BELA EMBEDDED SYSTEM
2222EFFICIENT CONSTRAINED ENCODERS CORRECTING A SINGLE NUCLEOTIDE EDIT IN DNA STORAGE
1454Efficient Decoupled Neural Architecture Search by Structure and Operation Sampling
2088EFFICIENT DEEP LEARNING-BASED LOSSY IMAGE COMPRESSION VIA ASYMMETRIC AUTOENCODER AND PRUNING
3319EFFICIENT ESTIMATION OF MIXING MATRIX USING A TWO-SENSOR ARRAY
2825EFFICIENT IMAGE SUPER RESOLUTION VIA CHANNEL DISCRIMINATIVE DEEP NEURAL NETWORK PRUNING
2012Efficient Multichannel Nonlinear Acoustic Echo Cancellation Based on a Cooperative strategy
6080EFFICIENT REPRESENTATION AND SPARSE SAMPLING OF HEAD-RELATED TRANSFER FUNCTIONS USING PHASE-CORRECTION BASED ON EAR ALIGNMENT
1509EFFICIENT SCENE TEXT DETECTION WITH TEXTUAL ATTENTION TOWER
5897EFFICIENT SHALLOW WAVENET VOCODER USING MULTIPLE SAMPLES OUTPUT BASED ON LAPLACIAN DISTRIBUTION AND LINEAR PREDICTION
5367Efficient Super-Resolution Two-Dimensional Harmonic Retrieval via Enhanced Low-Rank Structured Covariance Reconstruction
2167Efficient Techniques for In-Band System Information Broadcast in Multi-cell Massive MIMO
4524EFFICIENT TRAINABLE FRONT-ENDS FOR NEURAL SPEECH ENHANCEMENT
6090EIGENBEAM-ESPRIT FOR DOA-VECTOR ESTIMATION
6149EIGENDECOMPOSITION-FREE SAMPLING SET SELECTION FOR GRAPH SIGNALS
1368Electric Analog Circuit Design with Hypernetworks and a Differential Simulator
1901Electro-Magnetic Side-Channel Attack Through Learned Denoising and Classification
2453Eliminating Out-Of-Cell Interference in Cellular Massive MIMO With a Single Additional Transceiver
1761EMBEDDED LARGE–SCALE HANDWRITTEN CHINESE CHARACTER RECOGNITION
2900EMET: EMBEDDINGS FROM MULTILINGUAL-ENCODER TRANSFORMER FOR FAKE NEWS DETECTION
5365EMOTIONAL SPEECH SYNTHESIS WITH RICH AND GRANULARIZED CONTROL
4853EMOTIONAL VOICE CONVERSION USING MULTITASK LEARNING WITH TEXT-TO-SPEECH
5309Empirical SURE-guided microscopy super-resolution image reconstruction from confocal multi-array detectors
1223ENCODER-RECURRENT DECODER NETWORK FOR SINGLE IMAGE DEHAZING
3346ENCODING AND DECODING MIXED BANDLIMITED SIGNALS USING SPIKING INTEGRATE-AND-FIRE NEURONS
3738ENCODING TEMPORAL INFORMATION FOR AUTOMATIC DEPRESSION RECOGNITION FROM FACIAL ANALYSIS
4249END TO END SPEECH RECOGNITION ERROR PREDICTION WITH SEQUENCE TO SEQUENCE LEARNING
3034END-END SPEECH-TO-TEXT TRANSLATION WITH MODALITY AGNOSTIC META-LEARNING
3428END-TO-END ACCENT CONVERSION WITHOUT USING NATIVE UTTERANCES
3857END-TO-END ARCHITECTURES FOR ASR-FREE SPOKEN LANGUAGE UNDERSTANDING
4334END-TO-END ARTICULATORY MODELING FOR DYSARTHRIA ARTICULATORY ATTRIBUTE DETECTION
4142END-TO-END AUDITORY OBJECT RECOGNITION VIA INCEPTION NUCLEUS
2882END-TO-END AUTOMATIC SPEECH RECOGNITION INTEGRATED WITH CTC-BASED VOICE ACTIVITY DETECTION
4674END-TO-END CODE-SWITCHING TTS WITH CROSS-LINGUAL LANGUAGE MODEL
4493END-TO-END GENERATION OF TALKING FACES FROM NOISY SPEECH
4332END-TO-END MICROPHONE PERMUTATION AND NUMBER INVARIANT MULTI-CHANNEL SPEECH SEPARATION
2763END-TO-END MULTI-PERSON AUDIO/VISUAL AUTOMATIC SPEECH RECOGNITION
4758END-TO-END MULTI-SPEAKER SPEECH RECOGNITION WITH TRANSFORMER
4367END-TO-END MULTI-TALKER OVERLAPPING SPEECH RECOGNITION
4496End-to-end Non-Negative Autoencoders for Sound Source Separation
3159END-TO-END SPEECH TRANSLATION WITH SELF-CONTAINED VOCABULARY MANIPULATION
5939End-to-End Spoken Language Understanding Without Matched Language Speech Model Pretraining Data
3307END-TO-END TRAINING OF TIME DOMAIN AUDIO SEPARATION AND RECOGNITION
3618END-TO-END VOICE CONVERSION VIA CROSS-MODAL KNOWLEDGE DISTILLATION FOR DYSARTHRIC SPEECH RECONSTRUCTION
3572EnerGAN: A GENERATIVE ADVERSARIAL NETWORK FOR ENERGY DISAGGREGATION
2659ENERGY DISAGGREGATION FROM LOW SAMPLING FREQUENCY MEASUREMENTS USING MULTI-LAYER ZERO CROSSING RATE
2656ENERGY DISAGGREGATION USING FRACTIONAL CALCULUS
3178ENERGY EFFICIENT ACCELERATION OF FLOATING POINT APPLICATIONS ONTO CGRA
5193Energy-efficient 3D UAV trajectory design for data collection in wireless sensor networks
5681Energy-Efficient Bit Allocation for Resolution-Adaptive ADC in Multiuser Large-Scale MIMO Systems: Global Optimality
2717ENERGY-EFFICIENT DISTRIBUTED LEARNING WITH COARSELY QUANTIZED SIGNALS
4161ENHANCE FEATURE REPRESENTATION OF ELECTROENCEPHALOGRAM FOR SEIZURE DETECTION
1070ENHANCE PART-BASED MODEL FOR PERSON RE-IDENTIFICATION WITH FUSED MULTI-SCALE FEATURES
1466ENHANCED ACTION TUBELET DETECTOR FOR SPATIO-TEMPORAL VIDEO ACTION DETECTION
1210Enhanced Adversarial Strategically-Timed Attacks against Deep Reinforcement Learning
3015ENHANCED METHOD OF AUDIO CODING USING CNN-BASED SPECTRAL RECOVERY WITH ADAPTIVE STRUCTURE
3686Enhanced Mixture Population Monte Carlo Via Stochastic Optimization and Markov Chain Monte Carlo Sampling
5010ENHANCED NON-LOCAL CASCADING NETWORK WITH ATTENTION MECHANISM FOR HYPERSPECTRAL IMAGE DENOISING
3354Enhanced Safety of Autonomous Driving by Incorporating Terrestrial Signals of Opportunity
1897ENHANCEMENT OF CODED SPEECH USING A MASK-BASED POST-FILTER
4750ENHANCING END-TO-END MULTI-CHANNEL SPEECH SEPARATION VIA SPATIAL FEATURE LEARNING
1747ENHANCING THE LABELLING OF AUDIO SAMPLES FOR AUTOMATIC INSTRUMENT CLASSIFICATION BASED ON NEURAL NETWORKS
1870ENSEMBLE NETWORK FOR RANKING IMAGES BASED ON VISUAL APPEAL
2280ENVIRONMENT-AWARE RECONFIGURABLE NOISE SUPPRESSION
4976EPIGRAPHICAL REFORMULATION FOR NON-PROXIMABLE MIXED NORMS
3599EPI-NEIGHBORHOOD DISTRIBUTION BASED LIGHT FIELD DEPTH ESTIMATION
5453EPOCH EXTRACTION FROM A SPEECH SIGNAL USING GAMMATONE WAVELETS IN A SCATTERING NETWORK
1292EQUALIZATION OF OFDM WAVEFORMS WITH INSUFFICIENT CYCLIC PREFIX
1178ERNET FAMILY: HARDWARE-ORIENTED CNN MODELS FOR COMPUTATIONAL IMAGING USING BLOCK-BASED INFERENCE
5722ERROR ANALYSIS APPLIED TO END-TO-END SPOKEN LANGUAGE UNDERSTANDING
6127Error Preserving Correction: A Method for CP Decomposition at a Target Error Bound
2815ESPNET-TTS: UNIFIED, REPRODUCIBLE, AND INTEGRATABLE OPEN SOURCE END-TO-END TEXT-TO-SPEECH TOOLKIT
1999ESRGAN+ : FURTHER IMPROVING ENHANCED SUPER-RESOLUTION GENERATIVE ADVERSARIAL NETWORK
3541ESTIMATING CENTRALITY BLINDLY FROM LOW-PASS FILTERED GRAPH SIGNALS
1658ESTIMATING STRUCTURAL MISSING VALUES VIA LOW-TUBAL-RANK TENSOR COMPLETION
1720ESTIMATING THE DEGREE OF SLEEPINESS BY INTEGRATING ARTICULATORY FEATURE KNOWLEDGE IN RAW WAVEFORM BASED CNNS
5152Estimation of Information in Parallel Gaussian Channels via Model Order Selection
2995ESTIMATION OF POST-NONLINEAR CAUSAL MODELS USING AUTOENCODING STRUCTURE
2015EUROPARL-ST: A MULTILINGUAL CORPUS FOR SPEECH TRANSLATION OF PARLIAMENTARY DEBATES
5323EVALUATING VOICE CONVERSION-BASED PRIVACY PROTECTION AGAINST INFORMED ATTACKERS
2595EVALUATION OF DEEP-LEARNING-BASED VOICE ACTIVITY DETECTORS AND ROOM IMPULSE RESPONSE MODELS IN REVERBERANT ENVIRONMENTS
2802EVALUATION OF JOINT AUDITORY ATTENTION DECODING AND ADAPTIVE BINAURAL BEAMFORMING APPROACH FOR HEARING DEVICES WITH ATTENTION SWITCHING
2704EVALUATION OF SENSOR SELF-NOISE IN BINAURAL RENDERING OF SPHERICAL MICROPHONE ARRAY SIGNALS
4303EVENT-DRIVEN SIGNAL PROCESSING WITH NEUROMORPHIC COMPUTING SYSTEMS
3018EXACT SPARSE NONNEGATIVE LEAST SQUARES
5878EXEMPLAR TEACHING PRACTICES IN ENGINEERING COURSES IN U.S. UNIVERSITIES
2936EXOCENTRIC TO EGOCENTRIC IMAGE GENERATION VIA PARALLEL GENERATIVE ADVERSARIAL NETWORK
2914EXPERIMENTS IN CREATING ONLINE COURSE CONTENT FOR SIGNAL PROCESSING EDUCATION
5402EXPLOITATION OF 3D CITY MAPS FOR HYBRID 5G RTT AND GNSS POSITIONING SIMULATIONS
4182EXPLOITING CHANNEL LOCALITY FOR ADAPTIVE MASSIVE MIMO SIGNAL DETECTION
4781EXPLOITING COMMUTATIVITY CONDITION FOR CP DECOMPOSITION VIA APPROXIMATE SIMULTANEOUS DIAGONALIZATION
5194EXPLOITING PERIODICITY FEATURES FOR JOINT DETECTION AND DOA ESTIMATION OF SPEECH SOURCES USING CONVOLUTIONAL NEURAL NETWORKS
1878EXPLOITING RAYS IN BLIND LOCALIZATION OF DISTRIBUTED SENSOR ARRAYS
3126EXPLOITING SPARSITY FOR ROBUST SENSOR NETWORK LOCALIZATION IN MIXED LOS/NLOS ENVIRONMENTS
5108Exploiting Two-dimensional Symmetry and Unimodality for Model-free Source Localization in Harsh Environment
5799EXPLOITING VOCAL TRACT COORDINATION USING DILATED CNNS FOR DEPRESSION DETECTION IN NATURALISTIC ENVIRONMENTS
3904EXPLORATION METHODOLOGY FOR BTI-INDUCED FAILURES ON RRAM-BASED EDGE AI SYSTEMS
3700EXPLORING A ZERO-ORDER DIRECT HMM BASED ON LATENT ATTENTION FOR AUTOMATIC SPEECH RECOGNITION
3098EXPLORING APPROPRIATE ACOUSTIC AND LANGUAGE MODELLING CHOICES FOR CONTINUOUS DYSARTHRIC SPEECH RECOGNITION
2922EXPLORING BIO-BEHAVIORAL SIGNAL TRAJECTORIES OF STATE ANXIETY DURING PUBLIC SPEAKING
4555Exploring Energy Efficient Quantum-resistant Signal Processing Using Array Processors
1241Exploring Entity-level Spatial Relationships for Image-Text Matching
4178Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
1147EXPOSURE INTERPOLATION VIA HYBRID LEARNING
5658EXPRESSION-GUIDED EEG REPRESENTATION LEARNING FOR EMOTION RECOGNITION
2139EXTENDED CYCLIC COORDINATE DESCENT FOR ROBUST ROW-SPARSE SIGNAL RECONSTRUCTION IN THE PRESENCE OF OUTLIERS
4348Extended Object Tracking Using Hierarchical Truncated Gaussian Measurement Model
2378EXTRACTING UNIT EMBEDDINGS USING SEQUENCE-TO-SEQUENCE ACOUSTIC MODELS FOR UNIT SELECTION SPEECH SYNTHESIS
2615EXTRAPOLATED ALTERNATING ALGORITHMS FOR APPROXIMATE CANONICAL POLYADIC DECOMPOSITION
4796F0-CONSISTENT MANY-TO-MANY NON-PARALLEL VOICE CONVERSION VIA CONDITIONAL AUTOENCODER
3206FACE FEATURE RECOVERY VIA TEMPORAL FUSION FOR PERSON SEARCH
3764FACIAL EMOTION RECOGNITION USING LIGHT FIELD IMAGES WITH DEEP ATTENTION-BASED BIDIRECTIONAL LSTM
3244FACIAL FEATURE EMBEDDED CYCLEGAN FOR VIS-NIR TRANSLATION
4100FAR-FIELD LOCATION GUIDED TARGET SPEECH EXTRACTION USING END-TO-END SPEECH RECOGNITION OBJECTIVES
2631FAST ACOUSTIC SCATTERING USING CONVOLUTIONAL NEURAL NETWORKS
2397FAST AND ACCURATE EMBEDDED DCNN FOR RGB-D BASED SIGN LANGUAGE RECOGNITION
3660FAST AND HIGH-QUALITY SINGING VOICE SYNTHESIS SYSTEM BASED ON CONVOLUTIONAL NEURAL NETWORKS
4227FAST AND STABLE BLIND SOURCE SEPARATION WITH RANK-1 UPDATES
1151FAST BLOCK-SPARSE ESTIMATION FOR VECTOR NETWORKS
2315FAST CLUSTERING WITH CO-CLUSTERING VIA DISCRETE NON-NEGATIVE MATRIX FACTORIZATION FOR IMAGE IDENTIFICATION
4209FAST DIRECTION-OF-ARRIVAL ESTIMATION OF MULTIPLE TARGETS USING DEEP LEARNING AND SPARSE ARRAYS
1132FAST DOMAIN ADAPTATION FOR GOAL-ORIENTED DIALOGUE USING A HYBRID GENERATIVE-RETRIEVAL TRANSFORMER
2062Fast Graph Metric Learning via Gershgorin Disc Alignment
6133Fast High-Dimensional Kernel Filtering
1299FAST INDEPENDENT VECTOR EXTRACTION BY ITERATIVE SINR MAXIMIZATION
4155FAST INTENT CLASSIFICATION FOR SPOKEN LANGUAGE UNDERSTANDING
4302Fast Lattice-free Keyword Filtering for Accelerated Spoken Term Detection
5765FAST OPTICAL SYSTEM IDENTIFICATION BY NUMERICAL INTERFEROMETRY
1460FAST SINGLE-VIEW 3D OBJECT RECONSTRUCTION WITH FINE DETAILS THROUGH DILATED DOWNSAMPLE AND MULTI-PATH UPSAMPLE DEEP NEURAL NETWORK
2690Fast Start-Up Algorithm for Adaptive Noise Cancellers with Novel SNR Estimation and Stepsize Control
2169FAST TRAINING OF DEEP NEURAL NETWORKS FOR SPEECH RECOGNITION
1468Faster-than-Nyquist Signaling via Spatiotemporal Symbol-Level Precoding for Multi-User MISO Redundant Transmissions
3936Favorable Propagation and Linear Multiuser Detection for Distributed Antenna Systems
1396FCEM: A Novel Fast Correlation Extract Model For Real Time Steganalysis of VoIP Stream via Multi-head Attention
1905FDDWNET: A LIGHTWEIGHT CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION
2687FEATURE AFFINE PROJECTION ALGORITHMS
3954Feature drift resilient tracking of the carotid artery wall using unscented Kalman filtering with data fusion
3948FEATURE ENHANCEMENT WITH DEEP FEATURE LOSSES FOR SPEAKER VERIFICATION
1306FEATURE SELECTION UNDER ORTHOGONAL REGRESSION WITH REDUNDANCY MINIMIZING
4718FEDERATED CLASSIFICATION WITH LOW COMPLEXITY REPRODUCING KERNEL HILBERT SPACE REPRESENTATIONS
1670FEDERATED LEARNING WITH MUTUALLY COOPERATING DEVICES: A CONSENSUS APPROACH TOWARDS SERVER-LESS MODEL OPTIMIZATION
1676FEDERATED LEARNING WITH QUANTIZATION CONSTRAINTS
3157FEDERATED NEUROMORPHIC LEARNING OF SPIKING NEURAL NETWORKS FOR LOW-POWER EDGE INTELLIGENCE
5329FEDERATED TRUTH INFERENCE OVER DISTRIBUTED CROWDSOURCING PLATFORMS
4381FEDERATING SOLAR, STORAGE AND COMMUNICATIONS IN THE ELECTRIC GRID AND INTERNET OF THINGS
4216FEEDBACK RECURRENT AUTOENCODER
4039FEEDBACK TURBO AUTOENCODER
2862FEW-SHOT ACOUSTIC EVENT DETECTION VIA META LEARNING
3586FEW-SHOT SOUND EVENT DETECTION
4667FG2Seq: Effectively Encoding Knowledge for End-to-End Task-Oriented Dialog
3446FILTERBANK DESIGN FOR END-TO-END SPEECH SEPARATION
5705FILTERING OUT TIME-FREQUENCY AREAS USING GABOR MULTIPLIERS
3040FINE-GRAINED ACTION RECOGNITION ON A NOVEL BASKETBALL DATASET
1849FINE-GRAINED GIANT PANDA IDENTIFICATION
2945FINITE SAMPLE DEVIATION AND VARIANCE BOUNDS FOR FIRST ORDER AUTOREGRESSIVE PROCESSES
2762FIR FILTER DESIGN AND IMPLEMENTATION FOR PHASE-BASED PROCESSING
5545FIR FILTERING OF DISCONTINUOUS SIGNALS: A RANDOM-STRATIFIED SAMPLING APPROACH
5411FIXED SMOOTH CONVOLUTIONAL LAYER FOR AVOIDING CHECKERBOARD ARTIFACTS IN CNNS
5647FIXED-POINT OPTIMIZATION OF TRANSFORMER NEURAL NETWORK
2428FLEXIBLY-TUNABLE BITCUBE-BASED PERCEPTUAL ENCRYPTION WITHIN JPEG COMPRESSION
3368FLOW-TTS: A NON-AUTOREGRESSIVE NETWORK FOR TEXT TO SPEECH BASED ON FLOW
2770FOCUS ON SEMANTIC CONSISTENCY FOR CROSS-DOMAIN CROWD UNDERSTANDING
2604FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS
5448FORECASTING MULTI-DIMENSIONAL PROCESSES OVER GRAPHS
2909FORECASTING SPARSE TRAFFIC CONGESTION PATTERNS USING MESSAGE-PASSING RNNS
4883FOREGROUND SIGNATURE EXTRACTION FOR AN INTIMATE MIXING MODEL IN HYPERSPECTRAL IMAGE CLASSIFICATION
6065FORENSIC SIMILARITY FOR DIGITAL IMAGES
2541FORMULATING DIVERGENCE FRAMEWORK FOR MULTICLASS MOTOR IMAGERY EEG BRAIN COMPUTER INTERFACE
5272FORWARD-BACKWARD SPLITTING FOR OPTIMAL TRANSPORT BASED PROBLEMS
1732Fourier Phase Retrieval with Arbitrary Reference Signal
1859FOURTH ORDER CUMULANT BASED ACTIVE DIRECTION OF ARRIVAL ESTIMATION USING COPRIME ARRAYS
5203Fractional Fourier Transform Based QRS Complex detection in ECG Signal
4733Frame-based overlapping speech detection using Convolutional Neural Networks
3571FRAME-LEVEL MMI AS A SEQUENCE DISCRIMINATIVE TRAINING CRITERION FOR LVCSR
4460FRAME-LEVEL PHONEME-INVARIANT SPEAKER EMBEDDING FORTEXT-INDEPENDENT SPEAKER RECOGNITION ON EXTREMELY SHORT UTTERANCES
1482FREQUENCY AND TEMPORAL CONVOLUTIONAL ATTENTION FOR TEXT-INDEPENDENT SPEAKER RECOGNITION
4968Frequency Diverse Array Radar: A Closed-form Solution to Design Weights for Desired Beampattern
3496Frequency-dependent Directional Feedback Delay Network
4377FROM SYMBOLS TO SIGNALS: SYMBOLIC VARIATIONAL AUTOENCODERS
4662FROM UNSUPERVISED MACHINE TRANSLATION TO ADVERSARIAL TEXT GENERATION
5044FROM VIDEO GAME TO REAL ROBOT: THE TRANSFER BETWEEN ACTION SPACES
1573FULL REFERENCE VIDEO QUALITY MEASURES IMPROVEMENT USING NEURAL NETWORKS
5201Full-Reference Speech Quality Estimation with Attentional Siamese Neural Networks
4013FULL-SUM DECODING FOR HYBRID HMM BASED SPEECH RECOGNITION USING LSTM LANGUAGE MODEL
1418FULLY CONVOLUTIONAL RECURRENT NETWORKS FOR SPEECH ENHANCEMENT
1338FULLY LEARNABLE FRONT-END FOR MULTI-CHANNEL ACOUSTIC MODELING USING SEMI-SUPERVISED LEARNING
3732Fully Pipelined Iteration Unrolled Decoders The Road to Tb/s Turbo Decoding
4804FULLY QUANTIZING A SIMPLIFIED TRANSFORMER FOR END-TO-END SPEECH RECOGNITION
4413Fully-hierarchical Fine-grained Prosody Modeling for Interpretable speech synthesis
3137FULLY-NEURAL APPROACH TO HEAVY VEHICLE DETECTION ON BRIDGES USING A SINGLE STRAIN SENSOR
3705Fusion approaches for emotion recognition from speech using acoustic and text-based features
5706FUSIONNDVI: A NOVEL FUSION METHOD FOR NDVI IN REMOTE SENSING
1766G2G: TTS-DRIVEN PRONUNCIATION LEARNING FOR GRAPHEMIC HYBRID ASR
2959GAIT PHASE SEGMENTATION USING WEIGHTED DYNAMIC TIME WARPING AND K-NEAREST NEIGHBORS GRAPH EMBEDDING
4891GATED ATTENTIVE CONVOLUTIONAL NETWORK DIALOGUE STATE TRACKER
4761GATED MECHANISM FOR ATTENTION BASED MULTIMODAL SENTIMENT ANALYSIS
3465Gated Multi-layer Convolutional Feature Extraction Network for Robust Pedestrian Detection
3882GAUSSIAN LPCNET FOR MULTISAMPLE SPEECH SYNTHESIS
2797GAUSSIAN PROCESS IMPUTATION OF MULTIPLE FINANCIAL SERIES
5936GAUSSIAN PROCESSES OVER GRAPHS
5595GCI DETECTION FROM RAW SPEECH USING A FULLY-CONVOLUTIONAL NETWORK
2869GENDER DIFFERENCES ON THE PERCEPTION AND PRODUCTION OF UTTERANCES WITH WILLINGNESS AND RELUCTANCE IN CHINESE
5472GENERALIZED COHERENCE-BASED SIGNAL ENHANCEMENT
5176GENERALIZED GRAPH SPECTRAL SAMPLING WITH STOCHASTIC PRIORS
1910Generalized Kernel-Based Dynamic Mode Decomposition
4099Generalized Linear Bandits with Safety Constraints
4032GENERALIZED SPATIAL MODULATION FOR WIRELESS TERABITS SYSTEMS UNDER SUB-THZ CHANNEL WITH RF IMPAIRMENTS
5388GENERATING AND PROTECTING AGAINST ADVERSARIAL ATTACKS FOR DEEP SPEECH-BASED EMOTION RECOGNITION MODELS
2089Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and autoregressive prosody prior
3301GENERATING EMPATHETIC RESPONSES BY LOOKING AHEAD THE USER’S SENTIMENT
2655GENERATING MULTILINGUAL VOICES USING SPEAKER SPACE TRANSLATION BASED ON BILINGUAL SPEAKER DATA
3900GENERATING SYNTHETIC AUDIO DATA FOR ATTENTION-BASED SPEECH RECOGNITION SYSTEMS
5923GENERATIVE ADVERSARIAL NETWORKS FOR GRAPH DATA IMPUTATION FROM SIGNED OBSERVATIONS
2253GENERATIVE PRE-TRAINING FOR SPEECH WITH AUTOREGRESSIVE PREDICTIVE CODING
6110Generative RNNs for OOV Keyword Search
1252GENETIC ALGORITHM OPTIMIZED SUPPORT VECTOR MACHINE IN NOMA-BASED SATELLITE NETWORKS WITH IMPERFECT CSI
2733GEOMETRICALLY CONSTRAINED INDEPENDENT VECTOR ANALYSIS FOR DIRECTIONAL SPEECH ENHANCEMENT
2736GEOMETRY CONSTRAINED PROGRESSIVE LEARNING FOR LSTM-BASED SPEECH ENHANCEMENT
2835GFCN: A NEW GRAPH CONVOLUTIONAL NETWORK BASED ON PARALLEL FLOWS
1554GFNET: A LIGHTWEIGHT GROUP FRAME NETWORK FOR EFFICIENT HUMAN ACTION RECOGNITION
3277GLOBAL AND LOCAL DISCRIMINATIVE PATCHES EXPLOITING FOR ACTION RECOGNITION
5418GLOBAL STRUCTURE GRAPH GUIDED FINE-GRAINED VEHICLE RECOGNITION
2572Global Traffic State Recovery via Local Observations with Generative Adversarial Networks
4641GPU-ACCELERATED VITERBI EXACT LATTICE DECODER FOR BATCHED ONLINE AND OFFLINE SPEECH RECOGNITION
4212GRADIENT DELAY ANALYSIS IN ASYNCHRONOUS DISTRIBUTED OPTIMIZATION
4132GRADIENT-BASED ALGORITHM WITH SPATIAL REGULARIZATION FOR OPTIMAL SENSOR PLACEMENT
5005GRAPH AUTO-ENCODER FOR GRAPH SIGNAL DENOISING
4391GRAPH CONSTRUCTION FROM DATA BY NON-NEGATIVE KERNEL REGRESSION
6004GRAPH CONVOLUTIONAL NEURAL NETWORKS TO CLASSIFY WHOLE SLIDE IMAGES
2060Graph Neural Net using Analytical Graph Filters and Topology Optimization for Image Denoising
2708Graph Regularized Tensor Train Decomposition
4939GRAPH VERTEX SAMPLING WITH ARBITRARY GRAPH SIGNAL HILBERT SPACES
5918GRAPHEM: EM ALGORITHM FOR BLIND KALMAN FILTERING UNDER GRAPHICAL SPARSITY CONSTRAINTS
3621GRAPHICAL EVOLUTIONARY GAME THEORETIC ANALYSIS OF SUPER USERS IN INFORMATION DIFFUSION
5704GRAPHTTS: GRAPH-TO-SEQUENCE MODELLING IN NEURAL TEXT-TO-SPEECH
2235GRAY-SCALE IMAGE COLORIZATION USING CYCLE-CONSISTENT GENERATIVE ADVERSARIAL NETWORKS WITH RESIDUAL STRUCTURE ENHANCER
2481GREEDY HYBRID RATE ADAPTATION IN DYNAMIC WIRELESS COMMUNICATION ENVIRONMENT
2251GREEDY SPARSE ARRAY DESIGN FOR OPTIMAL LOCALIZATION UNDER SPATIALLY PRIORITIZED SOURCE DISTRIBUTION
6106GRIFFIN–LIM LIKE PHASE RECOVERY VIA ALTERNATING DIRECTION METHOD OF MULTIPLIERS
3569GROUP-UTILITY METRIC FOR EFFICIENT SENSOR SELECTION AND REMOVAL IN LCMV BEAMFORMERS
2650Guided Learning for Weakly-labeled Semi-supervised Sound Event Detection
3990GYROSCOPE AIDED VIDEO STABILIZATION USING NONLINEAR REGRESSION ON SPECIAL ORTHOGONAL GROUP
3171Hand-3D-Studio: A New Multi-view System for 3D Hand Reconstruction
5477HARMONIC/PERCUSSIVE SOUND SEPARATION AND SPECTRAL COMPLEXITY REDUCTION OF MUSIC SIGNALS FOR COCHLEAR IMPLANT LISTENERS
4914HARMONICS BASED REPRESENTATION IN CLARINET TONE QUALITY EVALUATION
1668HDMFH: HYPERGRAPH BASED DISCRETE MATRIX FACTORIZATION HASHING FOR MULTIMODAL RETRIEVAL
5612Headless Horseman: Adversarial Attacks on Transfer Learning Models
1973HEARING AID RESEARCH DATA SET FOR ACOUSTIC ENVIRONMENT RECOGNITION
4875HEIGHT AND WEIGHT ESTIMATION FROM UNCONSTRAINED IMAGES
4714HETEROGENEOUS DOMAIN GENERALIZATION VIA DOMAIN MIXUP
4652HGFM : A HIERARCHICAL GRAINED AND FEATURE MODEL FOR ACOUSTIC EMOTION RECOGNITION
1409HIDDEN MARKOV MODELS FOR SEPSIS DETECTION IN PRETERM INFANTS
3534HIERARCHICAL ATTENTION TRANSFER NETWORKS FOR DEPRESSION ASSESSMENT FROM SPEECH
2266Hierarchical Caching via Deep Reinforcement Learning
4108HIERARCHICAL FEDERATED LEARNING ACROSS HETEROGENEOUS CELLULAR NETWORKS
5787Hierarchical Sequence Representation with Graph Network
4320HIGH DYNAMIC RANGE IMAGING USING DEEP IMAGE PRIORS
4318HIGH-ACCURACY AND LOW-LATENCY SPEECH RECOGNITION WITH TWO-HEAD CONTEXTUAL LAYER TRAJECTORY LSTM MODEL
1085HIGH-ACCURACY CLASSIFICATION OF ATTENTION DEFICIT HYPERACTIVITY DISORDER WITH L2,1-NORM LINEAR DISCRIMINANT ANALYSIS
5569HIGH-DIMENSIONAL NEURAL FEATURE USING RECTIFIED LINEAR UNIT AND RANDOM MATRIX INSTANCE
5839High-Resolution Attention Network With Acoustic Segment Model For Acoustic Scene Classification
1851Hijacking Tracker: A Powerful Adversarial Attack on Visual Tracking
5259HI-MIA : A FAR-FIELD TEXT-DEPENDENT SPEAKER VERIFICATION DATABASE AND THE BASELINES
1709HKA: A HIERARCHICAL KNOWLEDGE ATTENTION MECHANISM FOR MULTI-TURN DIALOGUE SYSTEM
3811HOW CONFIDENT ARE YOU? EXPLORING THE ROLE OF FILLERS IN THE AUTOMATIC PREDICTION OF A SPEAKER’S CONFIDENCE
5260HOW MUCH SELF-ATTENTION DO WE NEED? TRADING ATTENTION FOR FEED-FORWARD LAYERS
4831HPRNN: A HIERARCHICAL SEQUENCE PREDICTION MODEL FOR LONG-TERM WEATHER RADAR ECHO EXTRAPOLATION
1122HUMANGAN: GENERATIVE ADVERSARIAL NETWORK WITH HUMAN-BASED DISCRIMINATOR AND ITS EVALUATION IN SPEECH PERCEPTION MODELING
3617Human-Machine Collaboration for Medical Image Segmentation
4088HUMBUG ZOONIVERSE: A CROWD-SOURCED ACOUSTIC MOSQUITO DATASET
3778H-VECTORS: UTTERANCE-LEVEL SPEAKER EMBEDDING USING A HIERARCHICAL ATTENTION MODEL
2768HYBRID ACTIVE CONTOUR DRIVEN BY DOUBLE-WEIGHTED SIGNED PRESSURE FORCE FOR IMAGE SEGMENTATION
5643HYBRID AUTOREGRESSIVE TRANSDUCER (HAT)
1487HYBRID DEEP-SEMANTIC MATRIX FACTORIZATION FOR TAG-AWARE PERSONALIZED RECOMMENDATION
5864HYBRID NEURAL-PARAMETRIC F0 MODEL FOR SINGING SYNTHESIS
4138HYBRID PRECODING FOR SECURE TRANSMISSION IN REFLECT-ARRAY-ASSISTED MASSIVE MIMO SYSTEMS
5364HydraNet: A real-time waveform separation network
2421IDENTIFICATION OF ESSENTIAL PROTEINS USING A NOVEL MULTI-OBJECTIVE OPTIMIZATION METHOD
4505IDENTIFYING TRUTHFUL LANGUAGE IN CHILD INTERVIEWS
2382IMAGE DE-RAINING VIA RDL: WHEN REWEIGHTED CONVOLUTIONAL SPARSE CODING MEETS DEEP LEARNING
5812IMAGE FUSION USING JOINT SPARSE REPRESENTATIONS AND COUPLED DICTIONARY LEARNING
4385IMAGE PROCESSING IN DNA
4248Image recovery from rotational and translational invariants
3683Image Restoration via Data-dependent Proximal Averaged Optimization
5112IMAGE SEGMENTATION BASED PRIVACY-PRESERVING HUMAN ACTION RECOGNITION FOR ANOMALY DETECTION
2212IMAGE SUPER-RESOLUTION USING RESIDUAL GLOBAL CONTEXT NETWORK
1172IMPACT OF A SHIFT-INVARIANT HARMONIC PHASE MODEL IN FULLY PARAMETRIC HARMONIC VOICE REPRESENTATION AND TIME/FREQUENCY SYNTHESIS
4270Improved End-to-End Spoken Utterance Classification with a Self-Attention Acoustic Classifier
3241IMPROVED LARGE-MARGIN SOFTMAX LOSS FOR SPEAKER DIARISATION
3697IMPROVED NEAREST NEIGHBOR DENSITY-BASED CLUSTERING TECHNIQUES WITH APPLICATION TO HYPERSPECTRAL IMAGES
1842IMPROVED PROBABILITY MODELLING FOR EXCEPTION HANDLING IN LOSSLESS SCREEN CONTENT CODING
5332IMPROVED REAL-TIME VISUAL TRACKING VIA ADVERSARIAL LEARNING
5769IMPROVED SPEAKER INDEPENDENT DYSARTHRIA INTELLIGIBILITY CLASSIFICATION USING DEEPSPEECH POSTERIORS
5708IMPROVING AUDITORY ATTENTION DECODING PERFORMANCE OF LINEAR AND NON-LINEAR METHODS USING STATE-SPACE MODEL
3519IMPROVING AUTOMATED SEGMENTATION OF RADIO SHOWS WITH AUDIO EMBEDDINGS
4525IMPROVING CONVERGENT CROSS MAPPING FOR CAUSAL DISCOVERY WITH GAUSSIAN PROCESSES
5846IMPROVING CROSS-DATASET PERFORMANCE OF FACE PRESENTATION ATTACK DETECTION SYSTEMS USING FACE RECOGNITION DATASETS
5082IMPROVING DEEP CNN NETWORKS WITH LONG TEMPORAL CONTEXT FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
2284IMPROVING DEEP LEARNING CLASSIFICATION OF JPEG2000 IMAGES OVER BANDLIMITED NETWORKS
3734Improving Device Directedness Classification of Utterances with Semantic Lexical Features
4538IMPROVING EFFICIENCY IN LARGE-SCALE DECENTRALIZED DISTRIBUTED TRAINING
2910IMPROVING END-TO-END SPEECH SYNTHESIS WITH LOCAL RECURRENT NEURAL NETWORK ENHANCED TRANSFORMER
1203IMPROVING FASHION ATTRIBUTE PREDICTION VIA GLOBAL SEMANTIC REASONING
3563IMPROVING LANGUAGE IDENTIFICATION FOR MULTILINGUAL SPEAKERS
5558Improving LPCNet-based Text-to-Speech with Linear Prediction-structured Mixture Density Network
2076IMPROVING MUSIC TRANSCRIPTION BY PRE-STACKING A U-NET
3567IMPROVING NOISE ROBUST AUTOMATIC SPEECH RECOGNITIONWITH SINGLE-CHANNEL TIME-DOMAIN ENHANCEMENT NETWORK
1742IMPROVING PROPER NOUN RECOGNITION IN END-TO-END ASR BY CUSTOMIZATION OF THE MWER LOSS CRITERION
3430IMPROVING PROSODY WITH LINGUISTIC AND BERT DERIVED FEATURES IN MULTI-SPEAKER BASED MANDARIN CHINESE NEURAL TTS
1243IMPROVING REVERBERANT SPEECH TRAINING USING DIFFUSE ACOUSTIC SIMULATION
4137IMPROVING ROBUSTNESS OF DEEP LEARNING BASED MONAURAL SPEECH ENHANCEMENT AGAINST PROCESSING ARTIFACTS
3699IMPROVING SAMPLE-EFFICIENCY IN REINFORCEMENT LEARNING FOR DIALOGUE SYSTEMS BY USING TRAINABLE-ACTION-MASK
4021Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation
4258IMPROVING SINGING VOICE SEPARATION WITH THE WAVE-U-NET USING MINIMUM HYPERSPHERICAL ENERGY
2735Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam
1874IMPROVING SPEAKER-ATTRIBUTE ESTIMATION BY VOTING BASED ON SPEAKER CLUSTER INFORMATION
1730IMPROVING SPEECH RECOGNITION USING CONSISTENT PREDICTIONS ON SYNTHESIZED SPEECH
5268Improving Spoken Question Answering using Contextualized Word Representation
3634IMPROVING THE CHRONOLOGICAL SORTING OF IMAGES THROUGH OCCLUSION: A STUDY ON THE NOTRE-DAME CATHEDRAL FIRE
3309IMPROVING THE PERFORMANCE OF TRANSFORMER BASED LOW RESOURCE SPEECH RECOGNITION FOR INDIAN LANGUAGES
4461IMPROVING THE SCALABILITY OF DEEP REINFORCEMENT LEARNING-BASED ROUTING WITH CONTROL ON PARTIAL NODES
2765IMPROVING UNIVERSAL SOUND SEPARATION USING SOUND CLASSIFICATION
2911IMPROVING VOICE SEPARATION BY INCORPORATING END-TO-END SPEECH RECOGNITION
1339IMPULSE RESPONSE DATA AUGMENTATION AND DEEP NEURAL NETWORKS FOR BLIND ROOM ACOUSTIC PARAMETER ESTIMATION
4035INCORPORATING WRITTEN DOMAIN NUMERIC GRAMMARS INTO END-TO-END CONTEXTUAL SPEECH RECOGNITION SYSTEMS FOR IMPROVED RECOGNITION OF NUMERIC SEQUENCES
3870INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION
3376INDEPENDENT LANGUAGE MODELING ARCHITECTURE FOR END-TO-END ASR
6076INDEPENDENT-VARIATION MATRIX FACTORIZATION WITH APPLICATION TO ENERGY DISAGGREGATION
4260INDIVIDUAL DISTANCE-DEPENDENT HRTFS MODELING THROUGH A FEW ANTHROPOMETRIC MEASUREMENTS
1364In-Domain and Out-of-Domain Data Augmentation to Improve Children's Speaker Verification System in Limited Data Scenario
3807INDOOR ALTITUDE ESTIMATION OF UNMANNED AERIAL VEHICLES USING A BANK OF KALMAN FILTERS
2667INDOOR HEADING DIRECTION ESTIMATION USING RF SIGNALS
5742IndyLSTMs: Independently Recurrent LSTMs
2271INFERRING DYNAMIC GROUP LEADERSHIP USING SEQUENTIAL BAYESIAN METHODS
5173INFORMATION FLOW OPTIMIZATION IN INFERENCE NETWORKS
5091INFORMATION MAXIMIZED VARIATIONAL DOMAIN ADVERSARIAL LEARNING FOR SPEAKER VERIFICATION
5667INFORMATION THEORETIC APPROACH FOR WAVEFORM DESIGN IN COEXISTING MIMO RADAR AND MIMO COMMUNICATIONS
2644In-network Caching For Hybrid Satellite-Terrestrial Networks Using Deep Reinforcement Learning
5405INSIGHTS INTO NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT
5342INSTANCE-BASED MODEL ADAPTATION FOR DIRECT SPEECH TRANSLATION
3324INSTANT ADAPTIVE LEARNING: AN ADAPTIVE FILTER BASED FAST LEARNING MODEL CONSTRUCTION FOR SENSOR SIGNAL TIME SERIES CLASSIFICATION ON EDGE DEVICES
2842INTEGRATING DISCRETE AND NEURAL FEATURES VIA MIXED-FEATURE TRANS-DIMENSIONAL RANDOM FIELD LANGUAGE MODELS
4229INTEGRATION OF MULTI-LOOK BEAMFORMERS FOR MULTI-CHANNEL KEYWORD SPOTTING
2320INTELLIGENT REFLECTING SURFACE FOR MASSIVE DEVICE CONNECTIVITY: JOINT ACTIVITY DETECTION AND CHANNEL ESTIMATION
4784INTELLIGENT STUDENT BEHAVIOR ANALYSIS SYSTEM FOR REAL CLASSROOMS
5439INTENSITY-IMAGE RECONSTRUCTION FOR EVENT CAMERAS USING CONVOLUTIONAL NEURAL NETWORK
1935Interpolation and Range Extrapolation of Sound Source Directivity Based on a Spherical Wave Propagation Model
5147INTERPRETABILITY-GUIDED CONVOLUTIONAL NEURAL NETWORKS FOR SEISMIC FAULT SEGMENTATION
2506Interpretable Machine Learning in Sustainable Edge Computing: A Case Study of Short-term Photovoltaic Power Output Prediction
4790INTERPRETABLE SELF-ATTENTION TEMPEROL REASONING FOR DRIVING BEHAVIOR UNDERSTANDING
3353INTERRUPTED AND CASCADED PERMUTATION INVARIANT TRAINING FOR SPEECH SEPARATION
4369INTRA FRAME RATE CONTROL FOR VERSATILE VIDEO CODING WITH QUADRATIC RATE-DISTORTION MODELLING
3779INVERSE MULTIPLE SCATTERING WITH PHASELESS MEASUREMENTS
5358INVERTIBLE DNN-BASED NONLINEAR TIME-FREQUENCY TRANSFORM FOR SPEECH ENHANCEMENT
5892INVESTIGATING GENERALIZATION IN NEURAL NETWORKS UNDER OPTIMALLY EVOLVED TRAINING PERTURBATIONS
5299INVESTIGATION OF METHODS TO IMPROVE THE RECOGNITION PERFORMANCE OF TAMIL-ENGLISH CODE-SWITCHED DATA IN TRANSFORMER FRAMEWORK
5884INVESTIGATION OF SPECAUGMENT FOR DEEP SPEAKER EMBEDDING LEARNING
5110IQ-STAN: IMAGE QUALITY GUIDED SPATIO-TEMPORAL ATTENTION NETWORK FOR LICENSE PLATE RECOGNITION
6089Irregular Array Manifold Aided Channel Estimation in Massive MIMO Communications
3766I-VECTOR TRANSFORMATION USING K-NEAREST NEIGHBORS FOR SPEAKER VERIFICATION
3942JHU-HLTCOE SYSTEM FOR THE VOXSRC SPEAKER RECOGNITION CHALLENGE
4792JOINT BEAMFORMING AND REVERBERATION CANCELLATION USING A CONSTRAINED KALMAN FILTER WITH MULTICHANNEL LINEAR PREDICTION
4376JOINT BLIND CALIBRATION AND TIME-DELAY ESTIMATION FOR MULTIBAND RANGING
3800JOINT CODING AND MODULATION IN THE ULTRA-SHORT BLOCKLENGTH REGIME FOR BERNOULLI-GAUSSIAN IMPULSIVE NOISE CHANNELS USING AUTOENCODERS
4559JOINT CONTEXTUAL MODELING FOR ASR CORRECTION and LANGUAGE UNDERSTANDING
3203Joint Enhancement and Denoising of Low Light Images Via JND Transform
5531Joint estimation of acoustic parameters from single-microphone speech observations
1758JOINT FREQUENCY DOMAIN CHANNEL ESTIMATION AND EQUALIZATION BASED ON EXPECTATION PROPAGATION FOR SINGLE CARRIER TRANSMISSIONS
5586Joint learning of assignment and representation for biometric group membership
3798JOINT LEARNING OF CARTESIAN UNDERSAMPLING AND RECONSTRUCTION FOR ACCELERATED MRI
5166JOINT MULTITARGET TRACKING AND DYNAMIC NETWORK LOCALIZATION IN THE UNDERWATER DOMAIN
4388Joint Optimization of Sampling Patterns and Deep Priors for Improved Parallel MRI
1938JOINT PHONEME ALIGNMENT AND TEXT-INFORMED SPEECH SEPARATION ON HIGHLY CORRUPTED SPEECH
2102JOINT PHONEME-GRAPHEME MODEL FOR END-TO-END SPEECH RECOGNITION
3809JOINT SCHEDULING AND BEAMFORMING FOR DELAY SENSITIVE TRAFFIC WITH PRIORITIES AND DEADLINES
2873JOINT SEMI-SUPERVISED FEATURE AUTO-WEIGHTING AND CLASSIFICATION MODEL FOR EEG-BASED CROSS-SUBJECT SLEEP QUALITY EVALUATION
1725Joint Software Defined Resource Allocation and Routing for Service Function Chaining with In-Subnetwork Processing
5349JOINT SOURCE-CHANNEL CODING AND BAYESIAN MESSAGE PASSING DETECTION FOR GRANT-FREE RADIO ACCESS IN IOT
3543Joint Sparse Recovery using Deep Unfolding With Application to Massive Random Access
1365JOINT TRAINING OF DEEP NEURAL NETWORKS FOR MULTI-CHANNEL DEREVERBERATION AND SPEECH SOURCE SEPARATION
1552JOINTLY OPTIMAL DEREVERBERATION AND BEAMFORMING
2989JPEG STEGANOGRAPHY WITH SIDE INFORMATION FROM THE PROCESSING PIPELINE
2855JUST NOTICEABLE DISTORTION BASED PERCEPTUALLY LOSSLESS INTRA CODING
3713KALM: KEY AREA LOCALIZATION MECHANISM FOR ABNORMALITY DETECTION IN MUSCULOSKELETAL RADIOGRAPHS
2351K-Autoencoders deep clustering
4935KERNEL COMPUTATIONS FROM LARGE-SCALE RANDOM FEATURES OBTAINED BY OPTICAL PROCESSING UNITS
1289KERNEL RIDGE REGRESSION WITH AUTOCORRELATION PRIOR: OPTIMAL MODEL AND CROSS-VALIDATION
5717KEY ACTION AND JOINT CTC-ATTENTION BASED SIGN LANGUAGE RECOGNITION
5420KEYWORD SEARCH FOR SIGN LANGUAGE
4110KNOWLEDGE DISTILLATION AND RANDOM ERASING DATA AUGMENTATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
2180KNOWLEDGE ENHANCED LATENT RELEVANCE MINING FOR QUESTION ANSWERING
4703KOREAN SINGING VOICE SYNTHESIS BASED ON AUTO-REGRESSIVE BOUNDARY EQUILIBRIUM GAN
4741K-SPACE TRAJECTORY DESIGN FOR REDUCED MRI SCAN TIME
5398L1-NORM HIGHER-ORDER ORTHOGONAL ITERATIONS FOR ROBUST TENSOR ANALYSIS
5338LABEL PROPAGATION ADAPTIVE RESONANCE THEORY FOR SEMI-SUPERVISED CONTINUOUS LEARNING
4454Label Reuse for Efficient Semi-supervised Learning
4716LAI-NET: LOCAL-ANCESTRY INFERENCE WITH NEURAL NETWORKS
2793LANCE: EFFICIENT LOW-PRECISION QUANTIZED WINOGRAD CONVOLUTION FOR NEURAL NETWORKS BASED ON GRAPHICS PROCESSING UNITS
4678LANGUAGE INDEPENDENT GENDER IDENTIFICATION FROM RAW WAVEFORM USING MULTI-SCALE CONVOLUTIONAL NEURAL NETWORKS
2086LANGUAGE-AGNOSTIC MULTILINGUAL MODELING
3601LAPLACE STATE SPACE FILTER WITH EXACT INFERENCE AND MOMENT MATCHING
4764LARGE DIMENSIONAL ASYMPTOTICS OF MULTI-TASK LEARNING
3057LARGE-CONTEXT POINTER-GENERATOR NETWORKS FOR SPOKEN-TO-WRITTEN STYLE CONVERSION
2980LARGE-SCALE FADING PRECODING FOR MAXIMIZING THE PRODUCT OF SINRS
1772LARGE-SCALE TIME SERIES CLUSTERING WITH k-ARs
2459Large-Scale Unsupervised Pre-training for End-to-End Spoken Language Understanding
1744LARGE-SCALE WEAKLY-SUPERVISED CONTENT EMBEDDINGS FOR MUSIC RECOMMENDATION AND TAGGING
2730LATENCY-MINIMIZED DESIGN OF SECURE TRANSMISSIONS IN UAV-AIDED COMMUNICATIONS
4083LATENT ATRIAL FIBRILLATION RISK PREDICTION FROM ELECTROCARDIOGRAM AND DEMOGRAPHIC DATA WITH CONVOLUTIONAL NEURAL NETWORK
4462LATENT FUSED LASSO
4165LATTICE-BASED IMPROVEMENTS FOR VOICE TRIGGERING USING GRAPH NEURAL NETWORKS
3544LAYER-NORMALIZED LSTM FOR HYBRID-HMM AND END-TO-END ASR
4114LEARN-BY-CALIBRATING: USING CALIBRATION AS A TRAINING OBJECTIVE
4443Learned Lossless Image Compression with a HyperPrior and Discretized Gaussian Mixture Likelihoods
4572LEARNING A COMMON GRANGER CAUSALITY NETWORK USING A NON-CONVEX REGULARIZATION
1335LEARNING A GENERIC ADAPTIVE WAVELET SHRINKAGE FUNCTION FOR DENOISING
4627LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK
4964LEARNING A SUBWORD INVENTORY JOINTLY WITH END-TO-END AUTOMATIC SPEECH RECOGNTION
5640LEARNING ASR-ROBUST CONTEXTUALIZED EMBEDDINGS FOR SPOKEN LANGUAGE UNDERSTANDING
5928LEARNING BASED RECONFIGURABLE SUB-NYQUIST SAMPLING FRAMEWORK FOR ULTRA-WIDEBAND ANGULAR SENSING
2519Learning Blind Denoising Network for Noisy Image Deblurring
4305LEARNING CONNECTIVITY AND HIGHER-ORDER INTERACTIONS IN RADIAL DISTRIBUTION GRIDS
5361LEARNING DATA REPRESENTATION AND EMOTION ASSESSMENT FROM PHYSIOLOGICAL DATA
4168LEARNING DIFFERENTIABLE SPARSE AND LOW RANK NETWORKS FOR AUDIO-VISUAL OBJECT LOCALIZATION
1608Learning diverse sub-policies via a task-agnostic regularization on action distributions.
3817LEARNING DOMAIN INVARIANT REPRESENTATIONS FOR CHILD-ADULT CLASSIFICATION FROM SPEECH
3884LEARNING EATING ENVIRONMENTS THROUGH SCENE CLUSTERING
3649LEARNING ENDMEMBER DYNAMICS IN MULTITEMPORAL HYPERSPECTRAL DATA USING A STATE-SPACE MODEL FORMULATION
5872LEARNING FRACTIONAL ORTHOGONAL LATENT CONSISTENT FEATURES FOR FACE HALLUCINATION AND RECOGNITION
4638LEARNING FROM DANCES: POSE-INVARIANT RE-IDENTIFICATION FOR MULTI-PERSON TRACKING
1133LEARNING GEOMETRIC FEATURES WITH DUAL-STREAM CNN FOR 3D ACTION RECOGNITION
3106LEARNING GRAPH INFLUENCE FROM SOCIAL INTERACTIONS
4979LEARNING LOCAL STRUCTURE OF REPRESENTATIVE POINTS FOR POINT CLOUD CLASSIFICATION AND SEMANTIC SEGMENTATION
4449LEARNING MULTI-SCALE ATTENTIVE FEATURES FOR SERIES PHOTO SELECTION
5162LEARNING NETWORK REPRESENTATION THROUGH REINFORCEMENT LEARNING
5426LEARNING NOISE INVARIANT FEATURES THROUGH TRANSFER LEARNING FOR ROBUST END-TO-END SPEECH RECOGNITION
4308LEARNING PARTIAL DIFFERENTIAL EQUATIONS FROM DATA USING NEURAL NETWORKS
1924Learning Perception and Planning with Deep Active Inference
3725LEARNING PLUG-AND-PLAY PROXIMAL QUASI-NEWTON DENOISERS
4735Learning Product Graphs from Multidomain Signals
4366LEARNING RECURRENT NEURAL NETWORK LANGUAGE MODELS WITH CONTEXT-SENSITIVE LABEL SMOOTHING FOR AUTOMATIC SPEECH RECOGNITION
5866LEARNING SAMPLING AND MODEL-BASED SIGNAL RECOVERY FOR COMPRESSED SENSING MRI
3111LEARNING SEMI-SUPERVISED ANONYMIZED REPRESENTATIONS BY MUTUAL INFORMATION
3418LEARNING SIGNED GRAPHS FROM DATA
3524LEARNING SPATIO-TEMPORAL CONVOLUTIONAL NETWORK FOR REAL-TIME OBJECT TRACKING
3658Learning Spatio-Temporal Representations with Temporal Squeeze Pooling
3006LEARNING SPECTRAL-SPATIAL PRIOR VIA 3DDNCNN FOR HYPERSPECTRAL IMAGE DECONVOLUTION
1673LEARNING TASK-BASED ANALOG-TO-DIGITAL CONVERSION FOR MIMO RECEIVERS
2318LEARNING THE HELIX TOPOLOGY OF MUSICAL PITCH
5775Learning the Spatio-Temporal Dynamics of Physical Processes from Partial Observations
2346LEARNING TO CHARACTERIZE ADVERSARIAL SUBSPACES
3769LEARNING TO DETECT KEYWORD PARTS AND WHOLE BY SMOOTHED MAX POOLING
3208Learning to Estimate Driver Drowsiness from Car Acceleration Sensors using Weakly Labeled Data
3146learning to fool the speaker recognition
4920LEARNING TO GENERATE DIVERSE QUESTIONS FROM KEYWORDS
3432LEARNING TO RANK MUSIC TRACKS USING TRIPLET LOSS
2847LEARNING TO SEPARATE SOUNDS FROM WEAKLY LABELED SCENES
2713LEARNING WITH OUT-OF-DISTRIBUTION DATA FOR AUDIO CLASSIFICATION
5366LEARNING-AIDED CONTENT PLACEMENT IN CACHING-ENABLED FOG COMPUTING SYSTEMS USING THOMPSON SAMPLING
5190LEARNING-BASED CONTENT CACHING AND USER CLUSTERING: A DEEP DETERMINISTIC POLICY GRADIENT APPROACH
5771LEAST-SQUARES DOA ESTIMATION WITH AN INFORMED PHASE UNWRAPPING AND FULL BANDWIDTH ROBUSTNESS
2010LEt-SNE: A HYBRID APPROACH TO DATA EMBEDDING AND VISUALIZATION OF HYPERSPECTRAL IMAGERY
2501LEVENBERG-MARQUARDT AND LINE-SEARCH EXTENDED KALMAN SMOOTHERS
4739LEVERAGING CUBOIDS FOR BETTER MOTION MODELING IN HIGH EFFICIENCY VIDEO CODING
1763LEVERAGING GANS TO IMPROVE CONTINUOUS PATH KEYBOARD INPUT MODELS
5344LEVERAGING ORDINAL REGRESSION WITH SOFT LABELS FOR 3D HEAD POSE ESTIMATION FROM POINT SETS
3844LEVERAGING UNPAIRED TEXT DATA FOR TRAINING END-TO-END SPEECH-TO-INTENT SYSTEMS
4422LIBRI-ADAPT: A NEW SPEECH DATASET FOR UNSUPERVISED DOMAIN ADAPTATION
1414Libri-Light: A (Large) Dataset for ASR with Limited or No Supervision
1519LIE GROUP STATE ESTIMATION VIA OPTIMAL TRANSPORT
3636LIFTER TRAINING AND SUB-BAND MODELING FOR COMPUTATIONALLY EFFICIENT AND HIGH-QUALITY VOICE CONVERSION USING SPECTRAL DIFFERENTIALS
3716LIGHTDET: A LIGHTWEIGHT AND ACCURATE OBJECT DETECTION NETWORK
3786LIGHT-FIELD RECONSTRUCTION AND DEPTH ESTIMATION FROM FOCAL STACK IMAGES USING CONVOLUTIONAL NEURAL NETWORKS
1739LIGHTWEIGHT AND EFFICIENT END-TO-END SPEECH RECOGNITION USING LOW-RANK TRANSFORMER
1328LIGHTWEIGHT HARDWARE IMPLEMENTATION OF VVC TRANSFORM BLOCK FOR ASIC DECODER
1356Lightweight V-Net for Liver segmentation
5865LIMITATIONS OF WEAK LABELS FOR EMBEDDING AND TAGGING
2675LINE SPECTRAL ESTIMATION WITH PALYNDROMIC KERNELS
1213LINEAR MODEL-BASED INTRA PREDICTION IN VVC TEST MODEL
3462LINEAR SPEEDUP IN SADDLE-POINT ESCAPE FOR DECENTRALIZED NON-CONVEX OPTIMIZATION
5198LINEAR THOMPSON SAMPLING UNDER UNKNOWN LINEAR CONSTRAINTS
5805Lipreading using Temporal Convolutional Networks
2978Load Management with Predictions of Solar Energy Production for Cloud Data Centers
1699LOCAL KEY ESTIMATION IN CLASSICAL MUSIC RECORDINGS: A CROSS-VERSION STUDY ON SCHUBERT’S WINTERREISE
4343LOCAL-GLOBAL FEATURE FOR VIDEO-BASED ONE-SHOT PERSON RE-IDENTIFICATION
6073Localized Linear Regression in Networked Data
3860Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis
5430Look globally, age locally: Face aging with An Attention Mechanism
4172LOOKAHEAD CONVERGES TO STATIONARY POINTS OF SMOOTH NON-CONVEX FUNCTIONS
3233LOOKING ENHANCES LISTENING: RECOVERING MISSING SPEECH USING IMAGES
1360LOW COMPLEXITY NLMS FOR MULTIPLE LOUDSPEAKER ACOUSTIC ECHO CANCELLER USING RELATIVE LOUDSPEAKER TRANSFER FUNCTIONS
2136LOW COMPLEXITY SINGLE IMAGE SUPER-RESOLUTION WITH CHANNEL SPLITTING AND FUSION NETWORK
1866LOW MUTUAL AND AVERAGE COHERENCE DICTIONARY LEARNING USING CONVEX APPROXIMATION
4111Low Rank Activations for Tensor-based Convolutional Sparse Coding
6111LOW RESOURCE KEYWORD SEARCH WITH SYNTHESIZED CROSSLINGUAL EXEMPLARS
3199Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers
1279Low-complexity 5G SLAM with CKF-PHD Filter
5895LOW-COMPLEXITY ACCURATE MMWAVE POSITIONING FOR SINGLE-ANTENNA USERS BASED ON ANGLE-OF-DEPARTURE AND ADAPTIVE BEAMFORMING
1191LOW-COMPLEXITY AND RELIABLE TRANSFORMS FOR PHYSICAL UNCLONABLE FUNCTIONS
5710LOW-COMPLEXITY COMPRESSED ALIGNMENT-AIDED COMPRESSIVE ANALYSIS FOR REAL-TIME ELECTROCARDIOGRAPHY TELEMONITORING
4419LOW-COMPLEXITY FIXED-POINT CONVOLUTIONAL NEURAL NETWORKS FOR AUTOMATIC TARGET RECOGNITION
4852Low-Complexity Levenberg-Marquardt Algorithm for Tensor Canonical Polyadic Decomposition
2993LOW-COMPLEXITY LSTM-ASSISTED BIT-FLIPPING ALGORITHM FOR SUCCESSIVE CANCELLATION LIST POLAR DECODER
4352LOW-FREQUENCY COMPENSATED SYNTHETIC IMPULSE RESPONSES FOR IMPROVED FAR-FIELD SPEECH RECOGNITION
5738LOW-LATENCY LIGHTWEIGHT STREAMING SPEECH RECOGNITION WITH 8-BIT QUANTIZED SIMPLE GATED CONVOLUTIONAL NEURAL NETWORKS
4448LOW-LATENCY SINGLE CHANNEL SPEECH ENHANCEMENT USING U-NET CONVOLUTIONAL NEURAL NETWORKS
2609LOW-RANK APPROXIMATION OF MATRICES VIA A RANK-REVEALING FACTORIZATION WITH RANDOMIZATION
2341LOW-RANK GRADIENT APPROXIMATION FOR MEMORY-EFFICIENT ON-DEVICE TRAINING OF DEEP NEURAL NETWORK
2830Low-rank mmWave MIMO channel estimation in one-bit receivers
1651LOW-RANK TENSOR RING MODEL FOR COMPLETING MISSING VISUAL DATA
4554LOW-RANK TOEPLITZ MATRIX ESTIMATION VIA RANDOM ULTRA-SPARSE RULERS
2803LOW-TUBAL-RANK TENSOR RECOVERY FROM ONE-BIT MEASUREMENTS
2496LQAID: Localized Quality Aware Image Denoising using Deep Convolutional Neural Networks
4077LSTM-BASED ONE-PASS DECODER FOR LOW-LATENCY STREAMING
5379LUPULUS: A FLEXIBLE HARDWARE ACCELERATOR FOR NEURAL NETWORKS
3123L-Vector: Neural Label Embedding for Domain Adaptation
5006MAHALANOBIS DISTANCE BASED ADVERSARIAL NETWORK FOR ANOMALY DETECTION
1538MANet: Multi-scale aggregated network for light field depth estimation
3829Mango: A Python Library for Parallel Hyperparameter Tuning
2240MANIFOLD GRADIENT DESCENT SOLVES MULTI-CHANNEL SPARSE BLIND DECONVOLUTION PROVABLY AND EFFICIENTLY
5488MANY-TO-MANY VOICE CONVERSION USING CONDITIONAL CYCLE-CONSISTENT ADVERSARIAL NETWORKS
4916MASK-DEPENDENT PHASE ESTIMATION FOR MONAURAL SPEAKER SEPARATION
5151MASKING AND INPAINTING: A TWO-STAGE SPEECH ENHANCEMENT APPROACH FOR LOW SNR AND NON-STATIONARY NOISE
4477MATCHING PURSUIT BASED DYNAMIC PHASE-AMPLITUDE COUPLING MEASURE
5339MAXIMALLY ENERGY-CONCENTRATED DIFFERENTIAL WINDOW FOR PHASE-AWARE SIGNAL PROCESSING USING INSTANTANEOUS FREQUENCY
6129Maximum Likelihood Estimation of a Low-Rank Probability Mass Tensor From Partial Observations
1647MAXIMUM LIKELIHOOD ESTIMATION OF THE INTERFERENCE-PLUS-NOISE CROSS POWER SPECTRAL DENSITY MATRIX FOR OWN VOICE RETRIEVAL
3911MAXIMUM LIKELIHOOD MULTI-SPEAKER DIRECTION OF ARRIVAL ESTIMATION UTILIZING A WEIGHTED HISTOGRAM
4085MAXPOLYNOMIAL DIVISION WITH APPLICATION TO NEURAL NETWORK SIMPLIFICATION
6128M-Channel Graph Filter Banks: Polyphase Analysis and Structures
1499MDR-SURV: a Multi-scale Deep Learning-based Radiomics for SURVival Prediction in Pulmonary Malignancies
2036MEDIA CLASSIFICATION WITH BAYESIAN OPTIMIZATION AND VAPNIK-CHERVONENKIS (VC) BOUNDS
3682MELLOTRON: MULTISPEAKER EXPRESSIVE VOICE SYNTHESIS BY CONDITIONING ON RHYTHM, PITCH AND GLOBAL STYLE TOKENS
4069Mental Fatigue Prediction from Multi-Channel ECoG Signal
3145MESSAGE TRANSMISSION THROUGH UNDERSPREAD TIME-VARYING LINEAR CHANNELS
5013M-estimators of scatter with eigenvalue shrinkage
2575META LEARNING FOR END-TO-END LOW-RESOURCE SPEECH RECOGNITION
3407META METRIC LEARNING FOR HIGHLY IMBALANCED AERIAL SCENE CLASSIFICATION
5491Meta-learning Extractors for Music Source Separation
3791META-LEARNING FOR ROBUST CHILD-ADULT CLASSIFICATION FROM SPEECH
5755META-LEARNING TO COMMUNICATE: FAST END-TO-END TRAINING FOR FADING CHANNELS
1692METRIC LEARNING WITH BACKGROUND NOISE CLASS FOR FEW-SHOT DETECTION OF RARE SOUND EVENTS
4031Metric Representations of Networks: A Uniqueness Result
5221Minimal Adversarial Perturbation in Mobile Health Applications: The Epileptic Brain Activity Case Study
2447Minimum latency training strategies for streaming sequence-to-sequence ASR
1771MINING EFFECTIVE NEGATIVE TRAINING SAMPLES FOR KEYWORD SPOTTING
2816MIRRORED ARRAYS FOR DIRECTION-OF-ARRIVAL ESTIMATION
1308MISSPECIFIED CRAMER-RAO BOUND FOR DELAY ESTIMATION WITH A MISMATCHED WAVEFORM: A CASE STUDY
3869MIXTURE FACTORIZED AUTO-ENCODER FOR UNSUPERVISED HIERARCHICAL DEEP FACTORIZATION OF SPEECH SIGNAL
4009MIXUP MULTI-ATTENTION MULTI-TASKING MODEL FOR EARLY STAGE LEUKEMIA IDENTIFICATION
1589MIXUP-BREAKDOWN: A CONSISTENCY TRAINING METHOD FOR IMPROVING GENERALIZATION OF SPEECH SEPARATION MODELS
4945ML AND EM ESTIMATION OF SAMPLING INTERVALS OF SENSOR DEVICES
4446MMSE-BASED CHANNEL ESTIMATION FOR HYBRID BEAMFORMING MASSIVE MIMO WITH CORRELATED CHANNELS
2127Mobility-aware Beam Steering in Metasurface-based Programmable Wireless Environments
1894MOCKINGJAY: UNSUPERVISED SPEECH REPRESENTATION LEARNING WITH DEEP BIDIRECTIONAL TRANSFORMER ENCODERS
6154MODAL DECOMPOSITION OF FEEDBACK DELAY NETWORKS
3294MODEL ORDER SELECTION IN DOA SCENARIOS VIA CROSS-ENTROPY BASED MACHINE LEARNING TECHNIQUES
4949Modeling Behavior as Mutual Dependency Between Physiological Signals and Indoor Location In Large-Scale Wearable Sensor Study
4238Modeling Behavioral Consistency In Large-Scale Wearable Recordings of Human Bio-behavioral Signals
5041MODELING PIECE-WISE STATIONARY TIME SERIES
2006MODELING PLATE AND SPRING REVERBERATION USING A DSP-INFORMED DEEP NEURAL NETWORK
4002MODELING THE ENVIRONMENT IN DEEP REINFORCEMENT LEARNING: THE CASE OF ENERGY HARVESTING BASE STATIONS
4206Modeling Uncertainty in Predicting Emotional Attributes from Spontaneous Speech
1384MODELLING SEA CLUTTER IN SAR IMAGES USING LAPLACE-RICIAN DISTRIBUTION
1094MoGA: Searching Beyond MobileNetV3
3538MONAURAL SPEECH ENHANCEMENT USING INTRA-SPECTRAL RECURRENT LAYERS IN THE MAGNITUDE AND PHASE RESPONSES
4996MOTION DYNAMICS IMPROVE SPEAKER-INDEPENDENT LIPREADING
3228MOTION FEEDBACK DESIGN FOR VIDEO FRAME INTERPOLATION
3980MSPEC-NET : MULTI-DOMAIN SPEECH CONVERSION NETWORK
3416MSPNET: MULTI-SUPERVISED PARALLEL NETWORK FOR CROWD COUNTING
4026MT-GCN FOR MULTI-LABEL AUDIO TAGGING WITH NOISY LABELS
3648MULTI IMAGE DEPTH FROM DEFOCUS NETWORK WITH BOUNDARY CUE FOR DUAL APERTURE CAMERA
1976MULTI-AGENT DEEP REINFORCEMENT LEARNING FOR DISTRIBUTED HANDOVER MANAGEMENT IN DENSE MMWAVE NETWORKS
3744Multi-Branch Learning for Weakly-Labeled Sound Event Detection
4608Multichannel Active Noise Control with Spatial Derivative Constraints to Enlarge the quiet zone
5721MULTICHANNEL SIGNAL CLASSIFICATION USING VECTOR AUTOREGRESSION
5507MULTICHANNEL SIGNAL PROCESSING FOR ROAD SURFACE IDENTIFICATION
1284MULTI-CHANNEL SPEECH SOURCE SEPARATION AND DEREVERBERATION WITH SEQUENTIAL INTEGRATION OF DETERMINED AND UNDERDETERMINED MODELS
5701MULTI-CONDITIONING AND DATA AUGMENTATION USING GENERATIVE NOISE MODEL FOR SPEECH EMOTION RECOGNITION IN NOISY CONDITIONS
5002MULTI-CONSTRAINT SPECTRAL CO-DESIGN FOR COLOCATED MIMO RADAR AND MIMO COMMUNICATIONS
5696MULTI-DEPTH COMPUTATIONAL PERISCOPY WITH AN ORDINARY CAMERA
3008MULTIGRAPH SPECTRAL CLUSTERING FOR JOINT CONTENT DELIVERY AND SCHEDULING IN BEAM-FREE SATELLITE COMMUNICATIONS
5142MULTI-HEAD ATTENTION FOR SPEECH EMOTION RECOGNITION WITH AUXILIARY LEARNING OF GENDER RECOGNITION
2373MULTI-LABEL CONSISTENT CONVOLUTIONAL TRANSFORM LEARNING: APPLICATION TO NON-INTRUSIVE LOAD MONITORING
3739MULTI-LABEL SOUND EVENT RETRIEVAL USING A DEEP LEARNING-BASED SIAMESE STRUCTURE WITH A PAIRWISE PRESENCE MATRIX
5942Multi-layer Content Interaction through Quaternion Product for Visual Question Answering
3043Multi-level deep neural network adaptation for speaker verification using MMD and consistency regularization
5452MULTILINEAR GENERALIZED SINGULAR VALUE DECOMPOSITION (ML-GSVD) WITH APPLICATION TO COORDINATED BEAMFORMING IN MULTI-USER MIMO SYSTEMS
1023MULTILINGUAL ACOUSTIC WORD EMBEDDING MODELS FOR PROCESSING ZERO-RESOURCE LANGUAGES
2063MULTILINGUAL GRAPHEME-TO-PHONEME CONVERSION WITH BYTE REPRESENTATION
4412MULTI-MICROPHONE COMPLEX SPECTRAL MAPPING FOR SPEECH DEREVERBERATION
5035MULTIMODAL ACTIVE SPEAKER DETECTION AND VIRTUAL CINEMATOGRAPHY FOR VIDEO CONFERENCING
4327MULTIMODAL LEARNING FOR CLASSROOM ACTIVITY DETECTION
5468MULTI-MODAL SELF-SUPERVISED PRE-TRAINING FOR JOINT OPTIC DISC AND CUP SEGMENTATION IN EYE FUNDUS IMAGES
4640MULTIMODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING D-VECTORS WITH SPATIAL FEATURES
3276MULTIMODAL TRANSFORMER FUSION FOR CONTINUOUS EMOTION RECOGNITION
3864Multimodal Violence Detection in Videos
5463MULTI-MOTIFGAN (MMGAN): MOTIF-TARGETED GRAPH GENERATION AND PREDICTION
5236MULTI-PATCH AGGREGATION MODELS FOR RESAMPLING DETECTION
5659MULTIPLE POINTS INPUT FOR CONVOLUTIONAL NEURAL NETWORKS IN REPLAY ATTACK DETECTION
4368Multi-polarization information fusion for object contour display in passive millimeter-wave and terahertz security imaging
1825MULTI-RESOLUTION MULTI-HEAD ATTENTION IN DEEP SPEAKER EMBEDDING
4265MULTI-RESOLUTION OVERLAPPING STRIPES NETWORK FOR PERSON RE-IDENTIFICATION
1898MULTI-SCALE DEEP FEATURE FUSION FOR VEHICLE RE-IDENTIFICATION
1375MULTI-SCALE FEATURE AGGREGATION NETWORK WITH WAVELET STRUCTURE SIMILARITY LOSS FUNCTION FOR SINGLE IMAGE DEHAZING
4990MULTI-SCALE OCTAVE CONVOLUTIONS FOR ROBUST SPEECH RECOGNITION
3297MULTI-SCALE RESIDUAL NETWORK FOR IMAGE CLASSIFICATION
5083MULTI-SPEAKER AND MULTI-DOMAIN EMOTIONAL VOICE CONVERSION USING FACTORIZED HIERARCHICAL VARIATIONAL AUTOENCODER
1100Multispectral Fusion of RGB and NIR Images Using Weighted Least Squares and Alternating Guidance
3993MULTI-STAGE RESIDUAL HIDING FOR IMAGE-INTO-AUDIO STEGANOGRAPHY
4576MULTISTATE ENCODING WITH END-TO-END SPEECH RNN TRANSDUCER NETWORK
2042MULTI-STEP ONLINE UNSUPERVISED DOMAIN ADAPTATION
4874MULTITAPER SPECTRAL GRANGER CAUSALITY WITH APPLICATION TO SSVEP
2681MULTI-TASK CENTER-OF-PRESSURE METRICS ESTIMATION FROM SKELETON USING GRAPH CONVOLUTIONAL NETWORK
1682MULTITASK LEARNING AND MULTISTAGE FUSION FOR DIMENSIONAL AUDIOVISUAL EMOTION RECOGNITION
2831MULTITASK LEARNING FOR DARPA LORELEI’S SITUATION FRAME EXTRACTION TASK
3039Multi-task Learning for Speaker Verification and Voice Trigger Detection
3064Multi-task Learning for Voice Trigger Detection
1406MULTI-TASK LEARNING IN AUTONOMOUS DRIVING SCENARIOS VIA ADAPTIVE FEATURE REFINEMENT NETWORKS
3776Multi-Task Learning via SA-FPN and EJ-Head
1865MULTITASK LEARNING WITH CAPSULE NETWORKS FOR SPEECH-TO-INTENT APPLICATIONS
2611Multi-task self-supervised learning for robust speech recognition
3999MULTI-TIME-SCALE CONVOLUTION FOR EMOTION RECOGNITION FROM SPEECH AUDIO SIGNALS
5725MULTIUSER MASSIVE MIMO DOWNLINK PRECODING USING SECOND-ORDER SPATIAL SIGMA-DELTA MODULATION
5552MULTIVARIATE TROPICAL REGRESSION AND PIECEWISE-LINEAR SURFACE FITTING
1271MULTI-VIEW BAYESIAN GENERATIVE MODEL FOR MULTI-SUBJECT FMRI DATA ON BRAIN DECODING OF VIEWED IMAGE CATEGORIES
5409Multi-View Clustering via Mixed Embedding Approximation
4131MULTI-VIEW SHAPE ESTIMATION OF TRANSPARENT CONTAINERS
1602Multi-view Wasserstein discriminant analysis with entropic regularized Wasserstein distance
2123MULTI-WAY MULTI-VIEW DEEP AUTOENCODER FOR IMAGE FEATURE LEARNING WITH MULTI-LEVEL GRAPH REGULARIZATION
1798MUTUAL-INFORMATION-BASED SENSOR PLACEMENT FOR SPATIAL SOUND FIELD RECORDING
3926NASIL : NEURAL ARCHITECTURE SEARCH WITH IMITATION LEARNING
3091Near capacity RCQD constellations for PAPR reduction of OFDM systems
3332NEAREST KRONECKER PRODUCT DECOMPOSITION BASED NORMALIZED LEAST MEAN SQUARE ALGORITHM
6153NEAR-FIELD ACOUSTIC SOURCE LOCALIZATION USING SPHERICAL HARMONIC FEATURES
4278Near-optimal Bayes Error Based Feature Selection
1389NEAR-OPTIMAL INTERFERENCE EXPLOITATION 1-BIT MASSIVE MIMO PRECODING VIA PARTIAL BRANCH-AND-BOUND
1942NEURAL ATTENTIVE MULTIVIEW MACHINES
1689NEURAL CODING STRATEGIES FOR EVENT-BASED VISION DATA
2333NEURAL LATTICE SEARCH FOR SPEECH RECOGNITION
4836NEURAL NETWORK TRAINING WITH APPROXIMATE LOGARITHMIC COMPUTATIONS
3488Neural Network Wiretap Code Design for Multi-Mode Fiber Optical Channels
4079NEURAL ORACLE SEARCH ON N-BEST HYPOTHESES
2689NEURAL PERCUSSIVE SYNTHESIS PARAMETERISED BY HIGH-LEVEL TIMBRAL FEATURES
1671NEURAL TIME WARPING FOR MULTIPLE SEQUENCE ALIGNMENT
4683NEUTRAL TO LOMBARD SPEECH CONVERSION WITH DEEP LEARNING
2306NEW METRICS FOR EVALUATING THE ACCURACY OF FUNDAMENTAL FREQUENCY ESTIMATION APPROACHES IN MUSICAL SIGNALS
2742NODE-ASYNCHRONOUS SPECTRAL CLUSTERING ON DIRECTED GRAPHS
6144NOISE STATISTICS OBLIVIOUS GARD FOR ROBUST REGRESSION WITH SPARSE OUTLIERS
2823NOISE-ROBUST KEY-PHRASE DETECTORS FOR AUTOMATED CLASSROOM FEEDBACK
5850Non Local Multi-Fiber Network for Action Anticipation in Videos
3558NONCOHERENT MAXIMUM-LIKELIHOOD DETECTION FOR AMBIENT BACKSCATTERING COMMUNICATIONS OVER AMBIENT OFDM SIGNALS
2837NON-EXPERTS OR EXPERTS? STATISTICAL ANALYSES OF MOS USING DSIS METHOD
2330Non-Gaussian BLE-based Indoor Localization via Gaussian Sum Filtering Coupled with Wasserstein Distance
2840NON-GRIFFIN–LIM TYPE SIGNAL RECOVERY FROM MAGNITUDE SPECTROGRAM
6121NON-ITERATIVE SUBSPACE-BASED DOA ESTIMATION IN THE PRESENCE OF NONUNIFORM NOISE
5615NONLINEAR SPATIAL FILTERING FOR MULTICHANNEL SPEECH ENHANCEMENT IN INHOMOGENEOUS NOISE FIELDS
2442Non-local Nested Residual Attention Network for Stereo Image Super-resolution
3359Non-parametric Community Change-points Detection in Streaming Graph Signals
1639NON-UNIFORM VIDEO TIME-LAPSE METHOD BASED ON MOTION SCENARIO AND STABILIZATION CONSTRAINT
4498NORMALIZED LEAST-MEAN-SQUARE ALGORITHMS WITH MINIMAX CONCAVE PENALTY
5128OBJECT DETECTION AND 3D ESTIMATION VIA AN FMCW RADAR USING A FULLY CONVOLUTIONAL NETWORK
1410OBJECT DETECTION WITH COLOR AND DEPTH IMAGES WITH MULTI-REDUCED REGION PROPOSAL NETWORK AND MULTI-POOLING
3745OBJECT SURFACE ESTIMATION FROM RADAR IMAGES
1721OBJECTIVE BAYESIAN DETECTION UNDER SPATIALLY CORRELATED GAUSSIAN OBSERVATIONS FOR MULTI-ANTENNA COGNITIVE RADIO NETWORK
5654OH, JEEZ! OR UH-HUH? A LISTENER-AWARE BACKCHANNEL PREDICTOR ON ASR TRANSCRIPTIONS
1480On Binary Sequence Set Design with Applications to Automotive Radar
2938ON CRAMÉR-RAO LOWER BOUNDS WITH RANDOM EQUALITY CONSTRAINTS
3665ON DESIGN OF OPTIMAL SMART METER PRIVACY CONTROL STRATEGY AGAINST ADVERSARIAL MAP DETECTION
3736ON DISTRIBUTED STOCHASTIC GRADIENT ALGORITHMS FOR GLOBAL OPTIMIZATION
4965On Distributed Stochastic Gradient Descent for Nonconvex Functions in the Presence of Byzantines
2485ON DIVERGENCE APPROXIMATIONS FOR UNSUPERVISED TRAINING OF DEEP DENOISERS BASED ON STEIN’S UNBIASED RISK ESTIMATOR
5184ON END-TO-END MULTI-CHANNEL TIME DOMAIN SPEECH SEPARATION IN REVERBERANT ENVIRONMENTS
2517ON EXPONENTIALLY CONSISTENCY OF LINKAGE-BASED HIERARCHICAL CLUSTERING ALGORITHM USING KOLMOGROV-SMIRNOV DISTANCE
1672ON HARMONIC APPROXIMATIONS OF INHARMONIC SIGNALS
4226ON MEASURING DOPPLER SHIFTS BETWEEN TAGS IN A BACKSCATTERING TAG-TO-TAG NETWORK WITH APPLICATIONS IN TRACKING
1205ON MODELING ASR WORD CONFIDENCE
2236On Network Science and Mutual Information for Explaining Deep Neural Networks
4464ON POLAR CODING FOR FINITE BLOCKLENGTH SECRET KEY GENERATION OVER WIRELESS CHANNELS
3479ON REGULARIZATION PARAMETER FOR L0-SPARSE COVARIANCE FITTING BASED DOA ESTIMATION
5855ON ROBUST VARIANCE FILTERING AND CHANGE OF VARIANCE DETECTION
3355ON THE BYZANTINE ROBUSTNESS OF CLUSTERED FEDERATED LEARNING
6066On the choice of graph neural network architectures
2345ON THE DEGREES OF FREEDOM IN TOTAL VARIATION MINIMIZATION
1174On the Determination of Window Length in the Short-Time Fourier Transform with Rényi Entropy
3781On the effect of BRDFs on Phasor Field NLOS imaging
3311ON THE FREQUENCY DOMAIN DETECTION OF HIGH DIMENSIONAL TIME SERIES
5501On The Impact of Language Familiarity In Talker Change Detection
1281ON THE IMPORTANCE OF VOCAL TRACT CONSTRICTION FOR SPEAKER CHARACTERIZATION: THE WHISPERED SPEECH STUDY
3033ON THE LIMIT DISTRIBUTION OF THE CANONICAL CORRELATION COEFFICIENTS BETWEEN THE PAST AND THE FUTURE OF A HIGH-DIMENSIONAL WHITE NOISE
2048ON THE OPPORTUNISTIC USE OF COMMERCIAL KU AND KA BAND SATCOM NETWORKS FOR RAIN RATE ESTIMATION: POTENTIALS AND CRITICAL ISSUES
5625ON THE STABILITY OF POLYNOMIAL SPECTRAL GRAPH FILTERS
1497On Throughput of Millimeter Wave MIMO Systems with Low Resolution ADCs
4262One-bit Compressed Sensing using Generative Models
5713One-bit DoA estimation via Sparse Linear Arrays
2583ONE-BIT NORMALIZED SCATTER MATRIX ESTIMATION FOR COMPLEX ELLIPTICALLY SYMMETRIC DISTRIBUTIONS
4323ONE-BIT SAMPLING IN FRACTIONAL FOURIER DOMAIN
4855One-shot Parametric Audio Production Style Transfer With Application to Frequency Equalization
3723ONE-SHOT VOICE CONVERSION BY VECTOR QUANTIZATION
5581ONE-SHOT VOICE CONVERSION USING STAR-GAN
1729ONLINE CHANNEL ESTIMATION FOR HYBRID BEAMFORMING ARCHITECTURES
2259Online Community Detection by Spectral CUSUM
4862ONLINE GRAPH TOPOLOGY INFERENCE WITH KERNELS FOR BRAIN CONNECTIVITY ESTIMATION
2838ONLINE META-LEARNING ON NON-CONVEX SETTING
5396ONLINE POSITRON EMISSION TOMOGRAPHY BY ONLINE PORTFOLIO SELECTION
3905ONLINE TENSOR COMPLETION AND FREE SUBMODULE TRACKING WITH THE T-SVD
1751ON-THE-FLY FEATURE SELECTION AND CLASSIFICATION WITH APPLICATION TO CIVIC ENGAGEMENT PLATFORMS
2276OOV RECOVERY WITH EFFICIENT 2ND PASS DECODING AND OPEN-VOCABULARY WORD-LEVEL RNNLM RESCORING FOR HYBRID ASR
3919OPEN SET VIDEO CAMERA MODEL VERIFICATION
1911OpenDenoising: an Extensible Benchmark for Building Comparative Studies of Image Denoisers
3574OPPORTUNISTIC USE OF GNSS SIGNALS TO CHARACTERIZE THE ENVIRONMENT BY MEANS OF MACHINE LEARNING BASED PROCESSING
3663OPTIMAL DESIGN OF ENERGY-EFFICIENT CELL-FREE MASSIVE MIMO: JOINT POWER ALLOCATION AND LOAD BALANCING
3957Optimal Joint Channel Estimation and Data Detection by L1-norm PCA for Streetscape IoT
2610OPTIMAL LAPLACIAN REGULARIZATION FOR SPARSE SPECTRAL COMMUNITY DETECTION
6150Optimal Leak Factor Selection for the Output-Constrained Leaky Filtered-Input Least Mean Square Algorithm
3989OPTIMAL POWER FLOW USING GRAPH NEURAL NETWORKS
3530OPTIMAL TRANSPORT BASED CHANGE POINT DETECTION AND TIME SERIES SEGMENT CLUSTERING
3314Optimal transport structure of cycleGAN for unsupervised learning for inverse problems
5373OPTIMAL WINDOW DESIGN FOR JOINT SPATIAL-SPECTRAL DOMAIN FILTERING OF SIGNALS ON THE SPHERE
5376OPTIMAL WINDOW DESIGN FOR W-OFDM
2009OPTIMIZED SENSOR SELECTION FOR JOINT RADAR-COMMUNICATION SYSTEMS
1965OPTIMIZED SINGLE CARRIER TRANSCEIVER FOR FUTURE SUB-TERAHERTZ APPLICATIONS
4694OPTIMIZING BACKSCATTERING COEFFICIENT DESIGN FOR MINIMIZING BER AT MONOSTATIC MIMO READER
2895OPTIMIZING BAYESIAN HMM BASED X-VECTOR CLUSTERING FOR THE SECOND DIHARD SPEECH DIARIZATION CHALLENGE
2031OPTIMUM KERNEL PARTICLE FILTER FOR ASYMMETRIC LAPLACE NOISE
3188ORDINAL LEARNING FOR EMOTION RECOGNITION IN CUSTOMER SERVICE CALLS
5901ORTHOGONAL TRAINING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
2027OVERCOMING HIGH NANOPORE BASECALLER ERROR RATES FOR DNA STORAGE VIA BASECALLER-DECODER INTEGRATION AND CONVOLUTIONAL CODES
1642OVERDETERMINED INDEPENDENT VECTOR ANALYSIS
4424OVERLAP LOCAL-SGD: AN ALGORITHMIC APPROACH TO HIDE COMMUNICATION DELAYS IN DISTRIBUTED SGD
4104OVERLAP-AWARE DIARIZATION: RESEGMENTATION USING NEURAL END-TO-END OVERLAPPED SPEECH DETECTION
3231Overlapped State Hidden Semi-Markov Model for Grouped Multiple Sequences
1731PACO AND PACO-DCT: PATCH CONSENSUS AND ITS APPLICATION TO INPAINTING
4510PAGAN: A PHASE-ADAPTED GENERATIVE ADVERSARIAL NETWORKS FOR SPEECH ENHANCEMENT
2402PAN: PHONEME-AWARE NETWORK FOR MONAURAL SPEECH ENHANCEMENT
5461PARALLEL WAVEGAN: A FAST WAVEFORM GENERATION MODEL BASED ON GENERATIVE ADVERSARIAL NETWORKS WITH MULTI-RESOLUTION SPECTROGRAM
5442PARALLELIZING ADAM OPTIMIZER WITH BLOCKWISE MODEL-UPDATE FILTERING
3285PARAMETER ESTIMATION OF IN-CITY FRONTAL RAINFALL PROPAGATION
4929PARSING MAP GUIDED MULTI-SCALE ATTENTION NETWORK FOR FACE HALLUCINATION
2545Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification
3793PARTICLE FILTER WITH REJECTION CONTROL AND UNBIASED ESTIMATOR OF THE MARGINAL LIKELIHOOD
1598PARTICLE FILTERING ON THE COMPLEX STIEFEL MANIFOLD WITH APPLICATION TO SUBSPACE TRACKING
4023PARTICLE GROUP METROPOLIS METHODS FOR TRACKING THE LEAF AREA INDEX
4635PASSIVE INTELLIGENT SURFACE ASSISTED MIMO POWERED SUSTAINABLE IOT
6084PASSIVE JOINT LOCALIZATION AND SYNCHRONIZATION OF DISTRIBUTED MICROPHONE ARRAYS
3441Patch-Level Selection and Breadth-First Prediction Strategy for Reversible Data Hiding
3113Pathloss Prediction using Deep Learning with Applications to Cellular Optimization and Efficient D2D Link Scheduling
2337Peer to Peer offloading with delayed feedback: An adversary bandit approach
2759Perception-Distortion Trade-Off with Restricted Boltzmann Machines
3362PERCEPTUAL LOSS FUNCTION FOR NEURAL MODELLING OF AUDIO SYSTEMS
6087PERFORMANCE ANALYSIS AND CONSTELLATION OPTIMIZATION OF STAR-QAM-AIDED DIFFERENTIAL FASTER-THAN-NYQUIST SIGNALING
3438PERFORMANCE ANALYSIS FOR PATH ATTENUATION ESTIMATION OF MICROWAVE SIGNALS DUE TO RAINFALL AND BEYOND
3823PERFORMANCE BOUNDS FOR DISPLACED SENSOR AUTOMOTIVE RADAR IMAGING
5965Performance Comparison of Lossless Compression Strategies for Dynamic Vision Sensor Data
1949PERFORMANCE STUDY OF A CONVOLUTIONAL TIME-DOMAIN AUDIO SEPARATION NETWORK FOR REAL-TIME SPEECH DENOISING
6092PERMUTATIONS UNLABELED BEYOND SAMPLING UNKNOWN
4287Person Identification using Deep Convolutional Neural Networks on Short-Term Signals from Wearable Sensors
4537PEVD-BASED SPEECH ENHANCEMENT IN REVERBERANT ENVIRONMENTS
1979PHASE RECONSTRUCTION BASED ON RECURRENT PHASE UNWRAPPING WITH DEEP NEURAL NETWORKS
2511PHONEME BOUNDARY DETECTION USING LEARNABLE SEGMENTAL FEATURES
4064PHONETIC FEEDBACK FOR SPEECH ENHANCEMENT WITH AND WITHOUT PARALLEL SPEECH DATA
4116Phylogenetic Minimum Spanning Tree Reconstruction Using Autoencoders
2929PITCH ESTIMATION VIA SELF-SUPERVISION
2963PITCHNET: UNSUPERVISED SINGING VOICE CONVERSION WITH PITCH ADVERSARIAL NETWORK
3155PIXEL-LEVEL SELF-PACED LEARNING FOR SUPER-RESOLUTION
2425PIXEL-WISE LINEAR/NONLINEAR NONNEGATIVE MATRIX FACTORIZATION FOR UNMIXING OF HYPERSPECTRAL DATA
3670PLAYING TECHNIQUE RECOGNITION BY JOINT TIME–FREQUENCY SCATTERING
3168POLARIZATION PARAMETERS ESTIMATION WITH SCALAR SENSOR ARRAYS
4846POLARIZING FRONT ENDS FOR ROBUST CNNS
5946POLYPHONIC SOUND EVENT DETECTION USING TRANSPOSED CONVOLUTIONAL RECURRENT NEURAL NETWORK
5222PORTFOLIO CUTS: A GRAPH-THEORETIC FRAMEWORK TO DIVERSIFICATION
2121Pose Refinement: bridging the gap between Unsupervised Learning and Geometric Methods for Visual Odometry
1472POSITION CONSTRAINT LOSS FOR FASHION LANDMARK ESTIMATION
2776POSITIVE SEMIDEFINITE MATRIX FACTORIZATION: A LINK TO PHASE RETRIEVAL AND A BLOCK GRADIENT ALGORITHM
3042POSITIVE SOLUTIONS FOR LARGE RANDOM LINEAR SYSTEMS
2723POWER OPTIMIZATION USING EMBEDDED AUTOMATIC GAIN CONTROL ALGORITHM WITH PHOTOPLETHYSMOGRAPHY SIGNAL QUALITY CLASSIFICATION
2208POWER SPECTRUM OPTIMIZATION FOR CAPACITY OF THE EXTENDED SPECTRUM HYBRID FIBER COAX NETWORK
6071Precise Performance Analysis of the Box-Elastic Net Under Matrix Uncertainties
5157Preconditioned Ghost Imaging via Sparsity Constraint
4237Preconditioning ADMM for Fast Decentralized Optimization
2389PREDICTING PERFORMANCE OUTCOME WITH A CONVERSATIONAL GRAPH CONVOLUTIONAL NETWORK FOR SMALL GROUP INTERACTIONS
2336Predicting word error rate for reverberant speech
2077PREDICTION OF INDIVIDUAL PROGRESSION RATE IN PARKINSON’S DISEASE USING CLINICAL MEASURES AND BIOMECHANICAL MEASURES OF GAIT AND POSTURAL STABILITY
5464PREDICTION OF VESSEL TRAJECTORIES FROM AIS DATA VIA SEQUENCE-TO-SEQUENCE RECURRENT NEURAL NETWORKS
3059PREDICTION OF VOICING AND THE F0 CONTOUR FROM ELECTROMAGNETIC ARTICULOGRAPHY DATA FOR ARTICULATION-TO-SPEECH SYNTHESIS
2352Preference-aware Mask for Session-based Recommendation with Bidirectional Transformer
1808PRESERVATION OF ANOMALOUS SUBGROUPS ON VARIATIONAL AUTOENCODER TRANSFORMED DATA
5249PRE-TRAINING FOR QUERY REWRITING A SPOKEN LANGUAGE UNDERSTANDING SYSTEM
3238Primal-Dual Stochastic Subgradient Method for Log-determinant Optimization
1553Primary path estimator based on individual secondary path for ANC headphones
5482PRINCIPAL ANGLE DETECTOR FOR SUBSPACE SIGNAL WITH STRUCTURED UNKNOWN INTERFERENCE
2205PRINCIPLE-INSPIRED MULTI-SCALE AGGREGATION NETWORK FOR EXTREMELY LOW-LIGHT IMAGE ENHANCEMENT
2128Privacy aware acoustic scene synthesis using deep spectral feature inversion
2041PRIVACY-AWARE QUICKEST CHANGE DETECTION
4152PRIVACY-PRESERVING IMAGE SHARING VIA SPARSIFYING LAYERS ON CONVOLUTIONAL GROUPS
5304PRIVACY-PRESERVING PATTERN RECOGNITION USING ENCRYPTED SPARSE REPRESENTATIONS IN L0 NORM MINIMIZATION
3923PRIVACY-PRESERVING PHISHING WEB PAGE CLASSIFICATION VIA FULLY HOMOMORPHIC ENCRYPTION
2808PRIVATE FL-GAN: DIFFERENTIAL PRIVACY SYNTHETIC DATA GENERATION BASED ON FEDERATED LEARNING
3480PROBABILISTIC FILTER AND SMOOTHER FOR VARIATIONAL INFERENCE OF BAYESIAN LINEAR DYNAMICAL SYSTEMS
2479PROCESSING CONVOLUTIONAL NEURAL NETWORKS ON CACHE
3217Programmable Dataflow Accelerators: A 5G OFDM Modulation/Demodulation Case Study
3078PROGRESSIVE MULTI-TARGET NETWORK BASED SPEECH ENHANCEMENT WITH SNR-PRESELECTION FOR ROBUST SPEAKER DIARIZATION
5933PROJECTED WEIGHT REGULARIZATION TO IMPROVE NEURAL NETWORK GENERALIZATION
4689Projection Free Dynamic Online Learning
3318Propeller Noise Detection with Deep Learning
4685PROTOTYPICAL NETWORKS FOR SMALL FOOTPRINT TEXT-INDEPENDENT SPEAKER VERIFICATION
4847PROXIMAL DISTANCE ALGORITHM FOR NONCONVEX QCQP WITH BEAMFORMING APPLICATIONS
2613PROXIMAL MULTITASK LEARNING OVER DISTRIBUTED NETWORKS WITH JOINTLY SPARSE STRUCTURE
2976Pseudo Labeling and Negative Feedback Learning for Large-scale Multi-label Domain Classification
5290PSEUDO LIKELIHOOD CORRECTION TECHNIQUE FOR LOW RESOURCE ACCENTED ASR
1575PYANNOTE.AUDIO: NEURAL BUILDING BLOCKS FOR SPEAKER DIARIZATION
4501Q-GADMM: Quantized Group ADMM for Communication Efficient Decentralized Machine Learning
4769Q-LEARNING BASED PREDICTIVE RELAY SELECTION FOR OPTIMAL RELAY BEAMFORMING
4430QOS-AWARE FLOW CONTROL FOR POWER-EFFICIENT DATA CENTER NETWORKS WITH DEEP REINFORCEMENT LEARNING
5905QUALITY-OF-SERVICE PREDICTION FOR PHYSICAL-LAYER SECURITY VIA SECRECY MAPS
2645QUANTIZED TENSOR ROBUST PRINCIPAL COMPONENT ANALYSIS
4407Quantum State Discrimination with Local Operations and Classical Communications
2249QUARTZNET: DEEP AUTOMATIC SPEECH RECOGNITION WITH 1D TIME-CHANNEL SEPARABLE CONVOLUTIONS.
3666QUICKEST CHANGE DETECTION IN ANONYMOUS HETEROGENEOUS SENSOR NETOWKRS
4003Quickest Detection of Growing Dynamic Anomalies in Networks
5009RATE ASSIGNMENT IN 360-DEGREE VIDEO TILED STREAMING USING RANDOM FOREST REGRESSION
6155Rate-Constrained Noise Reduction in Wireless Acoustic Sensor Networks
4431RATE-INVARIANT AUTOENCODING OF TIME-SERIES
5054RAW WAVEFORM BASED END-TO-END DEEP CONVOLUTIONAL NETWORK FOR SPATIAL LOCALIZATION OF MULTIPLE ACOUSTIC SOURCES
1049RAY SEPARATION AND SOURCE DEPTH ESTIMATION BASED ON SOUND PRESSURE FIELD TRANSFORMATION
4809RDE-MOGA: AUTOMATIC SELECTION OF RATE-DISTORTION-ENERGY CONTROL POINTS FOR VIDEO ENCODERS USING MUTI-OBJETIVE GENETIC ALGORITHM
3577REALIZABILITY OF PLANAR POINT EMBEDDINGS FROM ANGLE MEASUREMENTS
5123REAL-TIME BINAURAL SPEECH SEPARATION THAT PRESERVES SPATIAL CUES
3440REAL-TIME HAND GESTURE RECOGNITION USING TEMPORAL MUSCLE ACTIVATION MAPS OF MULTI-CHANNEL SEMG SIGNALS
5551REAL-TIME IMPLEMENTATION ASPECTS OF LARGE INTELLIGENT SURFACES
3637REAL-TIME SPEECH ENHANCEMENT USING EQUILIBRIATED RNN
2290Real-Time Task Offloading for Large-Scale Mobile Edge Computing
4447REAL-TIME, UNIVERSAL, AND ROBUST ADVERSARIAL ATTACKS AGAINST SPEAKER RECOGNITION SYSTEMS
4122RECEIVER DESIGN AND AGC OPTIMIZATION WITH SELF INTERFERENCE INDUCED SATURATION
2697RECEPTIVE FIELD PYRAMID NETWORK FOR OBJECT DETECTION
3763RECONSTRUCTION OF FRI SIGNALS USING DEEP NEURAL NETWORK APPROACHES
6083Recovery of Binary Sparse Signals From Compressed Linear Measurements via Polynomial Optimization
2001RECURRENT NEURAL AUDIOVISUAL WORD EMBEDDINGS FOR SYNCHRONIZED SPEECH AND REAL-TIME MRI
5185RECURSIVE PREDICTION OF GRAPH SIGNALS WITH INCOMING NODES
4665REDUCED-COMPLEXITY SINGULAR VALUE DECOMPOSITION FOR TUCKER DECOMPOSITION: ALGORITHM AND HARDWARE
2299REDUNDANT CONVOLUTIONAL NETWORK WITH ATTENTION MECHANISM FOR MONAURAL SPEECH ENHANCEMENT
1316REFLECTANCE-GUIDED, CONTRAST-ACCUMULATED HISTOGRAM EQUALIZATION
5008REGRESSION BEFORE CLASSIFICATION FOR TEMPORAL ACTION DETECTION
2975REGULARIZED BEAMFORMER FOR THE SPHERICAL MICROPHONE ARRAY TO COPE WITH THE WHITE NOISE AMPLIFICATION
3565REGULARIZED FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION WITH ILRMA-BASED PRIOR DISTRIBUTION OF JOINT-DIAGONALIZATION PROCESS
1990Regularized partial phase synchrony index applied to dynamical functional connectivity estimation
3283REINFORCED DEPTH-AWARE DEEP LEARNING FOR SINGLE IMAGE DEHAZING
6131Relative Acoustic Transfer Function Estimation in Wireless Acoustic Sensor Networks
3631RELATIVE COST BASED MODEL SELECTION FOR SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION MODELS
2641RELIABLE AND SECURE TRANSMISSION FOR FUTURE NETWORKS
3828Residual Attention Network for Wavelet Domain Super-Resolution
4564RESIDUAL RECURRENT NEURAL NETWORK FOR SPEECH ENHANCEMENT
2728Resilient Distributed Recovery of Large Fields
2846Resilient to Byzantine Attacks Finite-Sum Optimization over Networks
5053RESOURCE MANAGEMENT IN THE MULTIBEAM NOMA-BASED SATELLITE DOWNLINK
4444RESTING-STATE EEG-BASED BIOMETRICS WITH SIGNALS FEATURES EXTRACTED BY MULTIVARIATE EMPIRICAL MODE DECOMPOSITION
4974RETHINKING RETINAL LANDMARK LOCALIZATION AS POSE ESTIMATION: NAIVE SINGLE STACKED NETWORK FOR OPTIC DISK AND FOVEA DETECTION
2392Rethinking Temporal-related Sample for Human Action Recognition
3597Retinal Vessel Segmentation via A Semantics and Multi-Scale Aggregation Network
5838RE-TRANSLATION STRATEGIES FOR LONG FORM, SIMULTANEOUS, SPOKEN LANGUAGE TRANSLATION
4531RETRIEVING VOCAL-TRACT RESONANCE AND ANTI-RESONANCE FROM HIGH-PITCHED VOWELS USING A RAHMONIC SUBTRACTION TECHNIQUE
2799REV-AE: A LEARNED FRAME SET FOR IMAGE RECONSTRUCTION
1981REVEALING BACKDOORS, POST-TRAINING, IN DNN CLASSIFIERS VIA NOVEL INFERENCE ON OPTIMIZED PERTURBATIONS INDUCING GROUP MISCLASSIFICATION
4204REVEALING HIDDEN DRAWINGS IN LEONARDO'S 'THE VIRGIN OF THE ROCKS' FROM MACRO X-RAY FLUORESCENCE SCANNING DATA THROUGH ELEMENT LINE LOCALISATION
3120REVERSAL NO LONGER MATTERS: ATTENTION-BASED ARRHYTHMIA DETECTION WITH LEAD-REVERSAL ECG DATA
3910REVISIT OF ESTIMATE SEQUENCE FOR ACCELERATED GRADIENT METHOD
5528REVISITING FAST SPECTRAL CLUSTERING WITH ANCHOR GRAPH
1869RGB-D BASED MULTI-MODAL DEEP LEARNING FOR FACE IDENTIFICATION
4042RIEMANNIAN FRAMEWORK FOR ROBUST COVARIANCE MATRIX ESTIMATION IN SPIKED MODELS
4046RIEMANNIAN GEOMETRY AND CRAMÉR-RAO BOUND FOR BLIND SEPARATION OF GAUSSIAN SOURCES
1336RISK CONVERGENCE OF CENTERED KERNEL RIDGE REGRESSION WITH LARGE DIMENSIONAL DATA
2703RNN-TRANSDUCER WITH STATELESS PREDICTION NETWORK
2670ROBUST AND COMPUTATIONALLY-EFFICIENT ANOMALY DETECTION USING POWERS-OF-TWO NETWORKS
1047ROBUST AND STEERABLE KRONECKER PRODUCT DIFFERENTIAL BEAMFORMING WITH RECTANGULAR MICROPHONE ARRAYS
5357ROBUST CFAR RADAR DETECTION USING A K-NEAREST NEIGHBORS RULE
3920ROBUST COVARIANCE MATRIX ESTIMATION AND PORTFOLIO ALLOCATION: THE CASE OF NON-HOMOGENEOUS ASSETS
2446Robust Frequency-Domain Recursive Least M-Estimate Adaptive Filter for Acoustic System Identification
1633ROBUST FULL-FOV DEPTH ESTIMATION IN TELE-WIDE CAMERA SYSTEM
3166ROBUST FUNDAMENTAL FREQUENCY ESTIMATION IN COLOURED NOISE
4952ROBUST GLOBAL OPTIMIZED AFFINE REGISTRATION METHOD FOR MICROSCOPIC IMAGES OF BIOLOGICAL TISSUE
2246Robust Hybrid Beamforming for Satellite-Terrestrial Integrated Networks
3500Robust Hybrid Precoding for Interference Exploitation in Massive MIMO Systems
6126Robust Joint Estimation of Multimicrophone Signal Model Parameters
4695ROBUST LIKELIHOOD RATIO TEST USING ALPHA-DIVERGENCE
3028ROBUST LOW RATE SPEECH CODING BASED ON CLONED NETWORKS AND WAVENET
1762ROBUST MARINE BUOY PLACEMENT FOR SHIP DETECTION USING DROPOUT K-MEANS
2958ROBUST MATRIX COMPLETION VIA LP-GREEDY PURSUITS
1218ROBUST MULTI-CHANNEL SPEECH RECOGNITION USING FREQUENCY ALIGNED NETWORK
3639ROBUST MUSIC ESTIMATION UNDER ARRAY RESPONSE UNCERTAINTY
1117ROBUST ONLINE MATRIX COMPLETION WITH GAUSSIAN MIXTURE MODEL
1902Robust Online Mirror Saddle-Point Method for Constrained Resource Allocation
4452ROBUST PARAMETER ESTIMATION OF CONTAMINATED DAMPED EXPONENTIALS
3547ROBUST PHASE RETRIEVAL WITH OUTLIERS
3342Robust Pricing Mechanism for Resource Sustainability under Privacy Constraint in Competitive Online Learning Multi-Agent Systems
4882ROBUST RANK CONSTRAINED SPARSE LEARNING: AN GRAPH-BASED METHOD FOR CLUSTERING
5301ROBUST SPEAKER RECOGNITION USING UNSUPERVISED ADVERSARIAL INVARIANCE
1778ROBUST SYMBOL-LEVEL PRECODING VIA AUTOENCODER-BASED DEEP LEARNING
2999ROBUST TDOA INDOOR TRACKING USING CONSTRAINED MEASUREMENT FILTERING AND GRID-BASED FILTERING
4007ROBUST TRANSMISSION OVER CHANNELS WITH CHANNEL UNCERTAINTY: AN ALGORITHMIC PERSPECTIVE
1683Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational Autoencoders
4737ROBUST VISUAL TRACKING WITH CONTEXT-BASED ACTIVE OCCLUSION RECOGNITION
3135ROBUSTNESS ASSESSMENT OF AUTOMATIC REINKE’S EDEMA DIAGNOSIS SYSTEMS
2725ROBUSTNESS OF SBL IN CORRELATED ENVIRONMENTS
1562ROIMIX: PROPOSAL-FUSION AMONG MULTIPLE IMAGES FOR UNDERWATER OBJECT DETECTION
3214SALIENCY-BASED IMAGE CONTRAST ENHANCEMENT WITH REVERSIBLE DATA HIDING
5727SALIENT OBJECT DETECTION BASED ON IMAGE BIT-MAP
4102SAMPLING CLASSES OF NON-BANDLIMITED SIGNALS USING INTEGRATE-AND-FIRE DEVICES: AVERAGE CASE ANALYSIS
5529SAMPLING OF SURFACES AND LEARNING FUNCTIONS IN HIGH DIMENSIONS
3458SAMPLING STRATEGIES FOR GAN SYNTHETIC DATA
3271SCALABLE DETECTION AND TRACKING OF EXTENDED OBJECTS
2573SCALABLE KERNEL LEARNING VIA THE DISCRIMINANT INFORMATION
1821SCALABLE LEARNING-BASED SAMPLING OPTIMIZATION FOR COMPRESSIVE DYNAMIC MRI
2112SCALABLE MULTILINGUAL FRONTEND FOR TTS
5540SCALPNET: DETECTION OF SPATIOTEMPORAL ABNORMAL INTERVALS IN EPILEPTIC EEG USING CONVOLUTIONAL NEURAL NETWORKS
1718Scene Text Recognition with Temporal Convolutional Encoder
4341SCENE-DEPENDENT ACOUSTIC EVENT DETECTION WITH SCENE CONDITIONING AND FAKE-SCENE-CONDITIONED LOSS
1033S-DOD-CNN: Doubly Injecting Spatially-Preserved Object Information for Event Recognition
4231SDTCN: SIMILARITY DRIVEN TRANSMISSION COMPUTING NETWORK FOR IMAGE DEHAZING
4330SECL-UMons Database for sound event classification and localization
5986SECOST: SEQUENTIAL CO-SUPERVISION FOR LARGE SCALE WEAKLY LABELED AUDIO EVENT DETECTION
1039SECURE FACE RECOGNITION IN EDGE AND CLOUD NETWORKS: FROM THE ENSEMBLE LEARNING PERSPECTIVE
3372SECURE IDENTIFICATION FOR GAUSSIAN CHANNELS
1434Secure Symbol-Level MISO Precoding
1380SED-MDD: TOWARDS SENTENCE DEPENDENT END-TO-END MISPRONUNCIATION DETECTION AND DIAGNOSIS
3922SELECTION-CHANNEL-AWARE REVERSE JPEG COMPATIBILITY FOR HIGHLY RELIABLE STEGANALYSIS OF JPEG IMAGES
4316SELECTIVE ATTENTION ENCODERS BY SYNTACTIC GRAPH CONVOLUTIONAL NETWORKS FOR DOCUMENT SUMMARIZATION
2744SELECTIVE CONVOLUTIONAL NETWORK: AN EFFICIENT OBJECT DETECTOR WITH IGNORING BACKGROUND
5444SELF-ADAPTIVE FEATURE FOOL
2878SELF-ATTENTION AND RETRIEVAL ENHANCED NEURAL NETWORKS FOR ESSAY GENERATION
3613SELF-ATTENTIVE SENTIMENTAL SENTENCE EMBEDDING FOR SENTIMENT ANALYSIS
3855SELF-DRIVEN GRAPH VOLTERRA MODELS FOR HIGHER-ORDER LINK PREDICTION
1695SELF-PACED PROBABILISTIC PRINCIPAL COMPONENT ANALYSIS FOR DATA WITH OUTLIERS
2594Self-supervised Adversarial Training
2104SELF-SUPERVISED DEEP LEARNING FOR FISHEYE IMAGE RECTIFICATION
5519SELF-SUPERVISED DENOISING AUTOENCODER WITH LINEAR REGRESSION DECODER FOR SPEECH ENHANCEMENT
5215SELF-SUPERVISED LEARNING FOR AUDIO-VISUAL SPEAKER DIARIZATION
1496SELF-SUPERVISED LEARNING FOR ECG-BASED EMOTION RECOGNITION
4523SELF-TRAINING FOR END-TO-END SPEECH RECOGNITION
6151SELF-TUNING ALGORITHMS FOR MULTISENSOR-MULTITARGET TRACKING USING BELIEF PROPAGATION
1206SEMANTIC AUGMENTATION HASHING FOR ZERO-SHOT IMAGE RETRIEVAL
2672SemanticGAN: Generative Adversarial Networks for Semantic Image to Photo-realistic Image Translation
3967SEMI-IMPLICIT STOCHASTIC RECURRENT NEURAL NETWORKS
4011Semi-Regular Geometric Kernel Encoding \& Reconstruction for Video Compression
5476SEMI-SUPERVISED LEARNING BASED ON HIERARCHICAL GENERATIVE MODELS FOR END-TO-END SPEECH SYNTHESIS
4119SEMI-SUPERVISED LEARNING FOR TEXT CLASSIFICATION BY LAYER PARTITIONING
4471Semi-supervised learning of processes over multi-relational graphs
4194Semi-supervised optimal transport methods for detecting anomalies
5113Semi-supervised sentence classification based on user polarity in the social scenarios
2779SEMI-SUPERVISED SPEAKER ADAPTATION FOR END-TO-END SPEECH SYNTHESIS WITH PRETRAINED MODELS
6130Sensitivity in tensor decomposition
4940SENSOR SELECTION FOR MODEL-FREE SOURCE LOCALIZATION: WHERE LESS IS MORE
2685SEPARABLE OPTIMIZATION FOR JOINT BLIND DECONVOLUTION AND DEMIXING
3031SEQUENCE-LEVEL CONSISTENCY TRAINING FOR SEMI-SUPERVISED END-TO-END AUTOMATIC SPEECH RECOGNITION
4851SEQUENCE-TO-SEQUENCE AUTOMATIC SPEECH RECOGNITION WITH WORD EMBEDDING REGULARIZATION AND FUSED DECODING
2617SEQUENCE-TO-SEQUENCE LABANOTATION GENERATION BASED ON MOTION CAPTURE DATA
5931Sequence-to-sequence Singing Synthesis Using the Feed-forward Transformer
5195SEQUENCE-TO-SUBSEQUENCE LEARNING WITH CONDITIONAL GAN FOR POWER DISAGGREGATION
3422SEQUENTIAL DEEP UNROLLING WITH FLOW PRIORS FOR ROBUST VIDEO DERAINING
5139SEQUENTIAL IOT DATA AUGMENTATION USING GENERATIVE ADVERSARIAL NETWORKS
3099SEQUENTIAL JOINT DETECTION AND ESTIMATION WITH AN APPLICATION TO JOINT SYMBOL DECODING AND NOISE POWER ESTIMATION
4017SEQUENTIAL METHODS FOR DETECTING A CHANGE IN THE DISTRIBUTION OF AN EPISODIC PROCESS
1780Sequential semi-orthogonal multi-level NMF with negative residual reduction for network embedding
3783SEQUENTIAL VESSEL TRAJECTORY IDENTIFICATION USING TRUNCATED VITERBI ALGORITHM
5896SHADOW REMOVAL OF TEXT DOCUMENT IMAGES BY ESTIMATING LOCAL AND GLOBAL BACKGROUND COLORS
2331Shape from Bandwidth: Central Projection Case
2972SHORT AND SQUEEZED: ACCELERATING THE COMPUTATION OF ANTISPARSE REPRESENTATIONS WITH SAFE SQUEEZING
5015SIGHT TO SOUND: AN END-TO-END APPROACH FOR VISUAL PIANO TRANSCRIPTION
3246SIGNAL CLUSTERING WITH CLASS-INDEPENDENT SEGMENTATION
4080SIGNAL SENSING AND RECONSTRUCTION PARADIGMS FOR A NOVEL MULTI-SOURCE STATIC COMPUTED TOMOGRAPHY SYSTEM
3839SIGNAL-AWARE BROADBAND DOA ESTIMATION USING ATTENTION MECHANISMS
2527SIMILARITY LEARNING FOR COVER SONG IDENTIFICATION USING CROSS-SIMILARITY MATRICES OF MULTI-LEVEL DEEP SEQUENCES
4180SIMPLE CACHING SCHEMES FOR NON-HOMOGENEOUS MISO CACHE-AIDED COMMUNICATION VIA CONVEXITY
1510SIMPLIFIED DYNAMIC SC-FLIP POLAR DECODING
2327SIMULTANEOUS SEPARATION AND TRANSCRIPTION OF MIXTURES WITH MULTIPLE POLYPHONIC AND PERCUSSIVE INSTRUMENTS
5246SINGING VOICE CONVERSION WITH DISENTANGLED REPRESENTATIONS OF SINGER AND VOCAL TECHNIQUE USING VARIATIONAL AUTOENCODERS
4630SINGLE FREQUENCY FILTER BANK BASED LONG-TERM AVERAGE SPECTRA FOR HYPERNASALITY DETECTION AND ASSESSMENT IN CLEFT LIP AND PALATE SPEECH
1840SINGLE-CHANNEL SPEECH SEPARATION INTEGRATING PITCH INFORMATION BASED ON A MULTI TASK LEARNING FRAMEWORK
1296SINGLE-SHOT REAL-TIME MULTIPLE-PATH TIME-OF-FLIGHT DEPTH IMAGING FOR MULTI-APERTURE AND MACRO-PIXEL SENSORS
4658SketchPPNet: A Joint Pixel and Point Convolutional Neural Network for Low Resolution Sketch Image Recognition
5726SKINAUGMENT: AUTO-ENCODING SPEAKER CONVERSIONS FOR AUTOMATIC SPEECH TRANSLATION
6068SLEPIAN-BANGS FORMULA AND CRAMER-RAO BOUND FOR CIRCULAR AND NON-CIRCULAR COMPLEX ELLIPTICAL SYMMETRIC DISTRIBUTIONS
1347SliceNet: Slice-Wise 3D Shapes Reconstruction from Single Image
5055SLOGD: SPEAKER LOCATION GUIDED DEFLATION APPROACH TO SPEECH SEPARATION
4280Slow-Time MIMO-FMCW Automotive Radar Detection with Imperfect Waveform Separation
3965SMALL ENERGY MASKING FOR IMPROVED NEURAL NETWORK TRAINING FOR END-TO-END SPEECH RECOGNITION
4059SMALL-FOOTPRINT KEYWORD SPOTTING ON RAW AUDIO DATA WITH SINC-CONVOLUTIONS
1714SMOOTHING GRAPH SIGNALS VIA RANDOM SPANNING FORESTS
1204SNDCNN: SELF-NORMALIZING DEEP CNNs WITH SCALED EXPONENTIAL LINEAR UNITS FOR SPEECH RECOGNITION
1572SNORER DIARISATION BASED ON DEEP NEURAL NETWORK EMBEDDINGS
5877SOCIAL DATA ASSISTED MULTI-MODAL VIDEO ANALYSIS FOR SALIENCY DETECTION
3122SOCIAL LEARNING WITH PARTIAL INFORMATION SHARING
4586SOFT-OUTPUT FINITE ALPHABET EQUALIZATION FOR MMWAVE MASSIVE MIMO
4192SOLVING MISSING-ANNOTATION OBJECT DETECTION WITH BACKGROUND RECALIBRATION LOSS
3970SOLVING NON-CONVEX NON-DIFFERENTIABLE MIN-MAX GAMES USING PROXIMAL GRADIENT METHOD
2348SOME ALTERNATING DIRECTION METHODS OF MULTIPLIERS REVISITED FOR CONSTRAINED TOTAL VARIATION MINIMIZATION
2552Sound Event Detection By Multitask Learning of Sound Events and Scenes with Soft Scene Labels
5266SOUND EVENT DETECTION IN SYNTHETIC DOMESTIC ENVIRONMENTS
4901SOUND EVENT DETECTION VIA DILATED CONVOLUTIONAL RECURRENT NEURAL NETWORKS
4972SOUND EVENT LOCALIZATION BASED ON SOUND INTENSITY VECTOR REFINED BY DNN-BASED DENOISING AND SOURCE SEPARATION
3691Sound texture synthesis using RI spectrograms
3588SOURCE CODING OF AUDIO SIGNALS WITH A GENERATIVE MODEL
3916SOURCE DOMAIN DATA SELECTION FOR IMPROVED TRANSFER LEARNING TARGETING DYSARTHRIC SPEECH RECOGNITION
3219SOURCE ENUMERATION VIA TOEPLITZ MATRIX COMPLETION
1996SOURCE SEPARATION WITH WEAKLY LABELLED DATA: A SOLUTION TO COMPUTATIONAL AUDITORY SCENE ANALYSIS
4707SPACE FILLING CURVES FOR MRI SAMPLING
2814SPARSE BEAMSPACE EQUALIZATION FOR MASSIVE MU-MIMO MMWAVE SYSTEMS
2238SPARSE BRANCH AND BOUND FOR EXACT OPTIMIZATION OF L0-NORM PENALIZED LEAST SQUARES
2551SPARSE CONVOLUTIONAL BEAMFORMING FOR WIRELESS ULTRASOUND
5575SPARSE CSP ALGORITHM VIA JOINT SPATIO-TEMPORAL FILTERING
6078SPARSE DATA INTERPOLATION USING THE GEODESIC DISTANCE AFFINITY SPACE
2069Sparse Directed Graph Learning for Head Movement Prediction in 360 Video Streaming
4198SPARSE LOW-REDUNDANCY LINEAR ARRAY WITH UNIFORM SUM CO-ARRAY
3328Sparse modeling on distributed encryption data
4189SPARSE RECOVERY WITH NON-LINEAR FOURIER FEATURES
2174SPATIAL ACTIVE NOISE CONTROL BASED ON KERNEL INTERPOLATION WITH DIRECTIONAL WEIGHTING
4097SPATIAL AND TEMPORAL SMOOTHING FOR COVARIANCE ESTIMATION IN SUPER-RESOLUTION ANGLE ESTIMATION IN AUTOMOTIVE RADARS
4164SPATIAL ATTENTION FOR FAR-FIELD SPEECH RECOGNITION WITH DEEP BEAMFORMING NEURAL NETWORKS
2170SPATIAL ATTENTIONAL BILINEAR 3D CONVOLUTIONAL NETWORK FOR VIDEO-BASED AUTISM SPECTRUM DISORDER DETECTION
4222SPATIAL GATING STRATEGIES FOR GRAPH RECURRENT NEURAL NETWORKS
3973SPATIALLY ADAPTIVE INTRA MODE PRE-SELECTION FOR ERP 360 VIDEO CODING
1486SPATIALLY GUIDED INDEPENDENT VECTOR ANALYSIS
6104SPATIAL-TEMPORAL CONTEXT-AWARE TRACKING
5093SPATIAL-TEMPORAL FEATURE AGGREGATION NETWORK FOR VIDEO OBJECT DETECTION
1989SPATIO-TEMPORAL AND GEOMETRY CONSTRAINED NETWORK FOR AUTOMOBILE VISUAL ODOMETRY
3680SPEAKER ADAPTATION OF A MULTILINGUAL ACOUSTIC MODEL FOR CROSS-LANGUAGE SYNTHESIS
5580SPEAKER AUGMENTATION FOR LOW RESOURCE SPEECH RECOGNITION
4542SPEAKER DIARIZATION USING LATENT SPACE CLUSTERING IN GENERATIVE ADVERSARIAL NETWORK
2097SPEAKER DIARIZATION WITH REGION PROPOSAL NETWORK
4660SPEAKER DIARIZATION WITH SESSION-LEVEL SPEAKER EMBEDDING REFINEMENT USING GRAPH NEURAL NETWORKS
1883SPEAKER EMBEDDINGS INCORPORATING ACOUSTIC CONDITIONS FOR DIARIZATION
4389Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement
4570SPEAKER-AWARE TARGET SPEAKER ENHANCEMENT BY JOINTLY LEARNING WITH SPEAKER EMBEDDING EXTRACTION
3853SPEAKER-AWARE TRAINING OF ATTENTION-BASED END-TO-END SPEECH RECOGNITION USING NEURAL SPEAKER EMBEDDINGS
3508SPEAKERFILTER: DEEP LEARNING-BASED TARGET SPEAKER EXTRACTION USING ANCHOR SPEECH
2270SPEAKER-INVARIANT AFFECTIVE REPRESENTATION LEARNING VIA ADVERSARIAL TRAINING
2079SpecAugment on Large Scale Datasets
1058SPECTROGRAM ANALYSIS VIA SELF-ATTENTION FOR REALIZING CROSS-MODEL VISUAL-AUDIO GENERATION
3378SPECTROGRAMS FUSION WITH MINIMUM DIFFERENCE MASKS ESTIMATION FOR MONAURAL SPEECH DEREVERBERATION
3269SPECTRUM ALLOCATION IN WIRELESS NETWORKS FOR CROWD LABELLING
2081SPEECH BREATHING ESTIMATION USING DEEP LEARNING METHODS
2215SPEECH EMOTION RECOGNITION WITH DUAL-SEQUENCE LSTM ARCHITECTURE
4565SPEECH EMOTION RECOGNITION WITH LOCAL-GLOBAL AWARE DEEP REPRESENTATION LEARNING
6136SPEECH ENHANCEMENT USING A TWO-STAGE NETWORK FOR AN EFFICIENT BOOSTING STRATEGY
1846SPEECH ENHANCEMENT USING SELF-ADAPTATION AND MULTI-HEAD SELF-ATTENTION
5059SPEECH INTELLIGIBILITY ENHANCEMENT BY EQUALIZATION FOR IN-CAR APPLICATIONS
4186SPEECH RECOGNITION MODEL COMPRESSION
2278SPEECH SENTIMENT ANALYSIS VIA PRE-TRAINED FEATURES FROM END-TO-END ASR MODELS
4494Speech Synthesis using EEG
4118Speech-Based Parameter Estimation of an Asymmetric Vocal Fold Oscillation Model and Its Application in Discriminating Vocal Fold Pathologies
4288SPEECH-DRIVEN FACIAL ANIMATION USING POLYNOMIAL FUSION OF FEATURES
4171SPEECH-TO-SINGING CONVERSION IN AN ENCODER-DECODER FRAMEWORK
1590Spherical Large Intelligent Surfaces
4025SPHERICAL VIDEO CODING WITH GEOMETRY AND REGION ADAPTIVE TRANSFORM DOMAIN TEMPORAL PREDICTION
1854SPIDERnet: ATTENTION NETWORK FOR ONE-SHOT ANOMALY DETECTION IN SOUNDS
4256SPIKING NEURAL NETWORKS TRAINED WITH BACKPROPAGATION FOR LOW POWER NEUROMORPHIC IMPLEMENTATION OF VOICE ACTIVITY DETECTION
2786SPOKEN DOCUMENT RETRIEVAL LEVERAGING BERT-BASED MODELING AND QUERY REFORMULATION
5250SPOKEN LANGUAGE ACQUISITION BASED ON REINFORCEMENT LEARNING AND WORD UNIT SEGMENTATION
2971SRZOO: AN INTEGRATED REPOSITORY FOR SUPER-RESOLUTION USING DEEP LEARNING
4044SSGD: SPARSITY-PROMOTING STOCHASTIC GRADIENT DESCENT ALGORITHM FOR UNBIASED DNN PRUNING
1618SSTNET: DETECTING MANIPULATED FACES THROUGH SPATIAL, STEGANALYSIS AND TEMPORAL FEATURES
3898STABILITY OF GRAPH NEURAL NETWORKS TO RELATIVE PERTURBATIONS
4261STABILIZING MULTI-AGENT DEEP REINFORCEMENT LEARNING BY IMPLICITLY ESTIMATING OTHER AGENTS’ BEHAVIORS
3085STABLE TRAINING OF DNN FOR SPEECH ENHANCEMENT BASED ON PERCEPTUALLY-MOTIVATED BLACK-BOX COST FUNCTION
3303STAGED TRAINING STRATEGY AND MULTI-ACTIVATION FOR AUDIO TAGGING WITH NOISY AND SPARSE MULTI-LABEL DATA
5223StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition
6105STATE-AWARE ANTI-DRIFT OBJECT TRACKING
4828STATE-BASED TRANSCRIPTION OF COMPONENTS OF CARNATIC MUSIC
4789State-space Gaussian Process for Drift Estimation in Stochastic Differential Equations
3142STATIC VISUAL SPATIAL PRIORS FOR DOA ESTIMATION
1304STATISTICAL SIGNAL PROCESSING APPROACH FOR RAIN ESTIMATION BASED ON MEASUREMENTS FROM NETWORK MANAGEMENT SYSTEMS
2937STATISTICS POOLING TIME DELAY NEURAL NETWORK BASED ON X-VECTOR FOR SPEAKER VERIFICATION
5132Steepening Squared Error Function Helps Online Adaptation of Gaussian Scales
1215Steganography and its Detection in JPEG Images Obtained with the “Trunc” Quantizer
4763STOCHASTIC ADMM FOR BYZANTINE-ROBUST DISTRIBUTED LEARNING
2663STOCHASTIC GEOMETRY PLANNING OF ELECTRIC VEHICLES CHARGING STATIONS
3022STOCHASTIC GRAPH NEURAL NETWORKS
5767STOCHASTIC ML ESTIMATION FOR HYPERSPECTRAL UNMIXING UNDER ENDMEMBER VARIABILITY AND NONLINEAR MODELS
4053STOCHASTIC MULTI-SCALE AGGREGATION NETWORK FOR CROWD COUNTING
4150STOCK MOVEMENT PREDICTION THAT INTEGRATES HETEROGENEOUS DATA SOURCES USING DILATED CAUSAL CONVOLUTION NETWORKS WITH ATTENTION
5345STORING DIGITAL DATA INTO DNA: A COMPARATIVE STUDY OF QUATERNARY CODE CONSTRUCTION
4297STRATEGIC ATTENTION LEARNING FOR MODALITY TRANSLATION
3551STREAMING AUTOMATIC SPEECH RECOGNITION WITH THE TRANSFORMER MODEL
5992STRUCTURAL SPARSIFICATION FOR FAR-FIELD SPEAKER RECOGNITION WITH GNA
6143STRUCTURED AND UNSTRUCTURED OUTLIER IDENTIFICATION FOR ROBUST PCA: A FAST PARAMETER FREE ALGORITHM
4849STRUCTURED CITATION TREND PREDICTION USING GRAPH NEURAL NETWORKS
2377STRUCTURED SPARSE ATTENTION FOR END-TO-END AUTOMATIC SPEECH RECOGNITION
5739STUDY OF CLOSED PHASE RESONANCE BANDWIDTHS FOR ORAL AND NASAL TRACTS USING ZERO TIME WINDOWING
5257STUDY OF FORMANT MODIFICATION FOR CHILDREN ASR
1983SUB-DIP: OPTIMIZATION ON A SUBSPACE WITH DEEP IMAGE PRIOR REGULARIZATION AND APPLICATION TO SUPERRESOLUTION
1444Subject Transfer Framework Based on Source Selection and Semi-Supervised Style Transfer Mapping for sEMG Pattern Recognition
3023SUBJECTIVE QUALITY ESTIMATION USING PESQ FOR HANDS-FREE TERMINALS
4398Submodular Rank Aggregation on Score-based Permutations for Distributed Automatic Speech Recognition
3879SUBSPACE-BASED SPEECH CORRELATION VECTOR ESTIMATION FOR SINGLE-MICROPHONE MULTI-FRAME MVDR FILTERING
3134Superpixel Segmentation via Convolutional Neural Networks with Regularized Information Maximization
4429SUPER-RESOLUTION OF 3D COLOR POINT CLOUDS VIA FAST GRAPH TOTAL VARIATION
6152SUPER-RESOLUTION VIA IMAGE-ADAPTED DENOISING CNNS: INCORPORATING EXTERNAL AND INTERNAL LEARNING
5547SUPER-RESOLUTION WITH NOISY MEASUREMENTS: RECONCILING UPPER AND LOWER BOUNDS
1209SUPERVISED CANONICAL CORRELATION ANALYSIS OF DATA ON SYMMETRIC POSITIVE DEFINITE MANIFOLDS BY RIEMANNIAN DIMENSIONALITY REDUCTION
4579SUPERVISED DEEP HASHING FOR EFFICIENT AUDIO EVENT RETRIEVAL
1372SUPERVISED ENCODING FOR DISCRETE REPRESENTATION LEARNING
3872SUPERVISED GRAPH REPRESENTATION LEARNING FOR MODELING THE RELATIONSHIP BETWEEN STRUCTURAL AND FUNCTIONAL BRAIN CONNECTIVITY
3306Supervised online diarization with sample mean loss for multi-domain data
6074SWIFT-LINK: A COMPRESSIVE BEAM ALIGNMENT ALGORITHM FOR PRACTICAL MMWAVE RADIOS
6119SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech
4953SYNCHRONOUS TRANSFORMERS FOR END-TO-END SPEECH RECOGNITION
4765SYNTHESIZING ENGAGING MUSIC USING DYNAMIC MODELS OF STATISTICAL SURPRISAL
4815SYNTHETIC CROWD AND PEDESTRIAN GENERATOR FOR DEEP LEARNING PROBLEMS
3437SYNTHETIC DATA GENERATION THROUGH STATISTICAL EXPLOSION: IMPROVING CLASSIFICATION ACCURACY OF CORONARY ARTERY DISEASE USING PPG
3592SYNTHETIC SPEECH REFERENCES FOR AUTOMATIC PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT
3125TACKLING REAL NOISY REVERBERANT MEETINGS WITH ALL-NEURAL SOURCE SEPARATION, COUNTING, AND DIARIZATION SYSTEM
2637TALKER-INDEPENDENT SPEAKER SEPARATION IN REVERBERANT CONDITIONS
1835TARGET PARAMETER ESTIMATION VIA ONE-BIT PMCW RADAR
4985TASK-AWARE MEAN TEACHER METHOD FOR LARGE SCALE WEAKLY LABELED SEMI-SUPERVISED SOUND EVENT DETECTION
1610TDMF: TASK-DRIVEN MULTILEVEL FRAMEWORK FOR END-TO-END SPEAKER VERIFICATION
2415TEACHER-STUDENT TRAINING FOR ROBUST TACOTRON-BASED TTS
3181TEACHING SIGNALS AND SYSTEMS - A FIRST COURSE IN SIGNAL PROCESSING
3414TEMPORAL CODING IN SPIKING NEURAL NETWORKS WITH ALPHA SYNAPTIC FUNCTION
1757Tensor Decomposition-based Beamspace ESPRIT Algorithm for Multidimensional Harmonic Retrieval
4710TENSORFLOW AUDIO MODELS IN ESSENTIA
4370Tensor-to-Vector Regression for Multi-channel Speech Enhancement based on Tensor-Train Network
3997TEXCEPTION: A CHARACTER/WORD-LEVEL DEEP LEARNING MODEL FOR PHISHING URL DETECTION
5335TEXT ADAPTATION FOR SPEAKER VERIFICATION WITH SPEAKER-TEXT FACTORIZED EMBEDDINGS
5822TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCES
1987Text-to-image synthesis method evaluation based on visual patterns
1373T-GSA: TRANSFORMER WITH GAUSSIAN-WEIGHTED SELF-ATTENTION FOR SPEECH ENHANCEMENT
3198THE COMPRESSED NESTED ARRAY FOR UNDERDETERMINED DOA ESTIMATION BY FOURTH-ORDER DIFFERENCE COARRAY
4892THE DISCRETE STOCKWELL TRANSFORMS FOR INFINITE-LENGTH SIGNALS AND THEIR REAL-TIME IMPLEMENTATIONS
4020THE EFFECT OF DATA AUGMENTATION ON CLASSIFICATION OF ATRIAL FIBRILLATION IN SHORT SINGLE-LEAD ECG SIGNALS USING DEEP NEURAL NETWORKS
3260THE EFFECT OF POWER ALLOCATION ON VISIBLE LIGHT COMMUNICATION USING COMMERCIAL PHOSPHOR-CONVERTED LED LAMP FOR INDIRECT ILLUMINATION
5779THE EMPIRICAL DUALITY GAP OF CONSTRAINED STATISTICAL LEARNING
5894The FifthNet Chroma Extractor
2020THE FRACTIONAL QUATERNION FOURIER NUMBER TRANSFORM
4179THE GRAPHON FOURIER TRANSFORM
2468THE MATCHED REASSIGNED CROSS-SPECTROGRAM FOR PHASE ESTIMATION
5105THE OPEN BRANDS DATASET: UNIFIED BRAND DETECTION AND RECOGNITION AT SCALE
5908The PICASSO algorithm for Bayesian localization via paired comparisons in a union of subspaces model
2841THE PROCESSING OF MANDARIN CHINESE TONAL ALTERNATIONS IN CONTEXTS: AN EYE-TRACKING STUDY
2602The Role of Annotation Fusion Methods in the Study of Human-Reported Emotion Experience During Music Listening
5515THE RWTH ASR SYSTEM FOR TED-LIUM RELEASE 2: IMPROVING HYBRID HMM WITH SPECAUGMENT
4611THE SOUND OF MY VOICE: SPEAKER REPRESENTATION LOSS FOR TARGET VOICE SEPARATION
3896The SWAX Benchmark: Attacking Biometric Systems with Wax Figures
1425THEORETICAL ANALYSIS OF MULTI-CARRIER AGILE PHASED ARRAY RADAR
3102Theoretical Performance Bound of Uplink Channel Estimation Accuracy in Massive MIMO
5574THIS DATASET DOES NOT EXIST: TRAINING MODELS FROM GENERATED IMAGES
1184THRESHOLD-ADJUSTED ORB STRATEGIES WITH GENETIC ALGORITHM AND PROTECTIVE CLOSING STRATEGY ON TAIWAN FUTURES MARKET
4005TIME DIFFERENCE OF ARRIVAL ESTIMATION FROM FREQUENCY-SLIDING GENERALIZED CROSS-CORRELATIONS USING CONVOLUTIONAL NEURAL NETWORKS
3949TIME DOMAIN VELOCITY VECTOR FOR RETRACING THE MULTIPATH PROPAGATION
3399TIME REVERSAL BASED ROBUST GESTURE RECOGNITION USING WIFI
1819TIME-DOMAIN AUDIO SOURCE SEPARATION BASED ON WAVE-U-NET COMBINED WITH DISCRETE WAVELET TRANSFORM
5046TIME-DOMAIN NEURAL NETWORK APPROACH FOR SPEECH BANDWIDTH EXTENSION
1501TIME-FREQUENCY ANALYSIS OF UNIMODAL SENSORY PROCESSING IN AUTISM SPECTRUM DISORDER
2221TIME-FREQUENCY FEATURE DECOMPOSITION BASED ON SOUND DURATION FOR ACOUSTIC SCENE CLASSIFICATION
4028TIME-FREQUENCY LOSS FOR CNN BASED SPEECH SUPER-RESOLUTION
3429TIME-PREDICTABLE SOFTWARE-DEFINED ARCHITECTURE WITH SDF-BASED COMPILER FLOW FOR 5G BASEBAND PROCESSING
2141TIME-SCALE SYNTHESIS FOR LOCALLY STATIONARY SIGNALS
6072TOA-BASED LOCALIZATION WITH NLOS MITIGATION VIA ROBUST MULTIDIMENSIONAL SIMILARITY ANALYSIS
5021TOSO: STUDENT'S-T DISTRIBUTION AIDED ONE-STAGE ORIENTATION TARGET DETECTION IN REMOTE SENSING IMAGES
4372TOWARD BETTER SPEAKER EMBEDDINGS: AUTOMATED COLLECTION OF SPEECH SAMPLES FROM UNKNOWN DISTINCT SPEAKERS
1749TOWARDS A NEW UNDERSTANDING OF THE TRAINING OF NEURAL NETWORKS WITH MISLABELED TRAINING DATA
3940TOWARDS AN EFFICIENT AND GENERAL FRAMEWORK OF ROBUST TRAINING FOR GRAPH NEURAL NETWORKS
4112TOWARDS AN INTELLIGENT MICROSCOPE: ADAPTIVELY LEARNED ILLUMINATION FOR OPTIMAL SAMPLE CLASSIFICATION
3002TOWARDS BLIND QUALITY ASSESSMENT OF CONCERT AUDIO RECORDINGS USING DEEP NEURAL NETWORKS
4906TOWARDS DATA-EFFICIENT MODELING FOR WAKE WORD SPOTTING
4982Towards Decoding Selective Attention from Single-Trial EEG Data in Cochlear Implant Users Based on Deep Neural Networks
2870TOWARDS FAST AND ACCURATE STREAMING END-TO-END ASR
4688TOWARDS HIGH-PERFORMANCE OBJECT DETECTION: TASK-SPECIFIC DESIGN CONSIDERING CLASSIFICATION AND LOCALIZATION SEPARATION
5841TOWARDS LINKING THE LAKH AND IMSLP DATASETS
5454Towards Multilingual Sign Language Recognition
3296Towards Pose-invariant Lip-Reading
4066TOWARDS REAL-TIME SINGLE-CHANNEL SINGING-VOICE SEPARATION WITH PRUNED MULTI-SCALED DENSENETS
4793Towards Real-time, Multi-view Video Stereopsis
5682TOWARDS UNSUPERVISED SPEECH RECOGNITION AND SYNTHESIS WITH QUANTIZED SPEECH REPRESENTATION LEARNING
1433TRACE NORM GENERATIVE ADVERSARIAL NETWORKS FOR SENSOR GENERATION AND FEATURE EXTRACTION
1925Tracing Network Evolution Using the PARAFAC2 Model
5921TRACK-BEFORE-DETECT FOR SUB-NYQUIST RADAR
4600TRACKING TO IMPROVE DETECTION QUALITY IN LIDAR FOR AUTONOMOUS DRIVING
2026TRAINING A CODE-SWITCHING LANGUAGE MODEL WITH MONOLINGUAL DATA
4084TRAINING ASR MODELS BY GENERATION OF CONTEXTUAL INFORMATION
4475TRAINING DEEP SPIKING NEURAL NETWORKS FOR ENERGY-EFFICIENT NEUROMORPHIC COMPUTING
4794Training Keyword Spotters with Limited and Synthesized Speech Data
3514TRAINING LSTM FOR UNSUPERVISED ANOMALY DETECTION WITHOUT A PRIORI KNOWLEDGE
2051TRAINING SPOKEN LANGUAGE UNDERSTANDING SYSTEMS WITH NON-PARALLEL SPEECH AND TEXT
5118TRANSFER LEARNING FROM YOUTUBE SOUNDTRACKS TO TAG ARCTIC ECOACOUSTIC RECORDINGS
4517TRANSFERABLE POLICIES FOR LARGE SCALE WIRELESS NETWORKS WITH GRAPH NEURAL NETWORKS
4588TRANSFERRING NEURAL SPEECH WAVEFORM SYNTHESIZERS TO MUSICAL INSTRUMENT SOUNDS GENERATION
4897Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
2702TRANSFORMER VAE: A HIERARCHICAL MODEL FOR STRUCTURE-AWARE AND INTERPRETABLE MUSIC REPRESENTATION LEARNING
1805Transformer-based Acoustic Modeling for Hybrid Speech Recognition
3518TRANSFORMER-BASED ONLINE CTC/ATTENTION END-TO-END SPEECH RECOGNITION ARCHITECTURE
2269TRANSFORMER-BASED TEXT-TO-SPEECH WITH WEIGHTED FORCED ATTENTION
5828TRANSFORMING SEISMOCARDIOGRAMS INTO ELECTROCARDIOGRAMS BY APPLYING CONVOLUTIONAL AUTOENCODERS
1736TRANSLATION OF A HIGHER ORDER AMBISONICS SOUND SCENE BASED ON PARAMETRIC DECOMPOSITION
1899TRANSMIT BEAMFORMING DESIGN WITH RECEIVED-INTERFERENCE POWER CONSTRAINTS: THE ZERO-FORCING RELAXATION
2688TRANSMIT BEAMPATTERN SHAPING VIA WAVEFORM DESIGN IN COGNITIVE MIMO RADAR
4848Trapezoidal Segment Sequencing: A Novel Approach for Fusion of Human-produced Continuous Annotations
4726TREE OF SHAPES CUT FOR MATERIAL SEGMENTATION GUIDED BY A DESIGN
2022Triggerless Random Interleaved Sampling
4497TRILINGUAL SEMANTIC EMBEDDINGS OF VISUALLY GROUNDED SPEECH WITH SELF-ATTENTION MECHANISMS
1748TRIPLET LOSS FEATURE AGGREGATION FOR SCALABLE HASH
4433TRUTH-TO-ESTIMATE RATIO MASK: A POST-PROCESSING METHOD FOR SPEECH ENHANCEMENT DIRECT AT LOW SIGNAL-TO-NOISE RATIOS
5826TS-FEN: PROBING FEATURE SELECTION STRATEGY FOR FACE ANTI-SPOOFING
6063TWO-DIMENSIONAL DOA ESTIMATION FOR COPRIME PLANAR ARRAY: A COARRAY TENSOR-BASED SOLUTION
4307TWO-ELEMENT BIOMIMETIC ANTENNA ARRAY DESIGN AND PERFORMANCE
5282TWO-STEP ACOUSTIC MODEL ADAPTATION FOR DYSARTHRIC SPEECH RECOGNITION
4400TWO-STEP SOUND SOURCE SEPARATION: TRAINING ON LEARNED LATENT TARGETS
1741UNCERTAINTIES IN SHORT COMMERCIAL MICROWAVE LINKS FADING DUE TO RAIN
5818Uncertainty Quantification for Remaining Useful Lifetime Prediction with Multi-channel Sensory Data
5479UNDERWATER TRACKING BASED ON THE SUM-PRODUCT ALGORITHM ENHANCED BY A NEURAL NETWORK DETECTIONS CLASSIFIER
3504UNet 3+: A full-scale connected unet for medical image segmentation
4405UNIFIED SIGNAL COMPRESSION USING GENERATIVE ADVERSARIAL NETWORKS
4264Universal Phone Recognition with a Multilingual Allophone System
3456Unresolved Radar Targets Separation with Direct Extraction of Local Frequencies
3080UNSEEN FACE PRESENTATION ATTACK DETECTION WITH HYPERSPHERE LOSS
1667UNSUPERVISED AUTO-ENCODING MULTIPLE-OBJECT TRACKER FOR CONSTRAINT-CONSISTENT COMBINATORIAL PROBLEM
4177UNSUPERVISED CHANGE DETECTION FOR MULTIMODAL REMOTE SENSING IMAGES VIA COUPLED DICTIONARY LEARNING AND SPARSE CODING
4428UNSUPERVISED CONTENT-PRESERVED ADAPTATION NETWORK FOR CLASSIFICATION OF PULMONARY TEXTURES FROM DIFFERENT CT SCANNERS
4631UNSUPERVISED DOMAIN ADAPTATION FOR SEMANTIC SEGMENTATION WITH SYMMETRIC ADAPTATION CONSISTENCY
6070UNSUPERVISED ENSEMBLE CLASSIFICATION WITH CORRELATED DECISION AGENTS
4669UNSUPERVISED FEATURE ENHANCEMENT FOR SPEAKER VERIFICATION
3785UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION VIA FAIR REPRESENTATION OF GENDER BIAS
5179UNSUPERVISED KEY HAND SHAPE DISCOVERY OF SIGN LANGUAGE VIDEOS WITH CORRESPONDENCE SPARSE AUTOENCODERS
2437Unsupervised Multiple Source Localization Using Relative Harmonic Coefficients
5313UNSUPERVISED NEURAL MASK ESTIMATOR FOR GENERALIZED EIGEN-VALUE BEAMFORMING BASED ASR
1234UNSUPERVISED PERSON RE-IDENTIFICATION USING MULTI-BRANCH FEATURE COMPENSATION NETWORK AND LINK-BASED CLUSTER DISSIMILARITY METRIC
2719UNSUPERVISED PRE-TRAINING OF BIDIRECTIONAL SPEECH ENCODERS VIA MASKED RECONSTRUCTION
3707Unsupervised pretraining transfers well across languages
2571UNSUPERVISED SPEAKER ADAPTATION USING ATTENTION-BASED SPEAKER MEMORY FOR END-TO-END ASR
3749Unsupervised Style and Content Separation by Minimizing Mutual Information for Speech Synthesis
1212UNSUPERVISED TRAINING FOR DEEP SPEECH SOURCE SEPARATION WITH KULLBACK-LEIBLER DIVERGENCE BASED PROBABILISTIC LOSS FUNCTION
5544UNSUPERVISED VARIATIONAL BAYESIAN KALMAN FILTERING FOR LARGE-DIMENSIONAL GAUSSIAN SYSTEMS
5645UPGRADE METHODS FOR STRATIFIED SENSOR NETWORK SELF-CALIBRATION
5231UPGRADING CRFS TO JRFS AND ITS BENEFITS TO SEQUENCE MODELING AND LABELING
3891UPSCALING VECTOR APPROXIMATE MESSAGE PASSING
3048URTIS: A SMALL 3D IMAGING SONAR SENSOR FOR ROBOTIC APPLICATIONS
2234USING AUTOMATIC SPEECH RECOGNITION AND SPEECH SYNTHESIS TO IMPROVE THE INTELLIGIBILITY OF COCHLEAR IMPLANT USERS IN REVERBERANT LISTENING ENVIRONMENTS
4253USING BLACK-BOX COMPRESSION ALGORITHMS FOR PHASE RETRIEVAL
3564USING INTELLIGENT REFLECTING SURFACES FOR RANK IMPROVEMENT IN MIMO COMMUNICATIONS
5530USING PANORAMIC VIDEOS FOR MULTI-PERSON LOCALIZATION AND TRACKING IN A 3D PANORAMIC COORDINATE
4041Using Personalized Speech Synthesis and Neural Language Generator for Rapid Speaker Adaptation
1411USING SEPARATE LOSSES FOR SPEECH AND NOISE IN MASK-BASED SPEECH ENHANCEMENT
2044USING SPEECH SYNTHESIS TO TRAIN END-TO-END SPOKEN LANGUAGE UNDERSTANDING MODELS
1484USING VAES AND NORMALIZING FLOWS FOR ONE-SHOT TEXT-TO-SPEECH SYNTHESIS OF EXPRESSIVE SPEECH
2832USING X-VECTORS TO AUTOMATICALLY DETECT PARKINSON'S DISEASE FROM SPEECH
5302UTTERANCE-LEVEL SEQUENTIAL MODELING FOR DEEP GAUSSIAN PROCESS BASED SPEECH SYNTHESIS USING SIMPLE RECURRENT UNIT
2029VAMP with Vector-Valued Diagonalization
3051VAPAR SYNTH - A VARIATIONAL PARAMETRIC MODEL FOR AUDIO SYNTHESIS
5450VARIABLE BITRATE IMAGE COMPRESSION WITH QUALITY SCALING FACTORS
2969VARIABLE METRIC PROXIMAL GRADIENT METHOD WITH DIAGONAL BARZILAI-BORWEIN STEPSIZE
3974VARIABLE PROJECTION FOR MULTIPLE FREQUENCY ESTIMATION
4759VARIATIONAL STUDENT: LEARNING COMPACT AND SPARSER NETWORKS IN KNOWLEDGE DISTILLATION FRAMEWORK
3038VERSATILE VIDEO CODING AND SUPER-RESOLUTION FOR EFFICIENT DELIVERY OF 8K VIDEO WITH 4K BACKWARD-COMPATIBILITY
2494VGGFOLEY: A LARGE-SCALE AUDIO-VISUAL DATASET
5334VIDEO DEBLURRING VIA 3D CNN AND FOURIER ACCUMULATION LEARNING
5588VIDEO FRAME INTERPOLATION VIA EXCEPTIONAL MOTION-AWARE SYNTHESIS
1777Video Frame Interpolation via Residue Refinement
2277Video Question Generation via Semantic Rich Cross-Modal Self-Attention Networks Learning
5209VIEW-ANGLE INVARIANT OBJECT MONITORING WITHOUT IMAGE REGISTRATION
2514VIMO: VITAL SIGN MONITORING USING COMMODITY MILLIMETER WAVE RADIO
1713VISUALLY GUIDED SELF SUPERVISED LEARNING OF SPEECH REPRESENTATIONS
4519VOCAL TRACT ARTICULATORY CONTOUR DETECTION IN REAL-TIME MAGNETIC RESONANCE IMAGES USING SPATIO-TEMPORAL CONTEXT
6096VOICE ACTIVITY DETECTION FOR TRANSIENT NOISY ENVIRONMENT BASED ON DIFFUSION NETS
4923VOICE BASED CLASSIFICATION OF PATIENTS WITH AMYOTROPIC LATERAL SCLEROSIS, PARKINSON'S DISEASE AND HEALTHY CONTROLS WITH CNN-LSTM USING TRANSFER LEARNING
5115VOICE CONVERSION WITH TRANSFORMER NETWORK
5337VOICEAI SYSTEMS TO NIST SRE19 EVALUATION: ROBUST SPEAKER RECOGNITION ON CONVERSATIONAL TELEPHONE SPEECH
3866VOLUME RECONSTRUCTION FOR LIGHT FIELD MICROSCOPY
4657WAVEFFJORD: FFJORD-BASED VOCODER FOR STATISTICAL PARAMETRIC SPEECH SYNTHESIS
4624WAWEnets: A No-Reference Convolutional Waveform-Based Approach to Estimating Narrowband and Wideband Speech Quality
2510WEAKLY LABELLED AUDIO TAGGING VIA CONVOLUTIONAL NETWORKS WITH SPATIAL AND CHANNEL-WISE ATTENTION
2148Weakly Supervised Crowd-Wise Attention for Robust Crowd Counting
5160WEAKLY SUPERVISED SEGMENTATION GUIDED HAND POSE ESTIMATION DURING INTERACTION WITH UNKNOWN OBJECTS
5150WEAKLY SUPERVISED SEMANTIC SEGMENTATION FOR REMOTE SENSING HYPERSPECTRAL IMAGING
5216WEAKLY-SUPERVISED SOUND EVENT DETECTION WITH SELF-ATTENTION
3068Weight Sharing and Deep Learning for Spectral data
4195WEIGHTED GRADIENT CODING WITH LEVERAGE SCORE SAMPLING
3515WEIGHTED KRYLOV-LEVENBERG-MARQUARDT METHOD FOR CANONICAL POLYADIC TENSOR DECOMPOSITION
2155Weighted Null Vector Initialization and its Application to Phase Retrieval
4228WHAMR!: NOISY AND REVERBERANT SINGLE-CHANNEL SPEECH SEPARATION
1841WHAT DID YOUR ADVERSARY BELIEVE? OPTIMAL FILTERING AND SMOOTHING IN COUNTER-ADVERSARIAL AUTONOMOUS SYSTEMS
5513WHAT DOES A NETWORK LAYER HEAR? ANALYZING HIDDEN REPRESENTATIONS OF END-TO-END ASR THROUGH SPEECH SYNTHESIS
3443WHAT IS BEST FOR SPOKEN LANGUAGE UNDERSTANDING: SMALL BUT TASK-DEPENDANT EMBEDDINGS OR HUGE BUT OUT-OF-DOMAIN EMBEDDINGS?
5164WHAT MAKES THE SOUND?: A DUAL-MODALITY INTERACTING NETWORK FOR AUDIO-VISUAL EVENT LOCALIZATION
2979WHOSECOUGH: IN-THE-WILD COUGHER VERIFICATION USING MULTITASK LEARNING
4866WIDEBAND CHANNEL TRACKING FOR MILLIMETER WAVE MASSIVE MIMO SYSTEMS WITH HYBRID BEAMFORMING RECEPTION
3579WIDEBAND DIRECTION OF ARRIVAL ESTIMATION WITH SPARSE LINEAR ARRAYS
5494WIND: WASSERSTEIN INCEPTION DISTANCE FOR EVALUATING GENERATIVE ADVERSARIAL NETWORK PERFORMANCE
3614WIRTINGER FLOW ALGORITHMS FOR PHASE RETRIEVAL FROM BINARY MEASUREMENTS
5881WITCHCRAFT: EFFICIENT PGD ATTACKS WITH RANDOM STEP SIZE
4070Within-sample variability-invariant loss for robust speaker recognition under noisy environments
1181XceptionTime: A Novel Deep Architecture based on Depthwise Separable Convolutions for Hand Gesture Classification
5756XMU-TS SYSTEMS FOR NIST SRE19 CTS CHALLENGE
5311XPSNR: A LOW-COMPLEXITY EXTENSION OF THE PERCEPTUALLY WEIGHTED PEAK SIGNAL-TO-NOISE RATIO FOR HIGH-RESOLUTION VIDEO QUALITY ASSESSMENT
4167X-VECTORS MEET EMOTIONS: A STUDY ON DEPENDENCIES BETWEEN EMOTION AND SPEAKER RECOGNITION
3810ZERO-CROSSING PRECODING WITH MAXIMUM DISTANCE TO THE DECISION THRESHOLD FOR CHANNELS WITH 1-BIT QUANTIZATION AND OVERSAMPLING
2921ZERO-SHOT MULTI-SPEAKER TEXT-TO-SPEECH WITH STATE-OF-THE-ART NEURAL SPEAKER EMBEDDINGS