Following is the list of accepted ICASSP 2020 papers, sorted by paper title. You can use the search feature of your web browser to find your paper number. Notifications to all authors have also been sent by email. If you have not received your notification of the results by email, please contact us at icassp2020@cmsworkshops.com.
5666 | $\BETA$-NMF AND SPARSITY PROMOTING REGULARIZATIONS FOR COMPLEX MIXTURE UNMIXING. APPLICATION TO 2D HSQC NMR. |
2035 | 1.5GBIT/S 4.9W HYPERSPECTRAL IMAGE ENCODERS ON A LOW-POWER PARALLEL HETEROGENEOUS PROCESSING PLATFORM |
4728 | 2D-to-2D Mask Estimation for Speech Enhancement based on Fully Convolutional Neural Network |
4833 | 3-D ACOUSTIC MODELING FOR FAR-FIELD MULTI-CHANNEL SPEECH RECOGNITION |
3065 | 3D DEFORMATION SIGNATURE FOR DYNAMIC FACE RECOGNITION |
5740 | 3D Unknown View Tomography via Rotation Invariants |
6108 | A BEAMFORMING ALGORITHM BASED ON MAXIMUM LIKELIHOOD OF A COMPLEX GAUSSIAN DISTRIBUTION WITH TIME-VARYING VARIANCES FOR ROBUST SPEECH RECOGNITION |
3486 | A BIDIRECTIONAL CONTEXT PROPAGATION NETWORK FOR URINE SEDIMENT PARTICLE DETECTION IN MICROSCOPIC IMAGES |
6000 | A Bi-model Approach for Handling Unknown Slot Values in Dialogue State Tracking |
1471 | A BIN ENCODING TRAINING OF A SPIKING NEURAL NETWORK BASED VOICE ACTIVITY DETECTION |
3859 | A COMPARATIVE STUDY OF ESTIMATING ARTICULATORY MOVEMENTS FROM PHONEME SEQUENCES AND ACOUSTIC FEATURES |
2890 | A COMPARATIVE STUDY OF WESTERN AND CHINESE CLASSICAL MUSIC BASED ON SOUNDSCAPE MODELS |
4022 | A COMPARISON OF POOLING METHODS ON LSTM MODELS FOR RARE ACOUSTIC EVENT CLASSIFICATION |
3557 | A Complexity Efficient DMT-Optimal Tree Pruning Based Sphere Decoding |
2310 | A COMPOSITE DNN ARCHITECTURE FOR SPEECH ENHANCEMENT |
6051 | A COMPREHENSIVE FRAMEWORK FOR 2D-JND EXTENSION TO 360-DEG IMAGES |
3595 | A COMPREHENSIVE STUDY OF RESIDUAL CNNS FOR ACOUSTIC MODELING IN ASR |
3252 | A COMPUTATIONALLY LIGHT ALGORITHM FOR BAYESIAN SPEECH ENHANCEMENT WITH SNR MARGINALIZATION |
5175 | A connected auto-encoders based approach for image separation with side information: with applications to art investigation |
1659 | A CONSTRAINED MAXIMUM LIKELIHOOD ESTIMATOR OF SPEECH AND NOISE SPECTRA WITH APPLICATION TO MULTI-MICROPHONE NOISE REDUCTION |
1972 | A CROSS-TASK TRANSFER LEARNING APPROACH TO ADAPTING DEEP SPEECH ENHANCEMENT MODELS TO UNSEEN BACKGROUND NOISE USING PAIRED SENONE CLASSIFIERS |
1929 | A DATA EFFICIENT END-TO-END SPOKEN LANGUAGE UNDERSTANDING ARCHITECTURE |
2706 | A DATASET FOR MEASURING READING LEVELS IN INDIA AT SCALE |
1261 | A DEEP GRADIENT BOOSTING NETWORK FOR OPTIC DISC AND CUP SEGMENTATION |
3971 | A DEEP LEARNING APPROACH TO OBJECT AFFORDANCE SEGMENTATION |
5815 | A DEEP LEARNING ARCHITECTURE FOR EPILEPTIC SEIZURE CLASSIFICATION BASED ON OBJECT AND ACTION RECOGNITION |
4857 | A DEEP MULTIMODAL APPROACH FOR MAP IMAGE CLASSIFICATION |
5952 | A Deep Neural Network-Driven Feature Learning Method for Polyphonic Acoustic Event Detection from Real-Life Recordings |
5238 | A DENSE U-NET WITH CROSS-LAYER INTERSECTION FOR DETECTION AND LOCALIZATION OF IMAGE FORGERY |
2606 | A DIALOGICAL EMOTION DECODER FOR SPEECH EMOTION RECOGNITION IN SPOKEN DIALOG |
1505 | A DIFFERENTIAL APPROACH FOR RAIN FIELD TOMOGRAPHIC RECONSTRUCTION USING MICROWAVE SIGNALS FROM LEO SATELLITES |
1240 | A DISCRIMINATIVE CONDITION-AWARE BACKEND FOR SPEAKER VERIFICATION |
1880 | A DSP ACCELERATION FRAMEWORK FOR SOFTWARE-DEFINED RADIOS ON X86_64 |
4409 | A DUAL-STAGED CONTEXT AGGREGATION METHOD TOWARDS EFFICIENT END-TO-END SPEECH ENHANCEMENT |
2658 | A DYNAMIC STREAM WEIGHT BACKPROP KALMAN FILTER FOR AUDIOVISUAL SPEAKER TRACKING |
5258 | A FAST AND ACCURATE FREQUENT DIRECTIONS ALGORITHM FOR LOW RANK APPROXIMATION VIA BLOCK KRYLOV ITERATION |
1517 | A FAST AND ACCURATE SUPER-RESOLUTION NETWORK USING PROGRESSIVE RESIDUAL LEARNING |
5377 | A Fast Non-contact Vital Signs Detection Method Based on Regional Hidden Markov Model in a 77GHz LFMCW Radar System |
2875 | A FAST PROXIMAL POINT ALGORITHM FOR GENERALIZED GRAPH LAPLACIAN LEARNING |
3751 | A FAST REDUCED-RANK SOUND ZONE CONTROL ALGORITHM USING THE CONJUGATE GRADIENT METHOD |
5125 | A fast sparse covariance-based fitting method for DOA estimation via non-negative least squares |
3710 | A FIFO BASED ACCELERATOR FOR CONVOLUTIONAL NEURAL NETWORKS |
1895 | A FORWARD-BACKWARD ALGORITHM FOR REWEIGHTED PROCEDURES: APPLICATION TO RADIO-ASTRONOMICAL IMAGING |
3072 | A FRAMEWORK FOR PARAMETERS ESTIMATION OF IMAGE OPERATOR CHAIN |
1909 | A Framework for the Robust Evaluation of Sound Event Detection |
2771 | A FREQUENCY-DOMAIN BSS METHOD BASED ON L1 NORM, UNITARY CONSTRAINT, AND CAYLEY TRANSFORM |
2258 | A GATED HYPERNET DECODER FOR POLAR CODES |
2144 | A General Difficulty Control Algorithm for Proof-of-Work Based Blockchains |
3591 | A GENERAL TEST FOR THE LINEAR STRUCTURE OF COVARIANCE MATRICES OF GAUSSIAN POPULATIONS |
4486 | A GENERALIZATION OF PRINCIPAL COMPONENT ANALYSIS |
2919 | A GENERALIZED FRAMEWORK FOR DOMAIN ADAPTATION OF PLDA IN SPEAKER RECOGNITION |
1164 | A GEOMETRIC APPROACH FOR UNSUPERVISED SIMILARITY LEARNING |
1462 | A GRAPH NETWORK MODEL FOR DISTRIBUTED LEARNING WITH LIMITED BANDWIDTH LINKS AND PRIVACY CONSTRAINTS |
5783 | A GREEDY SPARSE APPROXIMATION ALGORITHM BASED ON L1-NORM SELECTION RULES |
5945 | A HARDWARE ARCHITECTURE FOR RECONFIGURABLE INTELLIGENT SURFACES WITH MINIMAL ACTIVE ELEMENTS FOR EXPLICIT PARTIAL CHANNEL ESTIMATION |
3281 | A HIERARCHICAL MODEL FOR DIALOG ACT RECOGNITION CONSIDERING ACOUSTIC AND LEXICAL CONTEXT INFORMATION |
4930 | A HIERARCHICAL TRACKER FOR MULTI-DOMAIN DIALOGUE STATE TRACKING |
4135 | A HYBRID APPROACH FOR THERMOGRAPHIC IMAGING WITH DEEP LEARNING |
4220 | A HYBRID MODEL FOR BIPOLAR DISORDER CLASSIFICATION FROM VISUAL INFORMATION |
1483 | A Hybrid Structural Sparse Error Model for Image Deblocking |
3282 | A HYBRID TEXT NORMALIZATION SYSTEM USING MULTI-HEAD SELF-ATTENTION FOR MANDARIN |
4331 | A Large-Scale Deep Architecture for Personalized Grocery Basket Recommendations |
4365 | A LEARNING APPROACH TO COOPERATIVE COMMUNICATION SYSTEM DESIGN |
2321 | A LIGHTWEIGHT MULTI-LABEL SEGMENTATION NETWORK FOR MOBILE IRIS BIOMETRICS |
4509 | A LINEAR TIME PARTITIONING ALGORITHM FOR FREQUENCY WEIGHTED IMPURITY FUNCTIONS |
3310 | A LOW-COMPLEXITY MAP DETECTOR FOR DISTRIBUTED NETWORKS |
3877 | A LOW-DIMENSIONALITY METHOD FOR DATA-DRIVEN GRAPH LEARNING |
2245 | A LOW-LATENCY SUCCESSIVE CANCELLATION HYBRID DECODER FOR CONVOLUTIONAL POLAR CODES |
3450 | A Low-Resolution ADC Proof-of-Concept Development for A Fully-Digital Millimeter-Wave Joint Communication-Radar |
2544 | A MAXIMUM LIKELIHOOD APPROACH TO MULTI-OBJECTIVE LEARNING USING GENERALIZED GAUSSIAN DISTRIBUTIONS FOR DNN-BASED SPEECH ENHANCEMENT |
2273 | A MEMORY AUGMENTED ARCHITECTURE FOR CONTINUOUS SPEAKER IDENTIFICATION IN MEETINGS |
5672 | A Method for Millimeter-Wave Imaging of Concealed Objects via De-Aliasing |
3413 | A MINIMAL PERSONALIZATION OF DYNAMIC BINAURAL SYNTHESIS WITH MIXED STRUCTURAL MODELING AND SCATTERING DELAY NETWORK |
4802 | A MODEL OF DOUBLE DESCENT FOR HIGH-DIMENSIONAL LOGISTIC REGRESSION |
5597 | A MODEL-BASED DEEP NETWORK FOR MRI RECONSTRUCTION USING APPROXIMATE MESSAGE PASSING ALGORITHM |
3852 | A MODEL-FREE APPROACH TO DISTRIBUTED TRANSMIT BEAMFORMING |
1861 | A MOMENT-BASED APPROACH FOR GUARANTEED TENSOR DECOMPOSITION |
3731 | A Monte Carlo Search-based Triplet Sampling Method for Learning Disentangled Representation of Impulsive Noise on Steering Gear |
1475 | A MULTICHANNEL KALMAN-BASED WIENER FILTER APPROACH FOR SPEAKER INTERFERENCE REDUCTION IN MEETINGS |
4841 | A MULTI-DILATION AND MULTI-RESOLUTION FULLY CONVOLUTIONAL NETWORK FOR SINGING MELODY EXTRACTION |
3234 | A MULTI-PHASE GAMMATONE FILTERBANK FOR SPEECH SEPARATION VIA TASNET |
5146 | A MULTI-SCALED RECEPTIVE FIELD LEARNING APPROACH FOR MEDICAL IMAGE SEGMENTATION |
2461 | A MULTITAPER REASSIGNED SPECTROGRAM FOR INCREASED TIME-FREQUENCY LOCALIZATION PRECISION |
2323 | A multi-view approach for Mandarin non-native mispronunciation verification |
3533 | A NEURAL DOCUMENT LANGUAGE MODELING FRAMEWORK FOR SPOKEN DOCUMENT RETRIEVAL |
3010 | A NEURAL NETWORK BASED ON FIRST PRINCIPLES |
1887 | A NEURAL NETWORK FOR MONAURAL INTRUSIVE SPEECH INTELLIGIBILITY PREDICTION |
3671 | A NEURAL NETWORK-BASED SPIKE SORTING FEATURE MAP THAT RESOLVES SPIKE OVERLAP IN THE FEATURE SPACE |
5668 | A NEW APPLICATION OF ULTRASOUND SIGNAL PROCESSING FOR ARCHAEOLOGICAL CERAMIC CLASSIFICATION |
3932 | A NEW MULTIHYPOTHESIS PREDICTION SCHEME FOR COMPRESSED VIDEO SENSING RECONSTRUCTION |
2525 | A NEW PERSPECTIVE FOR FLEXIBLE FEATURE GATHERING IN SCENE TEXT RECOGNITION VIA CHARACTER ANCHOR POOLING |
2499 | A NEW SAMPLING SCHEME FOR DISTRIBUTED BLIND SPECTRUM SENSING USING ENERGY DETECTORS |
4347 | A NEW VARIATIONAL METHOD FOR DEEP SUPERVISED SEMANTIC IMAGE HASHING |
1106 | A NONINVASIVE METHOD TO DETECT DIABETES MELLITUS AND LUNG CANCER USING THE STACKED SPARSE AUTOENCODER |
4896 | A NOVEL APPROACH FOR INTELLIGIBILITY ASSESSMENT IN DYSARTHRIC SUBJECTS |
2293 | A NOVEL METHOD FOR OBTAINING DIFFUSE FIELD MEASUREMENTS FOR MICROPHONE CALIBRATION |
3972 | A NOVEL MOVING SPARSE ARRAY GEOMETRY WITH INCREASED DEGREES OF FREEDOM |
4921 | A NOVEL PRUNING APPROACH FOR BAGGING ENSEMBLE REGRESSION BASED ON SPARSE REPRESENTATION |
4877 | A NOVEL RANK SELECTION SCHEME IN TENSOR RING DECOMPOSITION BASED ON REINFORCEMENT LEARNING FOR DEEP NEURAL NETWORKS |
4791 | A NOVEL SALIENCY-DRIVEN OIL TANK DETECTION METHOD FOR SYNTHETIC APERTURE RADAR IMAGES |
1549 | A NOVEL TWO-PATHWAY ENCODER-DECODER NETWORK FOR 3D FACE RECONSTRUCTION |
5274 | A PARTIAL RELAXATION DOA ESTIMATOR BASED ON ORTHOGONAL MATCHING PURSUIT |
5926 | A Particle Gibbs Sampling Approach to Topology Inference in Gene Regulatory Networks |
2349 | A PENALTY ALTERNATING DIRECTION METHOD OF MULTIPLIERS FOR DECENTRALIZED COMPOSITE OPTIMIZATION |
4649 | A PRACTICAL TWO-STAGE TRAINING STRATEGY FOR MULTI-STREAM END-TO-END SPEECH RECOGNITION |
4284 | A PRIORI ESTIMATES OF THE GENERALIZATION ERROR FOR AUTOENCODERS |
4346 | A PROBABILISTIC SCHEME FOR REPRESENTATION LEARNING WITH RADIAL TRANSFORM IMAGES |
3782 | A PROTOTYPICAL TRIPLET LOSS FOR COVER DETECTION |
2338 | A PROXIMAL DUAL CONSENSUS METHOD FOR LINEARLY COUPLED MULTI-AGENT NON-CONVEX OPTIMIZATION |
1097 | A RANDOM GOSSIP BMUF PROCESS FOR NEURAL LANGUAGE MODELING |
1395 | A real time implementation of a Bayer domain image deblurring core for optical blur compensation |
2844 | A REAL-TIME DEEP NETWORK FOR CROWD COUNTING |
3086 | A RECURRENT VARIATIONAL AUTOENCODER FOR SPEECH ENHANCEMENT |
2570 | A RECURSIVE BAYESIAN SOLUTION FOR THE EXCESS OVER THRESHOLD DISTRIBUTION WITH STOCHASTIC PARAMETERS |
5823 | A RECURSIVE EDGE DETECTOR FOR COLOR FILTER ARRAY IMAGE |
6112 | A REGULARIZATION FRAMEWORK FOR LEARNING OVER MULTITASK GRAPHS |
5027 | A Regularized Attention Mechanism for Graph Attention Networks |
4925 | A RETURN TO DEREVERBERATION IN THE FREQUENCY DOMAIN USING A JOINT LEARNING APPROACH |
5460 | A ROBUST AUDIO-VISUAL SPEECH ENHANCEMENT MODEL |
5784 | A ROBUST SPEAKER CLUSTERING METHOD BASED ON DISCRETE TIED VARIATIONAL AUTOENCODER |
1286 | A SEGMENTATION BASED ROBUST DEEP LEARNING FRAMEWORK FOR MULTIMODAL RETINAL IMAGE REGISTRATION |
3235 | A Self-Attentive Emotion Recognition Network |
2874 | A SEMI-SUPERVISED APPROACH FOR IDENTIFYING ABNORMAL HEART SOUNDS USING VARIATIONAL AUTOENCODER |
3160 | A SEMI-SUPERVISED RANK TRACKING ALGORITHM FOR ON-LINE UNMIXING OF HYPERSPECTRAL IMAGES |
3583 | A SEQUENCE MATCHING NETWORK FOR POLYPHONIC SOUND EVENT LOCALIZATION AND DETECTION |
2361 | A SIAMESE CONTENT-ATTENTIVE GRAPH CONVOLUTIONAL NETWORK FOR PERSONALITY RECOGNITION USING PHYSIOLOGY |
2535 | A SIMPLE AND EFFICIENT ITERATIVE METHOD FOR TOA LOCALIZATION |
4527 | A SIMPLE BUT EFFECTIVE BERT MODEL FOR DIALOG STATE TRACKING ON RESOURCE-LIMITED SYSTEMS |
3834 | A SIMPLE DERIVATION OF AMP AND ITS STATE EVOLUTION VIA FIRST-ORDER CANCELLATION |
4136 | A SINGLE-RF ARCHITECTURE FOR MULTIUSER MASSIVE MIMO VIA REFLECTING SURFACES |
3004 | A Sparse Linear Array Approach in Automotive Radars Using Matrix Completion |
3119 | A STACKED-AUTOENCODER BASED END-TO-END LEARNING FRAMEWORK FOR DECODE-AND-FORWARD RELAY NETWORKS |
2024 | A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency |
5020 | A STUDY OF CHILD SPEECH EXTRACTION USING JOINT SPEECH ENHANCEMENT AND SEPARATION IN REALISTIC CONDITIONS |
5512 | A STUDY OF GENERALIZATION OF STOCHASTIC MIRROR DESCENT ALGORITHMS ON OVERPARAMETERIZED NONLINEAR MODELS |
4529 | A STUDY ON THE TRANSFERABILITY OF ADVERSARIAL ATTACKS IN SOUND EVENT CLASSIFICATION |
2539 | A SWITCHING TRANSMISSION GAME WITH LATENCY AS THE USER'S COMMUNICATION UTILITY |
1202 | A THEORETICAL BASIS FOR PRACTITIONERS HEURISTIC 1/N AND LONG-ONLY QUINTILE PORTFOLIO |
3461 | A TIME-BASED SAMPLING FRAMEWORK FOR FINITE-RATE-OF-INNOVATION SIGNALS |
4747 | A TIME-FREQUENCY NETWORK WITH CHANNEL ATTENTION AND NON-LOCAL MODULES FOR ARTIFICIAL BANDWIDTH EXTENSION |
5144 | A UNIFIED SEQUENCE-TO-SEQUENCE FRONT-END MODEL FOR MANDARIN TEXT-TO-SPEECH SYNTHESIS |
2295 | A VARIATIONAL BAYESIAN APPROACH FOR MULTICHANNEL THROUGH-WALL RADAR IMAGING WITH LOW-RANK AND SPARSE PRIORS |
1422 | A VISUAL-PILOT DEEP FUSION FOR TARGET SPEECH SEPARATION IN MULTI-TALKER NOISY ENVIRONMENT |
1655 | A WHITENESS TEST BASED ON THE SPECTRAL MEASURE OF LARGE NON-HERMITIAN RANDOM MATRICES |
2821 | A WIFI-BASED PASSIVE FALL DETECTION SYSTEM |
2244 | A ZEROTH-ORDER LEARNING ALGORITHM FOR ERGODIC OPTIMIZATION OF WIRELESS SYSTEMS WITH NO MODELS AND NO GRADIENTS |
3542 | ACCELERATING DISTRIBUTED DEEP LEARNING BY ADAPTIVE GRADIENT QUANTIZATION |
4244 | ACCELERATING LINEAR ALGEBRA KERNELS ON A MASSIVELY PARALLEL RECONFIGURABLE ARCHITECTURE. |
5267 | ACCENT ESTIMATION OF JAPANESE WORDS FROM THEIR SURFACES AND ROMANIZATIONS FOR BUILDING LARGE VOCABULARY ACCENT DICTIONARIES |
3398 | Accounting for microprosody in modeling intonation |
3815 | ACCURACY-ROBUSTNESS TRADE-OFF FOR POSITIVELY WEIGHTED NEURAL NETWORKS |
4321 | Accurate 6D Object Pose Estimation by Pose Conditioned Mesh Reconstruction |
3718 | ACCURATE AND SCALABLE VERSION IDENTIFICATION USING MUSICALLY-MOTIVATED EMBEDDINGS |
2199 | ACCURATE LOCALIZATION OF AUV IN MOTION BY EXPLICIT SOLUTION USING TIME DELAYS |
4359 | Accurate Semidefinite Relaxation Method for 3-D Rigid Body Localization Using AOA |
4530 | ACHIEVING FULLY-DIGITAL PERFORMANCE BY HYBRID ANALOG/DIGITAL BEAMFORMING IN WIDE-BAND MASSIVE-MIMO SYSTEMS |
5656 | ACHIEVING THE CAPACITY OF THE DNA STORAGE CHANNEL |
5239 | ACOUSTIC MATCHING BY EMBEDDING IMPULSE RESPONSES |
4018 | Acoustic Model Adaptation for Lecture Transcription and Intelligent Meeting Assistant Systems |
5602 | ACOUSTIC SCENE CLASSIFICATION FOR MISMATCHED RECORDING DEVICES USING HEATED-UP SOFTMAX AND SPECTRUM CORRECTION |
4208 | ACOUSTIC SCENE CLASSIFICATION USING DEEP RESIDUAL NETWORKS WITH LATE FUSION OF SEPARATED HIGH AND LOW FREQUENCY PATHS |
1982 | A-CRNN: A DOMAIN ADAPTATION MODEL FOR SOUND EVENT DETECTION |
1792 | ACTION-MANIPULATION ATTACKS ON STOCHASTIC BANDITS |
2131 | ACTIVE CONTROL OF LINE SPECTRAL NOISE WITH SIMULTANEOUS SECONDARY PATH MODELING WITHOUT AUXILIARY NOISE |
3880 | ACTIVE LEARNING WITH UNSUPERVISED ENSEMBLES OF CLASSIFIERS |
2464 | ACTIVE NOISE CONTROL OVER MULTIPLE REGIONS: PERFORMANCE ANALYSIS |
3903 | ACTIVE SEMI-SUPERVISED LEARNING FOR DIFFUSIONS ON GRAPHS |
3129 | ACU-NET:A 3D ATTENTION CONTEXT U-NET FOR MULTIPLE SCLEROSIS LESION SEGMENTATION |
1416 | ADAPTATION AND LEARNING IN MULTI-TASK DECISION SYSTEMS |
4961 | ADAPTATION OF RNN TRANSDUCER WITH TEXT-TO-SPEECH TECHNOLOGY FOR KEYWORD SPOTTING |
1967 | Adaptive blind audio source extraction supervised by dominant speaker identification using x-vectors |
4580 | Adaptive Distributed Stochastic Gradient Descent for Minimizing Delay in the Presence of Stragglers |
2854 | ADAPTIVE ELASTIC LOSS BASED ON PROGRESSIVE INTER-CLASS ASSOCIATION FOR CERVICAL HISTOLOGY IMAGE SEGMENTATION |
1108 | ADAPTIVE FEATURE ENHANCEMENT FOR FASHION LANDMARK DETECTION |
1508 | ADAPTIVE KNOWLEDGE DISTILLATION BASED ON ENTROPY |
4751 | ADAPTIVE MATCHED FILTER USING NON-TARGET FREE TRAINING DATA |
2163 | ADAPTIVE NORMALIZATION FOR FORECASTING LIMIT ORDER BOOK DATA USING CONVOLUTIONAL NEURAL NETWORKS |
4304 | Adaptive prediction of financial time-series for decision-making using a tensorial aggregation approach |
4371 | ADAPTIVE REGION AGGREGATION NETWORK: UNSUPERVISED DOMAIN ADAPTATION WITH ADVERSARIAL TRAINING FOR ECG DELINEATION |
1268 | ADAPTIVE RESOLUTION CHANGE USING UNCODED AREAS AND DICTIONARY LEARNING-BASED SUPER-RESOLUTION IN VERSATILE VIDEO CODING |
2478 | ADAPTIVE SEQUENTIAL INTERPOLATOR USING ACTIVE LEARNING FOR EFFICIENT EMULATION OF COMPLEX SYSTEMS |
1947 | Adaptive Subspace Detectors for Off-Grid Mismatched Targets |
5548 | ADDRESSING ACCENT MISMATCH IN MANDARIN-ENGLISH CODE-SWITCHING SPEECH RECOGNITION |
4416 | ADDRESSING CHALLENGES IN BUILDING WEB-SCALE CONTENT CLASSIFICATION SYSTEMS |
5240 | ADDRESSING THE CONFOUNDS OF INSTRUMENTATION IN SINGER IDENTIFICATION |
1889 | ADDRESSING THE POLYSEMY PROBLEM IN LANGUAGE MODELING WITH ATTENTIONAL MULTI-SENSE EMBEDDINGS |
3477 | ADI17: A FINE-GRAINED ARABIC DIALECT IDENTIFICATION DATASET |
1528 | ADMM-BASED ONE-BIT QUANTIZED SIGNAL DETECTION FOR MASSIVE MIMO SYSTEMS WITH HARDWARE IMPAIRMENTS |
3019 | ADRN: Attention-based Deep Residual Network for Hyperspectral Image Denoising |
3627 | Adversarial Anomaly Detection for Marked Spatio-Temporal Streaming Data |
1686 | Adversarial Attack inspired by K-Anonymity principles |
2876 | ADVERSARIAL ATTACK ON GMM I-VECTOR BASED SPEAKER VERIFICATION SYSTEMS |
5836 | Adversarial Attacks on Deep Unfolded Networks for Sparse Coding |
1946 | ADVERSARIAL DETECTION OF COUNTERFEITED PRINTABLE GRAPHICAL CODES: TOWARDS ”ADVERSARIAL GAMES” IN PHYSICAL WORLD |
5071 | ADVERSARIAL EXAMPLE DETECTION BY CLASSIFICATION FOR DEEP SPEECH RECOGNITION |
2339 | ADVERSARIAL MIXUP SYNTHESIS TRAINING FOR UNSUPERVISED DOMAIN ADAPTATION |
2405 | ADVERSARIAL MULTI-TASK LEARNING FOR SPEAKER NORMALIZATION IN REPLAY DETECTION |
4693 | Adversarial Networks for Secure Wireless Communications |
2505 | Adversarial Text Image Super-Resolution Using Sinkhorn Distance |
2660 | ADVERSARIAL VIDEO COMPRESSION GUIDED BY SOFT EDGE DETECTION |
2055 | ADVMS: A MULTI-SOURCE MULTI-COST DEFENSE AGAINST ADVERSARIAL ATTACKS |
1189 | Age of Information with Finite Horizon and Partial Updates |
4642 | AGE-BASED SCHEDULING POLICY FOR FEDERATED LEARNING IN MOBILE EDGE NETWORKS |
2075 | AIPNET: GENERATIVE ADVERSARIAL PRE-TRAINING OF ACCENT-INVARIANT NETWORKS FOR END-TO-END SPEECH RECOGNITION |
3143 | AL2: PROGRESSIVE ACTIVATION LOSS FOR LEARNING GENERAL REPRESENTATIONS IN CLASSIFICATION NEURAL NETWORKS |
2248 | Algorithmic exploration of American English dialects |
4006 | ALIGNMENT-LENGTH SYNCHRONOUS DECODING FOR RNN TRANSDUCER |
5120 | ALIGNTTS: EFFICIENT FEED-FORWARD TEXT-TO-SPEECH SYSTEM WITHOUT EXPLICIT ALIGNMENT |
5503 | ALL IN ONE NETWORK FOR DRIVER ATTENTION MONITORING |
2476 | ALL YOU NEED IS A SECOND LOOK: TOWARDS TIGHTER ARBITRARY SHAPE TEXT DETECTION |
1565 | ALLOCATION OF COMPUTING TASKS IN DISTRIBUTED MEC SERVERS CO-POWERED BY RENEWABLE SOURCES AND THE POWER GRID |
3073 | ALTERNATIVE HALF-SAMPLE INTERPOLATION FILTERS FOR VERSATILE VIDEO CODING |
6015 | AN ACOUSTIC MODELLING BASED REMOTE ERROR SENSING APPROACH FOR QUIET ZONE GENERATION IN A NOISY ENVIRONMENT |
4423 | AN ADAPTIVE LINEAR ESTIMATOR BASED APPROACH TO BI-DIRECTIONAL MOTION COMPENSATED PREDICTION |
6075 | AN ADMM-BASED APPROACH TO ROBUST ARRAY PATTERN SYNTHESIS |
4574 | AN ALTERNATIVE SIGNATURE DESIGN USING L1 PRINCIPAL COMPONENTS FOR SPREAD-SPECTRUM STEGANOGRAPHY |
3109 | AN ANALYSIS OF SPEECH ENHANCEMENT AND RECOGNITION LOSSES IN LIMITED RESOURCES MULTI-TALKER SINGLE CHANNEL AUDIO-VISUAL ASR |
2018 | AN ANALYTICAL SOLUTION TO JACOBSEN ESTIMATOR FOR WINDOWED SIGNALS |
1603 | An attention enhanced multi-task model for objective speech assessment in real-world environments |
2291 | An Attention-Based Joint Acoustic and Text On-Device End-to-End Model |
1662 | AN EARLY TERMINATION SCHEME FOR SUCCESSIVE CANCELLATION LIST DECODING OF POLAR CODES |
5919 | AN EASY-IMPLEMENTIVE FRAMEWORK OF FAST SUBSPACE CLUSTERING FOR BIG DATA SETS |
6120 | AN EFFECTIVE STYLE TOKEN WEIGHT CONTROL TECHNIQUE FOR END-TO-END EMOTIONAL SPEECH SYNTHESIS |
2638 | AN EFFICIENT ALTERNATIVE TO NETWORK PRUNING THROUGH ENSEMBLE LEARNING |
2358 | AN EFFICIENT AUGMENTED LAGRANGIAN-BASED METHOD FOR LINEAR EQUALITY-CONSTRAINED LASSO |
6122 | AN EFFICIENT COUPLED DICTIONARY LEARNING METHOD |
5399 | AN EFFICIENT EKF BASED TRAINING ALGORITHM FOR LSTM-BASED ONLINE LEARNING |
4910 | AN EFFICIENT METHODOLOGY TO DE-ANONYMIZE THE 5G-NEW RADIO PHYSICAL DOWNLINK CONTROL CHANNEL |
6103 | An Embedding Cost Learning Framework Using GAN |
2692 | An Empirical Bayes Approach to Partially Labeled and Shuffled Data Sets |
5319 | AN EMPIRICAL STUDY OF CONV-TASNET |
4812 | An Empirical Study of Transformer-based Neural Language Model Adaptation |
1078 | AN EMPIRICAL STUDY ON ACOUSTIC FEEDBACK PATH ACROSS HEARING AID USERS |
3978 | AN ENHANCED DECODING ALGORITHM FOR CODED COMPRESSED SENSING |
2715 | An ensemble Based Approach for Generalized Detection of Spoofing Attacks to Automatic Speaker Recognizers |
2512 | AN IMPROVED DEEP NEURAL NETWORK FOR MODELING SPEAKER CHARACTERISTICS AT DIFFERENT TEMPORAL SCALES |
1349 | AN IMPROVED FRAME-UNIT-SELECTION BASED VOICE CONVERSION SYSTEM WITHOUT PARALLEL TRAINING DATA |
4528 | AN IMPROVED SELECTIVE ACTIVE NOISE CONTROL ALGORITHM BASED ON EMPIRICAL WAVELET TRANSFORM |
1048 | AN IMPROVED SOLUTION TO THE FREQUENCY-INVARIANT BEAMFORMING WITH CONCENTRIC CIRCULAR MICROPHONE ARRAYS |
2275 | AN LSTM BASED ARCHITECTURE TO RELATE SPEECH STIMULUS TO EEG |
3991 | AN LSTM-BASED DYNAMIC CHORD PROGRESSION GENERATION SYSTEM FOR INTERACTIVE MUSIC PERFORMANCE |
5031 | AN ODORANT ENCODING MACHINE FOR SAMPLING, RECONSTRUCTION AND ROBUST REPRESENTATION OF ODORANT IDENTITY |
4421 | AN ONLINE KERNEL SCALAR QUANTIZATION SCHEME FOR SIGNAL CLASSIFICATION |
6158 | An Online Plug-and-Play Algorithm for Regularized Image Reconstruction |
2201 | AN ONLINE SPEAKER-AWARE SPEECH SEPARATION APPROACH BASED ON TIME-DOMAIN REPRESENTATION |
3805 | AN ONTOLOGY-AWARE FRAMEWORK FOR AUDIO EVENT CLASSIFICATION |
2618 | AN OPTIMAL CHANNEL ESTIMATION SCHEME FOR INTELLIGENT REFLECTING SURFACES BASED ON A MINIMUM VARIANCE UNBIASED ESTIMATOR |
3768 | An optimal symmetric threshold strategy for remote estimation over the collision channel |
4390 | AN UNSUPERVISED RETINAL VESSEL EXTRACTION AND SEGMENTATION METHOD BASED ON A TUBE MARKED POINT PROCESS MODEL |
5935 | ANALYSIS OF ACOUSTIC FEATURES FOR SPEECH SOUND BASED CLASSIFICATION OF ASTHMATIC AND HEALTHY SUBJECTS |
3865 | ANALYZING ASR PRETRAINING FOR LOW-RESOURCE SPEECH-TO-TEXT TRANSLATION |
5885 | ANGULAR DISCRIMINATIVE DEEP FEATURE LEARNING FOR FACE VERIFICATION |
6086 | ANISOTROPIC GUIDED FILTERING |
5950 | ANOMALOUS SOUND DETECTION BASED ON INTERPOLATION DEEP NEURAL NETWORK |
3774 | Anomaly Detection for Time Series Using VAE-LSTM Hybrid Model |
2185 | ANOMALY DETECTION IN MIXED TIME-SERIES USING A CONVOLUTIONAL SPARSE REPRESENTATION WITH APPLICATION TO SPACECRAFT HEALTH MONITORING |
2781 | Anomaly Detection With Training Data in Hyperspectral Imagery |
3278 | AnomalyDAE: Dual autoencoder for anomaly detection on attributed networks |
1440 | Anti-jamming Routing for Internet of Satellites: A Reinforcement Learning Approach |
3876 | ANYTIME MINIBATCH WITH DELAYED GRADIENTS: SYSTEM PERFORMANCE AND CONVERGENCE ANALYSIS |
2342 | APB2FACE: AUDIO-GUIDED FACE REENACTMENT WITH AUXILIARY POSE AND BLINK SIGNALS |
4266 | Application Informed Motion Signal processing for finger motion tracking using wearable sensors |
2093 | Approaching Optimal Embedding in Audio Steganography with GAN |
3373 | APPROXIMATE BAYESIAN COMPUTATION WITH THE SLICED-WASSERSTEIN DISTANCE |
3308 | APPROXIMATE INFERENCE BY KULLBACK-LEIBLER TENSOR BELIEF PROPAGATION |
6085 | ARBITRARY LENGTH PERFECT INTEGER SEQUENCES USING ALL-PASS POLYNOMIAL |
3312 | ARNET:ATTENTION-BASED REFINEMENT NETWORK FOR FEW-SHOT SEMANTIC SEGMENTATION |
3116 | ARRAY-GEOMETRY-AWARE SPATIAL ACTIVE NOISE CONTROL BASED ON DIRECTION-OF-ARRIVAL WEIGHTING |
4285 | ARSM GRADIENT ESTIMATOR FOR SUPERVISED LEARNING TO RANK |
5372 | ARTIFICIAL BANDWIDTH EXTENSION USING CONDITIONAL VARIATIONAL AUTO-ENCODERS AND ADVERSARIAL LEARNING |
6013 | ASR ERROR CORRECTION AND DOMAIN ADAPTATION USING MACHINE TRANSLATION |
3400 | ASR IS ALL YOU NEED: CROSS-MODAL DISTILLATION FOR LIP READING |
1829 | ASSESSING THE SCOPE OF GENERALIZED COUNTERMEASURES FOR ANTI-SPOOFING |
1756 | ASSIMILATION-BASED LEARNING OF CHAOTIC DYNAMICAL SYSTEMS FROM NOISY AND PARTIAL DATA |
1998 | Asymptotic Stochastic Analysis of Partially Relaxed DML |
1157 | ASYMPTOTICALLY OPTIMAL BLIND CALIBRATION OF ACOUSTIC VECTOR SENSOR UNIFORM LINEAR ARRAYS |
5589 | ASYNCHROUNOUS DECENTRALIZED LEARNING OF A NEURAL NETWORK |
4754 | ATOMIC NORM BASED LOCALIZATION OF FAR-FIELD AND NEAR-FIELD SIGNALS WITH GENERALIZED SYMMETRIC ARRAYS |
2477 | ATOMIC NORM DENOISING IN BLIND TWO-DIMENSIONAL SUPER-RESOLUTION |
2851 | ATTENTION DRIVEN FUSION FOR MULTI-MODAL EMOTION RECOGNITION |
4360 | ATTENTION GUIDED REGION DIVISION FOR CROWD COUNTING |
1803 | ATTENTION MECHANISM ENHANCED KERNEL PREDICTION NETWORKS FOR DENOISING OF BURST IMAGES |
2357 | ATTENTIONAL FUSED TEMPORAL TRANSFORMATION NETWORK FOR VIDEO ACTION RECOGNITION |
1863 | ATTENTION-BASED ASR WITH LIGHTWEIGHT AND DYNAMIC CONVOLUTIONS |
2474 | ATTENTION-BASED CURIOSITY-DRIVEN EXPLORATION IN DEEP REINFORCEMENT LEARNING |
4296 | ATTENTION-BASED GATED SCALING ADAPTIVE ACOUSTIC MODEL FOR CTC-BASED SPEECH RECOGNITION |
3148 | Attention-guided Deraining Network via Stage-wise Learning |
2632 | ATTENTION-MASK DENSE MERGER (ATTENDENSE) DEEP HDR FOR GHOST REMOVAL |
2288 | ATTENTIVE CUTMIX: AN ENHANCED DATA AUGMENTATION APPROACH FOR DEEP LEARNING BASED IMAGE CLASSIFICATION |
1086 | Attentive Item2vec: Neural Attentive User Representations |
2940 | ATTENTIVE MODALITY HOPPING MECHANISM FOR SPEECH EMOTION RECOGNITION |
1363 | AUDIO CODEC ENHANCEMENT WITH GENERATIVE ADVERSARIAL NETWORKS |
1997 | AUDIO FEATURE EXTRACTION FOR VEHICLE ENGINE NOISE CLASSIFICATION |
4879 | Audio sound determination using feature space attention based convolution recurrent neural network |
6118 | AUDIO SOURCE SEPARATION USING VARIATIONAL AUTOENCODERS AND WEAK CLASS SUPERVISION |
5232 | AUDIO-ASSISTED IMAGE INPAINTING FOR TALKING FACES |
4225 | Audio-attention discriminative language model for ASR rescoring |
1569 | AUDIO-BASED AUTO-TAGGING WITH CONTEXTUAL TAGS FOR MUSIC |
3140 | AUDIO-BASED DETECTION OF EXPLICIT CONTENT IN MUSIC |
2647 | AUDIO-VISUAL CALIBRATION WITH POLYNOMIAL REGRESSION FOR 2-D PROJECTION USING SVD-PHAT |
2305 | AUDIO-VISUAL RECOGNITION OF OVERLAPPED SPEECH FOR THE LRS2 DATASET |
1119 | AUDITORY MODEL BASED SUBSETTING OF HEAD-RELATED TRANSFER FUNCTION DATASETS |
3540 | AUGLABEL: EXPLOITING WORD REPRESENTATIONS TO AUGMENT LABELS FOR FACE ATTRIBUTE CLASSIFICATION |
1263 | Augmentation Data Synthesis via GANs: Boosting Latent Fingerprint Reconstruction |
2908 | Augmented Grad-CAM: heat-maps super resolution through augmentation |
4599 | AUGMENTING MOLECULAR IMAGES WITH VECTOR REPRESENTATIONS AS A FEATURIZATION TECHNIQUE FOR DRUG CLASSIFICATION |
2194 | Auto-FAS: Searching Lightweight Networks for Face Anti-Spoofing |
2582 | AUTOMATIC AND SIMULTANEOUS ADJUSTMENT OF LEARNING RATE AND MOMENTUM FOR STOCHASTIC GRADIENT-BASED OPTIMIZATION METHODS |
5909 | AUTOMATIC CLASSIFICATION OF VOLUMES OF WATER USING SWALLOW OUNDS FROM CERVICAL AUSCULTATION |
3511 | AUTOMATIC DATA AUGMENTATION VIA DEEP REINFORCEMENT LEARNING FOR EFFECTIVE KIDNEY TUMOR SEGMENTATION |
3131 | AUTOMATIC EPILEPTIC SEIZURE ONSET-OFFSET DETECTION BASED ON CNN IN SCALP EEG |
4960 | AUTOMATIC EVENT DETECTION OF REM SLEEP WITHOUT ATONIA FROM POLYSOMNOGRAPHY SIGNALS USING DEEP NEURAL NETWORKS |
4663 | AUTOMATIC FLUENCY EVALUATION OF SPONTANEOUS SPEECH USING DISFLUENCY-BASED FEATURES |
5692 | AUTOMATIC IDENTIFICATION OF SPEAKERS FROM HEAD GESTURES IN A NARRATION |
1109 | Automatic Lyrics Alignment and Transcription in Polyphonic Music: Does Background music help? |
2828 | AUTOMATIC PREDICTION OF SUICIDAL RISK IN MILITARY COUPLES USING MULTIMODAL INTERACTION CUES FROM COUPLES CONVERSATIONS |
4049 | AUTOMATIC VOCAL TRACT LANDMARK TRACKING IN RTMRI USING FULLY CONVOLUTIONAL NETWORKS AND KALMAN FILTER |
4587 | AUTOMOTIVE COLLISION RISK ESTIMATION UNDER COOPERATIVE SENSING |
5354 | AUTOMOTIVE RADAR SIGNAL INTERFERENCE MITIGATION USING RNN WITH SELF ATTENTION |
3924 | AUTOREGRESSIVE PARAMETER ESTIMATION WITH DNN-BASED PRE-PROCESSING |
3753 | AUXILIARY CAPSULES FOR NATURAL LANGUAGE UNDERSTANDING |
3041 | AV(SE)²: AUDIO-VISUAL SQUEEZE-EXCITE SPEECH ENHANCEMENT |
2059 | AVA Active Speaker: An Audio-Visual Dataset for Active Speaker Detection |
5069 | BACK-AND-FORTH PREDICTION FOR DEEP TENSOR COMPRESSION |
1581 | Back-to-Back Butterfly Network, an Adaptive Permutation Network for New Communication Standards |
5256 | BALANCED BINARY NEURAL NETWORKS WITH GATED RESIDUAL |
2586 | BALANCING RATES AND VARIANCE VIA ADAPTIVE BATCH-SIZES IN FIRST-ORDER STOCHASTIC OPTIMIZATION |
2486 | BANDIT SAMPLING FOR FASTER ACTIVITY AND DATA DETECTION IN MASSIVE RANDOM ACCESS |
2125 | Bandwidth extension of musical audio signals with no side information using dilated convolutional neural networks |
5699 | BANGLA VOICE COMMAND RECOGNITION IN END-TO-END SYSTEM USING TOPIC MODELING BASED CONTEXTUAL RESCORING |
2554 | BATMAN: BAYESIAN TARGET MODELLING FOR ACTIVE INFERENCE |
2053 | BAYESIAN ESTIMATION OF PLDA WITH NOISY TRAINING LABELS, WITH APPLICATIONS TO SPEAKER VERIFICATION |
3103 | BAYESIAN MULTIPLE CHANGE-POINT DETECTION WITH LIMITED COMMUNICATION |
2805 | BBAND INDEX: A NO-REFERENCE BANDING ARTIFACT PREDICTOR |
4008 | BBA-NET: A bi-branch attention network for crowd counting |
5502 | BEAM ELIMINATION BASED ON SEQUENTIALLY ESTIMATED A POSTERIORI PROBABILITIES OF WINNING |
2324 | BEAMFORMED FEATURE FOR LEARNING-BASED DUAL-CHANNEL SPEECH SEPARATION |
4743 | BEAMFORMING DESIGN FOR HIGH-RESOLUTION LOW-INTENSITY FOCUSED ULTRASOUND NEUROMODULATION |
3827 | BEAMFORMING IN INTELLIGENT ENVIRONMENTS BASED ON ULTRA-MASSIVE MIMO PLATFORMS IN MILLIMETER WAVE AND TERAHERTZ BANDS |
2363 | BEAM-TASNET: TIME-DOMAIN AUDIO SEPARATION NETWORK MEETS FREQUENCY-DOMAIN BEAMFORMER |
5770 | BERT IS NOT ALL YOU NEED FOR COMMONSENSE INFERENCE |
3825 | BETTER SAFE THAN SORRY: RISK-AWARE NONLINEAR BAYESIAN ESTIMATION |
1481 | BEYOND THE DCASE 2017 CHALLENGE ON RARE SOUND EVENT DETECTION: A PROPOSAL FOR A MORE REALISTIC TRAINING AND TEST FRAMEWORK |
1154 | BILATERAL RECURRENT NETWORK FOR SINGLE IMAGE DERAINING |
6146 | BILEVEL OPTIMIZATION USING STATIONARY POINT OF LOWER-LEVEL OBJECTIVE FUNCTION |
1415 | BINARY PROBABILITY MODEL FOR LEARNING BASED IMAGE COMPRESSION |
5315 | BINAURAL AUDIO SOURCE REMIXING WITH MICROPHONE ARRAY LISTENING DEVICES |
4569 | BIO-MIMETIC ATTENTIONAL FEEDBACK IN MUSIC SOURCE SEPARATION |
5791 | BIPARTITE BELIEF PROPAGATION POLAR DECODING WITH BIT FLIPPING |
4442 | BIT ALLOCATION FOR MULTI-TASK COLLABORATIVE INTELLIGENCE |
5390 | BLASTER: An Off-Grid Method for Blind and Regularized Acoustic Echoes Retrieval |
5800 | BLIND ADAPTIVE EQUALIZATION USING BIAS-COMPENSATED RLS METHOD |
3885 | BLIND BOUNDED SOURCE SEPARATION USING NEURAL NETWORKS WITH LOCAL LEARNING RULES |
6125 | Blind Constant Modulus Multiuser Detection via Low-Rank Approximation |
6069 | BLIND DETERMINATION OF THE NUMBER OF SOURCES USING DISTANCE CORRELATION |
1704 | Blind Hyperspectral Unmixing using Dual Branch Deep Autoencoder with Orthogonal Sparse Prior |
3757 | BLIND INFERENCE OF CENTRALITY RANKINGS FROM GRAPH SIGNALS |
4402 | BLIND MULTI-SPECTRAL IMAGE PAN-SHARPENING |
3096 | BLIND QUALITY ASSESSMENT OF CAMERA IMAGES BASED ON STRUCTURE, TEXTURE AND COLOR INFORMATION |
3388 | BLIND SOURCE SEPARATION OF GRAPH SIGNALS |
1366 | BLOOD PRESSURE ESTIMATION FROM PPG SIGNALS USING CONVOLUTIONAL NEURAL NETWORKS AND SIAMESE NETWORK |
3349 | Body movement generation for expressive violin performance applying neural networks |
1836 | BOFFIN TTS: FEW-SHOT SPEAKER ADAPTATION BY BAYESIAN OPTIMIZATION |
5840 | BOOSTED LOCALITY SENSITIVE HASHING: DISCRIMINATIVE BINARY CODES FOR SOURCE SEPARATION |
1944 | BP-VB-EP BASED STATIC AND DYNAMIC SPARSE BAYESIAN LEARNING WITH KRONECKER STRUCTURED DICTIONARIES |
4045 | BREATHING AND SPEECH PLANNING IN SPONTANEOUS SPEECH SYNTHESIS |
1131 | Bridging Mixture Density Networks with Meta-learning for Automatic Speaker Identification |
2812 | BRINGING IN THE OUTLIERS: A SPARSE SUBSPACE CLUSTERING APPROACH TO LEARN A DICTIONARY OF MOUSE ULTRASONIC VOCALIZATIONS |
3893 | BUILDING FIRMLY NONEXPANSIVE CONVOLUTIONAL NEURAL NETWORKS |
2731 | BUT System for the Second DIHARD Speech Diarization Challenge |
4729 | Byzantine-Robust Decentralized Stochastic Optimization |
1270 | C3DVQA: FULL-REFERENCE VIDEO QUALITY ASSESSMENT WITH 3D CONVOLUTIONAL NEURAL NETWORK |
1494 | CAD-AEC: CONTEXT-AWARE DEEP ACOUSTIC ECHO CANCELLATION |
1358 | CAMERA CONFIGURATION DESIGN IN COOPERATIVE ACTIVE VISUAL 3D RECONSTRUCTION: A STATISTICAL APPROACH |
2597 | CAN EVERY ANALOG SYSTEM BE SIMULATED ON A DIGITAL COMPUTER? |
4562 | Capacity of the Erasure Shuffling Channel |
3011 | CARTOON-TEXTURE DECOMPOSITION-BASED VARIATIONAL PANSHARPENING |
1337 | CELL-PHONE CLASSIFICATION: A CONVOLUTIONAL NEURAL NETWORK APPROACH EXPLOITING ELECTROMAGNETIC EMANATIONS |
5930 | CGCNN: COMPLEX GABOR CONVOLUTIONAL NEURAL NETWORK ON RAW SPEECH |
5556 | CHALLENGES AND PERSPECTIVES IN NEUROMORPHIC-BASED VISUAL IOT SYSTEMS AND NETWORKS |
2038 | CHANNEL ADVERSARIAL TRAINING FOR SPEAKER VERIFICATION AND DIARIZATION |
4456 | CHANNEL ATTENTION BASED GENERATIVE NETWORK FOR ROBUST VISUAL TRACKING |
4383 | Channel Charting: An Euclidean Distance Matrix Completion Perspective |
1138 | CHANNEL COVARIANCE ESTIMATION IN MULTIUSER MASSIVE MIMO SYSTEMS WITH AN APPROACH BASED ON INFINITE DIMENSIONAL HILBERT SPACES |
5711 | CHANNEL INVARIANT SPEAKER EMBEDDING LEARNING WITH JOINT MULTI-TASK AND ADVERSARIAL TRAINING |
5424 | CHANNEL SELECTION OVER RIEMANNIAN MANIFOLD WITH NON-STATIONARITY CONSIDERATION FOR BRAIN-COMPUTER INTERFACE APPLICATIONS |
2294 | CHANNEL-ATTENTION DENSE U-NET FOR MULTICHANNEL SPEECH ENHANCEMENT |
5307 | CHARACTERIZATION OF A SNAPSHOT FOURIER TRANSFORM IMAGINGSPECTROMETER BASED ON AN ARRAY OF FABRY-PEROT INTERFEROMETERS |
2738 | Characterizing Adversarial Speech Examples Using Self-Attention U-Net Enhancement |
1760 | Character-Level Lexical Emotion Recognition |
4575 | CHIRPING UP THE RIGHT TREE: INCORPORATING BIOLOGICAL TAXONOMIES INTO DEEP BIOACOUSTIC CLASSIFIERS |
6139 | Chronological Age Estimation Under the Guidance of Age-Related Facial Attributes |
2177 | CIF: CONTINUOUS INTEGRATE-AND-FIRE FOR END-TO-END SPEECH RECOGNITION |
4338 | CLASSIFICATION OF DEPTH AND SURFACE EDGES WITH DEEP FEATURES |
4880 | CLASSIFICATION OF EPILEPTIC IEEG SIGNALS BY CNN AND DATA AUGMENTATION |
2928 | CLASSIFICATION OF HIGH-DIMENSIONAL MOTOR IMAGERY TASKS BASED ON AN END-TO-END ROLE ASSIGNED CONVOLUTIONAL NEURAL NETWORK |
4850 | Classify and explain: an interpretable convolutional neural network for lung cancer diagnosis |
2822 | CLASSIFYING ANOMALIES FOR NETWORK SECURITY |
5086 | CLASSIFYING PARTIALLY LABELED NETWORKED DATA VIA LOGISTIC NETWORK LASSO |
1702 | CLCNET: DEEP LEARNING-BASED NOISE REDUCTION FOR HEARING AIDS USING COMPLEX LINEAR CODING |
3267 | CLOCK SYNCHRONIZATION OVER NETWORKS USING SAWTOOTH MODELS |
3327 | CLOTHO: AN AUDIO CAPTIONING DATASET |
2183 | CLOUD-DRIVEN MULTI-WAY MULTIPLE-ANTENNA RELAY SYSTEMS: BEST-USER-LINK SELECTION AND JOINT MMSE DETECTION |
1072 | CLUSTERING OF NONNEGATIVE DATA AND AN APPLICATION TO MATRIX COMPLETION |
2740 | Clutter Identification Based on Sparse Recovery and L1-Type Probabilistic Distance Measures |
5500 | CN-CELEB: A CHALLENGING CHINESE SPEAKER RECOGNITION DATASET |
5369 | CNN-based Analog CSI Feedback in FDD MIMO-OFDM Systems |
4151 | COCHLEAR SIGNAL PROCESSING: A PLATFORM FOR LEARNING THE FUNDAMENTALS OF DIGITAL SIGNAL PROCESSING |
1727 | Coded Illumination and Multiplexing for Lensless Imaging |
5356 | CODE-SWITCHED SPEECH SYNTHESIS USING BILINGUAL PHONETIC POSTERIORGRAM WITH ONLY MONOLINGUAL CORPORA |
3549 | COGANS FOR UNSUPERVISED VISUAL SPEECH ADAPTATION TO NEW SPEAKERS |
1728 | COINCIDENCE, CATEGORIZATION, AND CONSOLIDATION: LEARNING TO RECOGNIZE SOUNDS WITH MINIMAL SUPERVISION |
1229 | COLOR AND ANGULAR RECONSTRUCTION OF LIGHT FIELDS FROM INCOMPLETE-COLOR CODED PROJECTIONS |
3453 | COLOR STABILIZATION FOR MULTI-CAMERA LIGHT-FIELD IMAGING |
1461 | COLOUR COMPRESSION OF PLENOPTIC POINT CLOUDS USING RAHT-KLT WITH PRIOR COLOUR CLUSTERING AND SPECULAR/DIFFUSE COMPONENT SEPARATION |
4336 | COMBINING ACOUSTICS, CONTENT AND INTERACTION FEATURES TO FIND HOT SPOTS IN MEETINGS |
2748 | COMBINING CGAN AND MIL FOR HOTSPOT SEGMENTATION IN BONE SCINTIGRAPHY |
5284 | COMBINING DEEP EMBEDDINGS OF ACOUSTIC AND ARTICULATORY FEATURES FOR SPEAKER IDENTIFICATION |
5312 | COMMUNICATION CONSTRAINED LEARNING WITH UNCERTAIN MODELS |
4724 | COMMUTING CONDITIONAL GANS FOR MULTI-MODAL FUSION |
2379 | COMPARE LEARNING: BI-ATTENTION NETWORK FOR FEW-SHOT LEARNING |
3770 | Comparison of Glottal Closure Instants Detection Algorithms for Emotional Speech |
3012 | COMPARISON OF USER MODELS BASED ON GMM-UBM AND I-VECTORS FOR SPEECH, HANDWRITING, AND GAIT ASSESSMENT OF PARKINSON'S DISEASE PATIENTS |
5348 | COMPLEX PAIRWISE ACTIVITY ANALYSIS VIA INSTANCE LEVEL EVOLUTION REASONING |
2366 | COMPLEX TRAINABLE ISTA FOR LINEAR AND NONLINEAR INVERSE PROBLEMS |
2912 | COMPLEX TRANSFORMER: A FRAMEWORK FOR MODELING COMPLEX-VALUED SEQUENCE |
1235 | COMPLEXITY REDUCTION METHODS FOR INDEX MODULATION BASED DUAL-FUNCTION RADAR COMMUNICATION SYSTEMS |
2196 | COMPOSITE DYNAMIC TEXTURE SYNTHESIS USING HIERARCHICAL LINEAR DYNAMICAL SYSTEM |
5628 | COMPRESSED SENSING BASED CHANNEL ESTIMATION AND OPEN-LOOP TRAINING DESIGN FOR HYBRID ANALOG-DIGITAL MASSIVE MIMO SYSTEMS |
1952 | COMPRESSING FLOW FIELDS WITH EDGE-AWARE HOMOGENEOUS DIFFUSION INPAINTING |
4337 | COMPRESSIVE 2-D OFF-GRID DOA ESTIMATION FOR PROPELLER CAVITATION LOCALIZATION |
3211 | COMPRESSIVE ADAPTIVE BILATERAL FILTERING |
5408 | COMPUTABILITY OF THE PEAK VALUE OF BANDLIMITED SIGNALS |
4115 | COMPUTATION OF "BEST" INTERPOLANTS IN THE Lp SENSE |
2605 | COMPUTING HILBERT TRANSFORM AND SPECTRAL FACTORIZATION FOR SIGNAL SPACES OF SMOOTH FUNCTIONS |
4546 | CONCENTRATION-BASED POLYNOMIAL CALCULATIONS ON NICKED DNA |
1303 | CONDITIONAL DENSITY DRIVEN GRID DESIGN IN POINT-MASS FILTER |
5172 | CONDITIONAL DOMAIN ADVERSARIAL TRANSFER FOR ROBUST CROSS-SITE ADHD CLASSIFICATION USING FUNCTIONAL MRI |
3288 | CONDITIONAL MUTUAL INFORMATION NEURAL ESTIMATOR |
2710 | CONFIDENCE ESTIMATION FOR BLACK BOX AUTOMATIC SPEECH RECOGNITION SYSTEMS USING LATTICE RECURRENT NEURAL NETWORKS |
5572 | CONFIRMNET: CONVOLUTIONAL FIRMNET AND APPLICATION TO IMAGE DENOISING AND INPAINTING |
6117 | CONNECTIONS BETWEEN SPECTRAL PROPERTIES OF ASYMPTOTIC MAPPINGS AND SOLUTIONS TO WIRELESS NETWORK PROBLEMS |
3420 | CONSENSUS-BASED DISTRIBUTED CLUSTERING FOR IOT |
1169 | CONSISTENCY-AWARE MULTI-CHANNEL SPEECH ENHANCEMENT USING DEEP NEURAL NETWORKS |
1919 | CONSTANT ENVELOPE MASSIVE MIMO-OFDM PRECODING: AN IMPROVED FORMULATION AND SOLUTION |
4199 | Constant-Envelope Precoding for Satellite Systems |
3643 | CONSTRAINED SPECTRAL CLUSTERING FOR DYNAMIC COMMUNITY DETECTION |
2640 | CONTENT BASED SINGING VOICE EXTRACTION FROM A MUSICAL MIXTURE |
5114 | Content VS Context: How about “Walking Hand-In-Hand" for Image Clustering? |
6039 | CONTEXT AND UNCERTAINTY MODELING FOR ONLINE SPEAKER CHANGE DETECTION |
5226 | CONTINUAL LEARNING FOR INFINITE HIERARCHICAL CHANGE-POINT DETECTION |
1652 | CONTINUAL LEARNING THROUGH ONE-CLASS CLASSIFICATION USING VAE |
4592 | CONTINUOUS SPEECH SEPARATION: DATASET AND ANALYSIS |
5182 | CONTROL OF LINEAR DYNAMICAL SYSTEMS USING SPARSE INPUTS |
2994 | CONTROLLABLE TIME-DELAY TRANSFORMER FOR REAL-TIME PUNCTUATION PREDICTION AND DISFLUENCY DETECTION |
3189 | CONTROLLING THE PERCEIVED SOUND QUALITY FOR DIALOGUE ENHANCEMENT WITH DEEP LEARNING |
3299 | CONVERGENCE-GUARANTEED INDEPENDENT POSITIVE SEMIDEFINITE TENSOR ANALYSIS BASED ON STUDENT'S T DISTRIBUTION |
3049 | CONVERTING WRITTEN LANGUAGE TO SPOKEN LANGUAGE WITH NEURAL MACHINE TRANSLATION FOR LANGUAGE MODELING |
1933 | CONVEX OPTIMISATION-BASED PRIVACY-PRESERVING DISTRIBUTED AVERAGE CONSENSUS IN WIRELESS SENSOR NETWORKS |
1032 | CONVOLUTIONAL BEAMSPACE FOR ARRAY SIGNAL PROCESSING |
2868 | COOPERATIVE LEARNING VIA FEDERATED DISTILLATION OVER FADING CHANNELS |
2334 | CORRDROP: CORRELATION BASED DROPOUT FOR CONVOLUTIONAL NEURAL NETWORKS |
4089 | CORRECTION OF AUTOMATIC SPEECH RECOGNITION WITH TRANSFORMER SEQUENCE-TO-SEQUENCE MODEL |
3485 | CORRELATED MULTI-ARMED BANDITS WITH A LATENT RANDOM SOURCE |
1991 | CORRGAN: SAMPLING REALISTIC FINANCIAL CORRELATION MATRICES USING GENERATIVE ADVERSARIAL NETWORKS |
4236 | COST AWARE ADVERSARIAL LEARNING |
4709 | COUNTING DENSE OBJECTS IN REMOTE SENSING IMAGES |
4577 | COUPLED TRAINING OF SEQUENCE-TO-SEQUENCE MODELS FOR ACCENTED SPEECH RECOGNITION |
5802 | CP-GAN: CONTEXT PYRAMID GENERATIVE ADVERSARIAL NETWORK FOR SPEECH ENHANCEMENT |
2498 | CPWC: CONTEXTUAL POINT WISE CONVOLUTION FOR OBJECT RECOGNITION |
4240 | CRA: A GENERIC COMPRESSION RATIO ADAPTER FOR END-TO-END DATA-DRIVEN IMAGE COMPRESSIVE SENSING RECONSTRUCTION FRAMEWORKS |
5752 | Cramer-Rao bound on DOA Estimation of finite bandwidth signals using a Moving Sensor |
6079 | CRAMÉR-RAO BOUND UNDER NORM CONSTRAINT |
5276 | CRAMÉR-RAO BOUNDS FOR FLAW LOCALIZATION IN SUBSAMPLED MULTISTATIC MULTICHANNEL ULTRASOUND NDT DATA |
2134 | CRNN-CTC BASED MANDARIN KEYWORDS SPOTTING |
1435 | Cross Image Cubic Interpolator for Spatially Varying Exposures |
3775 | CROSS LINGUAL TRANSFER LEARNING FOR ZERO-RESOURCE DOMAIN ADAPTATION |
2190 | Cross-Domain Adaptation for Biometric Identification Using Photoplethysmogram |
2677 | CROSS-DOMAIN JOINT DICTIONARY LEARNING FOR ECG RECONSTRUCTION FROM PPG |
1423 | CROSS-LINGUAL TOPIC PREDICTION FOR SPEECH USING TRANSLATIONS |
3066 | CROSS-SPEAKER SILENT-SPEECH COMMAND WORD RECOGNITION USING ELECTRO-OPTICAL STOMATOGRAPHY |
5653 | CROSS-STAINED SEGMENTATION FROM RENAL BIOPSY IMAGES USING MULTI-LEVEL ADVERSARIAL LEARNING |
4978 | Cross-VAE: Towards Disentangling Expression from Identity for Human Faces |
5341 | Cross-view Attention Network for Breast Cancer Screening from Multi-view Mammograms |
5060 | CROWDSOURCING-BASED RANKING AGGREGATION FOR PERSON RE-IDENTIFICATION |
3403 | CS-R-FCN: CROSS-SUPERVISED LEARNING FOR LARGE-SCALE OBJECT DETECTION |
2599 | Cumulant Slice Reconstruction from Compressive Measurements and Its Application to Line Spectrum Estimation |
6159 | CURRICULUM LEARNING FOR SPEECH EMOTION RECOGNITION FROM CROWDSOURCED LABELS |
3392 | D2NA: DAY-TO-NIGHT ADAPTATION FOR VISION BASED PARKING MANAGEMENT SYSTEM |
4654 | DAMAGE-SENSITIVE AND DOMAIN-INVARIANT FEATURE EXTRACTION FOR VEHICLE-VIBRATION-BASED BRIDGE HEALTH MONITORING |
2484 | DATA AUGMENTATION USING EMPIRICAL MODE DECOMPOSITION ON NEURAL NETWORKS TO CLASSIFY IMPACT NOISE IN VEHICLE |
3478 | DATA SELECTION KERNEL CONJUGATE GRADIENT ALGORITHM |
4243 | DATA-DRIVEN HARMONIC FILTERS FOR AUDIO REPRESENTATION LEARNING |
1177 | DATA-DRIVEN MODEL SET DESIGN FOR MODEL AVERAGED PARTICLE FILTER |
3664 | DATA-DRIVEN WIND SPEED ESTIMATION USING MULTIPLE MICROPHONES |
1369 | DEBLURRING AND SUPER-RESOLUTION USING DEEP GATED FUSION ATTENTION NETWORKS FOR FACE IMAGES |
1199 | DECENTRALIZED EXPECTED CONSISTENT SIGNAL RECOVERY FOR QUANTIZATION MEASUREMENTS |
4942 | DECENTRALIZED MIN-MAX OPTIMIZATION: FORMULATIONS, ALGORITHMS AND APPLICATIONS IN NETWORK POISONING ATTACK |
2630 | Decentralized optimization with non-identical sampling in presence of stragglers |
4004 | Decentralized Stochastic Non-convex Optimization over Weakly Connected Time-varying Digraphs |
1847 | DECIDABLE VARIABLE-RATE DATAFLOW FOR HETEROGENEOUS SIGNAL PROCESSING SYSTEMS |
3138 | Decoding 5G-NR Communications via Deep Learning |
2927 | DECODING MOVEMENT IMAGINATION AND EXECUTION FROM EEG SIGNALS USING BCI-TRANSFER LEARNING METHOD BASED ON RELATION NETWORK |
2536 | DECOMPOSED CYCLEGAN FOR SINGLE IMAGE DERAINING WITH UNPAIRED DATA |
5045 | DEEP AUDIO-VISUAL SPEECH SEPARATION WITH ATTENTION MECHANISM |
2834 | DEEP AUTOTUNER: A PITCH CORRECTING NETWORK FOR SINGING PERFORMANCES |
4230 | DEEP CASA FOR TALKER-INDEPENDENT MONAURAL SPEECH SEPARATION |
4160 | DEEP CLUSTERING FOR DOMAIN ADAPTATION |
4196 | DEEP CLUSTERING WITH CONCRETE K-MEANS |
4175 | DEEP CONTEXTUALIZED ACOUSTIC REPRESENTATIONS FOR SEMI-SUPERVISED SPEECH RECOGNITION |
5598 | DEEP ENCODED LINGUISTIC AND ACOUSTIC CUES FOR ATTENTION BASED END TO END SPEECH EMOTION RECOGNITION |
5326 | DEEP EXPOSURE FUSION WITH DEGHOSTING VIA HOMOGRAPHY ESTIMATION AND ATTENTION LEARNING |
5410 | DEEP FLOW COLLABORATIVE NETWORK FOR ONLINE VISUAL TRACKING |
5192 | Deep geometric knowledge distillation with graphs |
5087 | DEEP IMAGE DEBLURRING USING LOCAL CORRELATION BLOCK |
4158 | DEEP JAMES-STEIN NEURAL NETWORKS FOR BRAIN-COMPUTER INTERFACES |
4868 | DEEP JOINT SOURCE-CHANNEL CODING FOR WIRELESS IMAGE RETRIEVAL |
4354 | DEEP JOINT-SOURCE CHANNEL CODING OF IMAGES WITH FEEDBACK |
1557 | DEEP LEARNING ABILITIES TO CLASSIFY INTRICATE VARIATIONS IN TEMPORAL DYNAMICS OF MULTIVARIATE TIME SERIES |
5554 | Deep Learning Based Bearing Fault Diagnosis Using 1D Convolutional Neural Network with Modified Octave Convolution |
4247 | DEEP LEARNING BASED PREDICTION OF HYPERNASALITY FOR CLINICAL APPLICATIONS |
1586 | DEEP LEARNING FOR ROBUST POWER CONTROL FOR WIRELESS NETWORKS |
4197 | DEEP LEARNING-BASED BEAM ALIGNMENT IN MMWAVE VEHICULAR NETWORKS |
2427 | DEEP MATRIX COMPLETION ON GRAPHS: APPLICATION IN DRUG TARGET INTERACTION PREDICTION |
1558 | DEEP META-RELATION NETWORK FOR VISUAL FEW-SHOT LEARNING |
4483 | DEEP METRIC LEARNING BASED ON CENTER-RANKED LOSS FOR GAIT RECOGNITION |
1282 | Deep Monocular Video Depth Estimation Using Temporal Attention |
3722 | DEEP MULTI-SCALE GABOR WAVELET NETWORK FOR IMAGE RESTORATION |
2304 | DEEP NEURAL NETWORK BASED MATRIX COMPLETION FOR INTERNET OF THINGS NETWORK LOCALIZATION |
5686 | DEEP NEURAL NETWORKS BASED AUTOMATIC SPEECH RECOGNITION FOR FOUR ETHIOPIAN LANGUAGES |
4148 | DEEP PERCEPTUAL OPTIMIZER FOR VIDEO PRECODING |
3174 | DEEP PRODUCT QUANTIZATION MODULE FOR EFFICIENT IMAGE RETRIEVAL |
5116 | DEEP RAINRATE ESTIMATION FROM HIGHLY ATTENUATED DOWNLINK SIGNALS OF GROUND-BASED COMMUNICATIONS SATELLITE TERMINALS |
1764 | DEEP RESIDUAL NETWORK FOR MSFA RAW IMAGE DENOISING |
1237 | DEEP SOFT INTERFERENCE CANCELLATION FOR MIMO DETECTION |
1685 | DEEP SPEECH EXTRACTION WITH TIME-VARYING SPATIAL FILTERING GUIDED BY DESIRED DIRECTION ATTRACTOR |
5457 | DEEPMULTI-REGIONHASHING |
2438 | Deep-Neural-Network based Fall-back Mechanism in Interference-Aware Receiver Design |
4944 | DEEP-SST-EDDIES: A DEEP LEARNING FRAMEWORK TO DETECT OCEANIC EDDIES IN SEA SURFACE TEMPERATURE IMAGES |
2726 | Defending Graph Convolutional Networks against Adversarial Attacks |
3693 | Defense against adversarial attacks on spoofing countermeasures of ASV |
3426 | Deja-vu: Double Feature Presentation in Deep Transformer Networks |
3532 | DELIBERATION MODEL BASED TWO-PASS END-TO-END SPEECH RECOGNITION |
2019 | DEMYSTIFYING TASNET: A DISSECTING APPROACH |
1717 | DENOISING OF EVENT-BASED SENSORS WITH SPATIO-TEMPORAL CORRELATION |
5028 | DENSE CROWD COUNTING WITH STACKED POOLING FOR BOOSTING SCALE |
5830 | DENSE MAPPING OF INTRACELLULAR DIFFUSION AND DRIFT FROM SINGLE-PARTICLE TRACKING DATA ANALYSIS |
1746 | DENSE RESIDUAL NETWORK FOR RETINAL VESSEL SEGMENTATION |
4246 | DENSELY CONNECTED NEURAL NETWORK WITH DILATED CONVOLUTIONS FOR REAL-TIME SPEECH ENHANCEMENT IN THE TIME DOMAIN |
1392 | DEPTH ESTIMATION FROM SINGLE IMAGE THROUGH MULTI-PATH-MULTI-RATE DIVERSE FEATURE EXTRACTOR |
3799 | DEPTH MAP FINGERPRINTING AND SPLICING DETECTION |
2634 | DEPTHWISE-STFT BASED SEPARABLE CONVOLUTIONAL NEURAL NETWORKS |
4956 | Deriving Compact Feature Representations Via Annealed Contraction |
3676 | DESIGN CONSIDERATIONS FOR HYPOTHESIS REJECTION MODULES IN SPOKEN LANGUAGE UNDERSTANDING SYSTEMS |
1856 | DESIGN OF A CONVERGENCE-AWARE BASED EXPECTATION PROPAGATION ALGORITHM FOR UPLINK MIMO SCMA SYSTEMS |
1351 | DESIGN-GAN: CROSS-CATEGORY FASHION TRANSLATION DRIVEN BY LANDMARK ATTENTION |
3410 | DETECT INSIDER ATTACKS USING CNN IN DECENTRALIZED OPTIMIZATION |
2696 | DETECTING ADVERSARIAL ATTACKS IN TIME-SERIES DATA |
1378 | DETECTING AUTISM SPECTRUM DISORDER USING TOPOLOGICAL DATA ANALYSIS |
3909 | DETECTING EMOTION PRIMITIVES FROM SPEECH AND THEIR USE IN DISCERNING CATEGORICAL EMOTIONS |
2923 | Detecting Mismatch between Text Script and Voice-over Using Utterance Verification Based on Phoneme Recognition Ranking |
1753 | DETECTING MULTIPLE SPEECH DISFLUENCIES USING A DEEP RESIDUAL NETWORK WITH BIDIRECTIONAL LONG SHORT-TERM MEMORY |
3007 | DETECTION AND ANALYSIS OF T/D DELETION IN LIBRISPEECH |
4467 | DETECTION OF ADVERSARIAL ATTACKS AND CHARACTERIZATION OF ADVERSARIAL SUBSPACE |
2004 | DETECTION OF MALICIOUS VBSCRIPT USING STATIC AND DYNAMIC ANALYSIS WTIH RECURRENT DEEP LEARNING |
3290 | DETECTION OF MILD DYSPNEA FROM PAIRS OF SPEECH RECORDINGS |
3951 | DETECTION OF S1 AND S2 LOCATIONS IN PHONOCARDIOGRAM SIGNALS USING ZERO FREQUENCY FILTER |
4239 | DETECTION OF SPEECH EVENTS AND SPEAKER CHARACTERISTICS THROUGH PHOTO-PLETHYSMOGRAPHIC SIGNAL NEURAL PROCESSING |
6145 | Detection of Speech Smoothing on Very Short Clips |
2783 | DETERMINED SOURCE SEPARATION USING THE SPARSITY OF IMPULSE RESPONSES |
5691 | DETERMINISTIC FEATURE DECOUPLING BY SURFING INVARIANCE MANIFOLDS |
4472 | DFSMN-SAN WITH PERSISTENT MEMORY MODEL FOR AUTOMATIC SPEECH RECOGNITION |
1455 | DGAN: DISENTANGLED REPRESENTATION LEARNING FOR ANISOTROPIC BRDF RECONSTRUCTION |
2285 | DIACRITIC-LEVEL PRONUNCIATION ANALYSIS USING PHONOLOGICAL FEATURES |
1937 | DIAGONALIZABLE SHIFT AND FILTERS FOR DIRECTED GRAPHS BASED ON THE JORDAN-CHEVALLEY DECOMPOSITION |
5433 | DIALOGUE HISTORY INTEGRATION INTO END-TO-END SIGNAL-TO-CONCEPT SPOKEN LANGUAGE UNDERSTANDING SYSTEMS |
1402 | DIFFERENTIABLE BRANCHING IN DEEP NETWORKS FOR FAST INFERENCE |
6088 | DIFFERENTIALLY MODULATED SPECTRALLY EFFICIENT FREQUENCY-DIVISION MULTIPLEXING |
5633 | DIGITAL WATERMARKING FOR PROTECTING AUDIO CLASSIFICATION DATASETS |
2810 | DILATED CONVOLUTIONAL NEURAL NETWORKS FOR PANORAMIC IMAGE SALIENCY PREDICTION |
6134 | DIRECTION OF ARRIVAL ESTIMATION FOR REVERBERANT SPEECH BASED ON ENHANCED DECOMPOSITION OF THE DIRECT SOUND |
4708 | DISCOVERING CAUSALITIES FROM CARDIOTOCOGRAPHY SIGNALS USING IMPROVED CONVERGENT CROSS MAPPING WITH GAUSSIAN PROCESSES |
1629 | Discrete Wasserstein Autoencoders for Document Retrieval |
1467 | Discriminant and sparsity based least squares regression with l1 regularization for feature representation |
1426 | DISCRIMINANT GENERATIVE ADVERSARIAL NETWORKS WITH ITS APPLICATION TO EQUIPMENT HEALTH CLASSIFICATION |
2861 | DISENTANGLED MULTIDIMENSIONAL METRIC LEARNING FOR MUSIC SIMILARITY |
4364 | DISENTANGLED SPEECH EMBEDDINGS USING CROSS-MODAL SELF-SUPERVISION |
3191 | Disentangling Controllable Object through Video Prediction Improves Visual Reinforcement Learning |
5583 | DISENTANGLING TIMBRE AND SINGING STYLE WITH MULTI-SINGER SINGING SYNTHESIS SYSTEM |
1661 | DISPERSIVE GRID-FREE ORTHOGONAL MATCHING PURSUIT FOR MODAL ESTIMATION IN OCEAN ACOUSTICS |
2954 | DISTILLING ATTENTION WEIGHTS FOR CTC-BASED ASR SYSTEMS |
1236 | DISTRIBUTED DETECTION OF SPARSE SIGNALS WITH 1-BIT DATA IN TWO-LEVEL TWO-DEGREE TREE-STRUCTURED SENSOR NETWORKS |
5292 | DISTRIBUTED EQUALIZATION AND POWER ALLOCATION FOR MULTI-CARRIER BIDIRECTIONAL FILTER-AND-FORWARD RELAY NETWORKS |
6082 | Distributed Nesterov gradient methods over arbitrary graphs |
4867 | DISTRIBUTED NON-ORTHOGONAL PILOT DESIGN FOR MULTI-CELL MASSIVE MIMO SYSTEMS |
1579 | Distributed Quantization for Sparse Time Sequences |
3892 | DISTRIBUTED TENSOR COMPLETION OVER NETWORKS |
5188 | DISTRIBUTED TRACKING AND CIRCUMNAVIGATION USING BEARING MEASUREMENTS |
3070 | DISTRIBUTED VERIFICATION OF BELIEF PRECISIONS CONVERGENCE IN GAUSSIAN BELIEF PROPAGATION |
2585 | DISTRIBUTED WAVE-DOMAIN ACTIVE NOISE CONTROL BASED ON THE DIFFUSION STRATEGY |
5126 | DISTRIBUTION OF THE PRODUCT OF A COMPLEX GAUSSIAN MATRIX AND VECTOR AND ITS SUM WITH A COMPLEX GAUSSIAN VECTOR |
2596 | DIVERGENCE-BASED ADAPTIVE EXTREME VIDEO COMPLETION |
1247 | Diversity and Sparsity: A New Perspective on Index Tracking |
5977 | dMazeRunner: OPTIMIZING CONVOLUTIONS ON DATAFLOW ACCELERATORS |
3406 | DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS |
2917 | DNN-based Mask Estimation Integrating Spectral and Spatial Features for Robust Beamforming |
5811 | DNN-BASED SPEECH PRESENCE PROBABILITY ESTIMATION FOR MULTI-FRAME SINGLE-MICROPHONE SPEECH ENHANCEMENT |
5018 | DNN-BASED SPEECH RECOGNITION FOR GLOBALPHONE LANGUAGES |
1680 | DNN-CHIP PREDICTOR: A MULTI-GRAINED GRAPH-BASED PERFORMANCE SIMULATOR FOR DNN ACCELERATORS |
1813 | DNN-SUPPORTED MASK-BASED CONVOLUTIONAL BEAMFORMING FOR SIMULTANEOUS DENOISING, DEREVERBERATION, AND SOURCE SEPARATION |
2721 | DOA ESTIMATION IN SYSTEMS WITH NONLINEARITIES FOR MMWAVE COMMUNICATIONS |
1560 | DOA TRACKING VIA SIGNAL-SUBSPACE PROJECTOR UPDATE |
4947 | DOMAIN ADAPTATION FOR GENERALIZATION OF FACE PRESENTATION ATTACK DETECTION IN MOBILE SETTINGS WITH MINIMAL INFORMATION |
4058 | DOMAIN ROBUST, FAST, AND COMPACT NEURAL LANGUAGE MODELS |
1343 | DRIFT DETECTION AND CORRECTION POST-TRACKING |
3027 | DRSS-BASED LOCALISATION USING WEIGHTED INSTRUMENTAL VARIABLES AND SELECTIVE POWER MEASUREMENT |
4174 | D-SLAM: DIFFUSION SOURCE LOCALIZATION AND TRAJECTORY MAPPING |
1424 | DUAL-PATH RNN: EFFICIENT LONG SEQUENCE MODELING FOR TIME-DOMAIN SINGLE-CHANNEL SPEECH SEPARATION |
1613 | DURATION ROBUST WEAKLY SUPERVISED SOUND EVENT DETECTION |
5121 | DYNA-BOLT: DOMAIN ADAPTIVE BINARY FACTORIZATION OF CURRENT WAVEFORMS FOR ENERGY DISAGGREGATION |
4755 | DYNAMIC ATTACK SCORING USING DISTRIBUTED LOCAL DETECTORS |
3409 | Dynamic Channel Pruning for Correlation Filter based Object Tracking |
2508 | DYNAMIC METASURFACE ANTENNAS FOR BIT-CONSTRAINED MIMO-OFDM RECEIVERS |
1248 | DYNAMIC OVERSAMPLING IN 1-BIT QUANTIZED ASYNCHRONOUS LARGE-SCALE MULTIPLE-ANTENNA SYSTEMS FOR SUSTAINABLE IOT NETWORKS |
4063 | Dynamic Resource Allocation for Wireless Edge Machine Learning with Latency and Accuracy Guarantees |
5754 | DYNAMIC RESOURCE OPTIMIZATION AND ALTITUDE SELECTION IN UAV-BASED MULTI-ACCESS EDGE COMPUTING |
4835 | DYNAMIC TEMPORAL RESIDUAL LEARNING FOR SPEECH RECOGNITION |
1843 | DYNAMIC VARIATIONAL AUTOENCODERS FOR VISUAL PROCESS MODELING |
4324 | DYNAMICALLY MODULATED DEEP METRIC LEARNING FOR VISUAL SEARCH |
5632 | DYSARTHRIC SPEECH RECOGNITION WITH LATTICE-FREE MMI |
5204 | E2E-SINCNET: TOWARD FULLY END-TO-END SPEECH RECOGNITION |
3261 | ECG HEARTBEAT CLASSIFICATION BASED ON MULTI-SCALE WAVELET CONVOLUTIONAL NEURAL NETWORKS |
5234 | EDGEFOOL: AN ADVERSARIAL IMAGE ENHANCEMENT FILTER |
4748 | EDNFC-NET: CONVOLUTIONAL NEURAL NETWORK WITH NESTED FEATURE CONCATENATION FOR NUCLEI-INSTANCE SEGMENTATION |
3758 | EEG CONNECTIVITY - INFORMED COOPERATIVE ADAPTIVE LINE ENHANCER FOR RECOGNITION OF BRAIN STATE |
5317 | EEG FEATURE SELECTION USING ORTHOGONAL REGRESSION: APPLICATION TO EMOTION RECOGNITION |
3209 | EFFECT OF CHOICE OF PROBABILITY DISTRIBUTION, RANDOMNESS, AND SEARCH METHODS FOR ALIGNMENT MODELING IN SEQUENCE-TO-SEQUENCE TEXT-TO-SPEECH SYNTHESIS USING HARD ALIGNMENT |
5957 | EFFECT OF FRICATION DURATION AND FORMANT TRANSITIONS ON THE PERCEPTION OF FRICATIVES IN VCV UTTERANCES |
5762 | EFFECT OF UNDERSAMPLING ON NON-NEGATIVE BLIND DECONVOLUTION WITH AUTOREGRESSIVE FILTERS |
4034 | EFFECTIVE APPROXIMATE MAXIMUM LIKELIHOOD ESTIMATION OF ANGLES OF ARRIVAL FOR NON-COHERENT SUB-ARRAYS |
3642 | EFFECTIVE APPROXIMATION OF BANDLIMITED SIGNALS AND THEIR SAMPLES |
1025 | Effective Pipeline for Compressing Deep Object Detectors |
5449 | EFFECTIVE WAVENET ADAPTATION FOR VOICE CONVERSION WITH LIMITED DATA |
3729 | Effectiveness of Random Deep Feature Selection for securing image manipulation detectors against adversarial examples |
4038 | Effectiveness of self-supervised pre-training for ASR |
5859 | EFFECTS OF SPECTRAL TILT ON LISTENERS PREFERENCES AND INTELLIGIBILITY |
1977 | EFFICIENT ALGORITHM TO IMPLEMENT SLIDING SINGULAR SPECTRUM ANALYSIS WITH APPLICATION TO BIOMEDICAL SIGNAL DENOISING |
4927 | EFFICIENT AND SCALABLE NEURAL RESIDUAL WAVEFORM CODING WITH COLLABORATIVE QUANTIZATION |
3824 | EFFICIENT BELIEF PROPAGATION FOR GRAPH MATCHING |
5062 | EFFICIENT BIRD SOUND DETECTION ON THE BELA EMBEDDED SYSTEM |
2222 | EFFICIENT CONSTRAINED ENCODERS CORRECTING A SINGLE NUCLEOTIDE EDIT IN DNA STORAGE |
1454 | Efficient Decoupled Neural Architecture Search by Structure and Operation Sampling |
2088 | EFFICIENT DEEP LEARNING-BASED LOSSY IMAGE COMPRESSION VIA ASYMMETRIC AUTOENCODER AND PRUNING |
3319 | EFFICIENT ESTIMATION OF MIXING MATRIX USING A TWO-SENSOR ARRAY |
2825 | EFFICIENT IMAGE SUPER RESOLUTION VIA CHANNEL DISCRIMINATIVE DEEP NEURAL NETWORK PRUNING |
2012 | Efficient Multichannel Nonlinear Acoustic Echo Cancellation Based on a Cooperative strategy |
6080 | EFFICIENT REPRESENTATION AND SPARSE SAMPLING OF HEAD-RELATED TRANSFER FUNCTIONS USING PHASE-CORRECTION BASED ON EAR ALIGNMENT |
1509 | EFFICIENT SCENE TEXT DETECTION WITH TEXTUAL ATTENTION TOWER |
5897 | EFFICIENT SHALLOW WAVENET VOCODER USING MULTIPLE SAMPLES OUTPUT BASED ON LAPLACIAN DISTRIBUTION AND LINEAR PREDICTION |
5367 | Efficient Super-Resolution Two-Dimensional Harmonic Retrieval via Enhanced Low-Rank Structured Covariance Reconstruction |
2167 | Efficient Techniques for In-Band System Information Broadcast in Multi-cell Massive MIMO |
4524 | EFFICIENT TRAINABLE FRONT-ENDS FOR NEURAL SPEECH ENHANCEMENT |
6090 | EIGENBEAM-ESPRIT FOR DOA-VECTOR ESTIMATION |
6149 | EIGENDECOMPOSITION-FREE SAMPLING SET SELECTION FOR GRAPH SIGNALS |
1368 | Electric Analog Circuit Design with Hypernetworks and a Differential Simulator |
1901 | Electro-Magnetic Side-Channel Attack Through Learned Denoising and Classification |
2453 | Eliminating Out-Of-Cell Interference in Cellular Massive MIMO With a Single Additional Transceiver |
1761 | EMBEDDED LARGE–SCALE HANDWRITTEN CHINESE CHARACTER RECOGNITION |
2900 | EMET: EMBEDDINGS FROM MULTILINGUAL-ENCODER TRANSFORMER FOR FAKE NEWS DETECTION |
5365 | EMOTIONAL SPEECH SYNTHESIS WITH RICH AND GRANULARIZED CONTROL |
4853 | EMOTIONAL VOICE CONVERSION USING MULTITASK LEARNING WITH TEXT-TO-SPEECH |
5309 | Empirical SURE-guided microscopy super-resolution image reconstruction from confocal multi-array detectors |
1223 | ENCODER-RECURRENT DECODER NETWORK FOR SINGLE IMAGE DEHAZING |
3346 | ENCODING AND DECODING MIXED BANDLIMITED SIGNALS USING SPIKING INTEGRATE-AND-FIRE NEURONS |
3738 | ENCODING TEMPORAL INFORMATION FOR AUTOMATIC DEPRESSION RECOGNITION FROM FACIAL ANALYSIS |
4249 | END TO END SPEECH RECOGNITION ERROR PREDICTION WITH SEQUENCE TO SEQUENCE LEARNING |
3034 | END-END SPEECH-TO-TEXT TRANSLATION WITH MODALITY AGNOSTIC META-LEARNING |
3428 | END-TO-END ACCENT CONVERSION WITHOUT USING NATIVE UTTERANCES |
3857 | END-TO-END ARCHITECTURES FOR ASR-FREE SPOKEN LANGUAGE UNDERSTANDING |
4334 | END-TO-END ARTICULATORY MODELING FOR DYSARTHRIA ARTICULATORY ATTRIBUTE DETECTION |
4142 | END-TO-END AUDITORY OBJECT RECOGNITION VIA INCEPTION NUCLEUS |
2882 | END-TO-END AUTOMATIC SPEECH RECOGNITION INTEGRATED WITH CTC-BASED VOICE ACTIVITY DETECTION |
4674 | END-TO-END CODE-SWITCHING TTS WITH CROSS-LINGUAL LANGUAGE MODEL |
4493 | END-TO-END GENERATION OF TALKING FACES FROM NOISY SPEECH |
4332 | END-TO-END MICROPHONE PERMUTATION AND NUMBER INVARIANT MULTI-CHANNEL SPEECH SEPARATION |
2763 | END-TO-END MULTI-PERSON AUDIO/VISUAL AUTOMATIC SPEECH RECOGNITION |
4758 | END-TO-END MULTI-SPEAKER SPEECH RECOGNITION WITH TRANSFORMER |
4367 | END-TO-END MULTI-TALKER OVERLAPPING SPEECH RECOGNITION |
4496 | End-to-end Non-Negative Autoencoders for Sound Source Separation |
3159 | END-TO-END SPEECH TRANSLATION WITH SELF-CONTAINED VOCABULARY MANIPULATION |
5939 | End-to-End Spoken Language Understanding Without Matched Language Speech Model Pretraining Data |
3307 | END-TO-END TRAINING OF TIME DOMAIN AUDIO SEPARATION AND RECOGNITION |
3618 | END-TO-END VOICE CONVERSION VIA CROSS-MODAL KNOWLEDGE DISTILLATION FOR DYSARTHRIC SPEECH RECONSTRUCTION |
3572 | EnerGAN: A GENERATIVE ADVERSARIAL NETWORK FOR ENERGY DISAGGREGATION |
2659 | ENERGY DISAGGREGATION FROM LOW SAMPLING FREQUENCY MEASUREMENTS USING MULTI-LAYER ZERO CROSSING RATE |
2656 | ENERGY DISAGGREGATION USING FRACTIONAL CALCULUS |
3178 | ENERGY EFFICIENT ACCELERATION OF FLOATING POINT APPLICATIONS ONTO CGRA |
5193 | Energy-efficient 3D UAV trajectory design for data collection in wireless sensor networks |
5681 | Energy-Efficient Bit Allocation for Resolution-Adaptive ADC in Multiuser Large-Scale MIMO Systems: Global Optimality |
2717 | ENERGY-EFFICIENT DISTRIBUTED LEARNING WITH COARSELY QUANTIZED SIGNALS |
4161 | ENHANCE FEATURE REPRESENTATION OF ELECTROENCEPHALOGRAM FOR SEIZURE DETECTION |
1070 | ENHANCE PART-BASED MODEL FOR PERSON RE-IDENTIFICATION WITH FUSED MULTI-SCALE FEATURES |
1466 | ENHANCED ACTION TUBELET DETECTOR FOR SPATIO-TEMPORAL VIDEO ACTION DETECTION |
1210 | Enhanced Adversarial Strategically-Timed Attacks against Deep Reinforcement Learning |
3015 | ENHANCED METHOD OF AUDIO CODING USING CNN-BASED SPECTRAL RECOVERY WITH ADAPTIVE STRUCTURE |
3686 | Enhanced Mixture Population Monte Carlo Via Stochastic Optimization and Markov Chain Monte Carlo Sampling |
5010 | ENHANCED NON-LOCAL CASCADING NETWORK WITH ATTENTION MECHANISM FOR HYPERSPECTRAL IMAGE DENOISING |
3354 | Enhanced Safety of Autonomous Driving by Incorporating Terrestrial Signals of Opportunity |
1897 | ENHANCEMENT OF CODED SPEECH USING A MASK-BASED POST-FILTER |
4750 | ENHANCING END-TO-END MULTI-CHANNEL SPEECH SEPARATION VIA SPATIAL FEATURE LEARNING |
1747 | ENHANCING THE LABELLING OF AUDIO SAMPLES FOR AUTOMATIC INSTRUMENT CLASSIFICATION BASED ON NEURAL NETWORKS |
1870 | ENSEMBLE NETWORK FOR RANKING IMAGES BASED ON VISUAL APPEAL |
2280 | ENVIRONMENT-AWARE RECONFIGURABLE NOISE SUPPRESSION |
4976 | EPIGRAPHICAL REFORMULATION FOR NON-PROXIMABLE MIXED NORMS |
3599 | EPI-NEIGHBORHOOD DISTRIBUTION BASED LIGHT FIELD DEPTH ESTIMATION |
5453 | EPOCH EXTRACTION FROM A SPEECH SIGNAL USING GAMMATONE WAVELETS IN A SCATTERING NETWORK |
1292 | EQUALIZATION OF OFDM WAVEFORMS WITH INSUFFICIENT CYCLIC PREFIX |
1178 | ERNET FAMILY: HARDWARE-ORIENTED CNN MODELS FOR COMPUTATIONAL IMAGING USING BLOCK-BASED INFERENCE |
5722 | ERROR ANALYSIS APPLIED TO END-TO-END SPOKEN LANGUAGE UNDERSTANDING |
6127 | Error Preserving Correction: A Method for CP Decomposition at a Target Error Bound |
2815 | ESPNET-TTS: UNIFIED, REPRODUCIBLE, AND INTEGRATABLE OPEN SOURCE END-TO-END TEXT-TO-SPEECH TOOLKIT |
1999 | ESRGAN+ : FURTHER IMPROVING ENHANCED SUPER-RESOLUTION GENERATIVE ADVERSARIAL NETWORK |
3541 | ESTIMATING CENTRALITY BLINDLY FROM LOW-PASS FILTERED GRAPH SIGNALS |
1658 | ESTIMATING STRUCTURAL MISSING VALUES VIA LOW-TUBAL-RANK TENSOR COMPLETION |
1720 | ESTIMATING THE DEGREE OF SLEEPINESS BY INTEGRATING ARTICULATORY FEATURE KNOWLEDGE IN RAW WAVEFORM BASED CNNS |
5152 | Estimation of Information in Parallel Gaussian Channels via Model Order Selection |
2995 | ESTIMATION OF POST-NONLINEAR CAUSAL MODELS USING AUTOENCODING STRUCTURE |
2015 | EUROPARL-ST: A MULTILINGUAL CORPUS FOR SPEECH TRANSLATION OF PARLIAMENTARY DEBATES |
5323 | EVALUATING VOICE CONVERSION-BASED PRIVACY PROTECTION AGAINST INFORMED ATTACKERS |
2595 | EVALUATION OF DEEP-LEARNING-BASED VOICE ACTIVITY DETECTORS AND ROOM IMPULSE RESPONSE MODELS IN REVERBERANT ENVIRONMENTS |
2802 | EVALUATION OF JOINT AUDITORY ATTENTION DECODING AND ADAPTIVE BINAURAL BEAMFORMING APPROACH FOR HEARING DEVICES WITH ATTENTION SWITCHING |
2704 | EVALUATION OF SENSOR SELF-NOISE IN BINAURAL RENDERING OF SPHERICAL MICROPHONE ARRAY SIGNALS |
4303 | EVENT-DRIVEN SIGNAL PROCESSING WITH NEUROMORPHIC COMPUTING SYSTEMS |
3018 | EXACT SPARSE NONNEGATIVE LEAST SQUARES |
5878 | EXEMPLAR TEACHING PRACTICES IN ENGINEERING COURSES IN U.S. UNIVERSITIES |
2936 | EXOCENTRIC TO EGOCENTRIC IMAGE GENERATION VIA PARALLEL GENERATIVE ADVERSARIAL NETWORK |
2914 | EXPERIMENTS IN CREATING ONLINE COURSE CONTENT FOR SIGNAL PROCESSING EDUCATION |
5402 | EXPLOITATION OF 3D CITY MAPS FOR HYBRID 5G RTT AND GNSS POSITIONING SIMULATIONS |
4182 | EXPLOITING CHANNEL LOCALITY FOR ADAPTIVE MASSIVE MIMO SIGNAL DETECTION |
4781 | EXPLOITING COMMUTATIVITY CONDITION FOR CP DECOMPOSITION VIA APPROXIMATE SIMULTANEOUS DIAGONALIZATION |
5194 | EXPLOITING PERIODICITY FEATURES FOR JOINT DETECTION AND DOA ESTIMATION OF SPEECH SOURCES USING CONVOLUTIONAL NEURAL NETWORKS |
1878 | EXPLOITING RAYS IN BLIND LOCALIZATION OF DISTRIBUTED SENSOR ARRAYS |
3126 | EXPLOITING SPARSITY FOR ROBUST SENSOR NETWORK LOCALIZATION IN MIXED LOS/NLOS ENVIRONMENTS |
5108 | Exploiting Two-dimensional Symmetry and Unimodality for Model-free Source Localization in Harsh Environment |
5799 | EXPLOITING VOCAL TRACT COORDINATION USING DILATED CNNS FOR DEPRESSION DETECTION IN NATURALISTIC ENVIRONMENTS |
3904 | EXPLORATION METHODOLOGY FOR BTI-INDUCED FAILURES ON RRAM-BASED EDGE AI SYSTEMS |
3700 | EXPLORING A ZERO-ORDER DIRECT HMM BASED ON LATENT ATTENTION FOR AUTOMATIC SPEECH RECOGNITION |
3098 | EXPLORING APPROPRIATE ACOUSTIC AND LANGUAGE MODELLING CHOICES FOR CONTINUOUS DYSARTHRIC SPEECH RECOGNITION |
2922 | EXPLORING BIO-BEHAVIORAL SIGNAL TRAJECTORIES OF STATE ANXIETY DURING PUBLIC SPEAKING |
4555 | Exploring Energy Efficient Quantum-resistant Signal Processing Using Array Processors |
1241 | Exploring Entity-level Spatial Relationships for Image-Text Matching |
4178 | Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition |
1147 | EXPOSURE INTERPOLATION VIA HYBRID LEARNING |
5658 | EXPRESSION-GUIDED EEG REPRESENTATION LEARNING FOR EMOTION RECOGNITION |
2139 | EXTENDED CYCLIC COORDINATE DESCENT FOR ROBUST ROW-SPARSE SIGNAL RECONSTRUCTION IN THE PRESENCE OF OUTLIERS |
4348 | Extended Object Tracking Using Hierarchical Truncated Gaussian Measurement Model |
2378 | EXTRACTING UNIT EMBEDDINGS USING SEQUENCE-TO-SEQUENCE ACOUSTIC MODELS FOR UNIT SELECTION SPEECH SYNTHESIS |
2615 | EXTRAPOLATED ALTERNATING ALGORITHMS FOR APPROXIMATE CANONICAL POLYADIC DECOMPOSITION |
4796 | F0-CONSISTENT MANY-TO-MANY NON-PARALLEL VOICE CONVERSION VIA CONDITIONAL AUTOENCODER |
3206 | FACE FEATURE RECOVERY VIA TEMPORAL FUSION FOR PERSON SEARCH |
3764 | FACIAL EMOTION RECOGNITION USING LIGHT FIELD IMAGES WITH DEEP ATTENTION-BASED BIDIRECTIONAL LSTM |
3244 | FACIAL FEATURE EMBEDDED CYCLEGAN FOR VIS-NIR TRANSLATION |
4100 | FAR-FIELD LOCATION GUIDED TARGET SPEECH EXTRACTION USING END-TO-END SPEECH RECOGNITION OBJECTIVES |
2631 | FAST ACOUSTIC SCATTERING USING CONVOLUTIONAL NEURAL NETWORKS |
2397 | FAST AND ACCURATE EMBEDDED DCNN FOR RGB-D BASED SIGN LANGUAGE RECOGNITION |
3660 | FAST AND HIGH-QUALITY SINGING VOICE SYNTHESIS SYSTEM BASED ON CONVOLUTIONAL NEURAL NETWORKS |
4227 | FAST AND STABLE BLIND SOURCE SEPARATION WITH RANK-1 UPDATES |
1151 | FAST BLOCK-SPARSE ESTIMATION FOR VECTOR NETWORKS |
2315 | FAST CLUSTERING WITH CO-CLUSTERING VIA DISCRETE NON-NEGATIVE MATRIX FACTORIZATION FOR IMAGE IDENTIFICATION |
4209 | FAST DIRECTION-OF-ARRIVAL ESTIMATION OF MULTIPLE TARGETS USING DEEP LEARNING AND SPARSE ARRAYS |
1132 | FAST DOMAIN ADAPTATION FOR GOAL-ORIENTED DIALOGUE USING A HYBRID GENERATIVE-RETRIEVAL TRANSFORMER |
2062 | Fast Graph Metric Learning via Gershgorin Disc Alignment |
6133 | Fast High-Dimensional Kernel Filtering |
1299 | FAST INDEPENDENT VECTOR EXTRACTION BY ITERATIVE SINR MAXIMIZATION |
4155 | FAST INTENT CLASSIFICATION FOR SPOKEN LANGUAGE UNDERSTANDING |
4302 | Fast Lattice-free Keyword Filtering for Accelerated Spoken Term Detection |
5765 | FAST OPTICAL SYSTEM IDENTIFICATION BY NUMERICAL INTERFEROMETRY |
1460 | FAST SINGLE-VIEW 3D OBJECT RECONSTRUCTION WITH FINE DETAILS THROUGH DILATED DOWNSAMPLE AND MULTI-PATH UPSAMPLE DEEP NEURAL NETWORK |
2690 | Fast Start-Up Algorithm for Adaptive Noise Cancellers with Novel SNR Estimation and Stepsize Control |
2169 | FAST TRAINING OF DEEP NEURAL NETWORKS FOR SPEECH RECOGNITION |
1468 | Faster-than-Nyquist Signaling via Spatiotemporal Symbol-Level Precoding for Multi-User MISO Redundant Transmissions |
3936 | Favorable Propagation and Linear Multiuser Detection for Distributed Antenna Systems |
1396 | FCEM: A Novel Fast Correlation Extract Model For Real Time Steganalysis of VoIP Stream via Multi-head Attention |
1905 | FDDWNET: A LIGHTWEIGHT CONVOLUTIONAL NEURAL NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION |
2687 | FEATURE AFFINE PROJECTION ALGORITHMS |
3954 | Feature drift resilient tracking of the carotid artery wall using unscented Kalman filtering with data fusion |
3948 | FEATURE ENHANCEMENT WITH DEEP FEATURE LOSSES FOR SPEAKER VERIFICATION |
1306 | FEATURE SELECTION UNDER ORTHOGONAL REGRESSION WITH REDUNDANCY MINIMIZING |
4718 | FEDERATED CLASSIFICATION WITH LOW COMPLEXITY REPRODUCING KERNEL HILBERT SPACE REPRESENTATIONS |
1670 | FEDERATED LEARNING WITH MUTUALLY COOPERATING DEVICES: A CONSENSUS APPROACH TOWARDS SERVER-LESS MODEL OPTIMIZATION |
1676 | FEDERATED LEARNING WITH QUANTIZATION CONSTRAINTS |
3157 | FEDERATED NEUROMORPHIC LEARNING OF SPIKING NEURAL NETWORKS FOR LOW-POWER EDGE INTELLIGENCE |
5329 | FEDERATED TRUTH INFERENCE OVER DISTRIBUTED CROWDSOURCING PLATFORMS |
4381 | FEDERATING SOLAR, STORAGE AND COMMUNICATIONS IN THE ELECTRIC GRID AND INTERNET OF THINGS |
4216 | FEEDBACK RECURRENT AUTOENCODER |
4039 | FEEDBACK TURBO AUTOENCODER |
2862 | FEW-SHOT ACOUSTIC EVENT DETECTION VIA META LEARNING |
3586 | FEW-SHOT SOUND EVENT DETECTION |
4667 | FG2Seq: Effectively Encoding Knowledge for End-to-End Task-Oriented Dialog |
3446 | FILTERBANK DESIGN FOR END-TO-END SPEECH SEPARATION |
5705 | FILTERING OUT TIME-FREQUENCY AREAS USING GABOR MULTIPLIERS |
3040 | FINE-GRAINED ACTION RECOGNITION ON A NOVEL BASKETBALL DATASET |
1849 | FINE-GRAINED GIANT PANDA IDENTIFICATION |
2945 | FINITE SAMPLE DEVIATION AND VARIANCE BOUNDS FOR FIRST ORDER AUTOREGRESSIVE PROCESSES |
2762 | FIR FILTER DESIGN AND IMPLEMENTATION FOR PHASE-BASED PROCESSING |
5545 | FIR FILTERING OF DISCONTINUOUS SIGNALS: A RANDOM-STRATIFIED SAMPLING APPROACH |
5411 | FIXED SMOOTH CONVOLUTIONAL LAYER FOR AVOIDING CHECKERBOARD ARTIFACTS IN CNNS |
5647 | FIXED-POINT OPTIMIZATION OF TRANSFORMER NEURAL NETWORK |
2428 | FLEXIBLY-TUNABLE BITCUBE-BASED PERCEPTUAL ENCRYPTION WITHIN JPEG COMPRESSION |
3368 | FLOW-TTS: A NON-AUTOREGRESSIVE NETWORK FOR TEXT TO SPEECH BASED ON FLOW |
2770 | FOCUS ON SEMANTIC CONSISTENCY FOR CROSS-DOMAIN CROWD UNDERSTANDING |
2604 | FOCUSING ON ATTENTION: PROSODY TRANSFER AND ADAPTATIVE OPTIMIZATION STRATEGY FOR MULTI-SPEAKER END-TO-END SPEECH SYNTHESIS |
5448 | FORECASTING MULTI-DIMENSIONAL PROCESSES OVER GRAPHS |
2909 | FORECASTING SPARSE TRAFFIC CONGESTION PATTERNS USING MESSAGE-PASSING RNNS |
4883 | FOREGROUND SIGNATURE EXTRACTION FOR AN INTIMATE MIXING MODEL IN HYPERSPECTRAL IMAGE CLASSIFICATION |
6065 | FORENSIC SIMILARITY FOR DIGITAL IMAGES |
2541 | FORMULATING DIVERGENCE FRAMEWORK FOR MULTICLASS MOTOR IMAGERY EEG BRAIN COMPUTER INTERFACE |
5272 | FORWARD-BACKWARD SPLITTING FOR OPTIMAL TRANSPORT BASED PROBLEMS |
1732 | Fourier Phase Retrieval with Arbitrary Reference Signal |
1859 | FOURTH ORDER CUMULANT BASED ACTIVE DIRECTION OF ARRIVAL ESTIMATION USING COPRIME ARRAYS |
5203 | Fractional Fourier Transform Based QRS Complex detection in ECG Signal |
4733 | Frame-based overlapping speech detection using Convolutional Neural Networks |
3571 | FRAME-LEVEL MMI AS A SEQUENCE DISCRIMINATIVE TRAINING CRITERION FOR LVCSR |
4460 | FRAME-LEVEL PHONEME-INVARIANT SPEAKER EMBEDDING FORTEXT-INDEPENDENT SPEAKER RECOGNITION ON EXTREMELY SHORT UTTERANCES |
1482 | FREQUENCY AND TEMPORAL CONVOLUTIONAL ATTENTION FOR TEXT-INDEPENDENT SPEAKER RECOGNITION |
4968 | Frequency Diverse Array Radar: A Closed-form Solution to Design Weights for Desired Beampattern |
3496 | Frequency-dependent Directional Feedback Delay Network |
4377 | FROM SYMBOLS TO SIGNALS: SYMBOLIC VARIATIONAL AUTOENCODERS |
4662 | FROM UNSUPERVISED MACHINE TRANSLATION TO ADVERSARIAL TEXT GENERATION |
5044 | FROM VIDEO GAME TO REAL ROBOT: THE TRANSFER BETWEEN ACTION SPACES |
1573 | FULL REFERENCE VIDEO QUALITY MEASURES IMPROVEMENT USING NEURAL NETWORKS |
5201 | Full-Reference Speech Quality Estimation with Attentional Siamese Neural Networks |
4013 | FULL-SUM DECODING FOR HYBRID HMM BASED SPEECH RECOGNITION USING LSTM LANGUAGE MODEL |
1418 | FULLY CONVOLUTIONAL RECURRENT NETWORKS FOR SPEECH ENHANCEMENT |
1338 | FULLY LEARNABLE FRONT-END FOR MULTI-CHANNEL ACOUSTIC MODELING USING SEMI-SUPERVISED LEARNING |
3732 | Fully Pipelined Iteration Unrolled Decoders The Road to Tb/s Turbo Decoding |
4804 | FULLY QUANTIZING A SIMPLIFIED TRANSFORMER FOR END-TO-END SPEECH RECOGNITION |
4413 | Fully-hierarchical Fine-grained Prosody Modeling for Interpretable speech synthesis |
3137 | FULLY-NEURAL APPROACH TO HEAVY VEHICLE DETECTION ON BRIDGES USING A SINGLE STRAIN SENSOR |
3705 | Fusion approaches for emotion recognition from speech using acoustic and text-based features |
5706 | FUSIONNDVI: A NOVEL FUSION METHOD FOR NDVI IN REMOTE SENSING |
1766 | G2G: TTS-DRIVEN PRONUNCIATION LEARNING FOR GRAPHEMIC HYBRID ASR |
2959 | GAIT PHASE SEGMENTATION USING WEIGHTED DYNAMIC TIME WARPING AND K-NEAREST NEIGHBORS GRAPH EMBEDDING |
4891 | GATED ATTENTIVE CONVOLUTIONAL NETWORK DIALOGUE STATE TRACKER |
4761 | GATED MECHANISM FOR ATTENTION BASED MULTIMODAL SENTIMENT ANALYSIS |
3465 | Gated Multi-layer Convolutional Feature Extraction Network for Robust Pedestrian Detection |
3882 | GAUSSIAN LPCNET FOR MULTISAMPLE SPEECH SYNTHESIS |
2797 | GAUSSIAN PROCESS IMPUTATION OF MULTIPLE FINANCIAL SERIES |
5936 | GAUSSIAN PROCESSES OVER GRAPHS |
5595 | GCI DETECTION FROM RAW SPEECH USING A FULLY-CONVOLUTIONAL NETWORK |
2869 | GENDER DIFFERENCES ON THE PERCEPTION AND PRODUCTION OF UTTERANCES WITH WILLINGNESS AND RELUCTANCE IN CHINESE |
5472 | GENERALIZED COHERENCE-BASED SIGNAL ENHANCEMENT |
5176 | GENERALIZED GRAPH SPECTRAL SAMPLING WITH STOCHASTIC PRIORS |
1910 | Generalized Kernel-Based Dynamic Mode Decomposition |
4099 | Generalized Linear Bandits with Safety Constraints |
4032 | GENERALIZED SPATIAL MODULATION FOR WIRELESS TERABITS SYSTEMS UNDER SUB-THZ CHANNEL WITH RF IMPAIRMENTS |
5388 | GENERATING AND PROTECTING AGAINST ADVERSARIAL ATTACKS FOR DEEP SPEECH-BASED EMOTION RECOGNITION MODELS |
2089 | Generating diverse and natural text-to-speech samples using a quantized fine-grained VAE and autoregressive prosody prior |
3301 | GENERATING EMPATHETIC RESPONSES BY LOOKING AHEAD THE USER’S SENTIMENT |
2655 | GENERATING MULTILINGUAL VOICES USING SPEAKER SPACE TRANSLATION BASED ON BILINGUAL SPEAKER DATA |
3900 | GENERATING SYNTHETIC AUDIO DATA FOR ATTENTION-BASED SPEECH RECOGNITION SYSTEMS |
5923 | GENERATIVE ADVERSARIAL NETWORKS FOR GRAPH DATA IMPUTATION FROM SIGNED OBSERVATIONS |
2253 | GENERATIVE PRE-TRAINING FOR SPEECH WITH AUTOREGRESSIVE PREDICTIVE CODING |
6110 | Generative RNNs for OOV Keyword Search |
1252 | GENETIC ALGORITHM OPTIMIZED SUPPORT VECTOR MACHINE IN NOMA-BASED SATELLITE NETWORKS WITH IMPERFECT CSI |
2733 | GEOMETRICALLY CONSTRAINED INDEPENDENT VECTOR ANALYSIS FOR DIRECTIONAL SPEECH ENHANCEMENT |
2736 | GEOMETRY CONSTRAINED PROGRESSIVE LEARNING FOR LSTM-BASED SPEECH ENHANCEMENT |
2835 | GFCN: A NEW GRAPH CONVOLUTIONAL NETWORK BASED ON PARALLEL FLOWS |
1554 | GFNET: A LIGHTWEIGHT GROUP FRAME NETWORK FOR EFFICIENT HUMAN ACTION RECOGNITION |
3277 | GLOBAL AND LOCAL DISCRIMINATIVE PATCHES EXPLOITING FOR ACTION RECOGNITION |
5418 | GLOBAL STRUCTURE GRAPH GUIDED FINE-GRAINED VEHICLE RECOGNITION |
2572 | Global Traffic State Recovery via Local Observations with Generative Adversarial Networks |
4641 | GPU-ACCELERATED VITERBI EXACT LATTICE DECODER FOR BATCHED ONLINE AND OFFLINE SPEECH RECOGNITION |
4212 | GRADIENT DELAY ANALYSIS IN ASYNCHRONOUS DISTRIBUTED OPTIMIZATION |
4132 | GRADIENT-BASED ALGORITHM WITH SPATIAL REGULARIZATION FOR OPTIMAL SENSOR PLACEMENT |
5005 | GRAPH AUTO-ENCODER FOR GRAPH SIGNAL DENOISING |
4391 | GRAPH CONSTRUCTION FROM DATA BY NON-NEGATIVE KERNEL REGRESSION |
6004 | GRAPH CONVOLUTIONAL NEURAL NETWORKS TO CLASSIFY WHOLE SLIDE IMAGES |
2060 | Graph Neural Net using Analytical Graph Filters and Topology Optimization for Image Denoising |
2708 | Graph Regularized Tensor Train Decomposition |
4939 | GRAPH VERTEX SAMPLING WITH ARBITRARY GRAPH SIGNAL HILBERT SPACES |
5918 | GRAPHEM: EM ALGORITHM FOR BLIND KALMAN FILTERING UNDER GRAPHICAL SPARSITY CONSTRAINTS |
3621 | GRAPHICAL EVOLUTIONARY GAME THEORETIC ANALYSIS OF SUPER USERS IN INFORMATION DIFFUSION |
5704 | GRAPHTTS: GRAPH-TO-SEQUENCE MODELLING IN NEURAL TEXT-TO-SPEECH |
2235 | GRAY-SCALE IMAGE COLORIZATION USING CYCLE-CONSISTENT GENERATIVE ADVERSARIAL NETWORKS WITH RESIDUAL STRUCTURE ENHANCER |
2481 | GREEDY HYBRID RATE ADAPTATION IN DYNAMIC WIRELESS COMMUNICATION ENVIRONMENT |
2251 | GREEDY SPARSE ARRAY DESIGN FOR OPTIMAL LOCALIZATION UNDER SPATIALLY PRIORITIZED SOURCE DISTRIBUTION |
6106 | GRIFFIN–LIM LIKE PHASE RECOVERY VIA ALTERNATING DIRECTION METHOD OF MULTIPLIERS |
3569 | GROUP-UTILITY METRIC FOR EFFICIENT SENSOR SELECTION AND REMOVAL IN LCMV BEAMFORMERS |
2650 | Guided Learning for Weakly-labeled Semi-supervised Sound Event Detection |
3990 | GYROSCOPE AIDED VIDEO STABILIZATION USING NONLINEAR REGRESSION ON SPECIAL ORTHOGONAL GROUP |
3171 | Hand-3D-Studio: A New Multi-view System for 3D Hand Reconstruction |
5477 | HARMONIC/PERCUSSIVE SOUND SEPARATION AND SPECTRAL COMPLEXITY REDUCTION OF MUSIC SIGNALS FOR COCHLEAR IMPLANT LISTENERS |
4914 | HARMONICS BASED REPRESENTATION IN CLARINET TONE QUALITY EVALUATION |
1668 | HDMFH: HYPERGRAPH BASED DISCRETE MATRIX FACTORIZATION HASHING FOR MULTIMODAL RETRIEVAL |
5612 | Headless Horseman: Adversarial Attacks on Transfer Learning Models |
1973 | HEARING AID RESEARCH DATA SET FOR ACOUSTIC ENVIRONMENT RECOGNITION |
4875 | HEIGHT AND WEIGHT ESTIMATION FROM UNCONSTRAINED IMAGES |
4714 | HETEROGENEOUS DOMAIN GENERALIZATION VIA DOMAIN MIXUP |
4652 | HGFM : A HIERARCHICAL GRAINED AND FEATURE MODEL FOR ACOUSTIC EMOTION RECOGNITION |
1409 | HIDDEN MARKOV MODELS FOR SEPSIS DETECTION IN PRETERM INFANTS |
3534 | HIERARCHICAL ATTENTION TRANSFER NETWORKS FOR DEPRESSION ASSESSMENT FROM SPEECH |
2266 | Hierarchical Caching via Deep Reinforcement Learning |
4108 | HIERARCHICAL FEDERATED LEARNING ACROSS HETEROGENEOUS CELLULAR NETWORKS |
5787 | Hierarchical Sequence Representation with Graph Network |
4320 | HIGH DYNAMIC RANGE IMAGING USING DEEP IMAGE PRIORS |
4318 | HIGH-ACCURACY AND LOW-LATENCY SPEECH RECOGNITION WITH TWO-HEAD CONTEXTUAL LAYER TRAJECTORY LSTM MODEL |
1085 | HIGH-ACCURACY CLASSIFICATION OF ATTENTION DEFICIT HYPERACTIVITY DISORDER WITH L2,1-NORM LINEAR DISCRIMINANT ANALYSIS |
5569 | HIGH-DIMENSIONAL NEURAL FEATURE USING RECTIFIED LINEAR UNIT AND RANDOM MATRIX INSTANCE |
5839 | High-Resolution Attention Network With Acoustic Segment Model For Acoustic Scene Classification |
1851 | Hijacking Tracker: A Powerful Adversarial Attack on Visual Tracking |
5259 | HI-MIA : A FAR-FIELD TEXT-DEPENDENT SPEAKER VERIFICATION DATABASE AND THE BASELINES |
1709 | HKA: A HIERARCHICAL KNOWLEDGE ATTENTION MECHANISM FOR MULTI-TURN DIALOGUE SYSTEM |
3811 | HOW CONFIDENT ARE YOU? EXPLORING THE ROLE OF FILLERS IN THE AUTOMATIC PREDICTION OF A SPEAKER’S CONFIDENCE |
5260 | HOW MUCH SELF-ATTENTION DO WE NEED? TRADING ATTENTION FOR FEED-FORWARD LAYERS |
4831 | HPRNN: A HIERARCHICAL SEQUENCE PREDICTION MODEL FOR LONG-TERM WEATHER RADAR ECHO EXTRAPOLATION |
1122 | HUMANGAN: GENERATIVE ADVERSARIAL NETWORK WITH HUMAN-BASED DISCRIMINATOR AND ITS EVALUATION IN SPEECH PERCEPTION MODELING |
3617 | Human-Machine Collaboration for Medical Image Segmentation |
4088 | HUMBUG ZOONIVERSE: A CROWD-SOURCED ACOUSTIC MOSQUITO DATASET |
3778 | H-VECTORS: UTTERANCE-LEVEL SPEAKER EMBEDDING USING A HIERARCHICAL ATTENTION MODEL |
2768 | HYBRID ACTIVE CONTOUR DRIVEN BY DOUBLE-WEIGHTED SIGNED PRESSURE FORCE FOR IMAGE SEGMENTATION |
5643 | HYBRID AUTOREGRESSIVE TRANSDUCER (HAT) |
1487 | HYBRID DEEP-SEMANTIC MATRIX FACTORIZATION FOR TAG-AWARE PERSONALIZED RECOMMENDATION |
5864 | HYBRID NEURAL-PARAMETRIC F0 MODEL FOR SINGING SYNTHESIS |
4138 | HYBRID PRECODING FOR SECURE TRANSMISSION IN REFLECT-ARRAY-ASSISTED MASSIVE MIMO SYSTEMS |
5364 | HydraNet: A real-time waveform separation network |
2421 | IDENTIFICATION OF ESSENTIAL PROTEINS USING A NOVEL MULTI-OBJECTIVE OPTIMIZATION METHOD |
4505 | IDENTIFYING TRUTHFUL LANGUAGE IN CHILD INTERVIEWS |
2382 | IMAGE DE-RAINING VIA RDL: WHEN REWEIGHTED CONVOLUTIONAL SPARSE CODING MEETS DEEP LEARNING |
5812 | IMAGE FUSION USING JOINT SPARSE REPRESENTATIONS AND COUPLED DICTIONARY LEARNING |
4385 | IMAGE PROCESSING IN DNA |
4248 | Image recovery from rotational and translational invariants |
3683 | Image Restoration via Data-dependent Proximal Averaged Optimization |
5112 | IMAGE SEGMENTATION BASED PRIVACY-PRESERVING HUMAN ACTION RECOGNITION FOR ANOMALY DETECTION |
2212 | IMAGE SUPER-RESOLUTION USING RESIDUAL GLOBAL CONTEXT NETWORK |
1172 | IMPACT OF A SHIFT-INVARIANT HARMONIC PHASE MODEL IN FULLY PARAMETRIC HARMONIC VOICE REPRESENTATION AND TIME/FREQUENCY SYNTHESIS |
4270 | Improved End-to-End Spoken Utterance Classification with a Self-Attention Acoustic Classifier |
3241 | IMPROVED LARGE-MARGIN SOFTMAX LOSS FOR SPEAKER DIARISATION |
3697 | IMPROVED NEAREST NEIGHBOR DENSITY-BASED CLUSTERING TECHNIQUES WITH APPLICATION TO HYPERSPECTRAL IMAGES |
1842 | IMPROVED PROBABILITY MODELLING FOR EXCEPTION HANDLING IN LOSSLESS SCREEN CONTENT CODING |
5332 | IMPROVED REAL-TIME VISUAL TRACKING VIA ADVERSARIAL LEARNING |
5769 | IMPROVED SPEAKER INDEPENDENT DYSARTHRIA INTELLIGIBILITY CLASSIFICATION USING DEEPSPEECH POSTERIORS |
5708 | IMPROVING AUDITORY ATTENTION DECODING PERFORMANCE OF LINEAR AND NON-LINEAR METHODS USING STATE-SPACE MODEL |
3519 | IMPROVING AUTOMATED SEGMENTATION OF RADIO SHOWS WITH AUDIO EMBEDDINGS |
4525 | IMPROVING CONVERGENT CROSS MAPPING FOR CAUSAL DISCOVERY WITH GAUSSIAN PROCESSES |
5846 | IMPROVING CROSS-DATASET PERFORMANCE OF FACE PRESENTATION ATTACK DETECTION SYSTEMS USING FACE RECOGNITION DATASETS |
5082 | IMPROVING DEEP CNN NETWORKS WITH LONG TEMPORAL CONTEXT FOR TEXT-INDEPENDENT SPEAKER VERIFICATION |
2284 | IMPROVING DEEP LEARNING CLASSIFICATION OF JPEG2000 IMAGES OVER BANDLIMITED NETWORKS |
3734 | Improving Device Directedness Classification of Utterances with Semantic Lexical Features |
4538 | IMPROVING EFFICIENCY IN LARGE-SCALE DECENTRALIZED DISTRIBUTED TRAINING |
2910 | IMPROVING END-TO-END SPEECH SYNTHESIS WITH LOCAL RECURRENT NEURAL NETWORK ENHANCED TRANSFORMER |
1203 | IMPROVING FASHION ATTRIBUTE PREDICTION VIA GLOBAL SEMANTIC REASONING |
3563 | IMPROVING LANGUAGE IDENTIFICATION FOR MULTILINGUAL SPEAKERS |
5558 | Improving LPCNet-based Text-to-Speech with Linear Prediction-structured Mixture Density Network |
2076 | IMPROVING MUSIC TRANSCRIPTION BY PRE-STACKING A U-NET |
3567 | IMPROVING NOISE ROBUST AUTOMATIC SPEECH RECOGNITIONWITH SINGLE-CHANNEL TIME-DOMAIN ENHANCEMENT NETWORK |
1742 | IMPROVING PROPER NOUN RECOGNITION IN END-TO-END ASR BY CUSTOMIZATION OF THE MWER LOSS CRITERION |
3430 | IMPROVING PROSODY WITH LINGUISTIC AND BERT DERIVED FEATURES IN MULTI-SPEAKER BASED MANDARIN CHINESE NEURAL TTS |
1243 | IMPROVING REVERBERANT SPEECH TRAINING USING DIFFUSE ACOUSTIC SIMULATION |
4137 | IMPROVING ROBUSTNESS OF DEEP LEARNING BASED MONAURAL SPEECH ENHANCEMENT AGAINST PROCESSING ARTIFACTS |
3699 | IMPROVING SAMPLE-EFFICIENCY IN REINFORCEMENT LEARNING FOR DIALOGUE SYSTEMS BY USING TRAINABLE-ACTION-MASK |
4021 | Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation |
4258 | IMPROVING SINGING VOICE SEPARATION WITH THE WAVE-U-NET USING MINIMUM HYPERSPHERICAL ENERGY |
2735 | Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam |
1874 | IMPROVING SPEAKER-ATTRIBUTE ESTIMATION BY VOTING BASED ON SPEAKER CLUSTER INFORMATION |
1730 | IMPROVING SPEECH RECOGNITION USING CONSISTENT PREDICTIONS ON SYNTHESIZED SPEECH |
5268 | Improving Spoken Question Answering using Contextualized Word Representation |
3634 | IMPROVING THE CHRONOLOGICAL SORTING OF IMAGES THROUGH OCCLUSION: A STUDY ON THE NOTRE-DAME CATHEDRAL FIRE |
3309 | IMPROVING THE PERFORMANCE OF TRANSFORMER BASED LOW RESOURCE SPEECH RECOGNITION FOR INDIAN LANGUAGES |
4461 | IMPROVING THE SCALABILITY OF DEEP REINFORCEMENT LEARNING-BASED ROUTING WITH CONTROL ON PARTIAL NODES |
2765 | IMPROVING UNIVERSAL SOUND SEPARATION USING SOUND CLASSIFICATION |
2911 | IMPROVING VOICE SEPARATION BY INCORPORATING END-TO-END SPEECH RECOGNITION |
1339 | IMPULSE RESPONSE DATA AUGMENTATION AND DEEP NEURAL NETWORKS FOR BLIND ROOM ACOUSTIC PARAMETER ESTIMATION |
4035 | INCORPORATING WRITTEN DOMAIN NUMERIC GRAMMARS INTO END-TO-END CONTEXTUAL SPEECH RECOGNITION SYSTEMS FOR IMPROVED RECOGNITION OF NUMERIC SEQUENCES |
3870 | INCREMENTAL SEMI-SUPERVISED LEARNING FOR MULTI-GENRE SPEECH RECOGNITION |
3376 | INDEPENDENT LANGUAGE MODELING ARCHITECTURE FOR END-TO-END ASR |
6076 | INDEPENDENT-VARIATION MATRIX FACTORIZATION WITH APPLICATION TO ENERGY DISAGGREGATION |
4260 | INDIVIDUAL DISTANCE-DEPENDENT HRTFS MODELING THROUGH A FEW ANTHROPOMETRIC MEASUREMENTS |
1364 | In-Domain and Out-of-Domain Data Augmentation to Improve Children's Speaker Verification System in Limited Data Scenario |
3807 | INDOOR ALTITUDE ESTIMATION OF UNMANNED AERIAL VEHICLES USING A BANK OF KALMAN FILTERS |
2667 | INDOOR HEADING DIRECTION ESTIMATION USING RF SIGNALS |
5742 | IndyLSTMs: Independently Recurrent LSTMs |
2271 | INFERRING DYNAMIC GROUP LEADERSHIP USING SEQUENTIAL BAYESIAN METHODS |
5173 | INFORMATION FLOW OPTIMIZATION IN INFERENCE NETWORKS |
5091 | INFORMATION MAXIMIZED VARIATIONAL DOMAIN ADVERSARIAL LEARNING FOR SPEAKER VERIFICATION |
5667 | INFORMATION THEORETIC APPROACH FOR WAVEFORM DESIGN IN COEXISTING MIMO RADAR AND MIMO COMMUNICATIONS |
2644 | In-network Caching For Hybrid Satellite-Terrestrial Networks Using Deep Reinforcement Learning |
5405 | INSIGHTS INTO NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT |
5342 | INSTANCE-BASED MODEL ADAPTATION FOR DIRECT SPEECH TRANSLATION |
3324 | INSTANT ADAPTIVE LEARNING: AN ADAPTIVE FILTER BASED FAST LEARNING MODEL CONSTRUCTION FOR SENSOR SIGNAL TIME SERIES CLASSIFICATION ON EDGE DEVICES |
2842 | INTEGRATING DISCRETE AND NEURAL FEATURES VIA MIXED-FEATURE TRANS-DIMENSIONAL RANDOM FIELD LANGUAGE MODELS |
4229 | INTEGRATION OF MULTI-LOOK BEAMFORMERS FOR MULTI-CHANNEL KEYWORD SPOTTING |
2320 | INTELLIGENT REFLECTING SURFACE FOR MASSIVE DEVICE CONNECTIVITY: JOINT ACTIVITY DETECTION AND CHANNEL ESTIMATION |
4784 | INTELLIGENT STUDENT BEHAVIOR ANALYSIS SYSTEM FOR REAL CLASSROOMS |
5439 | INTENSITY-IMAGE RECONSTRUCTION FOR EVENT CAMERAS USING CONVOLUTIONAL NEURAL NETWORK |
1935 | Interpolation and Range Extrapolation of Sound Source Directivity Based on a Spherical Wave Propagation Model |
5147 | INTERPRETABILITY-GUIDED CONVOLUTIONAL NEURAL NETWORKS FOR SEISMIC FAULT SEGMENTATION |
2506 | Interpretable Machine Learning in Sustainable Edge Computing: A Case Study of Short-term Photovoltaic Power Output Prediction |
4790 | INTERPRETABLE SELF-ATTENTION TEMPEROL REASONING FOR DRIVING BEHAVIOR UNDERSTANDING |
3353 | INTERRUPTED AND CASCADED PERMUTATION INVARIANT TRAINING FOR SPEECH SEPARATION |
4369 | INTRA FRAME RATE CONTROL FOR VERSATILE VIDEO CODING WITH QUADRATIC RATE-DISTORTION MODELLING |
3779 | INVERSE MULTIPLE SCATTERING WITH PHASELESS MEASUREMENTS |
5358 | INVERTIBLE DNN-BASED NONLINEAR TIME-FREQUENCY TRANSFORM FOR SPEECH ENHANCEMENT |
5892 | INVESTIGATING GENERALIZATION IN NEURAL NETWORKS UNDER OPTIMALLY EVOLVED TRAINING PERTURBATIONS |
5299 | INVESTIGATION OF METHODS TO IMPROVE THE RECOGNITION PERFORMANCE OF TAMIL-ENGLISH CODE-SWITCHED DATA IN TRANSFORMER FRAMEWORK |
5884 | INVESTIGATION OF SPECAUGMENT FOR DEEP SPEAKER EMBEDDING LEARNING |
5110 | IQ-STAN: IMAGE QUALITY GUIDED SPATIO-TEMPORAL ATTENTION NETWORK FOR LICENSE PLATE RECOGNITION |
6089 | Irregular Array Manifold Aided Channel Estimation in Massive MIMO Communications |
3766 | I-VECTOR TRANSFORMATION USING K-NEAREST NEIGHBORS FOR SPEAKER VERIFICATION |
3942 | JHU-HLTCOE SYSTEM FOR THE VOXSRC SPEAKER RECOGNITION CHALLENGE |
4792 | JOINT BEAMFORMING AND REVERBERATION CANCELLATION USING A CONSTRAINED KALMAN FILTER WITH MULTICHANNEL LINEAR PREDICTION |
4376 | JOINT BLIND CALIBRATION AND TIME-DELAY ESTIMATION FOR MULTIBAND RANGING |
3800 | JOINT CODING AND MODULATION IN THE ULTRA-SHORT BLOCKLENGTH REGIME FOR BERNOULLI-GAUSSIAN IMPULSIVE NOISE CHANNELS USING AUTOENCODERS |
4559 | JOINT CONTEXTUAL MODELING FOR ASR CORRECTION and LANGUAGE UNDERSTANDING |
3203 | Joint Enhancement and Denoising of Low Light Images Via JND Transform |
5531 | Joint estimation of acoustic parameters from single-microphone speech observations |
1758 | JOINT FREQUENCY DOMAIN CHANNEL ESTIMATION AND EQUALIZATION BASED ON EXPECTATION PROPAGATION FOR SINGLE CARRIER TRANSMISSIONS |
5586 | Joint learning of assignment and representation for biometric group membership |
3798 | JOINT LEARNING OF CARTESIAN UNDERSAMPLING AND RECONSTRUCTION FOR ACCELERATED MRI |
5166 | JOINT MULTITARGET TRACKING AND DYNAMIC NETWORK LOCALIZATION IN THE UNDERWATER DOMAIN |
4388 | Joint Optimization of Sampling Patterns and Deep Priors for Improved Parallel MRI |
1938 | JOINT PHONEME ALIGNMENT AND TEXT-INFORMED SPEECH SEPARATION ON HIGHLY CORRUPTED SPEECH |
2102 | JOINT PHONEME-GRAPHEME MODEL FOR END-TO-END SPEECH RECOGNITION |
3809 | JOINT SCHEDULING AND BEAMFORMING FOR DELAY SENSITIVE TRAFFIC WITH PRIORITIES AND DEADLINES |
2873 | JOINT SEMI-SUPERVISED FEATURE AUTO-WEIGHTING AND CLASSIFICATION MODEL FOR EEG-BASED CROSS-SUBJECT SLEEP QUALITY EVALUATION |
1725 | Joint Software Defined Resource Allocation and Routing for Service Function Chaining with In-Subnetwork Processing |
5349 | JOINT SOURCE-CHANNEL CODING AND BAYESIAN MESSAGE PASSING DETECTION FOR GRANT-FREE RADIO ACCESS IN IOT |
3543 | Joint Sparse Recovery using Deep Unfolding With Application to Massive Random Access |
1365 | JOINT TRAINING OF DEEP NEURAL NETWORKS FOR MULTI-CHANNEL DEREVERBERATION AND SPEECH SOURCE SEPARATION |
1552 | JOINTLY OPTIMAL DEREVERBERATION AND BEAMFORMING |
2989 | JPEG STEGANOGRAPHY WITH SIDE INFORMATION FROM THE PROCESSING PIPELINE |
2855 | JUST NOTICEABLE DISTORTION BASED PERCEPTUALLY LOSSLESS INTRA CODING |
3713 | KALM: KEY AREA LOCALIZATION MECHANISM FOR ABNORMALITY DETECTION IN MUSCULOSKELETAL RADIOGRAPHS |
2351 | K-Autoencoders deep clustering |
4935 | KERNEL COMPUTATIONS FROM LARGE-SCALE RANDOM FEATURES OBTAINED BY OPTICAL PROCESSING UNITS |
1289 | KERNEL RIDGE REGRESSION WITH AUTOCORRELATION PRIOR: OPTIMAL MODEL AND CROSS-VALIDATION |
5717 | KEY ACTION AND JOINT CTC-ATTENTION BASED SIGN LANGUAGE RECOGNITION |
5420 | KEYWORD SEARCH FOR SIGN LANGUAGE |
4110 | KNOWLEDGE DISTILLATION AND RANDOM ERASING DATA AUGMENTATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION |
2180 | KNOWLEDGE ENHANCED LATENT RELEVANCE MINING FOR QUESTION ANSWERING |
4703 | KOREAN SINGING VOICE SYNTHESIS BASED ON AUTO-REGRESSIVE BOUNDARY EQUILIBRIUM GAN |
4741 | K-SPACE TRAJECTORY DESIGN FOR REDUCED MRI SCAN TIME |
5398 | L1-NORM HIGHER-ORDER ORTHOGONAL ITERATIONS FOR ROBUST TENSOR ANALYSIS |
5338 | LABEL PROPAGATION ADAPTIVE RESONANCE THEORY FOR SEMI-SUPERVISED CONTINUOUS LEARNING |
4454 | Label Reuse for Efficient Semi-supervised Learning |
4716 | LAI-NET: LOCAL-ANCESTRY INFERENCE WITH NEURAL NETWORKS |
2793 | LANCE: EFFICIENT LOW-PRECISION QUANTIZED WINOGRAD CONVOLUTION FOR NEURAL NETWORKS BASED ON GRAPHICS PROCESSING UNITS |
4678 | LANGUAGE INDEPENDENT GENDER IDENTIFICATION FROM RAW WAVEFORM USING MULTI-SCALE CONVOLUTIONAL NEURAL NETWORKS |
2086 | LANGUAGE-AGNOSTIC MULTILINGUAL MODELING |
3601 | LAPLACE STATE SPACE FILTER WITH EXACT INFERENCE AND MOMENT MATCHING |
4764 | LARGE DIMENSIONAL ASYMPTOTICS OF MULTI-TASK LEARNING |
3057 | LARGE-CONTEXT POINTER-GENERATOR NETWORKS FOR SPOKEN-TO-WRITTEN STYLE CONVERSION |
2980 | LARGE-SCALE FADING PRECODING FOR MAXIMIZING THE PRODUCT OF SINRS |
1772 | LARGE-SCALE TIME SERIES CLUSTERING WITH k-ARs |
2459 | Large-Scale Unsupervised Pre-training for End-to-End Spoken Language Understanding |
1744 | LARGE-SCALE WEAKLY-SUPERVISED CONTENT EMBEDDINGS FOR MUSIC RECOMMENDATION AND TAGGING |
2730 | LATENCY-MINIMIZED DESIGN OF SECURE TRANSMISSIONS IN UAV-AIDED COMMUNICATIONS |
4083 | LATENT ATRIAL FIBRILLATION RISK PREDICTION FROM ELECTROCARDIOGRAM AND DEMOGRAPHIC DATA WITH CONVOLUTIONAL NEURAL NETWORK |
4462 | LATENT FUSED LASSO |
4165 | LATTICE-BASED IMPROVEMENTS FOR VOICE TRIGGERING USING GRAPH NEURAL NETWORKS |
3544 | LAYER-NORMALIZED LSTM FOR HYBRID-HMM AND END-TO-END ASR |
4114 | LEARN-BY-CALIBRATING: USING CALIBRATION AS A TRAINING OBJECTIVE |
4443 | Learned Lossless Image Compression with a HyperPrior and Discretized Gaussian Mixture Likelihoods |
4572 | LEARNING A COMMON GRANGER CAUSALITY NETWORK USING A NON-CONVEX REGULARIZATION |
1335 | LEARNING A GENERIC ADAPTIVE WAVELET SHRINKAGE FUNCTION FOR DENOISING |
4627 | LEARNING A REPRESENTATION FOR COVER SONG IDENTIFICATION USING CONVOLUTIONAL NEURAL NETWORK |
4964 | LEARNING A SUBWORD INVENTORY JOINTLY WITH END-TO-END AUTOMATIC SPEECH RECOGNTION |
5640 | LEARNING ASR-ROBUST CONTEXTUALIZED EMBEDDINGS FOR SPOKEN LANGUAGE UNDERSTANDING |
5928 | LEARNING BASED RECONFIGURABLE SUB-NYQUIST SAMPLING FRAMEWORK FOR ULTRA-WIDEBAND ANGULAR SENSING |
2519 | Learning Blind Denoising Network for Noisy Image Deblurring |
4305 | LEARNING CONNECTIVITY AND HIGHER-ORDER INTERACTIONS IN RADIAL DISTRIBUTION GRIDS |
5361 | LEARNING DATA REPRESENTATION AND EMOTION ASSESSMENT FROM PHYSIOLOGICAL DATA |
4168 | LEARNING DIFFERENTIABLE SPARSE AND LOW RANK NETWORKS FOR AUDIO-VISUAL OBJECT LOCALIZATION |
1608 | Learning diverse sub-policies via a task-agnostic regularization on action distributions. |
3817 | LEARNING DOMAIN INVARIANT REPRESENTATIONS FOR CHILD-ADULT CLASSIFICATION FROM SPEECH |
3884 | LEARNING EATING ENVIRONMENTS THROUGH SCENE CLUSTERING |
3649 | LEARNING ENDMEMBER DYNAMICS IN MULTITEMPORAL HYPERSPECTRAL DATA USING A STATE-SPACE MODEL FORMULATION |
5872 | LEARNING FRACTIONAL ORTHOGONAL LATENT CONSISTENT FEATURES FOR FACE HALLUCINATION AND RECOGNITION |
4638 | LEARNING FROM DANCES: POSE-INVARIANT RE-IDENTIFICATION FOR MULTI-PERSON TRACKING |
1133 | LEARNING GEOMETRIC FEATURES WITH DUAL-STREAM CNN FOR 3D ACTION RECOGNITION |
3106 | LEARNING GRAPH INFLUENCE FROM SOCIAL INTERACTIONS |
4979 | LEARNING LOCAL STRUCTURE OF REPRESENTATIVE POINTS FOR POINT CLOUD CLASSIFICATION AND SEMANTIC SEGMENTATION |
4449 | LEARNING MULTI-SCALE ATTENTIVE FEATURES FOR SERIES PHOTO SELECTION |
5162 | LEARNING NETWORK REPRESENTATION THROUGH REINFORCEMENT LEARNING |
5426 | LEARNING NOISE INVARIANT FEATURES THROUGH TRANSFER LEARNING FOR ROBUST END-TO-END SPEECH RECOGNITION |
4308 | LEARNING PARTIAL DIFFERENTIAL EQUATIONS FROM DATA USING NEURAL NETWORKS |
1924 | Learning Perception and Planning with Deep Active Inference |
3725 | LEARNING PLUG-AND-PLAY PROXIMAL QUASI-NEWTON DENOISERS |
4735 | Learning Product Graphs from Multidomain Signals |
4366 | LEARNING RECURRENT NEURAL NETWORK LANGUAGE MODELS WITH CONTEXT-SENSITIVE LABEL SMOOTHING FOR AUTOMATIC SPEECH RECOGNITION |
5866 | LEARNING SAMPLING AND MODEL-BASED SIGNAL RECOVERY FOR COMPRESSED SENSING MRI |
3111 | LEARNING SEMI-SUPERVISED ANONYMIZED REPRESENTATIONS BY MUTUAL INFORMATION |
3418 | LEARNING SIGNED GRAPHS FROM DATA |
3524 | LEARNING SPATIO-TEMPORAL CONVOLUTIONAL NETWORK FOR REAL-TIME OBJECT TRACKING |
3658 | Learning Spatio-Temporal Representations with Temporal Squeeze Pooling |
3006 | LEARNING SPECTRAL-SPATIAL PRIOR VIA 3DDNCNN FOR HYPERSPECTRAL IMAGE DECONVOLUTION |
1673 | LEARNING TASK-BASED ANALOG-TO-DIGITAL CONVERSION FOR MIMO RECEIVERS |
2318 | LEARNING THE HELIX TOPOLOGY OF MUSICAL PITCH |
5775 | Learning the Spatio-Temporal Dynamics of Physical Processes from Partial Observations |
2346 | LEARNING TO CHARACTERIZE ADVERSARIAL SUBSPACES |
3769 | LEARNING TO DETECT KEYWORD PARTS AND WHOLE BY SMOOTHED MAX POOLING |
3208 | Learning to Estimate Driver Drowsiness from Car Acceleration Sensors using Weakly Labeled Data |
3146 | learning to fool the speaker recognition |
4920 | LEARNING TO GENERATE DIVERSE QUESTIONS FROM KEYWORDS |
3432 | LEARNING TO RANK MUSIC TRACKS USING TRIPLET LOSS |
2847 | LEARNING TO SEPARATE SOUNDS FROM WEAKLY LABELED SCENES |
2713 | LEARNING WITH OUT-OF-DISTRIBUTION DATA FOR AUDIO CLASSIFICATION |
5366 | LEARNING-AIDED CONTENT PLACEMENT IN CACHING-ENABLED FOG COMPUTING SYSTEMS USING THOMPSON SAMPLING |
5190 | LEARNING-BASED CONTENT CACHING AND USER CLUSTERING: A DEEP DETERMINISTIC POLICY GRADIENT APPROACH |
5771 | LEAST-SQUARES DOA ESTIMATION WITH AN INFORMED PHASE UNWRAPPING AND FULL BANDWIDTH ROBUSTNESS |
2010 | LEt-SNE: A HYBRID APPROACH TO DATA EMBEDDING AND VISUALIZATION OF HYPERSPECTRAL IMAGERY |
2501 | LEVENBERG-MARQUARDT AND LINE-SEARCH EXTENDED KALMAN SMOOTHERS |
4739 | LEVERAGING CUBOIDS FOR BETTER MOTION MODELING IN HIGH EFFICIENCY VIDEO CODING |
1763 | LEVERAGING GANS TO IMPROVE CONTINUOUS PATH KEYBOARD INPUT MODELS |
5344 | LEVERAGING ORDINAL REGRESSION WITH SOFT LABELS FOR 3D HEAD POSE ESTIMATION FROM POINT SETS |
3844 | LEVERAGING UNPAIRED TEXT DATA FOR TRAINING END-TO-END SPEECH-TO-INTENT SYSTEMS |
4422 | LIBRI-ADAPT: A NEW SPEECH DATASET FOR UNSUPERVISED DOMAIN ADAPTATION |
1414 | Libri-Light: A (Large) Dataset for ASR with Limited or No Supervision |
1519 | LIE GROUP STATE ESTIMATION VIA OPTIMAL TRANSPORT |
3636 | LIFTER TRAINING AND SUB-BAND MODELING FOR COMPUTATIONALLY EFFICIENT AND HIGH-QUALITY VOICE CONVERSION USING SPECTRAL DIFFERENTIALS |
3716 | LIGHTDET: A LIGHTWEIGHT AND ACCURATE OBJECT DETECTION NETWORK |
3786 | LIGHT-FIELD RECONSTRUCTION AND DEPTH ESTIMATION FROM FOCAL STACK IMAGES USING CONVOLUTIONAL NEURAL NETWORKS |
1739 | LIGHTWEIGHT AND EFFICIENT END-TO-END SPEECH RECOGNITION USING LOW-RANK TRANSFORMER |
1328 | LIGHTWEIGHT HARDWARE IMPLEMENTATION OF VVC TRANSFORM BLOCK FOR ASIC DECODER |
1356 | Lightweight V-Net for Liver segmentation |
5865 | LIMITATIONS OF WEAK LABELS FOR EMBEDDING AND TAGGING |
2675 | LINE SPECTRAL ESTIMATION WITH PALYNDROMIC KERNELS |
1213 | LINEAR MODEL-BASED INTRA PREDICTION IN VVC TEST MODEL |
3462 | LINEAR SPEEDUP IN SADDLE-POINT ESCAPE FOR DECENTRALIZED NON-CONVEX OPTIMIZATION |
5198 | LINEAR THOMPSON SAMPLING UNDER UNKNOWN LINEAR CONSTRAINTS |
5805 | Lipreading using Temporal Convolutional Networks |
2978 | Load Management with Predictions of Solar Energy Production for Cloud Data Centers |
1699 | LOCAL KEY ESTIMATION IN CLASSICAL MUSIC RECORDINGS: A CROSS-VERSION STUDY ON SCHUBERT’S WINTERREISE |
4343 | LOCAL-GLOBAL FEATURE FOR VIDEO-BASED ONE-SHOT PERSON RE-IDENTIFICATION |
6073 | Localized Linear Regression in Networked Data |
3860 | Location-Relative Attention Mechanisms For Robust Long-Form Speech Synthesis |
5430 | Look globally, age locally: Face aging with An Attention Mechanism |
4172 | LOOKAHEAD CONVERGES TO STATIONARY POINTS OF SMOOTH NON-CONVEX FUNCTIONS |
3233 | LOOKING ENHANCES LISTENING: RECOVERING MISSING SPEECH USING IMAGES |
1360 | LOW COMPLEXITY NLMS FOR MULTIPLE LOUDSPEAKER ACOUSTIC ECHO CANCELLER USING RELATIVE LOUDSPEAKER TRANSFER FUNCTIONS |
2136 | LOW COMPLEXITY SINGLE IMAGE SUPER-RESOLUTION WITH CHANNEL SPLITTING AND FUSION NETWORK |
1866 | LOW MUTUAL AND AVERAGE COHERENCE DICTIONARY LEARNING USING CONVEX APPROXIMATION |
4111 | Low Rank Activations for Tensor-based Convolutional Sparse Coding |
6111 | LOW RESOURCE KEYWORD SEARCH WITH SYNTHESIZED CROSSLINGUAL EXEMPLARS |
3199 | Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers |
1279 | Low-complexity 5G SLAM with CKF-PHD Filter |
5895 | LOW-COMPLEXITY ACCURATE MMWAVE POSITIONING FOR SINGLE-ANTENNA USERS BASED ON ANGLE-OF-DEPARTURE AND ADAPTIVE BEAMFORMING |
1191 | LOW-COMPLEXITY AND RELIABLE TRANSFORMS FOR PHYSICAL UNCLONABLE FUNCTIONS |
5710 | LOW-COMPLEXITY COMPRESSED ALIGNMENT-AIDED COMPRESSIVE ANALYSIS FOR REAL-TIME ELECTROCARDIOGRAPHY TELEMONITORING |
4419 | LOW-COMPLEXITY FIXED-POINT CONVOLUTIONAL NEURAL NETWORKS FOR AUTOMATIC TARGET RECOGNITION |
4852 | Low-Complexity Levenberg-Marquardt Algorithm for Tensor Canonical Polyadic Decomposition |
2993 | LOW-COMPLEXITY LSTM-ASSISTED BIT-FLIPPING ALGORITHM FOR SUCCESSIVE CANCELLATION LIST POLAR DECODER |
4352 | LOW-FREQUENCY COMPENSATED SYNTHETIC IMPULSE RESPONSES FOR IMPROVED FAR-FIELD SPEECH RECOGNITION |
5738 | LOW-LATENCY LIGHTWEIGHT STREAMING SPEECH RECOGNITION WITH 8-BIT QUANTIZED SIMPLE GATED CONVOLUTIONAL NEURAL NETWORKS |
4448 | LOW-LATENCY SINGLE CHANNEL SPEECH ENHANCEMENT USING U-NET CONVOLUTIONAL NEURAL NETWORKS |
2609 | LOW-RANK APPROXIMATION OF MATRICES VIA A RANK-REVEALING FACTORIZATION WITH RANDOMIZATION |
2341 | LOW-RANK GRADIENT APPROXIMATION FOR MEMORY-EFFICIENT ON-DEVICE TRAINING OF DEEP NEURAL NETWORK |
2830 | Low-rank mmWave MIMO channel estimation in one-bit receivers |
1651 | LOW-RANK TENSOR RING MODEL FOR COMPLETING MISSING VISUAL DATA |
4554 | LOW-RANK TOEPLITZ MATRIX ESTIMATION VIA RANDOM ULTRA-SPARSE RULERS |
2803 | LOW-TUBAL-RANK TENSOR RECOVERY FROM ONE-BIT MEASUREMENTS |
2496 | LQAID: Localized Quality Aware Image Denoising using Deep Convolutional Neural Networks |
4077 | LSTM-BASED ONE-PASS DECODER FOR LOW-LATENCY STREAMING |
5379 | LUPULUS: A FLEXIBLE HARDWARE ACCELERATOR FOR NEURAL NETWORKS |
3123 | L-Vector: Neural Label Embedding for Domain Adaptation |
5006 | MAHALANOBIS DISTANCE BASED ADVERSARIAL NETWORK FOR ANOMALY DETECTION |
1538 | MANet: Multi-scale aggregated network for light field depth estimation |
3829 | Mango: A Python Library for Parallel Hyperparameter Tuning |
2240 | MANIFOLD GRADIENT DESCENT SOLVES MULTI-CHANNEL SPARSE BLIND DECONVOLUTION PROVABLY AND EFFICIENTLY |
5488 | MANY-TO-MANY VOICE CONVERSION USING CONDITIONAL CYCLE-CONSISTENT ADVERSARIAL NETWORKS |
4916 | MASK-DEPENDENT PHASE ESTIMATION FOR MONAURAL SPEAKER SEPARATION |
5151 | MASKING AND INPAINTING: A TWO-STAGE SPEECH ENHANCEMENT APPROACH FOR LOW SNR AND NON-STATIONARY NOISE |
4477 | MATCHING PURSUIT BASED DYNAMIC PHASE-AMPLITUDE COUPLING MEASURE |
5339 | MAXIMALLY ENERGY-CONCENTRATED DIFFERENTIAL WINDOW FOR PHASE-AWARE SIGNAL PROCESSING USING INSTANTANEOUS FREQUENCY |
6129 | Maximum Likelihood Estimation of a Low-Rank Probability Mass Tensor From Partial Observations |
1647 | MAXIMUM LIKELIHOOD ESTIMATION OF THE INTERFERENCE-PLUS-NOISE CROSS POWER SPECTRAL DENSITY MATRIX FOR OWN VOICE RETRIEVAL |
3911 | MAXIMUM LIKELIHOOD MULTI-SPEAKER DIRECTION OF ARRIVAL ESTIMATION UTILIZING A WEIGHTED HISTOGRAM |
4085 | MAXPOLYNOMIAL DIVISION WITH APPLICATION TO NEURAL NETWORK SIMPLIFICATION |
6128 | M-Channel Graph Filter Banks: Polyphase Analysis and Structures |
1499 | MDR-SURV: a Multi-scale Deep Learning-based Radiomics for SURVival Prediction in Pulmonary Malignancies |
2036 | MEDIA CLASSIFICATION WITH BAYESIAN OPTIMIZATION AND VAPNIK-CHERVONENKIS (VC) BOUNDS |
3682 | MELLOTRON: MULTISPEAKER EXPRESSIVE VOICE SYNTHESIS BY CONDITIONING ON RHYTHM, PITCH AND GLOBAL STYLE TOKENS |
4069 | Mental Fatigue Prediction from Multi-Channel ECoG Signal |
3145 | MESSAGE TRANSMISSION THROUGH UNDERSPREAD TIME-VARYING LINEAR CHANNELS |
5013 | M-estimators of scatter with eigenvalue shrinkage |
2575 | META LEARNING FOR END-TO-END LOW-RESOURCE SPEECH RECOGNITION |
3407 | META METRIC LEARNING FOR HIGHLY IMBALANCED AERIAL SCENE CLASSIFICATION |
5491 | Meta-learning Extractors for Music Source Separation |
3791 | META-LEARNING FOR ROBUST CHILD-ADULT CLASSIFICATION FROM SPEECH |
5755 | META-LEARNING TO COMMUNICATE: FAST END-TO-END TRAINING FOR FADING CHANNELS |
1692 | METRIC LEARNING WITH BACKGROUND NOISE CLASS FOR FEW-SHOT DETECTION OF RARE SOUND EVENTS |
4031 | Metric Representations of Networks: A Uniqueness Result |
5221 | Minimal Adversarial Perturbation in Mobile Health Applications: The Epileptic Brain Activity Case Study |
2447 | Minimum latency training strategies for streaming sequence-to-sequence ASR |
1771 | MINING EFFECTIVE NEGATIVE TRAINING SAMPLES FOR KEYWORD SPOTTING |
2816 | MIRRORED ARRAYS FOR DIRECTION-OF-ARRIVAL ESTIMATION |
1308 | MISSPECIFIED CRAMER-RAO BOUND FOR DELAY ESTIMATION WITH A MISMATCHED WAVEFORM: A CASE STUDY |
3869 | MIXTURE FACTORIZED AUTO-ENCODER FOR UNSUPERVISED HIERARCHICAL DEEP FACTORIZATION OF SPEECH SIGNAL |
4009 | MIXUP MULTI-ATTENTION MULTI-TASKING MODEL FOR EARLY STAGE LEUKEMIA IDENTIFICATION |
1589 | MIXUP-BREAKDOWN: A CONSISTENCY TRAINING METHOD FOR IMPROVING GENERALIZATION OF SPEECH SEPARATION MODELS |
4945 | ML AND EM ESTIMATION OF SAMPLING INTERVALS OF SENSOR DEVICES |
4446 | MMSE-BASED CHANNEL ESTIMATION FOR HYBRID BEAMFORMING MASSIVE MIMO WITH CORRELATED CHANNELS |
2127 | Mobility-aware Beam Steering in Metasurface-based Programmable Wireless Environments |
1894 | MOCKINGJAY: UNSUPERVISED SPEECH REPRESENTATION LEARNING WITH DEEP BIDIRECTIONAL TRANSFORMER ENCODERS |
6154 | MODAL DECOMPOSITION OF FEEDBACK DELAY NETWORKS |
3294 | MODEL ORDER SELECTION IN DOA SCENARIOS VIA CROSS-ENTROPY BASED MACHINE LEARNING TECHNIQUES |
4949 | Modeling Behavior as Mutual Dependency Between Physiological Signals and Indoor Location In Large-Scale Wearable Sensor Study |
4238 | Modeling Behavioral Consistency In Large-Scale Wearable Recordings of Human Bio-behavioral Signals |
5041 | MODELING PIECE-WISE STATIONARY TIME SERIES |
2006 | MODELING PLATE AND SPRING REVERBERATION USING A DSP-INFORMED DEEP NEURAL NETWORK |
4002 | MODELING THE ENVIRONMENT IN DEEP REINFORCEMENT LEARNING: THE CASE OF ENERGY HARVESTING BASE STATIONS |
4206 | Modeling Uncertainty in Predicting Emotional Attributes from Spontaneous Speech |
1384 | MODELLING SEA CLUTTER IN SAR IMAGES USING LAPLACE-RICIAN DISTRIBUTION |
1094 | MoGA: Searching Beyond MobileNetV3 |
3538 | MONAURAL SPEECH ENHANCEMENT USING INTRA-SPECTRAL RECURRENT LAYERS IN THE MAGNITUDE AND PHASE RESPONSES |
4996 | MOTION DYNAMICS IMPROVE SPEAKER-INDEPENDENT LIPREADING |
3228 | MOTION FEEDBACK DESIGN FOR VIDEO FRAME INTERPOLATION |
3980 | MSPEC-NET : MULTI-DOMAIN SPEECH CONVERSION NETWORK |
3416 | MSPNET: MULTI-SUPERVISED PARALLEL NETWORK FOR CROWD COUNTING |
4026 | MT-GCN FOR MULTI-LABEL AUDIO TAGGING WITH NOISY LABELS |
3648 | MULTI IMAGE DEPTH FROM DEFOCUS NETWORK WITH BOUNDARY CUE FOR DUAL APERTURE CAMERA |
1976 | MULTI-AGENT DEEP REINFORCEMENT LEARNING FOR DISTRIBUTED HANDOVER MANAGEMENT IN DENSE MMWAVE NETWORKS |
3744 | Multi-Branch Learning for Weakly-Labeled Sound Event Detection |
4608 | Multichannel Active Noise Control with Spatial Derivative Constraints to Enlarge the quiet zone |
5721 | MULTICHANNEL SIGNAL CLASSIFICATION USING VECTOR AUTOREGRESSION |
5507 | MULTICHANNEL SIGNAL PROCESSING FOR ROAD SURFACE IDENTIFICATION |
1284 | MULTI-CHANNEL SPEECH SOURCE SEPARATION AND DEREVERBERATION WITH SEQUENTIAL INTEGRATION OF DETERMINED AND UNDERDETERMINED MODELS |
5701 | MULTI-CONDITIONING AND DATA AUGMENTATION USING GENERATIVE NOISE MODEL FOR SPEECH EMOTION RECOGNITION IN NOISY CONDITIONS |
5002 | MULTI-CONSTRAINT SPECTRAL CO-DESIGN FOR COLOCATED MIMO RADAR AND MIMO COMMUNICATIONS |
5696 | MULTI-DEPTH COMPUTATIONAL PERISCOPY WITH AN ORDINARY CAMERA |
3008 | MULTIGRAPH SPECTRAL CLUSTERING FOR JOINT CONTENT DELIVERY AND SCHEDULING IN BEAM-FREE SATELLITE COMMUNICATIONS |
5142 | MULTI-HEAD ATTENTION FOR SPEECH EMOTION RECOGNITION WITH AUXILIARY LEARNING OF GENDER RECOGNITION |
2373 | MULTI-LABEL CONSISTENT CONVOLUTIONAL TRANSFORM LEARNING: APPLICATION TO NON-INTRUSIVE LOAD MONITORING |
3739 | MULTI-LABEL SOUND EVENT RETRIEVAL USING A DEEP LEARNING-BASED SIAMESE STRUCTURE WITH A PAIRWISE PRESENCE MATRIX |
5942 | Multi-layer Content Interaction through Quaternion Product for Visual Question Answering |
3043 | Multi-level deep neural network adaptation for speaker verification using MMD and consistency regularization |
5452 | MULTILINEAR GENERALIZED SINGULAR VALUE DECOMPOSITION (ML-GSVD) WITH APPLICATION TO COORDINATED BEAMFORMING IN MULTI-USER MIMO SYSTEMS |
1023 | MULTILINGUAL ACOUSTIC WORD EMBEDDING MODELS FOR PROCESSING ZERO-RESOURCE LANGUAGES |
2063 | MULTILINGUAL GRAPHEME-TO-PHONEME CONVERSION WITH BYTE REPRESENTATION |
4412 | MULTI-MICROPHONE COMPLEX SPECTRAL MAPPING FOR SPEECH DEREVERBERATION |
5035 | MULTIMODAL ACTIVE SPEAKER DETECTION AND VIRTUAL CINEMATOGRAPHY FOR VIDEO CONFERENCING |
4327 | MULTIMODAL LEARNING FOR CLASSROOM ACTIVITY DETECTION |
5468 | MULTI-MODAL SELF-SUPERVISED PRE-TRAINING FOR JOINT OPTIC DISC AND CUP SEGMENTATION IN EYE FUNDUS IMAGES |
4640 | MULTIMODAL SPEAKER DIARIZATION OF REAL-WORLD MEETINGS USING D-VECTORS WITH SPATIAL FEATURES |
3276 | MULTIMODAL TRANSFORMER FUSION FOR CONTINUOUS EMOTION RECOGNITION |
3864 | Multimodal Violence Detection in Videos |
5463 | MULTI-MOTIFGAN (MMGAN): MOTIF-TARGETED GRAPH GENERATION AND PREDICTION |
5236 | MULTI-PATCH AGGREGATION MODELS FOR RESAMPLING DETECTION |
5659 | MULTIPLE POINTS INPUT FOR CONVOLUTIONAL NEURAL NETWORKS IN REPLAY ATTACK DETECTION |
4368 | Multi-polarization information fusion for object contour display in passive millimeter-wave and terahertz security imaging |
1825 | MULTI-RESOLUTION MULTI-HEAD ATTENTION IN DEEP SPEAKER EMBEDDING |
4265 | MULTI-RESOLUTION OVERLAPPING STRIPES NETWORK FOR PERSON RE-IDENTIFICATION |
1898 | MULTI-SCALE DEEP FEATURE FUSION FOR VEHICLE RE-IDENTIFICATION |
1375 | MULTI-SCALE FEATURE AGGREGATION NETWORK WITH WAVELET STRUCTURE SIMILARITY LOSS FUNCTION FOR SINGLE IMAGE DEHAZING |
4990 | MULTI-SCALE OCTAVE CONVOLUTIONS FOR ROBUST SPEECH RECOGNITION |
3297 | MULTI-SCALE RESIDUAL NETWORK FOR IMAGE CLASSIFICATION |
5083 | MULTI-SPEAKER AND MULTI-DOMAIN EMOTIONAL VOICE CONVERSION USING FACTORIZED HIERARCHICAL VARIATIONAL AUTOENCODER |
1100 | Multispectral Fusion of RGB and NIR Images Using Weighted Least Squares and Alternating Guidance |
3993 | MULTI-STAGE RESIDUAL HIDING FOR IMAGE-INTO-AUDIO STEGANOGRAPHY |
4576 | MULTISTATE ENCODING WITH END-TO-END SPEECH RNN TRANSDUCER NETWORK |
2042 | MULTI-STEP ONLINE UNSUPERVISED DOMAIN ADAPTATION |
4874 | MULTITAPER SPECTRAL GRANGER CAUSALITY WITH APPLICATION TO SSVEP |
2681 | MULTI-TASK CENTER-OF-PRESSURE METRICS ESTIMATION FROM SKELETON USING GRAPH CONVOLUTIONAL NETWORK |
1682 | MULTITASK LEARNING AND MULTISTAGE FUSION FOR DIMENSIONAL AUDIOVISUAL EMOTION RECOGNITION |
2831 | MULTITASK LEARNING FOR DARPA LORELEI’S SITUATION FRAME EXTRACTION TASK |
3039 | Multi-task Learning for Speaker Verification and Voice Trigger Detection |
3064 | Multi-task Learning for Voice Trigger Detection |
1406 | MULTI-TASK LEARNING IN AUTONOMOUS DRIVING SCENARIOS VIA ADAPTIVE FEATURE REFINEMENT NETWORKS |
3776 | Multi-Task Learning via SA-FPN and EJ-Head |
1865 | MULTITASK LEARNING WITH CAPSULE NETWORKS FOR SPEECH-TO-INTENT APPLICATIONS |
2611 | Multi-task self-supervised learning for robust speech recognition |
3999 | MULTI-TIME-SCALE CONVOLUTION FOR EMOTION RECOGNITION FROM SPEECH AUDIO SIGNALS |
5725 | MULTIUSER MASSIVE MIMO DOWNLINK PRECODING USING SECOND-ORDER SPATIAL SIGMA-DELTA MODULATION |
5552 | MULTIVARIATE TROPICAL REGRESSION AND PIECEWISE-LINEAR SURFACE FITTING |
1271 | MULTI-VIEW BAYESIAN GENERATIVE MODEL FOR MULTI-SUBJECT FMRI DATA ON BRAIN DECODING OF VIEWED IMAGE CATEGORIES |
5409 | Multi-View Clustering via Mixed Embedding Approximation |
4131 | MULTI-VIEW SHAPE ESTIMATION OF TRANSPARENT CONTAINERS |
1602 | Multi-view Wasserstein discriminant analysis with entropic regularized Wasserstein distance |
2123 | MULTI-WAY MULTI-VIEW DEEP AUTOENCODER FOR IMAGE FEATURE LEARNING WITH MULTI-LEVEL GRAPH REGULARIZATION |
1798 | MUTUAL-INFORMATION-BASED SENSOR PLACEMENT FOR SPATIAL SOUND FIELD RECORDING |
3926 | NASIL : NEURAL ARCHITECTURE SEARCH WITH IMITATION LEARNING |
3091 | Near capacity RCQD constellations for PAPR reduction of OFDM systems |
3332 | NEAREST KRONECKER PRODUCT DECOMPOSITION BASED NORMALIZED LEAST MEAN SQUARE ALGORITHM |
6153 | NEAR-FIELD ACOUSTIC SOURCE LOCALIZATION USING SPHERICAL HARMONIC FEATURES |
4278 | Near-optimal Bayes Error Based Feature Selection |
1389 | NEAR-OPTIMAL INTERFERENCE EXPLOITATION 1-BIT MASSIVE MIMO PRECODING VIA PARTIAL BRANCH-AND-BOUND |
1942 | NEURAL ATTENTIVE MULTIVIEW MACHINES |
1689 | NEURAL CODING STRATEGIES FOR EVENT-BASED VISION DATA |
2333 | NEURAL LATTICE SEARCH FOR SPEECH RECOGNITION |
4836 | NEURAL NETWORK TRAINING WITH APPROXIMATE LOGARITHMIC COMPUTATIONS |
3488 | Neural Network Wiretap Code Design for Multi-Mode Fiber Optical Channels |
4079 | NEURAL ORACLE SEARCH ON N-BEST HYPOTHESES |
2689 | NEURAL PERCUSSIVE SYNTHESIS PARAMETERISED BY HIGH-LEVEL TIMBRAL FEATURES |
1671 | NEURAL TIME WARPING FOR MULTIPLE SEQUENCE ALIGNMENT |
4683 | NEUTRAL TO LOMBARD SPEECH CONVERSION WITH DEEP LEARNING |
2306 | NEW METRICS FOR EVALUATING THE ACCURACY OF FUNDAMENTAL FREQUENCY ESTIMATION APPROACHES IN MUSICAL SIGNALS |
2742 | NODE-ASYNCHRONOUS SPECTRAL CLUSTERING ON DIRECTED GRAPHS |
6144 | NOISE STATISTICS OBLIVIOUS GARD FOR ROBUST REGRESSION WITH SPARSE OUTLIERS |
2823 | NOISE-ROBUST KEY-PHRASE DETECTORS FOR AUTOMATED CLASSROOM FEEDBACK |
5850 | Non Local Multi-Fiber Network for Action Anticipation in Videos |
3558 | NONCOHERENT MAXIMUM-LIKELIHOOD DETECTION FOR AMBIENT BACKSCATTERING COMMUNICATIONS OVER AMBIENT OFDM SIGNALS |
2837 | NON-EXPERTS OR EXPERTS? STATISTICAL ANALYSES OF MOS USING DSIS METHOD |
2330 | Non-Gaussian BLE-based Indoor Localization via Gaussian Sum Filtering Coupled with Wasserstein Distance |
2840 | NON-GRIFFIN–LIM TYPE SIGNAL RECOVERY FROM MAGNITUDE SPECTROGRAM |
6121 | NON-ITERATIVE SUBSPACE-BASED DOA ESTIMATION IN THE PRESENCE OF NONUNIFORM NOISE |
5615 | NONLINEAR SPATIAL FILTERING FOR MULTICHANNEL SPEECH ENHANCEMENT IN INHOMOGENEOUS NOISE FIELDS |
2442 | Non-local Nested Residual Attention Network for Stereo Image Super-resolution |
3359 | Non-parametric Community Change-points Detection in Streaming Graph Signals |
1639 | NON-UNIFORM VIDEO TIME-LAPSE METHOD BASED ON MOTION SCENARIO AND STABILIZATION CONSTRAINT |
4498 | NORMALIZED LEAST-MEAN-SQUARE ALGORITHMS WITH MINIMAX CONCAVE PENALTY |
5128 | OBJECT DETECTION AND 3D ESTIMATION VIA AN FMCW RADAR USING A FULLY CONVOLUTIONAL NETWORK |
1410 | OBJECT DETECTION WITH COLOR AND DEPTH IMAGES WITH MULTI-REDUCED REGION PROPOSAL NETWORK AND MULTI-POOLING |
3745 | OBJECT SURFACE ESTIMATION FROM RADAR IMAGES |
1721 | OBJECTIVE BAYESIAN DETECTION UNDER SPATIALLY CORRELATED GAUSSIAN OBSERVATIONS FOR MULTI-ANTENNA COGNITIVE RADIO NETWORK |
5654 | OH, JEEZ! OR UH-HUH? A LISTENER-AWARE BACKCHANNEL PREDICTOR ON ASR TRANSCRIPTIONS |
1480 | On Binary Sequence Set Design with Applications to Automotive Radar |
2938 | ON CRAMÉR-RAO LOWER BOUNDS WITH RANDOM EQUALITY CONSTRAINTS |
3665 | ON DESIGN OF OPTIMAL SMART METER PRIVACY CONTROL STRATEGY AGAINST ADVERSARIAL MAP DETECTION |
3736 | ON DISTRIBUTED STOCHASTIC GRADIENT ALGORITHMS FOR GLOBAL OPTIMIZATION |
4965 | On Distributed Stochastic Gradient Descent for Nonconvex Functions in the Presence of Byzantines |
2485 | ON DIVERGENCE APPROXIMATIONS FOR UNSUPERVISED TRAINING OF DEEP DENOISERS BASED ON STEIN’S UNBIASED RISK ESTIMATOR |
5184 | ON END-TO-END MULTI-CHANNEL TIME DOMAIN SPEECH SEPARATION IN REVERBERANT ENVIRONMENTS |
2517 | ON EXPONENTIALLY CONSISTENCY OF LINKAGE-BASED HIERARCHICAL CLUSTERING ALGORITHM USING KOLMOGROV-SMIRNOV DISTANCE |
1672 | ON HARMONIC APPROXIMATIONS OF INHARMONIC SIGNALS |
4226 | ON MEASURING DOPPLER SHIFTS BETWEEN TAGS IN A BACKSCATTERING TAG-TO-TAG NETWORK WITH APPLICATIONS IN TRACKING |
1205 | ON MODELING ASR WORD CONFIDENCE |
2236 | On Network Science and Mutual Information for Explaining Deep Neural Networks |
4464 | ON POLAR CODING FOR FINITE BLOCKLENGTH SECRET KEY GENERATION OVER WIRELESS CHANNELS |
3479 | ON REGULARIZATION PARAMETER FOR L0-SPARSE COVARIANCE FITTING BASED DOA ESTIMATION |
5855 | ON ROBUST VARIANCE FILTERING AND CHANGE OF VARIANCE DETECTION |
3355 | ON THE BYZANTINE ROBUSTNESS OF CLUSTERED FEDERATED LEARNING |
6066 | On the choice of graph neural network architectures |
2345 | ON THE DEGREES OF FREEDOM IN TOTAL VARIATION MINIMIZATION |
1174 | On the Determination of Window Length in the Short-Time Fourier Transform with Rényi Entropy |
3781 | On the effect of BRDFs on Phasor Field NLOS imaging |
3311 | ON THE FREQUENCY DOMAIN DETECTION OF HIGH DIMENSIONAL TIME SERIES |
5501 | On The Impact of Language Familiarity In Talker Change Detection |
1281 | ON THE IMPORTANCE OF VOCAL TRACT CONSTRICTION FOR SPEAKER CHARACTERIZATION: THE WHISPERED SPEECH STUDY |
3033 | ON THE LIMIT DISTRIBUTION OF THE CANONICAL CORRELATION COEFFICIENTS BETWEEN THE PAST AND THE FUTURE OF A HIGH-DIMENSIONAL WHITE NOISE |
2048 | ON THE OPPORTUNISTIC USE OF COMMERCIAL KU AND KA BAND SATCOM NETWORKS FOR RAIN RATE ESTIMATION: POTENTIALS AND CRITICAL ISSUES |
5625 | ON THE STABILITY OF POLYNOMIAL SPECTRAL GRAPH FILTERS |
1497 | On Throughput of Millimeter Wave MIMO Systems with Low Resolution ADCs |
4262 | One-bit Compressed Sensing using Generative Models |
5713 | One-bit DoA estimation via Sparse Linear Arrays |
2583 | ONE-BIT NORMALIZED SCATTER MATRIX ESTIMATION FOR COMPLEX ELLIPTICALLY SYMMETRIC DISTRIBUTIONS |
4323 | ONE-BIT SAMPLING IN FRACTIONAL FOURIER DOMAIN |
4855 | One-shot Parametric Audio Production Style Transfer With Application to Frequency Equalization |
3723 | ONE-SHOT VOICE CONVERSION BY VECTOR QUANTIZATION |
5581 | ONE-SHOT VOICE CONVERSION USING STAR-GAN |
1729 | ONLINE CHANNEL ESTIMATION FOR HYBRID BEAMFORMING ARCHITECTURES |
2259 | Online Community Detection by Spectral CUSUM |
4862 | ONLINE GRAPH TOPOLOGY INFERENCE WITH KERNELS FOR BRAIN CONNECTIVITY ESTIMATION |
2838 | ONLINE META-LEARNING ON NON-CONVEX SETTING |
5396 | ONLINE POSITRON EMISSION TOMOGRAPHY BY ONLINE PORTFOLIO SELECTION |
3905 | ONLINE TENSOR COMPLETION AND FREE SUBMODULE TRACKING WITH THE T-SVD |
1751 | ON-THE-FLY FEATURE SELECTION AND CLASSIFICATION WITH APPLICATION TO CIVIC ENGAGEMENT PLATFORMS |
2276 | OOV RECOVERY WITH EFFICIENT 2ND PASS DECODING AND OPEN-VOCABULARY WORD-LEVEL RNNLM RESCORING FOR HYBRID ASR |
3919 | OPEN SET VIDEO CAMERA MODEL VERIFICATION |
1911 | OpenDenoising: an Extensible Benchmark for Building Comparative Studies of Image Denoisers |
3574 | OPPORTUNISTIC USE OF GNSS SIGNALS TO CHARACTERIZE THE ENVIRONMENT BY MEANS OF MACHINE LEARNING BASED PROCESSING |
3663 | OPTIMAL DESIGN OF ENERGY-EFFICIENT CELL-FREE MASSIVE MIMO: JOINT POWER ALLOCATION AND LOAD BALANCING |
3957 | Optimal Joint Channel Estimation and Data Detection by L1-norm PCA for Streetscape IoT |
2610 | OPTIMAL LAPLACIAN REGULARIZATION FOR SPARSE SPECTRAL COMMUNITY DETECTION |
6150 | Optimal Leak Factor Selection for the Output-Constrained Leaky Filtered-Input Least Mean Square Algorithm |
3989 | OPTIMAL POWER FLOW USING GRAPH NEURAL NETWORKS |
3530 | OPTIMAL TRANSPORT BASED CHANGE POINT DETECTION AND TIME SERIES SEGMENT CLUSTERING |
3314 | Optimal transport structure of cycleGAN for unsupervised learning for inverse problems |
5373 | OPTIMAL WINDOW DESIGN FOR JOINT SPATIAL-SPECTRAL DOMAIN FILTERING OF SIGNALS ON THE SPHERE |
5376 | OPTIMAL WINDOW DESIGN FOR W-OFDM |
2009 | OPTIMIZED SENSOR SELECTION FOR JOINT RADAR-COMMUNICATION SYSTEMS |
1965 | OPTIMIZED SINGLE CARRIER TRANSCEIVER FOR FUTURE SUB-TERAHERTZ APPLICATIONS |
4694 | OPTIMIZING BACKSCATTERING COEFFICIENT DESIGN FOR MINIMIZING BER AT MONOSTATIC MIMO READER |
2895 | OPTIMIZING BAYESIAN HMM BASED X-VECTOR CLUSTERING FOR THE SECOND DIHARD SPEECH DIARIZATION CHALLENGE |
2031 | OPTIMUM KERNEL PARTICLE FILTER FOR ASYMMETRIC LAPLACE NOISE |
3188 | ORDINAL LEARNING FOR EMOTION RECOGNITION IN CUSTOMER SERVICE CALLS |
5901 | ORTHOGONAL TRAINING FOR TEXT-INDEPENDENT SPEAKER VERIFICATION |
2027 | OVERCOMING HIGH NANOPORE BASECALLER ERROR RATES FOR DNA STORAGE VIA BASECALLER-DECODER INTEGRATION AND CONVOLUTIONAL CODES |
1642 | OVERDETERMINED INDEPENDENT VECTOR ANALYSIS |
4424 | OVERLAP LOCAL-SGD: AN ALGORITHMIC APPROACH TO HIDE COMMUNICATION DELAYS IN DISTRIBUTED SGD |
4104 | OVERLAP-AWARE DIARIZATION: RESEGMENTATION USING NEURAL END-TO-END OVERLAPPED SPEECH DETECTION |
3231 | Overlapped State Hidden Semi-Markov Model for Grouped Multiple Sequences |
1731 | PACO AND PACO-DCT: PATCH CONSENSUS AND ITS APPLICATION TO INPAINTING |
4510 | PAGAN: A PHASE-ADAPTED GENERATIVE ADVERSARIAL NETWORKS FOR SPEECH ENHANCEMENT |
2402 | PAN: PHONEME-AWARE NETWORK FOR MONAURAL SPEECH ENHANCEMENT |
5461 | PARALLEL WAVEGAN: A FAST WAVEFORM GENERATION MODEL BASED ON GENERATIVE ADVERSARIAL NETWORKS WITH MULTI-RESOLUTION SPECTROGRAM |
5442 | PARALLELIZING ADAM OPTIMIZER WITH BLOCKWISE MODEL-UPDATE FILTERING |
3285 | PARAMETER ESTIMATION OF IN-CITY FRONTAL RAINFALL PROPAGATION |
4929 | PARSING MAP GUIDED MULTI-SCALE ATTENTION NETWORK FOR FACE HALLUCINATION |
2545 | Partial AUC optimization based deep speaker embeddings with class-center learning for text-independent speaker verification |
3793 | PARTICLE FILTER WITH REJECTION CONTROL AND UNBIASED ESTIMATOR OF THE MARGINAL LIKELIHOOD |
1598 | PARTICLE FILTERING ON THE COMPLEX STIEFEL MANIFOLD WITH APPLICATION TO SUBSPACE TRACKING |
4023 | PARTICLE GROUP METROPOLIS METHODS FOR TRACKING THE LEAF AREA INDEX |
4635 | PASSIVE INTELLIGENT SURFACE ASSISTED MIMO POWERED SUSTAINABLE IOT |
6084 | PASSIVE JOINT LOCALIZATION AND SYNCHRONIZATION OF DISTRIBUTED MICROPHONE ARRAYS |
3441 | Patch-Level Selection and Breadth-First Prediction Strategy for Reversible Data Hiding |
3113 | Pathloss Prediction using Deep Learning with Applications to Cellular Optimization and Efficient D2D Link Scheduling |
2337 | Peer to Peer offloading with delayed feedback: An adversary bandit approach |
2759 | Perception-Distortion Trade-Off with Restricted Boltzmann Machines |
3362 | PERCEPTUAL LOSS FUNCTION FOR NEURAL MODELLING OF AUDIO SYSTEMS |
6087 | PERFORMANCE ANALYSIS AND CONSTELLATION OPTIMIZATION OF STAR-QAM-AIDED DIFFERENTIAL FASTER-THAN-NYQUIST SIGNALING |
3438 | PERFORMANCE ANALYSIS FOR PATH ATTENUATION ESTIMATION OF MICROWAVE SIGNALS DUE TO RAINFALL AND BEYOND |
3823 | PERFORMANCE BOUNDS FOR DISPLACED SENSOR AUTOMOTIVE RADAR IMAGING |
5965 | Performance Comparison of Lossless Compression Strategies for Dynamic Vision Sensor Data |
1949 | PERFORMANCE STUDY OF A CONVOLUTIONAL TIME-DOMAIN AUDIO SEPARATION NETWORK FOR REAL-TIME SPEECH DENOISING |
6092 | PERMUTATIONS UNLABELED BEYOND SAMPLING UNKNOWN |
4287 | Person Identification using Deep Convolutional Neural Networks on Short-Term Signals from Wearable Sensors |
4537 | PEVD-BASED SPEECH ENHANCEMENT IN REVERBERANT ENVIRONMENTS |
1979 | PHASE RECONSTRUCTION BASED ON RECURRENT PHASE UNWRAPPING WITH DEEP NEURAL NETWORKS |
2511 | PHONEME BOUNDARY DETECTION USING LEARNABLE SEGMENTAL FEATURES |
4064 | PHONETIC FEEDBACK FOR SPEECH ENHANCEMENT WITH AND WITHOUT PARALLEL SPEECH DATA |
4116 | Phylogenetic Minimum Spanning Tree Reconstruction Using Autoencoders |
2929 | PITCH ESTIMATION VIA SELF-SUPERVISION |
2963 | PITCHNET: UNSUPERVISED SINGING VOICE CONVERSION WITH PITCH ADVERSARIAL NETWORK |
3155 | PIXEL-LEVEL SELF-PACED LEARNING FOR SUPER-RESOLUTION |
2425 | PIXEL-WISE LINEAR/NONLINEAR NONNEGATIVE MATRIX FACTORIZATION FOR UNMIXING OF HYPERSPECTRAL DATA |
3670 | PLAYING TECHNIQUE RECOGNITION BY JOINT TIME–FREQUENCY SCATTERING |
3168 | POLARIZATION PARAMETERS ESTIMATION WITH SCALAR SENSOR ARRAYS |
4846 | POLARIZING FRONT ENDS FOR ROBUST CNNS |
5946 | POLYPHONIC SOUND EVENT DETECTION USING TRANSPOSED CONVOLUTIONAL RECURRENT NEURAL NETWORK |
5222 | PORTFOLIO CUTS: A GRAPH-THEORETIC FRAMEWORK TO DIVERSIFICATION |
2121 | Pose Refinement: bridging the gap between Unsupervised Learning and Geometric Methods for Visual Odometry |
1472 | POSITION CONSTRAINT LOSS FOR FASHION LANDMARK ESTIMATION |
2776 | POSITIVE SEMIDEFINITE MATRIX FACTORIZATION: A LINK TO PHASE RETRIEVAL AND A BLOCK GRADIENT ALGORITHM |
3042 | POSITIVE SOLUTIONS FOR LARGE RANDOM LINEAR SYSTEMS |
2723 | POWER OPTIMIZATION USING EMBEDDED AUTOMATIC GAIN CONTROL ALGORITHM WITH PHOTOPLETHYSMOGRAPHY SIGNAL QUALITY CLASSIFICATION |
2208 | POWER SPECTRUM OPTIMIZATION FOR CAPACITY OF THE EXTENDED SPECTRUM HYBRID FIBER COAX NETWORK |
6071 | Precise Performance Analysis of the Box-Elastic Net Under Matrix Uncertainties |
5157 | Preconditioned Ghost Imaging via Sparsity Constraint |
4237 | Preconditioning ADMM for Fast Decentralized Optimization |
2389 | PREDICTING PERFORMANCE OUTCOME WITH A CONVERSATIONAL GRAPH CONVOLUTIONAL NETWORK FOR SMALL GROUP INTERACTIONS |
2336 | Predicting word error rate for reverberant speech |
2077 | PREDICTION OF INDIVIDUAL PROGRESSION RATE IN PARKINSON’S DISEASE USING CLINICAL MEASURES AND BIOMECHANICAL MEASURES OF GAIT AND POSTURAL STABILITY |
5464 | PREDICTION OF VESSEL TRAJECTORIES FROM AIS DATA VIA SEQUENCE-TO-SEQUENCE RECURRENT NEURAL NETWORKS |
3059 | PREDICTION OF VOICING AND THE F0 CONTOUR FROM ELECTROMAGNETIC ARTICULOGRAPHY DATA FOR ARTICULATION-TO-SPEECH SYNTHESIS |
2352 | Preference-aware Mask for Session-based Recommendation with Bidirectional Transformer |
1808 | PRESERVATION OF ANOMALOUS SUBGROUPS ON VARIATIONAL AUTOENCODER TRANSFORMED DATA |
5249 | PRE-TRAINING FOR QUERY REWRITING A SPOKEN LANGUAGE UNDERSTANDING SYSTEM |
3238 | Primal-Dual Stochastic Subgradient Method for Log-determinant Optimization |
1553 | Primary path estimator based on individual secondary path for ANC headphones |
5482 | PRINCIPAL ANGLE DETECTOR FOR SUBSPACE SIGNAL WITH STRUCTURED UNKNOWN INTERFERENCE |
2205 | PRINCIPLE-INSPIRED MULTI-SCALE AGGREGATION NETWORK FOR EXTREMELY LOW-LIGHT IMAGE ENHANCEMENT |
2128 | Privacy aware acoustic scene synthesis using deep spectral feature inversion |
2041 | PRIVACY-AWARE QUICKEST CHANGE DETECTION |
4152 | PRIVACY-PRESERVING IMAGE SHARING VIA SPARSIFYING LAYERS ON CONVOLUTIONAL GROUPS |
5304 | PRIVACY-PRESERVING PATTERN RECOGNITION USING ENCRYPTED SPARSE REPRESENTATIONS IN L0 NORM MINIMIZATION |
3923 | PRIVACY-PRESERVING PHISHING WEB PAGE CLASSIFICATION VIA FULLY HOMOMORPHIC ENCRYPTION |
2808 | PRIVATE FL-GAN: DIFFERENTIAL PRIVACY SYNTHETIC DATA GENERATION BASED ON FEDERATED LEARNING |
3480 | PROBABILISTIC FILTER AND SMOOTHER FOR VARIATIONAL INFERENCE OF BAYESIAN LINEAR DYNAMICAL SYSTEMS |
2479 | PROCESSING CONVOLUTIONAL NEURAL NETWORKS ON CACHE |
3217 | Programmable Dataflow Accelerators: A 5G OFDM Modulation/Demodulation Case Study |
3078 | PROGRESSIVE MULTI-TARGET NETWORK BASED SPEECH ENHANCEMENT WITH SNR-PRESELECTION FOR ROBUST SPEAKER DIARIZATION |
5933 | PROJECTED WEIGHT REGULARIZATION TO IMPROVE NEURAL NETWORK GENERALIZATION |
4689 | Projection Free Dynamic Online Learning |
3318 | Propeller Noise Detection with Deep Learning |
4685 | PROTOTYPICAL NETWORKS FOR SMALL FOOTPRINT TEXT-INDEPENDENT SPEAKER VERIFICATION |
4847 | PROXIMAL DISTANCE ALGORITHM FOR NONCONVEX QCQP WITH BEAMFORMING APPLICATIONS |
2613 | PROXIMAL MULTITASK LEARNING OVER DISTRIBUTED NETWORKS WITH JOINTLY SPARSE STRUCTURE |
2976 | Pseudo Labeling and Negative Feedback Learning for Large-scale Multi-label Domain Classification |
5290 | PSEUDO LIKELIHOOD CORRECTION TECHNIQUE FOR LOW RESOURCE ACCENTED ASR |
1575 | PYANNOTE.AUDIO: NEURAL BUILDING BLOCKS FOR SPEAKER DIARIZATION |
4501 | Q-GADMM: Quantized Group ADMM for Communication Efficient Decentralized Machine Learning |
4769 | Q-LEARNING BASED PREDICTIVE RELAY SELECTION FOR OPTIMAL RELAY BEAMFORMING |
4430 | QOS-AWARE FLOW CONTROL FOR POWER-EFFICIENT DATA CENTER NETWORKS WITH DEEP REINFORCEMENT LEARNING |
5905 | QUALITY-OF-SERVICE PREDICTION FOR PHYSICAL-LAYER SECURITY VIA SECRECY MAPS |
2645 | QUANTIZED TENSOR ROBUST PRINCIPAL COMPONENT ANALYSIS |
4407 | Quantum State Discrimination with Local Operations and Classical Communications |
2249 | QUARTZNET: DEEP AUTOMATIC SPEECH RECOGNITION WITH 1D TIME-CHANNEL SEPARABLE CONVOLUTIONS. |
3666 | QUICKEST CHANGE DETECTION IN ANONYMOUS HETEROGENEOUS SENSOR NETOWKRS |
4003 | Quickest Detection of Growing Dynamic Anomalies in Networks |
5009 | RATE ASSIGNMENT IN 360-DEGREE VIDEO TILED STREAMING USING RANDOM FOREST REGRESSION |
6155 | Rate-Constrained Noise Reduction in Wireless Acoustic Sensor Networks |
4431 | RATE-INVARIANT AUTOENCODING OF TIME-SERIES |
5054 | RAW WAVEFORM BASED END-TO-END DEEP CONVOLUTIONAL NETWORK FOR SPATIAL LOCALIZATION OF MULTIPLE ACOUSTIC SOURCES |
1049 | RAY SEPARATION AND SOURCE DEPTH ESTIMATION BASED ON SOUND PRESSURE FIELD TRANSFORMATION |
4809 | RDE-MOGA: AUTOMATIC SELECTION OF RATE-DISTORTION-ENERGY CONTROL POINTS FOR VIDEO ENCODERS USING MUTI-OBJETIVE GENETIC ALGORITHM |
3577 | REALIZABILITY OF PLANAR POINT EMBEDDINGS FROM ANGLE MEASUREMENTS |
5123 | REAL-TIME BINAURAL SPEECH SEPARATION THAT PRESERVES SPATIAL CUES |
3440 | REAL-TIME HAND GESTURE RECOGNITION USING TEMPORAL MUSCLE ACTIVATION MAPS OF MULTI-CHANNEL SEMG SIGNALS |
5551 | REAL-TIME IMPLEMENTATION ASPECTS OF LARGE INTELLIGENT SURFACES |
3637 | REAL-TIME SPEECH ENHANCEMENT USING EQUILIBRIATED RNN |
2290 | Real-Time Task Offloading for Large-Scale Mobile Edge Computing |
4447 | REAL-TIME, UNIVERSAL, AND ROBUST ADVERSARIAL ATTACKS AGAINST SPEAKER RECOGNITION SYSTEMS |
4122 | RECEIVER DESIGN AND AGC OPTIMIZATION WITH SELF INTERFERENCE INDUCED SATURATION |
2697 | RECEPTIVE FIELD PYRAMID NETWORK FOR OBJECT DETECTION |
3763 | RECONSTRUCTION OF FRI SIGNALS USING DEEP NEURAL NETWORK APPROACHES |
6083 | Recovery of Binary Sparse Signals From Compressed Linear Measurements via Polynomial Optimization |
2001 | RECURRENT NEURAL AUDIOVISUAL WORD EMBEDDINGS FOR SYNCHRONIZED SPEECH AND REAL-TIME MRI |
5185 | RECURSIVE PREDICTION OF GRAPH SIGNALS WITH INCOMING NODES |
4665 | REDUCED-COMPLEXITY SINGULAR VALUE DECOMPOSITION FOR TUCKER DECOMPOSITION: ALGORITHM AND HARDWARE |
2299 | REDUNDANT CONVOLUTIONAL NETWORK WITH ATTENTION MECHANISM FOR MONAURAL SPEECH ENHANCEMENT |
1316 | REFLECTANCE-GUIDED, CONTRAST-ACCUMULATED HISTOGRAM EQUALIZATION |
5008 | REGRESSION BEFORE CLASSIFICATION FOR TEMPORAL ACTION DETECTION |
2975 | REGULARIZED BEAMFORMER FOR THE SPHERICAL MICROPHONE ARRAY TO COPE WITH THE WHITE NOISE AMPLIFICATION |
3565 | REGULARIZED FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION WITH ILRMA-BASED PRIOR DISTRIBUTION OF JOINT-DIAGONALIZATION PROCESS |
1990 | Regularized partial phase synchrony index applied to dynamical functional connectivity estimation |
3283 | REINFORCED DEPTH-AWARE DEEP LEARNING FOR SINGLE IMAGE DEHAZING |
6131 | Relative Acoustic Transfer Function Estimation in Wireless Acoustic Sensor Networks |
3631 | RELATIVE COST BASED MODEL SELECTION FOR SPARSE HIGH-DIMENSIONAL LINEAR REGRESSION MODELS |
2641 | RELIABLE AND SECURE TRANSMISSION FOR FUTURE NETWORKS |
3828 | Residual Attention Network for Wavelet Domain Super-Resolution |
4564 | RESIDUAL RECURRENT NEURAL NETWORK FOR SPEECH ENHANCEMENT |
2728 | Resilient Distributed Recovery of Large Fields |
2846 | Resilient to Byzantine Attacks Finite-Sum Optimization over Networks |
5053 | RESOURCE MANAGEMENT IN THE MULTIBEAM NOMA-BASED SATELLITE DOWNLINK |
4444 | RESTING-STATE EEG-BASED BIOMETRICS WITH SIGNALS FEATURES EXTRACTED BY MULTIVARIATE EMPIRICAL MODE DECOMPOSITION |
4974 | RETHINKING RETINAL LANDMARK LOCALIZATION AS POSE ESTIMATION: NAIVE SINGLE STACKED NETWORK FOR OPTIC DISK AND FOVEA DETECTION |
2392 | Rethinking Temporal-related Sample for Human Action Recognition |
3597 | Retinal Vessel Segmentation via A Semantics and Multi-Scale Aggregation Network |
5838 | RE-TRANSLATION STRATEGIES FOR LONG FORM, SIMULTANEOUS, SPOKEN LANGUAGE TRANSLATION |
4531 | RETRIEVING VOCAL-TRACT RESONANCE AND ANTI-RESONANCE FROM HIGH-PITCHED VOWELS USING A RAHMONIC SUBTRACTION TECHNIQUE |
2799 | REV-AE: A LEARNED FRAME SET FOR IMAGE RECONSTRUCTION |
1981 | REVEALING BACKDOORS, POST-TRAINING, IN DNN CLASSIFIERS VIA NOVEL INFERENCE ON OPTIMIZED PERTURBATIONS INDUCING GROUP MISCLASSIFICATION |
4204 | REVEALING HIDDEN DRAWINGS IN LEONARDO'S 'THE VIRGIN OF THE ROCKS' FROM MACRO X-RAY FLUORESCENCE SCANNING DATA THROUGH ELEMENT LINE LOCALISATION |
3120 | REVERSAL NO LONGER MATTERS: ATTENTION-BASED ARRHYTHMIA DETECTION WITH LEAD-REVERSAL ECG DATA |
3910 | REVISIT OF ESTIMATE SEQUENCE FOR ACCELERATED GRADIENT METHOD |
5528 | REVISITING FAST SPECTRAL CLUSTERING WITH ANCHOR GRAPH |
1869 | RGB-D BASED MULTI-MODAL DEEP LEARNING FOR FACE IDENTIFICATION |
4042 | RIEMANNIAN FRAMEWORK FOR ROBUST COVARIANCE MATRIX ESTIMATION IN SPIKED MODELS |
4046 | RIEMANNIAN GEOMETRY AND CRAMÉR-RAO BOUND FOR BLIND SEPARATION OF GAUSSIAN SOURCES |
1336 | RISK CONVERGENCE OF CENTERED KERNEL RIDGE REGRESSION WITH LARGE DIMENSIONAL DATA |
2703 | RNN-TRANSDUCER WITH STATELESS PREDICTION NETWORK |
2670 | ROBUST AND COMPUTATIONALLY-EFFICIENT ANOMALY DETECTION USING POWERS-OF-TWO NETWORKS |
1047 | ROBUST AND STEERABLE KRONECKER PRODUCT DIFFERENTIAL BEAMFORMING WITH RECTANGULAR MICROPHONE ARRAYS |
5357 | ROBUST CFAR RADAR DETECTION USING A K-NEAREST NEIGHBORS RULE |
3920 | ROBUST COVARIANCE MATRIX ESTIMATION AND PORTFOLIO ALLOCATION: THE CASE OF NON-HOMOGENEOUS ASSETS |
2446 | Robust Frequency-Domain Recursive Least M-Estimate Adaptive Filter for Acoustic System Identification |
1633 | ROBUST FULL-FOV DEPTH ESTIMATION IN TELE-WIDE CAMERA SYSTEM |
3166 | ROBUST FUNDAMENTAL FREQUENCY ESTIMATION IN COLOURED NOISE |
4952 | ROBUST GLOBAL OPTIMIZED AFFINE REGISTRATION METHOD FOR MICROSCOPIC IMAGES OF BIOLOGICAL TISSUE |
2246 | Robust Hybrid Beamforming for Satellite-Terrestrial Integrated Networks |
3500 | Robust Hybrid Precoding for Interference Exploitation in Massive MIMO Systems |
6126 | Robust Joint Estimation of Multimicrophone Signal Model Parameters |
4695 | ROBUST LIKELIHOOD RATIO TEST USING ALPHA-DIVERGENCE |
3028 | ROBUST LOW RATE SPEECH CODING BASED ON CLONED NETWORKS AND WAVENET |
1762 | ROBUST MARINE BUOY PLACEMENT FOR SHIP DETECTION USING DROPOUT K-MEANS |
2958 | ROBUST MATRIX COMPLETION VIA LP-GREEDY PURSUITS |
1218 | ROBUST MULTI-CHANNEL SPEECH RECOGNITION USING FREQUENCY ALIGNED NETWORK |
3639 | ROBUST MUSIC ESTIMATION UNDER ARRAY RESPONSE UNCERTAINTY |
1117 | ROBUST ONLINE MATRIX COMPLETION WITH GAUSSIAN MIXTURE MODEL |
1902 | Robust Online Mirror Saddle-Point Method for Constrained Resource Allocation |
4452 | ROBUST PARAMETER ESTIMATION OF CONTAMINATED DAMPED EXPONENTIALS |
3547 | ROBUST PHASE RETRIEVAL WITH OUTLIERS |
3342 | Robust Pricing Mechanism for Resource Sustainability under Privacy Constraint in Competitive Online Learning Multi-Agent Systems |
4882 | ROBUST RANK CONSTRAINED SPARSE LEARNING: AN GRAPH-BASED METHOD FOR CLUSTERING |
5301 | ROBUST SPEAKER RECOGNITION USING UNSUPERVISED ADVERSARIAL INVARIANCE |
1778 | ROBUST SYMBOL-LEVEL PRECODING VIA AUTOENCODER-BASED DEEP LEARNING |
2999 | ROBUST TDOA INDOOR TRACKING USING CONSTRAINED MEASUREMENT FILTERING AND GRID-BASED FILTERING |
4007 | ROBUST TRANSMISSION OVER CHANNELS WITH CHANNEL UNCERTAINTY: AN ALGORITHMIC PERSPECTIVE |
1683 | Robust Unsupervised Audio-visual Speech Enhancement Using a Mixture of Variational Autoencoders |
4737 | ROBUST VISUAL TRACKING WITH CONTEXT-BASED ACTIVE OCCLUSION RECOGNITION |
3135 | ROBUSTNESS ASSESSMENT OF AUTOMATIC REINKE’S EDEMA DIAGNOSIS SYSTEMS |
2725 | ROBUSTNESS OF SBL IN CORRELATED ENVIRONMENTS |
1562 | ROIMIX: PROPOSAL-FUSION AMONG MULTIPLE IMAGES FOR UNDERWATER OBJECT DETECTION |
3214 | SALIENCY-BASED IMAGE CONTRAST ENHANCEMENT WITH REVERSIBLE DATA HIDING |
5727 | SALIENT OBJECT DETECTION BASED ON IMAGE BIT-MAP |
4102 | SAMPLING CLASSES OF NON-BANDLIMITED SIGNALS USING INTEGRATE-AND-FIRE DEVICES: AVERAGE CASE ANALYSIS |
5529 | SAMPLING OF SURFACES AND LEARNING FUNCTIONS IN HIGH DIMENSIONS |
3458 | SAMPLING STRATEGIES FOR GAN SYNTHETIC DATA |
3271 | SCALABLE DETECTION AND TRACKING OF EXTENDED OBJECTS |
2573 | SCALABLE KERNEL LEARNING VIA THE DISCRIMINANT INFORMATION |
1821 | SCALABLE LEARNING-BASED SAMPLING OPTIMIZATION FOR COMPRESSIVE DYNAMIC MRI |
2112 | SCALABLE MULTILINGUAL FRONTEND FOR TTS |
5540 | SCALPNET: DETECTION OF SPATIOTEMPORAL ABNORMAL INTERVALS IN EPILEPTIC EEG USING CONVOLUTIONAL NEURAL NETWORKS |
1718 | Scene Text Recognition with Temporal Convolutional Encoder |
4341 | SCENE-DEPENDENT ACOUSTIC EVENT DETECTION WITH SCENE CONDITIONING AND FAKE-SCENE-CONDITIONED LOSS |
1033 | S-DOD-CNN: Doubly Injecting Spatially-Preserved Object Information for Event Recognition |
4231 | SDTCN: SIMILARITY DRIVEN TRANSMISSION COMPUTING NETWORK FOR IMAGE DEHAZING |
4330 | SECL-UMons Database for sound event classification and localization |
5986 | SECOST: SEQUENTIAL CO-SUPERVISION FOR LARGE SCALE WEAKLY LABELED AUDIO EVENT DETECTION |
1039 | SECURE FACE RECOGNITION IN EDGE AND CLOUD NETWORKS: FROM THE ENSEMBLE LEARNING PERSPECTIVE |
3372 | SECURE IDENTIFICATION FOR GAUSSIAN CHANNELS |
1434 | Secure Symbol-Level MISO Precoding |
1380 | SED-MDD: TOWARDS SENTENCE DEPENDENT END-TO-END MISPRONUNCIATION DETECTION AND DIAGNOSIS |
3922 | SELECTION-CHANNEL-AWARE REVERSE JPEG COMPATIBILITY FOR HIGHLY RELIABLE STEGANALYSIS OF JPEG IMAGES |
4316 | SELECTIVE ATTENTION ENCODERS BY SYNTACTIC GRAPH CONVOLUTIONAL NETWORKS FOR DOCUMENT SUMMARIZATION |
2744 | SELECTIVE CONVOLUTIONAL NETWORK: AN EFFICIENT OBJECT DETECTOR WITH IGNORING BACKGROUND |
5444 | SELF-ADAPTIVE FEATURE FOOL |
2878 | SELF-ATTENTION AND RETRIEVAL ENHANCED NEURAL NETWORKS FOR ESSAY GENERATION |
3613 | SELF-ATTENTIVE SENTIMENTAL SENTENCE EMBEDDING FOR SENTIMENT ANALYSIS |
3855 | SELF-DRIVEN GRAPH VOLTERRA MODELS FOR HIGHER-ORDER LINK PREDICTION |
1695 | SELF-PACED PROBABILISTIC PRINCIPAL COMPONENT ANALYSIS FOR DATA WITH OUTLIERS |
2594 | Self-supervised Adversarial Training |
2104 | SELF-SUPERVISED DEEP LEARNING FOR FISHEYE IMAGE RECTIFICATION |
5519 | SELF-SUPERVISED DENOISING AUTOENCODER WITH LINEAR REGRESSION DECODER FOR SPEECH ENHANCEMENT |
5215 | SELF-SUPERVISED LEARNING FOR AUDIO-VISUAL SPEAKER DIARIZATION |
1496 | SELF-SUPERVISED LEARNING FOR ECG-BASED EMOTION RECOGNITION |
4523 | SELF-TRAINING FOR END-TO-END SPEECH RECOGNITION |
6151 | SELF-TUNING ALGORITHMS FOR MULTISENSOR-MULTITARGET TRACKING USING BELIEF PROPAGATION |
1206 | SEMANTIC AUGMENTATION HASHING FOR ZERO-SHOT IMAGE RETRIEVAL |
2672 | SemanticGAN: Generative Adversarial Networks for Semantic Image to Photo-realistic Image Translation |
3967 | SEMI-IMPLICIT STOCHASTIC RECURRENT NEURAL NETWORKS |
4011 | Semi-Regular Geometric Kernel Encoding \& Reconstruction for Video Compression |
5476 | SEMI-SUPERVISED LEARNING BASED ON HIERARCHICAL GENERATIVE MODELS FOR END-TO-END SPEECH SYNTHESIS |
4119 | SEMI-SUPERVISED LEARNING FOR TEXT CLASSIFICATION BY LAYER PARTITIONING |
4471 | Semi-supervised learning of processes over multi-relational graphs |
4194 | Semi-supervised optimal transport methods for detecting anomalies |
5113 | Semi-supervised sentence classification based on user polarity in the social scenarios |
2779 | SEMI-SUPERVISED SPEAKER ADAPTATION FOR END-TO-END SPEECH SYNTHESIS WITH PRETRAINED MODELS |
6130 | Sensitivity in tensor decomposition |
4940 | SENSOR SELECTION FOR MODEL-FREE SOURCE LOCALIZATION: WHERE LESS IS MORE |
2685 | SEPARABLE OPTIMIZATION FOR JOINT BLIND DECONVOLUTION AND DEMIXING |
3031 | SEQUENCE-LEVEL CONSISTENCY TRAINING FOR SEMI-SUPERVISED END-TO-END AUTOMATIC SPEECH RECOGNITION |
4851 | SEQUENCE-TO-SEQUENCE AUTOMATIC SPEECH RECOGNITION WITH WORD EMBEDDING REGULARIZATION AND FUSED DECODING |
2617 | SEQUENCE-TO-SEQUENCE LABANOTATION GENERATION BASED ON MOTION CAPTURE DATA |
5931 | Sequence-to-sequence Singing Synthesis Using the Feed-forward Transformer |
5195 | SEQUENCE-TO-SUBSEQUENCE LEARNING WITH CONDITIONAL GAN FOR POWER DISAGGREGATION |
3422 | SEQUENTIAL DEEP UNROLLING WITH FLOW PRIORS FOR ROBUST VIDEO DERAINING |
5139 | SEQUENTIAL IOT DATA AUGMENTATION USING GENERATIVE ADVERSARIAL NETWORKS |
3099 | SEQUENTIAL JOINT DETECTION AND ESTIMATION WITH AN APPLICATION TO JOINT SYMBOL DECODING AND NOISE POWER ESTIMATION |
4017 | SEQUENTIAL METHODS FOR DETECTING A CHANGE IN THE DISTRIBUTION OF AN EPISODIC PROCESS |
1780 | Sequential semi-orthogonal multi-level NMF with negative residual reduction for network embedding |
3783 | SEQUENTIAL VESSEL TRAJECTORY IDENTIFICATION USING TRUNCATED VITERBI ALGORITHM |
5896 | SHADOW REMOVAL OF TEXT DOCUMENT IMAGES BY ESTIMATING LOCAL AND GLOBAL BACKGROUND COLORS |
2331 | Shape from Bandwidth: Central Projection Case |
2972 | SHORT AND SQUEEZED: ACCELERATING THE COMPUTATION OF ANTISPARSE REPRESENTATIONS WITH SAFE SQUEEZING |
5015 | SIGHT TO SOUND: AN END-TO-END APPROACH FOR VISUAL PIANO TRANSCRIPTION |
3246 | SIGNAL CLUSTERING WITH CLASS-INDEPENDENT SEGMENTATION |
4080 | SIGNAL SENSING AND RECONSTRUCTION PARADIGMS FOR A NOVEL MULTI-SOURCE STATIC COMPUTED TOMOGRAPHY SYSTEM |
3839 | SIGNAL-AWARE BROADBAND DOA ESTIMATION USING ATTENTION MECHANISMS |
2527 | SIMILARITY LEARNING FOR COVER SONG IDENTIFICATION USING CROSS-SIMILARITY MATRICES OF MULTI-LEVEL DEEP SEQUENCES |
4180 | SIMPLE CACHING SCHEMES FOR NON-HOMOGENEOUS MISO CACHE-AIDED COMMUNICATION VIA CONVEXITY |
1510 | SIMPLIFIED DYNAMIC SC-FLIP POLAR DECODING |
2327 | SIMULTANEOUS SEPARATION AND TRANSCRIPTION OF MIXTURES WITH MULTIPLE POLYPHONIC AND PERCUSSIVE INSTRUMENTS |
5246 | SINGING VOICE CONVERSION WITH DISENTANGLED REPRESENTATIONS OF SINGER AND VOCAL TECHNIQUE USING VARIATIONAL AUTOENCODERS |
4630 | SINGLE FREQUENCY FILTER BANK BASED LONG-TERM AVERAGE SPECTRA FOR HYPERNASALITY DETECTION AND ASSESSMENT IN CLEFT LIP AND PALATE SPEECH |
1840 | SINGLE-CHANNEL SPEECH SEPARATION INTEGRATING PITCH INFORMATION BASED ON A MULTI TASK LEARNING FRAMEWORK |
1296 | SINGLE-SHOT REAL-TIME MULTIPLE-PATH TIME-OF-FLIGHT DEPTH IMAGING FOR MULTI-APERTURE AND MACRO-PIXEL SENSORS |
4658 | SketchPPNet: A Joint Pixel and Point Convolutional Neural Network for Low Resolution Sketch Image Recognition |
5726 | SKINAUGMENT: AUTO-ENCODING SPEAKER CONVERSIONS FOR AUTOMATIC SPEECH TRANSLATION |
6068 | SLEPIAN-BANGS FORMULA AND CRAMER-RAO BOUND FOR CIRCULAR AND NON-CIRCULAR COMPLEX ELLIPTICAL SYMMETRIC DISTRIBUTIONS |
1347 | SliceNet: Slice-Wise 3D Shapes Reconstruction from Single Image |
5055 | SLOGD: SPEAKER LOCATION GUIDED DEFLATION APPROACH TO SPEECH SEPARATION |
4280 | Slow-Time MIMO-FMCW Automotive Radar Detection with Imperfect Waveform Separation |
3965 | SMALL ENERGY MASKING FOR IMPROVED NEURAL NETWORK TRAINING FOR END-TO-END SPEECH RECOGNITION |
4059 | SMALL-FOOTPRINT KEYWORD SPOTTING ON RAW AUDIO DATA WITH SINC-CONVOLUTIONS |
1714 | SMOOTHING GRAPH SIGNALS VIA RANDOM SPANNING FORESTS |
1204 | SNDCNN: SELF-NORMALIZING DEEP CNNs WITH SCALED EXPONENTIAL LINEAR UNITS FOR SPEECH RECOGNITION |
1572 | SNORER DIARISATION BASED ON DEEP NEURAL NETWORK EMBEDDINGS |
5877 | SOCIAL DATA ASSISTED MULTI-MODAL VIDEO ANALYSIS FOR SALIENCY DETECTION |
3122 | SOCIAL LEARNING WITH PARTIAL INFORMATION SHARING |
4586 | SOFT-OUTPUT FINITE ALPHABET EQUALIZATION FOR MMWAVE MASSIVE MIMO |
4192 | SOLVING MISSING-ANNOTATION OBJECT DETECTION WITH BACKGROUND RECALIBRATION LOSS |
3970 | SOLVING NON-CONVEX NON-DIFFERENTIABLE MIN-MAX GAMES USING PROXIMAL GRADIENT METHOD |
2348 | SOME ALTERNATING DIRECTION METHODS OF MULTIPLIERS REVISITED FOR CONSTRAINED TOTAL VARIATION MINIMIZATION |
2552 | Sound Event Detection By Multitask Learning of Sound Events and Scenes with Soft Scene Labels |
5266 | SOUND EVENT DETECTION IN SYNTHETIC DOMESTIC ENVIRONMENTS |
4901 | SOUND EVENT DETECTION VIA DILATED CONVOLUTIONAL RECURRENT NEURAL NETWORKS |
4972 | SOUND EVENT LOCALIZATION BASED ON SOUND INTENSITY VECTOR REFINED BY DNN-BASED DENOISING AND SOURCE SEPARATION |
3691 | Sound texture synthesis using RI spectrograms |
3588 | SOURCE CODING OF AUDIO SIGNALS WITH A GENERATIVE MODEL |
3916 | SOURCE DOMAIN DATA SELECTION FOR IMPROVED TRANSFER LEARNING TARGETING DYSARTHRIC SPEECH RECOGNITION |
3219 | SOURCE ENUMERATION VIA TOEPLITZ MATRIX COMPLETION |
1996 | SOURCE SEPARATION WITH WEAKLY LABELLED DATA: A SOLUTION TO COMPUTATIONAL AUDITORY SCENE ANALYSIS |
4707 | SPACE FILLING CURVES FOR MRI SAMPLING |
2814 | SPARSE BEAMSPACE EQUALIZATION FOR MASSIVE MU-MIMO MMWAVE SYSTEMS |
2238 | SPARSE BRANCH AND BOUND FOR EXACT OPTIMIZATION OF L0-NORM PENALIZED LEAST SQUARES |
2551 | SPARSE CONVOLUTIONAL BEAMFORMING FOR WIRELESS ULTRASOUND |
5575 | SPARSE CSP ALGORITHM VIA JOINT SPATIO-TEMPORAL FILTERING |
6078 | SPARSE DATA INTERPOLATION USING THE GEODESIC DISTANCE AFFINITY SPACE |
2069 | Sparse Directed Graph Learning for Head Movement Prediction in 360 Video Streaming |
4198 | SPARSE LOW-REDUNDANCY LINEAR ARRAY WITH UNIFORM SUM CO-ARRAY |
3328 | Sparse modeling on distributed encryption data |
4189 | SPARSE RECOVERY WITH NON-LINEAR FOURIER FEATURES |
2174 | SPATIAL ACTIVE NOISE CONTROL BASED ON KERNEL INTERPOLATION WITH DIRECTIONAL WEIGHTING |
4097 | SPATIAL AND TEMPORAL SMOOTHING FOR COVARIANCE ESTIMATION IN SUPER-RESOLUTION ANGLE ESTIMATION IN AUTOMOTIVE RADARS |
4164 | SPATIAL ATTENTION FOR FAR-FIELD SPEECH RECOGNITION WITH DEEP BEAMFORMING NEURAL NETWORKS |
2170 | SPATIAL ATTENTIONAL BILINEAR 3D CONVOLUTIONAL NETWORK FOR VIDEO-BASED AUTISM SPECTRUM DISORDER DETECTION |
4222 | SPATIAL GATING STRATEGIES FOR GRAPH RECURRENT NEURAL NETWORKS |
3973 | SPATIALLY ADAPTIVE INTRA MODE PRE-SELECTION FOR ERP 360 VIDEO CODING |
1486 | SPATIALLY GUIDED INDEPENDENT VECTOR ANALYSIS |
6104 | SPATIAL-TEMPORAL CONTEXT-AWARE TRACKING |
5093 | SPATIAL-TEMPORAL FEATURE AGGREGATION NETWORK FOR VIDEO OBJECT DETECTION |
1989 | SPATIO-TEMPORAL AND GEOMETRY CONSTRAINED NETWORK FOR AUTOMOBILE VISUAL ODOMETRY |
3680 | SPEAKER ADAPTATION OF A MULTILINGUAL ACOUSTIC MODEL FOR CROSS-LANGUAGE SYNTHESIS |
5580 | SPEAKER AUGMENTATION FOR LOW RESOURCE SPEECH RECOGNITION |
4542 | SPEAKER DIARIZATION USING LATENT SPACE CLUSTERING IN GENERATIVE ADVERSARIAL NETWORK |
2097 | SPEAKER DIARIZATION WITH REGION PROPOSAL NETWORK |
4660 | SPEAKER DIARIZATION WITH SESSION-LEVEL SPEAKER EMBEDDING REFINEMENT USING GRAPH NEURAL NETWORKS |
1883 | SPEAKER EMBEDDINGS INCORPORATING ACOUSTIC CONDITIONS FOR DIARIZATION |
4389 | Speaker independence of neural vocoders and their effect on parametric resynthesis speech enhancement |
4570 | SPEAKER-AWARE TARGET SPEAKER ENHANCEMENT BY JOINTLY LEARNING WITH SPEAKER EMBEDDING EXTRACTION |
3853 | SPEAKER-AWARE TRAINING OF ATTENTION-BASED END-TO-END SPEECH RECOGNITION USING NEURAL SPEAKER EMBEDDINGS |
3508 | SPEAKERFILTER: DEEP LEARNING-BASED TARGET SPEAKER EXTRACTION USING ANCHOR SPEECH |
2270 | SPEAKER-INVARIANT AFFECTIVE REPRESENTATION LEARNING VIA ADVERSARIAL TRAINING |
2079 | SpecAugment on Large Scale Datasets |
1058 | SPECTROGRAM ANALYSIS VIA SELF-ATTENTION FOR REALIZING CROSS-MODEL VISUAL-AUDIO GENERATION |
3378 | SPECTROGRAMS FUSION WITH MINIMUM DIFFERENCE MASKS ESTIMATION FOR MONAURAL SPEECH DEREVERBERATION |
3269 | SPECTRUM ALLOCATION IN WIRELESS NETWORKS FOR CROWD LABELLING |
2081 | SPEECH BREATHING ESTIMATION USING DEEP LEARNING METHODS |
2215 | SPEECH EMOTION RECOGNITION WITH DUAL-SEQUENCE LSTM ARCHITECTURE |
4565 | SPEECH EMOTION RECOGNITION WITH LOCAL-GLOBAL AWARE DEEP REPRESENTATION LEARNING |
6136 | SPEECH ENHANCEMENT USING A TWO-STAGE NETWORK FOR AN EFFICIENT BOOSTING STRATEGY |
1846 | SPEECH ENHANCEMENT USING SELF-ADAPTATION AND MULTI-HEAD SELF-ATTENTION |
5059 | SPEECH INTELLIGIBILITY ENHANCEMENT BY EQUALIZATION FOR IN-CAR APPLICATIONS |
4186 | SPEECH RECOGNITION MODEL COMPRESSION |
2278 | SPEECH SENTIMENT ANALYSIS VIA PRE-TRAINED FEATURES FROM END-TO-END ASR MODELS |
4494 | Speech Synthesis using EEG |
4118 | Speech-Based Parameter Estimation of an Asymmetric Vocal Fold Oscillation Model and Its Application in Discriminating Vocal Fold Pathologies |
4288 | SPEECH-DRIVEN FACIAL ANIMATION USING POLYNOMIAL FUSION OF FEATURES |
4171 | SPEECH-TO-SINGING CONVERSION IN AN ENCODER-DECODER FRAMEWORK |
1590 | Spherical Large Intelligent Surfaces |
4025 | SPHERICAL VIDEO CODING WITH GEOMETRY AND REGION ADAPTIVE TRANSFORM DOMAIN TEMPORAL PREDICTION |
1854 | SPIDERnet: ATTENTION NETWORK FOR ONE-SHOT ANOMALY DETECTION IN SOUNDS |
4256 | SPIKING NEURAL NETWORKS TRAINED WITH BACKPROPAGATION FOR LOW POWER NEUROMORPHIC IMPLEMENTATION OF VOICE ACTIVITY DETECTION |
2786 | SPOKEN DOCUMENT RETRIEVAL LEVERAGING BERT-BASED MODELING AND QUERY REFORMULATION |
5250 | SPOKEN LANGUAGE ACQUISITION BASED ON REINFORCEMENT LEARNING AND WORD UNIT SEGMENTATION |
2971 | SRZOO: AN INTEGRATED REPOSITORY FOR SUPER-RESOLUTION USING DEEP LEARNING |
4044 | SSGD: SPARSITY-PROMOTING STOCHASTIC GRADIENT DESCENT ALGORITHM FOR UNBIASED DNN PRUNING |
1618 | SSTNET: DETECTING MANIPULATED FACES THROUGH SPATIAL, STEGANALYSIS AND TEMPORAL FEATURES |
3898 | STABILITY OF GRAPH NEURAL NETWORKS TO RELATIVE PERTURBATIONS |
4261 | STABILIZING MULTI-AGENT DEEP REINFORCEMENT LEARNING BY IMPLICITLY ESTIMATING OTHER AGENTS’ BEHAVIORS |
3085 | STABLE TRAINING OF DNN FOR SPEECH ENHANCEMENT BASED ON PERCEPTUALLY-MOTIVATED BLACK-BOX COST FUNCTION |
3303 | STAGED TRAINING STRATEGY AND MULTI-ACTIVATION FOR AUDIO TAGGING WITH NOISY AND SPARSE MULTI-LABEL DATA |
5223 | StarGAN for Emotional Speech Conversion: Validated by Data Augmentation of End-to-End Emotion Recognition |
6105 | STATE-AWARE ANTI-DRIFT OBJECT TRACKING |
4828 | STATE-BASED TRANSCRIPTION OF COMPONENTS OF CARNATIC MUSIC |
4789 | State-space Gaussian Process for Drift Estimation in Stochastic Differential Equations |
3142 | STATIC VISUAL SPATIAL PRIORS FOR DOA ESTIMATION |
1304 | STATISTICAL SIGNAL PROCESSING APPROACH FOR RAIN ESTIMATION BASED ON MEASUREMENTS FROM NETWORK MANAGEMENT SYSTEMS |
2937 | STATISTICS POOLING TIME DELAY NEURAL NETWORK BASED ON X-VECTOR FOR SPEAKER VERIFICATION |
5132 | Steepening Squared Error Function Helps Online Adaptation of Gaussian Scales |
1215 | Steganography and its Detection in JPEG Images Obtained with the “Trunc” Quantizer |
4763 | STOCHASTIC ADMM FOR BYZANTINE-ROBUST DISTRIBUTED LEARNING |
2663 | STOCHASTIC GEOMETRY PLANNING OF ELECTRIC VEHICLES CHARGING STATIONS |
3022 | STOCHASTIC GRAPH NEURAL NETWORKS |
5767 | STOCHASTIC ML ESTIMATION FOR HYPERSPECTRAL UNMIXING UNDER ENDMEMBER VARIABILITY AND NONLINEAR MODELS |
4053 | STOCHASTIC MULTI-SCALE AGGREGATION NETWORK FOR CROWD COUNTING |
4150 | STOCK MOVEMENT PREDICTION THAT INTEGRATES HETEROGENEOUS DATA SOURCES USING DILATED CAUSAL CONVOLUTION NETWORKS WITH ATTENTION |
5345 | STORING DIGITAL DATA INTO DNA: A COMPARATIVE STUDY OF QUATERNARY CODE CONSTRUCTION |
4297 | STRATEGIC ATTENTION LEARNING FOR MODALITY TRANSLATION |
3551 | STREAMING AUTOMATIC SPEECH RECOGNITION WITH THE TRANSFORMER MODEL |
5992 | STRUCTURAL SPARSIFICATION FOR FAR-FIELD SPEAKER RECOGNITION WITH GNA |
6143 | STRUCTURED AND UNSTRUCTURED OUTLIER IDENTIFICATION FOR ROBUST PCA: A FAST PARAMETER FREE ALGORITHM |
4849 | STRUCTURED CITATION TREND PREDICTION USING GRAPH NEURAL NETWORKS |
2377 | STRUCTURED SPARSE ATTENTION FOR END-TO-END AUTOMATIC SPEECH RECOGNITION |
5739 | STUDY OF CLOSED PHASE RESONANCE BANDWIDTHS FOR ORAL AND NASAL TRACTS USING ZERO TIME WINDOWING |
5257 | STUDY OF FORMANT MODIFICATION FOR CHILDREN ASR |
1983 | SUB-DIP: OPTIMIZATION ON A SUBSPACE WITH DEEP IMAGE PRIOR REGULARIZATION AND APPLICATION TO SUPERRESOLUTION |
1444 | Subject Transfer Framework Based on Source Selection and Semi-Supervised Style Transfer Mapping for sEMG Pattern Recognition |
3023 | SUBJECTIVE QUALITY ESTIMATION USING PESQ FOR HANDS-FREE TERMINALS |
4398 | Submodular Rank Aggregation on Score-based Permutations for Distributed Automatic Speech Recognition |
3879 | SUBSPACE-BASED SPEECH CORRELATION VECTOR ESTIMATION FOR SINGLE-MICROPHONE MULTI-FRAME MVDR FILTERING |
3134 | Superpixel Segmentation via Convolutional Neural Networks with Regularized Information Maximization |
4429 | SUPER-RESOLUTION OF 3D COLOR POINT CLOUDS VIA FAST GRAPH TOTAL VARIATION |
6152 | SUPER-RESOLUTION VIA IMAGE-ADAPTED DENOISING CNNS: INCORPORATING EXTERNAL AND INTERNAL LEARNING |
5547 | SUPER-RESOLUTION WITH NOISY MEASUREMENTS: RECONCILING UPPER AND LOWER BOUNDS |
1209 | SUPERVISED CANONICAL CORRELATION ANALYSIS OF DATA ON SYMMETRIC POSITIVE DEFINITE MANIFOLDS BY RIEMANNIAN DIMENSIONALITY REDUCTION |
4579 | SUPERVISED DEEP HASHING FOR EFFICIENT AUDIO EVENT RETRIEVAL |
1372 | SUPERVISED ENCODING FOR DISCRETE REPRESENTATION LEARNING |
3872 | SUPERVISED GRAPH REPRESENTATION LEARNING FOR MODELING THE RELATIONSHIP BETWEEN STRUCTURAL AND FUNCTIONAL BRAIN CONNECTIVITY |
3306 | Supervised online diarization with sample mean loss for multi-domain data |
6074 | SWIFT-LINK: A COMPRESSIVE BEAM ALIGNMENT ALGORITHM FOR PRACTICAL MMWAVE RADIOS |
6119 | SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech |
4953 | SYNCHRONOUS TRANSFORMERS FOR END-TO-END SPEECH RECOGNITION |
4765 | SYNTHESIZING ENGAGING MUSIC USING DYNAMIC MODELS OF STATISTICAL SURPRISAL |
4815 | SYNTHETIC CROWD AND PEDESTRIAN GENERATOR FOR DEEP LEARNING PROBLEMS |
3437 | SYNTHETIC DATA GENERATION THROUGH STATISTICAL EXPLOSION: IMPROVING CLASSIFICATION ACCURACY OF CORONARY ARTERY DISEASE USING PPG |
3592 | SYNTHETIC SPEECH REFERENCES FOR AUTOMATIC PATHOLOGICAL SPEECH INTELLIGIBILITY ASSESSMENT |
3125 | TACKLING REAL NOISY REVERBERANT MEETINGS WITH ALL-NEURAL SOURCE SEPARATION, COUNTING, AND DIARIZATION SYSTEM |
2637 | TALKER-INDEPENDENT SPEAKER SEPARATION IN REVERBERANT CONDITIONS |
1835 | TARGET PARAMETER ESTIMATION VIA ONE-BIT PMCW RADAR |
4985 | TASK-AWARE MEAN TEACHER METHOD FOR LARGE SCALE WEAKLY LABELED SEMI-SUPERVISED SOUND EVENT DETECTION |
1610 | TDMF: TASK-DRIVEN MULTILEVEL FRAMEWORK FOR END-TO-END SPEAKER VERIFICATION |
2415 | TEACHER-STUDENT TRAINING FOR ROBUST TACOTRON-BASED TTS |
3181 | TEACHING SIGNALS AND SYSTEMS - A FIRST COURSE IN SIGNAL PROCESSING |
3414 | TEMPORAL CODING IN SPIKING NEURAL NETWORKS WITH ALPHA SYNAPTIC FUNCTION |
1757 | Tensor Decomposition-based Beamspace ESPRIT Algorithm for Multidimensional Harmonic Retrieval |
4710 | TENSORFLOW AUDIO MODELS IN ESSENTIA |
4370 | Tensor-to-Vector Regression for Multi-channel Speech Enhancement based on Tensor-Train Network |
3997 | TEXCEPTION: A CHARACTER/WORD-LEVEL DEEP LEARNING MODEL FOR PHISHING URL DETECTION |
5335 | TEXT ADAPTATION FOR SPEAKER VERIFICATION WITH SPEAKER-TEXT FACTORIZED EMBEDDINGS |
5822 | TEXT-INDEPENDENT SPEAKER VERIFICATION WITH ADVERSARIAL LEARNING ON SHORT UTTERANCES |
1987 | Text-to-image synthesis method evaluation based on visual patterns |
1373 | T-GSA: TRANSFORMER WITH GAUSSIAN-WEIGHTED SELF-ATTENTION FOR SPEECH ENHANCEMENT |
3198 | THE COMPRESSED NESTED ARRAY FOR UNDERDETERMINED DOA ESTIMATION BY FOURTH-ORDER DIFFERENCE COARRAY |
4892 | THE DISCRETE STOCKWELL TRANSFORMS FOR INFINITE-LENGTH SIGNALS AND THEIR REAL-TIME IMPLEMENTATIONS |
4020 | THE EFFECT OF DATA AUGMENTATION ON CLASSIFICATION OF ATRIAL FIBRILLATION IN SHORT SINGLE-LEAD ECG SIGNALS USING DEEP NEURAL NETWORKS |
3260 | THE EFFECT OF POWER ALLOCATION ON VISIBLE LIGHT COMMUNICATION USING COMMERCIAL PHOSPHOR-CONVERTED LED LAMP FOR INDIRECT ILLUMINATION |
5779 | THE EMPIRICAL DUALITY GAP OF CONSTRAINED STATISTICAL LEARNING |
5894 | The FifthNet Chroma Extractor |
2020 | THE FRACTIONAL QUATERNION FOURIER NUMBER TRANSFORM |
4179 | THE GRAPHON FOURIER TRANSFORM |
2468 | THE MATCHED REASSIGNED CROSS-SPECTROGRAM FOR PHASE ESTIMATION |
5105 | THE OPEN BRANDS DATASET: UNIFIED BRAND DETECTION AND RECOGNITION AT SCALE |
5908 | The PICASSO algorithm for Bayesian localization via paired comparisons in a union of subspaces model |
2841 | THE PROCESSING OF MANDARIN CHINESE TONAL ALTERNATIONS IN CONTEXTS: AN EYE-TRACKING STUDY |
2602 | The Role of Annotation Fusion Methods in the Study of Human-Reported Emotion Experience During Music Listening |
5515 | THE RWTH ASR SYSTEM FOR TED-LIUM RELEASE 2: IMPROVING HYBRID HMM WITH SPECAUGMENT |
4611 | THE SOUND OF MY VOICE: SPEAKER REPRESENTATION LOSS FOR TARGET VOICE SEPARATION |
3896 | The SWAX Benchmark: Attacking Biometric Systems with Wax Figures |
1425 | THEORETICAL ANALYSIS OF MULTI-CARRIER AGILE PHASED ARRAY RADAR |
3102 | Theoretical Performance Bound of Uplink Channel Estimation Accuracy in Massive MIMO |
5574 | THIS DATASET DOES NOT EXIST: TRAINING MODELS FROM GENERATED IMAGES |
1184 | THRESHOLD-ADJUSTED ORB STRATEGIES WITH GENETIC ALGORITHM AND PROTECTIVE CLOSING STRATEGY ON TAIWAN FUTURES MARKET |
4005 | TIME DIFFERENCE OF ARRIVAL ESTIMATION FROM FREQUENCY-SLIDING GENERALIZED CROSS-CORRELATIONS USING CONVOLUTIONAL NEURAL NETWORKS |
3949 | TIME DOMAIN VELOCITY VECTOR FOR RETRACING THE MULTIPATH PROPAGATION |
3399 | TIME REVERSAL BASED ROBUST GESTURE RECOGNITION USING WIFI |
1819 | TIME-DOMAIN AUDIO SOURCE SEPARATION BASED ON WAVE-U-NET COMBINED WITH DISCRETE WAVELET TRANSFORM |
5046 | TIME-DOMAIN NEURAL NETWORK APPROACH FOR SPEECH BANDWIDTH EXTENSION |
1501 | TIME-FREQUENCY ANALYSIS OF UNIMODAL SENSORY PROCESSING IN AUTISM SPECTRUM DISORDER |
2221 | TIME-FREQUENCY FEATURE DECOMPOSITION BASED ON SOUND DURATION FOR ACOUSTIC SCENE CLASSIFICATION |
4028 | TIME-FREQUENCY LOSS FOR CNN BASED SPEECH SUPER-RESOLUTION |
3429 | TIME-PREDICTABLE SOFTWARE-DEFINED ARCHITECTURE WITH SDF-BASED COMPILER FLOW FOR 5G BASEBAND PROCESSING |
2141 | TIME-SCALE SYNTHESIS FOR LOCALLY STATIONARY SIGNALS |
6072 | TOA-BASED LOCALIZATION WITH NLOS MITIGATION VIA ROBUST MULTIDIMENSIONAL SIMILARITY ANALYSIS |
5021 | TOSO: STUDENT'S-T DISTRIBUTION AIDED ONE-STAGE ORIENTATION TARGET DETECTION IN REMOTE SENSING IMAGES |
4372 | TOWARD BETTER SPEAKER EMBEDDINGS: AUTOMATED COLLECTION OF SPEECH SAMPLES FROM UNKNOWN DISTINCT SPEAKERS |
1749 | TOWARDS A NEW UNDERSTANDING OF THE TRAINING OF NEURAL NETWORKS WITH MISLABELED TRAINING DATA |
3940 | TOWARDS AN EFFICIENT AND GENERAL FRAMEWORK OF ROBUST TRAINING FOR GRAPH NEURAL NETWORKS |
4112 | TOWARDS AN INTELLIGENT MICROSCOPE: ADAPTIVELY LEARNED ILLUMINATION FOR OPTIMAL SAMPLE CLASSIFICATION |
3002 | TOWARDS BLIND QUALITY ASSESSMENT OF CONCERT AUDIO RECORDINGS USING DEEP NEURAL NETWORKS |
4906 | TOWARDS DATA-EFFICIENT MODELING FOR WAKE WORD SPOTTING |
4982 | Towards Decoding Selective Attention from Single-Trial EEG Data in Cochlear Implant Users Based on Deep Neural Networks |
2870 | TOWARDS FAST AND ACCURATE STREAMING END-TO-END ASR |
4688 | TOWARDS HIGH-PERFORMANCE OBJECT DETECTION: TASK-SPECIFIC DESIGN CONSIDERING CLASSIFICATION AND LOCALIZATION SEPARATION |
5841 | TOWARDS LINKING THE LAKH AND IMSLP DATASETS |
5454 | Towards Multilingual Sign Language Recognition |
3296 | Towards Pose-invariant Lip-Reading |
4066 | TOWARDS REAL-TIME SINGLE-CHANNEL SINGING-VOICE SEPARATION WITH PRUNED MULTI-SCALED DENSENETS |
4793 | Towards Real-time, Multi-view Video Stereopsis |
5682 | TOWARDS UNSUPERVISED SPEECH RECOGNITION AND SYNTHESIS WITH QUANTIZED SPEECH REPRESENTATION LEARNING |
1433 | TRACE NORM GENERATIVE ADVERSARIAL NETWORKS FOR SENSOR GENERATION AND FEATURE EXTRACTION |
1925 | Tracing Network Evolution Using the PARAFAC2 Model |
5921 | TRACK-BEFORE-DETECT FOR SUB-NYQUIST RADAR |
4600 | TRACKING TO IMPROVE DETECTION QUALITY IN LIDAR FOR AUTONOMOUS DRIVING |
2026 | TRAINING A CODE-SWITCHING LANGUAGE MODEL WITH MONOLINGUAL DATA |
4084 | TRAINING ASR MODELS BY GENERATION OF CONTEXTUAL INFORMATION |
4475 | TRAINING DEEP SPIKING NEURAL NETWORKS FOR ENERGY-EFFICIENT NEUROMORPHIC COMPUTING |
4794 | Training Keyword Spotters with Limited and Synthesized Speech Data |
3514 | TRAINING LSTM FOR UNSUPERVISED ANOMALY DETECTION WITHOUT A PRIORI KNOWLEDGE |
2051 | TRAINING SPOKEN LANGUAGE UNDERSTANDING SYSTEMS WITH NON-PARALLEL SPEECH AND TEXT |
5118 | TRANSFER LEARNING FROM YOUTUBE SOUNDTRACKS TO TAG ARCTIC ECOACOUSTIC RECORDINGS |
4517 | TRANSFERABLE POLICIES FOR LARGE SCALE WIRELESS NETWORKS WITH GRAPH NEURAL NETWORKS |
4588 | TRANSFERRING NEURAL SPEECH WAVEFORM SYNTHESIZERS TO MUSICAL INSTRUMENT SOUNDS GENERATION |
4897 | Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss |
2702 | TRANSFORMER VAE: A HIERARCHICAL MODEL FOR STRUCTURE-AWARE AND INTERPRETABLE MUSIC REPRESENTATION LEARNING |
1805 | Transformer-based Acoustic Modeling for Hybrid Speech Recognition |
3518 | TRANSFORMER-BASED ONLINE CTC/ATTENTION END-TO-END SPEECH RECOGNITION ARCHITECTURE |
2269 | TRANSFORMER-BASED TEXT-TO-SPEECH WITH WEIGHTED FORCED ATTENTION |
5828 | TRANSFORMING SEISMOCARDIOGRAMS INTO ELECTROCARDIOGRAMS BY APPLYING CONVOLUTIONAL AUTOENCODERS |
1736 | TRANSLATION OF A HIGHER ORDER AMBISONICS SOUND SCENE BASED ON PARAMETRIC DECOMPOSITION |
1899 | TRANSMIT BEAMFORMING DESIGN WITH RECEIVED-INTERFERENCE POWER CONSTRAINTS: THE ZERO-FORCING RELAXATION |
2688 | TRANSMIT BEAMPATTERN SHAPING VIA WAVEFORM DESIGN IN COGNITIVE MIMO RADAR |
4848 | Trapezoidal Segment Sequencing: A Novel Approach for Fusion of Human-produced Continuous Annotations |
4726 | TREE OF SHAPES CUT FOR MATERIAL SEGMENTATION GUIDED BY A DESIGN |
2022 | Triggerless Random Interleaved Sampling |
4497 | TRILINGUAL SEMANTIC EMBEDDINGS OF VISUALLY GROUNDED SPEECH WITH SELF-ATTENTION MECHANISMS |
1748 | TRIPLET LOSS FEATURE AGGREGATION FOR SCALABLE HASH |
4433 | TRUTH-TO-ESTIMATE RATIO MASK: A POST-PROCESSING METHOD FOR SPEECH ENHANCEMENT DIRECT AT LOW SIGNAL-TO-NOISE RATIOS |
5826 | TS-FEN: PROBING FEATURE SELECTION STRATEGY FOR FACE ANTI-SPOOFING |
6063 | TWO-DIMENSIONAL DOA ESTIMATION FOR COPRIME PLANAR ARRAY: A COARRAY TENSOR-BASED SOLUTION |
4307 | TWO-ELEMENT BIOMIMETIC ANTENNA ARRAY DESIGN AND PERFORMANCE |
5282 | TWO-STEP ACOUSTIC MODEL ADAPTATION FOR DYSARTHRIC SPEECH RECOGNITION |
4400 | TWO-STEP SOUND SOURCE SEPARATION: TRAINING ON LEARNED LATENT TARGETS |
1741 | UNCERTAINTIES IN SHORT COMMERCIAL MICROWAVE LINKS FADING DUE TO RAIN |
5818 | Uncertainty Quantification for Remaining Useful Lifetime Prediction with Multi-channel Sensory Data |
5479 | UNDERWATER TRACKING BASED ON THE SUM-PRODUCT ALGORITHM ENHANCED BY A NEURAL NETWORK DETECTIONS CLASSIFIER |
3504 | UNet 3+: A full-scale connected unet for medical image segmentation |
4405 | UNIFIED SIGNAL COMPRESSION USING GENERATIVE ADVERSARIAL NETWORKS |
4264 | Universal Phone Recognition with a Multilingual Allophone System |
3456 | Unresolved Radar Targets Separation with Direct Extraction of Local Frequencies |
3080 | UNSEEN FACE PRESENTATION ATTACK DETECTION WITH HYPERSPHERE LOSS |
1667 | UNSUPERVISED AUTO-ENCODING MULTIPLE-OBJECT TRACKER FOR CONSTRAINT-CONSISTENT COMBINATORIAL PROBLEM |
4177 | UNSUPERVISED CHANGE DETECTION FOR MULTIMODAL REMOTE SENSING IMAGES VIA COUPLED DICTIONARY LEARNING AND SPARSE CODING |
4428 | UNSUPERVISED CONTENT-PRESERVED ADAPTATION NETWORK FOR CLASSIFICATION OF PULMONARY TEXTURES FROM DIFFERENT CT SCANNERS |
4631 | UNSUPERVISED DOMAIN ADAPTATION FOR SEMANTIC SEGMENTATION WITH SYMMETRIC ADAPTATION CONSISTENCY |
6070 | UNSUPERVISED ENSEMBLE CLASSIFICATION WITH CORRELATED DECISION AGENTS |
4669 | UNSUPERVISED FEATURE ENHANCEMENT FOR SPEAKER VERIFICATION |
3785 | UNSUPERVISED IMAGE-TO-IMAGE TRANSLATION VIA FAIR REPRESENTATION OF GENDER BIAS |
5179 | UNSUPERVISED KEY HAND SHAPE DISCOVERY OF SIGN LANGUAGE VIDEOS WITH CORRESPONDENCE SPARSE AUTOENCODERS |
2437 | Unsupervised Multiple Source Localization Using Relative Harmonic Coefficients |
5313 | UNSUPERVISED NEURAL MASK ESTIMATOR FOR GENERALIZED EIGEN-VALUE BEAMFORMING BASED ASR |
1234 | UNSUPERVISED PERSON RE-IDENTIFICATION USING MULTI-BRANCH FEATURE COMPENSATION NETWORK AND LINK-BASED CLUSTER DISSIMILARITY METRIC |
2719 | UNSUPERVISED PRE-TRAINING OF BIDIRECTIONAL SPEECH ENCODERS VIA MASKED RECONSTRUCTION |
3707 | Unsupervised pretraining transfers well across languages |
2571 | UNSUPERVISED SPEAKER ADAPTATION USING ATTENTION-BASED SPEAKER MEMORY FOR END-TO-END ASR |
3749 | Unsupervised Style and Content Separation by Minimizing Mutual Information for Speech Synthesis |
1212 | UNSUPERVISED TRAINING FOR DEEP SPEECH SOURCE SEPARATION WITH KULLBACK-LEIBLER DIVERGENCE BASED PROBABILISTIC LOSS FUNCTION |
5544 | UNSUPERVISED VARIATIONAL BAYESIAN KALMAN FILTERING FOR LARGE-DIMENSIONAL GAUSSIAN SYSTEMS |
5645 | UPGRADE METHODS FOR STRATIFIED SENSOR NETWORK SELF-CALIBRATION |
5231 | UPGRADING CRFS TO JRFS AND ITS BENEFITS TO SEQUENCE MODELING AND LABELING |
3891 | UPSCALING VECTOR APPROXIMATE MESSAGE PASSING |
3048 | URTIS: A SMALL 3D IMAGING SONAR SENSOR FOR ROBOTIC APPLICATIONS |
2234 | USING AUTOMATIC SPEECH RECOGNITION AND SPEECH SYNTHESIS TO IMPROVE THE INTELLIGIBILITY OF COCHLEAR IMPLANT USERS IN REVERBERANT LISTENING ENVIRONMENTS |
4253 | USING BLACK-BOX COMPRESSION ALGORITHMS FOR PHASE RETRIEVAL |
3564 | USING INTELLIGENT REFLECTING SURFACES FOR RANK IMPROVEMENT IN MIMO COMMUNICATIONS |
5530 | USING PANORAMIC VIDEOS FOR MULTI-PERSON LOCALIZATION AND TRACKING IN A 3D PANORAMIC COORDINATE |
4041 | Using Personalized Speech Synthesis and Neural Language Generator for Rapid Speaker Adaptation |
1411 | USING SEPARATE LOSSES FOR SPEECH AND NOISE IN MASK-BASED SPEECH ENHANCEMENT |
2044 | USING SPEECH SYNTHESIS TO TRAIN END-TO-END SPOKEN LANGUAGE UNDERSTANDING MODELS |
1484 | USING VAES AND NORMALIZING FLOWS FOR ONE-SHOT TEXT-TO-SPEECH SYNTHESIS OF EXPRESSIVE SPEECH |
2832 | USING X-VECTORS TO AUTOMATICALLY DETECT PARKINSON'S DISEASE FROM SPEECH |
5302 | UTTERANCE-LEVEL SEQUENTIAL MODELING FOR DEEP GAUSSIAN PROCESS BASED SPEECH SYNTHESIS USING SIMPLE RECURRENT UNIT |
2029 | VAMP with Vector-Valued Diagonalization |
3051 | VAPAR SYNTH - A VARIATIONAL PARAMETRIC MODEL FOR AUDIO SYNTHESIS |
5450 | VARIABLE BITRATE IMAGE COMPRESSION WITH QUALITY SCALING FACTORS |
2969 | VARIABLE METRIC PROXIMAL GRADIENT METHOD WITH DIAGONAL BARZILAI-BORWEIN STEPSIZE |
3974 | VARIABLE PROJECTION FOR MULTIPLE FREQUENCY ESTIMATION |
4759 | VARIATIONAL STUDENT: LEARNING COMPACT AND SPARSER NETWORKS IN KNOWLEDGE DISTILLATION FRAMEWORK |
3038 | VERSATILE VIDEO CODING AND SUPER-RESOLUTION FOR EFFICIENT DELIVERY OF 8K VIDEO WITH 4K BACKWARD-COMPATIBILITY |
2494 | VGGFOLEY: A LARGE-SCALE AUDIO-VISUAL DATASET |
5334 | VIDEO DEBLURRING VIA 3D CNN AND FOURIER ACCUMULATION LEARNING |
5588 | VIDEO FRAME INTERPOLATION VIA EXCEPTIONAL MOTION-AWARE SYNTHESIS |
1777 | Video Frame Interpolation via Residue Refinement |
2277 | Video Question Generation via Semantic Rich Cross-Modal Self-Attention Networks Learning |
5209 | VIEW-ANGLE INVARIANT OBJECT MONITORING WITHOUT IMAGE REGISTRATION |
2514 | VIMO: VITAL SIGN MONITORING USING COMMODITY MILLIMETER WAVE RADIO |
1713 | VISUALLY GUIDED SELF SUPERVISED LEARNING OF SPEECH REPRESENTATIONS |
4519 | VOCAL TRACT ARTICULATORY CONTOUR DETECTION IN REAL-TIME MAGNETIC RESONANCE IMAGES USING SPATIO-TEMPORAL CONTEXT |
6096 | VOICE ACTIVITY DETECTION FOR TRANSIENT NOISY ENVIRONMENT BASED ON DIFFUSION NETS |
4923 | VOICE BASED CLASSIFICATION OF PATIENTS WITH AMYOTROPIC LATERAL SCLEROSIS, PARKINSON'S DISEASE AND HEALTHY CONTROLS WITH CNN-LSTM USING TRANSFER LEARNING |
5115 | VOICE CONVERSION WITH TRANSFORMER NETWORK |
5337 | VOICEAI SYSTEMS TO NIST SRE19 EVALUATION: ROBUST SPEAKER RECOGNITION ON CONVERSATIONAL TELEPHONE SPEECH |
3866 | VOLUME RECONSTRUCTION FOR LIGHT FIELD MICROSCOPY |
4657 | WAVEFFJORD: FFJORD-BASED VOCODER FOR STATISTICAL PARAMETRIC SPEECH SYNTHESIS |
4624 | WAWEnets: A No-Reference Convolutional Waveform-Based Approach to Estimating Narrowband and Wideband Speech Quality |
2510 | WEAKLY LABELLED AUDIO TAGGING VIA CONVOLUTIONAL NETWORKS WITH SPATIAL AND CHANNEL-WISE ATTENTION |
2148 | Weakly Supervised Crowd-Wise Attention for Robust Crowd Counting |
5160 | WEAKLY SUPERVISED SEGMENTATION GUIDED HAND POSE ESTIMATION DURING INTERACTION WITH UNKNOWN OBJECTS |
5150 | WEAKLY SUPERVISED SEMANTIC SEGMENTATION FOR REMOTE SENSING HYPERSPECTRAL IMAGING |
5216 | WEAKLY-SUPERVISED SOUND EVENT DETECTION WITH SELF-ATTENTION |
3068 | Weight Sharing and Deep Learning for Spectral data |
4195 | WEIGHTED GRADIENT CODING WITH LEVERAGE SCORE SAMPLING |
3515 | WEIGHTED KRYLOV-LEVENBERG-MARQUARDT METHOD FOR CANONICAL POLYADIC TENSOR DECOMPOSITION |
2155 | Weighted Null Vector Initialization and its Application to Phase Retrieval |
4228 | WHAMR!: NOISY AND REVERBERANT SINGLE-CHANNEL SPEECH SEPARATION |
1841 | WHAT DID YOUR ADVERSARY BELIEVE? OPTIMAL FILTERING AND SMOOTHING IN COUNTER-ADVERSARIAL AUTONOMOUS SYSTEMS |
5513 | WHAT DOES A NETWORK LAYER HEAR? ANALYZING HIDDEN REPRESENTATIONS OF END-TO-END ASR THROUGH SPEECH SYNTHESIS |
3443 | WHAT IS BEST FOR SPOKEN LANGUAGE UNDERSTANDING: SMALL BUT TASK-DEPENDANT EMBEDDINGS OR HUGE BUT OUT-OF-DOMAIN EMBEDDINGS? |
5164 | WHAT MAKES THE SOUND?: A DUAL-MODALITY INTERACTING NETWORK FOR AUDIO-VISUAL EVENT LOCALIZATION |
2979 | WHOSECOUGH: IN-THE-WILD COUGHER VERIFICATION USING MULTITASK LEARNING |
4866 | WIDEBAND CHANNEL TRACKING FOR MILLIMETER WAVE MASSIVE MIMO SYSTEMS WITH HYBRID BEAMFORMING RECEPTION |
3579 | WIDEBAND DIRECTION OF ARRIVAL ESTIMATION WITH SPARSE LINEAR ARRAYS |
5494 | WIND: WASSERSTEIN INCEPTION DISTANCE FOR EVALUATING GENERATIVE ADVERSARIAL NETWORK PERFORMANCE |
3614 | WIRTINGER FLOW ALGORITHMS FOR PHASE RETRIEVAL FROM BINARY MEASUREMENTS |
5881 | WITCHCRAFT: EFFICIENT PGD ATTACKS WITH RANDOM STEP SIZE |
4070 | Within-sample variability-invariant loss for robust speaker recognition under noisy environments |
1181 | XceptionTime: A Novel Deep Architecture based on Depthwise Separable Convolutions for Hand Gesture Classification |
5756 | XMU-TS SYSTEMS FOR NIST SRE19 CTS CHALLENGE |
5311 | XPSNR: A LOW-COMPLEXITY EXTENSION OF THE PERCEPTUALLY WEIGHTED PEAK SIGNAL-TO-NOISE RATIO FOR HIGH-RESOLUTION VIDEO QUALITY ASSESSMENT |
4167 | X-VECTORS MEET EMOTIONS: A STUDY ON DEPENDENCIES BETWEEN EMOTION AND SPEAKER RECOGNITION |
3810 | ZERO-CROSSING PRECODING WITH MAXIMUM DISTANCE TO THE DECISION THRESHOLD FOR CHANNELS WITH 1-BIT QUANTIZATION AND OVERSAMPLING |
2921 | ZERO-SHOT MULTI-SPEAKER TEXT-TO-SPEECH WITH STATE-OF-THE-ART NEURAL SPEAKER EMBEDDINGS |