1. <code id="md54j"></code>
      <big id="md54j"><em id="md54j"></em></big>

        <code id="md54j"><nobr id="md54j"><samp id="md54j"></samp></nobr></code>
        <dfn id="md54j"><option id="md54j"><sub id="md54j"></sub></option></dfn>
        1. <th id="md54j"></th>

          Mirror operated in collaboration with local support

          Sound

          Authors and titles for recent submissions

          [ total of 113 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-113 ]
          [ showing 25 entries per page: fewer | more | all ]

          Thu, 21 May 2020

          [1]  arXiv:2005.10228 [pdf, other]
          Title: Sparsity-based audio declipping methods: overview, new algorithms, and large-scale evaluation
          Authors: Clément Gaultier (PANAMA), Srđan Kitić, Rémi Gribonval (PANAMA, DANTE), Nancy Bertin (PANAMA)
          Comments: arXiv admin note: text overlap with arXiv:1711.11259
          Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS); Signal Processing (eess.SP)
          [2]  arXiv:2005.09966 [pdf, other]
          Title: SADDEL: Joint Speech Separation and Denoising Model based on Multitask Learning
          Comments: The two first authors made equal contributions
          Subjects: Sound (cs.SD); Audio and Speech Processing (eess.AS)
          [3]  arXiv:2005.10113 (cross-list from eess.AS) [pdf, other]
          Title: A Comparison of Label-Synchronous and Frame-Synchronous End-to-End Models for Speech Recognition
          Comments: 4 pages, 2 figures
          Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
          [4]  arXiv:2005.10089 (cross-list from eess.AS) [pdf, other]
          Title: Investigation of Large-Margin Softmax in Neural Language Modeling
          Comments: submitted to INTERSPEECH2020
          Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
          [5]  arXiv:2005.10049 (cross-list from eess.AS) [pdf, ps, other]
          Title: Early Stage LM Integration Using Local and Global Log-Linear Combination
          Comments: Submitted to Interspeech 2020
          Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
          [6]  arXiv:2005.09986 (cross-list from eess.AS) [pdf, other]
          Title: Evaluating Features and Metrics for High-Quality Simulation of Early Vocal Learning of Vowels
          Comments: Submitted to INTERSPEECH 2020
          Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
          [7]  arXiv:2005.09940 (cross-list from eess.AS) [pdf, other]
          Title: Relative Positional Encoding for Speech Recognition and Direct Translation
          Comments: Submitted to Interspeech 2020
          Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
          [8]  arXiv:2005.09921 (cross-list from eess.AS) [pdf, other]
          Title: End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors
          Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
          [9]  arXiv:2005.09913 (cross-list from eess.AS) [pdf, other]
          Title: Statistical and Neural Network Based Speech Activity Detection in Non-Stationary Acoustic Environments
          Comments: Submitted to Interspeech 2020
          Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
          [10]  arXiv:2005.09873 (cross-list from eess.AS) [pdf, other]
          Title: Consistent ICA: Determined BSS meets spectrogram consistency
          Authors: Kohei Yatabe
          Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD); Signal Processing (eess.SP)
          [11]  arXiv:2005.09862 (cross-list from eess.AS) [pdf, other]
          Title: A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition
          Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
          [12]  arXiv:2005.09843 (cross-list from eess.AS) [pdf, ps, other]
          Title: Jointly optimal denoising, dereverberation, and source separation
          Comments: Submitted to IEEE/ACM Trans. Audio, Speech, and Language Processing on 12 Feb 2020
          Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
          [13]  arXiv:2005.09834 (cross-list from cs.HC) [pdf, other]
          Title: Exploring Recurrent, Memory and Attention Based Architectures for Scoring Interactional Aspects of Human-Machine Text Dialog
          Subjects: Human-Computer Interaction (cs.HC); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
          [14]  arXiv:2005.09824 (cross-list from eess.AS) [pdf, other]
          Title: PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR
          Comments: Submtted to Interspeech 2020
          Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
          [15]  arXiv:2005.09768 (cross-list from eess.AS) [pdf, ps, other]
          Title: Perceptual similarity between piano notes: Simulations with a template-based perception model
          Subjects: Audio and Speech Processing (eess.AS); Sound (cs.SD)
          [16]  arXiv:2005.09756 (cross-list from eess.AS) [pdf, other]
          Title: Improving Proper Noun Recognition in End-to-End ASR By Customization of the MWER Loss Criterion
          Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD)
          [17]  arXiv:2005.09684 (cross-list from eess.AS) [pdf, other]
          Title: Exploring Transformers for Large-Scale Speech Recognition
          Comments: 5 pages, 1 figure
          Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)

          Wed, 20 May 2020 (showing first 8 of 12 entries)

          [18]  arXiv:2005.09242 [pdf, other]
          Title: Competitive Wakeup Scheme for Distributed Devices
          Comments: sumbitted to INTERSPEECH2020
          Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
          [19]  arXiv:2005.09238 [pdf, other]
          Title: A Lite Microphone Array Beamforming Scheme with Maximum Signal-to-Noise Ratio Filter
          Comments: submitted to INTERSPEECH2020
          Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
          [20]  arXiv:2005.09237 [pdf, other]
          Title: Acoustic Echo Cancellation by Combining Adaptive Digital Filter and Recurrent Neural Network
          Comments: submitted to INTERSPEECH2020
          Subjects: Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
          [21]  arXiv:2005.08944 [src]
          Title: Saving the Sonorine: Audio Recovery Using Image Processing and Computer Vision
          Authors: Kai Ji (Kevin) Feng, Adam Finkelstein
          Comments: Removing a co-author. The co-author did not contribute to the preparation of the manuscript, only background information and advice
          Subjects: Sound (cs.SD); Computer Vision and Pattern Recognition (cs.CV); Graphics (cs.GR); Audio and Speech Processing (eess.AS)
          [22]  arXiv:2005.09525 (cross-list from cs.CV) [pdf, other]
          Title: Toward Automated Classroom Observation: Multimodal Machine Learning to Estimate CLASS Positive Climate and Negative Climate
          Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS)
          [23]  arXiv:2005.09463 (cross-list from eess.AS) [pdf, other]
          Title: Learning Joint Articulatory-Acoustic Representations with Normalizing Flows
          Comments: 5 pages, 4 figures
          Subjects: Audio and Speech Processing (eess.AS); Machine Learning (cs.LG); Sound (cs.SD); Machine Learning (stat.ML)
          [24]  arXiv:2005.09394 (cross-list from eess.AS) [pdf, other]
          Title: Enhancing Monotonic Multihead Attention for Streaming ASR
          Subjects: Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Machine Learning (cs.LG); Sound (cs.SD)
          [25]  arXiv:2005.09310 (cross-list from cs.LG) [pdf, other]
          Title: Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition
          Subjects: Machine Learning (cs.LG); Sound (cs.SD); Audio and Speech Processing (eess.AS); Machine Learning (stat.ML)
          [ total of 113 entries: 1-25 | 26-50 | 51-75 | 76-100 | 101-113 ]
          [ showing 25 entries per page: fewer | more | all ]
          Ϸ