Jande, 2003 - Google Patents
Evaluating rules for phonological reduction in SwedishJande, 2003
View PDF- Document ID
- 17218694198626514
- Author
- Jande P
- Publication year
- Publication venue
- Proceedings of Fonetik
External Links
Snippet
4. Results In a recently completed experiment (Jande, 2003), fifteen subjects listened to pairs of stimuli, where both stimuli were synthetic readings of the same sentence, one in canonical form and one in reduced form. Each pair was presented with three different …
- 230000015572 biosynthetic process 0 abstract description 10
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
- G10L13/10—Prosody rules derived from text; Stress or intonation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/06—Elementary speech units used in speech synthesisers; Concatenation rules
- G10L13/07—Concatenation rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Aijun | Chinese prosody and prosodic labeling of spontaneous speech | |
| Levow | Context in multi-lingual tone and pitch accent recognition. | |
| Kuwabara | Acoustic properties of phonemes in continuous speech for different speaking rate | |
| Tan et al. | A Malay dialect translation and synthesis system: Proposal and preliminary system | |
| Jande | Evaluating rules for phonological reduction in Swedish | |
| Chung | Duration models and the perceptual evaluation of spoken Korean | |
| Youssef et al. | An Arabic TTS system based on the IBM trainable speech synthesizer | |
| Jande | Phonological reduction in Swedish | |
| Fackrell et al. | Prosodic variation with text type. | |
| Williams | A Welsh speech database: preliminary results. | |
| Hu et al. | Discourse prosody and its application to speech synthesis | |
| Olaszy et al. | Prosody generation for German CTS/TTS systems (from theoretical intonation patterns to practical realisation) | |
| Shahid et al. | Subjective testing of urdu text-to-speech (tts) system | |
| Kim et al. | Nn-kog2p: A novel grapheme-to-phoneme model for korean language | |
| Chung | Segment duration in spoken korean. | |
| Yokomizo et al. | Evaluation of prosodic contextual factors for HMM-based speech synthesis. | |
| Post | Sex differences in vocalic duration production in L1 and in L2 | |
| Iyanda et al. | Development of a yorúbà texttospeech system using festival | |
| Adeyemo et al. | Development and Integration of Text to Speech Usability Interface for Visually Impaired Users in Yoruba Language | |
| Mumtaz et al. | Stress annotated Urdu speech corpus to build female voice for TTS | |
| Uliniansyah et al. | Utilizing Indonesian Allophones and Intraword Short Pauses Handling to Improve Performance of Indonesian Text-To-Speech | |
| Gu et al. | A system framework for integrated synthesis of Mandarin, Min-nan, and Hakka speech | |
| Duan et al. | Comparison of syllable/phone hmm based mandarin tts | |
| Dessai et al. | Development of Konkani TTS system using concatenative synthesis | |
| Hansakunbuntheung et al. | Mongolian speech corpus for text-to-speech development |