CN109979257A - A method of partition operation is carried out based on reading English auto-scoring and is precisely corrected - Google Patents
A method of partition operation is carried out based on reading English auto-scoring and is precisely corrected Download PDFInfo
- Publication number
- CN109979257A CN109979257A CN201910346958.9A CN201910346958A CN109979257A CN 109979257 A CN109979257 A CN 109979257A CN 201910346958 A CN201910346958 A CN 201910346958A CN 109979257 A CN109979257 A CN 109979257A
- Authority
- CN
- China
- Prior art keywords
- score
- word
- record
- english
- letter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
- G09B5/04—Electrically-operated educational appliances with audible presentation of the material to be studied
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B7/00—Electrically-operated teaching apparatus or devices working with questions and answers
- G09B7/02—Electrically-operated teaching apparatus or devices working with questions and answers of the type wherein the student is expected to construct an answer to the question which is presented or wherein the machine gives an answer to the question presented by a student
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- Educational Administration (AREA)
- Educational Technology (AREA)
- General Physics & Mathematics (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
一种基于英语朗读自动打分进行分拆运算精准矫正的方法,本发明以HMM后验概率算法为打分基础,对英语朗读语音进行整句、单词、音节音素三个层次级别运算分析,从英语朗读的整句语音获得语音特征,和标准参考模型比对打分,对分值不合格的整句语音进行分拆运算分析,获得朗读错误的单词,进而对单词发音进行音素、音节层面的分拆,精准定位发音错误的位置,同时检索出标准发音及相关知识点供用户矫正学习,让用户快速掌握英语朗读的要领。
A method for accurate correction of split operation based on automatic scoring of English reading aloud. The present invention uses HMM posterior probability algorithm as the scoring basis, and carries out three-level operation analysis on English reading speech at three levels: whole sentence, word, and syllable and phoneme, and reads aloud from English. The whole sentence speech obtained by the system obtains the phonetic features, compares and scores with the standard reference model, conducts split operation analysis on the speech of the whole sentence with unqualified scores, obtains the words that are read aloud, and then separates the pronunciation of the word at the phoneme and syllable levels. Accurately locate the location of pronunciation errors, and at the same time retrieve the standard pronunciation and related knowledge points for users to correct and learn, allowing users to quickly master the essentials of English reading.
Description
技术领域technical field
本发明涉及语音识朗读打分技术领域,尤其涉及一种基于英语朗读自动打分进行分拆运算精准矫正学习的方法。The invention relates to the technical field of speech recognition, reading, and scoring, in particular to a method for accurate correction and learning of split operation based on automatic scoring of English reading.
背景技术Background technique
我国语音识别技术的研究近年来发展迅猛,语音识别研究的水平基本和国际同步,在某些细分领域具有了我国的优势,达到国际先进水平,但在英语朗读自动化打分测评这一语音识别细分领域的研究就显得相对滞后,通过国知局的专利检索和分析官网,分别检索“语音and打分”及“英语朗读”、“口语”、“英语”等词的各种组合,所得的结果寥寥无几,其中具有代表性、语音朗读打分方面具有代表性的技术方案有:CN200810226674一种用于口语测试的文本朗读水平自动评估诊断方法、CN201711200048 一种文本相关的英语口语发音错误检测与质量评分方法、CN201811030689一种研究生英语口语教学语音自动评估平台的实现方法等等,这些技术方案提供了不少切实有效自动评估和打分的技术方案,但现实中,人们在英语练习时候,不但需要了解现有的朗读水平,更需要了解英语朗读中的错误所在,学习掌握相关的知识点,通过学习高质量表述的知识点进行高效的学习,迅速纠正英语朗读中错误的习惯,提高自身的英语口语的朗读水平。而现有技术鲜见在这方面高效、便捷、系统、实用的技术方案。The research on speech recognition technology in China has developed rapidly in recent years. The level of speech recognition research is basically synchronized with the international level. In some sub-fields, it has the advantages of China and has reached the international advanced level. The research in different fields seems to be relatively lagging behind. Through the official website of the State Intellectual Property Office for patent search and analysis, various combinations of words such as "voice and scoring" and "English reading", "spoken language" and "English" were searched, and the results obtained were obtained. There are very few, and the representative technical solutions in terms of speech reading and scoring include: CN200810226674 A text-reading level automatic evaluation and diagnosis method for oral language testing, CN201711200048 A text-related English oral pronunciation error detection and quality scoring Methods, CN201811030689 A realization method of an automatic evaluation platform for postgraduate oral English teaching, etc. These technical solutions provide many practical and effective technical solutions for automatic evaluation and scoring, but in reality, when people practice English, not only need to understand the current situation. For some reading levels, it is more necessary to understand the mistakes in English reading, learn to master the relevant knowledge points, learn efficiently by learning the knowledge points expressed in high quality, quickly correct the wrong habits in English reading, and improve their spoken English. Reading level. However, in the prior art, there are few efficient, convenient, systematic and practical technical solutions in this regard.
发明内容SUMMARY OF THE INVENTION
鉴于背景技术所述的问题,本发明以HMM后验概率算法为打分基础,对英语朗读语音进行整句、单词、音节音素三个层次运算分析,先从英语朗读的整句语音获得语音特征,和标准参考模型比对打分,对分值不合格的整句语音进行分拆运算分析,获得朗读错误的单词,进而对单词发音进行音素、音节的分拆,精准定位发音错误的位置, 并分别建立音素特定组合与发音规律相关的语音特征基准库。将英语发音专业本身的知识要素融合到英语朗读打分测试练习中,检索出标准发音及相关知识点供用户矫正学习,快速掌握朗读知识。In view of the problems described in the background art, the present invention takes the HMM posterior probability algorithm as the scoring basis, and carries out three-level operation analysis on the whole sentence, word, and syllable phoneme of the English reading speech, and first obtains the phonetic features from the whole sentence pronunciation of the English reading, Compare and score with the standard reference model, split and analyze the speech of the whole sentence with unqualified scores, obtain words that are pronounced incorrectly, and then divide the pronunciation of words into phonemes and syllables to accurately locate the wrong pronunciation. Establish a speech feature benchmark library related to specific phoneme combinations and pronunciation laws. The knowledge elements of the English pronunciation major are integrated into the English reading and scoring test exercises, and the standard pronunciation and related knowledge points are retrieved for users to correct and learn, and quickly master the reading knowledge.
英语基于文本朗读句子打分,学员对英语文本朗读语音的一组观察序列y=(,, ,…..) ,标准参考模型中多组状态序列s = ( , , ,…..),那么模型s产生观察序列y 的概率为 ,解码过程中运用Viterbi算法,将音素对齐后,选择最可能与观察序列y 对应的状态序列S ,由此计算得到基于隐马尔可夫统计模型的对数后验概率的算法:音素 在第 i段语音每一帧下的后验概率计算法公式1:English is scored based on text-speaking sentences, and a set of observation sequences y=( , , ,… ) , in the standard reference model, multiple sets of state sequences s = ( , , ,… ), then the probability that the model s produces the observation sequence y is , the Viterbi algorithm is used in the decoding process, after aligning the phonemes, the state sequence S that is most likely to correspond to the observation sequence y is selected, and the algorithm of the logarithmic posterior probability based on the hidden Markov statistical model is calculated: phoneme The posterior probability calculation method formula 1 under each frame of the i-th speech:
取对数 然后累计叠加 就可以得到音素在第 i段时间点对应的语音段的对数后验概率打分计算公式2:Take the logarithm and accumulate it to get the phoneme The logarithmic posterior probability score calculation formula 2 of the speech segment corresponding to the i-th time point:
其中表示音素所对应的第i 段语音的起始时间、Z代表语音中因素总个数、 为给定音素 q下观察矢量的概率分布音素总数,这样包含所有音素段语音的对数后验概率的分数均值为公式3:in Represents a phoneme The corresponding start time of the i-th speech, Z represents the total number of factors in the speech, Observe the vector for a given phoneme q The probability distribution of the total number of phonemes, such that the fractional mean of the log-posterior probability of speech including all phoneme segments is Equation 3:
其中为第k个音素持续的帧数;通过上述计算分值和系统设定的一个达标标准分值比较大小确定朗读语音分值是否达标。in is the number of frames that the kth phoneme lasts; by comparing the above calculated score with a standard score set by the system to determine whether the reading voice score meets the standard.
本发明将语音朗读打分分为测试模式和练习测评模式,测试模式只对朗读语音进行测试打分,而练习测评模式时,朗读英语句子语音打分达标时,证明学员已经学会对该文本内容的朗读,因此无需做进一步的处理,直接进入下一条文本内容的朗读。The invention divides the speech reading and scoring into a test mode and a practice evaluation mode. The test mode only tests and scores the reading voice, and in the practice evaluation mode, when the pronunciation of reading an English sentence reaches the standard, it proves that the student has learned to read the text content. Therefore, no further processing is required, and the reading of the next text content is directly entered.
分值不达标情况下,必须运算分析发音错误精准单词或音节,让学员知道具体的错误所在,方能精准的学习纠正错误且信服语音朗读打分测评的结果,因此需要对整句语音进行分拆到音节、单词,因此需要对语音进行分拆识别单词语音段,音节语音段等运算。If the score does not meet the standard, it is necessary to calculate and analyze the pronunciation error and precise words or syllables, so that students can know the specific error, so that they can accurately learn to correct the error and be convinced of the results of the pronunciation reading score. Therefore, it is necessary to split the whole sentence. To syllables and words, it is necessary to split the speech to identify word speech segments, syllable speech segments and other operations.
整句语音分拆成单词,现有技术2016年Herman Kamper,Aren Jansen和SharonGoldwater提出的无监督贝叶斯模型,能将未标记的语音进行分割然后聚类成虚拟词组。该模型的错误率大约20%,,但对于英语朗读打分来说,还无法满足要求,英语朗读打分是一般是基于文本的,文本的范围局限在很小的范围,因此给句子语音分拆成单词音节提高了更为详实的基础,本发明创造了循环递推打分分拆识别法,首先将文本分拆成单词组,获得单词的标准语音及其声学特征等作为标准参考模型,先假设单词被朗读的时长为标准时长,在被测语音上依次分拆该时长的语音段,进行比对获得最高分值的语音段,然后进行向前向后的加减时长的修正,获得单词较为理想匹配的语音段。理想匹配的语音段。具体实施步骤:The whole sentence is split into words. The unsupervised Bayesian model proposed by Herman Kamper, Aren Jansen and Sharon Goldwater in the prior art in 2016 can segment unlabeled speech and then cluster it into virtual phrases. The error rate of this model is about 20%, but for English reading aloud scoring, it cannot meet the requirements. English reading aloud scoring is generally based on text, and the range of text is limited to a small range, so the sentence is divided into two parts. The word syllable improves a more detailed basis. The invention creates a cyclic recursive scoring and splitting recognition method. First, the text is split into word groups, and the standard pronunciation of the word and its acoustic characteristics are obtained as a standard reference model. The duration of the reading is the standard duration, and the speech segments of this duration are divided in turn on the tested speech, and the speech segment with the highest score is compared, and then the forward and backward addition and subtraction duration correction is performed to obtain words. matching speech segment. Ideally matched speech segments. Specific implementation steps:
步骤1、英语文本和汉字文本不同,英语文本text通过空格来分隔单词,因此通过split等函数,使用空格为识别子字符串界限的字符,将英语文本变成由各个单词组成的单词数组a,即a=Split(text);含有缩写符号'的连续字母组合看成一个单词。Step 1. English text and Chinese text are different. English text text is separated by spaces. Therefore, through functions such as split, spaces are used as characters that identify the boundaries of substrings, and English text is turned into a word array a composed of individual words. That is, a=Split(text); the continuous letter combination containing the abbreviation ' is regarded as a word.
步骤2、通过第三方的语音接口获得指定英语单词的语音,实施例:将英语文本post提交给百度语音开发平台的网址,获得返回mp3等格式的语音文件;(或通过文本转语音的引擎等获得特定文本单词的语音)。Step 2, obtain the voice of the specified English word through the voice interface of the third party, embodiment: submit the English text post to the website of the Baidu voice development platform, obtain the voice file of the format such as returning mp3; (or through the engine of text-to-speech etc. to get the speech of a specific text word).
步骤3、并通过预分析获得语音特征,转换成新的标准参考模型M,同时记录单词语音的时长S,并预先假设单词文本被测试朗读的时长S。In step 3, the speech features are obtained through pre-analysis, converted into a new standard reference model M, and the duration S of the word speech is recorded at the same time, and the duration S of the word text is assumed to be read aloud by test in advance.
步骤4、取被测试朗读语音中,起始时间1、结束时间为S这一区间为新的被测朗读语音,和步骤3中的M进行比对运算,通过公式1、公式2、公式3计算分值J,Step 4. Take the tested reading voice, the starting time 1 and the ending time are S. This interval is the new tested reading voice, and compare the operation with M in step 3, through formula 1, formula 2, formula 3 Calculate the score J,
步骤5、取被测试朗读语音中,依次将起始时间加1、结束时间加1这一区间为新的被测朗读语音组,直至结束时间等于原始被测语音的时长、分别和步骤3中的M进行比对运算,通过公式1、公式2、公式3计算分值。Step 5. Take the tested reading voice, add 1 to the starting time and add 1 to the ending time as a new tested reading voice group, until the ending time is equal to the duration of the original tested voice, respectively and step 3. The M of the comparison operation is performed, and the score is calculated by formula 1, formula 2, and formula 3.
步骤6、将步骤4和步骤5计算的分值进行比对,获得最大值的数值A,及和最大值相对应的起始时间T1和结束时间T2等参数。Step 6: Compare the scores calculated in Step 4 and Step 5 to obtain the maximum value A, and parameters such as the start time T1 and the end time T2 corresponding to the maximum value.
以上步骤基于单词朗读时长和标准时长相等,因此需要对结果进行优化矫正,本发明采用上述步骤6的起始时间和结束时间分别向上向下扩展,通过对比打分获得最优值来矫正单词语音的时间段及时长。具体实施如下紧接着的步骤:The above steps are based on the fact that the reading time of the word is equal to the standard time, so the results need to be optimized and corrected. The present invention uses the start time and end time of the above step 6 to expand upward and downward respectively, and obtain the optimal value by comparing and scoring to correct the pronunciation of the word. time period and length. The specific implementation is as follows:
步骤7、取被测试朗读语音中,依次将起始时间加T1循环减1、结束时间T2这一区间为新的被测朗读语音组,直到递减1的起始时间等于1,循环运算中所得语音段的声学特征和步骤3中的M进行比对运算获得分值,分值和步骤6中的A分值比对,如分值大于A,则将A的值设定为当前分值并将T1设定为当前分值对应的起始时间,分值小于A则跳出起始时间递减1的循环。Step 7. Take the tested reading voice, add T1 to the starting time and cyclically decrease 1, and the end time T2 is the new tested reading voice group, until the starting time decremented by 1 is equal to 1, and the cycle operation is obtained. The acoustic features of the speech segment are compared with the M in step 3 to obtain a score, and the score is compared with the A score in step 6. If the score is greater than A, the value of A is set as the current score and Set T1 as the starting time corresponding to the current score. If the score is less than A, the cycle of decreasing the starting time by 1 will jump out.
步骤8、依次将结束时间T2循环减1、起始时间T1这一区间为新的被测朗读语音组,直到递减1的结束时间等于T1,循环运算中所得语音段的声学特征和步骤3中的M进行比对运算获得分值,分值和步骤7中的A分值比对,如分值大于A,则将A的值设定为当前分值并将T2设定为当前分值对应的结束时间,分值小于A则跳出起始时间递减1的循环。Step 8. Decrease the end time T2 cyclically by 1 and start time T1 as the new tested speech group until the end time decremented by 1 is equal to T1. The acoustic characteristics of the obtained speech segment in the cyclic operation are the same as those in step 3. The M is compared to obtain the score, and the score is compared with the A score in step 7. If the score is greater than A, the value of A is set as the current score and T2 is set as the corresponding value of the current score. If the score is less than A, it will jump out of the cycle where the start time is decremented by 1.
步骤9、取被测试朗读语音中,依次将起始时间加T1循环加1、结束时间T2这一区间为新的被测朗读语音组,直到递增加1的起始时间等于T2,循环运算中所得语音段的声学特征和步骤3中的M进行比对运算获得分值,分值和步骤8中的A分值比对,如分值大于A,则将A的值设定为当前分值并将T1设定为当前分值对应的起始时间,分值小于A则跳出起始时间递减1的循环。Step 9. Take the tested reading voice, add 1 to the starting time and T1 cyclically, and add 1 to the ending time T2 as a new tested reading voice group, until the starting time that increases by 1 is equal to T2. The acoustic features of the obtained speech segment are compared with the M in step 3 to obtain a score, and the score is compared with the A score in step 8. If the score is greater than A, the value of A is set as the current score. And set T1 as the starting time corresponding to the current score. If the score is less than A, it will jump out of the cycle of decreasing the starting time by 1.
步骤10、依次将结束时间T2循环加1、起始时间T1这一区间为新的被测朗读语音组,直到递加1的结束时间等于原始被测语音整体的时长,循环运算中所得语音段的声学特征和步骤3中的M进行比对运算获得分值,分值和步骤9中的A分值比对,如分值大于A,则将A的值设定为当前分值并将T2设定为当前分值对应的结束时间,分值小于A则跳出起始时间递加1的循环。Step 10. Cycle the end time T2 by 1 and the start time T1 as the new tested speech group, until the end time incremented by 1 is equal to the overall duration of the original tested speech, and the speech segment obtained in the cyclic operation The acoustic features of the A are compared with the M in step 3 to obtain the score, and the score is compared with the A score in step 9. If the score is greater than A, the value of A is set as the current score and T2 Set as the end time corresponding to the current score. If the score is less than A, it will jump out of the cycle of incrementing the start time by 1.
步骤11、记录单词和通过上述步骤所得的在被朗读语音上对应的起始、结束时间及分值等数据,重复步骤2—10,获得步骤1分拆的所有单词在被朗读语音上对应的起始时间和结束时间,以及相应的分值,其中i下标为单词在文本句子中的序号。Step 11, record the word and the data such as the corresponding start, end time and score on the read voice obtained by the above-mentioned steps, repeat steps 2-10, obtain all the words split in step 1 corresponding to the read voice. start time and end time , and the corresponding score , where the i subscript is the sequence number of the word in the text sentence.
步骤12、单词分值低于系统设置的错误阀值,即定性为朗读发音不合格,则调出上述步骤中当前单词映射的文本,显示到特定的用户界面,提醒用户该单词发音错误,并设置播发点击功能标识链接到步骤2形成的单词语音位置,配置相应的程式让学员点击所述播发标记就可听到标准的单词语音。并对单词进行音素、音节级别的分拆分析。Step 12, the word score is lower than the error threshold set by the system, that is, it is characterized as unqualified reading and pronunciation, then call out the text of the current word mapping in the above steps, display it to a specific user interface, remind the user that the word is pronounced incorrectly, and Set the advertisement click function mark to link to the word pronunciation position formed in step 2, and configure the corresponding program so that students can click the advertisement mark to hear the standard word pronunciation. And split the words at the phoneme and syllable levels.
音素是构成音节的能被人们认知有区别特征的最小的语音片段,是从音质上划分出来的最小的线性的语音单位。依据音节里的发音动作分析,一个发音动作构成一个音素;英语国际音标共有48个音素,其中元音音素20个、辅音音素28个。英语字母共有26个,其中有元音字母5个、半元音字母2个、辅音字母19个;因此,将英语语音分拆到音节音素、单词的层面,能够找到英语语音最原始的核心标准,帮助用户特别是初学英语的用户精准快速掌握英语朗读的要素,解决学习英语朗读存在的问题。为了进一步帮助用户精准定位错误,纠正朗读发音,分拆到音素单词的语音片段进行运算分析,更能有效精准地帮助用户学习英语朗读,而现有技术根据对语音进行分帧加窗、解码、离散傅里变换等技术手段所得的音素只是一个基于概率的观察运算的结果,无法作为教材标准来应用,更无法匹配英语发音各种规则,本发明针对上述步骤12所述发音评分不合格单词word进一步进行拆分到音节、音素层面,帮组用户进行分析。具体实施如下:A phoneme is the smallest speech segment that can be recognized by people and has distinctive features that constitute a syllable, and it is the smallest linear unit of speech divided from the sound quality. According to the analysis of pronunciation actions in syllables, one pronunciation action constitutes a phoneme; there are 48 phonemes in the English International Phonetic Alphabet, including 20 vowel phonemes and 28 consonant phonemes. There are 26 English letters in total, including 5 vowels, 2 semi-vowels, and 19 consonants; therefore, the most primitive core standard of English pronunciation can be found by dividing English pronunciation into syllables, phonemes and words. , to help users, especially those who are new to English, accurately and quickly master the elements of English reading aloud, and solve the problems existing in learning English reading aloud. In order to further help users to accurately locate errors and correct the pronunciation of reading aloud, the speech fragments of phoneme words are divided for operation analysis, which can help users learn English reading aloud more effectively and accurately. The phoneme obtained by technical means such as discrete Fourier transform is only the result of a probability-based observation operation, which cannot be used as a textbook standard, and can not match various rules of English pronunciation. It is further divided into syllables and phonemes to help group users analyze. The specific implementation is as follows:
S1、单词word中根据字节分拆出字母,实施例:使用MID函数,MID字符串函数,作用是从一个字符串中截取出指定数量的字符, MID(text, start_num, num_chars)text为需要被拆分的单词 start_num从左起第1位开始截取 num_chars从左起向右截取1个字符长度(用数字表达),通过从1到单词字符长度递增的循环,获得单词的字符数组X(len-1),len为单词字节长度。S1. The word word is split into letters according to the bytes. Example: use the MID function, the MID string function, the function is to cut out a specified number of characters from a string, and MID(text, start_num, num_chars) text is required The split word start_num is intercepted from the 1st position from the left, num_chars is intercepted by 1 character length from the left to the right (expressed in numbers), and the character array X(len -1), len is the word byte length.
S2、创建英语音素音标的知识库,其中包含国际音标48个音素,前元音/iː/、/ɪ/、/e/、/æ/;中元音.....;后元音.....;开合双元音:/eɪ/、/aɪ/、/ɔɪ/、/aʊ/、/əʊ/;......清辅音.......; 鼻音: /m/、/n/、/ŋ/;........。为每个音素记录设置相应的类别、朗读知识点、匹配的标准语音储存路径及其语音声学特征等数据库表格栏,记录知识点例如:/e/是单元音前元音,这个音标在英式音标中的符号是/e/,美式音标与之对应的发音符号是[ɛ],其发音具体的技巧有:1)嘴唇向两侧微微分开,上下齿之间大约可容纳一个小指头尖的距离;2)舌前部在发音过程中抬起,舌尖稍微接触下齿背;3)发音时下巴逐渐向下移动,震动声带,发出/e/音。注意:/e/个短元音,注意与/ɜː/、/ə/的区别,诸如此类知识点,其他的音素同样记录其相关的知识。S2. Create a knowledge base of English phoneme phonetic symbols, which contains 48 phonemes of the International Phonetic Alphabet, front vowels /iː/, /ɪ/, /e/, /æ/; middle vowels...; back vowels. ....; opening and closing diphthongs: /eɪ/, /aɪ/, /ɔɪ/, /aʊ/, /əʊ/;...voiceless consonants....; nasal: / m/, /n/, /ŋ/;......... For each phoneme record, set the corresponding category, reading knowledge point, matching standard voice storage path and its voice acoustic characteristics and other database table columns, record knowledge points such as: /e/ is the vowel before the unit sound, this phonetic symbol is in the British style The symbol in the phonetic symbol is /e/, and the corresponding pronunciation symbol of the American phonetic symbol is [ɛ]. The specific techniques for its pronunciation are: 1) The lips are slightly separated to the sides, and the upper and lower teeth can accommodate about a little finger tip. distance; 2) the front part of the tongue is raised during pronunciation, and the tip of the tongue touches the back of the lower teeth slightly; 3) the jaw gradually moves down during pronunciation, vibrates the vocal cords, and makes the /e/ sound. Note: /e/ is a short vowel, pay attention to the difference from /ɜː/, /ə/, and other knowledge points, other phonemes also record their related knowledge.
S3、创建关于音标、字母、字母组合对应关系的对照库,首先将英语发音规则里26个字母及常用的字母组合及其对应的音标增加到规则库所对应的数据库相应的表格,并增加诸如两个相同元音排列在一起,如oo、ee在foot、meet单词里只读一个音标等等特殊的字母组合,同时创建字母分类等栏,记载字母诸如元音、辅音等类别所属等信息,也就是说将英语常规知识英语音标和字母及字母组合对照的信息输入对照库。 S3. Create a comparison library about the correspondence between phonetic symbols, letters, and letter combinations. First, add the 26 letters and commonly used letter combinations and their corresponding phonetic symbols in the English pronunciation rules to the corresponding table of the database corresponding to the rule library, and add such as Two identical vowels are arranged together. For example, oo and ee can only read a special letter combination such as a phonetic symbol in the words foot and meet. At the same time, columns such as letter classification are created to record information such as the categories of letters such as vowels and consonants. That is to say, the information comparing English phonetic symbols and letters and letter combinations of English conventional knowledge is input into the comparison database.
S4、创建英语发音规则库,将各种英语发音规则整理出方便程式可以逻辑运算的表达形式,并按照可进行逻辑运算的方式进行分类:1、字符特性特征:含有这些特征关键词的即此记录的音标为对应字母或字母字母组合的默认发音,而这种特征关键词,通过以特征符号例如&符号相间隔组合在同一记录,例:开音节&重音;2、列举:将含有当前记录的字母或字母组合发音为当前记录的音标的单词或句子,列举到本记录,不同的单词或句子用特征符号隔开。S4. Create a library of English pronunciation rules, sort out various English pronunciation rules into expressions that are convenient for programs that can be logically operated, and classify them according to the way that logical operations can be performed: 1. Character characteristics: those that contain these characteristic keywords are here The recorded phonetic symbol is the default pronunciation of the corresponding letter or letter-letter combination, and this characteristic keyword is combined in the same record with characteristic symbols such as & symbols at intervals, for example: open syllable &accent; 2. Enumeration: will contain the current record The letter or combination of letters is pronounced as the word or sentence of the phonetic symbol of the current record, listed in this record, and different words or sentences are separated by characteristic symbols.
S5、基于单词末尾e不发音的英语发音规则,判定S1步骤得出的X(len-1)字符数组最后一位是否是e,如是则强制字符数组减去最后一位成员:len=len-1。S5. Based on the English pronunciation rule that the e at the end of the word is not pronounced, determine whether the last digit of the X(len-1) character array obtained in step S1 is e, and if so, force the character array to subtract the last member: len=len- 1.
S6、创建A、Z两个变量,并赋初始值A=0 。S6. Create two variables, A and Z, and assign the initial value A=0.
S7、如len减A的值大于或等于4时(常规字母组合最多是4个字母的组合tion,发音的音标为[ʃ])则Z=4,否则Z=len-A。将A重新赋值:A=A+1,在X(len-1)字符数组成员中取第A个至第Z个字符组合,组合起来到常用发音字母组合的对照库中检索,根据检索结果分别处理。S7. If the value of len minus A is greater than or equal to 4 (the conventional letter combination is a combination of 4 letters at most, and the phonetic symbol for pronunciation is [ʃ]), then Z=4, otherwise Z=len-A. Re-assign A: A=A+1, take the A-th to Z-th character combinations in the X(len-1) character array members, and combine them to search in the comparison library of commonly used pronunciation letter combinations. According to the search results, respectively deal with.
1、 当检索到多个记录则将当前组合的字符提交给S9步骤的自定义函数guizefunction(当前组合的字符,word,A,Z)嵌入运算;当A+Z>len直接执行S10步骤否则将A赋值:A=A+Z-1并重新开始本步骤。1. When multiple records are retrieved, submit the currently combined characters to the custom function guizefunction (currently combined characters, word, A, Z) of step S9 for embedded operation; when A+Z>len, execute step S10 directly, otherwise A assignment: A=A+Z-1 and restart this step.
2、 只检索到唯一记录则将这记录的音标、和A、Z值一起记录,当A+Z>len直接执行S10步骤否则对A重新赋值:A=A+Z-1并重新开始本步骤。2. If only the unique record is retrieved, record the phonetic symbol of this record together with the A and Z values. When A+Z>len, execute step S10 directly, otherwise reassign A: A=A+Z-1 and start this step again .
3、 没有检索到记录则进入下一步骤。3. If no record is retrieved, go to the next step.
S8、如当Z=1则跳转到上面S 7步骤,否则将Z赋值为Z-1,取其中第A至Z个字符,组合起来到常用发音字母组合的规则库中检索,,根据检索结果分别处理。S8, if Z=1, jump to the above S7 step, otherwise assign Z to Z-1, take the A to Z characters, and combine them to search in the rule base of commonly used pronunciation letter combinations, according to the retrieval The results are processed separately.
1、只检索到唯一记录则将这记录的音标、和A、Z值一起记录,将A重新赋值:A=A+Z-1并直接跳转执行S7步骤。1. If only the unique record is retrieved, record the phonetic symbol of this record together with the A and Z values, and reassign A: A=A+Z-1 and directly jump to step S7.
2、检索到多个记录则将当前组合的字符提交给S9步骤的自定义函数guizefunction(当前组合的字符,word,A,Z)嵌入运算;A=A+Z-1并直接跳转执行S7步骤。2. If multiple records are retrieved, submit the current combination of characters to the custom function guizefunction (currently combined characters, word, A, Z) in step S9 for embedded operation; A=A+Z-1 and directly jump to execute S7 step.
3、没有记录则重复开始本步骤,进行循环分析,直至Z值为1。3. If there is no record, start this step repeatedly and perform cyclic analysis until the Z value is 1.
S9、规则运算的自定义函数guizefunction(str,str1,Index1,index2),S9, the custom function of rule operation guizefunction (str, str1, Index1, index2),
a、在英语发音规则库中检索str字符串,并对检索中的记录中“列举”栏记录的内容是否包含当前的单词,如果包含则返回当前记录里音标作为本函数的结果,将音标、和Index1,index2值一起记录,并终止本函数的运算,没有则进行下一个记录检验;a. Retrieve the str string in the English pronunciation rule base, and check whether the content recorded in the "List" column of the retrieved record contains the current word. If it does, return the phonetic symbol in the current record as the result of this function. Record with the values of Index1 and index2, and terminate the operation of this function, if not, perform the next record check;
b、判定str在str1中其后一个字节位置字母是否是辅音字母,先设置两个字符变量tex、texx,如index1+index2+1>len(word)则tex=right(word,1) 否则tex=MID(str1,index1+index2+1, 1),如tex为“r”或“w”或“y”则texx赋值为“开音节”否则在规则库内检索字母或字母组合为tex的记录,将记录中“字母分类栏”的记录如包含“元音”则texx赋值为“开音节”,否则texx赋值为“闭音节”。b. Determine whether the letter in the next byte position of str in str1 is a consonant letter, first set two character variables tex, texx, such as index1+index2+1>len(word) then tex=right(word, 1) otherwise tex=MID(str1,index1+index2+1, 1), if tex is "r" or "w" or "y", then texx is assigned as "open syllable", otherwise it searches the rule base for letters or letter combinations that are tex Record, if the record in the "letter classification column" in the record contains "vowel", then texx is assigned as "open syllable", otherwise texx is assigned as "closed syllable".
C、在规则库中检索str字符,并逐条验证记录中特征关键词栏的内容是否存在texx的内容,如果有则将本记录对应的音标返回给本函数,将音标、和Index1,index2值一起记录,否则验证符合条件的下一条记录。C. Retrieve the str character in the rule base, and verify one by one whether the content of the characteristic keyword column in the record has the content of texx, if so, return the phonetic symbol corresponding to this record to this function, and combine the phonetic symbol with the values of Index1 and index2 record, otherwise verify the next record that meets the condition.
S10、将分拆出与英语文本单词、字母及字母组合及其对应的音素音标、检索到相关的知识点,显示到用户界面,让用户学习掌握。S10, splitting the English text words, letters and letter combinations and their corresponding phonemes and phonetic symbols, retrieving relevant knowledge points, and displaying them on the user interface, so that users can learn and master.
在实际英语朗读中,音素往往会因上下的音素而改变原有标准的发音,更多的以音节作为学习英语朗读发音的基本单位,因而将单词拆分成音节能够有效帮助用户掌握正确的朗读。In actual English reading, phonemes often change the original standard pronunciation due to the upper and lower phonemes, and more syllables are used as the basic unit for learning English reading pronunciation. Therefore, splitting words into syllables can effectively help users master the correct reading. .
根据英语单词发音规律:单词拆分成音节遵守的原则:1)一归后,指在两个音节之间,若有辅音字母,这个字母划归后一个音节,2)二分开,指在两个音节之间,若有两个辅音字母,则分别划归前后两个音节;易错点:元音和元音字母搞混.划分音节是按元音来划的,如果元音字母不发音,那就不能构成音节了.如果有两个元音字母在一起,但只发一个元音,仍然算一个音节。音节划分界限规律:元音音素是构成音节的主体,辅音是音节的分界线。每个元音音素都可以构成一个音节,,因此更为分拆到音节能更为精准的帮助用户学习英语朗读,英语单词音节分拆的实施:According to the pronunciation rules of English words: the principles to be followed by dividing words into syllables: 1) After the first return, it refers to between two syllables. If there is a consonant letter, the letter is assigned to the next syllable; Between two syllables, if there are two consonants, they will be assigned to the two syllables before and after respectively; error prone: vowels and vowels are confused. Syllables are divided according to vowels, if vowels are not pronounced , it cannot form a syllable. If there are two vowels together, but only one vowel is pronounced, it still counts as a syllable. The rules of syllable division: vowel phonemes are the main body of syllables, and consonants are the boundaries of syllables. Each vowel phoneme can form a syllable, so it can be divided into syllables to help users learn English reading aloud more accurately. The implementation of syllable separation of English words:
步骤1、通过以上的单词拆分成音素的技术方案记录的构成与英语单词文本字母及字母组合对应的一组音标: , , ,…..,及每个音标对应字母的起始位置: , ,,…..,结束位置: , , ,…..。Step 1, a group of phonetic symbols corresponding to English word text letters and letter combinations that are recorded by the technical scheme that the above words are split into phonemes: , , ,… , and the starting position of each phonetic symbol corresponding to the letter: , , ,… , ending at: , , ,… .
步骤2、分别在知识库内检索音标,获得一个或一组音标分类为元音的音素。Step 2. Search phonemes in the knowledge base respectively to obtain one or a group of phonemes classified as vowels by phonemes.
步骤3、根据步骤1音素对应的位置数值,依次使用mid等函数获得元音对应字母或字母组之间的字母,如只有一个字母,则将后面一个元音所对应的起始位置值减1,如两个字母则将前一个元音对应的结束位置值加1。Step 3. According to the position value corresponding to the phoneme in step 1, use functions such as mid in turn to obtain the letters between the letters corresponding to the vowels or the letter groups. If there is only one letter, then subtract 1 from the starting position value corresponding to the next vowel. , such as two letters, add 1 to the end position value corresponding to the previous vowel.
步骤4、通过步骤3获得一组音素及新的起始位置和结束位置的值,按照这组音素对应的起始位置和结束位置计算获得相对应的字母或字母组,输出到用户界面,作为单词分拆音节的结果。Step 4, obtain a group of phonemes and the values of the new starting position and ending position through step 3, calculate and obtain the corresponding letter or letter group according to the starting position and ending position corresponding to this group of phonemes, and output to the user interface as The result of word splitting syllables.
特别申明:在本说明书中所述的 “实施例”等,指的是结合该实施例描述的具体特征、要素或者特点包括在本申请概括性描述的实施例中。在说明书中多个地方出现同种表述并非限定特指的是同一个实施例。也就是说,结合任一实施例描述一个具体特征、要素或者特点时,所要主张的是结合其他实施例来实现这种特征、要素或者特点被包含于本发明申请保护的权利要求范围中; 实施例是参照本发明逻辑架构及思路的多个解释性实施例对本发明进行了描述,但本发明的保护范围并不局限于此,本领域技术人员在本发明技术方案框架下可以设计出很多其他的修改和实施方式,可以对技术方案的要点变换组合/或布局进行多种非本质性变型和改进,对于本领域技术人员来说,其他的用途也将是明显的,可轻易想到实施的非实质性变化或替换,这些修改和实施方式将落在本申请公开的原则范围和精神之内。It is specially stated that "embodiments" and the like described in this specification refer to the specific features, elements or characteristics described in conjunction with the embodiments being included in the embodiments generally described in this application. The appearance of the same expression in various places in the specification is not intended to be limiting in particular to the same embodiment. That is to say, when a specific feature, element or characteristic is described in conjunction with any embodiment, what is claimed is to realize such feature, element or characteristic in conjunction with other embodiments, and it is included in the scope of the claims protected by the present application; The present invention is described with reference to multiple explanatory embodiments of the present invention's logical architecture and ideas, but the protection scope of the present invention is not limited to this, and those skilled in the art can design many other Modifications and implementations, various non-essential modifications and improvements can be made to the key point transformation combination/or layout of the technical solution, and other uses will also be obvious to those skilled in the art, and it is easy to think of the implementation of non-essential modifications and implementations. Substantial changes or substitutions, such modifications and embodiments will fall within the scope and spirit of the principles disclosed herein.
附图说明Description of drawings
图1 为一种基于英语朗读自动打分进行分拆运算精准矫正学习的方法整体逻辑框架图。Figure 1 is the overall logical frame diagram of a method for accurate correction and learning based on automatic scoring of English reading aloud.
Claims (4)
Priority Applications (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910346958.9A CN109979257B (en) | 2019-04-27 | 2019-04-27 | Method for performing accurate splitting operation correction based on English reading automatic scoring |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201910346958.9A CN109979257B (en) | 2019-04-27 | 2019-04-27 | Method for performing accurate splitting operation correction based on English reading automatic scoring |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN109979257A true CN109979257A (en) | 2019-07-05 |
| CN109979257B CN109979257B (en) | 2021-01-08 |
Family
ID=67086631
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201910346958.9A Active CN109979257B (en) | 2019-04-27 | 2019-04-27 | Method for performing accurate splitting operation correction based on English reading automatic scoring |
Country Status (1)
| Country | Link |
|---|---|
| CN (1) | CN109979257B (en) |
Cited By (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110889987A (en) * | 2019-12-16 | 2020-03-17 | 安徽必果科技有限公司 | Intelligent comment method for correcting spoken English |
| CN111402646A (en) * | 2020-03-27 | 2020-07-10 | 深圳小茜智能科技有限公司 | A kind of teaching equipment and its use method |
| CN113422825A (en) * | 2021-06-22 | 2021-09-21 | 读书郎教育科技有限公司 | System and method for assisting in culturing reading interests |
| CN113707178A (en) * | 2020-05-22 | 2021-11-26 | 苏州声通信息科技有限公司 | Audio evaluation method and device and non-transient storage medium |
| CN114120963A (en) * | 2021-11-25 | 2022-03-01 | 中国银行股份有限公司 | Method and device for synthesizing English dubbing, storage medium and electronic device |
| CN114245194A (en) * | 2021-12-23 | 2022-03-25 | 深圳市优必选科技股份有限公司 | Video teaching interaction method and device and electronic equipment |
| CN114495986A (en) * | 2021-12-31 | 2022-05-13 | 科大讯飞股份有限公司 | Pronunciation error detection method and device and voice scoring method and device |
| WO2022148176A1 (en) * | 2021-01-08 | 2022-07-14 | Ping An Technology (Shenzhen) Co., Ltd. | Method, device, and computer program product for english pronunciation assessment |
Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101739867A (en) * | 2008-11-19 | 2010-06-16 | 中国科学院自动化研究所 | Method for scoring interpretation quality by using computer |
| CN101739868A (en) * | 2008-11-19 | 2010-06-16 | 中国科学院自动化研究所 | Automatic evaluation and diagnosis method of text reading level for oral test |
| CN101739869A (en) * | 2008-11-19 | 2010-06-16 | 中国科学院自动化研究所 | Priori knowledge-based pronunciation evaluation and diagnosis system |
| CN107103915A (en) * | 2016-02-18 | 2017-08-29 | 广州酷狗计算机科技有限公司 | A kind of audio data processing method and device |
| CN107958673A (en) * | 2017-11-28 | 2018-04-24 | 北京先声教育科技有限公司 | A kind of spoken language methods of marking and device |
| CN108428382A (en) * | 2018-02-14 | 2018-08-21 | 广东外语外贸大学 | It is a kind of spoken to repeat methods of marking and system |
| US20180277098A1 (en) * | 2017-03-23 | 2018-09-27 | Samsung Electronics Co., Ltd. | Training method and apparatus for speech recognition |
| CN109300339A (en) * | 2018-11-19 | 2019-02-01 | 王泓懿 | A kind of exercising method and system of Oral English Practice |
-
2019
- 2019-04-27 CN CN201910346958.9A patent/CN109979257B/en active Active
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101739867A (en) * | 2008-11-19 | 2010-06-16 | 中国科学院自动化研究所 | Method for scoring interpretation quality by using computer |
| CN101739868A (en) * | 2008-11-19 | 2010-06-16 | 中国科学院自动化研究所 | Automatic evaluation and diagnosis method of text reading level for oral test |
| CN101739869A (en) * | 2008-11-19 | 2010-06-16 | 中国科学院自动化研究所 | Priori knowledge-based pronunciation evaluation and diagnosis system |
| CN107103915A (en) * | 2016-02-18 | 2017-08-29 | 广州酷狗计算机科技有限公司 | A kind of audio data processing method and device |
| US20180277098A1 (en) * | 2017-03-23 | 2018-09-27 | Samsung Electronics Co., Ltd. | Training method and apparatus for speech recognition |
| CN107958673A (en) * | 2017-11-28 | 2018-04-24 | 北京先声教育科技有限公司 | A kind of spoken language methods of marking and device |
| CN108428382A (en) * | 2018-02-14 | 2018-08-21 | 广东外语外贸大学 | It is a kind of spoken to repeat methods of marking and system |
| CN109300339A (en) * | 2018-11-19 | 2019-02-01 | 王泓懿 | A kind of exercising method and system of Oral English Practice |
Non-Patent Citations (2)
| Title |
|---|
| 吕军: "基于语音识别的汉语发音自动评分系统的设计与实现", 《数字化汉语教学的研究与应用》 * |
| 王娜: "汉语语音评分系统的设计与实现", 《绵阳师范学院学报》 * |
Cited By (10)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN110889987A (en) * | 2019-12-16 | 2020-03-17 | 安徽必果科技有限公司 | Intelligent comment method for correcting spoken English |
| CN111402646A (en) * | 2020-03-27 | 2020-07-10 | 深圳小茜智能科技有限公司 | A kind of teaching equipment and its use method |
| CN113707178A (en) * | 2020-05-22 | 2021-11-26 | 苏州声通信息科技有限公司 | Audio evaluation method and device and non-transient storage medium |
| CN113707178B (en) * | 2020-05-22 | 2024-02-06 | 苏州声通信息科技有限公司 | Audio evaluation method and device and non-transient storage medium |
| WO2022148176A1 (en) * | 2021-01-08 | 2022-07-14 | Ping An Technology (Shenzhen) Co., Ltd. | Method, device, and computer program product for english pronunciation assessment |
| CN113422825A (en) * | 2021-06-22 | 2021-09-21 | 读书郎教育科技有限公司 | System and method for assisting in culturing reading interests |
| CN113422825B (en) * | 2021-06-22 | 2022-11-08 | 读书郎教育科技有限公司 | System and method for assisting in culturing reading interests |
| CN114120963A (en) * | 2021-11-25 | 2022-03-01 | 中国银行股份有限公司 | Method and device for synthesizing English dubbing, storage medium and electronic device |
| CN114245194A (en) * | 2021-12-23 | 2022-03-25 | 深圳市优必选科技股份有限公司 | Video teaching interaction method and device and electronic equipment |
| CN114495986A (en) * | 2021-12-31 | 2022-05-13 | 科大讯飞股份有限公司 | Pronunciation error detection method and device and voice scoring method and device |
Also Published As
| Publication number | Publication date |
|---|---|
| CN109979257B (en) | 2021-01-08 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN109979257B (en) | Method for performing accurate splitting operation correction based on English reading automatic scoring | |
| US9711139B2 (en) | Method for building language model, speech recognition method and electronic apparatus | |
| US10235991B2 (en) | Hybrid phoneme, diphone, morpheme, and word-level deep neural networks | |
| CN106297800B (en) | A method and device for adaptive speech recognition | |
| US20090258333A1 (en) | Spoken language learning systems | |
| JPS61177493A (en) | Voice recognition | |
| CN106448288A (en) | Interactive English learning system and method | |
| CN105609107A (en) | Text processing method and device based on voice identification | |
| EP2891147B1 (en) | Method and system for predicting speech recognition performance using accuracy scores | |
| EP3613046A1 (en) | System and method for automatic speech analysis | |
| KR102794379B1 (en) | Learning data correction method and apparatus thereof using ensemble score | |
| CN111128181B (en) | Recitation question evaluating method, recitation question evaluating device and recitation question evaluating equipment | |
| Shokouhi et al. | Did you say U2 or YouTube? Inferring implicit transcripts from voice search logs | |
| CN106548787B (en) | Optimize the evaluating method and evaluating system of new word | |
| San et al. | Automated speech tools for helping communities process restricted-access corpora for language revival efforts | |
| CN111899581A (en) | Word spelling and reading exercise device and method for English teaching | |
| JP2004094257A (en) | Method and apparatus for generating question of decision tree for speech processing | |
| Bear et al. | Finding phonemes: improving machine lip-reading | |
| CN111429886B (en) | Voice recognition method and system | |
| JP5004863B2 (en) | Voice search apparatus and voice search method | |
| Azim et al. | Large vocabulary Arabic continuous speech recognition using tied states acoustic models | |
| CN119273512A (en) | Practice effect evaluation method of English learning simulation system based on artificial intelligence | |
| Díez et al. | Non-native speech corpora for the development of computer assisted pronunciation training systems | |
| CN115440193A (en) | Pronunciation evaluation scoring method based on deep learning | |
| Thaiprayoon et al. | Thaiqcor 2.0: Thai query correction via soundex and word approximation |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant | ||
| TR01 | Transfer of patent right |
Effective date of registration: 20251211 Address after: 361000 room 4101, No. 131, xinjingdongli, Haicang District, Xiamen City, Fujian Province Patentee after: Xiamen Trustworthy Technology Co.,Ltd. Country or region after: China Address before: 518052 Guangdong city of Shenzhen province Qianhai Shenzhen Hong Kong cooperation zone before Bay Road No. 1 building 201 room A Patentee before: SHENZHEN DIGITAL GALAXY TECHNOLOGY Co.,Ltd. Country or region before: China |