A COMPARISON OF MACHINE LEARNING TECHNIQUES FOR PERSIAN EXTRACTIVE SPEECH TO SPEECH SUMMARIZATION WITHOUT TRANSCRIPT

Q: How can I download an article?

To download an article from SID, first log in to the site, search for the article title, and click on the 'Download Article' option.

Q: How can I download an ISI article?

To download an ISI article on SID, enter the keyword or article title in the search bar, view the relevant results, click on the desired article, and select the 'Download Article' option.

Q: How can I access the SID database?

To access the SID database, visit SID.ir, create an account, and log in to access scientific resources.

Q: Is downloading articles from SID free?

Some articles on SID are available for free, while others require payment. Details are specified on the article's page.

JAFARI HODA SADAT; HOMAYOUNPOUR MOHAMMAD MEHDI

مرکز اطلاعات علمی Scientific Information Database (SID) - Trusted Source for Research and Academic Resources

Journal Paper

Paper Information

Journal: SIGNAL AND DATA PROCESSING Year:2018 | Volume:14 | Issue:4 (SERIAL 34) Page(s): 143-157

Download Full-Text

Persian Verion

View:

810

Download:

Cites:

Information Journal Paper

Title

A COMPARISON OF MACHINE LEARNING TECHNIQUES FOR PERSIAN EXTRACTIVE SPEECH TO SPEECH SUMMARIZATION WITHOUT TRANSCRIPT

Author(s)

JAFARI HODA SADAT | HOMAYOUNPOUR MOHAMMAD MEHDI | Issue Writer Certificate

Keywords

EXTRACTIVE SPEECH SUMMARIZATIONQ1

SPEECH SIGNALQ1

KEY PATTERNSQ1

S-DTW ALGORITHMQ1

MACHINE LEARNINGQ1

Abstract

In this paper, EXTRACTIVE SPEECH SUMMARIZATION using different MACHINE LEARNING algorithms was investigated. The task of Speech summarization deals with extracting important and salient segments from speech in order to access, search, extract and browse speech files easier and in a less costly manner. In this paper, a new method for speech summarization without using automatic speech recognition system (ASR) is proposed. ASR systems usually have high error rates especially in adverse acoustic environment and for low resource languages. Our goal was to answer this question: is it possible to summarize a Persian speech without ASR using less or no training data? We have proposed a method which discovers salient parts directly from SPEECH SIGNAL by using a semi-supervised algorithm. The proposed algorithm consists of three main stages, features extraction, identifying KEY PATTERNS and selecting important sentences. First we have segmented speech voices manually into sentences to eliminate sentence segmentation errors. Therefore, we could have better comparison between different summarization methods. Then we have extracted some features from each sentence such as sentence duration, if the sentence is first or last sentence in the speech and so on. Also, repetitive patterns between each two sentence of speech are discovered directly from SPEECH SIGNAL by using S-DTW ALGORITHM. S-DTW ALGORITHM can discover repetitive patterns between two SPEECH SIGNALs by using MFCC features. By using these repetitive patterns between each pair of sentences we can make a similarity matrix. Therefore, we could measure the similarity distance between each pair of sentences and eliminate redundant sentences from summary without the need to use an ASR system After finding the similarity between each two speech segments and extracting some features from each segment, various MACHINE LEARNING algorithms including unsupervised (MMR, TextRank), supervised (SVM, Naï ve Bayes) and semi-supervised algorithms (self-training, Co-training) are used in order to extract salient parts. Experiences are done in read Persian news. The results show that using semi-supervised co-training method and appropriate features, the performance of speech summarization system on read Persian news corpus can improve about 3% compared to selecting the first sentences and by 5% compared to longest sentences when ROUGE-3 is used as the evaluation measure.

Cites

No record.

References

No record.

Cite

APA: Copy

JAFARI, HODA SADAT, & HOMAYOUNPOUR, MOHAMMAD MEHDI. (2018). A COMPARISON OF MACHINE LEARNING TECHNIQUES FOR PERSIAN EXTRACTIVE SPEECH TO SPEECH SUMMARIZATION WITHOUT TRANSCRIPT. SIGNAL AND DATA PROCESSING, 14(4 (SERIAL 34) ), 143-157. SID. https://sid.ir/paper/160838/en

Vancouver: Copy

JAFARI HODA SADAT, HOMAYOUNPOUR MOHAMMAD MEHDI. A COMPARISON OF MACHINE LEARNING TECHNIQUES FOR PERSIAN EXTRACTIVE SPEECH TO SPEECH SUMMARIZATION WITHOUT TRANSCRIPT. SIGNAL AND DATA PROCESSING[Internet]. 2018;14(4 (SERIAL 34) ):143-157. Available from: https://sid.ir/paper/160838/en

IEEE: Copy

HODA SADAT JAFARI, and MOHAMMAD MEHDI HOMAYOUNPOUR, “A COMPARISON OF MACHINE LEARNING TECHNIQUES FOR PERSIAN EXTRACTIVE SPEECH TO SPEECH SUMMARIZATION WITHOUT TRANSCRIPT,” SIGNAL AND DATA PROCESSING, vol. 14, no. 4 (SERIAL 34) , pp. 143–157, 2018, [Online]. Available: https://sid.ir/paper/160838/en

Related Journal Papers