Video text detection plays an important role in applications such as semantic-based video analysis, text information retrieval, archiving and so on. In this paper, a Farsi/Arabic text detection approach is proposed. First, a three-level resolution pyramid of input image is created. Then, with an appropriate edge detector, edges are extracted and then by using edges cross points, artificial corners are extracted. Artificial corner histogram analysis is done for rejecting non-text corners. The discrete cosine transform (DCT) coefficients of input picture are extracted and texture intensity picture is created by combining appropriate coefficients. With combining artificial corners image and texture intensity image, a features vector is extracted and fed into support vector machine (SVM) classifier for detecting text regions. Finally, with drawing normalized texture intensity profiles, final verification is done and text lines are separated from each other’s.