FR2990786A1

FR2990786A1 - Method for determining orientation of digitized page of document to analyze and improve images, involves deciding orientation of digitized page of document as result of vote based on calculated sign function per text line

Info

Publication number: FR2990786A1
Application number: FR1254522A
Authority: FR
Inventors: Jerome Berger; Frederic Petit; Philippe Carre
Original assignee: Sagemcom Documents SAS
Current assignee: Sagemcom Documents SAS
Priority date: 2012-05-16
Filing date: 2012-05-16
Publication date: 2013-11-22

Abstract

The method involves extracting text lines from each block of extracted characters (1), and calculating a sign function per text line (2) from moment of odd nature and greater than 1 of pixel values corresponding to characters of the text line. Orientation of the digitized page is decided (3) as a result of a vote based on the calculated sign function, where the vote considers a zero integer value to which an integer value of 1 is added if the calculated sign function is positive and subtracted if the sign function is negative. Independent claims are also included for the following: (1) a device for determining orientation of a digitized page comprising a block of characters (2) a computer program product comprising a set of instructions to implement a method for determining orientation of a digitized page (3) a computer readable storage unit for storing a computer program comprising a set of instructions to implement a method for determining orientation of a digitized page.

Description

La présente invention concerne un procédé de détermination de l'orientation d'une page numérisée comprenant au moins une ligne de texte formée de caractères. La présente invention trouve son application dans le domaine du traitement d'images et, notamment, dans celui de l'analyse et de l'amélioration des images issues de la numérisation de pages de document. En effet, lors de l'acquisition numérique d'une page d'un document par un scanner, il est fréquent que cette page soit numérisée en biais. Cette inclinaison a pour effet de réduire la qualité visuelle de cette page et limite ainsi les performances des algorithmes d'analyse, notamment ceux de reconnaissance de caractères. Il est classique d'utiliser alors une méthode de redressement de l'image numérisée pour retrouver une orientation correcte de la page. Cette méthode de redressement consiste à estimer le biais et à appliquer ce biais pour que la page numérisée apparaisse redressée. Ces méthodes de redressement, bien qu'elles proposent de corriger l'inclinaison d'une page, ne permettent pas de déterminer l'orientation de la page c'est-à-dire de savoir si la page a été numérisée à l'endroit ou si la page a été numérisée à l'envers. Autrement dit, ces méthodes ne permettent pas de déterminer si la page originale a été placée correctement sur le scanner ou si elle a été placée à l'envers c'est-à-dire que le haut de la page originale correspond au bas de la page numérisée. Ceci a son importance pour corriger l'inclinaison de la page car il convient de détecter l'orientation de la page numérisée afin de savoir si il faut l'inverser, c'est-à-dire la tourner de 180° selon un axe perpendiculaire au plan de cette page, en complément de la correction de l'inclinaison par application du biais. Il est connu deux approches dans l'état de la technique pour déterminer l'orientation d'une page numérisée. La première approche consiste à appliquer une méthode de reconnaissance de caractères sur la page numérisée, à inverser cette page numérisée (rotation de 180° selon un axe perpendiculaire au plan de cette page), et à appliquer une nouvelle fois la méthode de reconnaissance de caractères. La décision de l'orientation est alors basée sur les résultats de reconnaissance de caractères obtenus. Cette approche présente l'inconvénient de devoir appliquer deux fois une méthode de reconnaissance de caractères ce qui est lourd en termes de temps de calcul et coûteux en termes de ressources. De plus, cette approche ne peut être mise en oeuvre lorsqu'elle est embarquée dans un appareil mobile du fait des ressources limitées de ce type d'appareil tant en termes de mémoires que de puissance de calcul. Une deuxième approche est basée sur le fait que les caractères latins comportent généralement plus de lettres « montantes » (b, d, f, h, k, 1...) que descendantes (g, j, p...). En effet, cette deuxième approche consiste, pour une ligne de texte, à déterminer une ligne haute et une ligne basse et à compter les pixels relatifs aux caractères qui se trouvent au-dessus de la ligne haute et ceux qui se trouvent en-dessous de la ligne basse. Un exemple de ce type d'approche est donné dans l'article de R.S. Caprari, intitulé "Algorithm for text page up/down orientation determination" (Pattern Recognition Letter, N° 21, 2000, pp. 311-317). Cette approche requiert l'application d'opérations morphologiques sur une image, application qui est largement connue de l'état de la technique pour être coûteuse en termes de ressources. Cette approche ne peut donc être mise en oeuvre lorsqu'elle est embarquée dans un appareil mobile du fait des ressources limitées de ce type d'appareil tant en termes de mémoires que de puissance de calcul. Le problème résolu par la présente invention est de définir un procédé de détermination de l'orientation d'une page numérisée qui ne requiert pas d'importantes ressources pour être mis en oeuvre. A cet effet, la présente invention concerne un procédé de détermination de l'orientation d'une page numérisée comprenant au moins une ligne de texte. Le procédé est caractérisé en ce qu'il comporte : - une étape de calcul d'une fonction de signe par ligne de texte, et - une étape de décision de l'orientation de la page numérisée selon le résultat d'un vote basé sur la ou les fonction(s) de signe ainsi calculée(s). The present invention relates to a method for determining the orientation of a digitized page comprising at least one line of text formed of characters. The present invention finds application in the field of image processing and, in particular, in the analysis and improvement of images resulting from the scanning of document pages. Indeed, when digitally acquiring a page of a document by a scanner, it is common that this page is digitized in bias. This inclination has the effect of reducing the visual quality of this page and thus limits the performance of the analysis algorithms, particularly those of character recognition. It is conventional to then use a method of straightening the scanned image to find a correct orientation of the page. This rectification method consists of estimating the bias and applying this bias so that the digitized page appears rectified. These methods of correction, although they propose to correct the inclination of a page, do not make it possible to determine the orientation of the page that is to say to know if the page was scanned at the place or if the page has been scanned upside down. In other words, these methods do not make it possible to determine if the original page was correctly placed on the scanner or if it was placed upside down, that is to say that the top of the original page corresponds to the bottom of the page. scanned page. This is important to correct the inclination of the page because it is necessary to detect the orientation of the scanned page in order to know if it should be reversed, that is to say turn it 180 ° along a perpendicular axis to the plane of this page, in addition to the correction of the inclination by application of the bias. Two prior art approaches are known for determining the orientation of a scanned page. The first approach is to apply a method of character recognition on the scanned page, to reverse this scanned page (rotation of 180 ° along an axis perpendicular to the plane of this page), and to apply again the method of recognition of characters . The decision of the orientation is then based on the character recognition results obtained. This approach has the disadvantage of having to apply twice a method of character recognition which is heavy in terms of computing time and expensive in terms of resources. In addition, this approach can not be implemented when it is embedded in a mobile device because of the limited resources of this type of device both in terms of memory and computing power. A second approach is based on the fact that Latin characters usually have more "rising" letters (b, d, f, h, k, 1 ...) than descendants (g, j, p ...). Indeed, this second approach consists, for a line of text, to determine a high line and a low line and to count the pixels relative to the characters which are above the high line and those which are below the low line. An example of this type of approach is given in the article by R.S. Caprari, entitled "Algorithm for text page up / down orientation determination" (Pattern Recognition Letter, No. 21, 2000, pp. 311-317). This approach requires the application of morphological operations on an image, an application that is widely known in the state of the art to be expensive in terms of resources. This approach can not be implemented when it is embedded in a mobile device because of the limited resources of this type of device both in terms of memory and computing power. The problem solved by the present invention is to define a method for determining the orientation of a digitized page that does not require significant resources to be implemented. For this purpose, the present invention relates to a method for determining the orientation of a digitized page comprising at least one line of text. The method is characterized in that it comprises: a step of calculating a sign function per text line, and a decision step of the orientation of the digitized page according to the result of a vote based on the function (s) of sign thus calculated (s).

L'utilisation d'une fonction de signe par ligne de texte permet d'obtenir une indication de l'orientation de cette ligne de texte sans pour autant consommer des ressources de calcul importantes. Ce procédé peut ainsi être embarqué dans des appareils à faibles ressources mémoire, de calcul et énergétiques. Selon un mode de réalisation, la fonction de signe pour une ligne de texte est calculée à partir d'un moment d'ordre impair et supérieur à 1 des valeurs des pixels qui correspondent à des caractères de cette ligne de texte. Selon un mode de réalisation, la fonction de signe pour une ligne de texte est calculée en considérant une ligne médiane de cette ligne de texte et en considérant une valeur entière nulle à laquelle il est ajouté la valeur entière 1 si un pixel qui correspond à un caractère de cette ligne de texte se situe au-dessus de cette ligne médiane et à lui retrancher la valeur entière 1 si un pixel qui correspond à un caractère de cette ligne de texte se situe au-dessous de cette ligne médiane. Selon un mode de réalisation, le vote consiste à considérer une valeur entière nulle à laquelle il est ajouté la valeur entière 1 si la fonction de signe calculée est positive et à lui retrancher la valeur entière 1 si cette fonction de signe est négative. Selon un mode de réalisation, au moins un bloc de caractère étant extrait à partir de la page numérisée, le procédé comporte alors une étape d'extraction de lignes de texte à partir de chaque bloc de caractère extrait. The use of a sign function per line of text makes it possible to obtain an indication of the orientation of this line of text without consuming significant computing resources. This method can thus be embedded in devices with low memory, calculation and energy resources. According to one embodiment, the sign function for a text line is calculated from an odd order moment and greater than 1 of the values of the pixels that correspond to characters of this line of text. According to one embodiment, the sign function for a text line is calculated by considering a center line of this line of text and considering a zero integer value to which the integer value 1 is added if a pixel that corresponds to a The character of this line of text is above this midline and subtract it from the integer value 1 if a pixel that matches a character in that line of text is below that midline. According to one embodiment, the vote consists of considering a null integer value to which the integer value 1 is added if the calculated sign function is positive and to subtract the integer value 1 if this sign function is negative. According to one embodiment, at least one block of character being extracted from the scanned page, the method then comprises a step of extracting lines of text from each extracted character block.

Selon un mode de réalisation, au cours de l'étape d'extraction de ligne de texte, une image binaire est obtenue à partir de la page numérisée et pour chaque bloc de caractère, les lignes de pixels de cette image binaire qui sont consécutives et qui contiennent chacune au moins un pixel qui correspond à un caractère sont regroupées entre elles pour former une ligne de texte. According to one embodiment, during the text line extraction step, a binary image is obtained from the digitized page and for each character block, the pixel lines of this binary image which are consecutive and each containing at least one pixel that corresponds to a character are grouped together to form a line of text.

Selon un mode de réalisation, au cours de l'étape de décision, le vôtre est tel qu'il est considéré que la page numérisée est orientée à l'envers si, parmi les lignes de texte de l'ensemble des blocs de caractère extraits de la page numérisée, le nombre de lignes de texte ayant une fonction de signe positive est supérieur au nombre de lignes de texte qui ont une fonction de signe négative, et sinon il est considéré que cette page numérisée est orientée à l'endroit. Ce mode de réalisation est particulièrement avantageux car la prise en compte des lignes de texte de tous les blocs de caractère de la page numérisée augmente significativement la fiabilité de la détermination de l'orientation de la page numérisée comparée au cas où seule une partie de ces lignes de texte est considérée. According to one embodiment, during the decision step, yours is such that it is considered that the scanned page is turned upside down if, among the lines of text of the set of character blocks extracted on the scanned page, the number of lines of text having a positive sign function is greater than the number of lines of text that have a negative sign function, and otherwise it is considered that this scanned page is oriented in the right place. This embodiment is particularly advantageous because the taking into account of the text lines of all the character blocks of the scanned page significantly increases the reliability of the determination of the orientation of the digitized page compared to the case where only a part of these lines of text is considered.

Selon l'un de ses aspects matériels, la présente invention concerne également un dispositif comportant des moyens pour mettre en oeuvre le procédé ci-dessus. Elle concerne également un programme d'ordinateur dont les instructions mettent en oeuvre au moins partiellement le procédé ci-dessus. Les caractéristiques de l'invention mentionnées ci-dessus, ainsi que d'autres, apparaîtront plus clairement à la lecture de la description suivante d'un exemple de réalisation, ladite description étant faite en relation avec les dessins joints, parmi lesquels: La Fig. 1 représente un diagramme des étapes d'un procédé de détermination de l'orientation d'une page numérisée selon la présente invention ; La Fig. 2 représente une illustration d'un exemple d'un bloc de caractères ; La Fig. 3 représente un exemple d'une ligne de texte ; La Fig. 4 représente une illustration d'un mode de réalisation pour calculer la fonction de signe ; La Fig. 5 représente un exemple d'une architecture interne d'un dispositif prévu pour mettre en oeuvre le procédé de la Fig. 1. De manière générale, tel que illustré à la Fig. 1, la présente invention concerne un procédé de détermination de l'orientation d'une page numérisée comprenant au moins une ligne de texte, caractérisé en ce qu'il comporte : - une étape 2 de calcul d'une fonction de signe par ligne de texte, et - une étape 3 de décision de l'orientation de la page numérisée selon le résultat d'un vote basé sur la ou les fonction(s) de signe ainsi calculée(s). Selon un mode de réalisation, la fonction de signe pour une ligne de texte Lj est calculée à partir d'un moment d'ordre impair et supérieur à 1 des valeurs des pixels relatifs aux caractères de cette ligne de texte. En termes mathématiques, la fonction de signe d'une ligne de texte, ici notée 1 un avec n impair et supérieur à 1, est donnée par : - mir dans laquelle I i=1 est l'ensemble des pixels de la ligne de texte qui correspondent à un caractère de cette 1 ligne de texte, et i=1 La Fig. 2 représente un exemple d'une ligne de texte Lj qui comporte trois caractères (a,b et g). Cette ligne de texte Lj est formée de Yj lignes de pixels de l'image binaire, chaque ligne de pixels comportant un nombre de pixels (xi,yi) égal au nombre de colonnes d'un bloc de caractères. Selon cet exemple, les caractères sont écrits en blanc sur fond noir pour indiquer que l'image dont cette ligne de texte est issue est une image binaire. Certains de ces pixels correspondent à un caractère, tel que les pixels (xl,y1) et (x2, y2), et certains autres correspondent au fond. Selon un mode de réalisation, illustré à la Fig. 2, la fonction de signe pour une ligne de texte est calculée en considérant une ligne médiane AX de cette ligne de texte. Cette ligne médiane AX est positionnée, par exemple, à une ordonnée Yj/2, c'est-à-dire au milieu de la ligne de texte. Une valeur entière N initialement nulle est alors considérée et il est ajouté la valeur entière 1 si un pixel (xl, yl) qui correspond à un caractère se situe au-dessus de cette ligne médiane et à lui retrancher la valeur entière 1 si un pixel (x2, y2) qui correspond à un caractère se situe au-dessus de cette ligne médiane. Selon un mode de réalisation, le vote consiste à considérer une valeur entière nulle à laquelle il est ajouté la valeur entière 1 si la fonction de signe calculée est positive et à lui retrancher la valeur entière 1 si cette fonction de signe est négative. Selon un mode de réalisation, au moins un bloc de caractère étant extrait à partir de la page numérisée, le procédé comporte alors une étape 1 d'extraction de lignes de texte à partir de chaque bloc de caractère extrait. Dans le domaine du traitement d'images, il est connu des méthodes de segmentation d'une page numérisée qui sont prévues pour extraire de cette page des blocs de caractères. Chacun de ces blocs de caractères correspond à une partie d'un texte de la page numérisée. La portée du procédé revendiqué n'est pas limitée à l'une de ces méthodes de segmentation de la page numérisée en blocs de caractères. On peut citer, à titre d'exemple, la méthode de segmentation d'une page numérisée en blocs de caractères qui est décrite dans l'article de Weijuan W. et al. Intitulé "An Efficient Method for Text Location and Segmentation", Proceeding WCSE '09 Proceedings of the 2009 WRI World Congress on Software Engineering - Volume 03 IEEE Computer Society. En résumé, cette méthode commence par déterminer une image de niveaux de gris à partir de la page numérisée. Une transformée en ondelettes de cette image est alors calculée (par exemple selon l'article de Mallat S., "A theory for multi-resolution signal decomposition: the wavelet representation", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 11, No. 7, 1993, pp. 674-693), suivie d'une sommation de trois des quatre sous-bandes caractérisant les discontinuités de cette image. Une binarisation de l'image est alors appliquée par un seuillage simple et un opérateur morphologique de fermeture est alors appliqué. Un algorithme connu sous l'abréviation CRLA est alors utilisé pour extraire les blocs de caractères de la page numérisée (F.M. Wahl, K.Y. Wong, and R.G. Casey, "Block segmentation and text extraction in mixed text/image documents", Computer Graphics and Image Processing, Vol.20, 1982, pp, 375-390). According to one of its material aspects, the present invention also relates to a device comprising means for implementing the method above. It also relates to a computer program whose instructions implement at least partially the above method. The characteristics of the invention mentioned above, as well as others, will emerge more clearly on reading the following description of an exemplary embodiment, said description being given in relation to the attached drawings, among which: FIG. . 1 is a diagram of the steps of a method for determining the orientation of a digitized page according to the present invention; Fig. 2 is an illustration of an example of a block of characters; Fig. 3 represents an example of a line of text; Fig. 4 is an illustration of an embodiment for calculating the sign function; Fig. 5 shows an example of an internal architecture of a device provided for implementing the method of FIG. 1. Generally speaking, as illustrated in FIG. 1, the present invention relates to a method for determining the orientation of a digitized page comprising at least one line of text, characterized in that it comprises: a step 2 of calculating a sign function per line of text, and - a step 3 of decision of the orientation of the digitized page according to the result of a vote based on the sign function (s) thus calculated (s). According to one embodiment, the sign function for a text line Lj is calculated from an odd order moment and greater than 1 of the values of the pixels relative to the characters of this line of text. In mathematical terms, the sign function of a line of text, here denoted 1 with odd n and greater than 1, is given by: - mir in which I i = 1 is the set of pixels in the text line which correspond to a character of this 1 line of text, and i = 1 FIG. 2 represents an example of a line of text Lj which comprises three characters (a, b and g). This line of text Lj is formed of Yj lines of pixels of the binary image, each row of pixels comprising a number of pixels (xi, yi) equal to the number of columns of a block of characters. In this example, the characters are written in white on a black background to indicate that the image from which this line of text originates is a binary image. Some of these pixels correspond to a character, such as pixels (xl, y1) and (x2, y2), and some others correspond to the background. According to one embodiment, illustrated in FIG. 2, the sign function for a text line is calculated by considering a centerline AX of this line of text. This central line AX is positioned, for example, at an ordinate Yj / 2, that is to say in the middle of the text line. An initially zero integer value N is then considered and the integer value 1 is added if a pixel (xl, yl) corresponding to a character is above this median line and to subtract the integer value 1 if a pixel (x2, y2) which corresponds to a character is above this median line. According to one embodiment, the vote consists of considering a null integer value to which the integer value 1 is added if the calculated sign function is positive and to subtract the integer value 1 if this sign function is negative. According to one embodiment, at least one block of character being extracted from the digitized page, the method then comprises a step 1 of extracting lines of text from each extracted character block. In the field of image processing, there are known methods of segmenting a scanned page that are intended to extract blocks of characters from this page. Each of these blocks of characters corresponds to a part of a text of the scanned page. The scope of the claimed method is not limited to any of these methods of segmenting the page scanned into blocks of characters. One example is the method of segmenting a page scanned into blocks of characters which is described in the article by Weijuan W. et al. Entitled "An Efficient Method for Text Location and Segmentation", Proceeding WCSE '09 Proceedings of the 2009 WRI World Congress on Software Engineering - Volume 03 IEEE Computer Society. In summary, this method begins by determining a grayscale image from the scanned page. A wavelet transform of this image is then calculated (for example, according to the article by Mallat S., "A theory for multi-resolution signal decomposition: the representation wavelet", IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. No. 7, 1993, pp. 674-693), followed by a summation of three of the four subbands characterizing the discontinuities of this image. A binarization of the image is then applied by a simple thresholding and a closing morphological operator is then applied. An algorithm known by the abbreviation CRLA is then used to extract the blocks of characters from the digitized page (FM Wahl, KY Wong, and RG Casey, "Block segmentation and text extraction in mixed text / image documents", Computer Graphics and Image Processing, Vol. 20, 1982, pp. 375-390).

Selon un mode de réalisation, au cours de l'étape 1 d'extraction de lignes de texte, une image binaire est obtenue à partir de la page numérisée et pour chaque bloc de caractères, les lignes de pixels qui sont consécutives et qui contiennent chacune au moins un pixel qui correspond à un caractère de ce bloc de caractères sont regroupées entre elles pour former une ligne de texte. According to one embodiment, during step 1 of extracting lines of text, a binary image is obtained from the scanned page and for each block of characters, the rows of pixels that are consecutive and which contain each at least one pixel that corresponds to a character of this block of characters is grouped together to form a line of text.

La Fig. 4 représente une illustration d'un exemple d'un bloc de caractère B à partir duquel plusieurs lignes de texte Lj ont été formées. Chaque ligne de texte Lj comporte plusieurs lignes de pixels consécutives. Selon un mode de réalisation, au cours de l'étape 3 de décision, le vote est tel qu'il est considéré que la page numérisée est orientée à l'envers si, parmi les lignes de texte de l'ensemble des blocs de caractères extraits de la page numérisée, le nombre de lignes de texte ayant une fonction de signe positive est supérieur au nombre de lignes de texte qui ont une fonction de signe négative, et sinon il est considéré que cette page numérisée est orientée à l'endroit. Fig. 4 is an illustration of an example of a block of character B from which several lines of text Lj have been formed. Each line of text Lj has several rows of consecutive pixels. According to one embodiment, during the decision step 3, the vote is such that it is considered that the digitized page is turned upside down if, among the lines of text of all the blocks of characters When extracted from the scanned page, the number of lines of text having a positive sign function is greater than the number of lines of text that have a negative sign function, and otherwise it is considered that this digitized page is oriented in the right place.

A la Fig. 5 est représentée l'architecture d'un dispositif 500 mettant en oeuvre le procédé de la Fig. 1. Le dispositif 500 comporte, reliés par un bus de communication 501, des ressources matérielles qui sont: - un processeur, micro-processeur ou microcontrôleur 502; - une mémoire vive (Random Access Memory en anglais ou Mémoire à Accès Aléatoire en français) 503 ; - une mémoire 504 (Read Only Memory en anglais ou Mémoire à Lecture Seule en français) ou encore flash ou réinscriptible mais rémanente (telles que dans une clé USB) ; Le microcontrôleur 502 est capable d'exécuter des instructions chargées dans la mémoire 503 à partir de la mémoire 504. Lorsque le dispositif 500 est mis sous tension, le microcontrôleur 502 est capable de lire de la mémoire 503 des instructions et de les exécuter. Ces instructions forment un programme d'ordinateur qui cause la mise en oeuvre, par le microcontrôleur 502, de tout ou partie des procédés décrits ci- dessus en relation avec la Fig. 1. Tout ou partie des procédés décrits ci-dessus en relation avec la Fig. 1 peut être implémenté sous forme logicielle par exécution d'un ensemble d'instructions par une machine programmable, tel qu'un DSP (Digital Signal Processor en anglais ou Unité de Traitement de Signal Numérique en français) ou un microcontrôleur, tel que le microcontrôleur 502, ou être implémenté sous forme matérielle par une machine ou un composant dédié, tel qu'un FPGA (Field-Programmable Gate Array en anglais ou Matrice de Portes Programmable sur Champ en français) ou un ASIC (ApplicationSpecific Integrated Circuit en anglais ou Circuit Intégré Spécifique à une Application en français).35 In FIG. 5 is the architecture of a device 500 implementing the method of FIG. 1. The device 500 comprises, connected by a communication bus 501, hardware resources which are: a processor, microprocessor or microcontroller 502; - a random access memory (Random Access Memory in French or Random Access Memory in French) 503; a memory 504 (Read Only Memory in English or Memory to Read Only in French) or flash or rewritable but persistent (such as in a USB key); The microcontroller 502 is capable of executing instructions loaded into the memory 503 from the memory 504. When the device 500 is turned on, the microcontroller 502 is able to read from the memory 503 instructions and execute them. These instructions form a computer program which causes the microcontroller 502 to implement all or some of the methods described above in connection with FIG. 1. All or some of the methods described above in connection with FIG. 1 can be implemented in software form by executing a set of instructions by a programmable machine, such as a DSP (Digital Signal Processor in English or a Digital Signal Processing Unit in French) or a microcontroller, such as the microcontroller 502, or be implemented in hardware form by a machine or a dedicated component, such as an FPGA (Field Programmable Gate Array) or an ASIC (ApplicationSpecific Integrated Circuit). Integrated Specific to an Application in French) .35

Claims

CLAIMS1) A method for determining the orientation of a digitized page comprising at least one line of text, characterized in that it comprises: - a step (2) of calculating a sign function per line of text, and a step (3) for deciding the orientation of the scanned page according to the result of a vote based on the sign function (s) thus calculated.

2) The method of claim 1, wherein the sign function for a text line is calculated from an odd order moment and greater than 1 pixel values that correspond to characters of that line of text.

3) Method according to claim 1, wherein the sign function for a text line is calculated by considering a center line (AX) of this line of text and considering a null integer value to which is added the integer value 1 if a pixel that matches a character in that line of text is above that midline and subtract from it the integer value 1 if a pixel that matches a character in that line of text is below that line median line.

4) Method according to one of claims 1 to 3, wherein the vote consists in considering a null integer value to which the integer value 1 is added if the computed sign function is positive and to subtract from it the integer value 1 if this sign function is negative.

5) Method according to one of claims 1 to 4, wherein at least one block of character being extracted from the scanned page, the method then comprises a step (1) for extracting lines of text from each block of characters extracted.

6) Method according to claim 5, wherein during the step of extracting lines of text, a binary image is obtained from the scanned page and for each block of characters, the rows of pixels that are consecutive and each containing at least one pixel that corresponds to a character of that block of characters are grouped together to form a line of text.

7) Method according to claim 5 or 6, wherein during the decision step, the vote is such that it is considered that the digitized page is turned upside down if, among the lines of text of the set of character blocks extracted from the scanned page, the number of lines of text having a positive sign function is greater than the number of lines of text that have a negative sign function, and otherwise it is considered that this scanned page is oriented at the location.

8) A device for determining the orientation of a digitized page comprising at least one block of characters, characterized in that it comprises means for: extracting a line of text from a block of characters; a function of sign by extracted line of text, and - decide on the orientation of the digitized page according to the result of a vote based on the sign function (s) thus calculated (s).

9) Computer program product, characterized in that it comprises instructions for implementing, by a device according to claim 8, the method according to one of claims 1 to 7 when said program is executed by a processor of the device

10) Computer-readable storage means, characterized in that they store a computer program including instructions for implementing, by a suitable device, the method according to one of claims 1 to 7, when said program is executed by a processor of the device and according to claim 8.