What can and cannot be recognized?

Text

MyScript Cloud supports 65 languages.

The Text recognition service:

  • recognizes also digits, common punctuation and symbols, in the usual style for the given language.
  • can only recognize ink samples in the order in which they are sent. Any “reassembly” of parts of a document has to be done in pre-processing.
  • can recognize many common items such as many proper names, phone numbers, email addresses, Web addresses.
  • is case-sensitive during recognition. It does not “adjust” the case, so for example if you write in all capitals, the result is likely to be in capitals too.
  • recognizes text in one language. You cannot mix for example Russian and Spanish in a single request. However, most languages allow to recognize common “International” English terms. The East Asian languages support some English words appearing mixed in with native Chinese/Korean/Japanese ideograms.
  • cannot recognize super- or sub-script.

Math

The Math recognizer recognizes subsets of elements, within a spatial layout. The elements given below are examples of these math components: The list is almost exhaustive. This means that it can also recognize any type of scientific expression, including measurement units, chemical symbols and so on.

Type  Example
Letters (Latin alphabet) a b c d e f g h i j k l m n o p q r s t u v w x y z A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Digits 0 1 2 3 4 5 6 7 8 9
Mathematical symbols (including operators)  € $ £ ¥ ₩ ¢ ( ) < > [ ] { }  ! # % & ? @ / \ | ∥ © ∂∅ ∇ ∞ ℂ ℕ ℚ ℝ ℤ + - ± × ÷ * ∘ · = ‘ , .  : ; _ ← ↑ → ↓ ↔ ↕ ↖ ↗ ↘ ↙⇐ ⇑ ⇒ ⇓ ⇔ ⇕ ∀ ∃ ∄ ∈ ∉ ∋ ∌ ∪ ∩ ⊂ ⊃ ⊄ ⊅ ∼ ≃ ≈ ≠ ≡ ≢ ≤ ≥ ≪ ≫ ∝ ∠ ∏ ∑ ∫ ∫∫ ∫∫∫ ∮ ∯ ∰ ∧ √l ⊕ ⊗ ⊖ ⊙ ¬Ψℑℜℓ↼⇀↽⇁⇋⇌∐∦≄≅≆≇≉≮≯≰≱⋉⋊⋮⌈⌉ ⌊⌋□ △ ⟦ ⟧ ⦞ 𝔻 ⫻
Greek symbols frequently used in mathematics Γ Δ Ω α β γ δ ε η θ λ ν π ρ σ τ φ χ ψ ω ϕ µ
Mathematical terms sin cos tan sinh cosh tanh arcsin arccos arctan cot coth min max arg argmin argmax csc deg det dim exp gcd hom inf sup lg ker lim liminf limsup ln log dx dy dz dt rad mol angle bessel mean median mod norm ceil conv cov diag erf fft floor Im Re inv perms pinv rand round sign size sort sqrt svd var
Chemical elements He Li Be Ne Na Mg Al Si Cl Ar Ca Sc Ti Cr Mn Fe Co Ni Cu Zn Ga Ge As Se Br Kr Rb Sr Zr Nb Mo Tc Ru Rh Pd Ag Cd In Sn Sb Te Xe Cs Ba Hf Ta Re Os Ir Pt Au Hg Tl Pb Bi Po At Rn Fr Ra Rf Db Sg Bh Hs Mt Ds Rg Cn Uut Uuq Uup Uuh Uus Uuo La Ce Pr Nd Pm Sm Eu Gd Tb Dy Ho Er Tm Yb Lu Ac Th Pa Np Pu Am Cm Bk Cf Es Fm Md No Lr
International convention units (weight, length, frequency, luminosity, dosage, pressure, etc.) km hm dam m dm cm mm µm nm pm fm ha in ft mi yd yds oz lb lbs dl cl ml µl kg hg dag dg cg mg µg ng pg ds cs ms µs gon tex Pr ua ha rad tr Hg sec amp div rot abs ns ps fs Pt Tt Gt Mt kt PHz THz GHz MHz kHz hHz daHz Hz dHz cHz mHz µHz nHz pHz fHz PN TN GN MN kN hN daN dN cN mN µN nN pN fN PNm TNm GNm MNm kNm hNm daNm dNm cNm mNm µNm nNm pNm fNm PJ TJ GJ MJ kJ hJ daJ dJ cJ mJ µJ nJ pJ fJ PW TW GW MW kW hW daW dW cW mW µW nW pW fW PWh TWh GWh MWh kWh hWh daWh Wh dWh cWh mWh µWh nWh pWh fWh PeV TeV GeV MeV keV heV daeV eV deV ceV meV µeV neV peV feV PPa TPa GPa MPa kPa hPa daPa Pa dPa cPa mPa µPa nPa pPa fPa Pbar Tbar Gbar Mbar kbar hbar dabar bar dbar cbar mbar µbar nbar pbar fbar PA TA GA MA kA mA µA nA pA fA PV TV GV MV kV mV µV nV pV fV PS TS GS MS kS mS µS nS pS fS PC TC GC MC kC mC µC nC pC fC PAh TAh GAh MAh kAh mAh µAh nAh pAh fAh PF TF GF MF kF mF µF nF pF fF PH TH GH MH kH mH µH nH pH fH PWb TWb GWb MWb kWb mWb µWb nWb pWb fWb PT TT GT MT kT mT µT nT pT fT PBq TBq GBq MBq kBq hBq daBq Bq dBq cBq mBq µBq nBq pBq fBq PGy TGy GGy MGy kGy hGy daGy Gy dGy cGy mGy µGy nGy pGy fGy PSv TSv GSv MSv kSv hSv daSv Sv dSv cSv mSv µSv nSv pSv fSv Pcd Tcd Gcd Mcd kcd hcd dacd cd dcd ccd mcd µcd ncd pcd fcd Plm Tlm Glm Mlm klm hlm dalm lm dlm clm mlm µlm nlm plm flm Plx Tlx Glx Mlx klx hlx dalx lx dlx clx mlx µlx nlx plx flx parsec
Any text items (for example, cos, sin, median, etc.) can be written in cursive text or in a handprinted style.

The Math recognition service:

  • can recognize equations that are not true equations, or that are improbable,
  • can recognize equation elements with almost limitless levels of recursiveness (for example, square roots with subscript),
  • can only recognize single equations - but not systems of equations,
  • cannot recognize matrices or complex arithmetic layouts like long divisions,
  • cannot recognize other text or characters or shapes: it assumes that its input is always an equation,
  • cannot support corrections or ink incidents.

There is one specific case that is not supported:

Unsupported case Description Supported alternative
This is not currently supported. The second elements are too close to the first and will not give a correct result (the segmentation of the elements is likely to be incorrect). This type of equation can be recognized but care must be taken to provide adequate spacing.

Recognition overview

Text recognition results

MyScript is based on artificial intelligence and so works mainly on probability. It gives the most probable result it finds, in accordance with the criteria it has to work with.

For recognition of text, the result is delivered as a kind of tree: It is hierarchical. The result is considered on 3 levels: character, word and text (the whole ink sample). At each level, the recognizer can keep several candidates, or likely alternatives that it also found during its deduction. You can get several recognition candidates, at any level, if you ask for them in the initial request. For example, this could allow your application to provide alternative recognition choices. You can ask in your request to see the different levels of results and ask for a certain number of candidates for each level.

A score is provided, from 0 to 1. The resemblance score reflects the level of confidence in the recognizer’s raw output. The normalized score reflects how confident the recognizer is, taking into account all the resources it has used to recognize (dictionaries, lexicons, etc.). It is probably the latter score that you should refer to.

Potential errors

Besides standard HTTP errors, some API-related exceptions may occur.

Exception Description
TextApiParsingException ShapeApiParsingException MathApiParsingException AnalyzerApiParsingException IllegalInputParameterException There is incorrect input in the JSON request.
NotImplementedMethodException You have used bad syntax or used parameters etc that do not exist.
BadIpAddressApplicationKeyException BadRefererApplicationKeyException The filtering value is unknown or incorrect.
InvalidApplicationKeyException You have used an application key that does not exist or is incorrect.
RevokedApplicationKeyException Your application key is no longer valid or does not have appropriate permissions.
ApplicationDeactivatedException This application is not currently active. Visit the account console to fix this if necessary.
ContentTypeResourceNotFoundException A content type resource (used to restrict recognition to a specific type of linguistic input, such as city names) has been requested but is not present on the server. You may have misspelled it or asked for a resource that doesn’t exist in that particular language.
SKResourceNotFoundException A subset knowledge resource (used to restrict recognition to a specific set of characters) has been requested but is not present on the server. You may have misspelled it or asked for a resource that doesn’t exist in that particular language.
TextLanguageException You have asked for a language that is not supported by MyScript or the language code is incorrectly entered.
RecognitionApiTimedOutException The processing of your request leads to a timeout in the recognition engine.
IOException RuntimeException There is an internal server error.