; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037387 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037387
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold8:1010349..1020363
RNA-Seq ExpressionSpg037387
SyntenySpg037387
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU21788.1 hypothetical protein TSUD_329120, partial [Trifolium subterraneum]5.7e-4525.8Show/hide
Query:  EQIEKLSLLCKFGTQRDKNRVTKGGPWSFGNNLLVFDEPKGN-----ISISSLSF--RKY-----------AEALGNAIGAFEMVEVNEQGNITGETLRV
        + + K   L KF T+R+ + V K GPWSF  NLL+ +   GN     +++ S+SF  R Y           A+ LGN +G FE +++ E  N  G+ LR+
Subjt:  EQIEKLSLLCKFGTQRDKNRVTKGGPWSFGNNLLVFDEPKGN-----ISISSLSF--RKY-----------AEALGNAIGAFEMVEVNEQGNITGETLRV

Query:  KIKVNINDPIKRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEEGESTKNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGR--RRG
        ++ V++  P+KRG+ +         W+   YE+LP+FC+  GRIGH + DCE+  +  E   +   EL E Q      +++ P  R  P        ++ 
Subjt:  KIKVNINDPIKRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEEGESTKNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGR--RRG

Query:  ADSQGRYDNWRKTNLNGNGVNSGNRAVGYIQLEETKEKVEESKDKKEGSSAEMGPTPHDGKQLAIDKDEETTLVNGKGSKKQGISQSQS---QNNSCCRK
        + S     +   +N N  G NSG         +E  E+VE+ +   +  S +         +  + KD      +G   K+Q + +      ++      
Subjt:  ADSQGRYDNWRKTNLNGNGVNSGNRAVGYIQLEETKEKVEESKDKKEGSSAEMGPTPHDGKQLAIDKDEETTLVNGKGSKKQGISQSQS---QNNSCCRK

Query:  NTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLRRTRRDIGEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHV
        +T+ + G+          +K       K K   + +   K  + + +D    +    +LR   + +         D +   C         R +  L  V
Subjt:  NTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLRRTRRDIGEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHV

Query:  IDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNN----VRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKDWHGRF--IGFYGNLEVDKRKDSW
           E PQ+VF MET+    + E I+  L F +   V  N     R GGL L+W   + + I S+S  HI   C  ++  G +   G YG  E   ++ +W
Subjt:  IDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNN----VRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKDWHGRF--IGFYGNLEVDKRKDSW

Query:  EMLERLTENDDEQWIIGGDFNKIWD---------------KVGYLILVTLWIRPVLYYDTNAMIQRLHVVDMRVG---VSCAMSLHKTGPRNTKEKVLVA
        +++  L   +   W+  GDFN I D                 G   +       + + +  + I+  H+   R G    +  +SL    PR+T+ +  + 
Subjt:  EMLERLTENDDEQWIIGGDFNKIWD---------------KVGYLILVTLWIRPVLYYDTNAMIQRLHVVDMRVG---VSCAMSLHKTGPRNTKEKVLVA

Query:  VFPKD--QDQRFGE---------TFKTSKP-----------------KKEDEILNLSER--DDSQNAEILE------KAKAELDDLLEEEEYFWRSRSRE
         F +   +D +  E         T   SK                  K +  I  + ER  D S  +E +E      + + E  +LL+ +E  WR RS+ 
Subjt:  VFPKD--QDQRFGE---------TFKTSKP-----------------KKEDEILNLSER--DDSQNAEILE------KAKAELDDLLEEEEYFWRSRSRE

Query:  VWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFI----SSILDQQDISK------------------EVEEALKSLSP
        +WL++GDRNTK+F  KASQR + N I+ + D +G W   E+ I +V  NYF ELF     S+I++  ++ K                  EV+EA+  + P
Subjt:  VWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFI----SSILDQQDISK------------------EVEEALKSLSP

Query:  SKAPGIDGAHASFYQSYWGVVGEFEELTCCRTIPQHYAPKNTNNS
         KAPG DG  A F+Q YW +VG+  +L     +  +  P + N +
Subjt:  SKAPGIDGAHASFYQSYWGVVGEFEELTCCRTIPQHYAPKNTNNS

KAA3477524.1 reverse transcriptase [Gossypium australe]1.8e-4326.82Show/hide
Query:  GEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHID
        G G   AP   MK LCWN RG+GNP ++R L+ ++    P +VF  ETKC  +    ++   + D    V    RSG L LMW   +++T++++S+ HID
Subjt:  GEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHID

Query:  TYCRSKDWHG-RFIGFYGNLEVDKRKDSWEMLERLTENDDEQWIIGGDFNKIW----------------DKVGYLI----LVTL--------------WI
        +    +D    RF GFYG  +   +K SW++L+R+     E WI+GGDFN I                 D+ G ++    L  +              W+
Subjt:  TYCRSKDWHG-RFIGFYGNLEVDKRKDSWEMLERLTENDDEQWIIGGDFNKIW----------------DKVGYLI----LVTL--------------WI

Query:  --------RPVLYYD-------TNAMIQR----------LHVVDMRVGVSC-----AMSLHKTGPRNTKEKVLVAVFPKDQDQRF---------------
                R ++  D        NAM+ R          +  +  + G SC              ++ + K ++      +D+ F               
Subjt:  --------RPVLYYD-------TNAMIQR----------LHVVDMRVGVSC-----AMSLHKTGPRNTKEKVLVAVFPKDQDQRF---------------

Query:  GETFKTSKPKKEDEILNLSERDDSQNAE----ILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQ
         + +K  K K  D    +S   DS N+E    +L+ A+  L  L E EE +W  ++R  WL  GDRNT++F  +A+ RR++N I+ + D +G W  ++ +
Subjt:  GETFKTSKPKKEDEILNLSERDDSQNAE----ILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQ

Query:  IGKVATNYFKELFISSILDQQD----------------------ISKEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVGEFEELTCCRTIPQHYAPKNT
        I  +A NYF +LF S+ +D +                       I  E+  A   +   KAPGIDG    F++ +W  VG++   TC  T+         
Subjt:  IGKVATNYFKELFISSILDQQD----------------------ISKEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVGEFEELTCCRTIPQHYAPKNT

Query:  NNSSNENHHQGAPTILWITEVE
         N++N+        ++ I++VE
Subjt:  NNSSNENHHQGAPTILWITEVE

KAG4138640.1 hypothetical protein ERO13_D07G146566v2, partial [Gossypium hirsutum]8.2e-4428.92Show/hide
Query:  MKTLCWNVRGLGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKDWHG-
        MK L WN RG+G+P  IR L+ ++   +P ++F  ETK +  + ++++   R  +   V ++ RSGGL LMW  ++++TIKS+S+ HID+  +       
Subjt:  MKTLCWNVRGLGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKDWHG-

Query:  RFIGFYGNLEVDKRKDSWEMLERLTENDDEQWIIGGDFNKIWDKVGYLILVTL--------WI-----------RPVLYYDTNAMIQRLHVVDMRVGVSC
        R +G+YG+    +R  SW ML  +     E+W++G   N   D +  L LV +        W+           R   +  + +M+++   +   V +  
Subjt:  RFIGFYGNLEVDKRKDSWEMLERLTENDDEQWIIGGDFNKIWDKVGYLILVTL--------WI-----------RPVLYYDTNAMIQRLHVVDMRVGVSC

Query:  AMSLH------------KTGPRN-----------TKEKVLVAVFPKDQDQRFGE---------------------TFKTSKPKKED---EILNLSERDDS
        + S H            K  PR+           T EK +  +  K+  Q  G                        K    + ED   +I++ + RD+S
Subjt:  AMSLH------------KTGPRN-----------TKEKVLVAVFPKDQDQRFGE---------------------TFKTSKPKKED---EILNLSERDDS

Query:  QNAEILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFIS--SILDQQDIS--
           +++ + +  L+ L + EE +W  RSR  WL+ GDRNTK+F AKA++R ++N IE +   NG W+T  + I K A NYF  LF S  + ++Q D+S  
Subjt:  QNAEILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFIS--SILDQQDIS--

Query:  ------------------KEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVGE
                          +E+  A+K ++PSKAPGIDG   +FY+ +W +VG+
Subjt:  ------------------KEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVGE

KAG8493180.1 hypothetical protein CXB51_010593 [Gossypium anomalum]1.1e-4325.46Show/hide
Query:  EEETQTKELQEQIEKLSLLCKFGTQRDKNRVTKGGPWSFGNNLLVF----DEPKGNISISSLSF---------RKYAEALGNAIGAFEMVEVNEQGNITG
        +EE     L+E +    ++ KFG+  DK+++    PW F N L+ +    D  + N+S   L           R+ A  +GNAIG    ++  ++     
Subjt:  EEETQTKELQEQIEKLSLLCKFGTQRDKNRVTKGGPWSFGNNLLVF----DEPKGNISISSLSF---------RKYAEALGNAIGAFEMVEVNEQGNITG

Query:  ETLRVKIKVNINDPIKRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEEGESTKNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGR
        E LR+KIK+NI++P++R   + +G +       + Y+ LP FCY+ GRIGHT+  C+                     S+  Y S+  +           
Subjt:  ETLRVKIKVNINDPIKRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEEGESTKNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGR

Query:  RRGADSQGRYDNWRKTNLNGNGVNSGNRAVGYIQLEETKEKVEESKDKKEGSSAEMGPTPHDGKQLAIDKDEETTLVNGKGSKKQGISQSQSQNNSCCRK
                +Y +W + N       S  R +G   +E   ++   ++DK+E  +                                               
Subjt:  RRGADSQGRYDNWRKTNLNGNGVNSGNRAVGYIQLEETKEKVEESKDKKEGSSAEMGPTPHDGKQLAIDKDEETTLVNGKGSKKQGISQSQSQNNSCCRK

Query:  NTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLRRTRRDIGEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHV
          E +KG  +   SN P +K ++                 +  L R  +   E +    R TR+  G G  TAP   MK LCW+ RG+GN   IR LR +
Subjt:  NTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLRRTRRDIGEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHV

Query:  IDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHIDTYCR-SKDWHGRFIGFYGNLEVDKRKDSWEMLER
             P ++F  ETK   +K   I    R D    V    +SGGL +MW+  I++ IK++S  HID++ +   D   RF GFYGN   +KR+ SW+ML +
Subjt:  IDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHIDTYCR-SKDWHGRFIGFYGNLEVDKRKDSWEMLER

Query:  LTENDDEQWIIGGDF---NKIWDKVGYLILVTLWIRPVLYYDTNAMIQRLHVVDMRVGV---------SCAMSLHKTGPRNTKEKVLVAVFPKDQDQRFG
        + +   E+WII GDF   N+  D      L    I   +Y D      R  + D R+           + A ++ K   +N    ++        D R  
Subjt:  LTENDDEQWIIGGDF---NKIWDKVGYLILVTLWIRPVLYYDTNAMIQRLHVVDMRVGV---------SCAMSLHKTGPRNTKEKVLVAVFPKDQDQRFG

Query:  ETFKTSKPKKEDEILN------LSERDDSQNAEILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEED
        +  K  + +K+   L       + E+ +  +   L+  + +L  LL++EE +W  RSR  WL+ GDRNT++F  +A   R+++ IE +    G W     
Subjt:  ETFKTSKPKKEDEILN------LSERDDSQNAEILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEED

Query:  QIGKVATNYFKELFISSILDQQDI----------------------SKEVEEALKSLSPSKAPGID
         I + A  YF+ LF SS+ + + +                      + E+ EA   +   KAP ID
Subjt:  QIGKVATNYFKELFISSILDQQDI----------------------SKEVEEALKSLSPSKAPGID

MCH80348.1 hypothetical protein [Trifolium medium]1.0e-4625.54Show/hide
Query:  EQIEKLSLLCKFGTQRDKNRVTKGGPWSFGNNLLVFDEPKGN-----ISISSLSF--RKY-----------AEALGNAIGAFEMVEVNEQGNITGETLRV
        + + K   L KF T+R+ + V + GPWSF  NLL+ +   GN     +++  +SF  R Y           A+ LGN +G FE ++  E  N  G+ LR+
Subjt:  EQIEKLSLLCKFGTQRDKNRVTKGGPWSFGNNLLVFDEPKGN-----ISISSLSF--RKY-----------AEALGNAIGAFEMVEVNEQGNITGETLRV

Query:  KIKVNINDPIKRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEEGESTKNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGR--RRG
        ++ V++  P+KRGT +         W+   YE+LP+FC+  GRIGH + DCE+  +  E   +   EL E Q      +++ P  R  P        ++ 
Subjt:  KIKVNINDPIKRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEEGESTKNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGR--RRG

Query:  ADSQGRYDNWRKTNLNGNGVNSGNRAVGYIQLEETKEKVEESKDKKEGSSAEMGPTPHDGKQLAIDKDEETTLVNGKGSKKQGISQSQSQNNSCCRKNTE
        + S     +   +  N  G NSG         +E  E+VE+ +   +  S           Q  I K   +    G  SK+Q + +            T 
Subjt:  ADSQGRYDNWRKTNLNGNGVNSGNRAVGYIQLEETKEKVEESKDKKEGSSAEMGPTPHDGKQLAIDKDEETTLVNGKGSKKQGISQSQSQNNSCCRKNTE

Query:  PEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLR-RTRRDIGEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHVID
            +TK G      K+       ++K T  + Q K  K + ++  G++ +   ++   T   +  G      D +   C  V   G+PR +R L  +  
Subjt:  PEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLR-RTRRDIGEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHVID

Query:  GEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNN----VRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKD----WHGRFIGFYGNLEVDKRKDSW
         E PQ+VF MET+  V + E I+  L F +   V  N     R+GGL LMW   + + I S+S  HI   C  ++    W     G YG  E   ++ +W
Subjt:  GEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNN----VRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKD----WHGRFIGFYGNLEVDKRKDSW

Query:  EMLERLTENDDEQWIIGGDFNKIWDK--------------------VGYLILVTLWIRPVLYYDTNAMIQRLHV---VDMRVG-----------------
         ++  L   +  +W+  GDFN I D                     V    L+ L      +  +N   +  +V   +D  +G                 
Subjt:  EMLERLTENDDEQWIIGGDFNKIWDK--------------------VGYLILVTLWIRPVLYYDTNAMIQRLHV---VDMRVG-----------------

Query:  ------VSCAMSLHKTGPRNTKEKVLVAVFPKD--QDQRFGETFKT-------SKPKK-------------------EDEILNLSER--DDSQNAEILE-
               +  + L    P  T+ +  +  F +   ++ +  E  ++       S PKK                   + E++ + +R  D S  +E +E 
Subjt:  ------VSCAMSLHKTGPRNTKEKVLVAVFPKD--QDQRFGETFKT-------SKPKK-------------------EDEILNLSER--DDSQNAEILE-

Query:  -----KAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFI----SSILDQQDISK-
             + + E ++LL+ +E  WR RSR +WL++GDRNTK+F  KASQR + N I+ + D +G W   ++ + +V  +YF ELF     S+I +  ++ K 
Subjt:  -----KAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFI----SSILDQQDISK-

Query:  -----------------EVEEALKSLSPSKAPGIDGAHASFYQSYWGVVGEFEELTCCRTIPQHYAPKNTNNS
                         EV+EA+  + P KAPG DG  A F+Q YW +VG+  +L     +  +  P + N +
Subjt:  -----------------EVEEALKSLSPSKAPGIDGAHASFYQSYWGVVGEFEELTCCRTIPQHYAPKNTNNS

TrEMBL top hitse value%identityAlignment
A0A2N9EEA9 CCHC-type domain-containing protein4.4e-5925.61Show/hide
Query:  KFGTQRDKNRVTKGGPWSFGNNLLVFDEPKGNISISSLSF------------------RKYAEALGNAIGAFEMVEVNEQGNITGETLRVKIKVNINDPI
        +F   +++ RV  G PW F N LL+ +   G I  S +                    R+  E +G+  G    V+V + G   G  LR+++ V+I+ P 
Subjt:  KFGTQRDKNRVTKGGPWSFGNNLLVFDEPKGNISISSLSF------------------RKYAEALGNAIGAFEMVEVNEQGNITGETLRVKIKVNINDPI

Query:  KRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDC----EEYTEEGESTKNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGRRRGADSQGRYD
         RG  V   S   + W+   YE+LP  C+H G IGH   DC    +   + G S K YG  LR  + S   ++ ++   R   FR  GR   A    + +
Subjt:  KRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDC----EEYTEEGESTKNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGRRRGADSQGRYD

Query:  NWRKTNL-------NGNGVNSGNRAVGYIQLEETKEKVEESKDKKEGSSAEMG----------------PTPHDGKQLAIDKDEETTLVNGKGSKKQGI-
         W            +G+   S  + V  +       + E S  K      EMG                P P  G    +       +    G   Q + 
Subjt:  NWRKTNL-------NGNGVNSGNRAVGYIQLEETKEKVEESKDKKEGSSAEMG----------------PTPHDGKQLAIDKDEETTLVNGKGSKKQGI-

Query:  SQSQSQNNSCCRKNTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLRRTRRDIGEGCGTAPSDTMKTLCWNVRG
         +   QN+    +  EP   +T     +         + +K+   S ++    WK++ R       +    L+R           +P   M+ L WN RG
Subjt:  SQSQSQNNSCCRKNTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLRRTRRDIGEGCGTAPSDTMKTLCWNVRG

Query:  LGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKDWHGRF--IGFYGNL
        LGNP+ +R L  ++    P ++F  ET+ D    E ++VS +F++++CVP     GGL ++W+  +++ + S+S+ HID     KD    F   GFYGN 
Subjt:  LGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKDWHGRF--IGFYGNL

Query:  EVDKRKDSWEMLERLTENDDEQWIIGGDFNKIWD----------------------------KVGYLILVTLWIR----PVLYYD-TNAMIQRLHVVDMR
        E  KRK++W +L+ L+ +    W+  GDFN++ D                             +G+      WI+    P    +  + ++  +  ++M 
Subjt:  EVDKRKDSWEMLERLTENDDEQWIIGGDFNKIWD----------------------------KVGYLILVTLWIR----PVLYYD-TNAMIQRLHVVDMR

Query:  VGVS-----------CAMSLH--KTGPRNTKEKV--LVAVFPKDQD--------------------------------------QRFGETFKTSKPKKE-
         G             C + L   +   R  + +V    A++ KD+                                       +RFG+     K K++ 
Subjt:  VGVS-----------CAMSLH--KTGPRNTKEKV--LVAVFPKDQD--------------------------------------QRFGETFKTSKPKKE-

Query:  -DEILNLSERDDSQNAEILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFIS
          E+ N++  D S  AEIL   + E++DLLE+EE FWR RSR  W+++GD+NT++F A+ +QRR+ N I G+ D +G W T++ ++  +A  YF+ +F S
Subjt:  -DEILNLSERDDSQNAEILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFIS

Query:  S-----ILD-------------------QQDISKEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVG
        S     ++D                   Q+   +EV +AL  + P+KAPG DG  A FYQ+YW +VG
Subjt:  S-----ILD-------------------QQDISKEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVG

A0A2N9G497 Reverse transcriptase domain-containing protein4.1e-5727.35Show/hide
Query:  FGTQRDKNRVTKGGPWSFGNNLLVFDEPKGNISISSLSF------------------RKYAEALGNAIGAFEMVEVNEQGNITGETLRVKIKVNINDPIK
        F    D  RV   GPWSF  +L++      +I  S  SF                  ++ +EA+G  +G  E  E  ++G   G  +R+++ ++I  P+ 
Subjt:  FGTQRDKNRVTKGGPWSFGNNLLVFDEPKGNISISSLSF------------------RKYAEALGNAIGAFEMVEVNEQGNITGETLRVKIKVNINDPIK

Query:  RGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEE----GESTKNYGVELR---ETQGSKCIY--------------------KSW---
        RG  + +G N + +W+   YE+L +FCY  G I H   DCE +         + + YG  +R   E  G K  +                    + W   
Subjt:  RGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEE----GESTKNYGVELR---ETQGSKCIY--------------------KSW---

Query:  -----------------KPMFRDGPFRGRGRRRGADSQG----------------RYDNWRKTNLNGNGV---NS--GNRAVGYIQLEETKE---KVEES
                         K     GP +     R   +                  RYD+    ++NG  +   NS  G+ A G   LEET E    V   
Subjt:  -----------------KPMFRDGPFRGRGRRRGADSQG----------------RYDNWRKTNLNGNGV---NS--GNRAVGYIQLEETKE---KVEES

Query:  KDKKEGSSAEMGPTPHDGKQLAIDKDEETTLVNGKGSKKQGISQSQSQNNSCCRKNTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLA
           +E S  + G   H G  +  +  E    VNG G+  +  S+ Q+   S            T SG  ND    D+  T + +  + ++  +  WKR A
Subjt:  KDKKEGSSAEMGPTPHDGKQLAIDKDEETTLVNGKGSKKQGISQSQSQNNSCCRKNTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLA

Query:  -RMDYGQQEMKTNLLRRTRRD-----IGEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNN
         R      ++    +  ++R+     +GE  G   S   K      +GLGNP  +R L H++  + P+V+F METK D  + E I+V L FD+++ VP+ 
Subjt:  -RMDYGQQEMKTNLLRRTRRD-----IGEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNN

Query:  VRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKD---WHGRFIGFYGNLEVDKRKDSWEMLERLTENDDEQWIIGGDFNKIWDKVGYLILVTLWIRPVLY
         RSGGL L+W +D E+ I+++S+ HID +  SK    W  R  GFYG  E  +R++SW +L+ L+  D   W   GDFN+I        L T    P+  
Subjt:  VRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKD---WHGRFIGFYGNLEVDKRKDSWEMLERLTENDDEQWIIGGDFNKIWDKVGYLILVTLWIRPVLY

Query:  YDTNAMIQRLHVVDM--RVGVSCAMSLHKTGPRNTKEKVLVAVFP-------KDQDQRFGETFKTSKPKKEDEILNLSERDDSQNAEILEK---------
           +  +     +D+  R  VS     H  G  +    ++V++         K   +RF E + T+ P  E  I     +  +   E LE          
Subjt:  YDTNAMIQRLHVVDM--RVGVSCAMSLHKTGPRNTKEKVLVAVFP-------KDQDQRFGETFKTSKPKKEDEILNLSERDDSQNAEILEK---------

Query:  -------AKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFISSILDQ---------
                + E++ LL ++E  WR RSREVWL  GD+NT++F  KA QRR +N ++G+ D NG W  EE ++G +   YF+++F +S + +         
Subjt:  -------AKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFISSILDQ---------

Query:  -------------QDISKEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVGE
                     Q  + E+++A   + PSKAPG DG  + F+Q YW +VG+
Subjt:  -------------QDISKEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVGE

A0A2N9G8I6 Reverse transcriptase domain-containing protein1.7e-5825.69Show/hide
Query:  FGTQRDKNRVTKGGPWSFGNNLLVFDEPKGNISISSLSF------------------RKYAEALGNAIGAFEMVEVNEQGNITGETLRVKIKVNINDPIK
        F    D  RV   GPWSF  +L++      +I  S  SF                  ++ +EA+G  +G  E  E  ++G   G  +R+++ ++I  P+ 
Subjt:  FGTQRDKNRVTKGGPWSFGNNLLVFDEPKGNISISSLSF------------------RKYAEALGNAIGAFEMVEVNEQGNITGETLRVKIKVNINDPIK

Query:  RGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEE----GESTKNYGVELRETQGSKC-------------------------IYKSWK
        RG  + +G N + +W+   YE+L +FCY  G I H   DCE +         + + YG  +R  Q +                           + +  K
Subjt:  RGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEE----GESTKNYGVELRETQGSKC-------------------------IYKSWK

Query:  PMFRDGPFRGRGRRRGADSQG----------------RYDNWRKTNLNGNGV---NS--GNRAVGYIQLEETKE---KVEESKDKKEGSSAEMGPTPHDG
             GP +     R   +                  RYD+    ++NG  +   NS  G+ A G   LEET E    V      +E S  + G   H G
Subjt:  PMFRDGPFRGRGRRRGADSQG----------------RYDNWRKTNLNGNGV---NS--GNRAVGYIQLEETKE---KVEESKDKKEGSSAEMGPTPHDG

Query:  KQLAIDKDEETTLVNGKGSKKQGISQSQSQNNSCCRKNTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLA-------------RMDYG
          +  +  E    VNG G+  +  S+ Q+   S            T SG  ND    D+  T + +  + ++  +  WKR A              +   
Subjt:  KQLAIDKDEETTLVNGKGSKKQGISQSQSQNNSCCRKNTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLA-------------RMDYG

Query:  QQEMKTNLLRRTRRDIGEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWN
        ++  +  LL     + G     AP +TM  L WN +GLGNP  +R L H++  + P+V+F METK D  + E I+V L FD+++ VP+  RSGGL L+W 
Subjt:  QQEMKTNLLRRTRRDIGEGCGTAPSDTMKTLCWNVRGLGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWN

Query:  SDIEMTIKSWSEGHIDTYCRSKD---WHGRFIGFYGNLEVDKRKDSWEMLERLTENDDEQWIIGGDFNKI--------------------WDKVGYLILV
        +D E+ I+++S+ HID +  SK    W  R  GFYG  E  +R++SW +L+ L+  D   W   GDFN+I                     + V     V
Subjt:  SDIEMTIKSWSEGHIDTYCRSKD---WHGRFIGFYGNLEVDKRKDSWEMLERLTENDDEQWIIGGDFNKI--------------------WDKVGYLILV

Query:  TLWIRPVLYYDTNAMIQRLHV---VDMRVGVSCAMSL-------HKTGPRNTKEKVLVAVFP-------KDQDQRFGETFKTSKPKKE------------
         L  +   Y  TN      ++   +D  +  +  + L       H  G  +    ++V++         K   +RF E + T+   ++            
Subjt:  TLWIRPVLYYDTNAMIQRLHV---VDMRVGVSCAMSL-------HKTGPRNTKEKVLVAVFP-------KDQDQRFGETFKTSKPKKE------------

Query:  ----------------------DEILNLSERDDSQNAEILEK----------------AKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQR
                               E+  ++    +   E LE                  + E++ LL ++E  WR RSREVWL  GD+NT++F  KA QR
Subjt:  ----------------------DEILNLSERDDSQNAEILEK----------------AKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQR

Query:  RRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELF------------------ISSILDQQDISK----EVEEALKSLSPSKAPGIDGAHASFYQSYWGV
        R +N ++G+ D NG W  EE ++G +   YF+++F                  +++++++Q +S+    E+++A   + PSKAPG DG  + F+Q YW +
Subjt:  RRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELF------------------ISSILDQQDISK----EVEEALKSLSPSKAPGIDGAHASFYQSYWGV

Query:  VGE
        VG+
Subjt:  VGE

A0A2N9GF83 CCHC-type domain-containing protein3.7e-6629.51Show/hide
Query:  LCKFGTQRDKNRVTKGGPWSFGNNLLVFDE-----PKGNISISSLSF-------------RKYAEALGNAIGAFEMVEVNEQGNITGETLRVKIKVNIND
        L +F  + D  RV KG PW F NNLLV +E     P   IS +S  F             ++  E +G AIG  E V+V+E G   G  LRV+I V+I  
Subjt:  LCKFGTQRDKNRVTKGGPWSFGNNLLVFDE-----PKGNISISSLSF-------------RKYAEALGNAIGAFEMVEVNEQGNITGETLRVKIKVNIND

Query:  PIKRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEEGEST----KNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGRRRGADSQ--
        PI+RG  V  GS   + WI   YE+LP FC+H G++GH   +C      G       K YGV LR  + S          FR     G  RRRGA S   
Subjt:  PIKRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEEGEST----KNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGRRRGADSQ--

Query:  ----GRYDNWRKTNLNGNGVNSGNRAVGYIQLEETK--EKVEESK-------DKKEGSSAEM-GPT----PHDGKQ---LAIDKDEETTLVNGKGSKKQG
            G+  +      +     SG+   GY    E +  +KV  S        D+     A +  PT    P+ G+Q   +A+       L  G G     
Subjt:  ----GRYDNWRKTNLNGNGVNSGNRAVGYIQLEETK--EKVEESK-------DKKEGSSAEM-GPT----PHDGKQ---LAIDKDEETTLVNGKGSKKQG

Query:  ISQSQ--SQNNSCCRKNTEPEKGKTKSGASNDP----RKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLRRTRRDIGEGCGTAPSDTMKT
        I+QS   S            E       AS  P      K   G+    K     +    WKRLAR        K  +          GC  AP DTM  
Subjt:  ISQSQ--SQNNSCCRKNTEPEKGKTKSGASNDP----RKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLRRTRRDIGEGCGTAPSDTMKT

Query:  LCWNVRGLGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKDWHGRF--
        L WN +GLGNP  +R L  ++  + P ++F  ET+ D    E+++V L+F +++CVP     GGL L+W + +E+ I+S+S  HID   +    H RF  
Subjt:  LCWNVRGLGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKDWHGRF--

Query:  IGFYGNLEVDKRKDSWEMLERLTE-NDDEQWIIGGDFNKIWDK----------------------------VGYLILVTLW-------------------
         GFYGNLE  KRK+SW +L+ L++      W+  GDFN++ D                             +G++     W                   
Subjt:  IGFYGNLEVDKRKDSWEMLERLTE-NDDEQWIIGGDFNKIWDK----------------------------VGYLILVTLW-------------------

Query:  --IRPVLYYDTNAM-IQRLHVVDMRVGVS--CAMSLH-KTGPRNTKEKVLV---AVFPKDQD--------------------------------------
          +   L  D+  +  Q L V  + V  S  C + +H   G + ++ K +    A++ KD+                                       
Subjt:  --IRPVLYYDTNAM-IQRLHVVDMRVGVS--CAMSLH-KTGPRNTKEKVLV---AVFPKDQD--------------------------------------

Query:  QRFGETFKTSKPKKEDEILNLSERDDSQNAEILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQI
         RFG    + K K+      L++     +  ILE  + +L+ LLE+EE +W+ RSR  W++ GD+NTK+F A+ SQRR  NK++G+ D  G W T++ ++
Subjt:  QRFGETFKTSKPKKEDEILNLSERDDSQNAEILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQI

Query:  GKVATNYFKELFISS-----------------ILDQQD-------ISKEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVG
          +A +YFK +F SS                 + D+ +        +KE+ EALK + P+KAPG DG  A FYQ+YW +VG
Subjt:  GKVATNYFKELFISS-----------------ILDQQD-------ISKEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVG

A0A2N9HND5 CCHC-type domain-containing protein1.7e-5825.49Show/hide
Query:  KFGTQRDKNRVTKGGPWSFGNNLLVFDEPKGNISISSLSF------------------RKYAEALGNAIGAFEMVEVNEQGNITGETLRVKIKVNINDPI
        +F   +++ RV  G PW F N LL+ +   G I  S +                    R+  E +G+  G    V+V + G   G  LR+++ V+I+ P 
Subjt:  KFGTQRDKNRVTKGGPWSFGNNLLVFDEPKGNISISSLSF------------------RKYAEALGNAIGAFEMVEVNEQGNITGETLRVKIKVNINDPI

Query:  KRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDC----EEYTEEGESTKNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGRRRGADSQGRYD
         RG  V   S   + W+   YE+LP  C+H G IGH   DC    +   + G S K YG  LR  + S   ++ ++   R   FR  GR   A    + +
Subjt:  KRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDC----EEYTEEGESTKNYGVELRETQGSKCIYKSWKPMFRDGPFRGRGRRRGADSQGRYD

Query:  NWRKTNL-------NGNGVNSGNRAVGYIQLEETKEKVEESKDKKEGSSAEMG----------------PTPHDGKQLAIDKDEETTLVNGKGSKKQGI-
         W            +G+   S  + V  +       + E S  K      EMG                P P  G    +       +    G   Q + 
Subjt:  NWRKTNL-------NGNGVNSGNRAVGYIQLEETKEKVEESKDKKEGSSAEMG----------------PTPHDGKQLAIDKDEETTLVNGKGSKKQGI-

Query:  SQSQSQNNSCCRKNTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLRRTRRDIGEGCGTAPSDTMKTLCWNVRG
         +   QN+    +  EP   +T     +         + +K+   S ++    WK++ R       +    L+R           +P   M+ L WN RG
Subjt:  SQSQSQNNSCCRKNTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLRRTRRDIGEGCGTAPSDTMKTLCWNVRG

Query:  LGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKDWHGRF--IGFYGNL
        LG+P+ +R L  ++    P ++F  ET+ D    E ++VS +F++++CVP     GGL ++W+  +++ + S+S+ HID     KD    F   GFYGN 
Subjt:  LGNPRMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKDWHGRF--IGFYGNL

Query:  EVDKRKDSWEMLERLTENDDEQWIIGGDFNKIWD----------------------------KVGYLILVTLWIR----PVLYYD-TNAMIQRLHVVDMR
        E  KRK++W +L+ L+ +    W+  GDFN++ D                             +G+      WI+    P    +  + ++  +  ++M 
Subjt:  EVDKRKDSWEMLERLTENDDEQWIIGGDFNKIWD----------------------------KVGYLILVTLWIR----PVLYYD-TNAMIQRLHVVDMR

Query:  VGVS-----------CAMSLH--KTGPRNTKEKV--LVAVFPKDQD--------------------------------------QRFGETFKTSKPKKE-
         G             C + L   +   R  + +V    A++ KD+                                       +RFG+     K K++ 
Subjt:  VGVS-----------CAMSLH--KTGPRNTKEKV--LVAVFPKDQD--------------------------------------QRFGETFKTSKPKKE-

Query:  -DEILNLSERDDSQNAEILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFIS
          E+ N++  D S  AEIL   + E++DLLE+EE FWR RSR  W+++GD+NT++F A+ +QRR+ N I G+ D +G W T++ ++  +A  YF+ +F S
Subjt:  -DEILNLSERDDSQNAEILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFIS

Query:  S-----ILD-------------------QQDISKEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVG
        S     ++D                   Q+   +EV +AL  + P+KAPG DG  A FYQ+YW +VG
Subjt:  S-----ILD-------------------QQDISKEVEEALKSLSPSKAPGIDGAHASFYQSYWGVVG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.6e-0523.66Show/hide
Query:  ILWITEVEI-------VGSRLPWKREGNFSINLAFDPKPWKGSGLFLLSEKEHCTGWTLSRGKSRVLQSNTQEEL-----GEWMVLESDGDWRESWLDWL
        + W+  +E+       +G+ +PW         L +  + WK     +   KE+     L R      + +T+ EL     G  +       W+     W 
Subjt:  ILWITEVEI-------VGSRLPWKREGNFSINLAFDPKPWKGSGLFLLSEKEHCTGWTLSRGKSRVLQSNTQEEL-----GEWMVLESDGDWRESWLDWL

Query:  WDCSCKFCPGFVKINVDASWNEEQNRGGIGWIARDSQRSPVGMGCSKINRRWSICVLESSAIKEGLAAYLNARGEAQTRILIESDS
                   VK N DA+W  E  R GIGWI R+     + MG   + R  ++   E  A++    A L        RI+ ESD+
Subjt:  WDCSCKFCPGFVKINVDASWNEEQNRGGIGWIARDSQRSPVGMGCSKINRRWSICVLESSAIKEGLAAYLNARGEAQTRILIESDS

AT4G29090.1 Ribonuclease H-like superfamily protein5.2e-0431.58Show/hide
Query:  FVKINVDASWNEEQNRGGIGWIARDSQRSPVGMGCSKINRRWSICVLESSAIKEGLAAYLNARGEAQTRILIESDS
        +VK N DA+WN +  R GIGW+ R+ +     MG   + +  S+   E  A++    A L+        ++ ESDS
Subjt:  FVKINVDASWNEEQNRGGIGWIARDSQRSPVGMGCSKINRRWSICVLESSAIKEGLAAYLNARGEAQTRILIESDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACGCCCTAATTTTGCCGGAAGTGCCGCAGCGTCGAGACGCTGTGGAGTTAGAGGAGGGTTTTGGCACTTTTTCTCTCTGGAAGGAAAAGGCCATGGACGAGGAAGG
GCAACACTCGGTTTGCTCGAAGGGATCCAACGGTGATCAGAGTATGGAGATTGAGAATGGAAACGCCAAGAGATTGGAAGAGGAGACACAAACAAAGGAGTTACAGGAGC
AGATAGAGAAACTTAGTCTGCTCTGTAAATTTGGGACTCAAAGAGACAAAAACCGAGTCACAAAAGGGGGCCCCTGGAGTTTTGGTAACAACCTTCTTGTCTTCGATGAG
CCAAAAGGAAACATCAGCATAAGCTCTCTGAGCTTCAGGAAATATGCAGAGGCACTTGGGAACGCCATTGGAGCATTTGAGATGGTGGAAGTGAATGAGCAAGGGAATAT
CACTGGTGAAACATTAAGAGTCAAAATAAAAGTTAATATCAATGATCCCATTAAAAGAGGAACGAATGTTGAGATAGGCTCGAATAGCAACTTGAATTGGATTCCAATCA
CCTATGAGAAACTCCCGGATTTTTGCTATCACAGTGGGAGAATTGGTCATACTCTGGACGATTGTGAAGAATATACCGAGGAAGGAGAATCCACTAAAAATTATGGGGTT
GAACTCAGAGAAACTCAGGGTAGTAAATGTATATACAAGAGTTGGAAACCAATGTTTAGGGATGGCCCCTTCAGAGGCAGAGGCAGGAGAAGAGGAGCTGATTCGCAGGG
GAGGTACGATAACTGGAGGAAAACCAATCTCAATGGAAATGGTGTTAACTCTGGGAATAGAGCCGTTGGGTACATTCAACTTGAAGAAACCAAGGAAAAAGTGGAAGAAA
GTAAAGACAAAAAAGAAGGAAGCAGCGCTGAGATGGGACCCACACCCCATGATGGTAAACAGCTGGCAATAGACAAGGATGAGGAAACTACATTAGTTAATGGGAAAGGG
AGTAAAAAGCAAGGTATTAGTCAAAGTCAAAGCCAAAATAACAGTTGCTGTAGAAAAAATACAGAACCGGAAAAGGGAAAAACAAAATCAGGAGCAAGCAATGACCCTCG
AAAAAAGGACAACCACGGGACTGGAAACAAGAGAAAAACTACTTCTCATGAGAGACAAACAAAGAAGTGGAAACGGTTAGCTAGAATGGATTATGGGCAACAAGAAATGA
AGACCAACTTACTTCGTAGAACTCGAAGGGATATCGGCGAAGGCTGTGGAACAGCCCCGTCGGACACCATGAAAACCTTATGCTGGAACGTTCGAGGGTTGGGGAACCCT
CGAATGATCCGAAACCTCCGTCATGTGATCGATGGCGAAAAACCCCAAGTTGTTTTCTTTATGGAAACTAAATGCGATGTTCATAAAAGTGAGAGGATCAAGGTCAGTTT
GAGGTTTGATCATAGTTGGTGTGTGCCTAATAATGTTAGGAGTGGTGGGCTCCTATTGATGTGGAATTCTGACATAGAGATGACGATTAAATCCTGGTCCGAAGGTCATA
TAGACACGTACTGTAGAAGCAAAGATTGGCATGGTAGGTTTATAGGTTTCTATGGCAATCTGGAAGTGGATAAAAGAAAGGATTCCTGGGAGATGTTGGAAAGGTTAACG
GAGAATGATGATGAGCAGTGGATCATTGGTGGCGATTTTAACAAGATTTGGGATAAGGTTGGGTACCTTATCCTGGTGACACTATGGATACGACCCGTTTTGTATTATGA
TACAAACGCAATGATCCAACGCCTACATGTAGTGGACATGCGAGTGGGGGTGTCCTGTGCAATGAGTTTGCATAAGACCGGACCGCGAAATACAAAGGAAAAAGTTCTTG
TTGCTGTTTTTCCAAAGGATCAAGATCAAAGATTTGGTGAAACGTTCAAGACTTCAAAGCCAAAAAAAGAGGATGAGATCTTGAACCTTAGCGAGAGAGATGATAGTCAG
AATGCTGAAATTCTGGAGAAAGCAAAAGCTGAATTAGATGATTTGCTTGAAGAAGAAGAGTACTTTTGGAGAAGTAGATCAAGAGAGGTTTGGCTTGAAAATGGTGATAG
AAACACTAAATGGTTTCTTGCGAAGGCGTCTCAAAGGAGAAGAAGGAATAAGATTGAAGGAGTTTATGATCACAATGGCTTTTGGGTCACTGAAGAAGACCAAATAGGTA
AAGTGGCAACAAATTACTTCAAAGAGTTGTTCATTTCTTCCATCCTAGATCAACAGGATATTTCCAAGGAAGTAGAAGAAGCCTTGAAGAGTTTGAGTCCGAGTAAGGCC
CCAGGAATTGATGGAGCTCATGCCTCTTTTTATCAATCCTACTGGGGGGTTGTGGGAGAATTTGAGGAACTTACTTGTTGTAGAACGATTCCACAGCACTATGCTCCGAA
AAACACGAACAATTCGTCCAACGAAAACCACCATCAAGGAGCACCCACTATCCTCTGGATTACTGAAGTAGAGATTGTGGGATCTCGTTTGCCTTGGAAAAGGGAAGGAA
ATTTCAGCATCAACCTTGCTTTTGATCCTAAACCGTGGAAAGGATCGGGTTTGTTCTTGCTCTCGGAAAAAGAACATTGTACCGGTTGGACTCTAAGTCGTGGAAAGAGT
CGTGTTCTTCAATCGAATACCCAAGAGGAGCTTGGGGAGTGGATGGTACTTGAGAGCGATGGCGATTGGCGAGAGTCGTGGCTTGACTGGCTATGGGACTGCAGCTGCAA
ATTTTGTCCGGGTTTTGTGAAAATCAACGTGGATGCTTCTTGGAATGAAGAGCAGAATCGAGGTGGAATTGGGTGGATTGCTCGTGATTCTCAAAGATCTCCGGTAGGAA
TGGGCTGTTCGAAAATTAATAGAAGATGGTCGATTTGTGTGTTAGAATCTTCTGCTATCAAAGAAGGCTTAGCAGCGTACCTGAATGCTAGGGGTGAAGCTCAGACTAGA
ATCTTGATTGAATCAGATTCGATATAA
mRNA sequenceShow/hide mRNA sequence
ATGCACGCCCTAATTTTGCCGGAAGTGCCGCAGCGTCGAGACGCTGTGGAGTTAGAGGAGGGTTTTGGCACTTTTTCTCTCTGGAAGGAAAAGGCCATGGACGAGGAAGG
GCAACACTCGGTTTGCTCGAAGGGATCCAACGGTGATCAGAGTATGGAGATTGAGAATGGAAACGCCAAGAGATTGGAAGAGGAGACACAAACAAAGGAGTTACAGGAGC
AGATAGAGAAACTTAGTCTGCTCTGTAAATTTGGGACTCAAAGAGACAAAAACCGAGTCACAAAAGGGGGCCCCTGGAGTTTTGGTAACAACCTTCTTGTCTTCGATGAG
CCAAAAGGAAACATCAGCATAAGCTCTCTGAGCTTCAGGAAATATGCAGAGGCACTTGGGAACGCCATTGGAGCATTTGAGATGGTGGAAGTGAATGAGCAAGGGAATAT
CACTGGTGAAACATTAAGAGTCAAAATAAAAGTTAATATCAATGATCCCATTAAAAGAGGAACGAATGTTGAGATAGGCTCGAATAGCAACTTGAATTGGATTCCAATCA
CCTATGAGAAACTCCCGGATTTTTGCTATCACAGTGGGAGAATTGGTCATACTCTGGACGATTGTGAAGAATATACCGAGGAAGGAGAATCCACTAAAAATTATGGGGTT
GAACTCAGAGAAACTCAGGGTAGTAAATGTATATACAAGAGTTGGAAACCAATGTTTAGGGATGGCCCCTTCAGAGGCAGAGGCAGGAGAAGAGGAGCTGATTCGCAGGG
GAGGTACGATAACTGGAGGAAAACCAATCTCAATGGAAATGGTGTTAACTCTGGGAATAGAGCCGTTGGGTACATTCAACTTGAAGAAACCAAGGAAAAAGTGGAAGAAA
GTAAAGACAAAAAAGAAGGAAGCAGCGCTGAGATGGGACCCACACCCCATGATGGTAAACAGCTGGCAATAGACAAGGATGAGGAAACTACATTAGTTAATGGGAAAGGG
AGTAAAAAGCAAGGTATTAGTCAAAGTCAAAGCCAAAATAACAGTTGCTGTAGAAAAAATACAGAACCGGAAAAGGGAAAAACAAAATCAGGAGCAAGCAATGACCCTCG
AAAAAAGGACAACCACGGGACTGGAAACAAGAGAAAAACTACTTCTCATGAGAGACAAACAAAGAAGTGGAAACGGTTAGCTAGAATGGATTATGGGCAACAAGAAATGA
AGACCAACTTACTTCGTAGAACTCGAAGGGATATCGGCGAAGGCTGTGGAACAGCCCCGTCGGACACCATGAAAACCTTATGCTGGAACGTTCGAGGGTTGGGGAACCCT
CGAATGATCCGAAACCTCCGTCATGTGATCGATGGCGAAAAACCCCAAGTTGTTTTCTTTATGGAAACTAAATGCGATGTTCATAAAAGTGAGAGGATCAAGGTCAGTTT
GAGGTTTGATCATAGTTGGTGTGTGCCTAATAATGTTAGGAGTGGTGGGCTCCTATTGATGTGGAATTCTGACATAGAGATGACGATTAAATCCTGGTCCGAAGGTCATA
TAGACACGTACTGTAGAAGCAAAGATTGGCATGGTAGGTTTATAGGTTTCTATGGCAATCTGGAAGTGGATAAAAGAAAGGATTCCTGGGAGATGTTGGAAAGGTTAACG
GAGAATGATGATGAGCAGTGGATCATTGGTGGCGATTTTAACAAGATTTGGGATAAGGTTGGGTACCTTATCCTGGTGACACTATGGATACGACCCGTTTTGTATTATGA
TACAAACGCAATGATCCAACGCCTACATGTAGTGGACATGCGAGTGGGGGTGTCCTGTGCAATGAGTTTGCATAAGACCGGACCGCGAAATACAAAGGAAAAAGTTCTTG
TTGCTGTTTTTCCAAAGGATCAAGATCAAAGATTTGGTGAAACGTTCAAGACTTCAAAGCCAAAAAAAGAGGATGAGATCTTGAACCTTAGCGAGAGAGATGATAGTCAG
AATGCTGAAATTCTGGAGAAAGCAAAAGCTGAATTAGATGATTTGCTTGAAGAAGAAGAGTACTTTTGGAGAAGTAGATCAAGAGAGGTTTGGCTTGAAAATGGTGATAG
AAACACTAAATGGTTTCTTGCGAAGGCGTCTCAAAGGAGAAGAAGGAATAAGATTGAAGGAGTTTATGATCACAATGGCTTTTGGGTCACTGAAGAAGACCAAATAGGTA
AAGTGGCAACAAATTACTTCAAAGAGTTGTTCATTTCTTCCATCCTAGATCAACAGGATATTTCCAAGGAAGTAGAAGAAGCCTTGAAGAGTTTGAGTCCGAGTAAGGCC
CCAGGAATTGATGGAGCTCATGCCTCTTTTTATCAATCCTACTGGGGGGTTGTGGGAGAATTTGAGGAACTTACTTGTTGTAGAACGATTCCACAGCACTATGCTCCGAA
AAACACGAACAATTCGTCCAACGAAAACCACCATCAAGGAGCACCCACTATCCTCTGGATTACTGAAGTAGAGATTGTGGGATCTCGTTTGCCTTGGAAAAGGGAAGGAA
ATTTCAGCATCAACCTTGCTTTTGATCCTAAACCGTGGAAAGGATCGGGTTTGTTCTTGCTCTCGGAAAAAGAACATTGTACCGGTTGGACTCTAAGTCGTGGAAAGAGT
CGTGTTCTTCAATCGAATACCCAAGAGGAGCTTGGGGAGTGGATGGTACTTGAGAGCGATGGCGATTGGCGAGAGTCGTGGCTTGACTGGCTATGGGACTGCAGCTGCAA
ATTTTGTCCGGGTTTTGTGAAAATCAACGTGGATGCTTCTTGGAATGAAGAGCAGAATCGAGGTGGAATTGGGTGGATTGCTCGTGATTCTCAAAGATCTCCGGTAGGAA
TGGGCTGTTCGAAAATTAATAGAAGATGGTCGATTTGTGTGTTAGAATCTTCTGCTATCAAAGAAGGCTTAGCAGCGTACCTGAATGCTAGGGGTGAAGCTCAGACTAGA
ATCTTGATTGAATCAGATTCGATATAA
Protein sequenceShow/hide protein sequence
MHALILPEVPQRRDAVELEEGFGTFSLWKEKAMDEEGQHSVCSKGSNGDQSMEIENGNAKRLEEETQTKELQEQIEKLSLLCKFGTQRDKNRVTKGGPWSFGNNLLVFDE
PKGNISISSLSFRKYAEALGNAIGAFEMVEVNEQGNITGETLRVKIKVNINDPIKRGTNVEIGSNSNLNWIPITYEKLPDFCYHSGRIGHTLDDCEEYTEEGESTKNYGV
ELRETQGSKCIYKSWKPMFRDGPFRGRGRRRGADSQGRYDNWRKTNLNGNGVNSGNRAVGYIQLEETKEKVEESKDKKEGSSAEMGPTPHDGKQLAIDKDEETTLVNGKG
SKKQGISQSQSQNNSCCRKNTEPEKGKTKSGASNDPRKKDNHGTGNKRKTTSHERQTKKWKRLARMDYGQQEMKTNLLRRTRRDIGEGCGTAPSDTMKTLCWNVRGLGNP
RMIRNLRHVIDGEKPQVVFFMETKCDVHKSERIKVSLRFDHSWCVPNNVRSGGLLLMWNSDIEMTIKSWSEGHIDTYCRSKDWHGRFIGFYGNLEVDKRKDSWEMLERLT
ENDDEQWIIGGDFNKIWDKVGYLILVTLWIRPVLYYDTNAMIQRLHVVDMRVGVSCAMSLHKTGPRNTKEKVLVAVFPKDQDQRFGETFKTSKPKKEDEILNLSERDDSQ
NAEILEKAKAELDDLLEEEEYFWRSRSREVWLENGDRNTKWFLAKASQRRRRNKIEGVYDHNGFWVTEEDQIGKVATNYFKELFISSILDQQDISKEVEEALKSLSPSKA
PGIDGAHASFYQSYWGVVGEFEELTCCRTIPQHYAPKNTNNSSNENHHQGAPTILWITEVEIVGSRLPWKREGNFSINLAFDPKPWKGSGLFLLSEKEHCTGWTLSRGKS
RVLQSNTQEELGEWMVLESDGDWRESWLDWLWDCSCKFCPGFVKINVDASWNEEQNRGGIGWIARDSQRSPVGMGCSKINRRWSICVLESSAIKEGLAAYLNARGEAQTR
ILIESDSI