; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031964 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031964
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:21062648..21065437
RNA-Seq ExpressionLag0031964
SyntenyLag0031964
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AFP55557.1 non-ltr retroelement reverse transcriptase [Rosa rugosa]3.7e-14835.88Show/hide
Query:  GAKGGLCILWADKDMVSIQSFSDNHIDCEVLW-DGSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPMRERRYIDDFR
        GA+GG+C+ W +K +V   S S   I+  V W D  K RFTG YG P   Q+ L+WDL+RSL    ++PWL  GD NEIL  +EK+G   R +R ID FR
Subjt:  GAKGGLCILWADKDMVSIQSFSDNHIDCEVLW-DGSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPMRERRYIDDFR

Query:  QCLDDCLLRDINPEGELFTWIGNRRGTI-IKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQR-RQKYQFRFEELWTNYEECKEL
          ++DC L +    G  +TW   R+G   +KERLDR   N      +  +   +L  + SDH P+  E     SR    R+K +F FE++W  +E C+ +
Subjt:  QCLDDCLLRDINPEGELFTWIGNRRGTI-IKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQR-RQKYQFRFEELWTNYEECKEL

Query:  IEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFE-SIHAIEFELDRLLEEDEIYWRQRSREEWLKWGDRNS
        +E+   W   ++  +S+   L   +  L +W +        ++   +  L        + N     + +E  LD +LE +E+ W+QR+R  W K GDRN+
Subjt:  IEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFE-SIHAIEFELDRLLEEDEIYWRQRSREEWLKWGDRNS

Query:  KWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDG
        ++FH+ A  R + N I GIL  +  W  D   I   F+SYF+NLF +     S   TI   +T +V       L   + + EIE A+K M P+K+PG DG
Subjt:  KWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDG

Query:  YPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNI
         PA F+Q++WN +GN  V+ CL  LN  G+I+ +N + I LIPK  NP+ V EYRPISLCNV YK+V+KV+ANR+K +L E+I+++QSAF+  R+I DNI
Subjt:  YPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNI

Query:  IIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF------------------------------------------------LAEG
        I   E I+ +K    +  +  ALKLD++KAYDRVEW FL+ +M  +GF                                                +AEG
Subjt:  IIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF------------------------------------------------LAEG

Query:  LSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSLLGIRHVSDLGKY
        LS  IRKA    +I                              + M +KNI   YE ASG+ IN  KSAI FS K  +  K+  S++L +  V    +Y
Subjt:  LSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSLLGIRHVSDLGKY

Query:  LGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF-------------------------
        LG+P+V  ++K K    + DRVW  V GW+    S AGKEVLIK++ QAIP+Y MSVF+LP    + I++  ARF                         
Subjt:  LGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF-------------------------

Query:  ---------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWLPKETTFRPICI
                                         LK+ Y+   + MEA+LG  PSYLW+S LWGRELL KG+R RIG+GK   +F DPW+P   +FRPI  
Subjt:  ---------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWLPKETTFRPICI

Query:  NSDMYNTKVSDYISDSG
               +VSD + ++G
Subjt:  NSDMYNTKVSDYISDSG

KAA3477308.1 reverse transcriptase [Gossypium australe]7.8e-14636.19Show/hide
Query:  SCHFEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWDG---SKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEK
        SC F G   + + G++GGLC+ W D  +V+++SFS  HID  +LW+G    +WRFTG YG P    +   W L++ L+   N PWL+ GD NEIL   EK
Subjt:  SCHFEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWDG---SKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEK

Query:  SGGPMRERRYIDDFRQCLDDCLLRDINPEGELFTW-IGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQRRQKYQFR
         GG  R+++ ++ FR  L+DC L DI   G  + W  GN   T I+ERLDR + N+++  +F      +L +  SDH PI L           R    FR
Subjt:  SGGPMRERRYIDDFRQCLDDCLLRDINPEGELFTW-IGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQRRQKYQFR

Query:  FEELWTNYEECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSIN-FESIHAIEFELDRLLEEDEIYWRQ
        FE  WT  +  + +I K   W     P   L   L    +   K    I   +N         L+   G  +  +    I   +  L+  +++DE YW Q
Subjt:  FEELWTNYEECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSIN-FESIHAIEFELDRLLEEDEIYWRQ

Query:  RSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKA
        R+R  WLK GD+N+ +FHK AS R++IN I  ++  +G    D   I      YFK+LF          R +L G+   V+ ++N KL+ PF + EI+  
Subjt:  RSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKA

Query:  IKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDS
        +K M PTKAPG +G+PALF+Q+YW+ VG   VE  LNILN    + + N T +VLIPK +NP ++  +RPISLC+V YKIV K IANRM+ ++   I + 
Subjt:  IKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDS

Query:  QSAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGFLAE----------GLSYQIR-KANNSGRI---------
        QSAF+ GRLI+DN+++   +++  +  R       A KLD+SKAYDRVEWDF+K +M+++GF  E           +SY +   AN +G +         
Subjt:  QSAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGFLAE----------GLSYQIR-KANNSGRI---------

Query:  -----------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSLLGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVW
                           E    N+KNIL++YE  SG+ +N  KS IF+SS   +D K+ +S+LLG+R  S+L KYLG+P++  R K +    ILD++ 
Subjt:  -----------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSLLGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVW

Query:  KAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF---------------------------LKSLYYKDSNIMEADLGRQPS
          + GW     S  G+EV IKS+ QAIP+Y MS F LPK + E I   FARF                            K+ Y+ D +  E++LG   S
Subjt:  KAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF---------------------------LKSLYYKDSNIMEADLGRQPS

Query:  YLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWLPKETTFRPICINSDMYNTKVSDYISDS
        Y W+S+   +  L KGL  ++G G++  IF+D W+P  +  R +    ++   KV+D I  +
Subjt:  YLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWLPKETTFRPICINSDMYNTKVSDYISDS

PRQ56718.1 putative RNA-directed DNA polymerase [Rosa chinensis]2.0e-14633.62Show/hide
Query:  MSCHFEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWDGSK--WRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEK
        ++C F      + V   GGLC+LW D   VS+QS+S+NHID  +  +G    WRFTGVYGFP  G++  TW+L++ L+  GN PW++GGD NEI    +K
Subjt:  MSCHFEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWDGSK--WRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEK

Query:  SGGPMRERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQRRQKYQFRF
         GG +R  R ++D ++ L  C L DI   G  FTW G R G  ++ RLDRF C+  +  +F + R  +LD   SDH PI LE+  Q+ R+++R+K +F+F
Subjt:  SGGPMRERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQRRQKYQFRF

Query:  EELWTNYEECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFESIH-AIEFELDRLLEEDEIYWRQR
        EE W   E CKE+++ +      + P   L + + +   AL  W +         I + +N L   Y +  S   E    A++ +L+ LL +++++WRQR
Subjt:  EELWTNYEECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFESIH-AIEFELDRLLEEDEIYWRQR

Query:  SREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAI
        ++  WLK GD N+K+FH++   RKK N + G+ +N+G W+ + + +E+  + YF +LF SS P+    + IL G+   V+   N  L    +K E+  AI
Subjt:  SREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAI

Query:  KQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQ
        K M P+K+PGPDG+   F+Q +W  VG+  V        +K ++   N T + LIPK   P+ + + RPISLCNV YKI +KV+ANR+K +L+ +IS  Q
Subjt:  KQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQ

Query:  SAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGFLAEGL----------SYQI--------------------
        SAF+ GRLISDN ++  E  + +K  R       ALKLD+SKAYDRVEW FL+ +M ++GF  E +          SY                      
Subjt:  SAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGFLAEGL----------SYQI--------------------

Query:  ---------------RKANNSGRITEEY-----------------------------ELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSS
                       R   N+ RI   Y                             +   +K++L  YE ASG+ +N  KS I FS  +    ++ +++
Subjt:  ---------------RKANNSGRITEEY-----------------------------ELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSS

Query:  LLGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF------------
        +LG+  V    KYLG+P   S +K +   F+ +++    QGW+    S AGKEVLIK++ QAIPSYVMS F++P++L + + R  A+F            
Subjt:  LLGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF------------

Query:  -----------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKD
                                                       LK+ Y+   + +EA+L    SY W+S+L GR++L KG+R ++G+G+S  ++ D
Subjt:  -----------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKD

Query:  PWLPKETTFRPIC-INSDMYNTKVSDYI
        PW+P   +FRP   +   +   +V+D I
Subjt:  PWLPKETTFRPIC-INSDMYNTKVSDYI

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]6.6e-14531.9Show/hide
Query:  SCHFEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVL-WDGSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSG
        S +FE  F V  +G  GGL + W+    V+I+SFS +HID  V    G  WR TG+YG     QK  TW L++ L    +  W   GD NEIL + EK G
Subjt:  SCHFEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVL-WDGSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSG

Query:  ---------GPMRE---------------------RRYID------------------------------------------------------------
                    RE                     RRY D                                                            
Subjt:  ---------GPMRE---------------------RRYID------------------------------------------------------------

Query:  ---------------------------DFRQCLDDCLLRDINPEGELFTWIGNRRG-TIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPI--ELE
                                   +F++ +  C L D+  +G  FTW   R G   I+ERLDR LC+  + S F ++ AI+L    SDH PI  E++
Subjt:  ---------------------------DFRQCLDDCLLRDINPEGELFTWIGNRRG-TIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPI--ELE

Query:  LVTQRSRKQRRQKYQFRFEELWTNYEECKELIEK------NGAWSGDIYPFHSLSS-NLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINF
        +  ++   ++    +  +E++W++YE C  ++          +W   +  F  ++  +LA+  +   +  +G    +N  I   K   +E     Q+I+ 
Subjt:  LVTQRSRKQRRQKYQFRFEELWTNYEECKELIEK------NGAWSGDIYPFHSLSS-NLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINF

Query:  ESIHAIEFELDRLLEEDEIYWRQRSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLT
        E I  +E ++  +L ++E+YW+QRSR +WLK GD+N+K+FH KAS R++ N+I G+ D+ G+W +DP+ IE  F  +F+ LF SSNP  + I   L GL 
Subjt:  ESIHAIEFELDRLLEEDEIYWRQRSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLT

Query:  PKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVN
        PKV+ +MN  L  PF   +I +A+ +M PTKAPGPDG PA F+Q++W  VG    + CL+ILN +G +   N T I LIPK   PR V E+RPISLCNV 
Subjt:  PKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVN

Query:  YKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGFLAEGLSY--------
        Y+IV K IANR+K ILN IIS +QSAFI  RLI+DN+IIG+E ++ I++++     + ALKLD+SKAYDRVEW+FL++ M  LGF A+ +S         
Subjt:  YKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGFLAEGLSY--------

Query:  ---QIRKANNSGRITEE---------------------YELMN-----------------------------------------IKNILKDYELASGESI
            +   N  G I  E                       L+N                                         +K I   Y  ASG+  
Subjt:  ---QIRKANNSGRITEE---------------------YELMN-----------------------------------------IKNILKDYELASGESI

Query:  NLAKSAIFFSSKIHSDRKDYLSSLLGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHL
        N  KS++FFS K  S++   + S+  ++ V    KYLG+P +  RNK      +  +V   +  W +  FS  GKE+LIK++ QA+P+Y MSVFKLPK L
Subjt:  NLAKSAIFFSSKIHSDRKDYLSSLLGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHL

Query:  HEGISRNFARF-----------------------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWG
         E I +  ARF                                                           +K+ YYK+S    A +G  PS++W+S+LWG
Subjt:  HEGISRNFARF-----------------------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWG

Query:  RELLSKGLRNRIGNGKSTFIFKDPWLPKETTFRPICINSDMYNTKVSDYI
         +++ KG+R RIG+GK   ++KD W+P+  TF+PI   +  + T V+D I
Subjt:  RELLSKGLRNRIGNGKSTFIFKDPWLPKETTFRPICINSDMYNTKVSDYI

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]2.3e-14534.26Show/hide
Query:  CHFEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDC--EVLWDGSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSG
        C  +   TV S G+KGGL +LW +   V I +++ +HID   E  WDG  W FTG YG P   Q+  +W  ++SL G  + PWL  GD NEI    EK G
Subjt:  CHFEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDC--EVLWDGSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSG

Query:  GPMRERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRR-GTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQRRQKYQFRFE
        G +R RR +++F   ++ C  R+++  G  +TW  +R  G  I+ERLDR L N ++  +F + +  +L    SDH P+ L LV +R +K+ R+   FRFE
Subjt:  GPMRERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRR-GTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQRRQKYQFRFE

Query:  ELWTNYEECKELIEKNGAWS-GDIYPFHS-----LSSNLANCSVALSKWGKGIHVHRNNRIFECKNILK--EAYGNFQSINFESIHAIEFELDRLLEEDE
         +W     C+E+++   AW  G+    HS     L S L +C   L KW K    H   +I E +  L+  E   +   I  E +      L++ LE+++
Subjt:  ELWTNYEECKELIEKNGAWS-GDIYPFHS-----LSSNLANCSVALSKWGKGIHVHRNNRIFECKNILK--EAYGNFQSINFESIHAIEFELDRLLEEDE

Query:  IYWRQRSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKT
          WRQRSR  W + GDRN+ +FH KAS R + N I GI+D  G W ED   IEE  ++YF+ LF SS P+      IL  + PKVT DMN +L   +   
Subjt:  IYWRQRSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKT

Query:  EIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNE
        E+  A+KQM+P KAPGPDG P LF+Q +WNT G       L+ LN+  +   +N+T+IVLIPK + P+ V +YRPISLCNV YKI +K IANR+K+ L  
Subjt:  EIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNE

Query:  IISDSQSAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF---------------------------------
        IISD+QSAF+ GRLI+DN+++  E+++ I   +       A+KLD+SKAYDRVEW F+++IM +LGF                                 
Subjt:  IISDSQSAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF---------------------------------

Query:  ---------------LAEGLSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDR
                        AEGLS  I+ +  +G +                              E   ++ +L  YE ASG+ +N AK+++FFSS    + 
Subjt:  ---------------LAEGLSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDR

Query:  KDYLSSLLGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF------
        ++ +    G + +    KYLG+PS+  +NK    + I +++ K + GWK    S AGKE+LIK++  A+P+Y MS FKLP +L + ++    +F      
Subjt:  KDYLSSLLGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF------

Query:  -----------------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKS
                                                             LK+ Y+     + A LG  PSY W+S++  + L+ +GL+ R+GNG S
Subjt:  -----------------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKS

Query:  TFIFKDPWLPKETTFRPICINSDMY-NTKVSDYI
          +++D WLP   + + I     ++ +T+V+D +
Subjt:  TFIFKDPWLPKETTFRPICINSDMY-NTKVSDYI

TrEMBL top hitse value%identityAlignment
A0A803P3X8 Uncharacterized protein2.1e-15234.09Show/hide
Query:  FEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWD-GSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPM
        ++GCF V + G  GGL +LW D   VSI+S++ +HID  V    G  WRFTG YG P  G +  +W L+  L    N  W+ GGD NEI+   EK GG  
Subjt:  FEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWD-GSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPM

Query:  RERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIEL--ELVTQRSRKQRRQKYQFRFEEL
        ++   +  FR+ +  C  ++I  EG  FTW   R+  ++ E+LDR L N ++ + F    A  L W  SDH+P+ L   +   ++ K+ R + +F +E+ 
Subjt:  RERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIEL--ELVTQRSRKQRRQKYQFRFEEL

Query:  WTNYEECKELIEK---NGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQRS
        W   EEC ++IE    +G+  GD    H L S + +C   L  W K        R  + K  LK    +    +++    +E +L+ + E++EI WRQRS
Subjt:  WTNYEECKELIEK---NGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQRS

Query:  REEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAIK
        R  WL  GDRN+K+FH KAS RKK N IKG+ D    W    + IE+  I Y+ +LF S+ P  +    ++  +  ++    N+ L+  F + E+++AI 
Subjt:  REEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAIK

Query:  QMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQS
        Q+ P KAPG DG P LFYQ +WN VG + ++ CL +LNN  + S  NDT + LIPK  NP  V +YRP+SLCNV+YK ++K +ANRMK  ++++IS++QS
Subjt:  QMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQS

Query:  AFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF----------------------------------------
        AFI+GR I DN I+G ES++ +K  RF  G+  ALKLD+SKAYDRVEWDF++E+M +LG+                                        
Subjt:  AFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF----------------------------------------

Query:  --------LAEGLSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSL
                 +EGLS  + +A+   +I                              E   +  IL+ Y   SG+ IN  KS +    KIH      L++ 
Subjt:  --------LAEGLSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSL

Query:  LGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF-------------
        LG+  + +  KYLG+P+   RNK +    + ++VW+ +QGWK   FS AG+EVLIKS+ Q IP Y+MS F++ K L   I    ARF             
Subjt:  LGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF-------------

Query:  ----------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDP
                                                      LK+LYY +++ + A  G   S +W+ +LWGR+LL KG+R R+ +G    I +D 
Subjt:  ----------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDP

Query:  WLPKETTFRPICINSDMYNTKVSDYISDSG
        WLP+   F    +     NT V+  ++ +G
Subjt:  WLPKETTFRPICINSDMYNTKVSDYISDSG

A0A803P4U9 Uncharacterized protein1.4e-15636.42Show/hide
Query:  FEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWD-GSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPM
        FEGCF+V + G  GGL +LWA+   V ++SF+ +HID  V  D G  WRFTG YG P  G +  +W L++ L       W+ GGD NEI  N EK GG  
Subjt:  FEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWD-GSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPM

Query:  RERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQRR--QKYQFRFEEL
        +    + +FR+ + +C LR+++ EG +FTW   R   +I E+LDR LCN  +   F       LDW  SDH+P+ L        K+ R  +  +F FE+ 
Subjt:  RERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQRR--QKYQFRFEEL

Query:  WTNYEECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQRSREE
        W   EEC+E+I+K  +         +L   L  C   L KW K      N RI E K+ +     ++   ++ ++  +E +L+ + E+ E+YW+QRSR  
Subjt:  WTNYEECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQRSREE

Query:  WLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTP-KVTLDMNNKLMHPFNKTEIEKAIKQM
        WLK GDRN+K+FH KAS RK+ N I+G+ D+   W      I E  I+YF+ LF  SN     IR IL G  P +++ + N  L+ PF++ E+  A+ Q+
Subjt:  WLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTP-KVTLDMNNKLMHPFNKTEIEKAIKQM

Query:  FPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQSAF
         P KAPG DG P LF+Q+ W  VG +    CL++LNN+ + S  N+T I LIPK  +P  + E+RPISLCNV YK+V+K +ANRMK  LN  IS +QSAF
Subjt:  FPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQSAF

Query:  IQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF------------------------------------------
        I GR+I DN I+G ES++ ++  RF  GR  ALKLD+SKAYDRVEWDFL+ +M  LG+                                          
Subjt:  IQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF------------------------------------------

Query:  ------LAEGLSYQIRKANNSGRI------TEEYELMN--------------------IKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSLLG
               +EGL+  + ++  +G+I        E+ L +                    +K +L  Y   SG++INL KS +    KI+ +  + L++ LG
Subjt:  ------LAEGLSYQIRKANNSGRI------TEEYELMN--------------------IKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSLLG

Query:  IRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF---------------
        +R V    KYLG+P+   +NK +    I DRV   +QGWK   FS AGKE+LIK++ QA+P YVMS F++ K +   I    ARF               
Subjt:  IRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF---------------

Query:  --------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWL
                                                    LK+LY+ ++N +EA LG   S +W+ +LWGRELL KG R  IGNG++  I +DPW+
Subjt:  --------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWL

Query:  PKETTF
        P+   F
Subjt:  PKETTF

A0A803PV25 Uncharacterized protein6.0e-16035.83Show/hide
Query:  FEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWDG-SKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPM
        +E  +TV  +G  GGL ++W     V + S S  HI   V   G   W FTG YG P  GQ+  +W L+R L  +   PWL  GD NEI+S  EK GG  
Subjt:  FEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWDG-SKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPM

Query:  RERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRK---QRRQKYQFRFEE
        R    +D F++ LDDC   D +      TW      + I ERLDR LC +++   F       LDW  SDH+ + +++  +       + ++K +F FEE
Subjt:  RERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRK---QRRQKYQFRFEE

Query:  LWTNYEECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQRSRE
         W   EEC E++++  +         S    +  C  AL  W +      N+ + + K  L E     Q   +E+I  +E +L+ LLE+DE YWRQRSR 
Subjt:  LWTNYEECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQRSRE

Query:  EWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAIKQM
         WL+WGDRN+K+FH KAS R+K NEIKG+ D+ G W +D   +      Y++ LF SS+ + S +  +L+ + PKV+  MNN L+  F + E+ +A+K+M
Subjt:  EWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAIKQM

Query:  FPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQSAF
         PTKAPG DG PALFYQ++W+ +    V   LN+LNN  ++   NDT + LIPK   P+ +EE+RPISLCNV YKIV+K +ANRM+  L  ++SDSQSAF
Subjt:  FPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQSAF

Query:  IQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF------------------------------------------
        ++GRLI DN I+G+ES++ ++ +RF  G   ALKLD++KAYDRVEW FL+ +M++LG+                                          
Subjt:  IQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF------------------------------------------

Query:  ------LAEGLSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSLLG
               AE  S  I+ A   GR+                            E E    + +L+ Y +ASG+ +N  KS + F   + +  + +L++ +G
Subjt:  ------LAEGLSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSLLG

Query:  IRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF---------------
        ++ V + GKYLG+PS   R K +   FI ++VW  ++GWK SFFS AGKEVLIK+I QAIP+Y MS F+LPK     I    ARF               
Subjt:  IRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF---------------

Query:  --------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWL
                                                    LK+ YY +  ++EA  G   S++W+SL+WG++++  G R RIGNG S  +  DPWL
Subjt:  --------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWL

Query:  PKETTFR
        P+  TF+
Subjt:  PKETTFR

A0A803PWX1 Uncharacterized protein4.7e-16536.7Show/hide
Query:  FEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWDG-SKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPM
        F+  +TV  VG  GGL ++W  +  + +   S  HI   V  +G S W  TG YG P    +  +W L+R+L  +   PWL  GD NEI+S  EK GG  
Subjt:  FEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWDG-SKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPM

Query:  RERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELEL---VTQRSRKQRRQKYQFRFEE
        R    +D F++ +DDC   D        TW      + I ERLDR LCN+++   F       LDW  SDH+ + + +   V      + ++K +F FEE
Subjt:  RERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELEL---VTQRSRKQRRQKYQFRFEE

Query:  LWTNYEECKELIEKNGAW---SGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQR
         W   EEC E+I+    W    G   P  S    +  C  AL  W K      NN I + K IL E     Q   +E+I  +E +L+ LLE+DE YWRQR
Subjt:  LWTNYEECKELIEKNGAW---SGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQR

Query:  SREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAI
        SR  WL+WGDRN+K+FH KAS R+K NEIKG+ D  G W +D   + +    Y+K LF  S+ D   ++ +L  + PKV++ MN +LM  F+  E+ +A+
Subjt:  SREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAI

Query:  KQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQ
        K M PTKAPG DG PALFYQ++W+ + ++ +  CLN+LNN  ++S  NDT + LIPK   P+ +EE+RPISLCNV YKIV+K +ANR++  L++++SDSQ
Subjt:  KQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQ

Query:  SAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF---------------------------------------
        SAF++GRLI DN I+G+E ++ ++ NRF  G   ALKLD++KAYDRVEW FL+ +ML+LG+                                       
Subjt:  SAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF---------------------------------------

Query:  ---------LAEGLSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSS
                  AE  S  I+ A +SG++                            E E    K +L+ Y  ASG+ +N  KS + F  K+    + +L++
Subjt:  ---------LAEGLSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSS

Query:  LLGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF------------
        ++G++ V + GKYLG+PS   R K +   FI ++VW  ++GWK SFFS AGKEVLIK++ QAIP+Y MS F+LPK     I    ARF            
Subjt:  LLGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF------------

Query:  -----------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKD
                                                       LK+ YY +  ++EA  G   S++W+SL+WG++++ KG R RIGNG S  +  D
Subjt:  -----------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKD

Query:  PWLPKETTFR
        PWLP+  TF+
Subjt:  PWLPKETTFR

A0A803Q9W0 Uncharacterized protein2.3e-15134.58Show/hide
Query:  FEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEV-LWDGSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPM
        FEGCF V + G  GGL +LW +   V + SF+D HID  +   +   WRFTG YG P   Q+  +W L++ +    N PWL GGD NEI    EK GG  
Subjt:  FEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEV-LWDGSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPM

Query:  RERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQRRQK--YQFRFEEL
        +    + +F   +D C LR+I+ EG  FTW   R   +I ERLDR + N+ +  I++  +  +L    SDH P+ L   T     Q++Q+  Y+F +E+ 
Subjt:  RERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQRRQK--YQFRFEEL

Query:  WTNYEECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEA--YGNFQSIN-FESIHAIEFELDRLLEEDEIYWRQRS
        W + EEC+++IE        I     L   L NC   L +W K   + R   + + K++ KE   Y N  S + F  +  IE +L+  L ++E++W+QRS
Subjt:  WTNYEECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEA--YGNFQSIN-FESIHAIEFELDRLLEEDEIYWRQRS

Query:  REEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAIK
        R  WL  GDRN+++FH+KA+ R+K N I G+ D N  W    + IE T   +F++LF +++   +   T+   +  +++   N +L+  F   +I+ A+ 
Subjt:  REEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAIK

Query:  QMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQS
        Q+   KAPG DG P LFY+++W  +G    + CL+ILNN  +    N T + LIPK   P+ V +YRPISLCNV+YKI+ K +ANRMK+ L E+IS++QS
Subjt:  QMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQS

Query:  AFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF----------------------------------------
        AFI+GRLI DN I+G ES++ +K  RF  GR  ALKLD+SKAYDRVEW FL+ +M+ LG+                                        
Subjt:  AFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGF----------------------------------------

Query:  --------LAEGLSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSL
                 +EGLS  I++A  + R+                              +  ++K+IL +Y L SG+ IN  KS +    +I+      L+++
Subjt:  --------LAEGLSYQIRKANNSGRI--------------------------TEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSL

Query:  LGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF-------------
        LG++ V    KYLG+P+   + K +    I  ++   +QGWK S FS AG+E+L+K+I QAIP+Y+MS F+LPK L + I    ARF             
Subjt:  LGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWKAVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARF-------------

Query:  ----------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDP
                                                      LK+ YY +SN +EA +G   SY+W+S+LWGR+++ KG+R R+  G+   I +D 
Subjt:  ----------------------------------------------LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDP

Query:  WLPKETTF
        WLP+ +TF
Subjt:  WLPKETTF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.7e-2124.88Show/hide
Query:  SDHKPIELELVTQRSRKQRRQKYQFR---FEELWTNYE---ECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNR----------IF
        SDH  I+LEL  +   + R   ++       + W + E   E K   E N           +   NL +   A+ + GK I ++   R            
Subjt:  SDHKPIELELVTQRSRKQRRQKYQFR---FEELWTNYE---ECKELIEKNGAWSGDIYPFHSLSSNLANCSVALSKWGKGIHVHRNNR----------IF

Query:  ECKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQRSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLF
        + K + K+   + ++   + I  I  EL  +  +  +     SR  + +  ++  +   +    +++ N+I  I ++ G    DP  I+ T   Y+K+L+
Subjt:  ECKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQRSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLF

Query:  RSSNPDTSHIRTILAGLT-PKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPK
         +   +   + T L   T P++  +    L  P   +EI   I  +   K+PGPDG+ A FYQRY   +    ++   +I       + + + +I+LIPK
Subjt:  RSSNPDTSHIRTILAGLT-PKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDTNIVLIPK

Query:  ASNPRAVEE-YRPISLCNVNYKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIK-INRFNWGRMAALKLDLSKAYDRVEWDFLKEI
               +E +RPISL N++ KI+ K++ANR+++ + ++I   Q  FI G     NI    +SIN I+ INR        + +D  KA+D+++  F+ + 
Subjt:  ASNPRAVEE-YRPISLCNVNYKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIK-INRFNWGRMAALKLDLSKAYDRVEWDFLKEI

Query:  MLQLGFLAEGLSYQIRKA
        + +LG   +G+  +I +A
Subjt:  MLQLGFLAEGLSYQIRKA

P08548 LINE-1 reverse transcriptase homolog2.2e-2625.69Show/hide
Query:  LIGGDLNEILSNDEKSGGPMRERRYID--DFRQCLD-DCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELE
        ++ GD N  L+  ++S      +  +D     Q LD   + R  +P    +T+  +  GT  K  +D  L +    S F  +  I    +FSDH  I++E
Subjt:  LIGGDLNEILSNDEKSGGPMRERRYID--DFRQCLD-DCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELE

Query:  LVTQRSRKQRRQKYQFR---FEELWTNYEECKEL---IEKNGAWSGDIYP-FHSLSSNLANCSVALSKWGKGIHVHR-NNRIFECKNILKEAYGNFQSIN
        L   R+     + ++      ++ W   E  KE+   +E+N     +    + +  + L    +AL  + K       NN +   K + KE + N +   
Subjt:  LVTQRSRKQRRQKYQFR---FEELWTNYEECKEL---IEKNGAWSGDIYP-FHSLSSNLANCSVALSKWGKGIHVHR-NNRIFECKNILKEAYGNFQSIN

Query:  FESIHAIEFELDRLLEEDEIYWRQRSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTIL-AG
         + I  I  EL+ +  +  I    +S+  + +  ++  K        ++  + I  I + N     DP  I++    Y+K L+     +   I   L A 
Subjt:  FESIHAIEFELDRLLEEDEIYWRQRSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTIL-AG

Query:  LTPKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDT----NIVLIPK-ASNPRAVEEYRP
          P+++      L  P + +EI   I+ +   K+PGPDG+ + FYQ    T   + V   LN+  N     I  +T    NI LIPK   +P   E YRP
Subjt:  LTPKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISIWNDT----NIVLIPK-ASNPRAVEEYRP

Query:  ISLCNVNYKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIK-INRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGFLAEGLSY
        ISL N++ KI+ K++ NR+++ + +II   Q  FI G     NI    +SIN I+ IN+        L +D  KA+D ++  F+   + ++G   EG   
Subjt:  ISLCNVNYKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIK-INRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGFLAEGLSY

Query:  QIRKANNSGRITEEYELMNIKNILKDYELASGESINLAKSAIFFS
        ++ +A  S + T    L  +K  LK + L SG       S + F+
Subjt:  QIRKANNSGRITEEYELMNIKNILKDYELASGESINLAKSAIFFS

P11369 LINE-1 retrotransposable element ORF2 protein2.8e-2130.51Show/hide
Query:  IKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGL-TPKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVG
        I  I +  G    DP+ I+ T  S++K L+ +   +   +   L     PK+  D  + L  P +  EIE  I  +   K+PGPDG+ A FYQ +   + 
Subjt:  IKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGL-TPKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVG

Query:  NKTVEECLNILNNKGNI-SIWNDTNIVLIPK-ASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIK-
           + +  + +  +G + + + +  I LIPK   +P  +E +RPISL N++ KI+ K++ANR++E +  II   Q  FI G     NI    +SIN I  
Subjt:  NKTVEECLNILNNKGNI-SIWNDTNIVLIPK-ASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIK-

Query:  INRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLG
        IN+        + LD  KA+D+++  F+ +++ + G
Subjt:  INRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLG

P14381 Transposon TX1 uncharacterized 149 kDa protein3.7e-2626.45Show/hide
Query:  LIGGDLNEILSNDEKSGGPMRE------RRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPI
        +IGGD N  L   +++    R+      R  I  F   L D + R+ NPE   FT++  R G + + R+DR   +    S   S   I L   FSDH  +
Subjt:  LIGGDLNEILSNDEKSGGPMRE------RRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPI

Query:  ELELVTQRSRKQRRQKYQFRFEELWTNYEECKELIEKNGAWSGDIYPFHSLSS----NLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINF
         L +    S    +  Y      L  +    K + +    W      F +L+        +  +   ++ K +   RN    E + +  E     Q ++ 
Subjt:  ELELVTQRSRKQRRQKYQFRFEELWTNYEECKELIEKNGAWSGDIYPFHSLSS----NLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINF

Query:  ESIHAIEFE-LDRLLEEDEIYWRQ------RSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIR
            A++ E L+R      +  RQ      RSR + L   DR S++F+     +    +I  +   +G   EDP+AI +   S+++NLF           
Subjt:  ESIHAIEFE-LDRLLEEDEIYWRQ------RSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIR

Query:  TILAGLTPKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISI-WNDTNIVLIPKASNPRAVEEYR
         +  GL P V+     +L  P    E+ +A++ M   K+PG DG    F+Q +W+T+G       L     KG + +      + L+PK  + R ++ +R
Subjt:  TILAGLTPKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGNISI-WNDTNIVLIPKASNPRAVEEYR

Query:  PISLCNVNYKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFL
        P+SL + +YKIV K I+ R+K +L E+I   QS  + GR I DN+ +  + ++  +    +   +A L LD  KA+DRV+  +L
Subjt:  PISLCNVNYKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFL

P93295 Uncharacterized mitochondrial protein AtMg003105.8e-1144.12Show/hide
Query:  ARFLKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWLPKETTFRPI
        +R L+S Y+  S++ME  +G +PSY W+S++ GRELLS+GL   IG+G  T ++ D W+  ET   P+
Subjt:  ARFLKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWLPKETTFRPI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.0e-2726.06Show/hide
Query:  WRFTGVYGFPVKGQKMLTWDLIRS--LNGQGNKPWLIGGDLNEILSNDE-----KSGGPMRERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIK
        WR    Y     G+  + WD   S  +  + ++  ++ GD ++I +  +     ++  PMR    +++F+ CL D  L DI   G  +TW  ++    I 
Subjt:  WRFTGVYGFPVKGQKMLTWDLIRS--LNGQGNKPWLIGGDLNEILSNDE-----KSGGPMRERRYIDDFRQCLDDCLLRDINPEGELFTWIGNRRGTIIK

Query:  ERLDRFLCN-DKFDSIFSSVRAINLDWLFSDHKP--IELELVTQRSRKQRRQKYQFRFEELWTNYEECKELIEKNGAWSGDI---YPFHSLSSNL---AN
         +LDR + N D F S  S++    L  + SDH P  I LE + +RS+K       FR+    + +     L+    AW   I       SL  +L     
Subjt:  ERLDRFLCN-DKFDSIFSSVRAINLDWLFSDHKP--IELELVTQRSRKQRRQKYQFRFEELWTNYEECKELIEKNGAWSGDI---YPFHSLSSNL---AN

Query:  CSVALSKWGKGIHVHRNNRIFE-CKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQRSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNN
        C   L++ G G   H+     +  ++I  +   N     F   H    + +      E ++RQ+SR +WL+ GD N+++FHK     +  N IK +  ++
Subjt:  CSVALSKWGKGIHVHRNNRIFE-CKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQRSREEWLKWGDRNSKWFHKKASIRKKINEIKGILDNN

Query:  GHWNEDPDAIEETFISYFKNLFRSSN----PDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVE
            E+   ++E  ++Y+ +L  S +    PD+  ++ I      +    + ++L    +  EI  A+  M   KAPGPD + A F+   W  V + T+ 
Subjt:  GHWNEDPDAIEETFISYFKNLFRSSN----PDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVE

Query:  ECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVT
                   +  +N T I LIPK +    +  +RP+S C V YKI+T
Subjt:  ECLNILNNKGNISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVT

AT3G09510.1 Ribonuclease H-like superfamily protein1.5e-0636.67Show/hide
Query:  LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPW--------LPKETTFRPICINSDMYNTKVSDYISD
        +K+ Y+KD +I++A + +Q SY W SLL G  LL KG R+ IG+G++  I  D          L  E T++ + IN +++  K S Y  D
Subjt:  LKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPW--------LPKETTFRPICINSDMYNTKVSDYISD

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.3e-1033.33Show/hide
Query:  IANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGFLAEGLSYQIRKANNSGRITEE
        +  R+K ++  +I  +Q++FI GR+ +DNI+   E++++++  +   G M  LKLDL KAYDR+ WD+L++ ++  GF    L    R    + R+  E
Subjt:  IANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLKEIMLQLGFLAEGLSYQIRKANNSGRITEE

AT4G29090.1 Ribonuclease H-like superfamily protein5.6e-0934.74Show/hide
Query:  ARFLKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWL---PKETTFRPICINSDMYNT-----KVSDYISDSG
        A+  KS Y+  S+ + A LG +PS++WKS+   +E+L +G R  +GNG+   I++  WL   P     R   +    Y +     KVSD I +SG
Subjt:  ARFLKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWL---PKETTFRPICINSDMYNT-----KVSDYISDSG

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.1e-1244.12Show/hide
Query:  ARFLKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWLPKETTFRPI
        +R L+S Y+  S++ME  +G +PSY W+S++ GRELLS+GL   IG+G  T ++ D W+  ET   P+
Subjt:  ARFLKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWLPKETTFRPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATGCCATTTCGAAGGATGCTTCACGGTAAGGAGTGTGGGCGCAAAAGGTGGTTTATGTATTCTCTGGGCAGATAAAGACATGGTCTCGATTCAGTCCTTCTCAGA
TAATCACATTGATTGCGAGGTCCTTTGGGATGGGTCAAAATGGAGATTTACAGGAGTCTACGGTTTTCCAGTAAAGGGTCAAAAGATGTTAACTTGGGATTTGATTCGAA
GTCTGAATGGGCAGGGAAATAAGCCATGGCTGATTGGAGGGGATTTAAATGAGATTTTAAGCAATGATGAGAAAAGTGGGGGCCCTATGAGGGAAAGACGATACATTGAT
GATTTTAGGCAATGCTTGGATGACTGCTTACTCAGAGATATCAATCCAGAGGGTGAATTATTCACATGGATAGGAAACAGAAGAGGGACTATCATTAAAGAGAGGCTTGA
CAGATTTCTATGCAATGACAAATTTGACTCTATTTTCAGTTCGGTGAGGGCCATCAACCTGGATTGGCTGTTCTCAGACCACAAACCTATTGAGTTGGAATTGGTTACAC
AAAGAAGTCGTAAGCAAAGAAGACAAAAGTACCAATTCAGATTTGAGGAGCTGTGGACCAACTATGAGGAATGCAAGGAACTAATAGAGAAGAATGGTGCTTGGTCAGGT
GATATCTATCCTTTTCACTCTCTGTCCTCTAATTTAGCTAATTGTTCTGTGGCTTTGTCTAAATGGGGGAAAGGCATTCATGTTCATAGGAATAATAGAATTTTTGAATG
TAAAAACATTTTAAAAGAAGCCTATGGAAATTTTCAATCCATTAATTTTGAGTCTATCCATGCTATTGAATTTGAGTTAGATAGGCTATTAGAAGAGGATGAGATTTATT
GGAGGCAAAGATCTAGGGAAGAATGGCTTAAATGGGGAGATAGGAACTCCAAATGGTTCCATAAGAAAGCTTCTATAAGGAAGAAAATTAATGAAATCAAAGGAATCTTG
GATAATAATGGCCATTGGAATGAAGACCCAGATGCTATTGAAGAAACTTTCATCTCCTATTTTAAAAACCTCTTCAGATCATCCAATCCTGATACTTCCCACATTAGAAC
TATCTTAGCCGGCTTAACTCCTAAAGTAACCCTGGATATGAATAACAAGCTTATGCATCCTTTCAATAAAACTGAAATTGAAAAAGCCATTAAACAAATGTTTCCAACCA
AGGCCCCTGGTCCGGATGGATACCCTGCCCTATTCTATCAAAGATATTGGAACACTGTTGGTAATAAAACAGTCGAAGAATGTCTAAATATCCTTAATAATAAGGGCAAC
ATCTCAATCTGGAACGATACTAACATTGTTTTAATTCCTAAAGCCTCAAATCCTAGAGCTGTTGAAGAATATAGACCAATAAGCTTGTGTAATGTTAATTACAAGATTGT
TACTAAAGTAATTGCAAATAGAATGAAGGAAATTCTTAATGAGATTATCTCAGATTCTCAATCAGCTTTTATCCAAGGGAGGCTTATTTCTGATAATATAATAATAGGCC
ATGAGAGTATTAATGCTATTAAGATCAATAGGTTTAACTGGGGCAGAATGGCAGCTCTCAAACTGGATCTTAGTAAAGCATATGATCGAGTTGAGTGGGATTTCCTTAAA
GAAATTATGCTTCAACTAGGTTTCTTAGCTGAAGGGCTTTCTTACCAAATTAGGAAAGCTAATAATTCTGGGAGGATTACAGAGGAATATGAGCTCATGAATATAAAAAA
CATCCTTAAAGACTATGAGCTAGCATCTGGAGAATCTATTAATTTGGCTAAATCTGCAATCTTTTTTTCTTCTAAAATTCATTCGGACAGGAAAGATTACTTGAGCTCTC
TGTTGGGTATTAGGCATGTGAGTGACCTTGGTAAGTATTTAGGGGTTCCCTCTGTGTTCTCCCGTAATAAGTCAAAGGATCTTAGCTTTATTTTAGATAGGGTGTGGAAA
GCTGTCCAAGGGTGGAAGAACTCTTTTTTCTCTATTGCTGGAAAAGAAGTTTTGATTAAAAGTATTGGTCAAGCTATCCCATCCTATGTAATGAGTGTTTTTAAATTACC
TAAACACTTACATGAGGGTATATCTAGGAACTTTGCAAGGTTCCTGAAAAGCCTCTATTATAAGGATTCTAATATTATGGAAGCTGATCTGGGGAGACAGCCTTCTTATC
TGTGGAAGAGCTTATTATGGGGTAGAGAATTACTTAGCAAGGGCCTTCGGAATAGGATAGGGAATGGAAAGAGCACATTTATTTTTAAGGACCCTTGGCTTCCCAAAGAG
ACTACTTTCAGACCAATTTGTATAAACAGTGATATGTACAACACAAAAGTGTCTGATTACATCTCTGATTCAGGGATTGGGATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCATGCCATTTCGAAGGATGCTTCACGGTAAGGAGTGTGGGCGCAAAAGGTGGTTTATGTATTCTCTGGGCAGATAAAGACATGGTCTCGATTCAGTCCTTCTCAGA
TAATCACATTGATTGCGAGGTCCTTTGGGATGGGTCAAAATGGAGATTTACAGGAGTCTACGGTTTTCCAGTAAAGGGTCAAAAGATGTTAACTTGGGATTTGATTCGAA
GTCTGAATGGGCAGGGAAATAAGCCATGGCTGATTGGAGGGGATTTAAATGAGATTTTAAGCAATGATGAGAAAAGTGGGGGCCCTATGAGGGAAAGACGATACATTGAT
GATTTTAGGCAATGCTTGGATGACTGCTTACTCAGAGATATCAATCCAGAGGGTGAATTATTCACATGGATAGGAAACAGAAGAGGGACTATCATTAAAGAGAGGCTTGA
CAGATTTCTATGCAATGACAAATTTGACTCTATTTTCAGTTCGGTGAGGGCCATCAACCTGGATTGGCTGTTCTCAGACCACAAACCTATTGAGTTGGAATTGGTTACAC
AAAGAAGTCGTAAGCAAAGAAGACAAAAGTACCAATTCAGATTTGAGGAGCTGTGGACCAACTATGAGGAATGCAAGGAACTAATAGAGAAGAATGGTGCTTGGTCAGGT
GATATCTATCCTTTTCACTCTCTGTCCTCTAATTTAGCTAATTGTTCTGTGGCTTTGTCTAAATGGGGGAAAGGCATTCATGTTCATAGGAATAATAGAATTTTTGAATG
TAAAAACATTTTAAAAGAAGCCTATGGAAATTTTCAATCCATTAATTTTGAGTCTATCCATGCTATTGAATTTGAGTTAGATAGGCTATTAGAAGAGGATGAGATTTATT
GGAGGCAAAGATCTAGGGAAGAATGGCTTAAATGGGGAGATAGGAACTCCAAATGGTTCCATAAGAAAGCTTCTATAAGGAAGAAAATTAATGAAATCAAAGGAATCTTG
GATAATAATGGCCATTGGAATGAAGACCCAGATGCTATTGAAGAAACTTTCATCTCCTATTTTAAAAACCTCTTCAGATCATCCAATCCTGATACTTCCCACATTAGAAC
TATCTTAGCCGGCTTAACTCCTAAAGTAACCCTGGATATGAATAACAAGCTTATGCATCCTTTCAATAAAACTGAAATTGAAAAAGCCATTAAACAAATGTTTCCAACCA
AGGCCCCTGGTCCGGATGGATACCCTGCCCTATTCTATCAAAGATATTGGAACACTGTTGGTAATAAAACAGTCGAAGAATGTCTAAATATCCTTAATAATAAGGGCAAC
ATCTCAATCTGGAACGATACTAACATTGTTTTAATTCCTAAAGCCTCAAATCCTAGAGCTGTTGAAGAATATAGACCAATAAGCTTGTGTAATGTTAATTACAAGATTGT
TACTAAAGTAATTGCAAATAGAATGAAGGAAATTCTTAATGAGATTATCTCAGATTCTCAATCAGCTTTTATCCAAGGGAGGCTTATTTCTGATAATATAATAATAGGCC
ATGAGAGTATTAATGCTATTAAGATCAATAGGTTTAACTGGGGCAGAATGGCAGCTCTCAAACTGGATCTTAGTAAAGCATATGATCGAGTTGAGTGGGATTTCCTTAAA
GAAATTATGCTTCAACTAGGTTTCTTAGCTGAAGGGCTTTCTTACCAAATTAGGAAAGCTAATAATTCTGGGAGGATTACAGAGGAATATGAGCTCATGAATATAAAAAA
CATCCTTAAAGACTATGAGCTAGCATCTGGAGAATCTATTAATTTGGCTAAATCTGCAATCTTTTTTTCTTCTAAAATTCATTCGGACAGGAAAGATTACTTGAGCTCTC
TGTTGGGTATTAGGCATGTGAGTGACCTTGGTAAGTATTTAGGGGTTCCCTCTGTGTTCTCCCGTAATAAGTCAAAGGATCTTAGCTTTATTTTAGATAGGGTGTGGAAA
GCTGTCCAAGGGTGGAAGAACTCTTTTTTCTCTATTGCTGGAAAAGAAGTTTTGATTAAAAGTATTGGTCAAGCTATCCCATCCTATGTAATGAGTGTTTTTAAATTACC
TAAACACTTACATGAGGGTATATCTAGGAACTTTGCAAGGTTCCTGAAAAGCCTCTATTATAAGGATTCTAATATTATGGAAGCTGATCTGGGGAGACAGCCTTCTTATC
TGTGGAAGAGCTTATTATGGGGTAGAGAATTACTTAGCAAGGGCCTTCGGAATAGGATAGGGAATGGAAAGAGCACATTTATTTTTAAGGACCCTTGGCTTCCCAAAGAG
ACTACTTTCAGACCAATTTGTATAAACAGTGATATGTACAACACAAAAGTGTCTGATTACATCTCTGATTCAGGGATTGGGATTTAG
Protein sequenceShow/hide protein sequence
MSCHFEGCFTVRSVGAKGGLCILWADKDMVSIQSFSDNHIDCEVLWDGSKWRFTGVYGFPVKGQKMLTWDLIRSLNGQGNKPWLIGGDLNEILSNDEKSGGPMRERRYID
DFRQCLDDCLLRDINPEGELFTWIGNRRGTIIKERLDRFLCNDKFDSIFSSVRAINLDWLFSDHKPIELELVTQRSRKQRRQKYQFRFEELWTNYEECKELIEKNGAWSG
DIYPFHSLSSNLANCSVALSKWGKGIHVHRNNRIFECKNILKEAYGNFQSINFESIHAIEFELDRLLEEDEIYWRQRSREEWLKWGDRNSKWFHKKASIRKKINEIKGIL
DNNGHWNEDPDAIEETFISYFKNLFRSSNPDTSHIRTILAGLTPKVTLDMNNKLMHPFNKTEIEKAIKQMFPTKAPGPDGYPALFYQRYWNTVGNKTVEECLNILNNKGN
ISIWNDTNIVLIPKASNPRAVEEYRPISLCNVNYKIVTKVIANRMKEILNEIISDSQSAFIQGRLISDNIIIGHESINAIKINRFNWGRMAALKLDLSKAYDRVEWDFLK
EIMLQLGFLAEGLSYQIRKANNSGRITEEYELMNIKNILKDYELASGESINLAKSAIFFSSKIHSDRKDYLSSLLGIRHVSDLGKYLGVPSVFSRNKSKDLSFILDRVWK
AVQGWKNSFFSIAGKEVLIKSIGQAIPSYVMSVFKLPKHLHEGISRNFARFLKSLYYKDSNIMEADLGRQPSYLWKSLLWGRELLSKGLRNRIGNGKSTFIFKDPWLPKE
TTFRPICINSDMYNTKVSDYISDSGIGI