; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019689 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019689
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold5:35712647..35718042
RNA-Seq ExpressionSpg019689
SyntenySpg019689
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.2e-13924.93Show/hide
Query:  ITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKH--GYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLIS---
        +TE  ++ S S+ +T  +L W+ + F  L     T  FF+E R ++  + V K  NK       EI ++ N G + SIL+P   +  GW SF +LI+   
Subjt:  ITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKH--GYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLIS---

Query:  --------------------------------DYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDL---VWTDIIVVQRFYQRDDWPSIRSSIL
                                         Y K  +  +     + YK        + S  +   P  L    +   +++ R    DDW  I  S+ 
Subjt:  --------------------------------DYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDL---VWTDIIVVQRFYQRDDWPSIRSSIL

Query:  SSISNRCSINPFQDNKALLHVYDRKTALELCK---STEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSS
               S  PFQ +KA+L +      L LC    +  WS +G +++KF    S  +   +   SYGGW+  R +    W+ + F+ IG +CGG+L  + 
Subjt:  SSISNRCSINPFQDNKALLHVYDRKTALELCK---STEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSS

Query:  HTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK---ENTQVIARESSPIH
         T +M  L++A++KVR N  GF+P+SI L     GE   V     T +        ER+    GS+  +      +  +L E       Q I  E +  H
Subjt:  HTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK---ENTQVIARESSPIH

Query:  ---------ERESPIMEQPPFNEDSISV--DFPNLTDSTKSSKGKEPLIMEEP--------SVELKNNISLPVSP-------TNLKIGQKGSTSGL----
                 +       Q   N  S S    F       +  KGK  LI+ +         S  + N     +SP       +N +I  KG +  +    
Subjt:  ---------ERESPIMEQPPFNEDSISV--DFPNLTDSTKSSKGKEPLIMEEP--------SVELKNNISLPVSP-------TNLKIGQKGSTSGL----

Query:  -------SPKLVIVGSDTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPM
               SP+       T      P      HK++L         + +        PL  M     N G    NN                +TP  N   
Subjt:  -------SPKLVIVGSDTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPM

Query:  FALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYDKTASLALTEGVQETK---TSNIDNHLIKSLWSSSHIGWSSLD
            E  +++   + E               R T     K  K   EL+        D+     L   ++E +   +    N +  S +    +   ++D
Subjt:  FALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYDKTASLALTEGVQETK---TSNIDNHLIKSLWSSSHIGWSSLD

Query:  -----SVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLIT
              +G  GGIL++W + +F + +   G +S+S+++   +G ++WLT++YGP +   R   W EL  L  L    W++ GDFN+ RW  E +   L  
Subjt:  -----SVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLIT

Query:  RSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFQFENSWLQIASFREVL
        R+M  FN  I+  +L+D P  N  FTWS+   NP  S +DRFL+SK   N FG      L+R  SDH+P+ L    I WGP PF+  NS L+   F++  
Subjt:  RSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFQFENSWLQIASFREVL

Query:  DNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSH-SLALSQLPSLITQLKRLDNLEDKGLTMSQKESRRI-LREQIEDLTAQEHIQWKQRCKLKWVKEG
         NWWN +   G+PG+AF+  L  L   ++ W  +  +L  +   +L+ ++  +D LE +G   +    +RI L+  +  +   +   W QR + +W   G
Subjt:  DNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSH-SLALSQLPSLITQLKRLDNLEDKGLTMSQKESRRI-LREQIEDLTAQEHIQWKQRCKLKWVKEG

Query:  DENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPI---------------------------------
        DEN  +FHRI    +RKN I  +    G SL S +DI + F+  ++ ++TK++    L  N+ W PI                                 
Subjt:  DENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPI---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---SEAQSKG-----------------------IHIDDTE-------------------------------------LEEMVAKFGCKRGFWPSTYLGLP
           S  +SKG                       I ++D E                                      +++ + FG +  F P  YLG+P
Subjt:  ---SEAQSKG-----------------------IHIDDTE-------------------------------------LEEMVAKFGCKRGFWPSTYLGLP

Query:  LGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLG
        LGGNP++  FW   IE I  KL  WKY+ ISKGGRLTL++A+LSS+PTY LS FK P  V K ++K  RDF W GS+     H +NW+    PK LGGLG
Subjt:  LGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLG

Query:  IGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPR
        I   +  N ALL KW+WR+ +E  SLW+  I AKY  +      P+  +  S  SPW  I    D   S+I  +  +G S  FW   W N + LS   PR
Subjt:  IGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPR

Query:  LYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGS----DTILNNLYSVIWKDN
        LY L+   +A V E+W+     WN+ PRR LNE E   W ++   L  I         +W    SK ++V S        S    +T        +W+ +
Subjt:  LYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGS----DTILNNLYSVIWKDN

Query:  YPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSN-NIHD
         P+K K F+W +    +NT D +Q+R P  SL+PSWC+ C S+ E+  HLF+ C FA   WN+     G   PM N N+ D
Subjt:  YPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSN-NIHD

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.2e-13828.7Show/hide
Query:  VQETKTSNIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGD
        +QETK    D   + S+W    + W++L + GASGGI+I+W        E + G FSV++     +  SFWLT++YGP    +R DFW EL DL GL   
Subjt:  VQETKTSNIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGD

Query:  RWILGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGD
        RW +GGDFNV R   EK     +T +MR F++ I    L+D PL+N  FTWS+   +P    +DRFL S +    F  S    L R TSDH P+ L    
Subjt:  RWILGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGD

Query:  INWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSHSLALSQLPSLI-TQLKRLDNLEDKG-LTMSQKESRRILREQ
        + WGP PF+FEN WL    F+E    WW   + +GW GH FM KLK +K++L+ WN      L +   LI T L R+D +E +G L       R + R +
Subjt:  INWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSHSLALSQLPSLI-TQLKRLDNLEDKG-LTMSQKESRRILREQ

Query:  IEDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPIS--------
        +ED+  +E +QW+Q+ ++KW+KEGD N+KFFHR+   R+ +  I  ++S  G +L +  DI +E V+F+  L++K     +    +DW PIS        
Subjt:  IEDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPIS--------

Query:  -------------------------------------------------------------------EAQSKGIHIDD-------TELEEMVAK------
                                                                             +S+ + I D       T L +++AK      
Subjt:  -------------------------------------------------------------------EAQSKGIHIDD-------TELEEMVAK------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------FGCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAG
                 F C+   WP +YLGLPLGGNPK   FW PV+E+I  +L  WK A++S GGR+TLIQ+ LS +P+YFLSLFK+P+ +A  ++K+ R+F W+G
Subjt:  ---------FGCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAG

Query:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRII
        +      H V W+    PK LGGLG G   LRNIALL KW+WRF  E   LW   +I   Y +         + R S + PW+ I     + +  +  ++
Subjt:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRII

Query:  GNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIR-LRDMNDFWSWPLEASKLFSVKSL
        GNG    FW+D W     L + F  LYR+ +  N  V+ V  NS   AWNL+ RR+L + EI     L   LS +R    + D  +W L +S LF+VKS 
Subjt:  GNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIR-LRDMNDFWSWPLEASKLFSVKSL

Query:  MVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNN
         + L   S+ IL      +W    P K+K   W ++ G +NT D+LQ R P+ SL P WC++C  N E+  HLF+ C      WN + +  G       +
Subjt:  MVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNN

Query:  IHDFLASIFVGHPFQGTKRILW
          D L   F G       + LW
Subjt:  IHDFLASIFVGHPFQGTKRILW

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.0e-15226.3Show/hide
Query:  PSASLPWNHPT--------RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNK
        PS +  +N P         RS  ++RK F +  D+YS+ +   +TE G + + S+ ++ + L+W+  T  +L   P T +FF E R  E  + + K  N 
Subjt:  PSASLPWNHPT--------RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNK

Query:  HGYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQP-------------KVSH
         G   EI ++     +  IL+P   +K GW SF S+I+  PK      ++P                    RSY + + + +P               SH
Subjt:  HGYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQP-------------KVSH

Query:  PLNTS-----PPDLVWTDIIVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTV
          + S       DL+   +++V+RF+  DDW  I  ++        + N F   KAL+H      A  LC++  WS +GK+ ++F   +   +       
Subjt:  PLNTS-----PPDLVWTDIIVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTV

Query:  SYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKG
        SYGGW   R +    W+   F+ IG +C G +  +  T    NL+EAR+KVR N +GF+P+++ +     G +  VQ+        G+    ER+    G
Subjt:  SYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKG

Query:  SYEVEEDKSESKDLNLEEK----ENTQVIARE--SSPIHERESPIMEQPPFNEDSI-----SVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNIS-----
        +++  +  +   D N E +    E ++ I+ +  S+    R+S   +QP   +  I     +   P+  +    +         +  +E+ + IS     
Subjt:  SYEVEEDKSESKDLNLEEK----ENTQVIARE--SSPIHERESPIMEQPPFNEDSI-----SVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNIS-----

Query:  -----------LPVSPTNL-KIGQKGSTSGLSPKLVIVGSDTEAYLSSPSPINSPHK------------------------INLDPPPTHDFDLTIFNPD
                    P S  NL K  +K S +  S K  I   D+     SPS +NSP K                         N     T    +   + D
Subjt:  -----------LPVSPTNL-KIGQKGSTSGLSPKLVIVGSDTEAYLSSPSPINSPHK------------------------INLDPPPTHDFDLTIFNPD

Query:  QHHLPLAI------MPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQR
             L++      +P    N      +N  NA   DI      PETP +  P+               +H             K K   +   K +L  
Subjt:  QHHLPLAI------MPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQR

Query:  EL--DNLKTTVHYDKTASLALTEGVQETKTSNI---DNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLT
         L  + LK +   D + +   T  +     S +   +  +IKSLW S+ I W + ++ G+SGGILI+W   + ++    +GLFS+S +  + +  S+WLT
Subjt:  EL--DNLKTTVHYDKTASLALTEGVQETKTSNI---DNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLT

Query:  NIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCL
         +YGP +R  RI FW ELH+L  L    WILGGD NV R   E +     + + R+ N  I+   L+D PL N  FTWS+  + P  S IDRFL +    
Subjt:  NIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCL

Query:  NKFGSSHLLRLDRITSDHYPLSLTFGD--INWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRS--HSLALSQLPSL
        N F       L R TSDH+PL     +  ++WGP PF+  +  L    F+  +  WW ++   G+PG +F+ +LK L N ++ W +   HSL  ++  ++
Subjt:  NKFGSSHLLRLDRITSDHYPLSLTFGD--INWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRS--HSLALSQLPSL

Query:  ITQLKRLDNLE-DKGLTMSQKESRRILREQIEDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYR
        I ++  +D  E D  LT  +   R  L+  + +L+ +E   W QR K  W++EGDEN+ FFHRI ++R++++ I E+    G    + N I   F+ F+ 
Subjt:  ITQLKRLDNLE-DKGLTMSQKESRRILREQIEDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYR

Query:  LLFTKDNRT-RFLPTNVDWCPIS------------EAQSKGI----------------------------------------------------------
         ++    ++      N+DW PI+            E + KG+                                                          
Subjt:  LLFTKDNRT-RFLPTNVDWCPIS------------EAQSKGI----------------------------------------------------------

Query:  -----------------------HID--DTELEE-------------------------------MVAKFGCKRG-------------------------
                               ++D  D  LE+                               + A  G ++G                         
Subjt:  -----------------------HID--DTELEE-------------------------------MVAKFGCKRG-------------------------

Query:  --------------------------------------------------------------------------FW-------PSTYLGLPLGGNPKNFV
                                                                                  FW       P +YLG+PLGGNPK+ +
Subjt:  --------------------------------------------------------------------------FW-------PSTYLGLPLGGNPKNFV

Query:  FWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNI
        FW+ V +KIQ KL +WKYA ISKGGRLTLI++TLSS+P Y LS+F+ PS   K+++KL R F W G+ G  G H +NW K    K  GGLGI    + N 
Subjt:  FWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNI

Query:  ALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPN
        ALL+KW+WR+L E  +LWR LI  K Y  +     P  I   + K+PWR I  + D   S     + NG    FW  +W     LS  +PRL+ LT +  
Subjt:  ALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPN

Query:  ARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGSDTILNN----LYSVIWKDNYPKKIKIFL
          V + WN+ ++ WN+  RR LN+ E   W  +  +L   R    +   +W  +++  FS+ S  V +    D    +    L  +IWK + P KIK F+
Subjt:  ARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGSDTILNN----LYSVIWKDNYPKKIKIFL

Query:  WELSLGAINTADRLQRRMPHFSLSPS
        W L    INT + +Q++MP+  L P+
Subjt:  WELSLGAINTADRLQRRMPHFSLSPS

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.4e-14625.68Show/hide
Query:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI
        RS  I+RK F +  D+Y++ +   +TE G + + S+ ++ + L+W+ ST  +L + P + +FF E R  E+ + + K  N  G   EI ++ +   +  I
Subjt:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI

Query:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI
        L+P    K  W SF S+I+  PK      ++P                    RSY + + + +  +S                   P ++  P L+   +
Subjt:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI

Query:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED
        ++V+RF+  DDW  I  ++        + N F   K L+H      A  LC++  W+ +GK+ ++F      ++       SYGGW   R +    W+  
Subjt:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED

Query:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK
         F+ IG +CGG +  +  T    NL+EA+LK+R N +GF+P+ + +     G +  VQ+      + G+    ER+    G+++  +  +   D N + +
Subjt:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK

Query:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGK---EPLIMEEPSVELKNNISLPVSPTNLK----IGQKGSTSGLSPKLVIVGS
        +         SP       ++     +  SIS + P+   S      K    P  + E  V   N++    + + LK    I   GS      K+ I   
Subjt:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGK---EPLIMEEPSVELKNNISLPVSPTNLK----IGQKGSTSGLSPKLVIVGS

Query:  DTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILS
         T A++          K++ + P       T FNPD    P    P +      + ++  K +   P +  N  Q +   I QP+  +   L       S
Subjt:  DTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILS

Query:  EHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGVQETKTSNIDNHLIKSL
        + GL +   +   P L P +                   + V  T +      E  N    V+Y K               +  ++T +    N L+  L
Subjt:  EHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGVQETKTSNIDNHLIKSL

Query:  WSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWI
          +        DS GA              +GGILI+W     ++    +G FS+S + F +   S+WLT +YGP +R  R++ W++LH+L  L    WI
Subjt:  WSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWI

Query:  LGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSL--TFGDI
        +GGD NV R   E +     + S  + N  I+   L+D PL N  +TWS+  + P  S +DRFL +      F       L R TSDH+PL    +   +
Subjt:  LGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSL--TFGDI

Query:  NWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSHSLAL-SQLPSLITQLKRLDNLE-DKGLTMSQKESRRILREQI
         WGP PF+  +  L    F+  ++ WW  +   G PG  F+ +LK L N ++ W +    +L S   ++I ++  +D  E D  L++ +   R  L+ ++
Subjt:  NWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSHSLAL-SQLPSLITQLKRLDNLE-DKGLTMSQKESRRILREQI

Query:  EDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVD-FYRLLFTKDNRTRFLPTNVDWCPI---------
         DL+ +E   W QR K  W+KEGDEN+ FFHRI ++R+++N I E+    G    + N+I   FV+ F R+      +      N++W PI         
Subjt:  EDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVD-FYRLLFTKDNRTRFLPTNVDWCPI---------

Query:  ---SEAQSKGI---------------------------------------------------------------HIDD-------TELEEMVAKF-----
           SE + KG+                                                               H  D       T + + +AK      
Subjt:  ---SEAQSKGI---------------------------------------------------------------HIDD-------TELEEMVAKF-----

Query:  -------------------------------------------------------------------------------GC-------------------
                                                                                       GC                   
Subjt:  -------------------------------------------------------------------------------GC-------------------

Query:  -KRGF---------------------------------------WPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMP
          RG                                         P TYLG+PLGGNPK+ +FW+ + ++IQ KL +WKYA ISKGGRLTLI++TLSS+P
Subjt:  -KRGF---------------------------------------WPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMP

Query:  TYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIP
         Y LS+F+ PS   K+++KL R+F W GS G  G H +NW     PK  GGLGI   Q+ N ALL+KW+WR+  E  SLWR LI  K Y  +     P  
Subjt:  TYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIP

Query:  IQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLS
        I   S K+PWR I   ID   S     + NG    FW  +W     LS  +PRL+ L+ +  + + +VWNS  + W ++ RR LN+ E+  W  +   L 
Subjt:  IQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLS

Query:  PIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGSDTILNN----LYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENS
         +R        +W  ++ K FS+ S    +    D  + N    L ++IWK + P KIK F+W L    +NT +          L P+WCV+C    E  
Subjt:  PIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGSDTILNN----LYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENS

Query:  GHLFVTCSFATKYWNLMLEAFGWHLPMSNN---IHDFLASIFVGHP-----FQGTKRILW
         HLF+ C      W+L+  +  +  P+S++   +  F  S+    P     F G   ILW
Subjt:  GHLFVTCSFATKYWNLMLEAFGWHLPMSNN---IHDFLASIFVGHP-----FQGTKRILW

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.2e-13924.93Show/hide
Query:  ITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKH--GYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLIS---
        +TE  ++ S S+ +T  +L W+ + F  L     T  FF+E R ++  + V K  NK       EI ++ N G + SIL+P   +  GW SF +LI+   
Subjt:  ITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKH--GYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLIS---

Query:  --------------------------------DYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDL---VWTDIIVVQRFYQRDDWPSIRSSIL
                                         Y K  +  +     + YK        + S  +   P  L    +   +++ R    DDW  I  S+ 
Subjt:  --------------------------------DYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDL---VWTDIIVVQRFYQRDDWPSIRSSIL

Query:  SSISNRCSINPFQDNKALLHVYDRKTALELCK---STEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSS
               S  PFQ +KA+L +      L LC    +  WS +G +++KF    S  +   +   SYGGW+  R +    W+ + F+ IG +CGG+L  + 
Subjt:  SSISNRCSINPFQDNKALLHVYDRKTALELCK---STEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSS

Query:  HTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK---ENTQVIARESSPIH
         T +M  L++A++KVR N  GF+P+SI L     GE   V     T +        ER+    GS+  +      +  +L E       Q I  E +  H
Subjt:  HTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK---ENTQVIARESSPIH

Query:  ---------ERESPIMEQPPFNEDSISV--DFPNLTDSTKSSKGKEPLIMEEP--------SVELKNNISLPVSP-------TNLKIGQKGSTSGL----
                 +       Q   N  S S    F       +  KGK  LI+ +         S  + N     +SP       +N +I  KG +  +    
Subjt:  ---------ERESPIMEQPPFNEDSISV--DFPNLTDSTKSSKGKEPLIMEEP--------SVELKNNISLPVSP-------TNLKIGQKGSTSGL----

Query:  -------SPKLVIVGSDTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPM
               SP+       T      P      HK++L         + +        PL  M     N G    NN                +TP  N   
Subjt:  -------SPKLVIVGSDTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPM

Query:  FALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYDKTASLALTEGVQETK---TSNIDNHLIKSLWSSSHIGWSSLD
            E  +++   + E               R T     K  K   EL+        D+     L   ++E +   +    N +  S +    +   ++D
Subjt:  FALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYDKTASLALTEGVQETK---TSNIDNHLIKSLWSSSHIGWSSLD

Query:  -----SVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLIT
              +G  GGIL++W + +F + +   G +S+S+++   +G ++WLT++YGP +   R   W EL  L  L    W++ GDFN+ RW  E +   L  
Subjt:  -----SVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLIT

Query:  RSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFQFENSWLQIASFREVL
        R+M  FN  I+  +L+D P  N  FTWS+   NP  S +DRFL+SK   N FG      L+R  SDH+P+ L    I WGP PF+  NS L+   F++  
Subjt:  RSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFQFENSWLQIASFREVL

Query:  DNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSH-SLALSQLPSLITQLKRLDNLEDKGLTMSQKESRRI-LREQIEDLTAQEHIQWKQRCKLKWVKEG
         NWWN +   G+PG+AF+  L  L   ++ W  +  +L  +   +L+ ++  +D LE +G   +    +RI L+  +  +   +   W QR + +W   G
Subjt:  DNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSH-SLALSQLPSLITQLKRLDNLEDKGLTMSQKESRRI-LREQIEDLTAQEHIQWKQRCKLKWVKEG

Query:  DENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPI---------------------------------
        DEN  +FHRI    +RKN I  +    G SL S +DI + F+  ++ ++TK++    L  N+ W PI                                 
Subjt:  DENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPI---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---SEAQSKG-----------------------IHIDDTE-------------------------------------LEEMVAKFGCKRGFWPSTYLGLP
           S  +SKG                       I ++D E                                      +++ + FG +  F P  YLG+P
Subjt:  ---SEAQSKG-----------------------IHIDDTE-------------------------------------LEEMVAKFGCKRGFWPSTYLGLP

Query:  LGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLG
        LGGNP++  FW   IE I  KL  WKY+ ISKGGRLTL++A+LSS+PTY LS FK P  V K ++K  RDF W GS+     H +NW+    PK LGGLG
Subjt:  LGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLG

Query:  IGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPR
        I   +  N ALL KW+WR+ +E  SLW+  I AKY  +      P+  +  S  SPW  I    D   S+I  +  +G S  FW   W N + LS   PR
Subjt:  IGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPR

Query:  LYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGS----DTILNNLYSVIWKDN
        LY L+   +A V E+W+     WN+ PRR LNE E   W ++   L  I         +W    SK ++V S        S    +T        +W+ +
Subjt:  LYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGS----DTILNNLYSVIWKDN

Query:  YPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSN-NIHD
         P+K K F+W +    +NT D +Q+R P  SL+PSWC+ C S+ E+  HLF+ C FA   WN+     G   PM N N+ D
Subjt:  YPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSN-NIHD

TrEMBL top hitse value%identityAlignment
A0A5A7US62 LINE-1 retrotransposable element ORF2 protein5.6e-14024.93Show/hide
Query:  ITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKH--GYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLIS---
        +TE  ++ S S+ +T  +L W+ + F  L     T  FF+E R ++  + V K  NK       EI ++ N G + SIL+P   +  GW SF +LI+   
Subjt:  ITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKH--GYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLIS---

Query:  --------------------------------DYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDL---VWTDIIVVQRFYQRDDWPSIRSSIL
                                         Y K  +  +     + YK        + S  +   P  L    +   +++ R    DDW  I  S+ 
Subjt:  --------------------------------DYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDL---VWTDIIVVQRFYQRDDWPSIRSSIL

Query:  SSISNRCSINPFQDNKALLHVYDRKTALELCK---STEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSS
               S  PFQ +KA+L +      L LC    +  WS +G +++KF    S  +   +   SYGGW+  R +    W+ + F+ IG +CGG+L  + 
Subjt:  SSISNRCSINPFQDNKALLHVYDRKTALELCK---STEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSS

Query:  HTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK---ENTQVIARESSPIH
         T +M  L++A++KVR N  GF+P+SI L     GE   V     T +        ER+    GS+  +      +  +L E       Q I  E +  H
Subjt:  HTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK---ENTQVIARESSPIH

Query:  ---------ERESPIMEQPPFNEDSISV--DFPNLTDSTKSSKGKEPLIMEEP--------SVELKNNISLPVSP-------TNLKIGQKGSTSGL----
                 +       Q   N  S S    F       +  KGK  LI+ +         S  + N     +SP       +N +I  KG +  +    
Subjt:  ---------ERESPIMEQPPFNEDSISV--DFPNLTDSTKSSKGKEPLIMEEP--------SVELKNNISLPVSP-------TNLKIGQKGSTSGL----

Query:  -------SPKLVIVGSDTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPM
               SP+       T      P      HK++L         + +        PL  M     N G    NN                +TP  N   
Subjt:  -------SPKLVIVGSDTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPM

Query:  FALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYDKTASLALTEGVQETK---TSNIDNHLIKSLWSSSHIGWSSLD
            E  +++   + E               R T     K  K   EL+        D+     L   ++E +   +    N +  S +    +   ++D
Subjt:  FALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYDKTASLALTEGVQETK---TSNIDNHLIKSLWSSSHIGWSSLD

Query:  -----SVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLIT
              +G  GGIL++W + +F + +   G +S+S+++   +G ++WLT++YGP +   R   W EL  L  L    W++ GDFN+ RW  E +   L  
Subjt:  -----SVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLIT

Query:  RSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFQFENSWLQIASFREVL
        R+M  FN  I+  +L+D P  N  FTWS+   NP  S +DRFL+SK   N FG      L+R  SDH+P+ L    I WGP PF+  NS L+   F++  
Subjt:  RSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFQFENSWLQIASFREVL

Query:  DNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSH-SLALSQLPSLITQLKRLDNLEDKGLTMSQKESRRI-LREQIEDLTAQEHIQWKQRCKLKWVKEG
         NWWN +   G+PG+AF+  L  L   ++ W  +  +L  +   +L+ ++  +D LE +G   +    +RI L+  +  +   +   W QR + +W   G
Subjt:  DNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSH-SLALSQLPSLITQLKRLDNLEDKGLTMSQKESRRI-LREQIEDLTAQEHIQWKQRCKLKWVKEG

Query:  DENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPI---------------------------------
        DEN  +FHRI    +RKN I  +    G SL S +DI + F+  ++ ++TK++    L  N+ W PI                                 
Subjt:  DENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPI---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---SEAQSKG-----------------------IHIDDTE-------------------------------------LEEMVAKFGCKRGFWPSTYLGLP
           S  +SKG                       I ++D E                                      +++ + FG +  F P  YLG+P
Subjt:  ---SEAQSKG-----------------------IHIDDTE-------------------------------------LEEMVAKFGCKRGFWPSTYLGLP

Query:  LGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLG
        LGGNP++  FW   IE I  KL  WKY+ ISKGGRLTL++A+LSS+PTY LS FK P  V K ++K  RDF W GS+     H +NW+    PK LGGLG
Subjt:  LGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLG

Query:  IGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPR
        I   +  N ALL KW+WR+ +E  SLW+  I AKY  +      P+  +  S  SPW  I    D   S+I  +  +G S  FW   W N + LS   PR
Subjt:  IGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPR

Query:  LYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGS----DTILNNLYSVIWKDN
        LY L+   +A V E+W+     WN+ PRR LNE E   W ++   L  I         +W    SK ++V S        S    +T        +W+ +
Subjt:  LYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGS----DTILNNLYSVIWKDN

Query:  YPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSN-NIHD
         P+K K F+W +    +NT D +Q+R P  SL+PSWC+ C S+ E+  HLF+ C FA   WN+     G   PM N N+ D
Subjt:  YPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSN-NIHD

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein3.1e-14625.68Show/hide
Query:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI
        RS  I+RK F +  D+Y++ +   +TE G + + S+ ++ + L+W+ ST  +L + P + +FF E R  E+ + + K  N  G   EI ++ +   +  I
Subjt:  RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGSRISI

Query:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI
        L+P    K  W SF S+I+  PK      ++P                    RSY + + + +  +S                   P ++  P L+   +
Subjt:  LIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQPKVS------------------HPLNTSPPDLVWTDI

Query:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED
        ++V+RF+  DDW  I  ++        + N F   K L+H      A  LC++  W+ +GK+ ++F      ++       SYGGW   R +    W+  
Subjt:  IVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSED

Query:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK
         F+ IG +CGG +  +  T    NL+EA+LK+R N +GF+P+ + +     G +  VQ+      + G+    ER+    G+++  +  +   D N + +
Subjt:  VFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK

Query:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGK---EPLIMEEPSVELKNNISLPVSPTNLK----IGQKGSTSGLSPKLVIVGS
        +         SP       ++     +  SIS + P+   S      K    P  + E  V   N++    + + LK    I   GS      K+ I   
Subjt:  ENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGK---EPLIMEEPSVELKNNISLPVSPTNLK----IGQKGSTSGLSPKLVIVGS

Query:  DTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILS
         T A++          K++ + P       T FNPD    P    P +      + ++  K +   P +  N  Q +   I QP+  +   L       S
Subjt:  DTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAG-QSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILS

Query:  EHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGVQETKTSNIDNHLIKSL
        + GL +   +   P L P +                   + V  T +      E  N    V+Y K               +  ++T +    N L+  L
Subjt:  EHGLCI---MAIPPFLPPKR-------------------KTVTTTGKKTKLQRELDNLKTTVHYDKTAS--------LALTEGVQETKTSNIDNHLIKSL

Query:  WSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWI
          +        DS GA              +GGILI+W     ++    +G FS+S + F +   S+WLT +YGP +R  R++ W++LH+L  L    WI
Subjt:  WSSSHIGWSSLDSVGA--------------SGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWI

Query:  LGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSL--TFGDI
        +GGD NV R   E +     + S  + N  I+   L+D PL N  +TWS+  + P  S +DRFL +      F       L R TSDH+PL    +   +
Subjt:  LGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSL--TFGDI

Query:  NWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSHSLAL-SQLPSLITQLKRLDNLE-DKGLTMSQKESRRILREQI
         WGP PF+  +  L    F+  ++ WW  +   G PG  F+ +LK L N ++ W +    +L S   ++I ++  +D  E D  L++ +   R  L+ ++
Subjt:  NWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSHSLAL-SQLPSLITQLKRLDNLE-DKGLTMSQKESRRILREQI

Query:  EDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVD-FYRLLFTKDNRTRFLPTNVDWCPI---------
         DL+ +E   W QR K  W+KEGDEN+ FFHRI ++R+++N I E+    G    + N+I   FV+ F R+      +      N++W PI         
Subjt:  EDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVD-FYRLLFTKDNRTRFLPTNVDWCPI---------

Query:  ---SEAQSKGI---------------------------------------------------------------HIDD-------TELEEMVAKF-----
           SE + KG+                                                               H  D       T + + +AK      
Subjt:  ---SEAQSKGI---------------------------------------------------------------HIDD-------TELEEMVAKF-----

Query:  -------------------------------------------------------------------------------GC-------------------
                                                                                       GC                   
Subjt:  -------------------------------------------------------------------------------GC-------------------

Query:  -KRGF---------------------------------------WPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMP
          RG                                         P TYLG+PLGGNPK+ +FW+ + ++IQ KL +WKYA ISKGGRLTLI++TLSS+P
Subjt:  -KRGF---------------------------------------WPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMP

Query:  TYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIP
         Y LS+F+ PS   K+++KL R+F W GS G  G H +NW     PK  GGLGI   Q+ N ALL+KW+WR+  E  SLWR LI  K Y  +     P  
Subjt:  TYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIP

Query:  IQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLS
        I   S K+PWR I   ID   S     + NG    FW  +W     LS  +PRL+ L+ +  + + +VWNS  + W ++ RR LN+ E+  W  +   L 
Subjt:  IQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLS

Query:  PIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGSDTILNN----LYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENS
         +R        +W  ++ K FS+ S    +    D  + N    L ++IWK + P KIK F+W L    +NT +          L P+WCV+C    E  
Subjt:  PIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGSDTILNN----LYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENS

Query:  GHLFVTCSFATKYWNLMLEAFGWHLPMSNN---IHDFLASIFVGHP-----FQGTKRILW
         HLF+ C      W+L+  +  +  P+S++   +  F  S+    P     F G   ILW
Subjt:  GHLFVTCSFATKYWNLMLEAFGWHLPMSNN---IHDFLASIFVGHP-----FQGTKRILW

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein2.9e-15226.3Show/hide
Query:  PSASLPWNHPT--------RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNK
        PS +  +N P         RS  ++RK F +  D+YS+ +   +TE G + + S+ ++ + L+W+  T  +L   P T +FF E R  E  + + K  N 
Subjt:  PSASLPWNHPT--------RSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNK

Query:  HGYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQP-------------KVSH
         G   EI ++     +  IL+P   +K GW SF S+I+  PK      ++P                    RSY + + + +P               SH
Subjt:  HGYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKP------------------QSRSYKEILQQKQP-------------KVSH

Query:  PLNTS-----PPDLVWTDIIVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTV
          + S       DL+   +++V+RF+  DDW  I  ++        + N F   KAL+H      A  LC++  WS +GK+ ++F   +   +       
Subjt:  PLNTS-----PPDLVWTDIIVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVYDRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTV

Query:  SYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKG
        SYGGW   R +    W+   F+ IG +C G +  +  T    NL+EAR+KVR N +GF+P+++ +     G +  VQ+        G+    ER+    G
Subjt:  SYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKG

Query:  SYEVEEDKSESKDLNLEEK----ENTQVIARE--SSPIHERESPIMEQPPFNEDSI-----SVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNIS-----
        +++  +  +   D N E +    E ++ I+ +  S+    R+S   +QP   +  I     +   P+  +    +         +  +E+ + IS     
Subjt:  SYEVEEDKSESKDLNLEEK----ENTQVIARE--SSPIHERESPIMEQPPFNEDSI-----SVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNIS-----

Query:  -----------LPVSPTNL-KIGQKGSTSGLSPKLVIVGSDTEAYLSSPSPINSPHK------------------------INLDPPPTHDFDLTIFNPD
                    P S  NL K  +K S +  S K  I   D+     SPS +NSP K                         N     T    +   + D
Subjt:  -----------LPVSPTNL-KIGQKGSTSGLSPKLVIVGSDTEAYLSSPSPINSPHK------------------------INLDPPPTHDFDLTIFNPD

Query:  QHHLPLAI------MPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQR
             L++      +P    N      +N  NA   DI      PETP +  P+               +H             K K   +   K +L  
Subjt:  QHHLPLAI------MPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPMFALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQR

Query:  EL--DNLKTTVHYDKTASLALTEGVQETKTSNI---DNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLT
         L  + LK +   D + +   T  +     S +   +  +IKSLW S+ I W + ++ G+SGGILI+W   + ++    +GLFS+S +  + +  S+WLT
Subjt:  EL--DNLKTTVHYDKTASLALTEGVQETKTSNI---DNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLT

Query:  NIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCL
         +YGP +R  RI FW ELH+L  L    WILGGD NV R   E +     + + R+ N  I+   L+D PL N  FTWS+  + P  S IDRFL +    
Subjt:  NIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCL

Query:  NKFGSSHLLRLDRITSDHYPLSLTFGD--INWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRS--HSLALSQLPSL
        N F       L R TSDH+PL     +  ++WGP PF+  +  L    F+  +  WW ++   G+PG +F+ +LK L N ++ W +   HSL  ++  ++
Subjt:  NKFGSSHLLRLDRITSDHYPLSLTFGD--INWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRS--HSLALSQLPSL

Query:  ITQLKRLDNLE-DKGLTMSQKESRRILREQIEDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYR
        I ++  +D  E D  LT  +   R  L+  + +L+ +E   W QR K  W++EGDEN+ FFHRI ++R++++ I E+    G    + N I   F+ F+ 
Subjt:  ITQLKRLDNLE-DKGLTMSQKESRRILREQIEDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYR

Query:  LLFTKDNRT-RFLPTNVDWCPIS------------EAQSKGI----------------------------------------------------------
         ++    ++      N+DW PI+            E + KG+                                                          
Subjt:  LLFTKDNRT-RFLPTNVDWCPIS------------EAQSKGI----------------------------------------------------------

Query:  -----------------------HID--DTELEE-------------------------------MVAKFGCKRG-------------------------
                               ++D  D  LE+                               + A  G ++G                         
Subjt:  -----------------------HID--DTELEE-------------------------------MVAKFGCKRG-------------------------

Query:  --------------------------------------------------------------------------FW-------PSTYLGLPLGGNPKNFV
                                                                                  FW       P +YLG+PLGGNPK+ +
Subjt:  --------------------------------------------------------------------------FW-------PSTYLGLPLGGNPKNFV

Query:  FWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNI
        FW+ V +KIQ KL +WKYA ISKGGRLTLI++TLSS+P Y LS+F+ PS   K+++KL R F W G+ G  G H +NW K    K  GGLGI    + N 
Subjt:  FWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNI

Query:  ALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPN
        ALL+KW+WR+L E  +LWR LI  K Y  +     P  I   + K+PWR I  + D   S     + NG    FW  +W     LS  +PRL+ LT +  
Subjt:  ALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPN

Query:  ARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGSDTILNN----LYSVIWKDNYPKKIKIFL
          V + WN+ ++ WN+  RR LN+ E   W  +  +L   R    +   +W  +++  FS+ S  V +    D    +    L  +IWK + P KIK F+
Subjt:  ARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGSDTILNN----LYSVIWKDNYPKKIKIFL

Query:  WELSLGAINTADRLQRRMPHFSLSPS
        W L    INT + +Q++MP+  L P+
Subjt:  WELSLGAINTADRLQRRMPHFSLSPS

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein5.6e-14024.93Show/hide
Query:  ITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKH--GYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLIS---
        +TE  ++ S S+ +T  +L W+ + F  L     T  FF+E R ++  + V K  NK       EI ++ N G + SIL+P   +  GW SF +LI+   
Subjt:  ITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKH--GYFVEINQLQNSGSRISILIPSESNKQGWFSFFSLIS---

Query:  --------------------------------DYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDL---VWTDIIVVQRFYQRDDWPSIRSSIL
                                         Y K  +  +     + YK        + S  +   P  L    +   +++ R    DDW  I  S+ 
Subjt:  --------------------------------DYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDL---VWTDIIVVQRFYQRDDWPSIRSSIL

Query:  SSISNRCSINPFQDNKALLHVYDRKTALELCK---STEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSS
               S  PFQ +KA+L +      L LC    +  WS +G +++KF    S  +   +   SYGGW+  R +    W+ + F+ IG +CGG+L  + 
Subjt:  SSISNRCSINPFQDNKALLHVYDRKTALELCK---STEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSS

Query:  HTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK---ENTQVIARESSPIH
         T +M  L++A++KVR N  GF+P+SI L     GE   V     T +        ER+    GS+  +      +  +L E       Q I  E +  H
Subjt:  HTDRMINLMEARLKVRQNSTGFIPSSIALPIALVGEEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEK---ENTQVIARESSPIH

Query:  ---------ERESPIMEQPPFNEDSISV--DFPNLTDSTKSSKGKEPLIMEEP--------SVELKNNISLPVSP-------TNLKIGQKGSTSGL----
                 +       Q   N  S S    F       +  KGK  LI+ +         S  + N     +SP       +N +I  KG +  +    
Subjt:  ---------ERESPIMEQPPFNEDSISV--DFPNLTDSTKSSKGKEPLIMEEP--------SVELKNNISLPVSP-------TNLKIGQKGSTSGL----

Query:  -------SPKLVIVGSDTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPM
               SP+       T      P      HK++L         + +        PL  M     N G    NN                +TP  N   
Subjt:  -------SPKLVIVGSDTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPPINQPM

Query:  FALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYDKTASLALTEGVQETK---TSNIDNHLIKSLWSSSHIGWSSLD
            E  +++   + E               R T     K  K   EL+        D+     L   ++E +   +    N +  S +    +   ++D
Subjt:  FALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYDKTASLALTEGVQETK---TSNIDNHLIKSLWSSSHIGWSSLD

Query:  -----SVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLIT
              +G  GGIL++W + +F + +   G +S+S+++   +G ++WLT++YGP +   R   W EL  L  L    W++ GDFN+ RW  E +   L  
Subjt:  -----SVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLIT

Query:  RSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFQFENSWLQIASFREVL
        R+M  FN  I+  +L+D P  N  FTWS+   NP  S +DRFL+SK   N FG      L+R  SDH+P+ L    I WGP PF+  NS L+   F++  
Subjt:  RSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFQFENSWLQIASFREVL

Query:  DNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSH-SLALSQLPSLITQLKRLDNLEDKGLTMSQKESRRI-LREQIEDLTAQEHIQWKQRCKLKWVKEG
         NWWN +   G+PG+AF+  L  L   ++ W  +  +L  +   +L+ ++  +D LE +G   +    +RI L+  +  +   +   W QR + +W   G
Subjt:  DNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSH-SLALSQLPSLITQLKRLDNLEDKGLTMSQKESRRI-LREQIEDLTAQEHIQWKQRCKLKWVKEG

Query:  DENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPI---------------------------------
        DEN  +FHRI    +RKN I  +    G SL S +DI + F+  ++ ++TK++    L  N+ W PI                                 
Subjt:  DENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPI---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---SEAQSKG-----------------------IHIDDTE-------------------------------------LEEMVAKFGCKRGFWPSTYLGLP
           S  +SKG                       I ++D E                                      +++ + FG +  F P  YLG+P
Subjt:  ---SEAQSKG-----------------------IHIDDTE-------------------------------------LEEMVAKFGCKRGFWPSTYLGLP

Query:  LGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLG
        LGGNP++  FW   IE I  KL  WKY+ ISKGGRLTL++A+LSS+PTY LS FK P  V K ++K  RDF W GS+     H +NW+    PK LGGLG
Subjt:  LGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLG

Query:  IGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPR
        I   +  N ALL KW+WR+ +E  SLW+  I AKY  +      P+  +  S  SPW  I    D   S+I  +  +G S  FW   W N + LS   PR
Subjt:  IGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPR

Query:  LYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGS----DTILNNLYSVIWKDN
        LY L+   +A V E+W+     WN+ PRR LNE E   W ++   L  I         +W    SK ++V S        S    +T        +W+ +
Subjt:  LYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVKSLMVDLLGGS----DTILNNLYSVIWKDN

Query:  YPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSN-NIHD
         P+K K F+W +    +NT D +Q+R P  SL+PSWC+ C S+ E+  HLF+ C FA   WN+     G   PM N N+ D
Subjt:  YPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSN-NIHD

M5WPQ5 Reverse transcriptase domain-containing protein4.8e-13930.74Show/hide
Query:  ETKTSNIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRW
        ETK   +D  L+  +W S    W    S+G SGGI ++W+    ++ +++ G FSVSI +    G  +WL+ IYGP R+  R  FW+EL DL G  GD+W
Subjt:  ETKTSNIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRW

Query:  ILGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDIN
         LGGDFNV R+S EKS+   +T+SMR FN  I    L D  L N  FTWS+  +N     +DRFL+S      F       L RITSDH P+ L    + 
Subjt:  ILGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDIN

Query:  WGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSHSLALSQ-LPSLITQLKRLDNLE-DKGLTMSQKESRRILREQIE
        WGP PF+FEN WL    F+  +  WW  + + GW G+ FM +LK LK++L+ W++     + + L     +L  LD  E  +GL    +  R  L  +I 
Subjt:  WGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNRSHSLALSQ-LPSLITQLKRLDNLE-DKGLTMSQKESRRILREQIE

Query:  DLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEV-LSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPISE--------
        DL  +E ++W+QR K+KW +EGD NTKFFHR+    +++N I ++ +   GV  V AN IE+E + F++ L++++    +    ++WCPIS+        
Subjt:  DLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEV-LSRGGVSLVSANDIEKEFVDFYRLLFTKDNRTRFLPTNVDWCPISE--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------AQSKGIHI-------------------------------------------------------------------------------------
               +QS+G  +                                                                                     
Subjt:  -------AQSKGIHI-------------------------------------------------------------------------------------

Query:  ---------------------------DDT--------------------------------------------ELEEMVAKFGCKRGFWPSTYLGLPLG
                                   DDT                                             L  M   +GC+ G WP  YLGLPLG
Subjt:  ---------------------------DDT--------------------------------------------ELEEMVAKFGCKRGFWPSTYLGLPLG

Query:  GNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIG
        GNP+   FW PV+EK++ +L+ WK A +SKGGRLTLIQA LSS+P+Y++SLFK+P  VA  +++L+R+F W G +     H V W++    K  GGLGIG
Subjt:  GNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIG

Query:  NFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIP-IQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRL
        + + RN AL AKW+WRF  E  SLW  +I +KY    D   W    I + S ++PWR I    +         +GNG    FW+D WL   IL +LFPRL
Subjt:  NFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIP-IQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRL

Query:  YRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMN-DFWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYP
          L+   N  +A   N+  +   W+   RR+L+E E+ E   L  +L  +RL     D  SW +E    FS KS    LL  +  +    +S IWK   P
Subjt:  YRLTTNPNARVAEVWNS--IESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMN-DFWSWPLEASKLFSVKSLMVDLLGGSDTILNNLYSVIWKDNYP

Query:  KKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG
         KI+ F+W  + G INT D +QRR P   LSPSWCV+C  N EN  HLF+ CS++ + W  ML A G
Subjt:  KKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657506.2e-3529.94Show/hide
Query:  VIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLA
        ++E++  ++  W+   +S  GRLTL +A LSSMP + +S   LP  +   LD+L R F W  +      H V W K   PK  GGLG+   +  N AL++
Subjt:  VIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLA

Query:  KWIWRFLHEEGSLWRNLIIAKYYNSEDVR--HWPIPIQRGSFKSPWRFICTTI-DKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNA
        K  WR L E+ SLW  L++ K Y+  ++R   W IP  +GS+ S WR I   + D V+  +  I G+G    FW D W++G  L  L     R T     
Subjt:  KWIWRFLHEEGSLWRNLIIAKYYNSEDVR--HWPIPIQRGSFKSPWRFICTTI-DKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNA

Query:  RVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL---RDMNDFWSWPLEASKLFSVKS----LMVDLLGGSDTILNNLYSVIWKDNYPKKIKI
           ++W      W+ +      + +     N    L  + L       D  SW       FSV+S    L VD +   +  + + ++ +WK   P+++K 
Subjt:  RVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL---RDMNDFWSWPLEASKLFSVKS----LMVDLLGGSDTILNNLYSVIWKDNYPKKIKI

Query:  FLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYW
        FLW +   A+ T +   RR  H S S + C +C   +E+  H+   C      W
Subjt:  FLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYW

P93295 Uncharacterized mitochondrial protein AtMg003104.5e-0928.08Show/hide
Query:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKL-LGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRH
        ++P Y +S F+L   + K L   + +F+W+  +    +  V W K    K   GGLG  +    N ALLAK  +R +H+  +L   L+ ++Y+    +  
Subjt:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKL-LGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRH

Query:  WPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWL
          +   R S+   WR I    + ++  + R IG+G  T  W D W+
Subjt:  WPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein4.9e-1124.65Show/hide
Query:  ILGGDFN---VTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSF-GDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTF
        IL GDF+    T   +      +  R +  F   +    L+DIP +   +TWS+   DNP +  +DR + + D  + F S+  +      SDH P  +  
Subjt:  ILGGDFN---VTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNGCFTWSSF-GDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTF

Query:  GDI-NWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMM--KLKGLKNELRNWNR------SHSL--ALSQLPSLITQLKRLDNLEDKGLTMS
         ++       F++ +      +F   L   W      G   H F +   LK  K   +  NR       H    AL  L S+ +QL  L N  D     S
Subjt:  GDI-NWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMM--KLKGLKNELRNWNR------SHSL--ALSQLPSLITQLKRLDNLEDKGLTMS

Query:  QKESRRILREQIEDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDN
              + R++     A     ++Q+ ++KW+++GD NT+FFH+++ A + KN I  +     V + +   +++  V +Y  L   D+
Subjt:  QKESRRILREQIEDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDN

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-2827.08Show/hide
Query:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHW
        ++PTY ++ F LP  V K +  ++ DF+W   Q   GMH   WD     K  GG+G  + +  N+ALL K +WR L    SL   +  ++Y++  D  + 
Subjt:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHW

Query:  PIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSY
        P+   R SF   W+ I  + + +      ++GNG     W+  WL+    S    R+ R+     A V+ +    +           +  E++       
Subjt:  PIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSY

Query:  LLSPIRL--RDMNDFWSWPLEASKLFSVKS---LMVDLLGG-------SDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSW
        L+  +R   R + D ++W   +S  ++VKS   ++  ++         S+  LN +Y  IWK     KI+ FLW+    ++  A  L  R  H S   S 
Subjt:  LLSPIRL--RDMNDFWSWPLEASKLFSVKS---LMVDLLGG-------SDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSW

Query:  CVMCSSNMENSGHLFVTCSFATKYW
        C+ C S  E   HL   C+FA   W
Subjt:  CVMCSSNMENSGHLFVTCSFATKYW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.2e-1028.08Show/hide
Query:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKL-LGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRH
        ++P Y +S F+L   + K L   + +F+W+  +    +  V W K    K   GGLG  +    N ALLAK  +R +H+  +L   L+ ++Y+    +  
Subjt:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKL-LGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRH

Query:  WPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWL
          +   R S+   WR I    + ++  + R IG+G  T  W D W+
Subjt:  WPIPIQRGSFKSPWRFICTTIDKVTSRIHRIIGNGCSTFFWKDSWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCTCCTTCAGCCTCACTCCCATGGAACCATCCTACCCGATCCATTTCCATAGATCGAAAAACCTTCTCTATAGCTTTTGATGAATACTCACGTGGAAGTAAAGC
AAAAATCACTGAAAAAGGTAGAAATTTCTCCAAGTCTCTGTCTCTCACTTGGAAATCCCTCAACTGGTTAGCCTCCACTTTCAACACCCTTGCCAAAGAACCTTGTACGT
ACAAATTCTTCTCAGAATTTCGTGGGGATGAATATATTCTTTGTGTTGAAAAACTCAACAATAAACATGGATACTTTGTGGAGATCAACCAACTGCAGAACTCTGGGAGC
AGAATCAGCATTCTCATCCCTTCCGAAAGCAACAAACAAGGTTGGTTTTCTTTTTTCTCCTTAATTTCAGATTATCCAAAGGAGTTCAACCACCAGACATCAAAGCCGCA
ATCACGATCGTATAAGGAGATCCTCCAACAGAAGCAACCAAAGGTTTCCCATCCACTGAATACTTCTCCTCCAGATTTGGTTTGGACAGATATAATTGTTGTGCAGAGGT
TCTATCAGCGTGATGATTGGCCATCCATTCGTTCATCAATTCTCTCCTCCATATCCAATCGATGCTCTATTAATCCCTTCCAAGACAACAAAGCTTTGCTCCATGTCTAT
GATCGCAAGACAGCCCTTGAGTTATGCAAATCGACTGAATGGTCTCAAATTGGAAAACATCGGCTGAAATTCTACCCTTTGACATCGAAAGCATATAAGCAAGATAATTT
CACTGTTTCTTATGGTGGTTGGATAGAAGTTCGAAACCTCTCCCCTGTCTATTGGTCTGAAGATGTATTTCGATTCATTGGTGATAGTTGTGGAGGCTACCTAACAACCT
CAAGTCACACCGATAGGATGATCAATCTCATGGAAGCTCGTTTGAAGGTCCGACAGAATTCCACAGGCTTCATTCCATCATCGATTGCCCTCCCTATTGCCCTAGTCGGC
GAAGAAATTACAGTCCAAATTTGGGGACTTACCGGAGAGACAATCGGACGGGAACAATTTAATGAGCGGCACCAATTACGGAAGGGAAGTTATGAAGTAGAGGAGGATAA
ATCGGAATCAAAAGATTTGAATTTAGAGGAAAAAGAGAATACACAAGTGATTGCCCGAGAATCTTCACCGATTCATGAGAGAGAATCACCGATTATGGAGCAGCCACCAT
TTAATGAAGATTCAATATCGGTTGATTTTCCTAATTTAACAGATTCCACAAAATCCTCCAAAGGAAAAGAGCCATTAATAATGGAAGAGCCATCGGTGGAATTAAAAAAT
AACATATCTCTTCCTGTGAGCCCTACGAATTTGAAAATTGGTCAAAAGGGCTCAACATCGGGTCTTAGCCCAAAATTGGTTATTGTAGGCTCTGATACTGAAGCTTATTT
ATCCAGCCCATCTCCAATCAATTCACCTCATAAGATTAATTTGGACCCACCCCCCACACACGATTTTGATCTCACCATTTTTAACCCTGACCAACATCATCTTCCTTTAG
CTATTATGCCGCCAGAATCAACAAATGCTGGCCAATCAGCCATAAATAACAAAGTGAATGCTCCAGCTCCAGATATTTTAACCAACCATCCCCAACCAGAAACACCTCCC
ATCAATCAGCCAATGTTTGCCCTCCCAGAATATCTCCGTCATATAGCTCCAATTCTTAGTGAGCATGGGTTGTGTATCATGGCTATCCCTCCATTTCTACCACCTAAAAG
GAAGACAGTTACTACTACCGGGAAGAAAACAAAACTCCAGAGAGAGCTTGATAACCTAAAAACTACAGTGCATTATGATAAAACTGCTTCTTTGGCCTTAACGGAGGGAG
TACAGGAAACAAAAACGTCTAATATAGACAATCATCTGATTAAATCCTTATGGAGTTCATCTCATATTGGCTGGTCTTCTCTCGATTCCGTTGGAGCATCAGGGGGCATC
CTTATTATGTGGAGTGAACCAGACTTCACTATCAAAGAGACAATTCAAGGTCTTTTCTCTGTCTCTATTCATGTTTTTATGGCTGATGGTTTTTCTTTTTGGCTTACAAA
TATTTATGGCCCCTCTCGACGAGAATTTCGTATTGACTTTTGGAAAGAATTACATGATCTGGCGGGTCTGGGAGGTGATCGTTGGATCCTTGGAGGAGACTTTAATGTTA
CCCGATGGTCATGGGAGAAATCTCATGGTCGTCTCATCACTCGGAGTATGCGTATTTTCAACCAATTGATTGCAACTTACAAGCTTCTGGATATCCCATTACAAAATGGT
TGTTTCACCTGGTCCAGTTTTGGTGACAATCCGTATCTCTCCTTAATAGACAGATTTTTGATTTCCAAAGATTGTCTGAATAAATTCGGGTCTTCTCATCTTCTTCGGCT
TGACAGAATTACTTCAGATCACTACCCTCTTTCCCTTACTTTTGGTGATATTAATTGGGGTCCTGGGCCTTTTCAATTTGAAAATTCCTGGCTGCAAATTGCATCATTCC
GTGAGGTGTTGGATAATTGGTGGAATCACAATTCTCTTCAAGGTTGGCCGGGTCATGCTTTTATGATGAAATTAAAAGGGCTGAAAAATGAGCTTCGAAATTGGAATAGA
TCCCACAGTTTGGCATTATCACAGCTTCCATCTCTGATTACTCAGTTAAAACGATTAGATAACCTTGAGGACAAGGGTCTAACTATGTCTCAAAAGGAATCTAGGCGTAT
TCTTCGAGAACAGATTGAAGATTTAACTGCCCAGGAGCATATCCAATGGAAACAACGTTGTAAACTTAAATGGGTTAAAGAGGGTGATGAAAATACTAAATTCTTCCATC
GCATCTTGGCAGCCCGTAAAAGGAAGAATTCGATTACTGAGGTGTTATCTCGAGGTGGGGTCAGTTTGGTTTCAGCCAATGATATTGAAAAAGAATTCGTTGATTTTTAT
CGGCTTTTGTTCACCAAAGATAATCGTACTCGTTTTCTGCCAACCAATGTTGATTGGTGCCCAATTAGTGAAGCTCAATCGAAAGGGATCCATATTGATGATACAGAGTT
GGAAGAAATGGTTGCTAAATTTGGTTGTAAGCGTGGTTTTTGGCCTAGCACATATCTTGGGCTTCCTTTGGGTGGTAACCCGAAAAACTTTGTTTTCTGGCAACCAGTTA
TTGAGAAGATTCAACATAAATTACGTAGTTGGAAATATGCCTTTATATCCAAAGGAGGAAGGCTTACCCTCATCCAAGCTACTCTTTCGAGTATGCCGACGTATTTTCTG
TCTTTGTTTAAACTTCCAAGTAAGGTTGCTAAATCTCTTGACAAGCTAGTGCGAGATTTTTTCTGGGCAGGTTCTCAAGGGGATGGTGGTATGCATAATGTTAATTGGGA
TAAGACTCAGCTTCCGAAATTACTGGGAGGTCTTGGCATTGGCAATTTTCAGCTTCGAAATATAGCCTTGTTAGCTAAATGGATTTGGAGATTTTTGCACGAAGAAGGAT
CTCTTTGGCGTAATCTCATTATTGCTAAATATTACAACTCGGAGGATGTTAGACATTGGCCTATTCCCATTCAAAGGGGATCTTTCAAATCTCCTTGGCGCTTTATTTGT
ACTACTATCGATAAGGTTACTAGTCGTATTCATCGAATTATTGGTAATGGTTGTAGCACATTTTTTTGGAAGGATTCCTGGCTGAATGGAGTGATTCTCTCAAATCTCTT
CCCTCGCCTTTATCGGTTAACTACCAATCCAAACGCCAGGGTTGCAGAAGTATGGAACTCTATAGAATCAGCATGGAATCTGAGTCCTCGTCGTCACCTTAATGAGTTTG
AGATTATTGAATGGGCAAATTTATCCTATCTTCTGTCTCCTATTCGACTTCGGGATATGAATGACTTTTGGTCTTGGCCCCTTGAAGCATCCAAATTATTCTCTGTTAAA
TCCTTGATGGTTGACCTTTTGGGTGGTTCTGATACTATTTTGAATAATTTATATTCGGTGATATGGAAAGATAATTATCCTAAAAAGATAAAAATCTTTCTATGGGAGCT
TAGCCTTGGGGCTATCAATACAGCGGATCGTCTTCAACGTAGAATGCCTCATTTTTCACTTTCGCCATCTTGGTGTGTTATGTGCTCATCAAATATGGAGAATTCGGGCC
ATCTATTTGTCACTTGTTCTTTTGCTACCAAGTATTGGAATTTGATGCTTGAAGCTTTCGGTTGGCATTTACCGATGTCGAATAACATTCATGACTTTCTGGCGTCTATT
TTTGTTGGTCATCCTTTTCAAGGAACAAAGAGGATTTTATGGCTGGCTTTTAATAGTCTTCTTTTGGTTTCTTTGGAACGAAAGAAATGGAAGAATTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACTTCTCCTTCAGCCTCACTCCCATGGAACCATCCTACCCGATCCATTTCCATAGATCGAAAAACCTTCTCTATAGCTTTTGATGAATACTCACGTGGAAGTAAAGC
AAAAATCACTGAAAAAGGTAGAAATTTCTCCAAGTCTCTGTCTCTCACTTGGAAATCCCTCAACTGGTTAGCCTCCACTTTCAACACCCTTGCCAAAGAACCTTGTACGT
ACAAATTCTTCTCAGAATTTCGTGGGGATGAATATATTCTTTGTGTTGAAAAACTCAACAATAAACATGGATACTTTGTGGAGATCAACCAACTGCAGAACTCTGGGAGC
AGAATCAGCATTCTCATCCCTTCCGAAAGCAACAAACAAGGTTGGTTTTCTTTTTTCTCCTTAATTTCAGATTATCCAAAGGAGTTCAACCACCAGACATCAAAGCCGCA
ATCACGATCGTATAAGGAGATCCTCCAACAGAAGCAACCAAAGGTTTCCCATCCACTGAATACTTCTCCTCCAGATTTGGTTTGGACAGATATAATTGTTGTGCAGAGGT
TCTATCAGCGTGATGATTGGCCATCCATTCGTTCATCAATTCTCTCCTCCATATCCAATCGATGCTCTATTAATCCCTTCCAAGACAACAAAGCTTTGCTCCATGTCTAT
GATCGCAAGACAGCCCTTGAGTTATGCAAATCGACTGAATGGTCTCAAATTGGAAAACATCGGCTGAAATTCTACCCTTTGACATCGAAAGCATATAAGCAAGATAATTT
CACTGTTTCTTATGGTGGTTGGATAGAAGTTCGAAACCTCTCCCCTGTCTATTGGTCTGAAGATGTATTTCGATTCATTGGTGATAGTTGTGGAGGCTACCTAACAACCT
CAAGTCACACCGATAGGATGATCAATCTCATGGAAGCTCGTTTGAAGGTCCGACAGAATTCCACAGGCTTCATTCCATCATCGATTGCCCTCCCTATTGCCCTAGTCGGC
GAAGAAATTACAGTCCAAATTTGGGGACTTACCGGAGAGACAATCGGACGGGAACAATTTAATGAGCGGCACCAATTACGGAAGGGAAGTTATGAAGTAGAGGAGGATAA
ATCGGAATCAAAAGATTTGAATTTAGAGGAAAAAGAGAATACACAAGTGATTGCCCGAGAATCTTCACCGATTCATGAGAGAGAATCACCGATTATGGAGCAGCCACCAT
TTAATGAAGATTCAATATCGGTTGATTTTCCTAATTTAACAGATTCCACAAAATCCTCCAAAGGAAAAGAGCCATTAATAATGGAAGAGCCATCGGTGGAATTAAAAAAT
AACATATCTCTTCCTGTGAGCCCTACGAATTTGAAAATTGGTCAAAAGGGCTCAACATCGGGTCTTAGCCCAAAATTGGTTATTGTAGGCTCTGATACTGAAGCTTATTT
ATCCAGCCCATCTCCAATCAATTCACCTCATAAGATTAATTTGGACCCACCCCCCACACACGATTTTGATCTCACCATTTTTAACCCTGACCAACATCATCTTCCTTTAG
CTATTATGCCGCCAGAATCAACAAATGCTGGCCAATCAGCCATAAATAACAAAGTGAATGCTCCAGCTCCAGATATTTTAACCAACCATCCCCAACCAGAAACACCTCCC
ATCAATCAGCCAATGTTTGCCCTCCCAGAATATCTCCGTCATATAGCTCCAATTCTTAGTGAGCATGGGTTGTGTATCATGGCTATCCCTCCATTTCTACCACCTAAAAG
GAAGACAGTTACTACTACCGGGAAGAAAACAAAACTCCAGAGAGAGCTTGATAACCTAAAAACTACAGTGCATTATGATAAAACTGCTTCTTTGGCCTTAACGGAGGGAG
TACAGGAAACAAAAACGTCTAATATAGACAATCATCTGATTAAATCCTTATGGAGTTCATCTCATATTGGCTGGTCTTCTCTCGATTCCGTTGGAGCATCAGGGGGCATC
CTTATTATGTGGAGTGAACCAGACTTCACTATCAAAGAGACAATTCAAGGTCTTTTCTCTGTCTCTATTCATGTTTTTATGGCTGATGGTTTTTCTTTTTGGCTTACAAA
TATTTATGGCCCCTCTCGACGAGAATTTCGTATTGACTTTTGGAAAGAATTACATGATCTGGCGGGTCTGGGAGGTGATCGTTGGATCCTTGGAGGAGACTTTAATGTTA
CCCGATGGTCATGGGAGAAATCTCATGGTCGTCTCATCACTCGGAGTATGCGTATTTTCAACCAATTGATTGCAACTTACAAGCTTCTGGATATCCCATTACAAAATGGT
TGTTTCACCTGGTCCAGTTTTGGTGACAATCCGTATCTCTCCTTAATAGACAGATTTTTGATTTCCAAAGATTGTCTGAATAAATTCGGGTCTTCTCATCTTCTTCGGCT
TGACAGAATTACTTCAGATCACTACCCTCTTTCCCTTACTTTTGGTGATATTAATTGGGGTCCTGGGCCTTTTCAATTTGAAAATTCCTGGCTGCAAATTGCATCATTCC
GTGAGGTGTTGGATAATTGGTGGAATCACAATTCTCTTCAAGGTTGGCCGGGTCATGCTTTTATGATGAAATTAAAAGGGCTGAAAAATGAGCTTCGAAATTGGAATAGA
TCCCACAGTTTGGCATTATCACAGCTTCCATCTCTGATTACTCAGTTAAAACGATTAGATAACCTTGAGGACAAGGGTCTAACTATGTCTCAAAAGGAATCTAGGCGTAT
TCTTCGAGAACAGATTGAAGATTTAACTGCCCAGGAGCATATCCAATGGAAACAACGTTGTAAACTTAAATGGGTTAAAGAGGGTGATGAAAATACTAAATTCTTCCATC
GCATCTTGGCAGCCCGTAAAAGGAAGAATTCGATTACTGAGGTGTTATCTCGAGGTGGGGTCAGTTTGGTTTCAGCCAATGATATTGAAAAAGAATTCGTTGATTTTTAT
CGGCTTTTGTTCACCAAAGATAATCGTACTCGTTTTCTGCCAACCAATGTTGATTGGTGCCCAATTAGTGAAGCTCAATCGAAAGGGATCCATATTGATGATACAGAGTT
GGAAGAAATGGTTGCTAAATTTGGTTGTAAGCGTGGTTTTTGGCCTAGCACATATCTTGGGCTTCCTTTGGGTGGTAACCCGAAAAACTTTGTTTTCTGGCAACCAGTTA
TTGAGAAGATTCAACATAAATTACGTAGTTGGAAATATGCCTTTATATCCAAAGGAGGAAGGCTTACCCTCATCCAAGCTACTCTTTCGAGTATGCCGACGTATTTTCTG
TCTTTGTTTAAACTTCCAAGTAAGGTTGCTAAATCTCTTGACAAGCTAGTGCGAGATTTTTTCTGGGCAGGTTCTCAAGGGGATGGTGGTATGCATAATGTTAATTGGGA
TAAGACTCAGCTTCCGAAATTACTGGGAGGTCTTGGCATTGGCAATTTTCAGCTTCGAAATATAGCCTTGTTAGCTAAATGGATTTGGAGATTTTTGCACGAAGAAGGAT
CTCTTTGGCGTAATCTCATTATTGCTAAATATTACAACTCGGAGGATGTTAGACATTGGCCTATTCCCATTCAAAGGGGATCTTTCAAATCTCCTTGGCGCTTTATTTGT
ACTACTATCGATAAGGTTACTAGTCGTATTCATCGAATTATTGGTAATGGTTGTAGCACATTTTTTTGGAAGGATTCCTGGCTGAATGGAGTGATTCTCTCAAATCTCTT
CCCTCGCCTTTATCGGTTAACTACCAATCCAAACGCCAGGGTTGCAGAAGTATGGAACTCTATAGAATCAGCATGGAATCTGAGTCCTCGTCGTCACCTTAATGAGTTTG
AGATTATTGAATGGGCAAATTTATCCTATCTTCTGTCTCCTATTCGACTTCGGGATATGAATGACTTTTGGTCTTGGCCCCTTGAAGCATCCAAATTATTCTCTGTTAAA
TCCTTGATGGTTGACCTTTTGGGTGGTTCTGATACTATTTTGAATAATTTATATTCGGTGATATGGAAAGATAATTATCCTAAAAAGATAAAAATCTTTCTATGGGAGCT
TAGCCTTGGGGCTATCAATACAGCGGATCGTCTTCAACGTAGAATGCCTCATTTTTCACTTTCGCCATCTTGGTGTGTTATGTGCTCATCAAATATGGAGAATTCGGGCC
ATCTATTTGTCACTTGTTCTTTTGCTACCAAGTATTGGAATTTGATGCTTGAAGCTTTCGGTTGGCATTTACCGATGTCGAATAACATTCATGACTTTCTGGCGTCTATT
TTTGTTGGTCATCCTTTTCAAGGAACAAAGAGGATTTTATGGCTGGCTTTTAATAGTCTTCTTTTGGTTTCTTTGGAACGAAAGAAATGGAAGAATTTTTAG
Protein sequenceShow/hide protein sequence
MTSPSASLPWNHPTRSISIDRKTFSIAFDEYSRGSKAKITEKGRNFSKSLSLTWKSLNWLASTFNTLAKEPCTYKFFSEFRGDEYILCVEKLNNKHGYFVEINQLQNSGS
RISILIPSESNKQGWFSFFSLISDYPKEFNHQTSKPQSRSYKEILQQKQPKVSHPLNTSPPDLVWTDIIVVQRFYQRDDWPSIRSSILSSISNRCSINPFQDNKALLHVY
DRKTALELCKSTEWSQIGKHRLKFYPLTSKAYKQDNFTVSYGGWIEVRNLSPVYWSEDVFRFIGDSCGGYLTTSSHTDRMINLMEARLKVRQNSTGFIPSSIALPIALVG
EEITVQIWGLTGETIGREQFNERHQLRKGSYEVEEDKSESKDLNLEEKENTQVIARESSPIHERESPIMEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKN
NISLPVSPTNLKIGQKGSTSGLSPKLVIVGSDTEAYLSSPSPINSPHKINLDPPPTHDFDLTIFNPDQHHLPLAIMPPESTNAGQSAINNKVNAPAPDILTNHPQPETPP
INQPMFALPEYLRHIAPILSEHGLCIMAIPPFLPPKRKTVTTTGKKTKLQRELDNLKTTVHYDKTASLALTEGVQETKTSNIDNHLIKSLWSSSHIGWSSLDSVGASGGI
LIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRIDFWKELHDLAGLGGDRWILGGDFNVTRWSWEKSHGRLITRSMRIFNQLIATYKLLDIPLQNG
CFTWSSFGDNPYLSLIDRFLISKDCLNKFGSSHLLRLDRITSDHYPLSLTFGDINWGPGPFQFENSWLQIASFREVLDNWWNHNSLQGWPGHAFMMKLKGLKNELRNWNR
SHSLALSQLPSLITQLKRLDNLEDKGLTMSQKESRRILREQIEDLTAQEHIQWKQRCKLKWVKEGDENTKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFY
RLLFTKDNRTRFLPTNVDWCPISEAQSKGIHIDDTELEEMVAKFGCKRGFWPSTYLGLPLGGNPKNFVFWQPVIEKIQHKLRSWKYAFISKGGRLTLIQATLSSMPTYFL
SLFKLPSKVAKSLDKLVRDFFWAGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNIALLAKWIWRFLHEEGSLWRNLIIAKYYNSEDVRHWPIPIQRGSFKSPWRFIC
TTIDKVTSRIHRIIGNGCSTFFWKDSWLNGVILSNLFPRLYRLTTNPNARVAEVWNSIESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRDMNDFWSWPLEASKLFSVK
SLMVDLLGGSDTILNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENSGHLFVTCSFATKYWNLMLEAFGWHLPMSNNIHDFLASI
FVGHPFQGTKRILWLAFNSLLLVSLERKKWKNF