; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030702 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030702
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr11:543967..553516
RNA-Seq ExpressionLag0030702
SyntenyLag0030702
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]8.0e-19139.78Show/hide
Query:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------
        ETK  ++D+  + S+W  K V W  + + G  GG++ILWD SKL+  E + G                                                
Subjt:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------

Query:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV
          GGDFNV R I E++  +R T  MR F++ I E GL++ PL N  FTWS    D     +DRFL S EWD  F  S      R  SDH P+ LE     
Subjt:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV

Query:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL
        WGP+PFR  N WL   +  +   +   +    GW G     KL+ +K  +K W  +   + K+++K +L +++  D   ++  L+S+ +  RT  R EL 
Subjt:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL

Query:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP
        D+ L EE    QK ++ W+K GD N+ FFHR    ++ +  I  L S  G +L    +I  E++ FF  LY     +       DW  +S      L  P
Subjt:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP

Query:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK
        F  EE+   +  L K KAPGP+GFT   + + W+ +K D +R+F E H NG +N      FI L+ K+   + I D+RPISL +S+YKI+AKVL+ RL+K
Subjt:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK

Query:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS
        V+   IS  Q AFVEGR ILD +LIANE V+  R   ++G + K+D EKA+D VDW FLD VL  KGF +KW  WI+GC+ +  F++ +NG  +G + AS
Subjt:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS

Query:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK
        RGLRQGDPLSPFLF L+++V S ++ +    G  EGF VG+D+  VS+LQFADDTI F K        L   + +F   SGLKIN EKS + GIN     
Subjt:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK

Query:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE
        +   AS  +C+V   P +YLGLPLGG+PK   FW PV++++ +++D WK+  LS GGR+TL  S LS IP YFLSLF + +SI+  ++++ R+F W G  
Subjt:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE

Query:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG
          K +HLV W +VS  ++ GGLG G ++ RN+ALL KW WRF  E    W +VI +IYGT   GW++      S R PW +IA+++Q F       +GNG
Subjt:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG

Query:  MKIRF
         +IRF
Subjt:  MKIRF

RVW65579.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.0e-18638.01Show/hide
Query:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------
        ETK    D+ L+ S+WS ++  W  + + G  GG+LI+WD  K++  E + G                                                
Subjt:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------

Query:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV
          GGDFNV R   E++  SR T  M+ F++ I +  L++ PL +  +TWS   ++     +DRFL S EW+ +F  S      R  SDH+P++LE   F 
Subjt:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV

Query:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL
        WGP+PFR  N WL  +   +      S+ +  GW G     KL+ +K  +K W      E  +K+K +L  +A FD+  ++  LS E +  R F + EL 
Subjt:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL

Query:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP
        +L L EE +  QK ++ W+K GD N+ FFH+    ++ +  I  L +  G  L     I+ E+L +F KLY     +       DWS +     S L +P
Subjt:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP

Query:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK
        F  EEI+  +  + ++KAPGP+GFT   F   W+ +K D +R+F+E HR+G +N     +FI L+ K+ +   I DFRPISL +S+YKI+AKVLA RL+ 
Subjt:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK

Query:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS
        V+   I   Q AFV+GRQILD +LIANE V+  R   ++G + K+D EKA+D V WDFLD VL  KGF  +W +W++GC+ +  ++V +NG  +G + AS
Subjt:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS

Query:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK
        RGLRQGDPLSPFLF ++++V S ++ K   +   EGF VG+++  VS LQFADDTI F    +    TL   + +F   SGLK+N +KS + GIN++   
Subjt:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK

Query:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE
        +   A  ++CK    P  YLGLPLGG+PK   FW PV++++ +++D W++  LS GGR+TL  S L+ +P YFLSLF + +S++  ++R+ R F W G  
Subjt:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE

Query:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG
          K +HLV W +V   +  GGLG G ++ RN+ALL KW WR+  E  + W +VI++IYG+   GW+  N    S R PW +IA ++Q F     F +G+G
Subjt:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG

Query:  MKIRF
         +IRF
Subjt:  MKIRF

RVW99790.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.0e-18538.01Show/hide
Query:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------
        ETK    D+ L+ S+WS ++  W  + + G  GG+LI+WD  KL+  E + G                                                
Subjt:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------

Query:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV
          GGDFNV R   E++  SR +  M+ F++ I +  L++ PL +  +TWS   ++     +DRFL S EW+ +F  S      R  SDH+P++LE   F 
Subjt:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV

Query:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL
        WGP+PF+  N WL  +   +      S+ +  GW G     KL+ +K  +K W      E  +K+K +L  +A FD+  ++  LS E +  R F + EL 
Subjt:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL

Query:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP
        +L L EE +  QK ++ W+K GD N+NFFH+    ++ +  I  L +  G  L     I+ E+L +F KLY +   +       DWS +     S L +P
Subjt:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP

Query:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK
        F  EEI+  +  + ++KAPGP+ FT   F   W+ +K D +R+F+E HR+G +N     +FI LI K+ +   I DFRPISL +S+Y+I+AKVLA RL+ 
Subjt:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK

Query:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS
        V+   I   Q AFV+GRQILD +LIANE V+  R   ++G + K+D EKA+D V WDFLD VL  KGF  +W +W++GC+ +  ++V +NG  +G + AS
Subjt:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS

Query:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK
        RGLRQGDPLSPFLF ++++V S ++ K   +   EGF VG+++  VS LQFADDTI F    +    TL   + +F   SGLK+N +KS + GINL+   
Subjt:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK

Query:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE
        +   A  ++CK    P  YLGLPLGG+PK   FW PV++++ +++D W++  LS GGR+TL  S L+ +P YFLSLF + +S++  ++R+ R F W G  
Subjt:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE

Query:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG
          K +HLV W +V   +  GGLG G ++ RN+ALL KW WR+  E  + W +VI++IYG+   GW+  N    S R PW +IA ++Q F     F +G+G
Subjt:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG

Query:  MKIRF
         +IRF
Subjt:  MKIRF

RVX13544.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]8.5e-18537.57Show/hide
Query:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------
        ETK    D+  + S+W++++  W  + + G  GG+LI+WD  KL   E + G                                                
Subjt:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------

Query:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV
          GGDFNV R   E++  SR T  M+ F+  I++  L++LPL +  FTWS    +     +DRFL S EW+  F  S      R  SDH+P++LE   F 
Subjt:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV

Query:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL
        WGP+PFR  N WL      +       + +  GW G     KL+ +K  +K W      E  ++++ +L  +  FD+  ++  LS E ++ R   + EL 
Subjt:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL

Query:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP
        +L L EE +  QK ++ W+K GD N+ FFH+    ++ +  I  L + +G  +  +  I+ E+L +F KLY +   +       DWS +S      L +P
Subjt:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP

Query:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK
        F  EEI   +  + ++KAPGP+GFT   F   WE +K D +++F+E HR+G +N     +FI L+ K+ +   I DFRPISL +S+YKI+AKVLA R+++
Subjt:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK

Query:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS
        V+   I   Q AFV+GRQILD +LIANE V+  R   ++G + K+D EKA+D V WDFLD V+  KGF  +W +W++GC+ +  F+V +NG  +G + AS
Subjt:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS

Query:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK
        RGLRQGDPLSPFLF ++++V S ++ K   +   EGF VG+++  VS LQFADDTI F    +    TL   + +F   SGLK+N +KS + GINL+   
Subjt:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK

Query:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE
        +   A  ++CK    P  YLGLPLGG+PK   FW PV++++ +++D W++  LS GGR+TL  S L+ +P YFLSLF + +S++  ++R+ R F W G  
Subjt:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE

Query:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG
          K +HLV W +V   +  GGLG G ++ RN+ALL KW WR+  E  + W +VI++IYG+   GW+  N    S R PW +IA ++Q F     F +GNG
Subjt:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG

Query:  MKIRF
         +IRF
Subjt:  MKIRF

RVX23556.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]8.5e-18539.29Show/hide
Query:  IQLKEIQPFLPHETKMVSFDQCLIKSIWSSKDVGWVNVKSWG-----------RLGGLLILWDESKLKIVEFLQGGGDFNVTRWIHERIPVSRATRGMRQ
        ++L++    +  ETK    D+ L+ S+W+ ++  W  + + G           R    + L+D   L    +   GGDFNV R   E++  SR T  MR 
Subjt:  IQLKEIQPFLPHETKMVSFDQCLIKSIWSSKDVGWVNVKSWG-----------RLGGLLILWDESKLKIVEFLQGGGDFNVTRWIHERIPVSRATRGMRQ

Query:  FNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFVWGPSPFRVFNSWLNMADCIKIVELTLS
        F+  I+E  LL+ PL N  FTWS   +      +DRFL S EW  +F        +R  SDH+P+ L+   F WGP+PFR  N WL      +       
Subjt:  FNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFVWGPSPFRVFNSWLNMADCIKIVELTLS

Query:  QDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELLDLYLVEERNSIQKCKLLWLKAGDENTN
          +  GW G     +L+ +K   K W  +      +K+KS+L ++A  DA  +D  L+SE +  R   + EL DL L EE +  QK ++ W+K GD N+ 
Subjt:  QDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELLDLYLVEERNSIQKCKLLWLKAGDENTN

Query:  FFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSE
        FFH+    ++ +  I  L +  G  L  A  I  E+L +F KLY     +       DWS +S+    +L+APF  EEI   +  + ++KAPGP+GFT  
Subjt:  FFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSE

Query:  FFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKKVIPSIISPYQSAFVEGRQILDPILIAN
         F   W+ +K D +R+F+E HR+G +N     +FI L+ K+     I DFRPISL +S+YKI+AKVL+ RL+ V+   I   Q AFV+GRQI+D +LIAN
Subjt:  FFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKKVIPSIISPYQSAFVEGRQILDPILIAN

Query:  EAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDK
        E V+  R   ++G + K+D EKA+D V WDFLD+VL  KGF  KW +W+ GC+ +  ++V +NG  +G + ASRGLRQGDPLSPFLF L+++V S ++ +
Subjt:  EAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDK

Query:  IHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAKVCHFASRINCKVEVLPFNYLGLPLGGH
           +   EGF VG+++  VS LQFADDTI F    +    TL   +  F   SGLK+N +KS + GINLD A +   A  + CK    P  YLGLPLGG+
Subjt:  IHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAKVCHFASRINCKVEVLPFNYLGLPLGGH

Query:  PKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGAL
        P+   FW PV++++ +++D W++  LS GGR+TL  S L+ +P Y+LSLF L +S++  ++R+ R F W G    K +HLV W +V N ++ GGLG G +
Subjt:  PKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGAL

Query:  NQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNGMKIRF
        + RN+ALL KW WR+  E  + W +VI++IYG+   GW++      S R PW +I++++Q F S   F +GNG +IRF
Subjt:  NQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNGMKIRF

TrEMBL top hitse value%identityAlignment
A0A438FWU5 LINE-1 retrotransposable element ORF2 protein3.9e-19139.78Show/hide
Query:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------
        ETK  ++D+  + S+W  K V W  + + G  GG++ILWD SKL+  E + G                                                
Subjt:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------

Query:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV
          GGDFNV R I E++  +R T  MR F++ I E GL++ PL N  FTWS    D     +DRFL S EWD  F  S      R  SDH P+ LE     
Subjt:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV

Query:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL
        WGP+PFR  N WL   +  +   +   +    GW G     KL+ +K  +K W  +   + K+++K +L +++  D   ++  L+S+ +  RT  R EL 
Subjt:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL

Query:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP
        D+ L EE    QK ++ W+K GD N+ FFHR    ++ +  I  L S  G +L    +I  E++ FF  LY     +       DW  +S      L  P
Subjt:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP

Query:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK
        F  EE+   +  L K KAPGP+GFT   + + W+ +K D +R+F E H NG +N      FI L+ K+   + I D+RPISL +S+YKI+AKVL+ RL+K
Subjt:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK

Query:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS
        V+   IS  Q AFVEGR ILD +LIANE V+  R   ++G + K+D EKA+D VDW FLD VL  KGF +KW  WI+GC+ +  F++ +NG  +G + AS
Subjt:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS

Query:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK
        RGLRQGDPLSPFLF L+++V S ++ +    G  EGF VG+D+  VS+LQFADDTI F K        L   + +F   SGLKIN EKS + GIN     
Subjt:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK

Query:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE
        +   AS  +C+V   P +YLGLPLGG+PK   FW PV++++ +++D WK+  LS GGR+TL  S LS IP YFLSLF + +SI+  ++++ R+F W G  
Subjt:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE

Query:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG
          K +HLV W +VS  ++ GGLG G ++ RN+ALL KW WRF  E    W +VI +IYGT   GW++      S R PW +IA+++Q F       +GNG
Subjt:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG

Query:  MKIRF
         +IRF
Subjt:  MKIRF

A0A803P8A0 Uncharacterized protein2.5e-19039.89Show/hide
Query:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------
        E K  + D+  I SIW S+   W+ + + GR GG L++WD   + +++ L G                                                
Subjt:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------

Query:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV
          GGDFNVTR + E++  S +TR M+ F+ LI EL L++  L NG FTWS        S +DRFL    W+V+F   R    VR +SDH P+++++    
Subjt:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV

Query:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL
        WGP PFR  N WL      K  E    ++   GW G     KL+ L+   K W      + K  + +L   +   D +      +      R  ++ E  
Subjt:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL

Query:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP
         L   EER+   K K  W K GD N+ FFH  L A+K +  I+ +   +G  + +  EI  E++ FF+KLY +           +W  +++     L  P
Subjt:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP

Query:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK
        F  +E+   + +   +KAPGP+GF+   F   WE +K + + +F   H  G +   + + FICLI K      +KDFRPISL +SVYKI+AK LA RL+ 
Subjt:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK

Query:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS
        V+   IS  QSAFVEGRQILD +L+ANEAVE YR + KKG++LK+D EKA+D VDW FLD VL  KGF ++W +WI+GCV +  FS+F+NGR RG+   S
Subjt:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS

Query:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK
        RGLRQGDPLSPFLF L+++V   +VDK     AF GF +G+D + +S LQFADDT+ F KD+D +   L++ +E F   SGLK+N  KS L GI L D  
Subjt:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK

Query:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE
        V   A+ I C+V   P  YLG+PLGG P+K +FW+PVLDK  K++D WK   LSRGGRLTL  SVLSS+P+Y+LSLF +   +   L++++R FFWEG +
Subjt:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE

Query:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG
         +  +HLV W  V   +  GGL IG L  RN  LL KW WRF +E +S W +VI + YG +   W+++     S R PW+ IA ++  +  +  FK+GNG
Subjt:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG

Query:  MKIRF
          IRF
Subjt:  MKIRF

A0A803QEA6 Uncharacterized protein5.2e-18839.34Show/hide
Query:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------
        E K  + D+  I SIW S+   W+ + + GR GG L++WD   + +++ L G                                                
Subjt:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------

Query:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV
           GDFNVTR + E++  S  TR M+ F+ LI EL L++  L NG FTWS        S +DRFL +  W+++F   R    VR +SDH P+++++    
Subjt:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV

Query:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL
        WGP PFR  N WL+     K  E    ++ + GW G     KL+ L+  +K W      + + K+ +L   +   D     +  +   +  R  ++ E  
Subjt:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL

Query:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP
         L   EER +  K K  W + GD N+ FFH  L A+K +  I+ +   +G  +    EI  E++ FF+KLY +           +W  ++++    L  P
Subjt:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP

Query:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK
        F  EE+   + +   NKAPGP+GF+       WE +K D + +F+  HR G +   + + FICLI K      +KDFRPISL +SVYKI+AK LA RL+ 
Subjt:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK

Query:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS
        V+   IS  QSAFVEGRQILD +L+ANEAVE YR + +KG++LK+D EKA+D VDW FLD VL  KGF ++W +WI+GCV +  FS+FINGR RG+   S
Subjt:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS

Query:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK
        RGLRQGDPLSPFLF +I++V   +VDK     +  GF +G+D + +S LQFADDT+ F KD+  +   L++ ++ F   SGLK+N  KS L GI +++  
Subjt:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK

Query:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE
        V   A  I C+V   P  YLG+ LGG P+K SFW+PVLDK  K++D WK   LSRGGRLTL  SVLSS+P+Y+LSLF     +   L++++R FFWEG +
Subjt:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE

Query:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG
         +  +HLV W  V   +  GGL IG L+ RN  LL KW WRF +EP+S W +VI + YG +   W+++     S R PW  I+ ++  +  L  FK+GNG
Subjt:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG

Query:  MKIRF
         +IRF
Subjt:  MKIRF

A0A803QI00 Uncharacterized protein8.6e-19140.11Show/hide
Query:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------
        E K  S D+  I SIW S+   W+ + + GR GG L++WD   + +++ L G                                                
Subjt:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------

Query:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV
          GGDFNVTR   E++  S  TR M+ F+ LI EL L++  L NG+FTWS        S +DRFL +  W+V++   R    VR +SDH P+++++    
Subjt:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV

Query:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL
        WGP PFR  N WL      K       +  S GW G    SKL+K +  +K W +    + K  +++L   +   D     N      +  R  ++ E  
Subjt:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL

Query:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP
         L   EER+   K K  W K GD N+ FFH  L A+K +  I+ +   DGS +    EI  E++GFF+KLY +   +     + +W  ++ +    L + 
Subjt:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP

Query:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK
        F  EE+   + +   +KAPGP+GF+   F   WE +K D + +F    + G +   + E FICLI K      +KDFRPISL +SVYKI+AK LA RL+ 
Subjt:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK

Query:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS
        V+   IS  QSAFVEGRQILD +LIANE VE +R + KKG++ K+D+EKA+D VDWDFLD VL  KGF + W +WI+GCV +  FS+ INGR RG+   S
Subjt:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS

Query:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK
        RGLRQGDPLSPFLF L+ +V   LVDK      F GF VG+D + +S LQFADDT+ F KD+  +   L++ +E F   SGLK+N  KS L GI+L++  
Subjt:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK

Query:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE
        V   A  I C+V   P  YLG+PLGG P+K +FW+PVLDK  K++D WK   LSRGGRL L  SVLSS+P+Y+LSLF     +   +++++R FFWEG +
Subjt:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE

Query:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG
         +  +HLV W  V   +  GGL IG L  RN  LL KW WR+ +EP+S W +VI + YG +   W+++     S R PW  I+  +  +  L  FK+GNG
Subjt:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG

Query:  MKIRF
          IRF
Subjt:  MKIRF

A0A803QQM3 Uncharacterized protein8.9e-18839.56Show/hide
Query:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------
        E K  + D+  I SIW S+   W+ + + GR GG L++WD   + +++ L G                                                
Subjt:  ETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWDESKLKIVEFLQG------------------------------------------------

Query:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV
          GGDFNVTR + E++  S  TR M+ F+ LI EL L++  L NG FTWS        S +DRFL S  W+V++   R    VR +SDH P+++++    
Subjt:  --GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV

Query:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL
        WGP PFR  N WL      K  E    ++ + GW G     KL+ L+  +K W      + K  + +L   +   D     +  +   +  R  ++ E  
Subjt:  WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELL

Query:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP
         L+  EER    K K  W + GD N+  FH  L A+K K  I+ +   +G  +    EI  E++ FF+KLY +           +W  + ++    L  P
Subjt:  DLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAP

Query:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK
        F  EE+   + +   NKAPGP+GF+       WE +K D + +F   HR G +   + + FICLI K      +KD+RPISL +SVYKI+AK LA RL+ 
Subjt:  FFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKK

Query:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS
        V+   IS  QSAFVEGRQILD +L+ANEAVE YR + KKG +LK+D EKA+D VDW FLD V+  KGF ++W +WI+GCV    FS+FINGR RG+   S
Subjt:  VIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVAS

Query:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK
        RGLRQ DPLSPFLF LI++V   +VDK     +  GF +G+D + +S LQFADDT+ F KD+  +   L++ +E F   SGLK+N  KS L G+ +D+  
Subjt:  RGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAK

Query:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE
        V   A +I C+V   P  YLG+PLGG P+K SFW+PVLDK   ++D WK   LSRGGRLTL  SVLSS+P+YFLSLF     +   L++++R FFWEG +
Subjt:  VCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNE

Query:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG
         +  +HLV W  V   +  GGL IG L  RN  LL KW WRF +E +S W +VI + YG +   W++++    S R PW  I+ ++  +  L  FK+GNG
Subjt:  GSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNG

Query:  MKIRF
          IRF
Subjt:  MKIRF

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.0e-4723.83Show/hide
Query:  GDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLEL-----PLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLE---
        GDFN    I +R    +  +  ++ N  +++  L+++     P S     +S P    + S ID  + SK   ++    R       +SDH  + LE   
Subjt:  GDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLEL-----PLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLE---

Query:  -----AGNFVWGPSPFRVFNSWLN---MADCIKIVELTLSQDKSYG--WVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLS
             + +  W  +   + + W++    A+     E   ++D +Y   W  F         K   +  F      ++++E+S +D +     + E  + +
Subjt:  -----AGNFVWGPSPFRVFNSWLN---MADCIKIVELTLSQDKSYG--WVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLS

Query:  SEEISLR---TFVRSELLDLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLY----QALPEKR
          + S R   T +R+EL ++   +    I + +  + +  ++      R +  K+ K  I  + +  G       EI+  +  ++  LY    + L E  
Subjt:  SEEISLR---TFVRSELLDLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLY----QALPEKR

Query:  VFPFNFDWSMVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKD-F
         F   +    ++Q +  +L  P    EI   + +L   K+PGP+GFT+EF+ ++ E L    ++LF  + + G L +   E  I LI K     T K+ F
Subjt:  VFPFNFDWSMVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKD-F

Query:  RPISLTSSVYKILAKVLAKRLKKVIPSIISPYQSAFVEGRQILDPILIANEAVEYY-RVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWI
        RPISL +   KIL K+LA R+++ I  +I   Q  F+ G Q    I  +   +++  R K+K   I+ +D EKAFD +   F+ K L   G +  +++ I
Subjt:  RPISLTSSVYKILAKVLAKRLKKVIPSIISPYQSAFVEGRQILDPILIANEAVEYY-RVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWI

Query:  QGCVRNPKFSVFINGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELF
        +     P  ++ +NG+         G RQG PLSP LF ++ EV   L   I  +   +G  +G+++V +S+  FADD I++ ++       L++ I  F
Subjt:  QGCVRNPKFSVFINGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELF

Query:  EWCSGLKINWEKSALCGINLDDAKVCHFASRINCKVEVLPFNYLGLPLGGHPKKY--SFWQPVLDKVQKKIDRWKRINLSRGGRLTLCS-SVLSSIPLYF
           SG KIN +KS     N +          +   +      YLG+ L    K      ++P+L ++++  ++WK I  S  GR+ +   ++L  +   F
Subjt:  EWCSGLKINWEKSALCGINLDDAKVCHFASRINCKVEVLPFNYLGLPLGGHPKKY--SFWQPVLDKVQKKIDRWKRINLSRGGRLTLCS-SVLSSIPLYF

Query:  LSL-FLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGW
         ++   L  +    L++    F W           +  S++S   K GG+ +        A + K  W
Subjt:  LSL-FLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGW

P08548 LINE-1 reverse transcriptase homolog9.5e-4624.07Show/hide
Query:  GDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLEL----PLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGN-
        GDFN    + +R    + ++ +   N  I  L L ++      +  ++T+       + S ID  L  K     F    +   +   SDH  + +E  N 
Subjt:  GDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLEL----PLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGN-

Query:  -------FVWGPSPFRVFNSWL---NMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEE
                 W  +   + ++W+      +  K +E   +QD +Y  +     + LR   I ++   A  ++  +++  +L+  +   + +   N   S  
Subjt:  -------FVWGPSPFRVFNSWL---NMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEE

Query:  ISLRTFVRSELLDLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKL----YQALPEKRVFPFNF
          + T +R+EL ++        I K K  + +  ++           K+ K LI+ + + +        EI+  +  ++ KL    Y+ L E   +    
Subjt:  ISLRTFVRSELLDLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKL----YQALPEKRVFPFNF

Query:  DWSMVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKD-FRPISLT
            +SQ +   L  P    EI   ++NL K K+PGP+GFTSEF+  F E L    + LF  + + G L +   E  I LI K     T K+ +RPISL 
Subjt:  DWSMVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKD-FRPISLT

Query:  SSVYKILAKVLAKRLKKVIPSIISPYQSAFVEGRQILDPILIANEAVEYY-RVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRN
        +   KIL K+L  R+++ I  II   Q  F+ G Q    I  +   +++  ++KNK   IL +D EKAFD +   F+ + L   G E  +++ I+     
Subjt:  SSVYKILAKVLAKRLKKVIPSIISPYQSAFVEGRQILDPILIANEAVEYY-RVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRN

Query:  PKFSVFINGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGL
        P  ++ +NG          G RQG PLSP LF ++ EV   L   I  + A +G  +G +++ +S+  FADD I++ ++       L++ I+ +   SG 
Subjt:  PKFSVFINGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGL

Query:  KINWEKSALCGINLDDAKVCHFASRINCKVEVLPFNYLGLPLGGHPKKY--SFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLS
        KIN  KS       ++         I   V      YLG+ L    K      ++ +  ++ + +++WK I  S  GR+ +    +    +Y  +   + 
Subjt:  KINWEKSALCGINLDDAKVCHFASRINCKVEVLPFNYLGLPLGGHPKKY--SFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLS

Query:  SSISI--NLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGALN--QRNMALLAKWGWRFMMEPHSFWRRV
        + +S   +L++I+  F W   +       +  +L+SN  K GG+ +  L    +++ +   W W    E    W R+
Subjt:  SSISI--NLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGALN--QRNMALLAKWGWRFMMEPHSFWRRV

P0C2F6 Putative ribonuclease H protein At1g657501.4e-2037.06Show/hide
Query:  VLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLA
        +L++V  ++  W+   LS  GRLTL  +VLSS+P++ +S  LL  SI   LD++ R+F W      K  HLV WS V + +K GGLG+ A    N AL++
Subjt:  VLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLA

Query:  KWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSL-AHFKLGNGMKIRF
        K GWR + E +S W  V+   Y   +   +       S  S W SIA   +  VS    +  G+G +IRF
Subjt:  KWGWRFMMEPHSFWRRVIVNIYGTSKFGWNSENRTCCSLRSPWLSIAKIWQRFVSL-AHFKLGNGMKIRF

P11369 LINE-1 retrotransposable element ORF2 protein6.8e-4423.72Show/hide
Query:  GDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLEL-----PLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGN
        GDFN      +R    +  R   +  +++ ++ L ++     P + G   +S P    + S ID  +  K     + N  +   +  +SDH  L L   N
Subjt:  GDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLEL-----PLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGN

Query:  FVWGPSP---FRVFNSWLN---MADCIK-----IVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSE
         +    P   +++ N+ LN   + + IK      +E   ++  +Y        +    +K  ++         +K++E +    +       E  + +S 
Subjt:  FVWGPSP---FRVFNSWLN---MADCIK-----IVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSE

Query:  EISLRTFVRSELLDLYLVEERNSIQK---CKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLY----QALPEKRVF
        + S R  +     ++  VE R +IQ+    +  + +  ++      R     + K+LI  + +  G       EI+  +  F+ +LY    + L E   F
Subjt:  EISLRTFVRSELLDLYLVEERNSIQK---CKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLY----QALPEKRVF

Query:  PFNFDWSMVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLT-IKDFRP
           +    ++Q+Q   L +P   +EI   + +L   K+PGP+GF++EF+  F E L     +LF ++   G L +   E  I LI K +   T I++FRP
Subjt:  PFNFDWSMVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLT-IKDFRP

Query:  ISLTSSVYKILAKVLAKRLKKVIPSIISPYQSAFVEGRQILDPILIANEAVEYY-RVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQG
        ISL +   KIL K+LA R+++ I +II P Q  F+ G Q    I  +   + Y  ++K+K   I+ LD EKAFD +   F+ KVL   G +  ++  I+ 
Subjt:  ISLTSSVYKILAKVLAKRLKKVIPSIISPYQSAFVEGRQILDPILIANEAVEYY-RVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQG

Query:  CVRNPKFSVFINGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEW
            P  ++ +NG     I    G RQG PLSP+LF ++ EV   L   I  +   +G  +G+++V +S+L  ADD I++  D       L+  I  F  
Subjt:  CVRNPKFSVFINGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEW

Query:  CSGLKINWEKSALCGINLDDAKVCHFASRINCKVEVLPFNYLGLPLGGHPKKY--SFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSL
          G KIN  KS       +              +      YLG+ L    K      ++ +  ++++ + RWK +  S  GR+ +    +    +Y  + 
Subjt:  CSGLKINWEKSALCGINLDDAKVCHFASRINCKVEVLPFNYLGLPLGGHPKKY--SFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSL

Query:  --FLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPH-SFWRRV
            + +     L+  +  F W   +       +  SL+ + + +GG+ +  L     A++ K  W +  +     W R+
Subjt:  --FLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPH-SFWRRV

P14381 Transposon TX1 uncharacterized 149 kDa protein3.1e-4425.8Show/hide
Query:  GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNG----KFTWSRPGDDS-SQSLIDRFLISKEWDVMFDNSRVSKQVRTISDH--FPLLLE
        GGDFN T    +R    +         +LI    L+++          FT+ R  D   SQS IDR  IS        +S +  ++   SDH    L + 
Subjt:  GGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNG----KFTWSRPGDDS-SQSLIDRFLISKEWDVMFDNSRVSKQVRTISDH--FPLLLE

Query:  AGNFVWGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFV----------IASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAK---AEDN
            +   + +   NS L      K V     +D   GW  F              K+  LK+  + +      +R  + ++L  E+   + +   +ED 
Subjt:  AGNFVWGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFV----------IASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAK---AEDN

Query:  QLSSEEISLRTFVRSELLDLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPF
         L  E +  +  +R    ++   + R +  + ++  L   D  + FF+     K  +  IT L + DG+ L     I      F+  L+   P   + P 
Subjt:  QLSSEEISLRTFVRSELLDLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPF

Query:  NFD--WS---MVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDF
          +  W    +VS+ +   L  P  ++E+   L+ +  NK+PG +G T EFF  FW+ L  DF R+ +E  + G L    +   + L+ K+  +  IK++
Subjt:  NFD--WS---MVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDF

Query:  RPISLTSSVYKILAKVLAKRLKKVIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQ
        RP+SL S+ YKI+AK ++ RLK V+  +I P QS  V GR I D + +  + + + R        L LD EKAFD VD  +L   L    F  +++ +++
Subjt:  RPISLTSSVYKILAKVLAKRLKKVIPSIISPYQSAFVEGRQILDPILIANEAVEYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQ

Query:  GCVRNPKFSVFINGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFE
            + +  V IN      +   RG+RQG PLS  L+ L  E F  L     L+    G ++ +  + V +  +ADD IL  +D   +     +  E++ 
Subjt:  GCVRNPKFSVFINGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFE

Query:  WCSGLKINWEKSALCGINLDDAKVCHFASRI-NCKVEVLPFNYLGLPLGG--HPKKYSFWQPVLDKVQKKIDRWKRIN--LSRGGRLTLCSSVLSSIPLY
          S  +INW KS+  G+     KV        +   E     YLG+ L    +P   +F + + + V  ++ +WK     LS  GR  + + +++S   Y
Subjt:  WCSGLKINWEKSALCGINLDDAKVCHFASRI-NCKVEVLPFNYLGLPLGG--HPKKYSFWQPVLDKVQKKIDRWKRIN--LSRGGRLTLCSSVLSSIPLY

Query:  FLSLFLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGI
         L     +      + R L  F W G       H V   + S   K GG G+
Subjt:  FLSLFLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.0e-2625.23Show/hide
Query:  LGGLLILWDESKLKIV-----EFLQGGGDFN---VTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSS-QSLIDRFLISKEWDV
        LG + I+WD S   +V     + +   GDF+    T   +  +  S   RG+ +F   + +  L+++P     +TWS   DD+     +DR + + +W  
Subjt:  LGGLLILWDESKLKIV-----EFLQGGGDFN---VTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSS-QSLIDRFLISKEWDV

Query:  MFDNSRVSKQVRTISDHFPLLLEAGNFVWGPSPFRVFNSWLNMADCIKIVELTLSQDKS--YGWVGFVIASKLRK----LKINIKNWFAVFERERKQKEK
         F ++    ++  +SDH P ++   N          + S+L+      +V LT++ ++    G   F +   L+      K+  +  F   + + K+   
Subjt:  MFDNSRVSKQVRTISDHFPLLLEAGNFVWGPSPFRVFNSWLNMADCIKIVELTLSQDKS--YGWVGFVIASKLRK----LKINIKNWFAVFERERKQKEK

Query:  SLLDEIAWFDAKAEDNQLSSEEISLR--TFVRSELLDLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVL
        SL    +       D+    E ++ +   F  + L   Y        QK ++ WL+ GD NT FFH+ + A + K LI  L   D   +    +++  ++
Subjt:  SLLDEIAWFDAKAEDNQLSSEEISLR--TFVRSELLDLYLVEERNSIQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVL

Query:  GFFTKLYQA-----LPE-----KRVFPFNFDWSMVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLN
         ++T L  +      P+     K + PF  + ++ S  + SAL +    +EI   +  + +NKAPGP+ FT+EFF + W  +K   I    E  R GHL 
Subjt:  GFFTKLYQA-----LPE-----KRVFPFNFDWSMVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPGPEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLN

Query:  SCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKIL
               I LI K   V  +  FRP+S  + VYKI+
Subjt:  SCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKIL

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-1434.29Show/hide
Query:  LPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVS
        LP  YLGLPL       S + P+++K++ +I +W   +LS  GRL L SSV+ S+  +++S F L S+    +D I  SF W G E +     V WS V 
Subjt:  LPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKRINLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVS

Query:  NSQKNGGLGIGALNQRNM---------ALLAKWGWRFMME
          +  GGLGI +L + N            L  W W+ +++
Subjt:  NSQKNGGLGIGALNQRNM---------ALLAKWGWRFMME

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.5e-1141.46Show/hide
Query:  LAKRLKKVIPSIISPYQSAFVEGRQILDPILIANEAV-EYYRVKNKKGW-ILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWI
        + +RLK ++ ++I P Q++F+ GR   D I+   EAV    R K  KGW +LKLD+EKA+D + WD+L+  L   GF + W+
Subjt:  LAKRLKKVIPSIISPYQSAFVEGRQILDPILIANEAV-EYYRVKNKKGW-ILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWI

AT4G29090.1 Ribonuclease H-like superfamily protein2.6e-0628.57Show/hide
Query:  SIPLYFLSLFLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIY
        ++P Y ++ FLL  ++   +  +L  F+W   + +K  H   W  +S  +  GG+G   +   N+ALL K  WR +  P S   +V  + Y
Subjt:  SIPLYFLSLFLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIY

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.6e-1149.25Show/hide
Query:  INGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDT
        ING P+G +  SRGLRQGDPLSP+LF+L +EV S L  +   +G   G  V  +   ++ L FADDT
Subjt:  INGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVGQDKVHVSILQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGGAGAGTTGGAAAGCCCTTTCACAGAAGAAGAGATCTTTAAAGTTGTTATGAGCCACGACAAACTCAAATCTTCAGGTTCGGATGACATGACTAGCACTGGCCG
GAGGTTGAAAAAGGCCCTTTCGCTAACCATTAGTAACTGCCAAGCGGCCTTCGTTCAGGGAAGACAAATTCTCGATGCTATTTTAGTAGCAACTAAAGCGATGTCTTCAG
TGGAGAAGATGGATCAGTGGCTGCCTAAGAAACACCAACTTCTCGATTATGATCAAGGTGGCTTGGAAATGCTTGCAGAAAGTCACCTCCCCATGATGAAAACCAAGGCC
ACTCCTGGCAAGCAAAATGTGCCTGCATCACCACCTCATCCTGAAGGTTCCTCTAACCCCTCCCTTTCTAAACATAAAGTAGAGCTCGAGCATTTGAAGGAGCAAAGTAG
AGCCTATTGGGCCTATGCAAGAGAGAGAGACAAAGCCATCCGTGGAATGAACGACCAACAAAGATCGTATTACATAGATAACAACTTCTCCCCTTCCCTTCCCTTAAGGC
AAAACCGTATGGCTAGCTATAAGCTTGGTGAAAGCGAAGTGCTTACCAAGTGCAAAGGAAAGGGCCTTCGCCACTTGAGTCAGGAAACCAAAAGACGCGAAGCATTTCAT
TCCACTGAATCTGGATTCAAGGAAGCATTAATTAAAGAGGTTAATTGTAATTTAATTGGGCCGGTTGAGATATCAAAAGAGAAGAGAGTTTTGTTGCAAGAAAGAGATTT
TAATGCCAACGGAAAGGGTATTAATGCCATCGGTTCAGATATTCAAGGAGCATTAACTGATGGAGCTTTGAATGAGTCACCGGATTTATTATTCACACCTATTCATGACC
CACCTTCGGATTTGAAGAGTTGTAATGCAGCTGGATTGAAAGAAAACAAACAGAATGTTTCTAAGGCTTTAAAGAAGAAATATGAATCATTTCCTCTTCATTATTCTCGA
AGGAAATGTGAAAAGTCGGATATTTTGGACTCAATTCCCATTAATTCCAATTATAACCCTGATGTTATTGAAGAATCTTGTTCTCAATTTTTGCTCTCTATTTTGAATCA
GCCTAGGTGCTGTCAAAATAATCTTAATGAGTTATCAAATTCCATTTCATCCAATCAGTACATTCTTTCAAACATTCAATCCAACCCTTCTTTAACAAAGGGGGTTTTTA
TTCCTTCATCCAAAGTTGAAATTAAAGTTGATCAATCTTATTCATCTCCTATTGATTCTGATGATGATTCAGTAGTGAGTATTAGTAGTGTTGAGGCTGAAAATCAGTAT
TTGAATGATGAAAACAATGAATTATTGGAGGAAGACTCTTTTGCAATGGCTTTTAATCGGATTTTCCAGAATGATGATGATGTTTCTGAAGTTCAGTTGAATGCTTGTGA
TGTTTTGGCAACACCCTTAGTCTCGGTTCCAAGCAAATTTTCATCCCTTTTGGAAGATTGTGACATTCAGTTAAAGGAAATTCAGCCCTTTTTACCCCATGAGACTAAGA
TGGTATCTTTTGATCAATGTTTGATAAAATCCATATGGAGCTCTAAAGATGTTGGTTGGGTTAATGTGAAATCATGGGGAAGATTGGGAGGGTTACTGATTTTGTGGGAT
GAGAGCAAATTGAAAATTGTGGAATTTCTTCAAGGCGGTGGAGATTTTAATGTGACCCGTTGGATTCATGAAAGAATTCCAGTTAGTAGAGCAACAAGAGGGATGAGACA
ATTTAATAAGCTTATTAATGAATTAGGCTTATTAGAGTTGCCTTTATCCAATGGTAAATTTACATGGTCAAGACCAGGGGATGATTCTTCTCAATCTCTTATCGACAGAT
TTCTGATTTCTAAGGAATGGGATGTGATGTTTGATAATTCTAGAGTCTCCAAACAGGTCCGTACTATTTCTGATCATTTCCCTCTCCTTCTTGAAGCTGGTAATTTTGTA
TGGGGTCCATCCCCATTTCGGGTTTTTAATAGTTGGTTGAATATGGCTGATTGTATCAAAATTGTGGAGCTTACTTTATCTCAAGATAAATCCTATGGCTGGGTTGGTTT
TGTTATTGCTTCTAAGCTCAGAAAATTGAAAATCAACATTAAAAATTGGTTTGCTGTGTTTGAAAGAGAGAGGAAGCAGAAAGAAAAAAGTCTTTTAGATGAAATTGCTT
GGTTTGATGCAAAAGCCGAGGATAATCAATTATCTTCAGAGGAAATTAGTCTTCGAACTTTTGTAAGAAGTGAGCTTTTAGATCTATACTTAGTTGAAGAAAGAAATTCA
ATTCAAAAATGTAAATTGCTTTGGCTAAAAGCAGGGGATGAAAATACCAATTTCTTTCACAGATTCTTGGCTGCAAAGAAGAGAAAATTATTGATTACTGGTCTGAATTC
TATTGATGGTAGTTCTCTGTTGACGGCTGGGGAGATTGAATTTGAAGTTCTAGGGTTTTTCACCAAACTCTATCAAGCATTACCAGAAAAAAGAGTTTTTCCTTTTAACT
TTGATTGGTCTATGGTTTCACAAAATCAAAATTCTGCACTGATTGCTCCTTTTTTTGTTGAGGAAATTTGGTTGCCATTGAAGAATCTTGGTAAAAATAAAGCGCCTGGG
CCCGAGGGATTCACTTCAGAATTCTTTATCAAGTTTTGGGAATTTTTGAAAGCTGATTTTATTAGGCTTTTTTCGGAACTTCATCGAAATGGTCATCTCAATTCATGTTT
GAAGGAGAATTTTATTTGCTTGATTCAAAAGGAGGAGGTGGTTTTAACCATAAAGGATTTTAGGCCAATAAGTTTGACTTCTTCGGTGTACAAGATCCTTGCTAAAGTGC
TTGCTAAGCGTTTGAAAAAGGTAATACCTTCGATTATTTCTCCTTATCAAAGTGCTTTTGTTGAAGGAAGACAGATTTTAGACCCTATTCTTATTGCCAATGAAGCTGTG
GAATATTATAGGGTAAAAAATAAGAAAGGTTGGATTTTAAAGCTTGATATTGAAAAGGCTTTTGATTGTGTTGATTGGGATTTTCTGGATAAAGTGCTCTGTTTTAAGGG
TTTTGAAAAAAAATGGATTCAATGGATCCAAGGTTGTGTTAGAAATCCTAAATTTTCAGTTTTCATAAATGGCCGACCTCGTGGAAGAATTGTTGCATCTCGTGGGTTAA
GACAAGGAGATCCACTTTCTCCTTTCTTATTTCTTTTAATCAGTGAGGTTTTCAGTGCTTTGGTTGACAAAATTCATCTAAAGGGAGCTTTCGAAGGTTTTCTAGTTGGT
CAAGACAAGGTACATGTTTCTATTCTTCAATTTGCAGATGATACTATCTTATTTTGCAAGGATGATGATGGTATGTTTAATACCTTAATTCAAACCATTGAACTTTTCGA
ATGGTGCTCGGGTTTGAAGATTAATTGGGAAAAATCTGCATTATGTGGTATCAATTTGGATGATGCAAAGGTTTGTCATTTTGCCTCGCGTATTAATTGTAAGGTTGAAG
TTTTGCCTTTTAATTACTTGGGGCTTCCATTGGGAGGTCATCCGAAAAAATACTCTTTTTGGCAACCGGTGCTTGATAAAGTTCAAAAGAAGATTGATAGATGGAAAAGA
ATTAATTTATCTCGTGGAGGGCGACTAACTCTTTGTTCTTCTGTTTTATCAAGTATCCCATTATATTTCTTGTCATTATTCTTATTGTCATCTTCCATTAGCATAAACCT
TGACAGGATCTTACGATCATTCTTCTGGGAAGGCAATGAAGGAAGCAAAGTTAATCATTTGGTCGGATGGAGTCTTGTATCAAATTCTCAAAAAAATGGTGGCCTTGGAA
TTGGAGCTTTGAACCAAAGGAATATGGCTTTATTAGCCAAATGGGGTTGGCGGTTTATGATGGAACCTCACTCTTTTTGGAGAAGAGTTATAGTCAATATTTATGGTACT
AGCAAGTTTGGTTGGAATTCTGAAAATAGGACATGTTGCAGCCTCCGTAGTCCTTGGTTGTCCATTGCTAAAATTTGGCAGCGTTTTGTTTCTCTTGCACACTTCAAATT
GGGTAATGGAATGAAAATCAGATTTGGGAAGATCCTTGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGACAGGAGAGTTGGAAAGCCCTTTCACAGAAGAAGAGATCTTTAAAGTTGTTATGAGCCACGACAAACTCAAATCTTCAGGTTCGGATGACATGACTAGCACTGGCCG
GAGGTTGAAAAAGGCCCTTTCGCTAACCATTAGTAACTGCCAAGCGGCCTTCGTTCAGGGAAGACAAATTCTCGATGCTATTTTAGTAGCAACTAAAGCGATGTCTTCAG
TGGAGAAGATGGATCAGTGGCTGCCTAAGAAACACCAACTTCTCGATTATGATCAAGGTGGCTTGGAAATGCTTGCAGAAAGTCACCTCCCCATGATGAAAACCAAGGCC
ACTCCTGGCAAGCAAAATGTGCCTGCATCACCACCTCATCCTGAAGGTTCCTCTAACCCCTCCCTTTCTAAACATAAAGTAGAGCTCGAGCATTTGAAGGAGCAAAGTAG
AGCCTATTGGGCCTATGCAAGAGAGAGAGACAAAGCCATCCGTGGAATGAACGACCAACAAAGATCGTATTACATAGATAACAACTTCTCCCCTTCCCTTCCCTTAAGGC
AAAACCGTATGGCTAGCTATAAGCTTGGTGAAAGCGAAGTGCTTACCAAGTGCAAAGGAAAGGGCCTTCGCCACTTGAGTCAGGAAACCAAAAGACGCGAAGCATTTCAT
TCCACTGAATCTGGATTCAAGGAAGCATTAATTAAAGAGGTTAATTGTAATTTAATTGGGCCGGTTGAGATATCAAAAGAGAAGAGAGTTTTGTTGCAAGAAAGAGATTT
TAATGCCAACGGAAAGGGTATTAATGCCATCGGTTCAGATATTCAAGGAGCATTAACTGATGGAGCTTTGAATGAGTCACCGGATTTATTATTCACACCTATTCATGACC
CACCTTCGGATTTGAAGAGTTGTAATGCAGCTGGATTGAAAGAAAACAAACAGAATGTTTCTAAGGCTTTAAAGAAGAAATATGAATCATTTCCTCTTCATTATTCTCGA
AGGAAATGTGAAAAGTCGGATATTTTGGACTCAATTCCCATTAATTCCAATTATAACCCTGATGTTATTGAAGAATCTTGTTCTCAATTTTTGCTCTCTATTTTGAATCA
GCCTAGGTGCTGTCAAAATAATCTTAATGAGTTATCAAATTCCATTTCATCCAATCAGTACATTCTTTCAAACATTCAATCCAACCCTTCTTTAACAAAGGGGGTTTTTA
TTCCTTCATCCAAAGTTGAAATTAAAGTTGATCAATCTTATTCATCTCCTATTGATTCTGATGATGATTCAGTAGTGAGTATTAGTAGTGTTGAGGCTGAAAATCAGTAT
TTGAATGATGAAAACAATGAATTATTGGAGGAAGACTCTTTTGCAATGGCTTTTAATCGGATTTTCCAGAATGATGATGATGTTTCTGAAGTTCAGTTGAATGCTTGTGA
TGTTTTGGCAACACCCTTAGTCTCGGTTCCAAGCAAATTTTCATCCCTTTTGGAAGATTGTGACATTCAGTTAAAGGAAATTCAGCCCTTTTTACCCCATGAGACTAAGA
TGGTATCTTTTGATCAATGTTTGATAAAATCCATATGGAGCTCTAAAGATGTTGGTTGGGTTAATGTGAAATCATGGGGAAGATTGGGAGGGTTACTGATTTTGTGGGAT
GAGAGCAAATTGAAAATTGTGGAATTTCTTCAAGGCGGTGGAGATTTTAATGTGACCCGTTGGATTCATGAAAGAATTCCAGTTAGTAGAGCAACAAGAGGGATGAGACA
ATTTAATAAGCTTATTAATGAATTAGGCTTATTAGAGTTGCCTTTATCCAATGGTAAATTTACATGGTCAAGACCAGGGGATGATTCTTCTCAATCTCTTATCGACAGAT
TTCTGATTTCTAAGGAATGGGATGTGATGTTTGATAATTCTAGAGTCTCCAAACAGGTCCGTACTATTTCTGATCATTTCCCTCTCCTTCTTGAAGCTGGTAATTTTGTA
TGGGGTCCATCCCCATTTCGGGTTTTTAATAGTTGGTTGAATATGGCTGATTGTATCAAAATTGTGGAGCTTACTTTATCTCAAGATAAATCCTATGGCTGGGTTGGTTT
TGTTATTGCTTCTAAGCTCAGAAAATTGAAAATCAACATTAAAAATTGGTTTGCTGTGTTTGAAAGAGAGAGGAAGCAGAAAGAAAAAAGTCTTTTAGATGAAATTGCTT
GGTTTGATGCAAAAGCCGAGGATAATCAATTATCTTCAGAGGAAATTAGTCTTCGAACTTTTGTAAGAAGTGAGCTTTTAGATCTATACTTAGTTGAAGAAAGAAATTCA
ATTCAAAAATGTAAATTGCTTTGGCTAAAAGCAGGGGATGAAAATACCAATTTCTTTCACAGATTCTTGGCTGCAAAGAAGAGAAAATTATTGATTACTGGTCTGAATTC
TATTGATGGTAGTTCTCTGTTGACGGCTGGGGAGATTGAATTTGAAGTTCTAGGGTTTTTCACCAAACTCTATCAAGCATTACCAGAAAAAAGAGTTTTTCCTTTTAACT
TTGATTGGTCTATGGTTTCACAAAATCAAAATTCTGCACTGATTGCTCCTTTTTTTGTTGAGGAAATTTGGTTGCCATTGAAGAATCTTGGTAAAAATAAAGCGCCTGGG
CCCGAGGGATTCACTTCAGAATTCTTTATCAAGTTTTGGGAATTTTTGAAAGCTGATTTTATTAGGCTTTTTTCGGAACTTCATCGAAATGGTCATCTCAATTCATGTTT
GAAGGAGAATTTTATTTGCTTGATTCAAAAGGAGGAGGTGGTTTTAACCATAAAGGATTTTAGGCCAATAAGTTTGACTTCTTCGGTGTACAAGATCCTTGCTAAAGTGC
TTGCTAAGCGTTTGAAAAAGGTAATACCTTCGATTATTTCTCCTTATCAAAGTGCTTTTGTTGAAGGAAGACAGATTTTAGACCCTATTCTTATTGCCAATGAAGCTGTG
GAATATTATAGGGTAAAAAATAAGAAAGGTTGGATTTTAAAGCTTGATATTGAAAAGGCTTTTGATTGTGTTGATTGGGATTTTCTGGATAAAGTGCTCTGTTTTAAGGG
TTTTGAAAAAAAATGGATTCAATGGATCCAAGGTTGTGTTAGAAATCCTAAATTTTCAGTTTTCATAAATGGCCGACCTCGTGGAAGAATTGTTGCATCTCGTGGGTTAA
GACAAGGAGATCCACTTTCTCCTTTCTTATTTCTTTTAATCAGTGAGGTTTTCAGTGCTTTGGTTGACAAAATTCATCTAAAGGGAGCTTTCGAAGGTTTTCTAGTTGGT
CAAGACAAGGTACATGTTTCTATTCTTCAATTTGCAGATGATACTATCTTATTTTGCAAGGATGATGATGGTATGTTTAATACCTTAATTCAAACCATTGAACTTTTCGA
ATGGTGCTCGGGTTTGAAGATTAATTGGGAAAAATCTGCATTATGTGGTATCAATTTGGATGATGCAAAGGTTTGTCATTTTGCCTCGCGTATTAATTGTAAGGTTGAAG
TTTTGCCTTTTAATTACTTGGGGCTTCCATTGGGAGGTCATCCGAAAAAATACTCTTTTTGGCAACCGGTGCTTGATAAAGTTCAAAAGAAGATTGATAGATGGAAAAGA
ATTAATTTATCTCGTGGAGGGCGACTAACTCTTTGTTCTTCTGTTTTATCAAGTATCCCATTATATTTCTTGTCATTATTCTTATTGTCATCTTCCATTAGCATAAACCT
TGACAGGATCTTACGATCATTCTTCTGGGAAGGCAATGAAGGAAGCAAAGTTAATCATTTGGTCGGATGGAGTCTTGTATCAAATTCTCAAAAAAATGGTGGCCTTGGAA
TTGGAGCTTTGAACCAAAGGAATATGGCTTTATTAGCCAAATGGGGTTGGCGGTTTATGATGGAACCTCACTCTTTTTGGAGAAGAGTTATAGTCAATATTTATGGTACT
AGCAAGTTTGGTTGGAATTCTGAAAATAGGACATGTTGCAGCCTCCGTAGTCCTTGGTTGTCCATTGCTAAAATTTGGCAGCGTTTTGTTTCTCTTGCACACTTCAAATT
GGGTAATGGAATGAAAATCAGATTTGGGAAGATCCTTGGCTGA
Protein sequenceShow/hide protein sequence
MTGELESPFTEEEIFKVVMSHDKLKSSGSDDMTSTGRRLKKALSLTISNCQAAFVQGRQILDAILVATKAMSSVEKMDQWLPKKHQLLDYDQGGLEMLAESHLPMMKTKA
TPGKQNVPASPPHPEGSSNPSLSKHKVELEHLKEQSRAYWAYARERDKAIRGMNDQQRSYYIDNNFSPSLPLRQNRMASYKLGESEVLTKCKGKGLRHLSQETKRREAFH
STESGFKEALIKEVNCNLIGPVEISKEKRVLLQERDFNANGKGINAIGSDIQGALTDGALNESPDLLFTPIHDPPSDLKSCNAAGLKENKQNVSKALKKKYESFPLHYSR
RKCEKSDILDSIPINSNYNPDVIEESCSQFLLSILNQPRCCQNNLNELSNSISSNQYILSNIQSNPSLTKGVFIPSSKVEIKVDQSYSSPIDSDDDSVVSISSVEAENQY
LNDENNELLEEDSFAMAFNRIFQNDDDVSEVQLNACDVLATPLVSVPSKFSSLLEDCDIQLKEIQPFLPHETKMVSFDQCLIKSIWSSKDVGWVNVKSWGRLGGLLILWD
ESKLKIVEFLQGGGDFNVTRWIHERIPVSRATRGMRQFNKLINELGLLELPLSNGKFTWSRPGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLEAGNFV
WGPSPFRVFNSWLNMADCIKIVELTLSQDKSYGWVGFVIASKLRKLKINIKNWFAVFERERKQKEKSLLDEIAWFDAKAEDNQLSSEEISLRTFVRSELLDLYLVEERNS
IQKCKLLWLKAGDENTNFFHRFLAAKKRKLLITGLNSIDGSSLLTAGEIEFEVLGFFTKLYQALPEKRVFPFNFDWSMVSQNQNSALIAPFFVEEIWLPLKNLGKNKAPG
PEGFTSEFFIKFWEFLKADFIRLFSELHRNGHLNSCLKENFICLIQKEEVVLTIKDFRPISLTSSVYKILAKVLAKRLKKVIPSIISPYQSAFVEGRQILDPILIANEAV
EYYRVKNKKGWILKLDIEKAFDCVDWDFLDKVLCFKGFEKKWIQWIQGCVRNPKFSVFINGRPRGRIVASRGLRQGDPLSPFLFLLISEVFSALVDKIHLKGAFEGFLVG
QDKVHVSILQFADDTILFCKDDDGMFNTLIQTIELFEWCSGLKINWEKSALCGINLDDAKVCHFASRINCKVEVLPFNYLGLPLGGHPKKYSFWQPVLDKVQKKIDRWKR
INLSRGGRLTLCSSVLSSIPLYFLSLFLLSSSISINLDRILRSFFWEGNEGSKVNHLVGWSLVSNSQKNGGLGIGALNQRNMALLAKWGWRFMMEPHSFWRRVIVNIYGT
SKFGWNSENRTCCSLRSPWLSIAKIWQRFVSLAHFKLGNGMKIRFGKILG