; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy04g003230 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy04g003230
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr04:24271619..24274833
RNA-Seq ExpressionLcy04g003230
SyntenyLcy04g003230
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]2.9e-24139Show/hide
Query:  FQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNIEFQ-GNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAITQVNEKDGGGD
        F+N FVV+  G+ GGLALFW S V++ I S+S  HID  ++ Q G  +R T +YG  +   +H +W LL+ L+++    W   GD N I   +EK G  D
Subjt:  FQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNIEFQ-GNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAITQVNEKDGGGD

Query:  --------------------------------------------------------FERLQSQHFVNTLDD-----------------------------
                                                                F+ L +   VN + D                             
Subjt:  --------------------------------------------------------FERLQSQHFVNTLDD-----------------------------

Query:  --------------------------------CALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVI--EAVLQE
                                        C L D+GFKG++FTW N + G  +I +RLDR + + D  + F ++    L    SDH  I  E  +  
Subjt:  --------------------------------CALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVI--EAVLQE

Query:  YGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGD--ESPSSLLTRI-RSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKR
           H + +   P   +E+ W+ YE C+ I+++ WE   G+  ESP     R+ +     LK W + +    K + +E   R+++  +  L+      +++
Subjt:  YGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGD--ESPSSLLTRI-RSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKR

Query:  EEKCLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTE
         E  + N+L++EE+YWKQRSR +WLK GD+NT++FHSKA+ RR+KNKI  + D +G+  ++ E IE  F  +F +LF SS  +   + E ++    +V++
Subjt:  EEKCLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTE

Query:  EMNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIA
        EMN HL+  F   ++  AL +M PTKAPGPDGLPA F+Q++W IVGE +  TCL  LNE    D LN T I LIPK + P  V ++RPISLCNV Y+I+A
Subjt:  EMNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIA

Query:  KVLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSIL
        K +ANRLK IL+ IIS  QSAF+P RLI DNVI+GYECLH I+ SK ++N   ++KLD+SKAYDRVEW FL + M+ +GFS+ WI  I  C+ +  FS+L
Subjt:  KVLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSIL

Query:  INGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDK
        ING      KP RGLRQG PLSPYLF++CAE  S+LL +AE  Q I GL+ A+    I+HL FADDSL+F KA   +   ++ I   Y  A+GQ+ NF+K
Subjt:  INGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDK

Query:  SEIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDL
        S + F    ++E    + +I ++K+V  + KYLG P  L RNK +    +  KV   +  W  KLFS  GKE+LIKAVAQAVP Y +S FKLP+G C D+
Subjt:  SEIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDL

Query:  NRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELA
         + +A FWWG+ +    IHW +W++M+K K RGG+GFRD+  FNQAL+AK  WRL+  P SLM +++KARYY    F  A +GS PS++WR I+WG ++ 
Subjt:  NRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELA

Query:  MTGGRWRVGNGKSIDIYDDQWIPRMKTFRLLSPQRPLTDVQLIV-----------------FCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGY
          G RWR+G+GK + +Y D+WIPR  TF+ +SP + L    ++                  F ++D+E I  I +      D   WH+DK G+Y+V+SGY
Subjt:  MTGGRWRVGNGKSIDIYDDQWIPRMKTFRLLSPQRPLTDVQLIV-----------------FCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGY

Query:  KLAIDGCHEASSSANSFSCSWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL
        +LA++        +++ S   WK  WML++P K+KIF WRA  NILPT   L  RR   +P C +C    ET  H L
Subjt:  KLAIDGCHEASSSANSFSCSWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]1.2e-24242.56Show/hide
Query:  MNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNI-EFQGNF-FRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAIT
        M  ++  + F N  +V C G SGG+AL W   +NL + SY++ HID  I E   ++ +R T  YG P+   R+ SW LL  L+      W+  GD N I 
Subjt:  MNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNI-EFQGNF-FRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAIT

Query:  QVNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRK
         +NEK GG +  + Q   F + ++ C   DLG+ G  +TW N Q G   I  RLDR +   D    F  + ++HL   + DH  +  V      H   R 
Subjt:  QVNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRK

Query:  KTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEE
        +   F FE  WT  EDC  II+A W       +P  +   +R    EL +W     G+   +I + + R+  L    L+  +   + R  + ++ +L +E
Subjt:  KTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEE

Query:  EIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVA
        E YW QR++  WLK GDRNT++FH++A+ RR++N I  I D +G   +N E I +  + YF+ ++ SS  +   ++E+ +A   +VTEEMN  L R F  
Subjt:  EIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVA

Query:  MEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILH
         EV  ALKQ+HP KAPGPDG+ A+F+Q+YWSIVG  V +  L  LN      +LN+T I LIPK  NP  + D+RPISLCNV YK+I+K+LANRLK +L 
Subjt:  MEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILH

Query:  SIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPS
         IIS+ QSAF   RLI DNV+V +E +HY+    + K  F +IKLDMSKA+DRVEW F+ K+M +MGF + W   +  C+ SVS+SILING    +  PS
Subjt:  SIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPS

Query:  RGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANE
        RGLRQGDPLSP LFL+CAEGLS+L+ +A  ++ I+G+ I +  P ++HLFFADDS+LF KA  EE   ++ IL  YE A+GQ +N DKS I F    A E
Subjt:  RGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANE

Query:  VKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSS
         +  + NIL       H KYLG PS + R+K      +  KV   L GWK KL S  GKE+LIKAVAQA+PTYT+SCF LPQG C D+ R+M NFWWG  
Subjt:  VKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSS

Query:  EQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGK
         Q  K+ W+ W+ M   K  GG+GFR+++ FN A+LAK  WR+L+ P SL+ ++LKARY+P  D L A LGS PSY WR I    E+   G RWRVGNGK
Subjt:  EQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGK

Query:  SIDIYDDQWIPRMKTFRLLSPQRPLTDVQLI------------------VFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKLA---IDGCH
         I I++D+W+P   T++++SPQ    +  L+                  +F   +VE I  IP++ +   D   W  +K G+++V+S Y +A   ID   
Subjt:  SIDIYDDQWIPRMKTFRLLSPQRPLTDVQLI------------------VFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKLA---IDGCH

Query:  EASSSANSFSCSWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL
            S        WK LW+LN+P K+KIFAWRA ++ LPT   +  R +     CP CG + E   HAL
Subjt:  EASSSANSFSCSWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]8.7e-24643.67Show/hide
Query:  GGLALFWHSSVNLNIISYSKVHIDTNI-EFQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAITQVNEKDGGGDFERLQSQHFVNTL
        GGLA  W + V L +I+++  H+   + E  G  +  T  YG P  + +  SW LL+ L       W+V GD NA    +EK      +  Q + F   L
Subjt:  GGLALFWHSSVNLNIISYSKVHIDTNI-EFQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAITQVNEKDGGGDFERLQSQHFVNTL

Query:  DDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKT---PHFKFEETWTLYEDCAPI
          C L DLGFKG  +TW N +PG    + RLDR V N +  + F    + HL   +SDH  +   +Q     SFS+ +      FKFEE+W L ++CA +
Subjt:  DDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKT---PHFKFEETWTLYEDCAPI

Query:  IQAGWEKERGD-ESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRL-EGSMLTLLKREEKCLDNILMEEEIYWKQRSREEWLKWGDR
        IQ  W    G+ +  +++  +I++   EL  WG + T      I E +++++ LNE+ L E S    L   +K +D++L ++EIYW QRSR  WL+ GDR
Subjt:  IQAGWEKERGD-ESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRL-EGSMLTLLKREEKCLDNILMEEEIYWKQRSREEWLKWGDR

Query:  NTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGP
        NT++FH+KA+ RR+KN I+ I + +G   EN E++ +    YF  LF++   +   M+E + A  T+VTE+M   L   F A EV+ AL QM PTKAPGP
Subjt:  NTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGP

Query:  DGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIAD
        DG+ ALFYQ++W IVG++V++  L  LN G M  ++N T IVLIPK +NP  ++++RPISLCNV YKII+KVLANRLK +L  IIS TQSAFVPGRLI D
Subjt:  DGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIAD

Query:  NVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCA
        NV+V YE LH +   K  K    ++KLD+SKAYDRVEW FL+ IM KMGF + WI+++  CV + SFSIL+NG   E  +PSRG+RQGDP+SPYLFL+CA
Subjt:  NVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCA

Query:  EGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHG
        EGL++LL KAE++  I+G+ I + AP I++L FADDSLLF +A + E + +  IL++YE A+GQ +N +KS   F    +   KG +  IL VK V    
Subjt:  EGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHG

Query:  KYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPK
        KYLG P+ + R K  + + +  +V K LQGWK  L S AGKE+LIKAVAQA+PTYT+S F++P   C +L  L A FWWG     RKIHW  W+ +T PK
Subjt:  KYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPK

Query:  DRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWIPRMKTFRL
          GGMGFRD+R FN A+LAK  WRL+    SL+ +  KARY+P+  FL+A      S+VWR ++  + +   G  WRVGNG SI+   D+W+P   T ++
Subjt:  DRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWIPRMKTFRL

Query:  L-SPQRPLTDV---QLI--------------VFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKLA---IDGCHEASSSANSFSCSWWKTLW
        L S QR  +++   +LI              +F +D+ E I  IP++   +PD+ FW Y   G ++V+S Y +A   +   +   +S    + + W  +W
Subjt:  L-SPQRPLTDV---QLI--------------VFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKLA---IDGCHEASSSANSFSCSWWKTLW

Query:  MLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL
         L +PNK+K+FAWRA   ILPT + L TR++ +D +C  C + +E+TVHAL
Subjt:  MLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]1.7e-24141.37Show/hide
Query:  METKSNQSRMNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNIE--FQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWM
        METKS +  +  +++    ++   V+  G  GGLAL W   + + I +Y++ HID  IE  + G  + FT  YG P    R  SW  L+ L       W+
Subjt:  METKSNQSRMNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNIE--FQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWM

Query:  VAGDLNAITQVNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQE
          GD N IT + EK+GG    R Q ++FV+ ++ C  R++ F G ++TW  ++     IR+RLDR + N + ++LF +  + HL   +SDH  +   L  
Subjt:  VAGDLNAITQVNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQE

Query:  YGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEK
               RK    F+FE  W     C  I++A WE      +   L + +     +L++W + + G    +ISE ++++E L        +L  L+    
Subjt:  YGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEK

Query:  CLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMN
         L+  L +E+  W+QRSR  W + GDRNT +FH+KA+ R QKN I  I D +G   E+   IE   + YF +LF SS    +   +I+ A   +VT +MN
Subjt:  CLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMN

Query:  LHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVL
        + L R + A EV  ALKQM+P KAPGPDG+P LF+Q +W+  GE V +T L  LN G  P   NET IVLIPK   P  V+DYRPISLCNV YKI +K +
Subjt:  LHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVL

Query:  ANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILING
        ANRLK  L SIISDTQSAFV GRLI DNV+V +E +H+I R K  K    +IKLDMSKAYDRVEW+F+ KIM K+GF       I  C+ +VS++I ING
Subjt:  ANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILING

Query:  TCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEI
               PSRG+RQGDPLSPYLFL+CAEGLS+L+K +  + ++ G+ I +  P +SHLFFADDSL+F KA   E D +QR+L VYE A+GQ +N  K+ +
Subjt:  TCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEI

Query:  AFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRL
         F +    E++  +      ++++ H KYLG PS + +NK+++ N I  K+ K L GWK KL S AGKE+LIKAVA AVPTYT+SCFKLP   C +L  +
Subjt:  AFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRL

Query:  MANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTG
        +  FWWG  +   +I WL W+ M + K  GGMGF++++LFN ALLAK  WRL     SL+ ++LKA+Y+P+ +F+ A+LG+ PSY WR I+  + L   G
Subjt:  MANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTG

Query:  GRWRVGNGKSIDIYDDQWIPRMKTFRLLSPQRPL------------------TDVQLIVFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKL
         +WRVGNG SI +++D+W+P   + ++++P+  L                  T+V   VF   + + I+ IPI+    PD   W     G +TVRS YKL
Subjt:  GRWRVGNGKSIDIYDDQWIPRMKTFRLLSPQRPL------------------TDVQLIVFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKL

Query:  AIDGCHEASSSANSFSC---SWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL
        A++     +  A S +    S+W+ +W + +P+K++ F WRA  N LPT   L+ R++  D  C  C +  E+  H L
Subjt:  AIDGCHEASSSANSFSC---SWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL

XP_042965942.1 uncharacterized protein LOC122299620 [Carya illinoinensis]1.3e-24442Show/hide
Query:  ETKSNQSRMNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNI---EFQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWM
        ET+   S M+  + +L  +N   V+C G  GGLALFW   V+L I+ YSK HI   I   E +   +  T +YG P+  +RHL+WTL+R L ++ +  W+
Subjt:  ETKSNQSRMNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNI---EFQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWM

Query:  VAGDLNAITQVNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQE
        + GD N I    EK GG D    Q ++F + +DDC ++DLG++G  +TW N +  +  I  RLDR + N          ++ H     SDH  +      
Subjt:  VAGDLNAITQVNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQE

Query:  YGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEG--SMLTLLKRE
        Y T   SR+    F+FE  W+  E C+ +I+  W +    +   +L+   + V E+LK W +   G  K +++E K  +  L +    G  S   +  RE
Subjt:  YGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEG--SMLTLLKRE

Query:  EKCLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEE
        E  ++  L  EE  W QRS+  W++ GD+N+R+FHSKA+ R++KN I+++ D +    E R D+E   ++YF +LF SS    +   E      ++VT  
Subjt:  EKCLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEE

Query:  MNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAK
        MN  L + F A EV  AL QMHPTKAPGPDG+P LFYQ+YWS +G +V    L  LN G  P +LN + I LIPK++NP+ V D+RPISLCNV YK+++K
Subjt:  MNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAK

Query:  VLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILI
         +ANRLK +L  +IS +QSAFVPGRLI DNV+V YE +H+++  +  K  + SIKLDMSKAYDRVEW FL +IM +MGF + WI  I  CV SV FS+++
Subjt:  VLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILI

Query:  NGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKS
        NG  T   KP+RGLRQGDPLSPYLFL+C EGL S+LK+A ++  I G+RI + AP I+HL FADDS++F KA  +   E+QRILK YE A+GQ +N DK+
Subjt:  NGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKS

Query:  EIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLN
         + F + V++ ++  + N+     +Q + KYLG P  ++R K  + + I S+V K LQ WK K  S  GKE+L+KAVA A+PTY +SCFKLP     +L 
Subjt:  EIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLN

Query:  RLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAM
         LMA FWWG +E  ++IHW+ W+ + K K +GG+GF++++LFN ALLAK  WRL+ + +SL+ Q+ KA+Y+P+ +FL + LG  PSY WRGI   R    
Subjt:  RLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAM

Query:  TGGRWRVGNGKSIDIYDDQWIPRMKTF------------------RLLSPQRPLTDVQLI--VFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRS
         G + RVG+GK+I I+ D WIP  +                     L++ +    D+  +  +F     EKI  I I  +   D + W  +++G+++VRS
Subjt:  TGGRWRVGNGKSIDIYDDQWIPRMKTF------------------RLLSPQRPLTDVQLI--VFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRS

Query:  GYKLAIDGCHEA-SSSANSFSC-SWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL
         YKL   G ++A   S+NS S  S WK +W + +P K++IFAW+   + LPT + L+ R V  DP+C  C + NE   HA+
Subjt:  GYKLAIDGCHEA-SSSANSFSC-SWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL

TrEMBL top hitse value%identityAlignment
A0A2N9E9A1 Reverse transcriptase domain-containing protein4.7e-24541.52Show/hide
Query:  MNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNI-EFQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAITQ
        +  LR  L F N FV       GGL LFW ++VNL + S+S  HID  + E Q + +R T  YG P+   R  SWTLLRRLS +    W   GD N +T+
Subjt:  MNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNI-EFQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAITQ

Query:  VNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKK
          EK G       Q Q F + +DDC   DLG+ G  FTW NN+ G     +RLDR +   + L LF +  ++HL    SDH+ I  ++      + +RK 
Subjt:  VNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKK

Query:  TPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEEE
           F+FEE WT    C   + A W+  +       +  +I +  + L+RW  +  G  K +I E + R++    + + G      K  +  L  +L +EE
Subjt:  TPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEEE

Query:  IYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAM
          W+QRSR EWL+ GD+NTR+FH +AT RR++N+I ++ D  G    ++  + + F+ ++++LF S+  N   ++++++     VT EMN HL + F+ +
Subjt:  IYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAM

Query:  EVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHS
        EV EA+KQM P K+PGPDG P +FYQ+YW I+GE V    L CLN G++   +N T I LIPK KNP +V D+RPISLCNV YKII+KVL NRLK+IL  
Subjt:  EVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHS

Query:  IISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSR
        I+S++QSAFVPGRLI DN++V +E LH++ + +  K+   ++KLDMSKAYDRVEW +L ++M +MGF   W+K +  C+ +VS+SILING      KPSR
Subjt:  IISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSR

Query:  GLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEV
        GLRQGDPLSPYLFL CAEGL SLL++A+   N+ G+ I++  P ++HLFFADDSLLF KA   E   +Q IL  YE A+GQ +N  K+ + F       +
Subjt:  GLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEV

Query:  KGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSE
        +  +  +L V ++  + KYLG PS + R K +S   I  +V   L+GWK KL S AGKE+LIK+VAQA+PTY +SCF+LPQ    ++  L+  FWWG   
Subjt:  KGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSE

Query:  QGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKS
        +  K+HW+ W+++ + K +GG+GFR++  FN+ALLAK  WRL+H   SL  ++ KA+Y+PQ   L A L +R SY W+ I+  R++   G  WR+G+GK+
Subjt:  QGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKS

Query:  IDIYDDQWIPRMKTFRLLSPQRPLTDVQLIV------------------FCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKLAIDGCHEASS
        I I+ D+W+P      ++SP+ P   +  +V                  F   + E I  IP+    + D   W   ++G Y VRSGY L +D     + 
Subjt:  IDIYDDQWIPRMKTFRLLSPQRPLTDVQLIV------------------FCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKLAIDGCHEASS

Query:  SANSFSCSW--WKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL
          +  +     WK++W L +P K++ F WRA  + LPT   L  R V NDP CP C    E+T+HAL
Subjt:  SANSFSCSW--WKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL

A0A2N9F6L9 Reverse transcriptase domain-containing protein2.1e-25342.8Show/hide
Query:  METKSNQSRMNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNI-EFQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMV
        MET SN+  +  LR  L F +  VV+     GGLALFW++ ++++I SYS  HID  I + + + +R T +YG P+  NRH +W L+RRL  +   +W  
Subjt:  METKSNQSRMNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNI-EFQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMV

Query:  AGDLNAITQVNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQ--PGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQ
         GD N I ++ E  G       Q Q F N LDDC L DLGF G  FTW NN+  P TT++  RLDR V N++ L  F    ++HL  + SDH+ +    +
Subjt:  AGDLNAITQVNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQ--PGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQ

Query:  EYGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREE
             S SR++   F+FEE W     C   I+  W  ++   +   +  ++R    +L  W R   G    +I  TK  +       ++G     L+   
Subjt:  EYGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREE

Query:  KCLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKS-SPFNVQAMDEIIQATTTRVTEE
          L+++  +EE  W+QRSR  WL  GDRNT++FHS+AT R ++N+I  + D  G   +  E +   F+ Y++ LF +  P  ++A   ++      VTE+
Subjt:  KCLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKS-SPFNVQAMDEIIQATTTRVTEE

Query:  MNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAK
        MN  L R F A EVE ALKQM PTKAPGPDG+P +FYQ++W +VG  V    L CLN G +   +N T I LIPK KNP  V ++RPISLCNV YK+I+K
Subjt:  MNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAK

Query:  VLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILI
        VLANRLK IL  I+SD+QSAFVPGRLI DNV+V +E LH++  +K  ++   ++KLDMSKAYDRVEW+FL KIMAK+GF   WI  +  C+ +VS+SIL+
Subjt:  VLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILI

Query:  NGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKS
        NG      KPSRGLRQGDPLSPYLFL+CAEGL SL++KA +  +I G+ + +  P I+HLFFADDSLLF KA  +   ++Q IL  YE A+GQ VN DK+
Subjt:  NGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKS

Query:  EIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLN
         I F  G     +  +   L+V I++ + KYLG PS + RN+  S + I  +V + L+GWK KL S AG+E+LIKAVAQA+PTY++SCF+LP   C+DL 
Subjt:  EIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLN

Query:  RLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAM
         ++  FWW ++ + RKIHW+ W  + + K +GG+GFRD+R FN ALLAK  WRL+H   SL  ++ KA+++P    +     +R SY W+ I+  RE+  
Subjt:  RLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAM

Query:  TGGRWRVGNGKSIDIYDDQWIPRMKTFRLLSPQRPL----TDVQLIV--------------FCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGY
         G  WRVGNG+SI+I+D +W+      ++L+P   +    T  QLI+              F   D E I+ IP++     D   W  +  G Y+VRSGY
Subjt:  TGGRWRVGNGKSIDIYDDQWIPRMKTFRLLSPQRPL----TDVQLIV--------------FCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGY

Query:  KLAID-------GCHEASSSANSFSCSWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL
        +L +D       GC   +   N+     W+++W LNIP K ++FAW+AS   LPT + L  R +  DP C  CG+  E  +HAL
Subjt:  KLAID-------GCHEASSSANSFSCSWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL

A0A2N9GIC4 Reverse transcriptase domain-containing protein1.1e-24643.42Show/hide
Query:  GGLALFWHSSVNLNIISYSKVHIDTNI-EFQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAITQVNEKDGGGDFERLQSQHFVNTL
        GGLALFW + +N+ I SYS  HID  I E   + +R T +YG P+  NR  +W L+RRL  +    W   GD N + ++ E  G       Q Q F N L
Subjt:  GGLALFWHSSVNLNIISYSKVHIDTNI-EFQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAITQVNEKDGGGDFERLQSQHFVNTL

Query:  DDCALRDLGFKGNRFTWKNNQ--PGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKTPHFKFEETWTLYEDCAPII
        DDC L DLGF G  FTW NN+  P TT++  RLDR V   D L  +    ++H+  + SDH+ +  V +    +S + K+ P F+FEE W     C   I
Subjt:  DDCALRDLGFKGNRFTWKNNQ--PGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKTPHFKFEETWTLYEDCAPII

Query:  QAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEEEIYWKQRSREEWLKWGDRNTR
           W   R   +   +  +++    +L  W R K G    +I ETK  ++    S L G     L+   K L+++  +EE  W+QRSR  WL  GDRNT+
Subjt:  QAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEEEIYWKQRSREEWLKWGDRNTR

Query:  WFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGPDGL
        +FHS+AT R ++N I  + D      +  + +    + Y++ LF +S  +   + E++      VTE+MN  L R F A EVE ALKQM PTKAPGPDG+
Subjt:  WFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGPDGL

Query:  PALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIADNVI
        P +FYQ++W +VG  V    L CLN G +   +N T I LIPK KNP  V ++RPISLCNV YK+I+KVLANRLK IL  ++SD+QSAFVPGRLI DNV+
Subjt:  PALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIADNVI

Query:  VGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCAEGL
        V +E LHY+  +K  ++   ++KLDMSKAYDRVEW FL KIM++MGF   WI  +  C+ +VS+SIL+NG      +PSRGLRQGDPLSPYLFL+CAEGL
Subjt:  VGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCAEGL

Query:  SSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHGKYL
         SL++KA +  +I G+ + +  P ISHLFFADDSLLF KA     +++Q IL  YE A+GQ VN DK+ I F        +  +   L+V I++ + KYL
Subjt:  SSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHGKYL

Query:  GCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRG
        G PS + RN+ AS + I  +V + L+GWK KL S AG+E+LIKAVAQA+PTY++SCFKLP   C++L  ++  FWW ++ + RKIHW+ W  + +PK +G
Subjt:  GCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRG

Query:  GMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWIPRMKTFRLLSP
        GMGFRDIR FN ALLAK  WRLLH   SL  ++ KA+++P    L     +R SY W+ I+  R++ + G  WRVGNG+SI+I+D +W       ++++P
Subjt:  GMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWIPRMKTFRLLSP

Query:  Q----RPLTDVQLIV--------------FCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKLAIDGCHEA---SSSANSFSCSWWKTLWMLN
             R  T  QLI+              FC  D E I+ IP++     D   W  +  G+YTVRSGY+  ++   ++   SS  N      WK++W L 
Subjt:  Q----RPLTDVQLIV--------------FCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKLAIDGCHEA---SSSANSFSCSWWKTLWMLN

Query:  IPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL
        IP K ++FAW+AS   LPT + L  R++     C  CG+ +E  +HAL
Subjt:  IPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL

A0A7N2LIH6 Uncharacterized protein7.4e-25142.7Show/hide
Query:  METKSNQSRMNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNIEFQGNF--FRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWM
        +ETK++  +M   +  L F    +V   G SGGLAL W    ++   S S  HID  +   G+   +R T  YG P    R+ SW LL  L+      W+
Subjt:  METKSNQSRMNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNIEFQGNF--FRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWM

Query:  VAGDLNAITQVNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQE
        V GD N I   +EK G  D +  Q   F   L  C L DLGF G RFTW N + G      RLDR V N     +F    ++H+   +SDH ++   L +
Subjt:  VAGDLNAITQVNEKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQE

Query:  YGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEK
               +K+   F FEE WT  E+C  I++  W+  R ++S   +  R+    + L++W +N  G     I + K R++ L    L       ++  +K
Subjt:  YGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEK

Query:  CLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMN
         ++ +   EE+ WKQRSR  WL++GD+N+++FH+ A+ RRQKN+I  + D  G   E++E  E+  L YF +++ S+     + D  ++A   RVT EMN
Subjt:  CLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMN

Query:  LHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVL
          L + F A+EV +AL+QMHPTKAPGPDG+  +FYQ+YW IVG +V N  L+ LN G MP  +N+T I LIPK KNP  + ++RPISLCNV YKII+KVL
Subjt:  LHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVL

Query:  ANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILING
        ANRLK +LH +I + QSAFVPGR+I DNVIV +E +H I + +  K    +IKLDMSKAYDRVEW +L  +M KMGF   WI  I MCV SVSFS+LING
Subjt:  ANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILING

Query:  TCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEI
            SF PSRGLRQGDP+SPYLFL+C EGLS+++KK E    I G+  A+ AP ISHLFFADDS++F +A  +E +++ ++L+VYE  +GQ +N DK+ +
Subjt:  TCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEI

Query:  AFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRL
         F     +E+K F   I   +I+Q H KYLG P  + R KK + N I  +V + + GWK KL S AG+EVLIKAVAQA PTYT++ FKLP   C +LN +
Subjt:  AFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRL

Query:  MANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTG
        M +FWWG   + +K+ W+ W+ + KPK  GGMGF+D++ FN ALLAK  WRL   P SL  ++LKA+Y+    F++A LG +PSY+WR I+  + +   G
Subjt:  MANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTG

Query:  GRWRVGNGKSIDIYDDQWIP-----RMKTFR-----------LLSPQRPLTDVQLI--VFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKL
         RW VG+G+SI+I+D +W+P     ++ T R           L+S +R      L+   F   + E+I  IP++  +L D+  W     G +TV+S Y+ 
Subjt:  GRWRVGNGKSIDIYDDQWIP-----RMKTFR-----------LLSPQRPLTDVQLI--VFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKL

Query:  AI-------DGCHEASSSANSFSCSWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL
        A        +G      S  S   + WKT+W L  PNK+K F WRA   ILPT   LV R++  D  C  CG+ +ET+ H L
Subjt:  AI-------DGCHEASSSANSFSCSWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL

M5VU98 Reverse transcriptase domain-containing protein3.6e-24542.91Show/hide
Query:  GLSGGLALFWHSSVNLNIISYSKVHIDTNIEFQ--GNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAITQVNEKDGGGDFERLQSQHF
        G SGGLAL W   V++++ ++S   ID  I     G+ +R T  YG P  ++R  SW LL +L       W+  GD N I   +EK+GG      Q Q F
Subjt:  GLSGGLALFWHSSVNLNIISYSKVHIDTNIEFQ--GNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAITQVNEKDGGGDFERLQSQHF

Query:  VNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKTPHFKFEETWTLYEDCAP
         N +D    RDLGF G +FTWK  + G  F+R RLDR +      NLF   ++ HL    SDH  I  V   + T   SR +   F FE  WT + DC  
Subjt:  VNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKTPHFKFEETWTLYEDCAP

Query:  IIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEEEIYWKQRSREEWLKWGDRN
         I+  WE     +    L  +I+ +   L+RW ++  G  K      + ++  L ++     +    +  +K LD +L + E+YW QRSRE WLK GD+N
Subjt:  IIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEEEIYWKQRSREEWLKWGDRN

Query:  TRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGPD
        T +FH KAT RR++N I+ + D  G    +R+ I    + YF +LF+SS  ++  M+EI+ A   +VT +M   L   F   E+++A+ QM P+KAPGPD
Subjt:  TRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGPD

Query:  GLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIADN
        GLP LFYQ+YW IVG+ V+      L   EM  +LN T + LIPK K P T+   RPISLCNV Y+I AK LANR+K ++ S+IS++QSAFVPGRLI DN
Subjt:  GLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIADN

Query:  VIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCAE
         IV +E  H++K+ +  +    ++KLDMSKAYDRVEW FL K+M  MGF   W++ +  CV +VS+S L+NG  T    P+RGLRQGDPLSPYLFL+CAE
Subjt:  VIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCAE

Query:  GLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHGK
        G ++LL KAE    + G+ I + AP +SHLFFADDS +F KA       ++ I +VYE A+GQ +N  KS +AF A +  + +  LA++L V  V  H  
Subjt:  GLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHGK

Query:  YLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKD
        YLG P  L RNK     Y+  +V K LQGW+ +  S AGKEVL+K VAQ++P Y +SCF LPQG CH++ ++MA FWWG   + RKIHW++WE + K K 
Subjt:  YLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKD

Query:  RGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWIPRMKTFRLL
         GGMGFR ++ FN A+LAK  WRL+H P SL  ++LKA+Y+PQ +F +ATLGSRPS VW+ I   R++   G R+++G+GKS+ I+ D+W+PR  TF ++
Subjt:  RGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWIPRMKTFRLL

Query:  SPQ-----------------RPLTDVQLI--VFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKLAI---DGCHEASSSANSFSCSWWKTLW
        +                    P  D+Q +  +F   DV  I  IP++  + PD   W+YDK G +TV+S Y++A+    G  + SSS+NS +   W+ +W
Subjt:  SPQ-----------------RPLTDVQLI--VFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKLAI---DGCHEASSSANSFSCSWWKTLW

Query:  MLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL
           +P KLKIFAWR + +ILPT   L+ + V     C  CG + E+ +H L
Subjt:  MLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.6e-4626.06Show/hide
Query:  RRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQA-TTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQR
        +R+KN+I  I + +G    +  +I+     Y+  L+ +   N++ MD  +   T  R+ +E    L+R     E+   +  +   K+PGPDG  A FYQR
Subjt:  RRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQA-TTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQR

Query:  YWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVND-YRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECL
        Y   +   ++        EG +P+   E  I+LIPK    +T  + +RPISL N+  KI+ K+LANR++  +  +I   Q  F+PG     N+      +
Subjt:  YWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVND-YRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECL

Query:  HYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKK
         +I R+K K +   SI  D  KA+D+++  F+ K + K+G    ++K I    +  + +I++NG   E+F    G RQG PLSP LF +  E L+  +++
Subjt:  HYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKK

Query:  AEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSL
            + I G+++ K    +S   FADD +++L+        + +++  +   +G  +N  KS+ AF      + +  +   L   I     KYLG   + 
Subjt:  AEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSL

Query:  ARNKKASLNY--ITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSC--FKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGM
                NY  +  ++++    WK    S  G+  ++K        Y  +    KLP     +L +    F W      +K   +    +++    GG+
Subjt:  ARNKKASLNY--ITSKVQKVLQGWKAKLFSTAGKEVLIKAVAQAVPTYTLSC--FKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGM

Query:  GFRDIRLFNQALLAKHTW
           D +L+ +A + K  W
Subjt:  GFRDIRLFNQALLAKHTW

P08548 LINE-1 reverse transcriptase homolog7.3e-4622.48Show/hide
Query:  LRRLSDIPHTAWMVAGDLNAITQVNEKDGGGDFERLQSQHFVNTLDDCALRDL--GFKGNR--FTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINH
        L  +S++  +  +V GD N    V ++       + +     +T+    L D+   F  N+  +T+ ++  GT     ++D  +G+   L+ F  + I  
Subjt:  LRRLSDIPHTAWMVAGDLNAITQVNEKDGGGDFERLQSQHFVNTLDDCALRDL--GFKGNR--FTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINH

Query:  LGFLSSDHRVIEAVL-QEYGTHSFSRK-KTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEE--------LKRWGRNKTGRFKTRIS
        +  + SDH  I+  L      H+ ++  K  +   ++TW + E    I +   +    D +  +L    ++V           LK+  R +       + 
Subjt:  LGFLSSDHRVIEAVL-QEYGTHSFSRK-KTPHFKFEETWTLYEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEE--------LKRWGRNKTGRFKTRIS

Query:  ETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSEL
        + ++  E  N        +T ++ E   ++N  + ++I    +S+  + +  ++  +   +    +R K+ I  I +       +  +I++    Y+ +L
Subjt:  ETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEEEIYWKQRSREEWLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSEL

Query:  FKSSPFNVQAMDEIIQAT-TTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIP
        +     N++ +D+ ++A    R++++    L+R   + E+   ++ +   K+PGPDG  + FYQ +   +   ++N       EG +P+   E  I LIP
Subjt:  FKSSPFNVQAMDEIIQAT-TTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIP

Query:  K-RKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKI
        K  K+P+   +YRPISL N+  KI+ K+L NR++  +  II   Q  F+PG     N+      + +I + K+K +   SI  D  KA+D ++  F+ + 
Subjt:  K-RKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKI

Query:  MAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKAR
        + K+G    ++K I       + +I++NG   +SF    G RQG PLSP LF +  E L+  +++    + I G+ I      +S   FADD +++L+  
Subjt:  MAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKAR

Query:  KEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYIT--SKVQKVLQGWKAKLFSTAGKE
        ++   ++  ++K Y   +G  +N  KS +AF     N+ +  + + +   +V    KYLG   +         NY T   ++ + +  WK    S  G+ 
Subjt:  KEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYIT--SKVQKVLQGWKAKLFSTAGKE

Query:  VLIK--AVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTW
         ++K   + +A+  +     K P     DL +++ +F W      +K   +    ++     GG+   D+RL+ ++++ K  W
Subjt:  VLIK--AVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTW

P11369 LINE-1 retrotransposable element ORF2 protein3.5e-4824.31Show/hide
Query:  PGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGD--ESPSSLLTRIR
        P  TF   ++D  +G+   LN + ++ I  +  + SDH  +  +      ++ +  K P F ++   TL  D   +++ G +KE  D  E   +  T   
Subjt:  PGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKERGD--ESPSSLLTRIR

Query:  SVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEEEIYWK----QRSREEWLKWGDRNTRWFHSKATL---------
        ++++ +K + R K       +S +K++ E  + S L   +  L K+E         +E I  +    Q      ++  ++   WF  K            
Subjt:  SVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEEEIYWK----QRSREEWLKWGDRNTRWFHSKATL---------

Query:  --RRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEII-QATTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFY
           R K  I KI + +G    + E+I+     ++  L+ +   N+  MD+ + +    ++ ++   HL+      E+E  +  +   K+PGPDG  A FY
Subjt:  --RRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEII-QATTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFY

Query:  QRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPK-RKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYE
        Q +   +   +     +   EG +P+   E  I LIPK +K+P+ + ++RPISL N+  KI+ K+LANR++  + +II   Q  F+PG     N+     
Subjt:  QRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPK-RKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYE

Query:  CLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLL
         +HYI + K K +    I LD  KA+D+++  F+ K++ + G    ++  I         +I +NG   E+     G RQG PLSPYLF +  E L+  +
Subjt:  CLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLL

Query:  KKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPS
        ++    + I G++I K    IS L  ADD ++++   K    E+  ++  +    G  +N +KS +AF      + +  +       IV ++ KYLG   
Subjt:  KKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPS

Query:  SLARNKKASLNY--ITSKVQKVLQGWKAKLFSTAGKEVLIK--AVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRG
        +         N+  +  ++++ L+ WK    S  G+  ++K   + +A+  +     K+P    ++L   +  F W + +       LK       +  G
Subjt:  SLARNKKASLNY--ITSKVQKVLQGWKAKLFSTAGKEVLIK--AVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRG

Query:  GMGFRDIRLFNQALLAKHTW
        G+   D++L+ +A++ K  W
Subjt:  GMGFRDIRLFNQALLAKHTW

P14381 Transposon TX1 uncharacterized 149 kDa protein1.4e-4425.56Show/hide
Query:  WKNNQPGT---TFIR--------KRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKE
        W+   P T   T++R         R+DR   +  L++   S TI    F  SDH  +   +    + + S  K  ++ F  +    E  A  ++  W   
Subjt:  WKNNQPGT---TFIR--------KRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKTPHFKFEETWTLYEDCAPIIQAGWEKE

Query:  RGDESPSSLLTRIRSVFE-ELKRWGRNKTGRFKTRISETKRRIEVLN------ESRLEGSMLTLLKRE----EKCLDNILMEEEIYWKQRSREEWLKWGD
        R  +   + L +   V +  LK   +  T   K+   +    IE LN      E RL GS    L+ E    ++ L N+   +      RSR + L   D
Subjt:  RGDESPSSLLTRIRSVFE-ELKRWGRNKTGRFKTRISETKRRIEVLN------ESRLEGSMLTLLKRE----EKCLDNILMEEEIYWKQRSREEWLKWGD

Query:  RNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPG
        R +R+F++    +  + +I  +   +G   E+ E I      ++  LF   P +  A +E+       V+E     L+      E+ +AL+ M   K+PG
Subjt:  RNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPG

Query:  PDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIA
         DGL   F+Q +W  +G            +GE+P      ++ L+PK+ +   + ++RP+SL +  YKI+AK ++ RLK++L  +I   QS  VPGR I 
Subjt:  PDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIA

Query:  DNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMC
        DNV +  + LH+ +R+       A + LD  KA+DRV+  +L   +    F   ++  +     S    + IN + T      RG+RQG PLS  L+ + 
Subjt:  DNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMC

Query:  AEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKV------
         E    LL+K      ++GL + +    +    +ADD +L +     + +  Q   +VY AA+   +N+ KS             G L   LKV      
Subjt:  AEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKV------

Query:  --------KIVQDHGKYLGCPSSLARNKKASLNYITSK--VQKVLQGWK--AKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGS
                KI++  G YL      A     S N+I  +  V   L  WK  AK+ S  G+ ++I  +  +   Y L C    Q     + R + +F W  
Subjt:  --------KIVQDHGKYLGCPSSLARNKKASLNYITSK--VQKVLQGWK--AKLFSTAGKEVLIKAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGS

Query:  SEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQ
               HW+     + P   GG G   IR        +   R L+   S     L + +Y Q
Subjt:  SEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQ

P93295 Uncharacterized mitochondrial protein AtMg003102.2e-3444.76Show/hide
Query:  AVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPK-DRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLK
        A+P Y +SCF+L +  C  L   M  FWW S E  RKI W+ W+ + K K D GG+GFRD+  FNQALLAK ++R++H+P +L+ ++L++RY+P    ++
Subjt:  AVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPK-DRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLK

Query:  ATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWI
         ++G+RPSY WR II GREL   G    +G+G    ++ D+WI
Subjt:  ATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.6e-2527.03Show/hide
Query:  QHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHS-VTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKTPHFKFEETWTLYE
        + F N L D  L D+  +G  +TW N+Q     IRK LDR + N D  + F S + +  L  + SDH     +L+       S+K   +F F  T   + 
Subjt:  QHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHS-VTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKTPHFKFEETWTLYE

Query:  DCAPIIQAGWEKERGDESPS-SLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKRE---EKCLDNILMEEEIYWKQRSREE
             +   WE++    S   SL   +++  +  K   R   G  + +  E    +E +    L     +L + E    K  +      E +++Q+SR +
Subjt:  DCAPIIQAGWEKERGDESPS-SLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKRE---EKCLDNILMEEEIYWKQRSREE

Query:  WLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKS-----SPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAMEVEEA
        WL+ GD NTR+FH      + KN I+ +   +    EN   ++   + Y++ L  S     +P +VQ + +I      R  + +   L       E+  A
Subjt:  WLKWGDRNTRWFHSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKS-----SPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAMEVEEA

Query:  LKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKII
        +  M   KAPGPD   A F+   W +V ++ I         G +  + N T I LIPK      ++ +RP+S C V YKII
Subjt:  LKQMHPTKAPGPDGLPALFYQRYWSIVGEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.4e-1234.88Show/hide
Query:  LANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKI
        +  RLK ++ ++I   Q++F+PGR+  DN++   E +H ++R K  K  +  +KLD+ KAYDR+ W +L   +   GF   W+ +I
Subjt:  LANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASIKLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKI

AT4G29090.1 Ribonuclease H-like superfamily protein4.0e-4732.14Show/hide
Query:  AVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKA
        A+PTYT++CF LP+  C  +  ++A+FWW + ++ + +HW  W+ ++  K  GG+GF+DI  FN ALL K  WR+L RP+SLM ++ K+RY+ + D L A
Subjt:  AVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKA

Query:  TLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWIPRMKTFRLLSPQRP-------------------------LTDVQLIVFCQDDVEKIQD
         LGSRPS+VW+ I   +E+   G R  VGNG+ I I+  +W+        L  QR                            DV  ++F + + + I +
Subjt:  TLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWIPRMKTFRLLSPQRP-------------------------LTDVQLIVFCQDDVEKIQD

Query:  IPIAGDSLPDTFFWHYDKTGKYTVRSGYKLAIDGCHEASSSANSFSCSW---WKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKM
        +   G  + D++ W Y  +G YTV+SGY +     ++ SS       S    ++ +W      K++ F W+   N LP    L  R +  +  C +C   
Subjt:  IPIAGDSLPDTFFWHYDKTGKYTVRSGYKLAIDGCHEASSSANSFSCSW---WKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKM

Query:  NETTVHAL
         ET  H L
Subjt:  NETTVHAL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-3544.76Show/hide
Query:  AVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPK-DRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLK
        A+P Y +SCF+L +  C  L   M  FWW S E  RKI W+ W+ + K K D GG+GFRD+  FNQALLAK ++R++H+P +L+ ++L++RY+P    ++
Subjt:  AVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPK-DRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLK

Query:  ATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWI
         ++G+RPSY WR II GREL   G    +G+G    ++ D+WI
Subjt:  ATLGSRPSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)9.0e-1551.47Show/hide
Query:  LINGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDS
        +ING       PSRGLRQGDPLSPYLF++C E LS L ++A+    + G+R++  +P I+HL FADD+
Subjt:  LINGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACGAAATCTAATCAATCTCGAATGAACTCTCTAAGGGAGACGCTTATGTTTCAAAATGTGTTTGTAGTGAATTGTACTGGACTAAGTGGTGGTTTAGCCTTGTT
TTGGCATTCTTCTGTTAATCTAAATATTATTTCTTATTCTAAGGTGCATATTGATACTAACATTGAATTTCAAGGAAATTTTTTTAGATTTACATGTATGTATGGCGAAC
CAAAGCAAGAAAACCGACACTTATCTTGGACGTTACTACGTCGCCTTTCTGACATACCCCATACGGCTTGGATGGTAGCTGGAGATCTAAATGCAATAACCCAAGTAAAT
GAAAAGGATGGAGGAGGAGACTTCGAGAGATTACAAAGCCAACATTTTGTAAATACATTAGATGACTGCGCTCTTCGAGATCTTGGTTTCAAGGGCAACCGATTCACATG
GAAGAACAATCAGCCTGGAACAACCTTTATTAGGAAAAGGCTAGACCGGTGTGTCGGTAATGTTGATCTCTTGAATCTTTTTCATTCTGTTACTATTAATCATCTTGGTT
TTTTGTCTTCAGATCATAGAGTTATCGAGGCCGTTTTACAAGAATATGGAACTCATTCCTTCTCAAGGAAAAAAACCCCCCATTTCAAATTTGAAGAAACATGGACTCTT
TATGAGGATTGTGCCCCAATAATTCAGGCTGGTTGGGAAAAGGAAAGAGGCGATGAATCGCCTAGCAGTTTACTCACCCGAATTCGGAGCGTATTTGAGGAGCTAAAGCG
GTGGGGAAGAAACAAAACAGGAAGATTCAAAACACGAATTTCTGAAACTAAAAGGAGAATTGAGGTGTTAAATGAGAGTCGATTAGAAGGAAGCATGCTAACATTGTTAA
AGCGGGAAGAGAAATGCCTAGATAATATTCTTATGGAGGAGGAAATTTACTGGAAACAACGCTCTCGAGAAGAATGGCTCAAATGGGGGGACAGAAATACAAGATGGTTC
CACTCCAAAGCCACTCTTAGACGACAGAAGAACAAAATCCAGAAAATAACTGATATCGAGGGGCATTGTTTTGAAAACAGGGAAGATATTGAACGAAATTTCCTTCATTA
TTTTTCTGAACTTTTTAAATCCTCTCCTTTTAATGTGCAGGCAATGGATGAAATTATTCAAGCTACCACAACAAGAGTTACAGAAGAGATGAATCTTCATCTAGATCGTG
CTTTTGTAGCTATGGAGGTGGAGGAAGCTTTGAAACAAATGCACCCAACCAAAGCCCCTGGACCAGACGGATTACCAGCCCTCTTCTATCAAAGATACTGGAGTATAGTA
GGAGAGGCAGTGATCAACACATGTCTCCGATGCCTCAACGAGGGGGAAATGCCAGATAAGCTTAATGAAACAATGATTGTACTCATACCGAAGAGGAAAAACCCGAGCAC
GGTGAACGATTATCGCCCTATTAGTTTATGCAATGTGTGTTATAAAATTATTGCAAAAGTGCTTGCCAACCGTCTGAAAACTATTCTACATTCTATCATATCTGATACTC
AAAGTGCGTTTGTCCCTGGTAGATTAATAGCTGATAATGTTATTGTTGGTTATGAATGTCTGCATTACATTAAAAGATCTAAGTCCAAGAAAAATTGTTTTGCATCTATA
AAATTGGACATGAGTAAAGCATATGATAGAGTAGAGTGGATCTTTCTTAGGAAAATAATGGCAAAGATGGGTTTTAGTTCAGCTTGGATAAAGAAGATTGGCATGTGTGT
TGAATCTGTATCATTTTCGATTCTTATTAATGGTACCTGTACTGAATCTTTTAAGCCCTCACGAGGGCTTAGACAGGGCGATCCGCTATCTCCTTATTTATTTCTCATGT
GTGCTGAGGGACTATCTAGCCTACTAAAAAAGGCCGAGATATCTCAAAATATTTCAGGTCTTAGAATTGCCAAAACGGCACCACCTATCTCTCACCTCTTTTTTGCAGAT
GATAGTTTACTTTTTCTCAAAGCAAGAAAGGAGGAATTTGATGAGATGCAACGAATCCTGAAAGTATATGAAGCAGCCACTGGGCAGGTCGTTAATTTTGATAAATCAGA
GATAGCTTTTGGGGCAGGAGTGGCAAATGAAGTAAAAGGCTTTTTAGCCAACATTTTAAAAGTCAAGATAGTCCAAGACCATGGAAAGTATCTTGGGTGTCCGTCTTCCC
TAGCTAGAAACAAGAAAGCTTCTTTGAACTACATAACATCTAAAGTTCAGAAGGTGCTGCAGGGTTGGAAAGCAAAATTATTTTCAACGGCTGGAAAGGAGGTGTTGATA
AAGGCTGTAGCTCAGGCAGTTCCCACTTATACTCTTTCATGTTTCAAACTTCCTCAGGGGTGCTGCCATGACTTGAACAGGCTAATGGCTAATTTTTGGTGGGGTAGTTC
TGAACAGGGGAGGAAAATTCATTGGCTGAAATGGGAAGCAATGACAAAACCAAAGGATAGAGGAGGGATGGGTTTCAGAGATATAAGACTGTTCAACCAGGCCTTGCTGG
CCAAACATACCTGGAGACTATTACATAGACCGCAATCTCTAATGGTCCAAATCCTGAAAGCCAGATACTATCCTCAAGAAGATTTCCTAAAAGCAACTCTGGGAAGTAGA
CCATCTTATGTTTGGAGGGGGATCATTTGGGGACGAGAACTAGCCATGACAGGAGGACGGTGGAGAGTTGGAAACGGAAAATCTATTGACATCTATGACGATCAGTGGAT
CCCGAGAATGAAAACTTTCAGGCTGCTATCTCCTCAACGACCGTTAACTGACGTACAGTTAATAGTCTTTTGTCAGGATGATGTGGAAAAAATTCAAGATATCCCAATTG
CAGGTGATAGTCTTCCAGATACTTTCTTTTGGCATTATGACAAAACAGGAAAGTATACAGTCCGGAGCGGCTATAAACTTGCAATAGATGGATGTCATGAGGCATCTTCA
TCAGCGAATTCCTTTAGCTGCTCTTGGTGGAAAACATTATGGATGCTGAACATCCCTAATAAACTCAAGATCTTTGCATGGCGAGCCAGCCTGAATATCCTCCCAACCAA
TATGATTCTTGTCACCCGTCGAGTCCATAATGACCCTCGATGTCCAAAGTGTGGGAAAATGAATGAAACAACTGTACATGCTTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGACGAAATCTAATCAATCTCGAATGAACTCTCTAAGGGAGACGCTTATGTTTCAAAATGTGTTTGTAGTGAATTGTACTGGACTAAGTGGTGGTTTAGCCTTGTT
TTGGCATTCTTCTGTTAATCTAAATATTATTTCTTATTCTAAGGTGCATATTGATACTAACATTGAATTTCAAGGAAATTTTTTTAGATTTACATGTATGTATGGCGAAC
CAAAGCAAGAAAACCGACACTTATCTTGGACGTTACTACGTCGCCTTTCTGACATACCCCATACGGCTTGGATGGTAGCTGGAGATCTAAATGCAATAACCCAAGTAAAT
GAAAAGGATGGAGGAGGAGACTTCGAGAGATTACAAAGCCAACATTTTGTAAATACATTAGATGACTGCGCTCTTCGAGATCTTGGTTTCAAGGGCAACCGATTCACATG
GAAGAACAATCAGCCTGGAACAACCTTTATTAGGAAAAGGCTAGACCGGTGTGTCGGTAATGTTGATCTCTTGAATCTTTTTCATTCTGTTACTATTAATCATCTTGGTT
TTTTGTCTTCAGATCATAGAGTTATCGAGGCCGTTTTACAAGAATATGGAACTCATTCCTTCTCAAGGAAAAAAACCCCCCATTTCAAATTTGAAGAAACATGGACTCTT
TATGAGGATTGTGCCCCAATAATTCAGGCTGGTTGGGAAAAGGAAAGAGGCGATGAATCGCCTAGCAGTTTACTCACCCGAATTCGGAGCGTATTTGAGGAGCTAAAGCG
GTGGGGAAGAAACAAAACAGGAAGATTCAAAACACGAATTTCTGAAACTAAAAGGAGAATTGAGGTGTTAAATGAGAGTCGATTAGAAGGAAGCATGCTAACATTGTTAA
AGCGGGAAGAGAAATGCCTAGATAATATTCTTATGGAGGAGGAAATTTACTGGAAACAACGCTCTCGAGAAGAATGGCTCAAATGGGGGGACAGAAATACAAGATGGTTC
CACTCCAAAGCCACTCTTAGACGACAGAAGAACAAAATCCAGAAAATAACTGATATCGAGGGGCATTGTTTTGAAAACAGGGAAGATATTGAACGAAATTTCCTTCATTA
TTTTTCTGAACTTTTTAAATCCTCTCCTTTTAATGTGCAGGCAATGGATGAAATTATTCAAGCTACCACAACAAGAGTTACAGAAGAGATGAATCTTCATCTAGATCGTG
CTTTTGTAGCTATGGAGGTGGAGGAAGCTTTGAAACAAATGCACCCAACCAAAGCCCCTGGACCAGACGGATTACCAGCCCTCTTCTATCAAAGATACTGGAGTATAGTA
GGAGAGGCAGTGATCAACACATGTCTCCGATGCCTCAACGAGGGGGAAATGCCAGATAAGCTTAATGAAACAATGATTGTACTCATACCGAAGAGGAAAAACCCGAGCAC
GGTGAACGATTATCGCCCTATTAGTTTATGCAATGTGTGTTATAAAATTATTGCAAAAGTGCTTGCCAACCGTCTGAAAACTATTCTACATTCTATCATATCTGATACTC
AAAGTGCGTTTGTCCCTGGTAGATTAATAGCTGATAATGTTATTGTTGGTTATGAATGTCTGCATTACATTAAAAGATCTAAGTCCAAGAAAAATTGTTTTGCATCTATA
AAATTGGACATGAGTAAAGCATATGATAGAGTAGAGTGGATCTTTCTTAGGAAAATAATGGCAAAGATGGGTTTTAGTTCAGCTTGGATAAAGAAGATTGGCATGTGTGT
TGAATCTGTATCATTTTCGATTCTTATTAATGGTACCTGTACTGAATCTTTTAAGCCCTCACGAGGGCTTAGACAGGGCGATCCGCTATCTCCTTATTTATTTCTCATGT
GTGCTGAGGGACTATCTAGCCTACTAAAAAAGGCCGAGATATCTCAAAATATTTCAGGTCTTAGAATTGCCAAAACGGCACCACCTATCTCTCACCTCTTTTTTGCAGAT
GATAGTTTACTTTTTCTCAAAGCAAGAAAGGAGGAATTTGATGAGATGCAACGAATCCTGAAAGTATATGAAGCAGCCACTGGGCAGGTCGTTAATTTTGATAAATCAGA
GATAGCTTTTGGGGCAGGAGTGGCAAATGAAGTAAAAGGCTTTTTAGCCAACATTTTAAAAGTCAAGATAGTCCAAGACCATGGAAAGTATCTTGGGTGTCCGTCTTCCC
TAGCTAGAAACAAGAAAGCTTCTTTGAACTACATAACATCTAAAGTTCAGAAGGTGCTGCAGGGTTGGAAAGCAAAATTATTTTCAACGGCTGGAAAGGAGGTGTTGATA
AAGGCTGTAGCTCAGGCAGTTCCCACTTATACTCTTTCATGTTTCAAACTTCCTCAGGGGTGCTGCCATGACTTGAACAGGCTAATGGCTAATTTTTGGTGGGGTAGTTC
TGAACAGGGGAGGAAAATTCATTGGCTGAAATGGGAAGCAATGACAAAACCAAAGGATAGAGGAGGGATGGGTTTCAGAGATATAAGACTGTTCAACCAGGCCTTGCTGG
CCAAACATACCTGGAGACTATTACATAGACCGCAATCTCTAATGGTCCAAATCCTGAAAGCCAGATACTATCCTCAAGAAGATTTCCTAAAAGCAACTCTGGGAAGTAGA
CCATCTTATGTTTGGAGGGGGATCATTTGGGGACGAGAACTAGCCATGACAGGAGGACGGTGGAGAGTTGGAAACGGAAAATCTATTGACATCTATGACGATCAGTGGAT
CCCGAGAATGAAAACTTTCAGGCTGCTATCTCCTCAACGACCGTTAACTGACGTACAGTTAATAGTCTTTTGTCAGGATGATGTGGAAAAAATTCAAGATATCCCAATTG
CAGGTGATAGTCTTCCAGATACTTTCTTTTGGCATTATGACAAAACAGGAAAGTATACAGTCCGGAGCGGCTATAAACTTGCAATAGATGGATGTCATGAGGCATCTTCA
TCAGCGAATTCCTTTAGCTGCTCTTGGTGGAAAACATTATGGATGCTGAACATCCCTAATAAACTCAAGATCTTTGCATGGCGAGCCAGCCTGAATATCCTCCCAACCAA
TATGATTCTTGTCACCCGTCGAGTCCATAATGACCCTCGATGTCCAAAGTGTGGGAAAATGAATGAAACAACTGTACATGCTTTGTAG
Protein sequenceShow/hide protein sequence
METKSNQSRMNSLRETLMFQNVFVVNCTGLSGGLALFWHSSVNLNIISYSKVHIDTNIEFQGNFFRFTCMYGEPKQENRHLSWTLLRRLSDIPHTAWMVAGDLNAITQVN
EKDGGGDFERLQSQHFVNTLDDCALRDLGFKGNRFTWKNNQPGTTFIRKRLDRCVGNVDLLNLFHSVTINHLGFLSSDHRVIEAVLQEYGTHSFSRKKTPHFKFEETWTL
YEDCAPIIQAGWEKERGDESPSSLLTRIRSVFEELKRWGRNKTGRFKTRISETKRRIEVLNESRLEGSMLTLLKREEKCLDNILMEEEIYWKQRSREEWLKWGDRNTRWF
HSKATLRRQKNKIQKITDIEGHCFENREDIERNFLHYFSELFKSSPFNVQAMDEIIQATTTRVTEEMNLHLDRAFVAMEVEEALKQMHPTKAPGPDGLPALFYQRYWSIV
GEAVINTCLRCLNEGEMPDKLNETMIVLIPKRKNPSTVNDYRPISLCNVCYKIIAKVLANRLKTILHSIISDTQSAFVPGRLIADNVIVGYECLHYIKRSKSKKNCFASI
KLDMSKAYDRVEWIFLRKIMAKMGFSSAWIKKIGMCVESVSFSILINGTCTESFKPSRGLRQGDPLSPYLFLMCAEGLSSLLKKAEISQNISGLRIAKTAPPISHLFFAD
DSLLFLKARKEEFDEMQRILKVYEAATGQVVNFDKSEIAFGAGVANEVKGFLANILKVKIVQDHGKYLGCPSSLARNKKASLNYITSKVQKVLQGWKAKLFSTAGKEVLI
KAVAQAVPTYTLSCFKLPQGCCHDLNRLMANFWWGSSEQGRKIHWLKWEAMTKPKDRGGMGFRDIRLFNQALLAKHTWRLLHRPQSLMVQILKARYYPQEDFLKATLGSR
PSYVWRGIIWGRELAMTGGRWRVGNGKSIDIYDDQWIPRMKTFRLLSPQRPLTDVQLIVFCQDDVEKIQDIPIAGDSLPDTFFWHYDKTGKYTVRSGYKLAIDGCHEASS
SANSFSCSWWKTLWMLNIPNKLKIFAWRASLNILPTNMILVTRRVHNDPRCPKCGKMNETTVHAL