; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy08g007780 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy08g007780
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr08:10786587..10790883
RNA-Seq ExpressionLcy08g007780
SyntenyLcy08g007780
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.9e-12245.89Show/hide
Query:  KEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITPKVSVQ
        ++E+  LL+ EE  W+ RSR  WL  GD+NTK+FH KAS R+RRN I GI   +G W++    I ++A  YF+ ++ SS    T I + L+ I   V+ +
Subjt:  KEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITPKVSVQ

Query:  QRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGYKIIAK
            L + +TREEIE  +   +P+KA G DG+ A F+Q YW+IVG + V + L +LN ++ +  +NKT ITL+ K+ NP K+++F PISLCNV YK+I+K
Subjt:  QRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGYKIIAK

Query:  TIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLI
         +ANR K +L  +IS  Q+ F+ G+LI+DNVLV FE +H +  KK+GK+   AIKLDMSKAY+RVEW F+K+++ K+GF  KW   +M C+  VS+S+L+
Subjt:  TIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLI

Query:  NGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINLDKS
        NG       P+RG+RQGDP+SPY+FL+CA+GFS LL        ++G  I   CP +THLFFADDSL+FC+++ +ECQT+ ++ + YE+ASGQ IN+DKS
Subjt:  NGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINLDKS

Query:  MFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK
            S+N  D +   +  +LG  Q      YL + S   +SK  +F ++K ++E  L GW E+L S GG+EIL+K
Subjt:  MFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]4.3e-12245.09Show/hide
Query:  EKRGGDSILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLF
        ++R  +SI T ++     A   Q ++EL  LL+ EE  WR RS+  W + GD+NTK+FHA+AS+R+++N I  + + DG W + +  I   A  YF+N++
Subjt:  EKRGGDSILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLF

Query:  QSSRAKDTLIKDALECITPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKV
         SS    T I + +  I  +V+ +   EL K +T EE+   ++  +P+KA G DG+ A+F+  YW IVG     + L +LN ++ +  +NKT I+LI K 
Subjt:  QSSRAKDTLIKDALECITPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKV

Query:  SNPQKITEFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLK
        + P ++TEF PISLCN  YKII+K +ANRFK +L ++IS  Q+ F P +LI+DNVLV FE +H ++ K +GK+  ++IKLDMSKA++RVEW F+K ++ K
Subjt:  SNPQKITEFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLK

Query:  LGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKE
        LGF  KW H IM CV  VS+SVLING       PSRGIRQGDPLSP LFL+CAEG S L+     N+ + G  I   CP +THLFFADDSL+FC++  +E
Subjt:  LGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKE

Query:  CQTIKELFRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK
        C  +  +  RYEEASGQ IN DKS    S N      +SI  +LG  Q      YL + S   +SKA++F ++K ++   L GW  +L S GG+EIL+K
Subjt:  CQTIKELFRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK

XP_023913142.1 uncharacterized protein LOC112024740 [Quercus suber]2.1e-12447.29Show/hide
Query:  ETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITP
        E L A ++L+ LL ++E FW   SR  WLK GD+NTK+FH+KASQR++RN I GI++  G W ED  ++ E+A +YF+ +F S   +   +++ L  +  
Subjt:  ETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITP

Query:  KVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGY
        +++   + EL KPYT EE++A +    P+KA G DG++A FYQ +W IVG +     L  LN    +  +N T I LI KV +P+K+T+F PISLCNV Y
Subjt:  KVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGY

Query:  KIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVS
        KII+K +ANR K +L  +ISPTQ+ F+PG+LI+DNVL+ +E +HA+  +KKGK R +A+KLD+SKAY+RVEW FLK M+++LGF  +W + +M CV   S
Subjt:  KIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVS

Query:  FSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAI
        FSV INGR      PSRG+RQGDPLSPYLFL+CAEGF+ LL    S   L G +I    P +++L FADDSLIFCR++++E Q I    + Y EASGQ I
Subjt:  FSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAI

Query:  NLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK
        N +KS    SSN  + + Q I + LG+++     +YL + +   RSK + F  +K ++   LQGW  RL S  GKE+L+K
Subjt:  NLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]3.2e-12546.75Show/hide
Query:  LTENEV-DANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKD
        L E E+ +A+ AE L   ++++ LL+++E +W  RSR  WL+ GD+NTK+FHAKASQR+R+N I GIR++ G W E+  ++G++A DYF NLFQ+     
Subjt:  LTENEV-DANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKD

Query:  TLIKDALECITPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKIT
          +++ L+ +  KV+   R  L   +T EE++A +    P+KA G DG++A FYQ +W IVG+  V   L  LN+   +  +N T I LI KV NP++++
Subjt:  TLIKDALECITPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKIT

Query:  EFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKW
        EF PISLCNV YKII+K +ANR K+VL  +IS TQ+ F+PG+LI+DNVLV +E +H + ++KKGK   VA+KLD+SKAY+RVEW FL+ ++ K+GF   W
Subjt:  EFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKW

Query:  THNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKEL
           +M CV   SFS+L+NG+P E  +PSRGIRQGDP+SPYLFL+CAEG + LL     N  + G  I    P +T+L FADDSL+FC+++R E +TI E+
Subjt:  THNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKEL

Query:  FRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK
         + YE ASGQ+INL+KS    S+N  + +   I E+LG+K+      YL + +   R+K   F ++K ++   LQGW   L S  GKEIL+K
Subjt:  FRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK

XP_030964220.1 uncharacterized protein LOC115985421 [Quercus lobata]8.2e-12145.34Show/hide
Query:  DSILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRA
        +SI T ++     AE  Q +EE+  LL+ EE  WR RS+  W + GD+NTK+FHA+AS+RKR+N I G+ ++DG W E +  I   A  YFK ++ +S  
Subjt:  DSILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRA

Query:  KDTLIKDALECITPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQK
          T I + +  I  +V+ +   EL K +T EE+   ++  +P+KA G DG+ A F+  YW IVG   + + L +LN ++ +  +NKT I+LI K + P K
Subjt:  KDTLIKDALECITPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQK

Query:  ITEFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSN
        +TEF PISLCN  YKII+K +ANR K +L ++IS  Q+ F P +LI+DNVLV FE +H ++ K +GK+  ++IKLDMSKA++RVEW F+K +++KLGF +
Subjt:  ITEFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSN

Query:  KWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIK
        KW   +M CV  VS+SVLING       PSRGIRQGDPLSP LFL+CAEGFS L+     N+ + G  I   CP +TH FFADDSL+FC++  +EC  + 
Subjt:  KWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIK

Query:  ELFRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK
         +  +YEEASGQ IN DKS    S N      +SI ++LG  Q      YL + S   +SK ++F ++K ++   L GW  +L S GG+EIL+K
Subjt:  ELFRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK

TrEMBL top hitse value%identityAlignment
A0A2N9G5I8 Reverse transcriptase domain-containing protein8.0e-12245.7Show/hide
Query:  QAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITPKVS
        + K+E++ LL +EE  W  R+R  WLK GD+NT++FH +ASQ +RRN I  I  N G     +  IG +  +YF +LF++S   D      LE I+P V+
Subjt:  QAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITPKVS

Query:  VQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGYKII
        V+    L KP+ R+E++  ++   P KA G DG+   FYQ +W  VG +     L  LN    ++ +N T ITLI K  NP K+T+F PISLCNV YKI+
Subjt:  VQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGYKII

Query:  AKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSV
        +K + NR K +L  +IS TQ+ F+PG+LI+DN+LV FE +H + ++  GKD  +A+KLDMSKAY+RVEW+FLK ++LK+GF+ KW   +M+C+  VS+S+
Subjt:  AKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSV

Query:  LINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINLD
        LING PQ   KP+RG+RQGDPLSPYLFL+CAEG  GLL    +N+ + G  ++   P LTHLFFADDSL+FCR+ R+EC+T+  +  +YE  SGQ IN D
Subjt:  LINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINLD

Query:  KSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK
        K+    S +    +   I ++LG+   +    YL + S   RS+   F +IK KI   LQGW +++ S  G+EIL+K
Subjt:  KSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK

A0A2N9GPZ7 Reverse transcriptase domain-containing protein2.3e-12146.23Show/hide
Query:  LQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITPKV
        L+ +++L  LLE+EE FWR RSR  W+  GDKNTK+FHA+ ++R+R NHI G+R  DG W+ ++ KI EIA DYF+ +F SS      I   L+ +   V
Subjt:  LQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITPKV

Query:  SVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGYKI
        +     +L   +T++E+   ++   P+KA G DG+ A FYQ YW IVG E  +  L IL+    +R +N T I LI KV NP+ IT+F PISLCNV YKI
Subjt:  SVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGYKI

Query:  IAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFS
        ++K +ANR KKVL  VIS  Q+ F+PG+LI+DNVLV FE +H++S K+KGK  Q+A+KLDMSKAY+RVEW+FL+ ++  +GF+ +W   +M C+  VS+S
Subjt:  IAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFS

Query:  VLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINL
        VLING     F  SRGIRQGD LSPYLFL+CAEG S LL+    +K L G   +   P LTHLFFADDSL+FC+++   C+ +  + ++YE ASGQ +N 
Subjt:  VLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINL

Query:  DKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK
         K+    + +      + I +   + + KS   YL + S   RSK+  FG+IK ++   + GW E+  S  G+E+L+K
Subjt:  DKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK

A0A2N9H0J9 Uncharacterized protein2.8e-11942.6Show/hide
Query:  RELEECNCEKRGGDSILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIA
        R+ EE +   RG D +L+              ++EL  LL +EE  W+ RSR +WLK GD+NTK+FH++A+ RKRRN +  +R   G   ED  +IG   
Subjt:  RELEECNCEKRGGDSILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIA

Query:  TDYFKNLFQSSRAKDTLIKDALECITPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKT
          Y+++LFQ++  ++  +   L  I P V+ +   +L +PYT  E+   ++   P KA G DG+  +FYQ YW +VG+E V+  L  +N       +N T
Subjt:  TDYFKNLFQSSRAKDTLIKDALECITPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKT

Query:  IITLILKVSNPQKITEFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWI
         + LI KV NP+ +TE+ PISLCNV YK+I+K +ANR K+VL +VI+ TQ+ F+PG+LI+DNVL+ FE +H + ++++G+   +A+KLDMSKAY+RVEW 
Subjt:  IITLILKVSNPQKITEFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWI

Query:  FLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLI
        FL++++LK+GF ++W   +M+C+  VS+S+LING P+    PSRG+RQGDP+SPYLFL+CAEG +GLL    +   + G  +    P LTHLFFADDSL+
Subjt:  FLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLI

Query:  FCRSSRKECQTIKELFRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTG
        FCR+++ EC  I++L   YE+ASGQ +N  K+    S N        I  +LG+   +    YL + S   + K   F +IK ++ S ++GW E+L S  
Subjt:  FCRSSRKECQTIKELFRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTG

Query:  GKEILLK
        G+EIL+K
Subjt:  GKEILLK

A0A2N9I335 Reverse transcriptase domain-containing protein7.5e-12045.62Show/hide
Query:  ETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITP
        E  + + EL  LLE EE +WR RSR  W++ GDKNTK+FHA  + R+  N I+G+R N+G  + D++K+  IA DYF+++F SS   D  I   L+ +  
Subjt:  ETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITP

Query:  KVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGY
         V+ +    L + +  EE+   ++   P+KA G DG+ A FYQ YW IVG E  +  L IL+    +  +N T I LI KV NP++IT+F PISLCNV Y
Subjt:  KVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGY

Query:  KIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVS
        KI++K +ANR KKVL  VIS +Q+ F+PG+LI+DNVLV FE +H++S K+ G+  Q+A+KLDMSKAY+RVEW+F++ ++ +LGF+  W + IM C++ VS
Subjt:  KIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVS

Query:  FSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAI
        +SVLING     FK SRGIRQGD LSPYLFL+CAEG S LL+  V  K ++G   +   P LTHLFFADDSL+FC+++   C  +  + ++YE  SGQ +
Subjt:  FSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAI

Query:  NLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK
        N  K+    + N        I EV  + + KS   YL + S   RSK   FG++K ++   + GW E+  S GG+E+L+K
Subjt:  NLDKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK

A0A2N9IPS8 Reverse transcriptase domain-containing protein2.3e-12146.23Show/hide
Query:  LQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITPKV
        L+ +++L  LLE+EE FWR RSR  W+  GDKNTK+FHA+ ++R+R NHI G+R  DG W+ ++ KI EIA DYF+ +F SS      I   L+ +   V
Subjt:  LQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECITPKV

Query:  SVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGYKI
        +     +L   +T++E+   ++   P+KA G DG+ A FYQ YW IVG E  +  L IL+    +R +N T I LI KV NP+ IT+F PISLCNV YKI
Subjt:  SVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGYKI

Query:  IAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFS
        ++K +ANR KKVL  VIS  Q+ F+PG+LI+DNVLV FE +H++S K+KGK  Q+A+KLDMSKAY+RVEW+FL+ ++  +GF+ +W   +M C+  VS+S
Subjt:  IAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFS

Query:  VLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINL
        VLING     F  SRGIRQGD LSPYLFL+CAEG S LL+    +K L G   +   P LTHLFFADDSL+FC+++   C+ +  + ++YE ASGQ +N 
Subjt:  VLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINL

Query:  DKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK
         K+    + +      + I +   + + KS   YL + S   RSK+  FG+IK ++   + GW E+  S  G+E+L+K
Subjt:  DKSMFMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLK

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.4e-3528.22Show/hide
Query:  QRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECIT-PKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQ
        +++ +N I+ I+++ G    D  +I     +Y+K+L+ +       +   L+  T P+++ ++   L++P T  EI A + S    K+ G DG  A FYQ
Subjt:  QRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALECIT-PKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQ

Query:  GYWSIVGEETVKVCLKILN--DDVDIRPLNKTIITLILKVSNPQKIT----EFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVL
         Y     EE V   LK+    +   I P N      I+ +  P + T     F PISL N+  KI+ K +ANR ++ +  +I   Q  FIPG     N+ 
Subjt:  GYWSIVGEETVKVCLKILN--DDVDIRPLNKTIITLILKVSNPQKIT----EFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVL

Query:  VGFECIHAISSKKKGKDR-QVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEG
           + I+ I    + KD+  V I +D  KA+++++  F+ + L KLG    +   I    +  + ++++NG+  E F    G RQG PLSP LF +  E 
Subjt:  VGFECIHAISSKKKGKDR-QVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEG

Query:  FSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEV---LGIKQTKSL
         +  ++ E   K + G ++      L+   FADD +++  +     Q + +L   + + SG  IN+ KS   + +N +  E+Q +GE+   +  K+ K L
Subjt:  FSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEV---LGIKQTKSL

Query:  GTYL
        G  L
Subjt:  GTYL

P08548 LINE-1 reverse transcriptase homolog6.7e-3328Show/hide
Query:  DKNTKWFHAKASQ---------RKRR--NHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALE-CITPKVSVQQRRELDKPYTREEIE
        +K+  WF  K ++         RK+R  + I  IR+ +     D  +I +I  +Y+K L+         I   LE C  P++S ++   L++P +  EI 
Subjt:  DKNTKWFHAKASQ---------RKRR--NHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAKDTLIKDALE-CITPKVSVQQRRELDKPYTREEIE

Query:  ATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTI----ITLILKV-SNPQKITEFLPISLCNVGYKIIAKTIANRFKKVL
        +T+++    K+ G DG  + FYQ +     EE V + L +  +      L  T     ITLI K   +P +   + PISL N+  KI+ K + NR ++ +
Subjt:  ATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTI----ITLILKV-SNPQKITEFLPISLCNVGYKIIAKTIANRFKKVL

Query:  DSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKP
          +I   Q  FIPG     N+      I  I +K K KD  + + +D  KA++ ++  F+   L K+G    +   I       + ++++NG   + F  
Subjt:  DSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKP

Query:  SRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINLDKSMFMVSSNVKD
          G RQG PLSP LF +  E  +  ++ E   K + G  I +    L+   FADD +++  ++R     + E+ + Y   SG  IN  KS+  + +N   
Subjt:  SRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINLDKSMFMVSSNVKD

Query:  RE---AQSIGEVLGIKQTKSLGTYL
         E     SI   +  K+ K LG YL
Subjt:  RE---AQSIGEVLGIKQTKSLGTYL

P11369 LINE-1 retrotransposable element ORF2 protein2.1e-3125.88Show/hide
Query:  ARELEECNCEKRGGDSILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEI
        A E +E N  KR     + +   + N  ET +  + +            +++R  + +  +K  K         + +  I  IR+  G    D  +I   
Subjt:  ARELEECNCEKRGGDSILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEI

Query:  ATDYFKNLFQSSRAK-DTLIKDALECITPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLN
           ++K L+ +     D + K       PK++  Q   L+ P + +EIEA + S    K+ G DG  A FYQ +   +     K+  KI  +        
Subjt:  ATDYFKNLFQSSRAK-DTLIKDALECITPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLN

Query:  KTIITLILK-VSNPQKITEFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRV
        +  ITLI K   +P KI  F PISL N+  KI+ K +ANR ++ + ++I P Q  FIPG     N+      IH I +K K K+  + I LD  KA++++
Subjt:  KTIITLILK-VSNPQKITEFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRV

Query:  EWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADD
        +  F+ ++L + G    + + I         ++ +NG   E      G RQG PLSPYLF +  E  +  ++ +   K + G +I      ++ L  ADD
Subjt:  EWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADD

Query:  SLIFCRSSRKECQTIKELFRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEV----LGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWS
         +++    +   + +  L   + E  G  IN +KSM  + +  K  E + I E     +     K LG  L    +    K   F  +K +I+  L+ W 
Subjt:  SLIFCRSSRKECQTIKELFRRYEEASGQAINLDKSMFMVSSNVKDREAQSIGEV----LGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWS

Query:  ERLFSTGGKEILLK
        +   S  G+  ++K
Subjt:  ERLFSTGGKEILLK

P14381 Transposon TX1 uncharacterized 149 kDa protein5.1e-3327.41Show/hide
Query:  MSGFRSARELEECNCEKRGGDSILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDE
        +SG R+A E+E  N E    +  L+ +E  A   E L+ KE L  + + +      RSR   L   D+ +++F+A   ++  R  I  + + DG   ED 
Subjt:  MSGFRSARELEECNCEKRGGDSILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDE

Query:  LKIGEIATDYFKNLFQSSRAKDTLIKDALECI---TPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILND
          I + A  +++NLF    + D +  DA E +    P VS +++  L+ P T +E+   +R    +K+ G DG+   F+Q +W  +G +  +V  +    
Subjt:  LKIGEIATDYFKNLFQSSRAKDTLIKDALECI---TPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILND

Query:  DVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDM
                + +++L+ K  + + I  + P+SL +  YKI+AK I+ R K VL  VI P Q+  +PG+ I DNV +  + +H   +++ G      + LD 
Subjt:  DVDIRPLNKTIITLILKVSNPQKITEFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDM

Query:  SKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLT
         KA++RV+  +L   L    F  ++   +          V IN          RG+RQG PLS  L+ +  E F  LL+     K L G  +      + 
Subjt:  SKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCVEIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLT

Query:  HLFFADDSLIFCR-----SSRKECQTIKELFRRYEEASGQAINLDKSMFMVSSNVK
           +ADD ++  +        +ECQ +      Y  AS   IN  KS  ++  ++K
Subjt:  HLFFADDSLIFCR-----SSRKECQTIKELFRRYEEASGQAINLDKSMFMVSSNVK

P92555 Uncharacterized mitochondrial protein AtMg012503.7e-1552.94Show/hide
Query:  LINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDS
        +ING PQ    PSRG+RQGDPLSPYLF++C E  SGL +       L G R++N+ P + HL FADD+
Subjt:  LINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.7e-1528.18Show/hide
Query:  SILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAK
        S L  N  D+ +     A+++        E F+R +SR  WL+ GD NT++FH      + +N I+ +R +D    E+  ++ E+   Y+ +L  S    
Subjt:  SILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDYFKNLFQSSRAK

Query:  DTLIKDALECITP----KVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSN
        D L  D+++ I      + +      L    + +EI A V +   +KA G D   A F+   W +V + T+    +       ++  N T ITLI KV+ 
Subjt:  DTLIKDALECITP----KVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSN

Query:  PQKITEFLPISLCNVGYKII
          +++ F P+S C V YKII
Subjt:  PQKITEFLPISLCNVGYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.0e-1235.23Show/hide
Query:  IANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMK
        +  R K ++ ++I P QA+FIPG++ +DN++   E +H++  +KKG    + +KLD+ KAY+R+ W +L++ L+  GF   W   I +
Subjt:  IANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMK

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.6e-1652.94Show/hide
Query:  LINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDS
        +ING PQ    PSRG+RQGDPLSPYLF++C E  SGL +       L G R++N+ P + HL FADD+
Subjt:  LINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGGATTCAGGTCAGCAAGAGAGCTTGAGGAATGCAATTGTGAGAAAAGAGGAGGAGATTCAATACTAACCGAGAATGAGGTGGATGCTAACTGGGCCGAAACCTT
ACAGGCTAAAGAAGAGCTCGAGATTTTGCTTGAGGAAGAAGAAGATTTTTGGAGGAGTCGATCTAGAGAAGTGTGGCTCAAAAGCGGGGACAAAAATACCAAGTGGTTTC
ATGCGAAAGCCTCCCAAAGGAAAAGGAGAAATCATATTGAAGGGATACGGTCGAATGATGGCTTTTGGGAGGAAGACGAGCTGAAGATTGGGGAAATTGCCACTGACTAC
TTCAAAAACCTATTTCAATCCTCAAGGGCTAAAGATACTCTGATAAAAGATGCACTTGAATGTATCACCCCTAAAGTTTCAGTGCAACAAAGGAGGGAGTTGGATAAACC
TTACACTAGAGAAGAGATTGAGGCCACGGTGAGAAGCTTCAATCCAAGTAAAGCGCTCGGAAAAGATGGAGTCCACGCCTCATTCTACCAAGGGTACTGGAGCATTGTGG
GGGAGGAGACAGTGAAAGTTTGCTTGAAAATTCTAAACGATGATGTTGATATTAGACCACTAAACAAGACTATCATTACGCTAATCCTGAAGGTCTCCAACCCTCAAAAG
ATTACCGAATTTCTGCCAATTAGTCTCTGCAATGTGGGGTATAAAATTATTGCCAAGACGATTGCAAACAGATTCAAAAAAGTGCTAGACTCGGTCATATCCCCTACTCA
AGCGACTTTTATTCCGGGAAAACTCATATCGGATAATGTGTTAGTTGGTTTTGAATGTATCCACGCGATAAGTAGCAAGAAAAAAGGGAAAGACAGGCAGGTTGCTATCA
AGCTAGATATGTCCAAGGCCTACAATCGTGTAGAATGGATATTTCTAAAGGAGATGCTCCTAAAGCTAGGTTTTAGCAACAAATGGACTCATAACATCATGAAATGTGTG
GAGATAGTGTCATTCTCGGTGCTAATCAACGGAAGACCTCAAGAGGAATTCAAGCCTAGCCGAGGAATCAGACAGGGAGATCCCTTATCACCTTACCTATTCCTGGTGTG
TGCAGAAGGCTTCTCAGGGCTCCTAAAAATGGAAGTTTCCAATAAAAACTTAGCTGGTTTTAGAATTAACAATCATTGCCCGCCTCTAACTCACTTATTCTTCGCTGATG
ATAGCCTTATTTTTTGCAGGTCAAGTAGAAAGGAGTGTCAAACTATTAAGGAGTTGTTTAGAAGATATGAGGAAGCCTCCGGGCAAGCTATTAACCTGGATAAATCTATG
TTTATGGTGAGCAGTAATGTGAAAGATAGGGAGGCACAAAGCATTGGCGAGGTGCTAGGAATTAAACAAACAAAGTCATTGGGCACATACCTCGAAATGTCATCTCAAAG
CGCAAGAAGCAAGGCCAGATTGTTTGGAAAGATCAAATCCAAAATCGAGAGCATTTTACAAGGGTGGAGCGAGAGGCTGTTTTCCACGGGGGGCAAGGAAATACTTCTAA
AAGGACGACAGGACTTCTATGGAGTAGTGGTGGGAATGGGTTTTGAGCAAGCTGAACACAGAGGAAAGCGAGAATGCAGTGCTTCTATGCTGGACCTTATGGAGTTTCAG
GAACAAGATAACAATCGAGAATGCAAGAGCAGATCACAACAAACTCATCCGATTAATCAAGATGAATTTCTCAGAACAAAGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCGGATTCAGGTCAGCAAGAGAGCTTGAGGAATGCAATTGTGAGAAAAGAGGAGGAGATTCAATACTAACCGAGAATGAGGTGGATGCTAACTGGGCCGAAACCTT
ACAGGCTAAAGAAGAGCTCGAGATTTTGCTTGAGGAAGAAGAAGATTTTTGGAGGAGTCGATCTAGAGAAGTGTGGCTCAAAAGCGGGGACAAAAATACCAAGTGGTTTC
ATGCGAAAGCCTCCCAAAGGAAAAGGAGAAATCATATTGAAGGGATACGGTCGAATGATGGCTTTTGGGAGGAAGACGAGCTGAAGATTGGGGAAATTGCCACTGACTAC
TTCAAAAACCTATTTCAATCCTCAAGGGCTAAAGATACTCTGATAAAAGATGCACTTGAATGTATCACCCCTAAAGTTTCAGTGCAACAAAGGAGGGAGTTGGATAAACC
TTACACTAGAGAAGAGATTGAGGCCACGGTGAGAAGCTTCAATCCAAGTAAAGCGCTCGGAAAAGATGGAGTCCACGCCTCATTCTACCAAGGGTACTGGAGCATTGTGG
GGGAGGAGACAGTGAAAGTTTGCTTGAAAATTCTAAACGATGATGTTGATATTAGACCACTAAACAAGACTATCATTACGCTAATCCTGAAGGTCTCCAACCCTCAAAAG
ATTACCGAATTTCTGCCAATTAGTCTCTGCAATGTGGGGTATAAAATTATTGCCAAGACGATTGCAAACAGATTCAAAAAAGTGCTAGACTCGGTCATATCCCCTACTCA
AGCGACTTTTATTCCGGGAAAACTCATATCGGATAATGTGTTAGTTGGTTTTGAATGTATCCACGCGATAAGTAGCAAGAAAAAAGGGAAAGACAGGCAGGTTGCTATCA
AGCTAGATATGTCCAAGGCCTACAATCGTGTAGAATGGATATTTCTAAAGGAGATGCTCCTAAAGCTAGGTTTTAGCAACAAATGGACTCATAACATCATGAAATGTGTG
GAGATAGTGTCATTCTCGGTGCTAATCAACGGAAGACCTCAAGAGGAATTCAAGCCTAGCCGAGGAATCAGACAGGGAGATCCCTTATCACCTTACCTATTCCTGGTGTG
TGCAGAAGGCTTCTCAGGGCTCCTAAAAATGGAAGTTTCCAATAAAAACTTAGCTGGTTTTAGAATTAACAATCATTGCCCGCCTCTAACTCACTTATTCTTCGCTGATG
ATAGCCTTATTTTTTGCAGGTCAAGTAGAAAGGAGTGTCAAACTATTAAGGAGTTGTTTAGAAGATATGAGGAAGCCTCCGGGCAAGCTATTAACCTGGATAAATCTATG
TTTATGGTGAGCAGTAATGTGAAAGATAGGGAGGCACAAAGCATTGGCGAGGTGCTAGGAATTAAACAAACAAAGTCATTGGGCACATACCTCGAAATGTCATCTCAAAG
CGCAAGAAGCAAGGCCAGATTGTTTGGAAAGATCAAATCCAAAATCGAGAGCATTTTACAAGGGTGGAGCGAGAGGCTGTTTTCCACGGGGGGCAAGGAAATACTTCTAA
AAGGACGACAGGACTTCTATGGAGTAGTGGTGGGAATGGGTTTTGAGCAAGCTGAACACAGAGGAAAGCGAGAATGCAGTGCTTCTATGCTGGACCTTATGGAGTTTCAG
GAACAAGATAACAATCGAGAATGCAAGAGCAGATCACAACAAACTCATCCGATTAATCAAGATGAATTTCTCAGAACAAAGGAGTAG
Protein sequenceShow/hide protein sequence
MSGFRSARELEECNCEKRGGDSILTENEVDANWAETLQAKEELEILLEEEEDFWRSRSREVWLKSGDKNTKWFHAKASQRKRRNHIEGIRSNDGFWEEDELKIGEIATDY
FKNLFQSSRAKDTLIKDALECITPKVSVQQRRELDKPYTREEIEATVRSFNPSKALGKDGVHASFYQGYWSIVGEETVKVCLKILNDDVDIRPLNKTIITLILKVSNPQK
ITEFLPISLCNVGYKIIAKTIANRFKKVLDSVISPTQATFIPGKLISDNVLVGFECIHAISSKKKGKDRQVAIKLDMSKAYNRVEWIFLKEMLLKLGFSNKWTHNIMKCV
EIVSFSVLINGRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKMEVSNKNLAGFRINNHCPPLTHLFFADDSLIFCRSSRKECQTIKELFRRYEEASGQAINLDKSM
FMVSSNVKDREAQSIGEVLGIKQTKSLGTYLEMSSQSARSKARLFGKIKSKIESILQGWSERLFSTGGKEILLKGRQDFYGVVVGMGFEQAEHRGKRECSASMLDLMEFQ
EQDNNRECKSRSQQTHPINQDEFLRTKE