; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g017880 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g017880
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:41212674..41215440
RNA-Seq ExpressionLcy06g017880
SyntenyLcy06g017880
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]2.5e-11238.84Show/hide
Query:  SETENMDEQVVAMGKNDQRSNPIVEETKEKWGPGEEIKKEDLDVSIQYFSEGHIDAMISRKN-DGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLR
        SET+ M +Q+ A  +  +  N  V +     G    +   D+ + ++ +S+ HIDA+I  +N   WR +  YG+PE+E++ H+W+LL RL        L 
Subjt:  SETENMDEQVVAMGKNDQRSNPIVEETKEKWGPGEEIKKEDLDVSIQYFSEGHIDAMISRKN-DGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLR

Query:  IIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCKLIDAGYRG--TTWA-------------------------------------------------
           GDFNEI + +EK GG +RNP ++  FR  +  C+L+D G +G   TW+                                                 
Subjt:  IIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCKLIDAGYRG--TTWA-------------------------------------------------

Query:  --------ETSKAEEELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIK
                E  K E +++ +L++EE FWK RSR  WL+ GDKNTK+FH KAS R+++N I GI  +   W ED  ++ +I  ++F  LF ++ P    + 
Subjt:  --------ETSKAEEELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIK

Query:  DMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCP
           +  + KV++    +LD P+  EEI   +    P+KAPG DG+ A+F+Q +W  V E  +  CL ILND  ++ PLN T I+LIPK + P+ +++F P
Subjt:  DMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCP

Query:  ISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNI
        ISLCNV Y+IIAK+++N  K +LD ++SP Q+AFI  RLI+DN+++G+E ++ I   K  K+G +A+KLD+SKAYDRVEW FL+  + KLGFS+ W    
Subjt:  ISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNI

Query:  MKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIFCRSSK
        M C+ T SFSVLIN  P+ + +P RG+RQG PLSPYLFL+CAE FS +L +   N  + G R N+    ++HL FADDSL+F R+++
Subjt:  MKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIFCRSSK

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]4.6e-11136.41Show/hide
Query:  DLDVSIQYFSEGHIDAMISRKNDG--WRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCKLI
        ++++ ++ ++  HIDA+IS  +    WR +GFYG+PET KR  SW+LL  L        L +  GDFNEI+S +EK GGA R+  QMD FR+ ++YC   
Subjt:  DLDVSIQYFSEGHIDAMISRKNDG--WRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCKLI

Query:  DAGYRGT--TWA----------------------------------------------------------------------------------------
        D GY G   TW+                                                                                        
Subjt:  DAGYRGT--TWA----------------------------------------------------------------------------------------

Query:  ---------------------------------------------------ETSKAEEELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRR
                                                           E ++  EE+  LL++EE +W  R++  WL+ GD+NTK+FH +AS+R+++
Subjt:  ---------------------------------------------------ETSKAEEELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRR

Query:  NHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIV
        N I GI  +   W ++E  I + A  YF N++ SS P  S I+++ E I  KV++     L R +T+EE+   ++  HP+KAPG DG+ A F+Q YW+IV
Subjt:  NHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIV

Query:  GEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWK
        G    ++ L +LN +  I  LNKT ISLIPK ++P++M DF PISLCNV YK+I+K ++NR K +L  +IS  Q+AF   RLI+DNVLV FE +H ++ K
Subjt:  GEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWK

Query:  KKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNN
          GK G +AIKLDMSKA+DRVEW F+ +++ ++GF N+W   +M+C+ +VS+S+LIN        PSRG+RQGDPLSP LFL+CAEG S L+ +   N  
Subjt:  KKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNN

Query:  LAGFRINNHCPPLTHLFFADDSLIFCRSS
        + G  IN  CP +THLFFADDS++FC+++
Subjt:  LAGFRINNHCPPLTHLFFADDSLIFCRSS

XP_030931246.1 uncharacterized protein LOC115957168 [Quercus lobata]1.0e-11038.94Show/hide
Query:  NPIVEETKEKWGPGEEIKKEDLDVSIQYFSEGHIDAMISRKNDG--WRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGA
        N I+ +     G    I K  +DV +  F++ H  A + R+ DG  W  +GFYG P++ ++  SW LL  L   V EG    I GDFN I+   EK    
Subjt:  NPIVEETKEKWGPGEEIKKEDLDVSIQYFSEGHIDAMISRKNDG--WRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGA

Query:  KRNPIQMDLFRNNIDYCKLIDAGYRG--TTW---------------------------------------------------------------------
             QMD F+  ++ C L D G+ G   TW                                                                     
Subjt:  KRNPIQMDLFRNNIDYCKLIDAGYRG--TTW---------------------------------------------------------------------

Query:  ----------------AETSKAE-----EELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQ
                         + SK E     +EL+ LL ++E +W   SR  WL++GDKNTK+FH KAS R+RRN I+GIR++D+ W ED   IG++AT YF+
Subjt:  ----------------AETSKAE-----EELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQ

Query:  NLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLI
         +F +   +   +++ L  +  K+++  +  L R Y+ +EI+A +    P+KAPG DG++A FYQ +WNIVG++ V   L  LN       +N T I LI
Subjt:  NLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLI

Query:  PKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEM
        PK+ SP+KM+D+ PISLCNV YKII+K ++N+ K++L  +IS TQ+AF+P RLI+DN+LV +EC+HA++++KKGK G +A+KLD+SKAYDRVEW FLK +
Subjt:  PKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEM

Query:  LLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIFCRSS
        + K+GF   W   +M CV T SFSV IN +P     PSRGIRQGDPLSPYLFL+CAEGF+ LL +      + G  I    P +++L FADDSL+FC+++
Subjt:  LLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIFCRSS

Query:  K
        +
Subjt:  K

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]5.1e-11037.82Show/hide
Query:  KEDLDVSIQYFSEGHIDAMISRKNDG--WRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCK
        KE + V I  +++ HIDA I    DG  W F+GFYGNP+T +R  SW  L+ L  +     L I  GDFNEI    EK GG  R   QM+ F + I+YC 
Subjt:  KEDLDVSIQYFSEGHIDAMISRKNDG--WRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCK

Query:  LIDAGYRG----------------------------------------------------------------------TTWAETSKAEE-----------
          +  + G                                                                      + W + S+ EE           
Subjt:  LIDAGYRG----------------------------------------------------------------------TTWAETSKAEE-----------

Query:  --------------------------------------ELEIL-----------------------LEEEEDFWKSRSREVWLENGDKNTKWFHVKASQR
                                              +LE L                       LE+E++ W+ RSR  W + GD+NT +FH KAS R
Subjt:  --------------------------------------ELEIL-----------------------LEEEEDFWKSRSREVWLENGDKNTKWFHVKASQR

Query:  KRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYW
         ++N+I+GI  +   W+EDELKI ++A  YF+ LF SS+PE+    D+L  + PKV+     EL R YT +E+   ++  +P KAPG DG+   F+Q +W
Subjt:  KRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYW

Query:  NIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAI
        N  GE   +  L  LN        N+T I LIPK++ P+ ++D+ PISLCNV YKI +K I+NR K+ L S+IS TQ+AF+ GRLI+DNVLV FE +H I
Subjt:  NIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAI

Query:  NWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVS
        + KK GK G++AIKLDMSKAYDRVEW+F+++++ KLGF       IM+C+ TVS+++ IN RP+    PSRGIRQGDPLSPYLFL+CAEG S L+K  V 
Subjt:  NWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVS

Query:  NNNLAGFRINNHCPPLTHLFFADDSLIFCRSS
        N ++ G  I    P L+HLFFADDSLIFC+++
Subjt:  NNNLAGFRINNHCPPLTHLFFADDSLIFCRSS

XP_030939696.1 uncharacterized protein LOC115964548 [Quercus lobata]6.7e-11041.95Show/hide
Query:  KKEDLDVSIQYFSEGHIDAMISRKNDG-WRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCK
        K+ED+ V +  FS+ HIDA++   +D  W  +GFYG P+T +R   W++L  L          +  GDFNE++   EK GGA+R+   M  FR+     +
Subjt:  KKEDLDVSIQYFSEGHIDAMISRKNDG-WRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCK

Query:  LIDAGYRGTTWAE----------TSKAEEELEILLE----------EEEDFW------------KSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIR
         +D G     W                 +   +LL           + + FW             +RSR  W+++GDKNT++FH  A+ RKRRN I+G+R
Subjt:  LIDAGYRGTTWAE----------TSKAEEELEILLE----------EEEDFW------------KSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIR

Query:  SKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNV
          + +W  +E    KI  D++  LF +S P++  ++ +LE I   VS     +L  PY  EE+E  ++   P KAPG DG+   FYQ YW  VG +    
Subjt:  SKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNV

Query:  CLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQ
         L  LN  + +K +N T I+LIPKV +P+++++F PISLCNV YKI++K I+NR K +L+S+IS TQ+AFI  RLI+DNVL+ FE +H +      K+G 
Subjt:  CLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQ

Query:  VAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRIN
        +A+KLDMSKAYDRVEW FL+++LLK+GF N W   IM+C+ TVS+S+LIN  PQ    P++G+ QGDPLSPYLFL CAEG + LL+R   +  + GF I+
Subjt:  VAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRIN

Query:  NHCPPLTHLFFADDSLIFCRSS
           P LT LFFADD L+FCRS+
Subjt:  NHCPPLTHLFFADDSLIFCRSS

TrEMBL top hitse value%identityAlignment
A0A2N9FFZ2 Reverse transcriptase domain-containing protein1.7e-11135.97Show/hide
Query:  KEDLDVSIQYFSEGHIDAMIS-RKNDGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRI---IGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDY
        K+D+ +S+Q FS  HIDA+++  + D WRF+GFYG PET KR  SW+LL RL        L++     GDFNE+V  +EK G   R+  QM LFR+ +D 
Subjt:  KEDLDVSIQYFSEGHIDAMIS-RKNDGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRI---IGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDY

Query:  CKLIDAGYRG------------------------------------------------------------------------------------------
        C  +D G+ G                                                                                          
Subjt:  CKLIDAGYRG------------------------------------------------------------------------------------------

Query:  ----TTWAE-------------------TSKAEE--------------------------ELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRK
            T W +                   TS+ +E                          EL  LL +EE  W+ RSR  WL  GD+NT++FH +A+QRK
Subjt:  ----TTWAE-------------------TSKAEE--------------------------ELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRK

Query:  RRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWN
        R+NH+  +R++D  W   + ++  +  +Y+++LFQ++ P+   ++ ++E I   V+     +L   +T  E+E  ++   P KAPG D +   FYQ YW+
Subjt:  RRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWN

Query:  IVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAIN
        ++G +     L  LN    +K +N T I+LIPKV +P+++ +F PISLCNV YK+I+K ++NR K +L S++  +Q+AFIPGRLI+DN+LV FE +H + 
Subjt:  IVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAIN

Query:  WKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSN
         +K GK+G +A+KLDMSKAYDRVEW +LK ++ K+GF NKW   +M+C+ TVS+S+L+N  P    KPSRG+RQGDPLSPYLFL+CAEG   L+++E  +
Subjt:  WKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSN

Query:  NNLAGFRINNHCPPLTHLFFADDSLIFCRSS
          L G  I+   P +THLFFADDSL+FC+++
Subjt:  NNLAGFRINNHCPPLTHLFFADDSLIFCRSS

A0A2N9GDB5 Reverse transcriptase domain-containing protein1.4e-11334.73Show/hide
Query:  NRSPATLQS----KAEARDEKDSIQLMHQLVVSEATGKRVINGDKKETSETENMDEQVVAMGKNDQRSNPI-------------VEETKEKWGPGEEIKK
        N  P+TL+S     A ++      Q + ++ +S+ T + VI   K+ T   +   +++     + +   P+             V  ++ K G    + K
Subjt:  NRSPATLQS----KAEARDEKDSIQLMHQLVVSEATGKRVINGDKKETSETENMDEQVVAMGKNDQRSNPI-------------VEETKEKWGPGEEIKK

Query:  EDLDVSIQYFSEGHIDAMIS-RKNDGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCKLI
        +++ + +  +S  HIDA+++  + D WRF+GFYG PET KR  SWNLL RL   +      +  GDFNE+V  +EK G   R+  QM LFR+ +D C L+
Subjt:  EDLDVSIQYFSEGHIDAMIS-RKNDGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCKLI

Query:  DAGYRGT--TWA----------------------------------------------------------------------------------------
        D G+ G   TW                                                                                         
Subjt:  DAGYRGT--TWA----------------------------------------------------------------------------------------

Query:  ----------------ETSKAEEELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSS
                          S+ ++EL  LL +EE  W+ RSR  WL+ GDKNT++FH +A+QR+RRN+I  +R+    W  D  ++  +  D++ +LFQ+ 
Subjt:  ----------------ETSKAEEELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSS

Query:  RPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSP
         P+   ++ ++E I   V+     +L + +T  E+E  ++   P KAPG DG+   FYQ YW+++G++  +  L  LN    +K +N T I+LIPKV +P
Subjt:  RPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSP

Query:  QKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGF
        +++ +F PISLCNV YKII+K ++NR K +L  ++S +Q+AFIPGRLI+DN+LV FE +H +  +K GK+G +A+KLDMSKAYDRVEW FL+ ++ K+GF
Subjt:  QKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGF

Query:  SNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIFCRSS
          KW   +M+C+ TVS+S+L+N  P    KPSRG+RQGDPLSPYLFL+CAEG   L+++E +   L G  I+   P +THLFFADDSL+FC+++
Subjt:  SNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIFCRSS

A0A2N9HWM9 Reverse transcriptase domain-containing protein1.0e-11136.48Show/hide
Query:  KEKWGPGEEIKKED-LDVSIQYFSEGHIDAMI-SRKNDGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQM
        +E++G G  +  +D +D+ IQ +S+ HID  + +   + WRF+GFYG+P+T  R HSW LL RL        + ++ GDFNEI S DEK G   R P QM
Subjt:  KEKWGPGEEIKKED-LDVSIQYFSEGHIDAMI-SRKNDGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQM

Query:  DLFRNNIDYCKLIDAGYRG-------------------------TTWAET--------------------------------------------------
          FR  ++ C L D G+ G                         T W +                                                   
Subjt:  DLFRNNIDYCKLIDAGYRG-------------------------TTWAET--------------------------------------------------

Query:  ----------------------------------SKAE-----------------------------------EELEILLEEEEDFWKSRSREVWLENGD
                                          SK++                                    +L  LL +EE +W+ RSR  WL  GD
Subjt:  ----------------------------------SKAE-----------------------------------EELEILLEEEEDFWKSRSREVWLENGD

Query:  KNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPE--DSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKA
        +NT +FH  A+QRK+ N I GIR  +++W  D++ I ++  +YF  ++ SS P   D+V +++   ++P +++    +L  P+TREE+   +    PSKA
Subjt:  KNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPE--DSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKA

Query:  PGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRL
        PG DG+ A F+Q +W+IVG +  +  L  LN+   +K LN T I+LIPKV SP+ M  F PISLCNV YKII+K + NR K +L  V+S +Q+AF+PGR+
Subjt:  PGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRL

Query:  ISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFL
        ISDN+++ FE +H +  K+ GK  Q+A+KLDMSKAYDRVEW +LK+M+LKLGF+ +W   IM+CV +VS+S+L+N  P+   KPSRG+RQGDPLSPYLFL
Subjt:  ISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFL

Query:  VCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIFCRSS
        +CAEG + LL++    + + G  I    P ++HLFFADDSLIFCR++
Subjt:  VCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIFCRSS

A0A2N9HYE3 Reverse transcriptase domain-containing protein1.7e-11135.97Show/hide
Query:  KEDLDVSIQYFSEGHIDAMIS-RKNDGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRI---IGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDY
        K+D+ +S+Q FS  HIDA+++  + D WRF+GFYG PET KR  SW+LL RL        L++     GDFNE+V  +EK G   R+  QM LFR+ +D 
Subjt:  KEDLDVSIQYFSEGHIDAMIS-RKNDGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRI---IGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDY

Query:  CKLIDAGYRG------------------------------------------------------------------------------------------
        C  +D G+ G                                                                                          
Subjt:  CKLIDAGYRG------------------------------------------------------------------------------------------

Query:  ----TTWAE-------------------TSKAEE--------------------------ELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRK
            T W +                   TS+ +E                          EL  LL +EE  W+ RSR  WL  GD+NT++FH +A+QRK
Subjt:  ----TTWAE-------------------TSKAEE--------------------------ELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRK

Query:  RRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWN
        R+NH+  +R++D  W   + ++  +  +Y+++LFQ++ P+   ++ ++E I   V+     +L   +T  E+E  ++   P KAPG D +   FYQ YW+
Subjt:  RRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWN

Query:  IVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAIN
        ++G +     L  LN    +K +N T I+LIPKV +P+++ +F PISLCNV YK+I+K ++NR K +L S++  +Q+AFIPGRLI+DN+LV FE +H + 
Subjt:  IVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAIN

Query:  WKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSN
         +K GK+G +A+KLDMSKAYDRVEW +LK ++ K+GF NKW   +M+C+ TVS+S+L+N  P    KPSRG+RQGDPLSPYLFL+CAEG   L+++E  +
Subjt:  WKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSN

Query:  NNLAGFRINNHCPPLTHLFFADDSLIFCRSS
          L G  I+   P +THLFFADDSL+FC+++
Subjt:  NNLAGFRINNHCPPLTHLFFADDSLIFCRSS

A0A2N9IPS8 Reverse transcriptase domain-containing protein1.4e-11337.52Show/hide
Query:  LDVSIQYFSEGHIDAMI--SRKNDGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCKLID
        LDV +  +S  HIDA I    K  G+R +GFYGNPET KR  SW LL+ L        L +  GDFNEI+ ++E+ G   R   Q+  FR  + +C L D
Subjt:  LDVSIQYFSEGHIDAMI--SRKNDGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCKLID

Query:  AGYRGT--TW------------------------------------------------------------------------------------------
         GY G   TW                                                                                          
Subjt:  AGYRGT--TW------------------------------------------------------------------------------------------

Query:  -----------AETS---------------------------------------KAEEELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRR
                     TS                                       + +++L  LLE+EE FW+ RSR  W+  GDKNTK+FH + ++R+R 
Subjt:  -----------AETS---------------------------------------KAEEELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRR

Query:  NHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIV
        NHI G+R +D +W+ ++ KI +IA DYFQ +F SS P    I  +L+ +   V++    +L   +T++E+   ++  +P+KAPG DG+ A FYQ YW+IV
Subjt:  NHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIV

Query:  GEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWK
        G E     L IL+    ++ +N T I+LIPKV +P+ + DF PISLCNV YKI++K ++NR K+VL  VIS  Q+AF+PGRLI+DNVLV FE +H+++ K
Subjt:  GEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWK

Query:  KKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNN
        +KGK GQ+A+KLDMSKAYDRVEW+FL+ ++  +GF+ +W   +M C+ +VS+SVLIN      F  SRGIRQGD LSPYLFL+CAEG S LL++   +  
Subjt:  KKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNN

Query:  LAGFRINNHCPPLTHLFFADDSLIFCRSS
        L G   +   P LTHLFFADDSL+FC+++
Subjt:  LAGFRINNHCPPLTHLFFADDSLIFCRSS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.0e-2927.37Show/hide
Query:  ETSKAEEELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECIT-
        E +K   EL+ +  ++     + SR  + E  +K  +       +++ +N I+ I++       D  +I     +Y+++L+ +       +   L+  T 
Subjt:  ETSKAEEELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECIT-

Query:  PKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILN--DDADIKP--LNKTIISLIPKVS-SPQKMADFCPIS
        P+++  +   L+RP T  EI A + S    K+PG DG  A FYQ Y     EE V   LK+    +   I P    +  I LIPK      K  +F PIS
Subjt:  PKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILN--DDADIKP--LNKTIISLIPKVS-SPQKMADFCPIS

Query:  LCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMK
        L N+  KI+ K ++NR ++ +  +I   Q  FIPG     N+      I  IN  K      V I +D  KA+D+++  F+ + L KLG    +   I  
Subjt:  LCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMK

Query:  CVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIF
          +  + ++++N +  E F    G RQG PLSP LF +  E  +  +++E     + G ++      L+   FADD +++
Subjt:  CVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIF

P11369 LINE-1 retrotransposable element ORF2 protein1.0e-2828.18Show/hide
Query:  IEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPE-DSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVG
        I  IR++      D  +I      +++ L+ +     D + K +     PK++  Q   L+ P + +EIEA + S    K+PG DG  A FYQ +   + 
Subjt:  IEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPE-DSVIKDMLECITPKVSDHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVG

Query:  EETVNVCLKILNDDADIKPLNKTIISLIPK-VSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWK
             +  KI  +        +  I+LIPK    P K+ +F PISL N+  KI+ K ++NR +  + ++I P Q  FIPG     N+      IH IN K
Subjt:  EETVNVCLKILNDDADIKPLNKTIISLIPK-VSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWK

Query:  KKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNN
         K K+  + I LD  KA+D+++  F+ ++L + G    + + I         ++ +N    E      G RQG PLSPYLF +  E  +  ++++     
Subjt:  KKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNN

Query:  LAGFRINNHCPPLTHLFFADDSLIFCRSSK
        + G +I      ++ L  ADD +++    K
Subjt:  LAGFRINNHCPPLTHLFFADDSLIFCRSSK

P14381 Transposon TX1 uncharacterized 149 kDa protein1.5e-3228.09Show/hide
Query:  RSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEAT
        RSR   L + D+ +++F+    ++  R  I  + ++D    ED   I   A  ++QNLF          +++ + + P VS+ ++  L+ P T +E+   
Subjt:  RSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELDRPYTREEIEAT

Query:  VRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPT
        +R    +K+PG DG+   F+Q +W+ +G +   V  +            + ++SL+PK    + + ++ P+SL +  YKI+AK IS R K VL  VI P 
Subjt:  VRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISPT

Query:  QAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQG
        Q+  +PGR I DNV +  + +H   + ++       + LD  KA+DRV+  +L   L    F  ++   +     +    V IN          RG+RQG
Subjt:  QAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQG

Query:  DPLSPYLFLVCAEGFSGLLKREVS
         PLS  L+ +  E F  LL++ ++
Subjt:  DPLSPYLFLVCAEGFSGLLKREVS

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM6.7e-1226.55Show/hide
Query:  DHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKII
        DH    +    T +++ A+  S   S +PG DGI     +   + +    +N+ L   N    I+ L +T+   IPK  + ++  DF PIS+ +V  + +
Subjt:  DHQRRELDRPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKII

Query:  AKTISNRFKRVLDSVISPTQAAFIPGRLISDN-VLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFS
           ++ R    ++    P Q  F+P    +DN  +V     H+    K  +S  +A  LD+SKA+D +    + + L   G    +   +    E    S
Subjt:  AKTISNRFKRVLDSVISPTQAAFIPGRLISDN-VLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFS

Query:  VLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIFCRS
        +  +    EEF P+RG++QGDPLSP LF +  +     L  E+      G  I N         FADD ++F  +
Subjt:  VLINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIFCRS

P92555 Uncharacterized mitochondrial protein AtMg012505.0e-1552.94Show/hide
Query:  LINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDS
        +IN  PQ    PSRG+RQGDPLSPYLF++C E  SGL +R      L G R++N+ P + HL FADD+
Subjt:  LINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein9.9e-1930.65Show/hide
Query:  EDFWKSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITP----KVSDHQRRELDR
        E F++ +SR  WL++GD NT++FH      + +N I+ +R  DD+  E+  ++ ++   Y+ +L  S    D +  D ++ I      + +D     L  
Subjt:  EDFWKSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITP----KVSDHQRRELDR

Query:  PYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRF
          + +EI A V +   +KAPG D   A F+   W +V + T+    +       +K  N T I+LIPKV+   +++ F P+S C V YKII  T + RF
Subjt:  PYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRF

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.1e-1440Show/hide
Query:  RFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMK
        R K ++ ++I P QA+FIPGR+ +DN++   E +H++  +KKG  G + +KLD+ KAYDR+ W +L++ L+  GF   W   I +
Subjt:  RFKRVLDSVISPTQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMK

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.5e-1652.94Show/hide
Query:  LINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDS
        +IN  PQ    PSRG+RQGDPLSPYLF++C E  SGL +R      L G R++N+ P + HL FADD+
Subjt:  LINRRPQEEFKPSRGIRQGDPLSPYLFLVCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCGGGAGAGAACAACAACGAGGCCCCAACGGTCAGTGCTACAGAGAAGAATGAAGAAAGCACACCAGAGAGGAGGAATCGGTCGCCAGCGACACTTCAGAGCAA
AGCAGAGGCGAGGGATGAGAAGGACAGTATACAATTAATGCATCAGCTAGTGGTGTCAGAGGCTACAGGAAAAAGAGTCATTAATGGGGACAAAAAAGAAACATCGGAAA
CAGAGAACATGGATGAGCAAGTGGTAGCTATGGGAAAGAATGATCAGAGAAGCAACCCTATTGTAGAAGAGACAAAAGAGAAGTGGGGCCCAGGGGAAGAAATAAAGAAA
GAGGATCTAGATGTTTCGATCCAGTATTTTTCAGAGGGGCATATTGATGCGATGATTAGTAGGAAAAACGATGGGTGGAGATTTTCGGGCTTCTACGGAAACCCGGAAAC
AGAGAAAAGGATTCATTCGTGGAATTTGTTGGAGAGGCTTTTTGAGAGTGTGGATGAAGGTACCCTTCGGATCATTGGAGGGGACTTCAATGAGATTGTCTCGGATGACG
AGAAAAGCGGGGGGGCCAAAAGAAATCCAATCCAAATGGATTTGTTCAGAAACAATATAGATTACTGCAAGCTCATTGATGCGGGCTACAGAGGCACCACCTGGGCCGAA
ACCTCAAAGGCTGAAGAAGAGCTCGAGATTCTGCTAGAGGAAGAAGAAGATTTTTGGAAGAGTCGGTCTAGAGAAGTGTGGCTTGAAAACGGGGACAAAAATACCAAATG
GTTCCACGTGAAAGCGTCCCAAAGGAAAAGAAGAAATCATATTGAAGGCATAAGGTCGAAGGATGACCTTTGGGAGGAGGATGAGCTAAAGATTGGGAAAATAGCCACGG
ATTACTTTCAAAACCTTTTCCAGTCCTCTAGGCCCGAAGATAGTGTGATAAAAGATATGCTTGAGTGTATCACCCCTAAAGTTTCGGATCACCAAAGGAGGGAGCTGGAT
AGACCTTACACCAGAGAAGAGATTGAAGCCACAGTGAGAAGTTTTCATCCAAGTAAAGCACCAGGGAAAGATGGGATCCACGCCTCATTCTATCAAGGGTACTGGAACAT
TGTGGGGGAGGAAACTGTGAATGTTTGCTTGAAAATCCTAAACGATGATGCGGATATAAAACCTCTAAACAAGACTATTATCTCGCTAATCCCAAAGGTATCCAGCCCCC
AAAAGATGGCAGATTTTTGCCCAATCAGTCTCTGCAATGTGGGATATAAAATTATTGCCAAAACGATTTCTAATAGGTTCAAAAGAGTGCTAGACTCGGTCATATCCCCT
ACGCAAGCGGCCTTTATTCCGGGAAGACTCATATCGGACAATGTGCTTGTGGGCTTTGAATGTATCCACGCGATAAACTGGAAGAAAAAAGGAAAAAGTGGTCAGGTTGC
TATCAAGCTCGATATGTCCAAGGCATACGATCGGGTTGAATGGATTTTCCTAAAGGAGATGCTCCTAAAGCTGGGATTTAGTAACAAATGGACTCATAACATCATGAAAT
GTGTGGAAACGGTGTCATTCTCGGTGCTAATCAACAGAAGACCTCAAGAAGAATTCAAGCCCAGTCGAGGAATCAGACAGGGAGATCCTTTATCACCTTACCTATTCCTG
GTGTGTGCGGAAGGCTTCTCAGGGCTCCTAAAAAGGGAAGTTTCAAATAACAACCTTGCCGGCTTTAGAATTAATAATCATTGCCCACCCCTAACTCACCTATTCTTCGC
TGATGATAGTCTTATTTTTTGCAGGTCAAGCAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCGGGAGAGAACAACAACGAGGCCCCAACGGTCAGTGCTACAGAGAAGAATGAAGAAAGCACACCAGAGAGGAGGAATCGGTCGCCAGCGACACTTCAGAGCAA
AGCAGAGGCGAGGGATGAGAAGGACAGTATACAATTAATGCATCAGCTAGTGGTGTCAGAGGCTACAGGAAAAAGAGTCATTAATGGGGACAAAAAAGAAACATCGGAAA
CAGAGAACATGGATGAGCAAGTGGTAGCTATGGGAAAGAATGATCAGAGAAGCAACCCTATTGTAGAAGAGACAAAAGAGAAGTGGGGCCCAGGGGAAGAAATAAAGAAA
GAGGATCTAGATGTTTCGATCCAGTATTTTTCAGAGGGGCATATTGATGCGATGATTAGTAGGAAAAACGATGGGTGGAGATTTTCGGGCTTCTACGGAAACCCGGAAAC
AGAGAAAAGGATTCATTCGTGGAATTTGTTGGAGAGGCTTTTTGAGAGTGTGGATGAAGGTACCCTTCGGATCATTGGAGGGGACTTCAATGAGATTGTCTCGGATGACG
AGAAAAGCGGGGGGGCCAAAAGAAATCCAATCCAAATGGATTTGTTCAGAAACAATATAGATTACTGCAAGCTCATTGATGCGGGCTACAGAGGCACCACCTGGGCCGAA
ACCTCAAAGGCTGAAGAAGAGCTCGAGATTCTGCTAGAGGAAGAAGAAGATTTTTGGAAGAGTCGGTCTAGAGAAGTGTGGCTTGAAAACGGGGACAAAAATACCAAATG
GTTCCACGTGAAAGCGTCCCAAAGGAAAAGAAGAAATCATATTGAAGGCATAAGGTCGAAGGATGACCTTTGGGAGGAGGATGAGCTAAAGATTGGGAAAATAGCCACGG
ATTACTTTCAAAACCTTTTCCAGTCCTCTAGGCCCGAAGATAGTGTGATAAAAGATATGCTTGAGTGTATCACCCCTAAAGTTTCGGATCACCAAAGGAGGGAGCTGGAT
AGACCTTACACCAGAGAAGAGATTGAAGCCACAGTGAGAAGTTTTCATCCAAGTAAAGCACCAGGGAAAGATGGGATCCACGCCTCATTCTATCAAGGGTACTGGAACAT
TGTGGGGGAGGAAACTGTGAATGTTTGCTTGAAAATCCTAAACGATGATGCGGATATAAAACCTCTAAACAAGACTATTATCTCGCTAATCCCAAAGGTATCCAGCCCCC
AAAAGATGGCAGATTTTTGCCCAATCAGTCTCTGCAATGTGGGATATAAAATTATTGCCAAAACGATTTCTAATAGGTTCAAAAGAGTGCTAGACTCGGTCATATCCCCT
ACGCAAGCGGCCTTTATTCCGGGAAGACTCATATCGGACAATGTGCTTGTGGGCTTTGAATGTATCCACGCGATAAACTGGAAGAAAAAAGGAAAAAGTGGTCAGGTTGC
TATCAAGCTCGATATGTCCAAGGCATACGATCGGGTTGAATGGATTTTCCTAAAGGAGATGCTCCTAAAGCTGGGATTTAGTAACAAATGGACTCATAACATCATGAAAT
GTGTGGAAACGGTGTCATTCTCGGTGCTAATCAACAGAAGACCTCAAGAAGAATTCAAGCCCAGTCGAGGAATCAGACAGGGAGATCCTTTATCACCTTACCTATTCCTG
GTGTGTGCGGAAGGCTTCTCAGGGCTCCTAAAAAGGGAAGTTTCAAATAACAACCTTGCCGGCTTTAGAATTAATAATCATTGCCCACCCCTAACTCACCTATTCTTCGC
TGATGATAGTCTTATTTTTTGCAGGTCAAGCAAATAG
Protein sequenceShow/hide protein sequence
MASGENNNEAPTVSATEKNEESTPERRNRSPATLQSKAEARDEKDSIQLMHQLVVSEATGKRVINGDKKETSETENMDEQVVAMGKNDQRSNPIVEETKEKWGPGEEIKK
EDLDVSIQYFSEGHIDAMISRKNDGWRFSGFYGNPETEKRIHSWNLLERLFESVDEGTLRIIGGDFNEIVSDDEKSGGAKRNPIQMDLFRNNIDYCKLIDAGYRGTTWAE
TSKAEEELEILLEEEEDFWKSRSREVWLENGDKNTKWFHVKASQRKRRNHIEGIRSKDDLWEEDELKIGKIATDYFQNLFQSSRPEDSVIKDMLECITPKVSDHQRRELD
RPYTREEIEATVRSFHPSKAPGKDGIHASFYQGYWNIVGEETVNVCLKILNDDADIKPLNKTIISLIPKVSSPQKMADFCPISLCNVGYKIIAKTISNRFKRVLDSVISP
TQAAFIPGRLISDNVLVGFECIHAINWKKKGKSGQVAIKLDMSKAYDRVEWIFLKEMLLKLGFSNKWTHNIMKCVETVSFSVLINRRPQEEFKPSRGIRQGDPLSPYLFL
VCAEGFSGLLKREVSNNNLAGFRINNHCPPLTHLFFADDSLIFCRSSK