; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy05g014890 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy05g014890
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr05:18383281..18385618
RNA-Seq ExpressionLcy05g014890
SyntenyLcy05g014890
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]6.2e-14537.03Show/hide
Query:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD
        ML LG  +  V  +M C++T ++  +  G   G I P  G+RQG P+SPYLFL C EG S +L   E    L GV++AR  P+++HL FADD +LF KA 
Subjt:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD

Query:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL
         ++   +    +TY  +TGQ INY KS + LSPN        I  +L V +V  HE YLGLP   G    +  +  KD++W  I  WK    S  GKE+L
Subjt:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL

Query:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN
        IK VLQAIPTY+MSCF++PK L KE N +MARFWW   + +R +H   W+ +C SK+ GGLGFRDLE FN+ALLAKQ WR++  P SL+AR+ + RY  +
Subjt:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN

Query:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR
          F++A+  TN S++W+SL WG+ LL  G RWRVG G SI++  D W+P     K++    +    RV  L  S G WN  +++ +F + +V+AIL IP 
Subjt:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR

Query:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEF---KKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEES
          L   D L+WHYE++G YSV+SGY+LA   +++ S   S       K+WK +WA +IP+K+K F+WR  +D LP    L  R +     C +C R  ES
Subjt:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEF---KKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEES

Query:  TFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE--FVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRRS
          HAVW C   K  WR+S +        + S  +L  W   +++    E+  F  +CW +WNRRN  +F         EG      + LS      +  S
Subjt:  TFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE--FVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRRS

Query:  EGVNTDIT--------------------VNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIR
        +  N   T                    V    ++  +G ++RN  GE M   ++ +         E MA  EGL  A++ GF+   +E D+   +  I 
Subjt:  EGVNTDIT--------------------VNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIR

Query:  SESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAELRTVD
        S        G L+ ++  L    R     W  R  N++AH +A+ A       TW+EE PS +  V  A++ +++
Subjt:  SESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAELRTVD

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]8.1e-14536.31Show/hide
Query:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD
        ML LG  +  V  +M C++T ++  +  G   G I P  G+RQG P+SPYLFL C EG S +L   E    L GV++AR  P+++HL FADD +LF KA 
Subjt:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD

Query:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL
            R +    +TY  ++GQ INY KS   LSPN        I  +L V +V  HE+YLGLP   G    +  +  KD++W  I  WK    S  GKE+L
Subjt:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL

Query:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN
        +K VLQAIPTY+MSCF++PK L KE N +MARFWW   + +R +H   W+ +C SK+ GGLGFRDLE FN+ALLAKQ WR++  P SL+AR+ + RY  +
Subjt:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN

Query:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR
          F++A+  TN S++W+SL WG+ LL  G RWRVG+G SI++  D W+P     K++    +     V  L  S G WN  +++ +F + +V+A L IP 
Subjt:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR

Query:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEF---KKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEES
          L   D L+WHYE++G YSV+SGY+LAC  +++ S   S       K+WK +WA +IP+K+K F+WR  +D LP    L  R +     C  C R  ES
Subjt:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEF---KKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEES

Query:  TFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE--FVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRRS
          HAVW C   K  WR+S +        + S  +L  W   +++    E+  F  +CW +WNRRN  +F         EG     T+ L       +  S
Subjt:  TFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE--FVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRRS

Query:  EG------------------------------VNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVET
                                        +N D  V    ++  +G ++RN  GE M   ++ ++        E MA  EGL  A++ GF+   +E 
Subjt:  EG------------------------------VNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVET

Query:  DSAKVVGRIRSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAELRTVD
        D+   +  I S        G+L+ ++  L    R     W  RS N++AH +A+ A       TW+EE P  +  V  A++ +++
Subjt:  DSAKVVGRIRSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAELRTVD

XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]4.5e-14336.76Show/hide
Query:  LGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKADLRE
        LG   + V  IM CVT+V+Y F +NG+  G++ PS G+RQGDP+SPYLFL CAEG S +L   E+   + G+++AR+ P+ISHLFFADD LLF KA  R 
Subjt:  LGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKADLRE

Query:  ARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLPV--GLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVLIKV
           + N    YS  +GQ IN+ KS ++ SPN  +D+R +    L +    + E YLGLP+  G       +  K+++W+R+  W+   FS GGKE+L+K 
Subjt:  ARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLPV--GLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVLIKV

Query:  VLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVNGSF
        V+QA+PTY MSCFK+P+   +E   ++AR+WW S   +R++H  +W K+   K  GGLGFR    +N+ALLAKQ WR++  P SLL++VL+ +YF + SF
Subjt:  VLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVNGSF

Query:  MDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPRPRL
        +D+K     S  W+S++WG++LL  G R R+G+G+S R  +D W+ R PS   I + G   + +V   +   G WN E+IR  F   D+  IL IP  R 
Subjt:  MDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPRPRL

Query:  RHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEESTFHAVW
         H D+  WHY   G Y+V+SGYKL   + ++ S S+ +   KWWK+ WA +IP K+ IF WR Y+++LP+   L  R + +   C  C   ++S  HAV+
Subjt:  RHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEESTFHAVW

Query:  ECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEEFVVMCWWVWNRRNKVLFGESDLDRREEGGW------RWVTEYLSHFRAFCR-----
         C   +  W    +    G     S  D+L +  + +  D  +  ++  W +W  RNK++  +      +   W           Y+S  RA  R     
Subjt:  ECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEEFVVMCWWVWNRRNKVLFGESDLDRREEGGW------RWVTEYLSHFRAFCR-----

Query:  -RRSEGVN------TDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIRSESIDFSELG
          + E V+       D  ++K+     +GA I     +   TL K +E +  V   EA+A+  GL  A   G++  +V TDS  +V  + SE+   +ELG
Subjt:  -RRSEGVN------TDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIRSESIDFSELG

Query:  MLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEE
        +L+AD + L         S   R+ N  AH +A+ AL++  E +WLE+
Subjt:  MLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEE

XP_023880912.1 uncharacterized protein LOC111993298 [Quercus suber]1.1e-14436.52Show/hide
Query:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD
        M  LG   + V L++ C++TVSY  ++NGE +G I+PS G+RQGDP+SPYLFL C+EGL+RML     + R+ G  L +  P ISHLFFADD LLF +A 
Subjt:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD

Query:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL
        L + + + + L  Y   +GQ +N  K+ I+ S  V ++ +  + + LQV  V  +E+YLGLP  VG     +L   K+R+WS++Q WK    S  G+EVL
Subjt:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL

Query:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN
        +K V+QAIPT+ M CFKLP  L +E  +++ +F+W  +  +R++H   W+ +C  K  GGLGF+DLE FN A+LAKQ WRL+++ +SL  RV K +YF  
Subjt:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN

Query:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPK-ARVDSLLN-SEGLWNEEVIRAMFREVDVNAILSI
        GS  +A + T GSY W+S++  R ++ +G RWR+GDGKSI I E+NW+P   S +++       K A+V +L+N +   WN +++   F   +V  I +I
Subjt:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPK-ARVDSLLN-SEGLWNEEVIRAMFREVDVNAILSI

Query:  PRPRLRHPDNLMWHYEKDGRYSVRSGYKLAC--AIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEE
        P       D L+W    DG Y V+SGY++ C  A    AS S+S     +WK +W   +P+K+K F+WR+  ++LP++ NL++R +     C+ C++ +E
Subjt:  PRPRLRHPDNLMWHYEKDGRYSVRSGYKLAC--AIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEE

Query:  STFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEEFVVMCWWVWNRRNKVLFGESDL--DRREEGGWRWVTEYLSHFRAF-CRR
        + FHA+W C  +K  W  +P F W           L          +  E F V+ W +W  RNK+   E  +  D+  E    ++ ++ S F+    ++
Subjt:  STFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEEFVVMCWWVWNRRNKVLFGESDL--DRREEGGWRWVTEYLSHFRAF-CRR

Query:  RSEG------------VNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIRSESIDF
        R E              N D  V      + IG I+R+ KG+V+  L + + ++  V++LEA+A R      VE G +  E E DS  V   ++  +   
Subjt:  RSEG------------VNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIRSESIDF

Query:  SELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAE
        S +G+++ D   + G LR  SFS  RR  N +AH +AK A+       W+E VP+ +  V +++
Subjt:  SELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAE

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]1.0e-14237.02Show/hide
Query:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD
        M  LG   + + LIM C+++VS+  ++NG  +G IKP  G+RQG PISPYLF+ CAE  S +L   E  R + G+   +    ISHL FADD L+F +A 
Subjt:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD

Query:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL
        + + + +   L  YS+ +GQ  N+ KS ++LS NV+     +I N+  + +V  +E YLGLP  VG          K ++ ++I  W+H  FS GGKEVL
Subjt:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL

Query:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN
        IK  +QAIP + MS FK+P  + ++   ++  FWW S   RR +H + W+K+  +K  GG+GFRD   FN+ALLAKQGWR+ + P SL+ARVL+ RYF +
Subjt:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN

Query:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR
         +F++AK  +N SY+W+S+LWGR ++  G+RWR+G+G+ + I + NWIP+  + K + K  +  +A V  L+N E  W+EE+I   F ++D + I  IP 
Subjt:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR

Query:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEESTFH
        PR    D L+WH+ K G+Y+V+SGY+ A  IR  A  S+SE  K  W  +W+  +P K++IFVWR   +LLPS  NL +R +  +  C  C  G E+ FH
Subjt:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEESTFH

Query:  AVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEEFVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRR------
        A+ +C+  K  WR S F+     +  +    LL    +  +    + F VM W  WN RN+ LF       + E     V +  +   A+ R +      
Subjt:  AVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEEFVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRR------

Query:  ---------------SEG---VNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIRS
                        EG   +NTD   N   NL+ +GA+IR+E G+V  T +K+ +F   V   EA A+  GL +A +A    V +E+DS +VV  + +
Subjt:  ---------------SEG---VNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIRS

Query:  ESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVP
             SE+  +V +I+ L       S  +  RS N +AH + K AL       W    P
Subjt:  ESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVP

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein3.0e-14537.03Show/hide
Query:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD
        ML LG  +  V  +M C++T ++  +  G   G I P  G+RQG P+SPYLFL C EG S +L   E    L GV++AR  P+++HL FADD +LF KA 
Subjt:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD

Query:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL
         ++   +    +TY  +TGQ INY KS + LSPN        I  +L V +V  HE YLGLP   G    +  +  KD++W  I  WK    S  GKE+L
Subjt:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL

Query:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN
        IK VLQAIPTY+MSCF++PK L KE N +MARFWW   + +R +H   W+ +C SK+ GGLGFRDLE FN+ALLAKQ WR++  P SL+AR+ + RY  +
Subjt:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN

Query:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR
          F++A+  TN S++W+SL WG+ LL  G RWRVG G SI++  D W+P     K++    +    RV  L  S G WN  +++ +F + +V+AIL IP 
Subjt:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR

Query:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEF---KKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEES
          L   D L+WHYE++G YSV+SGY+LA   +++ S   S       K+WK +WA +IP+K+K F+WR  +D LP    L  R +     C +C R  ES
Subjt:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEF---KKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEES

Query:  TFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE--FVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRRS
          HAVW C   K  WR+S +        + S  +L  W   +++    E+  F  +CW +WNRRN  +F         EG      + LS      +  S
Subjt:  TFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE--FVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRRS

Query:  EGVNTDIT--------------------VNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIR
        +  N   T                    V    ++  +G ++RN  GE M   ++ +         E MA  EGL  A++ GF+   +E D+   +  I 
Subjt:  EGVNTDIT--------------------VNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIR

Query:  SESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAELRTVD
        S        G L+ ++  L    R     W  R  N++AH +A+ A       TW+EE PS +  V  A++ +++
Subjt:  SESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAELRTVD

A0A5E4FZN9 PREDICTED: retrotransposon3.9e-14536.31Show/hide
Query:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD
        ML LG  +  V  +M C++T ++  +  G   G I P  G+RQG P+SPYLFL C EG S +L   E    L GV++AR  P+++HL FADD +LF KA 
Subjt:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD

Query:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL
            R +    +TY  ++GQ INY KS   LSPN        I  +L V +V  HE+YLGLP   G    +  +  KD++W  I  WK    S  GKE+L
Subjt:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL

Query:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN
        +K VLQAIPTY+MSCF++PK L KE N +MARFWW   + +R +H   W+ +C SK+ GGLGFRDLE FN+ALLAKQ WR++  P SL+AR+ + RY  +
Subjt:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN

Query:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR
          F++A+  TN S++W+SL WG+ LL  G RWRVG+G SI++  D W+P     K++    +     V  L  S G WN  +++ +F + +V+A L IP 
Subjt:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR

Query:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEF---KKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEES
          L   D L+WHYE++G YSV+SGY+LAC  +++ S   S       K+WK +WA +IP+K+K F+WR  +D LP    L  R +     C  C R  ES
Subjt:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEF---KKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEES

Query:  TFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE--FVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRRS
          HAVW C   K  WR+S +        + S  +L  W   +++    E+  F  +CW +WNRRN  +F         EG     T+ L       +  S
Subjt:  TFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE--FVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRRS

Query:  EG------------------------------VNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVET
                                        +N D  V    ++  +G ++RN  GE M   ++ ++        E MA  EGL  A++ GF+   +E 
Subjt:  EG------------------------------VNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVET

Query:  DSAKVVGRIRSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAELRTVD
        D+   +  I S        G+L+ ++  L    R     W  RS N++AH +A+ A       TW+EE P  +  V  A++ +++
Subjt:  DSAKVVGRIRSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAELRTVD

A0A803PAX6 Uncharacterized protein3.3e-14434.97Show/hide
Query:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD
        M+ LG + + V  IM C+TTVS+  ++NGE  GRI+P+ GIRQGDP+SPYLFL CAEGLS ++   E    + G++  +    +SHLFFADD  +F  A+
Subjt:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD

Query:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL
          E + +   L  YS+++GQ IN+ KSE+ +   +K +    +A +L V LV  H +YLG+P  +G    E  +  + +I +++Q W+   FS  G+EVL
Subjt:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL

Query:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN
        +K ++QAIPTY MSCF+LPK L+K+ ++MMARFWW S + + ++H   W K+C  K  GG+GF++LE FN++LLAKQGW+++ NP SL+AR+LK  YF N
Subjt:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN

Query:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR
         SF++AK    GS++W+S++ GR ++  G RWRV  GK IR+ ED W+PR  +  L     V     +D+L    G W  ++I+  F   D+  IL +  
Subjt:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR

Query:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEESTFH
         +    D+L+WH+  DG Y+VRSGY++A     +A  S+    +KWW  +W  + P K++ F+WR+    +P  + L+RRGM+I   C  C + EE+  H
Subjt:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEESTFH

Query:  AVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEEFVVMCWWVWNRRNKVLFGESDLDRREEGGWRWV---TEYLSH-----------
         +W C   K  W+  P++     S   +  D+L     ++A ++F  F++M W +WNRRNK     +     +E   +W     +Y+ H           
Subjt:  AVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEEFVVMCWWVWNRRNKVLFGESDLDRREEGGWRWV---TEYLSH-----------

Query:  ------------FRAFCRRRSEGVNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRI
                       FC      +N+D +V+ +     +G +IR  KGEV     + V  V  +++ EA+A++ G+++AV+       +++D  +V+  I
Subjt:  ------------FRAFCRRRSEGVNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRI

Query:  RSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAEL
        + +    ++ G L+ ++   A  +   + S   R+ N++AH +AK A+       W    PS   +  +A+L
Subjt:  RSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAEL

A0A803PV25 Uncharacterized protein3.0e-14536.66Show/hide
Query:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD
        M+ LG     V  IM C+T+V + F++NGE +GR+ P  G+RQGDP+SP+LFL CAE  S ++   E   RL GV   R    +SHLFFADD L+F  A 
Subjt:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD

Query:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL
          E R     L  YS+ +GQ +N+ KSE+    +V   +R  +A  + V +V ++ +YLGLP  VG T  +   +  +++W++++ WK + FS  GKEVL
Subjt:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL

Query:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN
        IK ++QAIPTYTMSCF+LPK+ +   +SM ARFWW S E   ++H   W  +C  K  GGLGFRDL LFN+ALLAKQ WR +  P SL ++VLK  Y+ N
Subjt:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN

Query:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR
           ++AK   + S+VW+SL+WG+ ++  G RWR+G+G S+R+++D W+PR  + K+  K  +     V  L    G W+EE +RA+F   D   IL +  
Subjt:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR

Query:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGE-ESTF
              D ++WHY KDG YSVRSGY++A A+      SN+E   +WW+ +W  +IP KVK FVW++ +  +P+ S L  R + I+  C RC  G  E+ F
Subjt:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGE-ESTF

Query:  HAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEEFVVMCWWVWNRRNKVLFGESDLDRREEGGWR--------WVTEYLSHF----
        HA+W CR     W+ S F               L      +A + FE F+V+ W +W  RN V            GG +        W +++L+ F    
Subjt:  HAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEEFVVMCWWVWNRRNKVLFGESDLDRREEGGWR--------WVTEYLSHF----

Query:  --------RAFCR-----RRSEGVNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRI
                RA  R     R S  +N D  V     L+S+ +++R+ +G V    +++VE        E  A+ +G+   ++       VETD  + V  +
Subjt:  --------RAFCR-----RRSEGVNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRI

Query:  RSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVP
          +     ++  LV  I+ L   +R    S+  R  NQ AHV+A  AL       W+  VP
Subjt:  RSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVP

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)3.0e-14537.03Show/hide
Query:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD
        ML LG  +  V  +M C++T ++  +  G   G I P  G+RQG P+SPYLFL C EG S +L   E    L GV++AR  P+++HL FADD +LF KA 
Subjt:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD

Query:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL
         ++   +    +TY  +TGQ INY KS + LSPN        I  +L V +V  HE YLGLP   G    +  +  KD++W  I  WK    S  GKE+L
Subjt:  LREARMMLNTLRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLP--VGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVL

Query:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN
        IK VLQAIPTY+MSCF++PK L KE N +MARFWW   + +R +H   W+ +C SK+ GGLGFRDLE FN+ALLAKQ WR++  P SL+AR+ + RY  +
Subjt:  IKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVN

Query:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR
          F++A+  TN S++W+SL WG+ LL  G RWRVG G SI++  D W+P     K++    +    RV  L  S G WN  +++ +F + +V+AIL IP 
Subjt:  GSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR

Query:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEF---KKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEES
          L   D L+WHYE++G YSV+SGY+LA   +++ S   S       K+WK +WA +IP+K+K F+WR  +D LP    L  R +     C +C R  ES
Subjt:  PRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEF---KKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEES

Query:  TFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE--FVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRRS
          HAVW C   K  WR+S +        + S  +L  W   +++    E+  F  +CW +WNRRN  +F         EG      + LS      +  S
Subjt:  TFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE--FVVMCWWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRRS

Query:  EGVNTDIT--------------------VNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIR
        +  N   T                    V    ++  +G ++RN  GE M   ++ +         E MA  EGL  A++ GF+   +E D+   +  I 
Subjt:  EGVNTDIT--------------------VNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIR

Query:  SESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAELRTVD
        S        G L+ ++  L    R     W  R  N++AH +A+ A       TW+EE PS +  V  A++ +++
Subjt:  SESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAELRTVD

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657505.8e-4526.34Show/hide
Query:  DRIWSRIQKWKHTCFSVGGKEVLIKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQ
        +R+ SR+  W+    S  G+  L K VL ++P ++MS   LP+ ++   + +   F W S   +++ HL  W KVC  K  GGLG R  +  N+AL++K 
Subjt:  DRIWSRIQKWKHTCFSVGGKEVLIKVVLQAIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQ

Query:  GWRLVENPASLLARVLKGRYFVNGSFMDAK---SRTNGSYVWKSLLWG-RSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLN
        GWRL++   SL   VL+ +Y V G   D++    + + S  W+S+  G R +++ G  W  GDG+ IR   D W+   P L+L   D        D+++ 
Subjt:  GWRLVENPASLLARVLKGRYFVNGSFMDAK---SRTNGSYVWKSLLWG-RSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLN

Query:  SEGLW--------------NEEVIRAMFREVDVNAILSIPRPRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEFKKWWKFMWACQIPSKV
        ++ LW                   R   R V ++ +           D L W + +DG++SVRS Y++     +E    N   F   +  +W  ++P +V
Subjt:  SEGLW--------------NEEVIRAMFREVDVNAILSIPRPRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEEFKKWWKFMWACQIPSKV

Query:  KIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEESTFHAVWECR-------RVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE-----
        K F+W +    + +E    RR +     C  C  G ES  H + +C        RV  Q R   FF             L  W +  +      E     
Subjt:  KIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEESTFHAVWECR-------RVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEE-----

Query:  --FVVMCWWVWNRRNKVLFGESDLDR-REEGGWRWVTE-YLSH----FRAFCRRRSE-------------GVNTDITVNKSLNLSSIGAIIRNEKGEVMF
          F V+ WW W  R   +FGE+   R R +    W  E Y +H         + R E              VNTD     +  L+S G ++R+  G    
Subjt:  --FVVMCWWVWNRRNKVLFGESDLDR-REEGGWRWVTE-YLSH----FRAFCRRRSE-------------GVNTDITVNKSLNLSSIGAIIRNEKGEVMF

Query:  TLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIRSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGL
             +   S     E   V  GL  A E    +VE+E DS  +VG +++   D   L  LV    G              R  N++A  +A  A  + L
Subjt:  TLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAKVVGRIRSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGL

Query:  EGTWLEEVPSVVESV
             + VP  + S+
Subjt:  EGTWLEEVPSVVESV

P11369 LINE-1 retrotransposable element ORF2 protein3.9e-1725.29Show/hide
Query:  LNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKADLREARMMLNTLRTYSVMTGQAINYGK
        +NGE+   I    G RQG P+SPYLF    E L+R    +   + + G+++ +    IS L  ADD +++        R +LN + ++  + G  IN  K
Subjt:  LNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKADLREARMMLNTLRTYSVMTGQAINYGK

Query:  SEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLPVGLTGPEA----LKQTKDRIWSRIQKWKHTCFSVGGKEVLIKVVL--QAIPTYTMSCFKLPKRL
        S  +L    K      I      ++V ++ +YLG+ +     +      K  K  I   +++WK    S  G+  ++K+ +  +AI  +     K+P + 
Subjt:  SEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLPVGLTGPEA----LKQTKDRIWSRIQKWKHTCFSVGGKEVLIKVVL--QAIPTYTMSCFKLPKRL

Query:  VKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGW
          E    + +F W +++ R    L   K+       GG+   DL+L+ +A++ K  W
Subjt:  VKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGW

P92555 Uncharacterized mitochondrial protein AtMg012503.1e-1448.53Show/hide
Query:  FMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADD
        F++NG  +G + PS G+RQGDP+SPYLF+ C E LS +    ++  RL G+R++ + P I+HL FADD
Subjt:  FMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADD

P93295 Uncharacterized mitochondrial protein AtMg003103.0e-3346.85Show/hide
Query:  AIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKY-HGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVNGSFMD
        A+P Y MSCF+L K L K+  S M  FWW S E +R++   +W+K+C SK   GGLGFRDL  FN+ALLAKQ +R++  P +LL+R+L+ RYF + S M+
Subjt:  AIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKY-HGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVNGSFMD

Query:  AKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWI
            T  SY W+S++ GR LL+ G    +GDG   ++  D WI
Subjt:  AKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWI

Q05118 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)5.9e-0530.08Show/hide
Query:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD
        M A G+D  +   IM  +T      ++ G    +I    G++QGDP+SP LF      L  +++ L D++   G  +  +C  I+ L FADD LL    D
Subjt:  MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKAD

Query:  LREARMMLNTLRTYSVMTGQAIN
        + +    L T   Y    G  +N
Subjt:  LREARMMLNTLRTYSVMTGQAIN

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein3.4e-3228.09Show/hide
Query:  LKGRYFVNGSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEG---LWNEEVIRAMFRE
        +K RYF + S +DAK R   SY W SLL G +LL  G R  +GDG++IRI  DN +   P  + ++ +  + +  +++L   +G    W++  I     +
Subjt:  LKGRYFVNGSFMDAKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEG---LWNEEVIRAMFRE

Query:  VDVNAILSIPRPRLRHPDNLMWHYEKDGRYSVRSGYKL-----ACAIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDI
         D   I  I   + + PD ++W+Y   G Y+VRSGY L     +  I        S + K     +W   I  K+K F+WR     L +   L  RGM I
Subjt:  VDVNAILSIPRPRLRHPDNLMWHYEKDGRYSVRSGYKL-----ACAIREEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDI

Query:  QRGCARCMRGEESTFHAVWECRRVKHQWRDSPFFPWAGPSTIRSP----------TDLLWWCWKEMAMDKFEEF--VVMCWWVWNRRNKVLF-------G
           C RC R  ES  HA++ C      WR S        S IR+           +++L +  ++  M  F +   V + W +W  RN V+F        
Subjt:  QRGCARCMRGEESTFHAVWECRRVKHQWRDSPFFPWAGPSTIRSP----------TDLLWWCWKEMAMDKFEEF--VVMCWWVWNRRNKVLF-------G

Query:  ESDLDRREEGGWRWVTEYLSH------FRAFCRRRSEGVNTDITVNK-----SLNLSSI----GAIIRNEKG-EVMFTLMKLVEFVSDVDILEAMAVREG
        ++ L  + E    W+    SH       R     + E  N   T  K       ++  +    G IIRN  G  + +  MKL    + ++  E  A+   
Subjt:  ESDLDRREEGGWRWVTEYLSH------FRAFCRRRSEGVNTDITVNK-----SLNLSSI----GAIIRNEKG-EVMFTLMKLVEFVSDVDILEAMAVREG

Query:  LAIAVEAGFSQVEVETDSAKVVGRIRSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAK
        L      G++QV +E D   ++  I   S   S L   + DI   A       F + RR  N++AHV+AK
Subjt:  LAIAVEAGFSQVEVETDSAKVVGRIRSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAK

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.4e-1723.3Show/hide
Query:  VKDDMRMSIANLLQVTLVGSHERYLGLPVGLTGPEALKQ---TKDRIWSRIQKWKHTCFSVGGKEVLIKVVLQAIPTYTMSCFKLPKRLVKECNSMMARF
        VKD+ +  I +           RYLGLP+ LT            ++I  RI KW     S  G+  LI  V+ ++  + MS F+LP   +KE +S+ + F
Subjt:  VKDDMRMSIANLLQVTLVGSHERYLGLPVGLTGPEALKQ---TKDRIWSRIQKWKHTCFSVGGKEVLIKVVLQAIPTYTMSCFKLPKRLVKECNSMMARF

Query:  WWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVNGSFMDAKSRTN-GSYVWKSLLWGRSLLALGARW
         W   E   +    +W  VC  K  GGLG R L+  NK                             GSF      T  GS++WK +L  R+L +   + 
Subjt:  WWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVNGSFMDAKSRTN-GSYVWKSLLWGRSLLALGARW

Query:  RVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR--PRLRH------PDNLMWHYEKDGRYSVRSG
         + +G +     DNW   S   +LI   G   +  +D  +       E V+    R    + +L I      +RH       D + W    D        
Subjt:  RVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPR--PRLRH------PDNLMWHYEKDGRYSVRSG

Query:  YKLACAIREEASYSNSEEFK-KWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEESTFHAVWEC
        +K     +E  + +   + K  W+K +W      K  +  W    + L +   +          C  C    E+  H  + C
Subjt:  YKLACAIREEASYSNSEEFK-KWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEESTFHAVWEC

AT4G29090.1 Ribonuclease H-like superfamily protein3.6e-5828.75Show/hide
Query:  AIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVNGSFMDA
        A+PTYTM+CF LPK + K+  S++A FWW +++  + +H  +W  +   K  GG+GF+D+E FN ALL KQ WR++  P SL+A+V K RYF     ++A
Subjt:  AIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVNGSFMDA

Query:  KSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPK--ARVDSLLNSEGL-------WNEEVIRAMFREVDVNAILS
           +  S+VWKS+   + +L  GAR  VG+G+ I I    W+   P+   +    V P+  A V S+L    L       W ++VI  +F EV+   I  
Subjt:  KSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPK--ARVDSLLNSEGL-------WNEEVIRAMFREVDVNAILS

Query:  IPRPRLRHPDNLMWHYEKDGRYSVRSGYKLACAI---REEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRG
        +     R  D+  W Y   G Y+V+SGY +   I   R      +       ++ +W  Q   K++ F+W+   + LP    L  R +  +  C RC   
Subjt:  IPRPRLRHPDNLMWHYEKDGRYSVRSGYKLACAI---REEASYSNSEEFKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRG

Query:  EESTFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWK----EMAMDKFEEFVV-MCWWVWNRRNKVL-----FGESDLDRREEGG---WRWVT
        +E+  H +++C   +  W  S      G     S    L+W +         +K  + V  + W +W  RN+++     F   ++ RR E     WR  T
Subjt:  EESTFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWK----EMAMDKFEEFVV-MCWWVWNRRNKVL-----FGESDLDRREEGG---WRWVT

Query:  EYLS------HFRAFCRR------RSEGVNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAK
        E  S        R+ C R      +    NTD T N+      IG ++RNEKGEV +   + +  +  V   E  A+R  +       ++ V  E+DS  
Subjt:  EYLS------HFRAFCRR------RSEGVNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVEVETDSAK

Query:  VVGRIRSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSAL
        ++  + ++ I +  L   + D++ L        F +  R  N +A  VA+ +L
Subjt:  VVGRIRSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSAL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.1e-3446.85Show/hide
Query:  AIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKY-HGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVNGSFMD
        A+P Y MSCF+L K L K+  S M  FWW S E +R++   +W+K+C SK   GGLGFRDL  FN+ALLAKQ +R++  P +LL+R+L+ RYF + S M+
Subjt:  AIPTYTMSCFKLPKRLVKECNSMMARFWWESEEGRRRVHLASWKKVCVSKY-HGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVNGSFMD

Query:  AKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWI
            T  SY W+S++ GR LL+ G    +GDG   ++  D WI
Subjt:  AKSRTNGSYVWKSLLWGRSLLALGARWRVGDGKSIRIVEDNWI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.2e-1548.53Show/hide
Query:  FMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADD
        F++NG  +G + PS G+RQGDP+SPYLF+ C E LS +    ++  RL G+R++ + P I+HL FADD
Subjt:  FMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGCTTTGGGTATGGACAGTAAGGTGGTTGGCTTGATTATGGGATGTGTGACGACAGTCTCGTACTTTTTTATGCTGAATGGAGAGAGGAGAGGGAGAATTAAACC
ATCTATGGGGATTCGCCAAGGTGACCCTATTTCTCCTTATTTGTTTCTTTTCTGCGCTGAAGGTTTATCTAGGATGCTCTCATGGCTAGAGGATGATAGGAGGTTGACGG
GGGTCCGGTTAGCCCGGAGTTGTCCTACCATTTCTCACTTGTTTTTTGCTGATGATTGTCTATTGTTTTTTAAGGCAGATTTGAGGGAGGCAAGAATGATGCTGAATACG
TTGAGAACTTACTCAGTTATGACTGGGCAGGCGATCAACTATGGTAAGTCTGAGATTTATCTAAGTCCAAATGTTAAAGATGATATGAGAATGAGTATTGCTAACCTTCT
TCAGGTGACTCTGGTGGGATCTCATGAACGTTATTTAGGCCTGCCAGTGGGATTGACTGGTCCTGAAGCCCTGAAGCAAACTAAGGACCGAATTTGGTCCCGGATCCAAA
AGTGGAAGCATACATGTTTTTCAGTAGGGGGAAAAGAGGTCTTGATTAAGGTAGTTTTACAGGCTATCCCTACTTACACGATGTCATGTTTTAAGCTCCCGAAGAGGTTG
GTTAAGGAGTGCAACAGCATGATGGCGAGATTTTGGTGGGAATCAGAGGAGGGAAGGAGGAGGGTTCATTTGGCATCTTGGAAGAAGGTTTGTGTTTCGAAATATCATGG
AGGGTTGGGCTTTCGGGATCTCGAGTTGTTTAATAAAGCACTATTGGCAAAGCAAGGGTGGAGGTTGGTCGAGAATCCGGCCTCCCTATTGGCTCGGGTCCTTAAAGGAA
GATACTTTGTGAATGGTTCCTTCATGGATGCAAAGTCAAGGACTAATGGATCATATGTGTGGAAAAGTTTGCTATGGGGGAGGAGTTTGTTGGCACTAGGGGCTAGATGG
AGAGTGGGGGATGGAAAGTCTATCAGGATTGTGGAGGATAACTGGATTCCTAGGTCTCCTTCTTTAAAACTAATTCACAAGGATGGTGTTCATCCGAAAGCAAGAGTGGA
TAGTTTGTTGAATTCGGAAGGCTTGTGGAATGAGGAGGTTATTCGGGCAATGTTTAGAGAGGTTGATGTTAATGCTATTTTGAGTATTCCGAGACCTAGGTTGAGGCATC
CTGATAATCTAATGTGGCACTATGAAAAAGATGGTAGGTATTCAGTGAGAAGTGGATATAAGCTAGCTTGTGCTATTAGAGAGGAGGCTAGTTATTCTAATTCTGAGGAG
TTTAAAAAATGGTGGAAGTTTATGTGGGCTTGCCAAATTCCAAGCAAAGTTAAGATTTTTGTGTGGAGGCTGTACTATGATCTTCTCCCTTCTGAAAGCAATCTGAGAAG
AAGAGGTATGGATATTCAGAGAGGGTGCGCTCGATGTATGAGAGGTGAGGAGTCGACGTTTCATGCGGTGTGGGAGTGTAGAAGGGTGAAGCATCAATGGAGAGATTCTC
CCTTTTTCCCTTGGGCTGGGCCATCGACAATAAGGAGTCCGACAGATTTGTTATGGTGGTGCTGGAAGGAGATGGCGATGGATAAATTCGAGGAATTTGTGGTGATGTGC
TGGTGGGTGTGGAATAGGAGGAATAAGGTGTTGTTTGGAGAGAGTGATTTGGATAGAAGGGAGGAGGGTGGGTGGAGATGGGTGACTGAGTATTTGTCTCACTTTAGAGC
CTTTTGTCGAAGGAGAAGTGAAGGGGTAAATACTGATATAACAGTCAATAAAAGCCTCAATCTCAGCAGTATTGGCGCAATTATCAGAAATGAGAAGGGGGAAGTGATGT
TTACCTTGATGAAACTAGTTGAATTTGTGTCAGATGTTGACATTCTGGAGGCGATGGCCGTTCGTGAAGGGCTGGCAATAGCTGTTGAAGCCGGCTTCTCGCAGGTGGAG
GTGGAGACGGATTCGGCCAAAGTTGTCGGAAGGATTAGATCTGAATCCATCGATTTCTCTGAGCTGGGGATGTTGGTAGCAGATATCAAAGGCCTCGCCGGTGGGTTGAG
ATTTTGCTCTTTTAGTTGGTGTCGCCGGTCAAGAAATCAGATGGCGCACGTGGTGGCGAAATCAGCGCTGAGGATGGGTCTCGAGGGGACTTGGTTGGAGGAGGTGCCGT
CGGTAGTTGAGAGCGTTTATGTTGCGGAGCTGAGGACTGTCGATTTGAGTCTTAGGGATTTCTGTATTGATGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTGGCTTTGGGTATGGACAGTAAGGTGGTTGGCTTGATTATGGGATGTGTGACGACAGTCTCGTACTTTTTTATGCTGAATGGAGAGAGGAGAGGGAGAATTAAACC
ATCTATGGGGATTCGCCAAGGTGACCCTATTTCTCCTTATTTGTTTCTTTTCTGCGCTGAAGGTTTATCTAGGATGCTCTCATGGCTAGAGGATGATAGGAGGTTGACGG
GGGTCCGGTTAGCCCGGAGTTGTCCTACCATTTCTCACTTGTTTTTTGCTGATGATTGTCTATTGTTTTTTAAGGCAGATTTGAGGGAGGCAAGAATGATGCTGAATACG
TTGAGAACTTACTCAGTTATGACTGGGCAGGCGATCAACTATGGTAAGTCTGAGATTTATCTAAGTCCAAATGTTAAAGATGATATGAGAATGAGTATTGCTAACCTTCT
TCAGGTGACTCTGGTGGGATCTCATGAACGTTATTTAGGCCTGCCAGTGGGATTGACTGGTCCTGAAGCCCTGAAGCAAACTAAGGACCGAATTTGGTCCCGGATCCAAA
AGTGGAAGCATACATGTTTTTCAGTAGGGGGAAAAGAGGTCTTGATTAAGGTAGTTTTACAGGCTATCCCTACTTACACGATGTCATGTTTTAAGCTCCCGAAGAGGTTG
GTTAAGGAGTGCAACAGCATGATGGCGAGATTTTGGTGGGAATCAGAGGAGGGAAGGAGGAGGGTTCATTTGGCATCTTGGAAGAAGGTTTGTGTTTCGAAATATCATGG
AGGGTTGGGCTTTCGGGATCTCGAGTTGTTTAATAAAGCACTATTGGCAAAGCAAGGGTGGAGGTTGGTCGAGAATCCGGCCTCCCTATTGGCTCGGGTCCTTAAAGGAA
GATACTTTGTGAATGGTTCCTTCATGGATGCAAAGTCAAGGACTAATGGATCATATGTGTGGAAAAGTTTGCTATGGGGGAGGAGTTTGTTGGCACTAGGGGCTAGATGG
AGAGTGGGGGATGGAAAGTCTATCAGGATTGTGGAGGATAACTGGATTCCTAGGTCTCCTTCTTTAAAACTAATTCACAAGGATGGTGTTCATCCGAAAGCAAGAGTGGA
TAGTTTGTTGAATTCGGAAGGCTTGTGGAATGAGGAGGTTATTCGGGCAATGTTTAGAGAGGTTGATGTTAATGCTATTTTGAGTATTCCGAGACCTAGGTTGAGGCATC
CTGATAATCTAATGTGGCACTATGAAAAAGATGGTAGGTATTCAGTGAGAAGTGGATATAAGCTAGCTTGTGCTATTAGAGAGGAGGCTAGTTATTCTAATTCTGAGGAG
TTTAAAAAATGGTGGAAGTTTATGTGGGCTTGCCAAATTCCAAGCAAAGTTAAGATTTTTGTGTGGAGGCTGTACTATGATCTTCTCCCTTCTGAAAGCAATCTGAGAAG
AAGAGGTATGGATATTCAGAGAGGGTGCGCTCGATGTATGAGAGGTGAGGAGTCGACGTTTCATGCGGTGTGGGAGTGTAGAAGGGTGAAGCATCAATGGAGAGATTCTC
CCTTTTTCCCTTGGGCTGGGCCATCGACAATAAGGAGTCCGACAGATTTGTTATGGTGGTGCTGGAAGGAGATGGCGATGGATAAATTCGAGGAATTTGTGGTGATGTGC
TGGTGGGTGTGGAATAGGAGGAATAAGGTGTTGTTTGGAGAGAGTGATTTGGATAGAAGGGAGGAGGGTGGGTGGAGATGGGTGACTGAGTATTTGTCTCACTTTAGAGC
CTTTTGTCGAAGGAGAAGTGAAGGGGTAAATACTGATATAACAGTCAATAAAAGCCTCAATCTCAGCAGTATTGGCGCAATTATCAGAAATGAGAAGGGGGAAGTGATGT
TTACCTTGATGAAACTAGTTGAATTTGTGTCAGATGTTGACATTCTGGAGGCGATGGCCGTTCGTGAAGGGCTGGCAATAGCTGTTGAAGCCGGCTTCTCGCAGGTGGAG
GTGGAGACGGATTCGGCCAAAGTTGTCGGAAGGATTAGATCTGAATCCATCGATTTCTCTGAGCTGGGGATGTTGGTAGCAGATATCAAAGGCCTCGCCGGTGGGTTGAG
ATTTTGCTCTTTTAGTTGGTGTCGCCGGTCAAGAAATCAGATGGCGCACGTGGTGGCGAAATCAGCGCTGAGGATGGGTCTCGAGGGGACTTGGTTGGAGGAGGTGCCGT
CGGTAGTTGAGAGCGTTTATGTTGCGGAGCTGAGGACTGTCGATTTGAGTCTTAGGGATTTCTGTATTGATGTCTAG
Protein sequenceShow/hide protein sequence
MLALGMDSKVVGLIMGCVTTVSYFFMLNGERRGRIKPSMGIRQGDPISPYLFLFCAEGLSRMLSWLEDDRRLTGVRLARSCPTISHLFFADDCLLFFKADLREARMMLNT
LRTYSVMTGQAINYGKSEIYLSPNVKDDMRMSIANLLQVTLVGSHERYLGLPVGLTGPEALKQTKDRIWSRIQKWKHTCFSVGGKEVLIKVVLQAIPTYTMSCFKLPKRL
VKECNSMMARFWWESEEGRRRVHLASWKKVCVSKYHGGLGFRDLELFNKALLAKQGWRLVENPASLLARVLKGRYFVNGSFMDAKSRTNGSYVWKSLLWGRSLLALGARW
RVGDGKSIRIVEDNWIPRSPSLKLIHKDGVHPKARVDSLLNSEGLWNEEVIRAMFREVDVNAILSIPRPRLRHPDNLMWHYEKDGRYSVRSGYKLACAIREEASYSNSEE
FKKWWKFMWACQIPSKVKIFVWRLYYDLLPSESNLRRRGMDIQRGCARCMRGEESTFHAVWECRRVKHQWRDSPFFPWAGPSTIRSPTDLLWWCWKEMAMDKFEEFVVMC
WWVWNRRNKVLFGESDLDRREEGGWRWVTEYLSHFRAFCRRRSEGVNTDITVNKSLNLSSIGAIIRNEKGEVMFTLMKLVEFVSDVDILEAMAVREGLAIAVEAGFSQVE
VETDSAKVVGRIRSESIDFSELGMLVADIKGLAGGLRFCSFSWCRRSRNQMAHVVAKSALRMGLEGTWLEEVPSVVESVYVAELRTVDLSLRDFCIDV