; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030041 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030041
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:44021601..44023975
RNA-Seq ExpressionLag0030041
SyntenyLag0030041
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4277969.1 unnamed protein product [Prunus armeniaca]2.3e-15840.2Show/hide
Query:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI
        M+KAYDRVEW FL Q++  MGF   ++ LIM CV +VS+S  + G   G +IPSRGLRQGDP+SPYLFL+ AE  S+LL+ AE+ S + G+ IA S+P I
Subjt:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI

Query:  SHLFFADDSLLFFRARAEEAL---------------TINYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI
        +HLFFADDSLLF  A   EAL                +N  KS + FSP++    Q  I Q+L V   PCH +YLGLP+ + +++    + VKDRVW ++
Subjt:  SHLFFADDSLLFFRARAEEAL---------------TINYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI

Query:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD
         GW+GK  S AGKEVL+KS+ QAIP Y+M+ FRLP  L +EI   +AKFWW    D   IHW +W  +C+ K  GG+GFR +  FNQ+LL KQ WR+ + 
Subjt:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD

Query:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR
        P SL+  + K RYFPHSDF     G+ PSF W+S+LWGRDLL  G RWRIG+G  + IYG  W+P +    I S PTLP++SRV DLF+  G WD  K+ 
Subjt:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR

Query:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQ------------PSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDR
          F   E E+IL IPL+ G   D  IW+F KNG +SVKSGY      A++Y+            PSSS+  S    W  +WKL++P K    LWR+  D 
Subjt:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQ------------PSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDR

Query:  LPTKVNLIQRGLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLS---DIIWAAKENFVLLDFELMVVFW-WAVWNLRNTLTWGG
        LP+K  L +R +    +C  C +A E   H   GC V   +W    F + +   +   +    D +W    + +  D + +  F  W +WN RN + +G 
Subjt:  LPTKVNLIQRGLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLS---DIIWAAKENFVLLDFELMVVFW-WAVWNLRNTLTWGG

Query:  SSDDRDLWLW-ASEYLAIYQGVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEG
              + +  A +Y A ++  +           SL   V   +W+PP    FKLNVD +   ++G    G ++RDS   ++ +     P    V   E 
Subjt:  SSDDRDLWLW-ASEYLAIYQGVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEG

Query:  WAMYKGITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLD---CVWIEEWPVEIS
        +A+  GI+ A  M  S     +ESDSL+ V + + E   ++  G L+D +RRLL       V   PRQ NK AH +A   FSL D    +W++  P+ + 
Subjt:  WAMYKGITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLD---CVWIEEWPVEIS

Query:  NVLLGD
        + +  D
Subjt:  NVLLGD

ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]8.9e-16339.75Show/hide
Query:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI
        M+KAYDRVEW FLR +M+++GF++TWV  +M C+ + +FS    G  VGH++P RGLRQG PLSPYLFL+C EG S LLRGAE+R  + G+++AR +P +
Subjt:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI

Query:  SHLFFADDSLLFFRARAEEALT---------------INYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI
        +HL FADDS+LF +A  ++ +                INY KS ++ SPN+       I  +L V    CH  YLGLP+   + R    + +KD++W+ I
Subjt:  SHLFFADDSLLFFRARAEEALT---------------INYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI

Query:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD
         GWK K  S AGKE+L+K+++QAIP Y+M+CFR+PK L KE++  MA+FWW  ++D   IHWV WE LCK K  GGLGFR++E FNQ+LLAKQCWR+ + 
Subjt:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD

Query:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR
        P SL+  + + RY P   F EA +G  PSFIWRSL WG++LL +G RWR+G+G+SI +Y   WLP     +I S P LPLS+RV DLF+  G W+   ++
Subjt:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR

Query:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLA--HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQR
          F   E ++IL+IPL S    D LIWH+E+NG++SVKSGYRLA      +  +P S+  D    +W  +W L+IP+K KFFLWR   D LP    L  R
Subjt:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLA--HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQR

Query:  GLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCS--RICLSDIIWAAKENFVLLDFE-LMVVFWWAVWNLRNTLTWGGSSDDRDLWLWA
         +   P+C  C+   E   H  W C   + +W   +  A+   C   R+     +W A +     + + L     W +WN RN+  + G S+     L  
Subjt:  GLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCS--RICLSDIIWAAKENFVLLDFE-LMVVFWWAVWNLRNTLTWGGSSDDRDLWLWA

Query:  SEYLAIYQGVAT-------GRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYK
           LA     A        GR + P        Q     W+PPP          +V+        G V+R+++   + +    +   +G    E  A  +
Subjt:  SEYLAIYQGVAT-------GRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYK

Query:  GITLAFQMGFSSGSFIVESDSLR-LVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLDCV-WIEEWPVEISNVLLGD
        G+  A  MGF+    I+E D+   L  I+  E  +  + G L++++  LL+        +TPR GNKVAH LA  AF   + V WIEE P  +  VL  D
Subjt:  GITLAFQMGFSSGSFIVESDSLR-LVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLDCV-WIEEWPVEISNVLLGD

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]8.8e-17140.38Show/hide
Query:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI
        M+KAYDRVEW FLR++M+++GF++TWV  +M C+ + +FS    G  VGH++P RGLRQG PLSPYLFL+C EG S LLRGAE+R  + G+++AR  P +
Subjt:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI

Query:  SHLFFADDSLLFFRA-----RAEEAL----------TINYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI
        +HL FADDS+LF +A     RA E L           INY KS  + SPN+       I  +L V    CH +YLGLP+   + R    + +KD++W+ I
Subjt:  SHLFFADDSLLFFRA-----RAEEAL----------TINYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI

Query:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD
         GWK K  S AGKE+L+K+++QAIP Y+M+CFR+PK L KE++  MA+FWW  ++D   IHWV WE LCK K  GGLGFR++E FNQ+LLAKQCWR+ + 
Subjt:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD

Query:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR
        P SL+  + + RY P   F EA +G  PSFIWRSL WG++LL +G RWR+GNG+SI +Y   WLP     +I S P LPLS+ V DLF+  G W+   ++
Subjt:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR

Query:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLA--HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQR
          F   E ++ L+IPL S    D LIWH+E+NG++SVKSGYRLA      +  +P S   D    +W  +W L+IP+K KFFLWR   D LP    L  R
Subjt:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLA--HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQR

Query:  GLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRDLWLWASEY
         +   P+C  C+   E   H  W C   + +W  S +             ++  A + +    +  L     W +WN RN+  + G S+     L     
Subjt:  GLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRDLWLWASEY

Query:  LAIYQGVAT-------GRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKGIT
        LA     A        GR + P        Q     W+PPP+G++K+NVD +V+        G V+R+++   + +    +   +G    E  A  +G+ 
Subjt:  LAIYQGVAT-------GRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKGIT

Query:  LAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEV----GLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLDCV-WIEEWPVEISNVLLGD
         A  MGF++   ++E D+   +      +L   E     GLL++++  LLH        +TPR GNKVAH LA  AF   + V WIEE P  +  VL  D
Subjt:  LAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEV----GLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLDCV-WIEEWPVEISNVLLGD

XP_030483669.1 uncharacterized protein LOC115700241 [Cannabis sativa]4.7e-16438.32Show/hide
Query:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI
        MSKA+DRVEW +L Q+M++MGF S  +DLI+RC+ SV++SF LNG+ +G ++P+RG+RQGDPLSPYLFL+CAEGLS LL+ +EQ   + GL+++R++P +
Subjt:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI

Query:  SHLFFADDSLLFFRARAEEALTI---------------NYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI
        SHLFFADDS+LF RA  + A +I               N +K V++FSPN+   +Q +   +L +   PCH +YLGLPSF  R+++     + D++W+ +
Subjt:  SHLFFADDSLLFFRARAEEALTI---------------NYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI

Query:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD
          WK + FS  GKEVLLK++VQAIP Y M+CFRLP  L  +I   M+ FWWG +   N IHW +W SLCK K  GGLGFRN  LFNQ+LLAKQ WR+ + 
Subjt:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD

Query:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR
        P+SLLG VL  RYF + +   AGLGA PS  WRS++WG++LL +G +WR+G G +I     +WLP   +    S      S +V+DL      WD   I 
Subjt:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR

Query:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGL
         +F  ++ + IL IPL    A+D+LIW+   +G ++VKSGY+ A SLA  ++ ++S   ++ +WW   WKL++PSK + F+W++FH+ LP    L ++ +
Subjt:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGL

Query:  DILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRDLWL-WASEYL
           P C LC    E   H  + C+  + +W  S     ++  +    +D +     N    +FE  +V  W++W  RN       S      L +A+ YL
Subjt:  DILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRDLWL-WASEYL

Query:  AIYQ----------GVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKG
          YQ          G AT   +R  V     L      W  PP G  KLN DA+     G    G V+RDS   ++ +    +  C+  +  E  A+   
Subjt:  AIYQ----------GVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKG

Query:  ITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSL-LDCVWIEEWPVEISNVLL
        +  A  +G S     +E+DSL +V+      +  S   L+++D+  L+    R ++    R  N  A  LA  A ++  D  W+EE+P  +  V++
Subjt:  ITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSL-LDCVWIEEWPVEISNVLL

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]2.4e-16038.35Show/hide
Query:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI
        MSKA+DRVEW F+ Q+MI+MGF +  V+LI+RC+ SVS+SF LNG   G VIPSRG+RQGDPLSPYLFL+CAEGLS LL+  E    + GL+I+RS+P +
Subjt:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI

Query:  SHLFFADDSLLFFRARAEEA---------------LTINYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI
        SHLFFADDS+LF RA  + A                 IN EK V++FS N+    Q +   +L +   PCH QYLGLPSF  +N+      + D++W+ +
Subjt:  SHLFFADDSLLFFRARAEEA---------------LTINYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI

Query:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD
          WK   FS  GKEVLLK++VQAIP Y M+CFRLP  L  +I   MA+FWWG +     IHW +W  LCK K  GGLGFRN   FNQ+LLAKQ WR+ + 
Subjt:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD

Query:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR
        P+SLL  +L+ RYF + ++  AGLG+ PS  WRSL+WG++LL +G RWR+G+G  I     +WLP   + + +       +  V+DL +    WD   + 
Subjt:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR

Query:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGL
         +F  ++   +L IPL     DD+LIW+    G+++VKSGY  A SLA Q   + SN  S+  WW   WKL++P K + F+W++FH  LP    L +R +
Subjt:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGL

Query:  DILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGS-SDDRDLWLWASEYL
           P C +CNS  E   H  + C   + +W  S F   +++  R   +D +     +    + EL +V  W++W+ RN +  G S      +  +A  YL
Subjt:  DILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGS-SDDRDLWLWASEYL

Query:  AIYQ-------------GVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAM
          +Q             G AT   +RP  +F     ++  +W  PP G  KLN DA++  +      G VLR+S   ++++A S  P        E  A+
Subjt:  AIYQ-------------GVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAM

Query:  YKGITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSL-LDCVWIEEWPVEISNVL
           ++L + +  +     +E+DSL +V+        +S    L+++I  L+    R ++    R  N  AH LA  A ++  DC+W+  +P  +  ++
Subjt:  YKGITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSL-LDCVWIEEWPVEISNVL

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein4.3e-16339.75Show/hide
Query:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI
        M+KAYDRVEW FLR +M+++GF++TWV  +M C+ + +FS    G  VGH++P RGLRQG PLSPYLFL+C EG S LLRGAE+R  + G+++AR +P +
Subjt:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI

Query:  SHLFFADDSLLFFRARAEEALT---------------INYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI
        +HL FADDS+LF +A  ++ +                INY KS ++ SPN+       I  +L V    CH  YLGLP+   + R    + +KD++W+ I
Subjt:  SHLFFADDSLLFFRARAEEALT---------------INYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI

Query:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD
         GWK K  S AGKE+L+K+++QAIP Y+M+CFR+PK L KE++  MA+FWW  ++D   IHWV WE LCK K  GGLGFR++E FNQ+LLAKQCWR+ + 
Subjt:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD

Query:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR
        P SL+  + + RY P   F EA +G  PSFIWRSL WG++LL +G RWR+G+G+SI +Y   WLP     +I S P LPLS+RV DLF+  G W+   ++
Subjt:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR

Query:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLA--HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQR
          F   E ++IL+IPL S    D LIWH+E+NG++SVKSGYRLA      +  +P S+  D    +W  +W L+IP+K KFFLWR   D LP    L  R
Subjt:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLA--HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQR

Query:  GLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCS--RICLSDIIWAAKENFVLLDFE-LMVVFWWAVWNLRNTLTWGGSSDDRDLWLWA
         +   P+C  C+   E   H  W C   + +W   +  A+   C   R+     +W A +     + + L     W +WN RN+  + G S+     L  
Subjt:  GLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCS--RICLSDIIWAAKENFVLLDFE-LMVVFWWAVWNLRNTLTWGGSSDDRDLWLWA

Query:  SEYLAIYQGVAT-------GRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYK
           LA     A        GR + P        Q     W+PPP          +V+        G V+R+++   + +    +   +G    E  A  +
Subjt:  SEYLAIYQGVAT-------GRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYK

Query:  GITLAFQMGFSSGSFIVESDSLR-LVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLDCV-WIEEWPVEISNVLLGD
        G+  A  MGF+    I+E D+   L  I+  E  +  + G L++++  LL+        +TPR GNKVAH LA  AF   + V WIEE P  +  VL  D
Subjt:  GITLAFQMGFSSGSFIVESDSLR-LVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLDCV-WIEEWPVEISNVLLGD

A0A5E4FZN9 PREDICTED: retrotransposon4.3e-17140.38Show/hide
Query:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI
        M+KAYDRVEW FLR++M+++GF++TWV  +M C+ + +FS    G  VGH++P RGLRQG PLSPYLFL+C EG S LLRGAE+R  + G+++AR  P +
Subjt:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI

Query:  SHLFFADDSLLFFRA-----RAEEAL----------TINYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI
        +HL FADDS+LF +A     RA E L           INY KS  + SPN+       I  +L V    CH +YLGLP+   + R    + +KD++W+ I
Subjt:  SHLFFADDSLLFFRA-----RAEEAL----------TINYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI

Query:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD
         GWK K  S AGKE+L+K+++QAIP Y+M+CFR+PK L KE++  MA+FWW  ++D   IHWV WE LCK K  GGLGFR++E FNQ+LLAKQCWR+ + 
Subjt:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD

Query:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR
        P SL+  + + RY P   F EA +G  PSFIWRSL WG++LL +G RWR+GNG+SI +Y   WLP     +I S P LPLS+ V DLF+  G W+   ++
Subjt:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR

Query:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLA--HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQR
          F   E ++ L+IPL S    D LIWH+E+NG++SVKSGYRLA      +  +P S   D    +W  +W L+IP+K KFFLWR   D LP    L  R
Subjt:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLA--HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQR

Query:  GLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRDLWLWASEY
         +   P+C  C+   E   H  W C   + +W  S +             ++  A + +    +  L     W +WN RN+  + G S+     L     
Subjt:  GLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRDLWLWASEY

Query:  LAIYQGVAT-------GRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKGIT
        LA     A        GR + P        Q     W+PPP+G++K+NVD +V+        G V+R+++   + +    +   +G    E  A  +G+ 
Subjt:  LAIYQGVAT-------GRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKGIT

Query:  LAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEV----GLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLDCV-WIEEWPVEISNVLLGD
         A  MGF++   ++E D+   +      +L   E     GLL++++  LLH        +TPR GNKVAH LA  AF   + V WIEE P  +  VL  D
Subjt:  LAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEV----GLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLDCV-WIEEWPVEISNVLLGD

A0A6J5UN51 Reverse transcriptase domain-containing protein1.1e-15840.2Show/hide
Query:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI
        M+KAYDRVEW FL Q++  MGF   ++ LIM CV +VS+S  + G   G +IPSRGLRQGDP+SPYLFL+ AE  S+LL+ AE+ S + G+ IA S+P I
Subjt:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI

Query:  SHLFFADDSLLFFRARAEEAL---------------TINYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI
        +HLFFADDSLLF  A   EAL                +N  KS + FSP++    Q  I Q+L V   PCH +YLGLP+ + +++    + VKDRVW ++
Subjt:  SHLFFADDSLLFFRARAEEAL---------------TINYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI

Query:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD
         GW+GK  S AGKEVL+KS+ QAIP Y+M+ FRLP  L +EI   +AKFWW    D   IHW +W  +C+ K  GG+GFR +  FNQ+LL KQ WR+ + 
Subjt:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD

Query:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR
        P SL+  + K RYFPHSDF     G+ PSF W+S+LWGRDLL  G RWRIG+G  + IYG  W+P +    I S PTLP++SRV DLF+  G WD  K+ 
Subjt:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR

Query:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQ------------PSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDR
          F   E E+IL IPL+ G   D  IW+F KNG +SVKSGY      A++Y+            PSSS+  S    W  +WKL++P K    LWR+  D 
Subjt:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQ------------PSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDR

Query:  LPTKVNLIQRGLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLS---DIIWAAKENFVLLDFELMVVFW-WAVWNLRNTLTWGG
        LP+K  L +R +    +C  C +A E   H   GC V   +W    F + +   +   +    D +W    + +  D + +  F  W +WN RN + +G 
Subjt:  LPTKVNLIQRGLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLS---DIIWAAKENFVLLDFELMVVFW-WAVWNLRNTLTWGG

Query:  SSDDRDLWLW-ASEYLAIYQGVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEG
              + +  A +Y A ++  +           SL   V   +W+PP    FKLNVD +   ++G    G ++RDS   ++ +     P    V   E 
Subjt:  SSDDRDLWLW-ASEYLAIYQGVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEG

Query:  WAMYKGITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLD---CVWIEEWPVEIS
        +A+  GI+ A  M  S     +ESDSL+ V + + E   ++  G L+D +RRLL       V   PRQ NK AH +A   FSL D    +W++  P+ + 
Subjt:  WAMYKGITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLD---CVWIEEWPVEIS

Query:  NVLLGD
        + +  D
Subjt:  NVLLGD

A0A803QJN9 Uncharacterized protein3.0e-16438.36Show/hide
Query:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI
        MSKA+DRVEW +L Q+M++MGF S  +DLI+RC+ SV++SF LNG+ +G ++P+RG+RQGDPLSPYLFL+CAEGLS LL+ +EQ   + GL+++R++P +
Subjt:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI

Query:  SHLFFADDSLLFFRARAEEALTI---------------NYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI
        SHLFFADDS+LF RA  + A +I               N +K V++FSPN+   +Q +   +L +   PCH +YLGLPSF  R+++     + D++W+ +
Subjt:  SHLFFADDSLLFFRARAEEALTI---------------NYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI

Query:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD
          WK + FS  GKEVLLK++VQAIP Y M+CFRLP  L  +I   M+ FWWG +   N IHW +W SLCK K  GGLGFRN  LFNQ+LLAKQ WR+ + 
Subjt:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD

Query:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR
        P+SLLG VL  RYF + +   AGLGA PS  WRS++WG++LL +G +WR+G G +I     +WLP   +    S      S +V+DL      WD   I 
Subjt:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR

Query:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGL
         +F  ++ + IL IPL    A+D+LIW+   +G ++VKSGY+ A SLA  ++ ++S   ++ +WW   WKL++PSK + F+W++FH+ LP    L ++ +
Subjt:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGL

Query:  DILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRDLWL-WASEYL
           P C LC    E   H  + C+  + +W  S     ++  +    +D +     N    +FE  +V  W++W  RN       S      L +A+ YL
Subjt:  DILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRDLWL-WASEYL

Query:  AIYQ----------GVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKG
          YQ          G AT   +R  V     L      W  PP G  KLN DA+     G    G V+RDS   ++ +    +  C+  +  E  A+   
Subjt:  AIYQ----------GVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKG

Query:  ITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSL-LDCVWIEEWPVEISNVL
        +  A  +G S     +E+DSL +V+      +  S   L+++D+  L+    R ++    R  N  A  LA  A ++  D  W+EE+P  +  V+
Subjt:  ITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSL-LDCVWIEEWPVEISNVL

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)4.3e-16339.75Show/hide
Query:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI
        M+KAYDRVEW FLR +M+++GF++TWV  +M C+ + +FS    G  VGH++P RGLRQG PLSPYLFL+C EG S LLRGAE+R  + G+++AR +P +
Subjt:  MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPI

Query:  SHLFFADDSLLFFRARAEEALT---------------INYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI
        +HL FADDS+LF +A  ++ +                INY KS ++ SPN+       I  +L V    CH  YLGLP+   + R    + +KD++W+ I
Subjt:  SHLFFADDSLLFFRARAEEALT---------------INYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQI

Query:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD
         GWK K  S AGKE+L+K+++QAIP Y+M+CFR+PK L KE++  MA+FWW  ++D   IHWV WE LCK K  GGLGFR++E FNQ+LLAKQCWR+ + 
Subjt:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKD

Query:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR
        P SL+  + + RY P   F EA +G  PSFIWRSL WG++LL +G RWR+G+G+SI +Y   WLP     +I S P LPLS+RV DLF+  G W+   ++
Subjt:  PSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIR

Query:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLA--HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQR
          F   E ++IL+IPL S    D LIWH+E+NG++SVKSGYRLA      +  +P S+  D    +W  +W L+IP+K KFFLWR   D LP    L  R
Subjt:  CHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLA--HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQR

Query:  GLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCS--RICLSDIIWAAKENFVLLDFE-LMVVFWWAVWNLRNTLTWGGSSDDRDLWLWA
         +   P+C  C+   E   H  W C   + +W   +  A+   C   R+     +W A +     + + L     W +WN RN+  + G S+     L  
Subjt:  GLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCS--RICLSDIIWAAKENFVLLDFE-LMVVFWWAVWNLRNTLTWGGSSDDRDLWLWA

Query:  SEYLAIYQGVAT-------GRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYK
           LA     A        GR + P        Q     W+PPP          +V+        G V+R+++   + +    +   +G    E  A  +
Subjt:  SEYLAIYQGVAT-------GRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYK

Query:  GITLAFQMGFSSGSFIVESDSLR-LVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLDCV-WIEEWPVEISNVLLGD
        G+  A  MGF+    I+E D+   L  I+  E  +  + G L++++  LL+        +TPR GNKVAH LA  AF   + V WIEE P  +  VL  D
Subjt:  GITLAFQMGFSSGSFIVESDSLR-LVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLDCV-WIEEWPVEISNVLLGD

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.6e-5326.46Show/hide
Query:  LPSFMPRNRSGTLKFVKDRVWRQIQGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGG
        +P    R    T   + +RV  ++ GW+ K  S AG+  L K+++ ++P ++M+   LP+ ++  + +    F WG + +  + H V W  +C PK  GG
Subjt:  LPSFMPRNRSGTLKFVKDRVWRQIQGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGG

Query:  LGFRNMELFNQSLLAKQCWRVFKDPSSLLGLVLKGRYFPHSDFFEAGLGARPSF--IWRSLLWG-RDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIH
        LG R  +  N++L++K  WR+ ++ +SL  LVL+ +Y          L  + S+   WRS+  G RD+++ G  W  G+G  I  +   W+  +  L++ 
Subjt:  LGFRNMELFNQSLLAKQCWRVFKDPSSLLGLVLKGRYFPHSDFFEAGLGARPSF--IWRSLLWG-RDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQIH

Query:  SA--PTLPLSSRVSDLFSGLGNWDEAKIRCHFLAS---ECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQPSSSNADSMHAWWIGM
        +   PT   +    DL+     WD AKI  +   +   E  +++ + LV+G A D L W F ++G FSV+S Y +   L V   P  + A   +     +
Subjt:  SA--PTLPLSSRVSDLFSGLGNWDEAKIRCHFLAS---ECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQPSSSNADSMHAWWIGM

Query:  WKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWL-------------CSKFDAFYRSC-SRICLSDIIWAA
        WK+R+P + K FLW + +  + T+    +R L    +C +C   VE   H+   C     +W+              S F+  Y +   R    DI W+ 
Subjt:  WKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWL-------------CSKFDAFYRSC-SRICLSDIIWAA

Query:  KENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRD----LWLWASEYLAIYQGVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGF
                    V+ WW  W  R    +G ++  RD    +  WA E    + G     + +P V+  +        W  P  G  K+N D + R + G 
Subjt:  KENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRD----LWLWASEYLAIYQGVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGF

Query:  ATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKGITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPR
        A+ G VLRD + A        + RC     AE W +Y G+  A++         +E DS  +V      + D   +  L+      L      +++   R
Subjt:  ATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKGITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPR

Query:  QGNKVAHALASLAFSL
        + N++A  LA+ AFSL
Subjt:  QGNKVAHALASLAFSL

P11369 LINE-1 retrotransposable element ORF2 protein2.4e-1423.18Show/hide
Query:  KAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPISH
        KA+D+++  F+ +++ R G    ++++I         +  +NGEK+  +    G RQG PLSPYLF +  E L+  +R   Q+  I G++I +    IS 
Subjt:  KAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPISH

Query:  LFFADDSLLFF---RARAEEALT------------INYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLG--LPSFMPRNRSGTLKFVKDRVWRQI
        L  ADD +++    +    E L             IN  KS +AF       +++ I +    +    + +YLG  L   +        K +K  +   +
Subjt:  LFFADDSLLFF---RARAEEALT------------INYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLG--LPSFMPRNRSGTLKFVKDRVWRQI

Query:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNC--FRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVF
        + WK    S  G+  ++K  +     Y  N    ++P     E+  ++ KF W      N+   ++   L   +  GG+   +++L+ ++++ K  W  +
Subjt:  QGWKGKFFSTAGKEVLLKSIVQAIPCYTMNC--FRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVF

Query:  KD
        +D
Subjt:  KD

P92555 Uncharacterized mitochondrial protein AtMg012506.9e-1755.26Show/hide
Query:  VCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPISHLFFADDS
        VC     F +NG   G V PSRGLRQGDPLSPYLF+LC E LS L R A+++  + G+R++ +SP I+HL FADD+
Subjt:  VCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPISHLFFADDS

P93295 Uncharacterized mitochondrial protein AtMg003109.2e-3846.31Show/hide
Query:  AIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPK-CLGGLGFRNMELFNQSLLAKQCWRVFKDPSSLLGLVLKGRYFPHSDFFE
        A+P Y M+CFRL K L K++  +M +FWW   E+  +I WV+W+ LCK K   GGLGFR++  FNQ+LLAKQ +R+   P +LL  +L+ RYFPHS   E
Subjt:  AIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPK-CLGGLGFRNMELFNQSLLAKQCWRVFKDPSSLLGLVLKGRYFPHSDFFE

Query:  AGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSL
          +G RPS+ WRS++ GR+LL+RG    IG+G+   ++   W+ +E  L
Subjt:  AGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSL

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein1.7e-1525.28Show/hide
Query:  VKSGYRLA------HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGLDILPLCVLCNSAVEDCFHLFWGCSVVRHMW
        ++SGY +A         A+Q  P S+           +WKL +  K K FLWR     L T   L  R +D  P+C  C    E   H+ + C   + +W
Subjt:  VKSGYRLA------HSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGLDILPLCVLCNSAVEDCFHLFWGCSVVRHMW

Query:  LCSKF---------DAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRDL-----------WLWASEYLAIYQGVATGRV
          +            +F  + +R+    I  +  +    LD  L     W +W  RN   +       D            WL A+E            V
Subjt:  LCSKF---------DAFYRSCSRICLSDIIWAAKENFVLLDFELMVVFWWAVWNLRNTLTWGGSSDDRDL-----------WLWASEYLAIYQGVATGRV

Query:  ARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKGITLAFQMGFSSGSFIVESDSL
        A   +Q S   + +  +W PPP G  K N D+     S +   G+ +R+ +  ++L   + L        AE       + + +  G     F  ESDS 
Subjt:  ARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKGITLAFQMGFSSGSFIVESDSL

Query:  RLVK-IYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALAS
         LV  I +GE  D S +G L+ DIR  +       + F  R+ N  A ALAS
Subjt:  RLVK-IYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALAS

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-1922.47Show/hide
Query:  QYLGLPSFMPRNRSGTLKFVKDRVWRQIQGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPK
        +YLGLP    +  +     + +++  +I  W  +  S AG+  L+ S++ ++  + M+ FRLP   IKEI    + F W G E   +   V+W  +C PK
Subjt:  QYLGLPSFMPRNRSGTLKFVKDRVWRQIQGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPK

Query:  CLGGLGFRNMELFNQSLLAKQCWRVFKDPSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQI
          GGLG R+++  N                       KG ++  S     G     S++W+ +L  R L +   +  I NG +   +  NW         
Subjt:  CLGGLGFRNMELFNQSLLAKQCWRVFKDPSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSLQI

Query:  HSAPTLPLSSRVSDLFSGLGNWDEAKIRCHFLASECESILKIPLVSGLADDIL-----IWHFEKNGIFSVKSGYRLAHSLAVQYQPSSSNADSMHA----
                 S++  L    G+     +     AS  E+++         D +L     I      G+ S +   R   +  + ++P  +  ++  A    
Subjt:  HSAPTLPLSSRVSDLFSGLGNWDEAKIRCHFLASECESILKIPLVSGLADDIL-----IWHFEKNGIFSVKSGYRLAHSLAVQYQPSSSNADSMHA----

Query:  -----WWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGLDILPLCVLCNSAVEDCFHLFWGC
             W+ G+W      K+    W    +RL T   ++         CVLC+  VE   HLF+ C
Subjt:  -----WWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGLDILPLCVLCNSAVEDCFHLFWGC

AT4G29090.1 Ribonuclease H-like superfamily protein4.5e-6430.42Show/hide
Query:  AIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKDPSSLLGLVLKGRYFPHSDFFEA
        A+P YTM CF LPK + K+I   +A FWW   ++A  +HW +W+ L   K  GG+GF+++E FN +LL KQ WR+   P SL+  V K RYF  SD   A
Subjt:  AIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKDPSSLLGLVLKGRYFPHSDFFEA

Query:  GLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWL---PNEFSLQIHSAPTLPLSS-----RVSDLFSGLG-NWDEAKIRCHFLASECESILK
         LG+RPSF+W+S+   +++L +G R  +GNG  I I+   WL   P   +L++   P    +S     +VSDL    G  W +  I   F   E + I +
Subjt:  GLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWL---PNEFSLQIHSAPTLPLSS-----RVSDLFSGLG-NWDEAKIRCHFLASECESILK

Query:  IPLVSGLADDILIWHFEKNGIFSVKSGY-RLAHSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGLDILPLCVLCNSA
        +        D   W +  +G ++VKSGY  L   +  +  P   +  S++  +  +WK +   K + FLW+   + LP    L  R L     C+ C S 
Subjt:  IPLVSGLADDILIWHFEKNGIFSVKSGY-RLAHSLAVQYQPSSSNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGLDILPLCVLCNSA

Query:  VEDCFHLFWGCSVVRHMWLCSKF---------DAFYRSCSRICLSDIIWAAKENFVLLDFE--LMVVFW--WAVWNLRNTLTWGGSSDDRDLWLWASEYL
         E   HL + C+  R  W  S           D+ Y         ++ W          +E    +V W  W +W  RN L + G   +    L  +E  
Subjt:  VEDCFHLFWGCSVVRHMWLCSKF---------DAFYRSCSRICLSDIIWAAKENFVLLDFE--LMVVFW--WAVWNLRNTLTWGGSSDDRDLWLWASEYL

Query:  AIYQGVATGRVARPGVQFSLDLQVNRC---RWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKGITLAFQM
             +   R+           QVNR    RW+PPP    K N DA+   D+     G+VLR+    V       LP+   V  AE  AM   +    + 
Subjt:  AIYQGVATGRVARPGVQFSLDLQVNRC---RWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRCWGVDLAEGWAMYKGITLAFQM

Query:  GFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLD
         F     I ESDS  L++I + + +  S +   + D++RLL      K +F PR+GN +A  +A  + S L+
Subjt:  GFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLD

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.6e-3946.31Show/hide
Query:  AIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPK-CLGGLGFRNMELFNQSLLAKQCWRVFKDPSSLLGLVLKGRYFPHSDFFE
        A+P Y M+CFRL K L K++  +M +FWW   E+  +I WV+W+ LCK K   GGLGFR++  FNQ+LLAKQ +R+   P +LL  +L+ RYFPHS   E
Subjt:  AIPCYTMNCFRLPKCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPK-CLGGLGFRNMELFNQSLLAKQCWRVFKDPSSLLGLVLKGRYFPHSDFFE

Query:  AGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSL
          +G RPS+ WRS++ GR+LL+RG    IG+G+   ++   W+ +E  L
Subjt:  AGLGARPSFIWRSLLWGRDLLARGCRWRIGNGLSIPIYGSNWLPNEFSL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)4.9e-1855.26Show/hide
Query:  VCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPISHLFFADDS
        VC     F +NG   G V PSRGLRQGDPLSPYLF+LC E LS L R A+++  + G+R++ +SP I+HL FADD+
Subjt:  VCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAGGCATACGACCGGGTGGAGTGGTCCTTCTTGAGGCAAATCATGATTCGGATGGGGTTTGCTTCTACGTGGGTTGACCTGATTATGCGATGTGTGTGCTCGGT
TTCTTTTTCGTTCAATCTGAATGGTGAGAAGGTGGGCCATGTGATTCCTTCTAGGGGTCTTAGACAGGGTGATCCGTTATCTCCTTATCTCTTTCTGTTATGTGCTGAGG
GTTTATCCAGTCTGCTCCGTGGAGCTGAACAGAGGTCGCTTATTGCCGGTCTTCGGATTGCTCGGTCCAGTCCGCCGATTTCTCACTTGTTCTTTGCTGATGATAGCCTT
CTTTTCTTTAGAGCTAGAGCGGAGGAGGCCTTGACTATTAACTATGAGAAATCAGTTGTGGCTTTTAGTCCTAATTCGGGGGTCGACTCTCAACAGTATATCAGTCAGAT
TCTCTTTGTCGCACGCAGCCCGTGTCATCATCAGTATCTTGGCCTTCCTTCGTTTATGCCACGTAATCGCTCAGGGACTCTGAAGTTTGTCAAAGATAGGGTATGGCGTC
AGATTCAGGGATGGAAGGGCAAATTTTTTTCTACAGCAGGTAAGGAGGTGCTTCTTAAATCGATTGTTCAAGCTATTCCTTGCTATACTATGAACTGTTTTCGCCTTCCA
AAGTGTTTGATTAAAGAAATCCACCGGTCTATGGCCAAGTTTTGGTGGGGTGGCTCTGAGGATGCTAATCGAATCCATTGGGTGAGTTGGGAGTCATTGTGCAAGCCTAA
GTGTTTGGGGGGTCTGGGATTTCGAAATATGGAGTTGTTTAATCAATCTCTTTTGGCTAAGCAGTGTTGGCGGGTTTTTAAAGATCCTTCTTCATTGTTAGGGTTGGTGT
TGAAAGGGCGTTATTTTCCACATTCTGACTTCTTTGAGGCAGGCTTAGGTGCCCGACCGTCTTTTATATGGCGTAGTCTACTTTGGGGTCGTGATCTTCTAGCTCGGGGT
TGTAGGTGGCGGATTGGGAATGGCCTTTCCATCCCTATCTATGGTTCCAACTGGTTGCCGAATGAGTTTTCTCTCCAGATCCACTCGGCTCCTACCTTACCATTGTCTAG
TCGGGTCAGTGATCTTTTTTCGGGCTTGGGTAATTGGGATGAGGCTAAGATTAGGTGTCATTTTCTTGCATCTGAGTGCGAATCTATCTTGAAAATTCCATTAGTTTCTG
GCTTGGCTGATGATATACTTATTTGGCATTTTGAGAAGAATGGGATTTTTTCGGTCAAGAGTGGATATCGCCTAGCTCATTCTTTGGCTGTGCAGTATCAACCCTCGTCG
TCGAATGCGGACAGTATGCATGCATGGTGGATAGGTATGTGGAAATTGCGTATTCCTAGCAAACATAAGTTCTTTCTTTGGCGGTTGTTTCATGACCGTTTGCCTACGAA
GGTAAATCTTATTCAGCGGGGTCTTGATATTTTACCTTTATGTGTGTTATGTAATTCTGCTGTTGAGGATTGTTTTCACTTGTTTTGGGGTTGTTCTGTGGTGAGACATA
TGTGGTTGTGCTCAAAATTTGATGCCTTTTACCGATCATGTTCTAGGATTTGTCTTTCTGATATAATTTGGGCAGCTAAGGAAAATTTTGTTTTGCTAGATTTTGAACTT
ATGGTTGTTTTTTGGTGGGCTGTCTGGAACTTGAGGAACACTTTGACTTGGGGCGGTTCATCAGATGATAGGGATCTGTGGTTATGGGCTTCTGAGTACCTTGCTATTTA
TCAAGGGGTTGCAACGGGTCGAGTAGCGAGGCCTGGGGTTCAATTTTCTTTGGATTTGCAGGTTAATCGGTGTAGGTGGCAGCCGCCTCCTAGTGGTGTTTTTAAGCTGA
ATGTTGATGCATCGGTCCGACCTGATTCTGGGTTTGCTACTGGTGGGTATGTTCTGCGTGATTCCTCCTGTGCGGTCCTTTTATCGGCATGCTCAGTTTTGCCACGATGC
TGGGGTGTTGATTTGGCGGAAGGGTGGGCTATGTATAAAGGAATTACCCTTGCGTTCCAAATGGGGTTCTCATCGGGGTCTTTCATTGTGGAGTCTGATTCCTTGAGGTT
GGTGAAGATCTATCATGGTGAAGTGCTTGATGTTTCGGAAGTGGGTCTCTTGATGGATGATATTCGTCGGCTTCTCCACCCTGGTGGTAGGGATAAGGTTCTGTTTACTC
CTCGTCAGGGCAACAAAGTGGCGCATGCTTTAGCCAGCTTGGCCTTTTCGCTCTTGGATTGTGTTTGGATTGAAGAATGGCCTGTTGAGATCTCTAATGTGTTGTTGGGG
GATGCTCCTCTATCCCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAGGCATACGACCGGGTGGAGTGGTCCTTCTTGAGGCAAATCATGATTCGGATGGGGTTTGCTTCTACGTGGGTTGACCTGATTATGCGATGTGTGTGCTCGGT
TTCTTTTTCGTTCAATCTGAATGGTGAGAAGGTGGGCCATGTGATTCCTTCTAGGGGTCTTAGACAGGGTGATCCGTTATCTCCTTATCTCTTTCTGTTATGTGCTGAGG
GTTTATCCAGTCTGCTCCGTGGAGCTGAACAGAGGTCGCTTATTGCCGGTCTTCGGATTGCTCGGTCCAGTCCGCCGATTTCTCACTTGTTCTTTGCTGATGATAGCCTT
CTTTTCTTTAGAGCTAGAGCGGAGGAGGCCTTGACTATTAACTATGAGAAATCAGTTGTGGCTTTTAGTCCTAATTCGGGGGTCGACTCTCAACAGTATATCAGTCAGAT
TCTCTTTGTCGCACGCAGCCCGTGTCATCATCAGTATCTTGGCCTTCCTTCGTTTATGCCACGTAATCGCTCAGGGACTCTGAAGTTTGTCAAAGATAGGGTATGGCGTC
AGATTCAGGGATGGAAGGGCAAATTTTTTTCTACAGCAGGTAAGGAGGTGCTTCTTAAATCGATTGTTCAAGCTATTCCTTGCTATACTATGAACTGTTTTCGCCTTCCA
AAGTGTTTGATTAAAGAAATCCACCGGTCTATGGCCAAGTTTTGGTGGGGTGGCTCTGAGGATGCTAATCGAATCCATTGGGTGAGTTGGGAGTCATTGTGCAAGCCTAA
GTGTTTGGGGGGTCTGGGATTTCGAAATATGGAGTTGTTTAATCAATCTCTTTTGGCTAAGCAGTGTTGGCGGGTTTTTAAAGATCCTTCTTCATTGTTAGGGTTGGTGT
TGAAAGGGCGTTATTTTCCACATTCTGACTTCTTTGAGGCAGGCTTAGGTGCCCGACCGTCTTTTATATGGCGTAGTCTACTTTGGGGTCGTGATCTTCTAGCTCGGGGT
TGTAGGTGGCGGATTGGGAATGGCCTTTCCATCCCTATCTATGGTTCCAACTGGTTGCCGAATGAGTTTTCTCTCCAGATCCACTCGGCTCCTACCTTACCATTGTCTAG
TCGGGTCAGTGATCTTTTTTCGGGCTTGGGTAATTGGGATGAGGCTAAGATTAGGTGTCATTTTCTTGCATCTGAGTGCGAATCTATCTTGAAAATTCCATTAGTTTCTG
GCTTGGCTGATGATATACTTATTTGGCATTTTGAGAAGAATGGGATTTTTTCGGTCAAGAGTGGATATCGCCTAGCTCATTCTTTGGCTGTGCAGTATCAACCCTCGTCG
TCGAATGCGGACAGTATGCATGCATGGTGGATAGGTATGTGGAAATTGCGTATTCCTAGCAAACATAAGTTCTTTCTTTGGCGGTTGTTTCATGACCGTTTGCCTACGAA
GGTAAATCTTATTCAGCGGGGTCTTGATATTTTACCTTTATGTGTGTTATGTAATTCTGCTGTTGAGGATTGTTTTCACTTGTTTTGGGGTTGTTCTGTGGTGAGACATA
TGTGGTTGTGCTCAAAATTTGATGCCTTTTACCGATCATGTTCTAGGATTTGTCTTTCTGATATAATTTGGGCAGCTAAGGAAAATTTTGTTTTGCTAGATTTTGAACTT
ATGGTTGTTTTTTGGTGGGCTGTCTGGAACTTGAGGAACACTTTGACTTGGGGCGGTTCATCAGATGATAGGGATCTGTGGTTATGGGCTTCTGAGTACCTTGCTATTTA
TCAAGGGGTTGCAACGGGTCGAGTAGCGAGGCCTGGGGTTCAATTTTCTTTGGATTTGCAGGTTAATCGGTGTAGGTGGCAGCCGCCTCCTAGTGGTGTTTTTAAGCTGA
ATGTTGATGCATCGGTCCGACCTGATTCTGGGTTTGCTACTGGTGGGTATGTTCTGCGTGATTCCTCCTGTGCGGTCCTTTTATCGGCATGCTCAGTTTTGCCACGATGC
TGGGGTGTTGATTTGGCGGAAGGGTGGGCTATGTATAAAGGAATTACCCTTGCGTTCCAAATGGGGTTCTCATCGGGGTCTTTCATTGTGGAGTCTGATTCCTTGAGGTT
GGTGAAGATCTATCATGGTGAAGTGCTTGATGTTTCGGAAGTGGGTCTCTTGATGGATGATATTCGTCGGCTTCTCCACCCTGGTGGTAGGGATAAGGTTCTGTTTACTC
CTCGTCAGGGCAACAAAGTGGCGCATGCTTTAGCCAGCTTGGCCTTTTCGCTCTTGGATTGTGTTTGGATTGAAGAATGGCCTGTTGAGATCTCTAATGTGTTGTTGGGG
GATGCTCCTCTATCCCAATGA
Protein sequenceShow/hide protein sequence
MSKAYDRVEWSFLRQIMIRMGFASTWVDLIMRCVCSVSFSFNLNGEKVGHVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAEQRSLIAGLRIARSSPPISHLFFADDSL
LFFRARAEEALTINYEKSVVAFSPNSGVDSQQYISQILFVARSPCHHQYLGLPSFMPRNRSGTLKFVKDRVWRQIQGWKGKFFSTAGKEVLLKSIVQAIPCYTMNCFRLP
KCLIKEIHRSMAKFWWGGSEDANRIHWVSWESLCKPKCLGGLGFRNMELFNQSLLAKQCWRVFKDPSSLLGLVLKGRYFPHSDFFEAGLGARPSFIWRSLLWGRDLLARG
CRWRIGNGLSIPIYGSNWLPNEFSLQIHSAPTLPLSSRVSDLFSGLGNWDEAKIRCHFLASECESILKIPLVSGLADDILIWHFEKNGIFSVKSGYRLAHSLAVQYQPSS
SNADSMHAWWIGMWKLRIPSKHKFFLWRLFHDRLPTKVNLIQRGLDILPLCVLCNSAVEDCFHLFWGCSVVRHMWLCSKFDAFYRSCSRICLSDIIWAAKENFVLLDFEL
MVVFWWAVWNLRNTLTWGGSSDDRDLWLWASEYLAIYQGVATGRVARPGVQFSLDLQVNRCRWQPPPSGVFKLNVDASVRPDSGFATGGYVLRDSSCAVLLSACSVLPRC
WGVDLAEGWAMYKGITLAFQMGFSSGSFIVESDSLRLVKIYHGEVLDVSEVGLLMDDIRRLLHPGGRDKVLFTPRQGNKVAHALASLAFSLLDCVWIEEWPVEISNVLLG
DAPLSQ