; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0037570 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0037570
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:7355452..7358480
RNA-Seq ExpressionLag0037570
SyntenyLag0037570
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]3.5e-7034.3Show/hide
Query:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT
        +E FNQALLAKQCWR+L+ P SL+  + + RY P   FLEA +G+ PSF+WRSL WG+ELL +G RWR+G+G +  +Y   WLP     +I S P L  +
Subjt:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT

Query:  STVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHR
        + V +LFT+SG W+V LL+ IF   + +AIL+IPL   +G D LIWH+E++G +SVKSGYRLA     +D+     S RV +   +W  +W L +PNK +
Subjt:  STVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHR

Query:  FFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIAS---IC-SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRN
        FFLWR   D LP    L  R +    +C  C    + +     +C +   +W                F E+  A++   +G +  L     W +W+ RN
Subjt:  FFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIAS---IC-SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRN

Query:  NLFWGGQSDG--------RDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDL
        +  + G+S+           L    SD  +  H   GR         QS  Q     WRPPP            V+  N +              +    
Subjt:  NLFWGGQSDG--------RDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDL

Query:  AEGWAVYKGIQLARQLGFVDFVVETDSLR-LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEE
         E  A  +G++ A  +GF D ++E D+   L  I + E ++      + EV  L+++ R ++  W       TPR GNKVAH LA+ AF   + V W+EE
Subjt:  AEGWAVYKGIQLARQLGFVDFVVETDSLR-LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEE

Query:  WPSEVSDVLRGDFASV
         PS +  VL  D  S+
Subjt:  WPSEVSDVLRGDFASV

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]9.3e-7133.4Show/hide
Query:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT
        +E FNQALLAKQCWR+L+ P SL+  + + RY P   FLEA +G+ PSF+WRSL WG+ELL +G RWR+GNG +  +Y   WLP     +I S P L  +
Subjt:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT

Query:  STVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHR
        + V +LFT+SG W+V LL+ IF   + +A L+IPL   +G D LIWH+E++G +SVKSGYRLA     +D+     S RV +   +W  +W L +PNK +
Subjt:  STVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHR

Query:  FFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIAS---IC-SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRN
        FFLWR   D LP    L  R +    +C  C    + +     +C +   +W                F E+  A++   +G +  L     W +W+ RN
Subjt:  FFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIAS---IC-SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRN

Query:  NLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSV
        +  + G+S+      +    L A         +      QS  Q     WRPPP  + K+N+D +                                +  
Subjt:  NLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSV

Query:  DLAEGWAVYKGIQLARQLGFVDFVVETDSLRLV-KILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WL
           E  A  +G++ A  +GF   V+E D+   +  IL+ E  +      + EV  L+ + R ++  W       TPR GNKVAH LA+ AF   + V W+
Subjt:  DLAEGWAVYKGIQLARQLGFVDFVVETDSLRLV-KILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WL

Query:  EEWPSEVSDVLRGDFASV
        EE P  +  VL  D  S+
Subjt:  EEWPSEVSDVLRGDFASV

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]4.2e-7133.4Show/hide
Query:  FNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTV
        FNQAL+AKQ WR+++ P+SL+  V+K RY+  S F  A +GS PSF+WRS+LWG +++ +G RWRIG+G+   +Y   W+P   + Q  S   L   + V
Subjt:  FNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTV

Query:  SELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRL
        ++L  +   W V  L   F   D EAIL+I L  G  ED ++WHF+K G +SVKSGY+LA      + P SSNS      W   W L++P K + F+WR 
Subjt:  SELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRL

Query:  CHDRLPTKVNLLK-RGLTVSLCVFCAMMIQKIASIC----SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNN-LFWG
          + LPT  NL K R L   +C  C + ++ ++ +     +   +W                F   I  M  + +  + EL++++ W +WS RN  +F G
Subjt:  CHDRLPTKVNLLK-RGLTVSLCVFCAMMIQKIASIC----SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNN-LFWG

Query:  GQSDGRDLWAYSSDYLHAFH---VGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDASRCWS------------------------------VDL
         +SD R L A +   L A+      G   GA+D      G  +++  W+PP   VLKLN+DA+                                  V L
Subjt:  GQSDGRDLWAYSSDYLHAFH---VGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDASRCWS------------------------------VDL

Query:  AEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAF--SYVDRVWLEEWPSEVS
        AE  A++ G+Q+A Q+     +VE+D   +V++LN      +E+  ++ D+RR    +   +  F PR  N  AH LA+ A   S  D VW+  +P+EV 
Subjt:  AEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAF--SYVDRVWLEEWPSEVS

Query:  DVL
        +VL
Subjt:  DVL

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]8.1e-8336.25Show/hide
Query:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT
        +ELFN+ALLAKQCWR+L  P+S+L  VLKGRYF    F+EA I   PS++WRS+LWGR+LL +G RWRIGNG +  IYG NW+PN+ +L+I S+P L   
Subjt:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT

Query:  STVSELFT-ASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLA--HTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHR
        S VS L     GGW   ++R  F   + + IL IP+ +G+ EDRLIW++EK G +SV+SGY++A  +   +Q  P SS+SE VR WW+G W++++PNK +
Subjt:  STVSELFT-ASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLA--HTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHR

Query:  FFLWRLCHDRLPTKVNLLKRGLTV-SLCVFCAMMIQ---KIASICS-GPALWFEEIIGAM---------RDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ
         FLWRLC DRLPT  NL KRG+ + + C FC    +    +  IC    ALW     G +          + L+  DFE + +  W +W+ RN   +   
Subjt:  FFLWRLCHDRLPTKVNLLKRGLTV-SLCVFCAMMIQ---KIASICS-GPALWFEEIIGAM---------RDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ

Query:  SD-----GRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSVDL
        +      G +L  +++ Y   F        A+ +            +W+PP   + K+N DAS                                 SVD+
Subjt:  SD-----GRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSVDL

Query:  AEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSD
        AE  A  +G+QLA ++G                ++  L D+SE G ++   +   +  ++    F  R+GNK AH+LAR A    +  +W+E+WP E+  
Subjt:  AEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSD

Query:  VL
         L
Subjt:  VL

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]2.8e-7534.34Show/hide
Query:  FNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTV
        FNQAL+AKQ WR++Q PSSL+  VLK RYF  +GF+ AG+GS+PSFVWRS++WGR++L +G RWRIGNG+   +YG+NW+P   + +  SAP +   +TV
Subjt:  FNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTV

Query:  SELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRL
        +EL      W   L+   F   D EAI++IPL +   ED+LIWH++K G +SVKSGY++A  +   + P  SN +  +  W  +W+L +P K + FLWR 
Subjt:  SELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRL

Query:  CHDRLPTKVNLLKRG-LTVSLCVFCAMMIQKIASI---CS-GPALW-----FEEIIGAMR--------------DKLTGPDFELVVIFWWSVWSLRNN-L
         HD LPT  NL K+  L   +C  C   ++ ++     C+    +W      EE+ G  R               K+ G +   V    W++W  RN  L
Subjt:  CHDRLPTKVNLLKRG-LTVSLCVFCAMMIQKIASI---CS-GPALW-----FEEIIGAMR--------------DKLTGPDFELVVIFWWSVWSLRNN-L

Query:  FWGGQSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS-------------------RC-----------WSVDL
        F G + +   + A +   + +F     +    + ++   G  E +  W PPPN   K+N+DA+                    C            SV +
Subjt:  FWGGQSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS-------------------RC-----------WSVDL

Query:  AEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEV
        AE  A+  G+++A +      + E+DSL ++ ++N +   ++E+G L+ DI+  L  + N K   +PR  N  AH LA+LA    + V WL+E P E+
Subjt:  AEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEV

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein1.7e-7034.3Show/hide
Query:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT
        +E FNQALLAKQCWR+L+ P SL+  + + RY P   FLEA +G+ PSF+WRSL WG+ELL +G RWR+G+G +  +Y   WLP     +I S P L  +
Subjt:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT

Query:  STVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHR
        + V +LFT+SG W+V LL+ IF   + +AIL+IPL   +G D LIWH+E++G +SVKSGYRLA     +D+     S RV +   +W  +W L +PNK +
Subjt:  STVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHR

Query:  FFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIAS---IC-SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRN
        FFLWR   D LP    L  R +    +C  C    + +     +C +   +W                F E+  A++   +G +  L     W +W+ RN
Subjt:  FFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIAS---IC-SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRN

Query:  NLFWGGQSDG--------RDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDL
        +  + G+S+           L    SD  +  H   GR         QS  Q     WRPPP            V+  N +              +    
Subjt:  NLFWGGQSDG--------RDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDL

Query:  AEGWAVYKGIQLARQLGFVDFVVETDSLR-LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEE
         E  A  +G++ A  +GF D ++E D+   L  I + E ++      + EV  L+++ R ++  W       TPR GNKVAH LA+ AF   + V W+EE
Subjt:  AEGWAVYKGIQLARQLGFVDFVVETDSLR-LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEE

Query:  WPSEVSDVLRGDFASV
         PS +  VL  D  S+
Subjt:  WPSEVSDVLRGDFASV

A0A5E4FZN9 PREDICTED: retrotransposon4.5e-7133.4Show/hide
Query:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT
        +E FNQALLAKQCWR+L+ P SL+  + + RY P   FLEA +G+ PSF+WRSL WG+ELL +G RWR+GNG +  +Y   WLP     +I S P L  +
Subjt:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT

Query:  STVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHR
        + V +LFT+SG W+V LL+ IF   + +A L+IPL   +G D LIWH+E++G +SVKSGYRLA     +D+     S RV +   +W  +W L +PNK +
Subjt:  STVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHR

Query:  FFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIAS---IC-SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRN
        FFLWR   D LP    L  R +    +C  C    + +     +C +   +W                F E+  A++   +G +  L     W +W+ RN
Subjt:  FFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIAS---IC-SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRN

Query:  NLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSV
        +  + G+S+      +    L A         +      QS  Q     WRPPP  + K+N+D +                                +  
Subjt:  NLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSV

Query:  DLAEGWAVYKGIQLARQLGFVDFVVETDSLRLV-KILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WL
           E  A  +G++ A  +GF   V+E D+   +  IL+ E  +      + EV  L+ + R ++  W       TPR GNKVAH LA+ AF   + V W+
Subjt:  DLAEGWAVYKGIQLARQLGFVDFVVETDSLRLV-KILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WL

Query:  EEWPSEVSDVLRGDFASV
        EE P  +  VL  D  S+
Subjt:  EEWPSEVSDVLRGDFASV

A0A6J1DAR4 uncharacterized protein LOC1110189543.9e-8336.25Show/hide
Query:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT
        +ELFN+ALLAKQCWR+L  P+S+L  VLKGRYF    F+EA I   PS++WRS+LWGR+LL +G RWRIGNG +  IYG NW+PN+ +L+I S+P L   
Subjt:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT

Query:  STVSELFT-ASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLA--HTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHR
        S VS L     GGW   ++R  F   + + IL IP+ +G+ EDRLIW++EK G +SV+SGY++A  +   +Q  P SS+SE VR WW+G W++++PNK +
Subjt:  STVSELFT-ASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLA--HTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHR

Query:  FFLWRLCHDRLPTKVNLLKRGLTV-SLCVFCAMMIQ---KIASICS-GPALWFEEIIGAM---------RDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ
         FLWRLC DRLPT  NL KRG+ + + C FC    +    +  IC    ALW     G +          + L+  DFE + +  W +W+ RN   +   
Subjt:  FFLWRLCHDRLPTKVNLLKRGLTV-SLCVFCAMMIQ---KIASICS-GPALWFEEIIGAM---------RDKLTGPDFELVVIFWWSVWSLRNNLFWGGQ

Query:  SD-----GRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSVDL
        +      G +L  +++ Y   F        A+ +            +W+PP   + K+N DAS                                 SVD+
Subjt:  SD-----GRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------------------------------RCWSVDL

Query:  AEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSD
        AE  A  +G+QLA ++G                ++  L D+SE G ++   +   +  ++    F  R+GNK AH+LAR A    +  +W+E+WP E+  
Subjt:  AEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSD

Query:  VL
         L
Subjt:  VL

A0A803QQT2 Uncharacterized protein3.0e-6731.78Show/hide
Query:  LFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTST
        +FNQALLAKQ WR L+ P  L   VLK  YFP+ G LEAG G+  SFVWRSL+WG++L+++G RWR+GNG +  +    WLP   + ++   P L     
Subjt:  LFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTST

Query:  VSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWR
        V++L  A G WD   +R+IFN  D + IL IP      ED+++WH+ K+G +SVKSGYR+A +   +     SN   +  WW  LWRL +P K + F+W+
Subjt:  VSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWR

Query:  LCHDRLPTKVNLLKRGLTVS-LCVFCAMMIQKIASICSGPALW-------FEEIIGAMRD--KLTGPD----------------FELVVIFWWSVWSLRN
        + H+ LP  VNL KRG+  S +C  C+  + +  +     ALW       +  + G   D  ++ G D                 E  ++  W++W++RN
Subjt:  LCHDRLPTKVNLLKRGLTVS-LCVFCAMMIQKIASICSGPALW-------FEEIIGAMRD--KLTGPD----------------FELVVIFWWSVWSLRN

Query:  NLFWGG-QSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS---------------RCWSVDLA-----------
         +  GG      ++  +  ++L  F    GR         +S    E   W PP    + +N+DA                    V L+           
Subjt:  NLFWGG-QSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS---------------RCWSVDLA-----------

Query:  ----EGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSY-VDRVWL
            E  A+ KGIQ+  Q     F VETD L+ V ++  + +   ++  L++ IR ++S      + F  R+ N+VAH LA  A  +    +W+
Subjt:  ----EGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSY-VDRVWL

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)1.7e-7034.3Show/hide
Query:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT
        +E FNQALLAKQCWR+L+ P SL+  + + RY P   FLEA +G+ PSF+WRSL WG+ELL +G RWR+G+G +  +Y   WLP     +I S P L  +
Subjt:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPT

Query:  STVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHR
        + V +LFT+SG W+V LL+ IF   + +AIL+IPL   +G D LIWH+E++G +SVKSGYRLA     +D+     S RV +   +W  +W L +PNK +
Subjt:  STVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRM---WWSGLWRLNVPNKHR

Query:  FFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIAS---IC-SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRN
        FFLWR   D LP    L  R +    +C  C    + +     +C +   +W                F E+  A++   +G +  L     W +W+ RN
Subjt:  FFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIAS---IC-SGPALW----------------FEEIIGAMRDKLTGPDFELVVIFWWSVWSLRN

Query:  NLFWGGQSDG--------RDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDL
        +  + G+S+           L    SD  +  H   GR         QS  Q     WRPPP            V+  N +              +    
Subjt:  NLFWGGQSDG--------RDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPN----------RVLKLNIDAS----------RCWSVDL

Query:  AEGWAVYKGIQLARQLGFVDFVVETDSLR-LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEE
         E  A  +G++ A  +GF D ++E D+   L  I + E ++      + EV  L+++ R ++  W       TPR GNKVAH LA+ AF   + V W+EE
Subjt:  AEGWAVYKGIQLARQLGFVDFVVETDSLR-LVKILNGELHD------VSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEE

Query:  WPSEVSDVLRGDFASV
         PS +  VL  D  S+
Subjt:  WPSEVSDVLRGDFASV

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.6e-2022.97Show/hide
Query:  NQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRP----SFVWRSLLWG-RELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSA--PVL
        N+AL++K  WR+LQ+ +SL   VL+ +Y    G +       P    S  WRS+  G R+++  G  W  G+G+    +   W+  +  L++ +   P  
Subjt:  NQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRP----SFVWRSLLWG-RELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSA--PVL

Query:  SPTSTVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSG-EDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKH
          T    +L+    GWD A +           +  + L   +G  DRL W F + G FSV+S Y +   L + + P       +  +++ LW++ VP + 
Subjt:  SPTSTVSELFTASGGWDVALLRTIFNGADCEAILRIPLQQGSG-EDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKH

Query:  RFFLWRLCHDRLPTKVNLLKRGLTVS-LCVFCAMMIQKIASICSG-PA---LW-----------------FEEIIGAMRDKLTGPDFE----LVVIFWWS
        + FLW + +  + T+    +R L+ S +C  C   ++ +  +    PA   +W                 FE +   + D+    D        VI WW 
Subjt:  RFFLWRLCHDRLPTKVNLLKRGLTVS-LCVFCAMMIQKIASICSG-PA---LW-----------------FEEIIGAMRDKLTGPDFE----LVVIFWWS

Query:  VWSLRNNLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGV-WRPPPNRVLKLNIDAS----------------------------
         W  R    +G  +  RD   +  ++    +    R  + + L   +  + ER + W  P    +K+N D +                            
Subjt:  VWSLRNNLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGV-WRPPPNRVLKLNIDAS----------------------------

Query:  --RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFS
          RC S   AE W VY G+  A +       +E DS  +V  L   + D   +  L+      L      +++   R+ N++A  LA  AFS
Subjt:  --RCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFS

P93295 Uncharacterized mitochondrial protein AtMg003101.2e-1747.67Show/hide
Query:  FNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSL
        FNQALLAKQ +R++  P +LL  +L+ RYFP S  +E  +G+RPS+ WRS++ GRELL RG    IG+G  T ++   W+ +E  L
Subjt:  FNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSL

Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.5e-0724.51Show/hide
Query:  LVVIFWWSVWSLRNNL-FWGGQSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGV---WRPPPNRVLKLNIDAS------RC---WSVDLA
        LV    W +W  RN L F G + D  ++   + +    +         R+     SG Q ER +   W+ PP + +K N DA+      RC   W +   
Subjt:  LVVIFWWSVWSLRNNL-FWGGQSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGV---WRPPPNRVLKLNIDAS------RC---WSVDLA

Query:  EGWAVYKG---------------------IQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARL
         G  ++ G                     +    +  +   + E+D+  LV +LN +      +   ++DI+++L  +   K  FTPR GNKVA  +AR 
Subjt:  EGWAVYKG---------------------IQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARL

Query:  AFSY
        + S+
Subjt:  AFSY

AT3G09510.1 Ribonuclease H-like superfamily protein1.5e-2323.95Show/hide
Query:  LKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGG---WDVALLRTIFNG
        +K RYF     L+A +  + S+ W SLL G  LL +G R  IG+G+   I   N + +     + +        T++ LF   G    WD + +    + 
Subjt:  LKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTASGG---WDVALLRTIFNG

Query:  ADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRL------AHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRG
        +D   I RI L +    D++IW++   G ++V+SGY L       +  AI    GS + +      + +W L +  K + FLWR     L T   L  RG
Subjt:  ADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRL------AHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRG

Query:  LTVS------------------LCVFCAMMIQKIASICSGPALW---FEE----IIGAMRDKLTGPDFELVVIFW--WSVWSLRNNLFWG--GQSDGRDL
        + +                    C F  M  +   S      L    FEE    I+  ++D  T  DF  ++  W  W +W  RNN+ +    +S  + +
Subjt:  LTVS------------------LCVFCAMMIQKIASICSGPALW---FEE----IIGAMRDKLTGPDFELVVIFW--WSVWSLRNNLFWG--GQSDGRDL

Query:  W---AYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS-RCWSVDLAEGW-----------------------------AVYK
            A + D+L+A          + +        E +  WR PP   +K N DA      ++   GW                             A+  
Subjt:  W---AYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS-RCWSVDLAEGW-----------------------------AVYK

Query:  GIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSY
         +Q     G+    +E D   L+ ++NG +   S +   ++DI    + + + +  F  R+GNK+AHVLA+   +Y
Subjt:  GIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSY

AT4G29090.1 Ribonuclease H-like superfamily protein1.0e-3827.72Show/hide
Query:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWL---PNEFSLQIQSAP--
        +E FN ALL KQ WR+L  P SL+  V K RYF +S  L A +GSRPSFVW+S+   +E+L +G R  +GNG    I+   WL   P   +L++Q  P  
Subjt:  MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWL---PNEFSLQIQSAP--

Query:  ---VLSPTSTVSELFTASG-GWDVALLRTIFNGADCEAILRIPLQQGSGE--DRLIWHFEKHGNFSVKSGY-RLAHTLAIQDRPGSSNSERVRMWWSGLW
            +S    VS+L   SG  W   ++  +F   + E  L   L+ G     D   W +   G+++VKSGY  L   +  +  P   +   +   +  +W
Subjt:  ---VLSPTSTVSELFTASG-GWDVALLRTIFNGADCEAILRIPLQQGSGE--DRLIWHFEKHGNFSVKSGY-RLAHTLAIQDRPGSSNSERVRMWWSGLW

Query:  RLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIASI---CSGPAL-WFEEII-----GAMRDKL------------TGPDFE----
        +     K + FLW+   + LP    L  R L+  S C+ C    + +  +   C+   L W    I     G   D +              P +E    
Subjt:  RLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLT-VSLCVFCAMMIQKIASI---CSGPAL-WFEEII-----GAMRDKL------------TGPDFE----

Query:  LVVIFWWSVWSLRNNL-FWGGQSDGRDLWAYSSDYLHAFHV--GGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------RC---WSVDLAE
        LV    W +W  RN L F G + + +++   + D L  + +      CG +  +      +   G WRPPP++ +K N DA+      RC   W +   +
Subjt:  LVVIFWWSVWSLRNNL-FWGGQSDGRDLWAYSSDYLHAFHV--GGGRCGARDSLWAQSGEQEERGVWRPPPNRVLKLNIDAS------RC---WSVDLAE

Query:  GWAVYKGIQLARQLGFV--------------------DFVV-ETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLA
        G   + G +   +L  V                    ++V+ E+DS  L++ILN +      +   + D++R+LS +   K +F PR+GN +A  +AR +
Subjt:  GWAVYKGIQLARQLGFV--------------------DFVV-ETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLA

Query:  FSYVD
         S+++
Subjt:  FSYVD

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.8e-1947.67Show/hide
Query:  FNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSL
        FNQALLAKQ +R++  P +LL  +L+ RYFP S  +E  +G+RPS+ WRS++ GRELL RG    IG+G  T ++   W+ +E  L
Subjt:  FNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTTTTTAACCAAGCCCTGCTGGCTAAACAGTGTTGGCGTGTTCTCCAGGATCCTTCCTCCCTTCTAGGCTGTGTGCTCAAGGGCCGCTATTTTCCCCAGTCGGG
TTTCTTGGAGGCAGGTATCGGTTCACGTCCGTCTTTCGTCTGGCGTAGCTTGTTATGGGGGCGGGAGCTCTTAGTTCGTGGATGTCGTTGGAGGATTGGTAATGGGCGTG
CTACGCCCATATATGGCTCAAACTGGCTGCCGAATGAGTTCTCGCTTCAAATACAGTCGGCTCCAGTACTTTCTCCTACTAGTACGGTGAGTGAGTTGTTCACTGCGTCT
GGTGGATGGGATGTGGCTTTACTCAGGACGATTTTCAATGGGGCTGATTGTGAGGCTATTTTGAGAATTCCTCTACAACAGGGCTCGGGGGAGGACCGCTTAATCTGGCA
CTTTGAGAAGCATGGGAATTTTTCGGTGAAGAGTGGGTATCGGCTTGCTCATACATTGGCTATTCAGGACCGACCTGGTTCCTCGAATTCCGAGAGAGTGCGCATGTGGT
GGTCCGGCCTCTGGAGGTTGAATGTGCCCAATAAGCATAGGTTCTTCCTCTGGCGTCTGTGCCACGACCGCTTGCCAACTAAGGTAAACCTTCTCAAACGTGGACTCACT
GTATCCCTTTGTGTGTTTTGTGCGATGATGATACAGAAGATTGCCTCCATCTGTTCTGGACCTGCCCTGTGGTTCGAGGAAATCATTGGGGCGATGAGGGACAAACTGAC
AGGGCCGGATTTTGAGCTTGTGGTGATTTTTTGGTGGTCTGTGTGGAGCCTACGAAATAACCTGTTTTGGGGTGGGCAGTCAGATGGTCGGGATCTCTGGGCATATTCGA
GTGATTACCTCCATGCCTTCCATGTTGGTGGGGGACGTTGCGGGGCAAGGGACTCCTTATGGGCTCAATCGGGAGAGCAAGAAGAGCGCGGTGTATGGAGACCGCCCCCT
AATAGGGTGCTGAAACTTAATATTGATGCTTCAAGGTGTTGGAGCGTGGATTTGGCTGAGGGTTGGGCTGTGTATAAAGGGATCCAACTTGCTCGACAGTTGGGGTTTGT
GGATTTTGTGGTGGAGACTGACTCTCTAAGACTGGTCAAAATTTTGAATGGGGAGCTGCACGATGTGTCGGAAGTGGGGCTGCTGATGGATGACATTCGACGGATCCTCA
GTCCTTGGGTCAACGGTAAGGTGTTGTTTACTCCACGTCAGGGGAACAAGGTTGCGCATGTTCTGGCCCGCCTGGCCTTTTCATATGTTGATCGTGTATGGCTTGAGGAG
TGGCCTAGCGAGGTCTCGGATGTCCTGAGGGGTGATTTTGCTTCAGTTGCGTATATTTCGCCCTTCTGGGTAGGATTTAGCTTTTGCCATGCAAGGGCGTTATCTATGAG
GGTTCCCAATTCCAGAGCCCTTCAGTGCAAAAATGTCAGCTCCTATGGTTTCTTGTTGGGAAGCTCAAGGTGCTGGAGGTGTGTCGTGGCAGATGGGCCTGTGGATAGGA
TGGACCAGATAATGCGCCATCCTGCCATATTGTGCCATCAAGATTCCAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTTTTTAACCAAGCCCTGCTGGCTAAACAGTGTTGGCGTGTTCTCCAGGATCCTTCCTCCCTTCTAGGCTGTGTGCTCAAGGGCCGCTATTTTCCCCAGTCGGG
TTTCTTGGAGGCAGGTATCGGTTCACGTCCGTCTTTCGTCTGGCGTAGCTTGTTATGGGGGCGGGAGCTCTTAGTTCGTGGATGTCGTTGGAGGATTGGTAATGGGCGTG
CTACGCCCATATATGGCTCAAACTGGCTGCCGAATGAGTTCTCGCTTCAAATACAGTCGGCTCCAGTACTTTCTCCTACTAGTACGGTGAGTGAGTTGTTCACTGCGTCT
GGTGGATGGGATGTGGCTTTACTCAGGACGATTTTCAATGGGGCTGATTGTGAGGCTATTTTGAGAATTCCTCTACAACAGGGCTCGGGGGAGGACCGCTTAATCTGGCA
CTTTGAGAAGCATGGGAATTTTTCGGTGAAGAGTGGGTATCGGCTTGCTCATACATTGGCTATTCAGGACCGACCTGGTTCCTCGAATTCCGAGAGAGTGCGCATGTGGT
GGTCCGGCCTCTGGAGGTTGAATGTGCCCAATAAGCATAGGTTCTTCCTCTGGCGTCTGTGCCACGACCGCTTGCCAACTAAGGTAAACCTTCTCAAACGTGGACTCACT
GTATCCCTTTGTGTGTTTTGTGCGATGATGATACAGAAGATTGCCTCCATCTGTTCTGGACCTGCCCTGTGGTTCGAGGAAATCATTGGGGCGATGAGGGACAAACTGAC
AGGGCCGGATTTTGAGCTTGTGGTGATTTTTTGGTGGTCTGTGTGGAGCCTACGAAATAACCTGTTTTGGGGTGGGCAGTCAGATGGTCGGGATCTCTGGGCATATTCGA
GTGATTACCTCCATGCCTTCCATGTTGGTGGGGGACGTTGCGGGGCAAGGGACTCCTTATGGGCTCAATCGGGAGAGCAAGAAGAGCGCGGTGTATGGAGACCGCCCCCT
AATAGGGTGCTGAAACTTAATATTGATGCTTCAAGGTGTTGGAGCGTGGATTTGGCTGAGGGTTGGGCTGTGTATAAAGGGATCCAACTTGCTCGACAGTTGGGGTTTGT
GGATTTTGTGGTGGAGACTGACTCTCTAAGACTGGTCAAAATTTTGAATGGGGAGCTGCACGATGTGTCGGAAGTGGGGCTGCTGATGGATGACATTCGACGGATCCTCA
GTCCTTGGGTCAACGGTAAGGTGTTGTTTACTCCACGTCAGGGGAACAAGGTTGCGCATGTTCTGGCCCGCCTGGCCTTTTCATATGTTGATCGTGTATGGCTTGAGGAG
TGGCCTAGCGAGGTCTCGGATGTCCTGAGGGGTGATTTTGCTTCAGTTGCGTATATTTCGCCCTTCTGGGTAGGATTTAGCTTTTGCCATGCAAGGGCGTTATCTATGAG
GGTTCCCAATTCCAGAGCCCTTCAGTGCAAAAATGTCAGCTCCTATGGTTTCTTGTTGGGAAGCTCAAGGTGCTGGAGGTGTGTCGTGGCAGATGGGCCTGTGGATAGGA
TGGACCAGATAATGCGCCATCCTGCCATATTGTGCCATCAAGATTCCAGGTAG
Protein sequenceShow/hide protein sequence
MELFNQALLAKQCWRVLQDPSSLLGCVLKGRYFPQSGFLEAGIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSPTSTVSELFTAS
GGWDVALLRTIFNGADCEAILRIPLQQGSGEDRLIWHFEKHGNFSVKSGYRLAHTLAIQDRPGSSNSERVRMWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLT
VSLCVFCAMMIQKIASICSGPALWFEEIIGAMRDKLTGPDFELVVIFWWSVWSLRNNLFWGGQSDGRDLWAYSSDYLHAFHVGGGRCGARDSLWAQSGEQEERGVWRPPP
NRVLKLNIDASRCWSVDLAEGWAVYKGIQLARQLGFVDFVVETDSLRLVKILNGELHDVSEVGLLMDDIRRILSPWVNGKVLFTPRQGNKVAHVLARLAFSYVDRVWLEE
WPSEVSDVLRGDFASVAYISPFWVGFSFCHARALSMRVPNSRALQCKNVSSYGFLLGSSRCWRCVVADGPVDRMDQIMRHPAILCHQDSR