; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012867 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012867
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold1:19503090..19513000
RNA-Seq ExpressionSpg012867
SyntenySpg012867
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]7.3e-0552Show/hide
Query:  WKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEKTAV
        W+QRSR  WL+EGD+NT +FH RAS R K NR+ G+ D    WQ E+  +
Subjt:  WKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEKTAV

ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]6.1e-6030.84Show/hide
Query:  SGFLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPL
        S FL A IGS PS++WRS+LWGR+++  G RWRIGNG+   I+ +NW+P  F+ +    P + S + V EL      W+E L+   F+  D + I +IPL
Subjt:  SGFLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPL

Query:  RHGSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC
             ED LIWHF K G ++VKSGY+ A  +     P SS S   +  W+ +W L +P K R F+WR   + LP+  NL KR +   P C LC    E+ 
Subjt:  RHGSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC

Query:  LHLFWTCPVVKSMWLGSKF-------------ALLH--------QSFSHLSLSWGGQPDGRDLWAYSSDYLSAFHVGGRRCATGDC------SRDQSRDQ
         H    C   K +W  S F             +LLH              ++    + + R+ W +     +   V  +  A  +       S D S  +
Subjt:  LHLFWTCPVVKSMWLGSKF-------------ALLH--------QSFSHLSLSWGGQPDGRDLWAYSSDYLSAFHVGGRRCATGDC------SRDQSRDQ

Query:  EERCV---WRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELH
        +++     W PP    +K+NTDA+   +   AG G V+R  +G+V   A    +   SV  AE  A+  G+Q+A+     D ++E+DS  +V +++    
Subjt:  EERCV---WRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELH

Query:  DVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDR-VWLEEWPSEVSDVLRGDV
          SE+  ++ +IQ++   +D+   ++T R  N +AH L ++A    +  VW   +P +V      D+
Subjt:  DVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDR-VWLEEWPSEVSDVLRGDV

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.5e-7936.68Show/hide
Query:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFT-ASGGWNEALLRTIFNGADCEAILRIPLR
        F+EA I   PS++WRS+LWGR+LL +G RWRIGNG +  IYG NW+PN+ +L+I S+P L   S V  L     GGW   ++R  F   + + IL IP+ 
Subjt:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFT-ASGGWNEALLRTIFNGADCEAILRIPLR

Query:  HGSGEDRLIWHFEKHGNFSVKSGYRLA-HTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC
         G+ EDRLIW++EK G +SV+SGY++A         P SS+SE VR WW+G W++++PNK + FLWRLC DRLPT  NL KRG+ ++  C  C  + ED 
Subjt:  HGSGEDRLIWHFEKHGNFSVKSGYRLA-HTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC

Query:  LHLFWTCPVVKSMWLGSKFALL---------HQS-----FSHLSLSWGGQPDGRDLWAYSSDYLSAFHVGGRRCATGDCSRDQSRDQEE-----------
        +HLFW C   +++W+ SKF  L         H+S     F  L +   G  + R+  A++    + F +G       +    + R+ +            
Subjt:  LHLFWTCPVVKSMWLGSKFALL---------HQS-----FSHLSLSWGGQPDGRDLWAYSSDYLSAFHVGGRRCATGDCSRDQSRDQEE-----------

Query:  RCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELHDVSEV
          +W+PP     K+NTDAS       AG G ++    G+V  AA   L+   SVD+AE  A   G+QLA ++G                +H  L D+SE 
Subjt:  RCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELHDVSEV

Query:  GLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVL
        G ++   +   +   +    F  R+GNK AH+LAR A    +  +W+E+WP E+   L
Subjt:  GLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVL

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]5.2e-1945.69Show/hide
Query:  SGNFPRRIRSANQRVQSAIADLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEK---TA
        +GNF  R++ A   +QSAI DL  + +R+   QA   + ++L+EEE++W+QRSR+LW + GDRNT+WFH +AS+R++ N I GL D QG W++ K     
Subjt:  SGNFPRRIRSANQRVQSAIADLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEK---TA

Query:  VIQSGFLEADIGSRPS
        +I+S F E    SRPS
Subjt:  VIQSGFLEADIGSRPS

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.2e-6935.79Show/hide
Query:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPLRH
        FLEA++G+ PSF+WRSL WG+ELL +G RWR+GNG +  +Y   WLP     +I S P L  ++ V +LFT+SG WN  LL+ IF   + +A L+IPL  
Subjt:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPLRH

Query:  GSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRV-----WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADA
         +G D LIWH+E++G +SVKSGYR    LA  ++   S    VRV     +W  +W L +PNK +FFLWR   D LP    L  R ++ +P+C  C   A
Subjt:  GSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRV-----WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADA

Query:  EDCLHLFWTCPVVKSMWLGSKFA---------LLHQSFSHLSLSWGGQPDGRD---LWAYSSDYLSAFHVGGRRCATGDCSR------------------
        E  LH  W C   K +W  S +             + +  L LS  G+  G      W   +   S    G    AT    R                  
Subjt:  EDCLHLFWTCPVVKSMWLGSKFA---------LLHQSFSHLSLSWGGQPDGRD---LWAYSSDYLSAFHVGGRRCATGDCSR------------------

Query:  --DQSRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACL-SLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKI
           QS  Q     WRPPP    K+N D +V+      G G V+R A+GE FMAAC+  +Q  +     E  A   G++ A  +GF   V+E D+   +  
Subjt:  --DQSRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACL-SLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKI

Query:  LHGELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDVVS
        +          GLL++++  +L  +      +TPR GNKVAH LA+ AF   + V W+EE P  +  VL  DV+S
Subjt:  LHGELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDVVS

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]9.4e-6933.55Show/hide
Query:  SGFLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPL
        +GF+ A +GS+PSFVWRS++WGR++L +G RWRIGNG+   +YG+NW+P   + +  SAP + + +TV EL      W E L+   F   D EAI++IPL
Subjt:  SGFLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPL

Query:  RHGSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC
             ED+LIWH++K G +SVKSGY++A  +   + P  SN +  +  W  +W+L +P K + FLWR  HD LPT  NL K+ +   P+C  C    E  
Subjt:  RHGSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC

Query:  LHLFWTCPVVKSMWLGSKFALLHQSFSHLSLSW-------------GGQP--------DGRDLWAYSSDYLSAFHVGGRRCATGDCSRDQSRDQ------
         H    C   + +W  S  A   +      + W             G +           R+ W +     +   V     A  +  +   + +      
Subjt:  LHLFWTCPVVKSMWLGSKFALLHQSFSHLSLSW-------------GGQP--------DGRDLWAYSSDYLSAFHVGGRRCATGDCSRDQSRDQ------

Query:  ---EERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELH
           E +  W PPPN   K+N DA+V  +   AG G V+R +DG    AA  SL+   SV +AE  A+  G+++A +      + E+DSL ++ +++ +  
Subjt:  ---EERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELH

Query:  DVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEV
         ++E+G L+ DIQ  L  + N K   +PR  N  AH LA+LA    + V WL+E P E+
Subjt:  DVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEV

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]3.7e-0953.12Show/hide
Query:  EAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEKTAV
        E Q+E++L +EEVYWKQRSR  WL+EGD+NT++FH +AS R++ NRI G+ D   VW  ++  V
Subjt:  EAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEKTAV

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]2.6e-6634.75Show/hide
Query:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPLRH
        FLEA++G+ PSF+WRSL WG+ELL +G RWR+G+G +  +Y   WLP     +I S P L  ++ V +LFT+SG WN  LL+ IF   + +AIL+IPL  
Subjt:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPLRH

Query:  GSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPG--SSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC
         +G D LIWH+E++G +SVKSGYRLA  L      G  S+  ++   +W  +W L +PNK +FFLWR   D LP    L  R ++ +P+C  C   AE  
Subjt:  GSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPG--SSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC

Query:  LHLFWTCPVVKSMWLGSKFA---------LLHQSFSHLSLSWGGQPDGR------DLWAYSSDYL------SAFHVGGRRCATGDCSRD-----------
        LH  W C   K +W  S +             + +  L LS  G+  G        LW   + ++      +A  +  R         D           
Subjt:  LHLFWTCPVVKSMWLGSKFA---------LLHQSFSHLSLSWGGQPDGR------DLWAYSSDYL------SAFHVGGRRCATGDCSRD-----------

Query:  QSRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACL-SLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHG
        QS  Q     WRPPP          +V+      G G V+R A+GE FMAAC+  +   +     E  A   G++ A  +GF D ++E D+   +  +  
Subjt:  QSRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACL-SLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHG

Query:  ELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDVVS
                G L++++  +L+ +      +TPR GNKVAH LA+ AF   + V W+EE PS +  VL  DV+S
Subjt:  ELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDVVS

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein3.5e-0552Show/hide
Query:  WKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEKTAV
        W+QRSR  WL+EGD+NT +FH RAS R K NR+ G+ D    WQ E+  +
Subjt:  WKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEKTAV

A0A251NPF0 Reverse transcriptase domain-containing protein1.2e-6432.98Show/hide
Query:  LEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAP-VLSSASTVRELFTAS-GGWNEALLRTIFNGADCEAILRIPLR
        LEA   +R SF WRS++  ++L++ G  WR+GNG+  PI+ S WL  E   +I + P      S+V EL   +   W+   +RTIF   D +AIL+IPL 
Subjt:  LEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAP-VLSSASTVRELFTAS-GGWNEALLRTIFNGADCEAILRIPLR

Query:  HGSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVW-----WSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDAD
          S  D+LIWH  K+G ++V+SGY +         PGSS     R+W     W  +W L VP K R FLWR CHD LP+K+ L++R + VSP C  C + 
Subjt:  HGSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVW-----WSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDAD

Query:  AEDCLHLFWTCPVVKSMW-----LGSKFALLHQSFSHLSLSWGGQPDGRDLWAYSSDYLSAFH----------------VGGRRCA------TGDCSRDQ
         EDCLH  W CP +  +W     L     ++H SF+ L  + G       L  ++      +H                +G R  A      +      +
Subjt:  AEDCLHLFWTCPVVKSMW-----LGSKFALLHQSFSHLSLSWGGQPDGRDLWAYSSDYLSAFH----------------VGGRRCA------TGDCSRDQ

Query:  SRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGEL
        ++  + R +WRPP   + K+N D ++  D  + G G V+R   G V       +    S +L E  A  R +Q AR++G +D + E DS  +++ L  + 
Subjt:  SRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGEL

Query:  HDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVLRGDVVS
           +  GL+++D + +L+ +       T R GN VAH LAR A    D  VW+E+ P ++  +L  D VS
Subjt:  HDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVLRGDVVS

A0A2N9ESZ1 Uncharacterized protein5.1e-0430.68Show/hide
Query:  GNFPRRIRSANQRVQSAIADLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQG
        G+  ++++     ++ A  + +     D ++    ++  +L +EE  W+QRSR LWL++GD+NT++FH RA++R++ N +  L D  G
Subjt:  GNFPRRIRSANQRVQSAIADLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQG

A0A6J1DAR4 uncharacterized protein LOC1110189547.5e-8036.68Show/hide
Query:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFT-ASGGWNEALLRTIFNGADCEAILRIPLR
        F+EA I   PS++WRS+LWGR+LL +G RWRIGNG +  IYG NW+PN+ +L+I S+P L   S V  L     GGW   ++R  F   + + IL IP+ 
Subjt:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFT-ASGGWNEALLRTIFNGADCEAILRIPLR

Query:  HGSGEDRLIWHFEKHGNFSVKSGYRLA-HTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC
         G+ EDRLIW++EK G +SV+SGY++A         P SS+SE VR WW+G W++++PNK + FLWRLC DRLPT  NL KRG+ ++  C  C  + ED 
Subjt:  HGSGEDRLIWHFEKHGNFSVKSGYRLA-HTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC

Query:  LHLFWTCPVVKSMWLGSKFALL---------HQS-----FSHLSLSWGGQPDGRDLWAYSSDYLSAFHVGGRRCATGDCSRDQSRDQEE-----------
        +HLFW C   +++W+ SKF  L         H+S     F  L +   G  + R+  A++    + F +G       +    + R+ +            
Subjt:  LHLFWTCPVVKSMWLGSKFALL---------HQS-----FSHLSLSWGGQPDGRDLWAYSSDYLSAFHVGGRRCATGDCSRDQSRDQEE-----------

Query:  RCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELHDVSEV
          +W+PP     K+NTDAS       AG G ++    G+V  AA   L+   SVD+AE  A   G+QLA ++G                +H  L D+SE 
Subjt:  RCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELHDVSEV

Query:  GLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVL
        G ++   +   +   +    F  R+GNK AH+LAR A    +  +W+E+WP E+   L
Subjt:  GLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVD-RVWLEEWPSEVSDVL

A0A6J1DAR4 uncharacterized protein LOC1110189542.5e-1945.69Show/hide
Query:  SGNFPRRIRSANQRVQSAIADLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEK---TA
        +GNF  R++ A   +QSAI DL  + +R+   QA   + ++L+EEE++W+QRSR+LW + GDRNT+WFH +AS+R++ N I GL D QG W++ K     
Subjt:  SGNFPRRIRSANQRVQSAIADLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEK---TA

Query:  VIQSGFLEADIGSRPS
        +I+S F E    SRPS
Subjt:  VIQSGFLEADIGSRPS

A0A6J1DAR4 uncharacterized protein LOC1110189541.6e-6935.79Show/hide
Query:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPLRH
        FLEA++G+ PSF+WRSL WG+ELL +G RWR+GNG +  +Y   WLP     +I S P L  ++ V +LFT+SG WN  LL+ IF   + +A L+IPL  
Subjt:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPLRH

Query:  GSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRV-----WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADA
         +G D LIWH+E++G +SVKSGYR    LA  ++   S    VRV     +W  +W L +PNK +FFLWR   D LP    L  R ++ +P+C  C   A
Subjt:  GSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRV-----WWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADA

Query:  EDCLHLFWTCPVVKSMWLGSKFA---------LLHQSFSHLSLSWGGQPDGRD---LWAYSSDYLSAFHVGGRRCATGDCSR------------------
        E  LH  W C   K +W  S +             + +  L LS  G+  G      W   +   S    G    AT    R                  
Subjt:  EDCLHLFWTCPVVKSMWLGSKFA---------LLHQSFSHLSLSWGGQPDGRD---LWAYSSDYLSAFHVGGRRCATGDCSR------------------

Query:  --DQSRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACL-SLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKI
           QS  Q     WRPPP    K+N D +V+      G G V+R A+GE FMAAC+  +Q  +     E  A   G++ A  +GF   V+E D+   +  
Subjt:  --DQSRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACL-SLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKI

Query:  LHGELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDVVS
        +          GLL++++  +L  +      +TPR GNKVAH LA+ AF   + V W+EE P  +  VL  DV+S
Subjt:  LHGELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDVVS

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)1.2e-6634.75Show/hide
Query:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPLRH
        FLEA++G+ PSF+WRSL WG+ELL +G RWR+G+G +  +Y   WLP     +I S P L  ++ V +LFT+SG WN  LL+ IF   + +AIL+IPL  
Subjt:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPLRH

Query:  GSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPG--SSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC
         +G D LIWH+E++G +SVKSGYRLA  L      G  S+  ++   +W  +W L +PNK +FFLWR   D LP    L  R ++ +P+C  C   AE  
Subjt:  GSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPG--SSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC

Query:  LHLFWTCPVVKSMWLGSKFA---------LLHQSFSHLSLSWGGQPDGR------DLWAYSSDYL------SAFHVGGRRCATGDCSRD-----------
        LH  W C   K +W  S +             + +  L LS  G+  G        LW   + ++      +A  +  R         D           
Subjt:  LHLFWTCPVVKSMWLGSKFA---------LLHQSFSHLSLSWGGQPDGR------DLWAYSSDYL------SAFHVGGRRCATGDCSRD-----------

Query:  QSRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACL-SLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHG
        QS  Q     WRPPP          +V+      G G V+R A+GE FMAAC+  +   +     E  A   G++ A  +GF D ++E D+   +  +  
Subjt:  QSRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACL-SLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHG

Query:  ELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDVVS
                G L++++  +L+ +      +TPR GNKVAH LA+ AF   + V W+EE PS +  VL  DV+S
Subjt:  ELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDVVS

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)2.0e-0840.24Show/hide
Query:  SAIADLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEKTAV
        +A+    T+D   L  + E  + ++L+++E+ W+QRSR  WL+EGD+NT +FH RAS R K NR+ G+ D    WQ E+  +
Subjt:  SAIADLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIGGLEDVQGVWQQEKTAV

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)1.2e-6634.75Show/hide
Query:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPLRH
        FLEA++G+ PSF+WRSL WG+ELL +G RWR+G+G +  +Y   WLP     +I S P L  ++ V +LFT+SG WN  LL+ IF   + +AIL+IPL  
Subjt:  FLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADCEAILRIPLRH

Query:  GSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPG--SSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC
         +G D LIWH+E++G +SVKSGYRLA  L      G  S+  ++   +W  +W L +PNK +FFLWR   D LP    L  R ++ +P+C  C   AE  
Subjt:  GSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPG--SSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC

Query:  LHLFWTCPVVKSMWLGSKFA---------LLHQSFSHLSLSWGGQPDGR------DLWAYSSDYL------SAFHVGGRRCATGDCSRD-----------
        LH  W C   K +W  S +             + +  L LS  G+  G        LW   + ++      +A  +  R         D           
Subjt:  LHLFWTCPVVKSMWLGSKFA---------LLHQSFSHLSLSWGGQPDGR------DLWAYSSDYL------SAFHVGGRRCATGDCSRD-----------

Query:  QSRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACL-SLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHG
        QS  Q     WRPPP          +V+      G G V+R A+GE FMAAC+  +   +     E  A   G++ A  +GF D ++E D+   +  +  
Subjt:  QSRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACL-SLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHG

Query:  ELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDVVS
                G L++++  +L+ +      +TPR GNKVAH LA+ AF   + V W+EE PS +  VL  DV+S
Subjt:  ELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRV-WLEEWPSEVSDVLRGDVVS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.0e-2524.25Show/hide
Query:  SFVWRSLLWG-RELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTV--RELFTASGGWNEALLRTIFNGADCEAILRIPLRHGSG-EDR
        S  WRS+  G R+++  G  W  G+G+    +   W+  +  L++ +    +   TV  ++L+    GW+ A +           +  + L   +G  DR
Subjt:  SFVWRSLLWG-RELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTV--RELFTASGGWNEALLRTIFNGADCEAILRIPLRHGSG-EDR

Query:  LIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDCLHLFWTCP
        L W F + G FSV+S Y +  T+    RP  ++      +++ LW++ VP + + FLW + +  + T+    +R LS S +C +C    E  LH+   CP
Subjt:  LIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDCLHLFWTCP

Query:  VVKSMWLGSKFALLHQSFSHLSL-SW-----GGQPDGRDL-----------WAYS----------------SDYLSAFHVGGRRCATGDCSRDQSRDQEE
            +W+        Q F   SL  W     G +    D+           W +                   ++  + V   R  +G+     ++ + E
Subjt:  VVKSMWLGSKFALLHQSFSHLSL-SW-----GGQPDGRDL-----------WAYS----------------SDYLSAFHVGGRRCATGDCSRDQSRDQEE

Query:  RCV-WRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELHDVSE
        R + W  P    +K+NTD + R + G A  G VLR   G       L++ RC S   AE W VY G+  A +       +E DS  +V  L   + D   
Subjt:  RCV-WRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELHDVSE

Query:  VGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSY-VDRVWLEEWPSEVSDVLRGDVVSA
        +  L+      L      +++   R+ N++A  LA  AFS  +     +  P  +S +LR D + +
Subjt:  VGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSY-VDRVWLEEWPSEVSDVLRGDVVSA

P93295 Uncharacterized mitochondrial protein AtMg003109.0e-0642.59Show/hide
Query:  SGFLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSL
        S  +E  +G+RPS+ WRS++ GRELL RG    IG+G  T ++   W+ +E  L
Subjt:  SGFLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSL

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.8e-1027.78Show/hide
Query:  GADCEAILRIPLRHGSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSP
        G   +A+  I  +H   +D  IW  + H   ++ S  + +  L  Q+         +  W+  +W  N   KH F  W +  +RL T+  L   GLS+  
Subjt:  GADCEAILRIPLRHGSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSP

Query:  LCVLCDADAEDCLHLFWTCPVVKSMW
        +C+LC++  E   HLF+ CP   ++W
Subjt:  LCVLCDADAEDCLHLFWTCPVVKSMW

AT2G02650.1 Ribonuclease H-like superfamily protein8.6e-1223.17Show/hide
Query:  VKSGYRLA------HTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDCLHLFWTCPVVKSMW
        ++SGY +A         A Q  PGS+  +        +W+L+V  K + FLWR     L T   L  R +   P+C  C  + E   H+ + CP  +S+W
Subjt:  VKSGYRLA------HTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDCLHLFWTCPVVKSMW

Query:  LGSKFALLHQ-----SFSH-----LSLSWGGQPDGRD----------LW-----------AYSSDYLSAFHVGGRRCATGDCSRDQSRD-----------
          +   + +Q     SF       + LS     +  D          LW             S DY +     G + AT   + +++ +           
Subjt:  LGSKFALLHQ-----SFSH-----LSLSWGGQPDGRD----------LW-----------AYSSDYLSAFHVGGRRCATGDCSRDQSRD-----------

Query:  ---QEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKIL-HGE
           + +   W PPP   +K N D+     +     G  +R  +G + +     LQ       AE       +Q+    G      E+DS  LV ++ +GE
Subjt:  ---QEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKIL-HGE

Query:  LHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLA
         H  S +G L+ DI+  +       + F  R+ N  A  LA
Subjt:  LHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLA

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein7.3e-1131.39Show/hide
Query:  WRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELHDVSEVGLL
        W+ PP + +K NTDA+ + +    G G +LR   G V      +L R  +V  AE  A+   V    +  +   + E+D+  LV +L+ +      +   
Subjt:  WRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELHDVSEVGLL

Query:  MDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSY
        ++DIQ++L  ++  K  FTPR GNKVA  +AR + S+
Subjt:  MDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSY

AT3G09510.1 Ribonuclease H-like superfamily protein1.2e-2623.9Show/hide
Query:  LEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGG---WNEALLRTIFNGADCEAILRIPL
        L+A +  + S+ W SLL G  LL +G R  IG+G+   I   N + +     + +        T+  LF   G    W+++ +    + +D   I RI L
Subjt:  LEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGG---WNEALLRTIFNGADCEAILRIPL

Query:  RHGSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC
              D++IW++   G ++V+SGY L     + + P  +         + +W L +  K + FLWR     L T   L  RG+ + P C  C  + E  
Subjt:  RHGSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDC

Query:  LHLFWTCPVVKSMWLGSKFALLHQSFSHLSLSWGGQPDGRDLWAYSSD-YLSAFH------------------------------VGGRRCATGD-CSRD
         H  +TCP     W  S  +L+        +S   + +  ++  +  D  +S FH                              V   +  T D  +  
Subjt:  LHLFWTCPVVKSMWLGSKFALLHQSFSHLSLSWGGQPDGRDLWAYSSD-YLSAFH------------------------------VGGRRCATGD-CSRD

Query:  QSRDQ---------EERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSL
        QS  +         E +  WR PP   +K N DA       EA GG ++R   G       + L    +   AE  A+   +Q     G+    +E D  
Subjt:  QSRDQ---------EERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSL

Query:  RLVKILHGELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSY
         L+ +++G +   S +   ++DI    + + + +  F  R+GNK+AHVLA+   +Y
Subjt:  RLVKILHGELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSY

AT4G29090.1 Ribonuclease H-like superfamily protein2.9e-3627.68Show/hide
Query:  LEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWL---PNEFSLQIQSAP-----VLSSASTVRELFTASG-GWNEALLRTIFNGADCEA
        L A +GSRPSFVW+S+   +E+L +G R  +GNG    I+   WL   P   +L++Q  P      +SS   V +L   SG  W + ++  +F   + E 
Subjt:  LEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWL---PNEFSLQIQSAP-----VLSSASTVRELFTASG-GWNEALLRTIFNGADCEA

Query:  ILRIPLRHGSGE--DRLIWHFEKHGNFSVKSGY-RLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCV
         L   LR G     D   W +   G+++VKSGY  L   +  +  P   +   +   +  +W+     K + FLW+   + LP    L  R LS    C+
Subjt:  ILRIPLRHGSGE--DRLIWHFEKHGNFSVKSGY-RLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCV

Query:  LCDADAEDCLHLFWTCPVVKSMW--------LGSKFALLHQSFSHLSLSW-GGQPDGRDLWAYSSDYL-----------SAFHVGGR--------RCATG
         C +  E   HL + C   +  W        LG ++A       +++L W     +G   W  +S  +           +     GR        R A  
Subjt:  LCDADAEDCLHLFWTCPVVKSMW--------LGSKFALLHQSFSHLSLSW-GGQPDGRDLWAYSSDYL-----------SAFHVGGR--------RCATG

Query:  DCSRDQSRDQEERC------------VWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVD
        D    + R + E C             WRPPP++ +K NTDA+   D    G G VLR   GEV      +L +  SV  AE  A+   V    +  +  
Subjt:  DCSRDQSRDQEERC------------VWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVD

Query:  FVVETDSLRLVKILHGELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVD
         + E+DS  L++IL+ +      +   + D+QR+LS +   K +F PR+GN +A  +AR + S+++
Subjt:  FVVETDSLRLVKILHGELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGAAGAATAAGTCCTCAAATGTTGAAATTGCAGCTTTCCCAGGGAAAAATACACTTTTGGTCCCTGAGTCACCAACCCGTCGGCCACCTCTGCTGCCGTCGCGCTT
CGTCGCCGACCGACATGCCGCTACCGCCGGAAATTTTTTTCCGCCACCGCCACCGCCGCGCTTCGTTCTCCTTGCTCGTCTGCCCGAACATCAAAAGACCGACGTTCTTT
CTCTCTCTCTTGGTCACGAACTCGAGCCTGAAGCAGATCAGCAACTTCGCCGTCGTTCGCCTGAAGCCGTCGCGTCGAGTCTAAAGCATTGTCGGAGATCGCCGCCAGAG
GTTTTCGGAGATCTTGTGGGTCACTTCTCCGGCAGCCGAGTTTCAGAGATCTTGTGGTTTTCGAAGATCCGAGTTTCTGGAACGTCCGGGAACTTCCCGAGGCGCATCCG
TAGTGCGAATCAGAGGGTTCAATCGGCCATCGCTGATCTTAGTACATCGGACTCTCGTGACTTGCTTGTTCAGGCTGAGGCTCAGTTGGAGGAGGTGTTACAGGAGGAGG
AGGTATACTGGAAACAGAGGTCCAGGGAGTTATGGCTTCGAGAAGGGGATCGCAACACTCGGTGGTTCCATTGTCGAGCCTCTTACCGCCAGAAGCTTAATCGCATTGGA
GGGTTGGAGGATGTTCAGGGAGTGTGGCAACAGGAGAAGACTGCAGTTATTCAGTCGGGTTTCTTGGAGGCAGATATTGGGTCACGTCCGTCTTTCGTCTGGCGCAGTTT
GTTATGGGGGCGGGAGCTCTTAGTTCGTGGATGCCGTTGGAGGATTGGTAATGGGCGTGCTACGCCCATCTATGGCTCGAACTGGCTGCCGAATGAGTTCTCACTTCAAA
TACAGTCGGCTCCAGTGCTTTCCTCTGCTAGTACGGTGAGGGAGTTGTTCACTGCGTCTGGGGGATGGAATGAGGCTTTGCTCAGAACGATTTTCAATGGGGCTGATTGT
GAGGCTATTTTGAGAATTCCTCTGCGACATGGCTCGGGGGAGGATCGCTTAATCTGGCACTTTGAGAAGCATGGGAATTTTTCGGTGAAGAGTGGGTATCGGCTTGCTCA
TACACTGGCTACCCAGGACCGGCCTGGCTCCTCGAACTCCGAGATAGTGCGCGTGTGGTGGTCCGGCCTCTGGAGGTTGAATGTGCCCAATAAGCATAGGTTCTTCCTCT
GGCGTCTGTGCCACGACCGCTTGCCAACGAAGGTAAACCTTCTCAAACGTGGACTCAGTGTATCCCCTTTGTGTGTTTTGTGTGATGCTGATGCAGAGGATTGTCTCCAT
CTGTTCTGGACCTGCCCTGTGGTTAAGAGTATGTGGTTGGGCTCTAAATTTGCTCTCCTCCACCAATCCTTTTCCCATCTCAGCCTGAGTTGGGGTGGGCAGCCAGACGG
CCGAGATCTCTGGGCATACTCGAGTGATTACCTCAGTGCCTTCCATGTGGGTGGGAGGCGTTGCGCAACAGGGGACTGCTCACGGGATCAATCGAGAGATCAGGAAGAGC
GTTGTGTATGGAGACCGCCCCCTAATAGGGAGCTGAAACTTAATACCGATGCTTCTGTGAGGCCGGATACAGGGGAAGCGGGGGGTGGCTGTGTGCTGCGTGGGGCTGAT
GGTGAGGTCTTCATGGCAGCTTGTTTGAGTTTACAGAGGTGTTGGAGCGTGGATTTGGCTGAGGGTTGGGCTGTGTATAGAGGGGTCCAACTTGCTCGACAGTTGGGGTT
TGTAGATTTTGTGGTGGAGACTGACTCTCTAAGGCTGGTCAAAATCCTGCATGGTGAGCTGCATGATGTGTCGGAAGTGGGGTTGCTGATGGACGACATTCAAAGGATCC
TCAGTCCTTGGGACAACGGTAAGGTTTTGTTCACTCCGCGTCAGGGAAACAAGGTTGCGCATGTTCTGGCTCGCTTGGCCTTTTCATACGTTGATCGTGTCTGGCTTGAG
GAGTGGCCTAGCGAGGTCTCGGACGTTTTGAGGGGTGATGTTGTTTCAGCTGCTGCGAAAGCTAGTACCCTGCCTCTGAGTGGAAAGAAAATGGGGGAAGATGGATGGTG
GGTGCTTATAAATCTTGTCAGGGCTTCTAGGGCTGCTCTAGATTTAGCATTCGCCATGCAGGGTGTTATCTATGAGGGTTCCCAATTCCAGAGTCCTTCAGGGCAGAGTA
TGCTGCTTAGGCCTACTTGGGGTTTCCATGTGGGGTGCTTGTGTATGATTGCTTTCGTTAGTATGCCAGCTCCTATGGTTTCTTGCTGGGAAGCTCAAGGTGCTGGAGGC
GTGATTTGGCGGCTCAGGGATGTGCACGCTTTGCGAATATGGGGCGACGAGTGTATGAATGCTTTTGCTAAAATGTCGGTCGTGATCACCCTACCTCAAAAGACCAAATC
CTCGCCAAAATTGTGCACACCCGCGATCCGCAAAGCACAGATTCGACGTGAGCACAATCGCAAAGACGAAGATGGTCGCGAAGAGGAAGAAAACCCACGCGAACAGATCT
GGGATCGTCCTCGTGTCGTGGGTCTGAATCGCTGCACGTTATCGAAATCGAAGGGGAGCGGTGACGCGAGGATAGAGCTTCATGCGGTTCGTTGGAGAAGGAGGTGCTCG
CTGGAGAGAGGAAGCTTGCGTGGGTCTTCTCTTCGCGCACGAGATGTCGCCGGAGTGAGGAAGATTGCGTGTGGTTTGGAGGGAGGAAGATTGCGTCGAAAGGGAGGGCT
GAACAAAGGCAGGAAAAACGAGGAAGAAAATAAGAGAGAAAGGAGAAGGAAAAAAAAAAGGTCGTCGACAGTCGGCCGACAGCGGCAGGTGTGCTGGTGGCGCACGCCTA
GGTGCAAGGCTCAACGTTGGAGCCTCGCTTCGAAAAAGCGAGGCGCCATGAAGAAGGCGCATGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGGAAGAATAAGTCCTCAAATGTTGAAATTGCAGCTTTCCCAGGGAAAAATACACTTTTGGTCCCTGAGTCACCAACCCGTCGGCCACCTCTGCTGCCGTCGCGCTT
CGTCGCCGACCGACATGCCGCTACCGCCGGAAATTTTTTTCCGCCACCGCCACCGCCGCGCTTCGTTCTCCTTGCTCGTCTGCCCGAACATCAAAAGACCGACGTTCTTT
CTCTCTCTCTTGGTCACGAACTCGAGCCTGAAGCAGATCAGCAACTTCGCCGTCGTTCGCCTGAAGCCGTCGCGTCGAGTCTAAAGCATTGTCGGAGATCGCCGCCAGAG
GTTTTCGGAGATCTTGTGGGTCACTTCTCCGGCAGCCGAGTTTCAGAGATCTTGTGGTTTTCGAAGATCCGAGTTTCTGGAACGTCCGGGAACTTCCCGAGGCGCATCCG
TAGTGCGAATCAGAGGGTTCAATCGGCCATCGCTGATCTTAGTACATCGGACTCTCGTGACTTGCTTGTTCAGGCTGAGGCTCAGTTGGAGGAGGTGTTACAGGAGGAGG
AGGTATACTGGAAACAGAGGTCCAGGGAGTTATGGCTTCGAGAAGGGGATCGCAACACTCGGTGGTTCCATTGTCGAGCCTCTTACCGCCAGAAGCTTAATCGCATTGGA
GGGTTGGAGGATGTTCAGGGAGTGTGGCAACAGGAGAAGACTGCAGTTATTCAGTCGGGTTTCTTGGAGGCAGATATTGGGTCACGTCCGTCTTTCGTCTGGCGCAGTTT
GTTATGGGGGCGGGAGCTCTTAGTTCGTGGATGCCGTTGGAGGATTGGTAATGGGCGTGCTACGCCCATCTATGGCTCGAACTGGCTGCCGAATGAGTTCTCACTTCAAA
TACAGTCGGCTCCAGTGCTTTCCTCTGCTAGTACGGTGAGGGAGTTGTTCACTGCGTCTGGGGGATGGAATGAGGCTTTGCTCAGAACGATTTTCAATGGGGCTGATTGT
GAGGCTATTTTGAGAATTCCTCTGCGACATGGCTCGGGGGAGGATCGCTTAATCTGGCACTTTGAGAAGCATGGGAATTTTTCGGTGAAGAGTGGGTATCGGCTTGCTCA
TACACTGGCTACCCAGGACCGGCCTGGCTCCTCGAACTCCGAGATAGTGCGCGTGTGGTGGTCCGGCCTCTGGAGGTTGAATGTGCCCAATAAGCATAGGTTCTTCCTCT
GGCGTCTGTGCCACGACCGCTTGCCAACGAAGGTAAACCTTCTCAAACGTGGACTCAGTGTATCCCCTTTGTGTGTTTTGTGTGATGCTGATGCAGAGGATTGTCTCCAT
CTGTTCTGGACCTGCCCTGTGGTTAAGAGTATGTGGTTGGGCTCTAAATTTGCTCTCCTCCACCAATCCTTTTCCCATCTCAGCCTGAGTTGGGGTGGGCAGCCAGACGG
CCGAGATCTCTGGGCATACTCGAGTGATTACCTCAGTGCCTTCCATGTGGGTGGGAGGCGTTGCGCAACAGGGGACTGCTCACGGGATCAATCGAGAGATCAGGAAGAGC
GTTGTGTATGGAGACCGCCCCCTAATAGGGAGCTGAAACTTAATACCGATGCTTCTGTGAGGCCGGATACAGGGGAAGCGGGGGGTGGCTGTGTGCTGCGTGGGGCTGAT
GGTGAGGTCTTCATGGCAGCTTGTTTGAGTTTACAGAGGTGTTGGAGCGTGGATTTGGCTGAGGGTTGGGCTGTGTATAGAGGGGTCCAACTTGCTCGACAGTTGGGGTT
TGTAGATTTTGTGGTGGAGACTGACTCTCTAAGGCTGGTCAAAATCCTGCATGGTGAGCTGCATGATGTGTCGGAAGTGGGGTTGCTGATGGACGACATTCAAAGGATCC
TCAGTCCTTGGGACAACGGTAAGGTTTTGTTCACTCCGCGTCAGGGAAACAAGGTTGCGCATGTTCTGGCTCGCTTGGCCTTTTCATACGTTGATCGTGTCTGGCTTGAG
GAGTGGCCTAGCGAGGTCTCGGACGTTTTGAGGGGTGATGTTGTTTCAGCTGCTGCGAAAGCTAGTACCCTGCCTCTGAGTGGAAAGAAAATGGGGGAAGATGGATGGTG
GGTGCTTATAAATCTTGTCAGGGCTTCTAGGGCTGCTCTAGATTTAGCATTCGCCATGCAGGGTGTTATCTATGAGGGTTCCCAATTCCAGAGTCCTTCAGGGCAGAGTA
TGCTGCTTAGGCCTACTTGGGGTTTCCATGTGGGGTGCTTGTGTATGATTGCTTTCGTTAGTATGCCAGCTCCTATGGTTTCTTGCTGGGAAGCTCAAGGTGCTGGAGGC
GTGATTTGGCGGCTCAGGGATGTGCACGCTTTGCGAATATGGGGCGACGAGTGTATGAATGCTTTTGCTAAAATGTCGGTCGTGATCACCCTACCTCAAAAGACCAAATC
CTCGCCAAAATTGTGCACACCCGCGATCCGCAAAGCACAGATTCGACGTGAGCACAATCGCAAAGACGAAGATGGTCGCGAAGAGGAAGAAAACCCACGCGAACAGATCT
GGGATCGTCCTCGTGTCGTGGGTCTGAATCGCTGCACGTTATCGAAATCGAAGGGGAGCGGTGACGCGAGGATAGAGCTTCATGCGGTTCGTTGGAGAAGGAGGTGCTCG
CTGGAGAGAGGAAGCTTGCGTGGGTCTTCTCTTCGCGCACGAGATGTCGCCGGAGTGAGGAAGATTGCGTGTGGTTTGGAGGGAGGAAGATTGCGTCGAAAGGGAGGGCT
GAACAAAGGCAGGAAAAACGAGGAAGAAAATAAGAGAGAAAGGAGAAGGAAAAAAAAAAGGTCGTCGACAGTCGGCCGACAGCGGCAGGTGTGCTGGTGGCGCACGCCTA
GGTGCAAGGCTCAACGTTGGAGCCTCGCTTCGAAAAAGCGAGGCGCCATGAAGAAGGCGCATGCCTAA
Protein sequenceShow/hide protein sequence
MRKNKSSNVEIAAFPGKNTLLVPESPTRRPPLLPSRFVADRHAATAGNFFPPPPPPRFVLLARLPEHQKTDVLSLSLGHELEPEADQQLRRRSPEAVASSLKHCRRSPPE
VFGDLVGHFSGSRVSEILWFSKIRVSGTSGNFPRRIRSANQRVQSAIADLSTSDSRDLLVQAEAQLEEVLQEEEVYWKQRSRELWLREGDRNTRWFHCRASYRQKLNRIG
GLEDVQGVWQQEKTAVIQSGFLEADIGSRPSFVWRSLLWGRELLVRGCRWRIGNGRATPIYGSNWLPNEFSLQIQSAPVLSSASTVRELFTASGGWNEALLRTIFNGADC
EAILRIPLRHGSGEDRLIWHFEKHGNFSVKSGYRLAHTLATQDRPGSSNSEIVRVWWSGLWRLNVPNKHRFFLWRLCHDRLPTKVNLLKRGLSVSPLCVLCDADAEDCLH
LFWTCPVVKSMWLGSKFALLHQSFSHLSLSWGGQPDGRDLWAYSSDYLSAFHVGGRRCATGDCSRDQSRDQEERCVWRPPPNRELKLNTDASVRPDTGEAGGGCVLRGAD
GEVFMAACLSLQRCWSVDLAEGWAVYRGVQLARQLGFVDFVVETDSLRLVKILHGELHDVSEVGLLMDDIQRILSPWDNGKVLFTPRQGNKVAHVLARLAFSYVDRVWLE
EWPSEVSDVLRGDVVSAAAKASTLPLSGKKMGEDGWWVLINLVRASRAALDLAFAMQGVIYEGSQFQSPSGQSMLLRPTWGFHVGCLCMIAFVSMPAPMVSCWEAQGAGG
VIWRLRDVHALRIWGDECMNAFAKMSVVITLPQKTKSSPKLCTPAIRKAQIRREHNRKDEDGREEEENPREQIWDRPRVVGLNRCTLSKSKGSGDARIELHAVRWRRRCS
LERGSLRGSSLRARDVAGVRKIACGLEGGRLRRKGGLNKGRKNEEENKRERRRKKKRSSTVGRQRQVCWWRTPRCKAQRWSLASKKRGAMKKAHA