; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010820 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010820
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:7206497..7207993
RNA-Seq ExpressionLag0010820
SyntenyLag0010820
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.3e-6331.41Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF
        +LAKQCWRIL+ P SL+  + R RY P   FLEA +G+ PSF+WR+L WG+ELL                               S P LP +++V DLF
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF

Query:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPS-NPDRMHAWWSGLWRLNVPGKHKFFLWRLFHD
          SG W+  +L+  F   + +A L+IPL    G D LIWH+E++  ++VKSGY+LA     K    PS   D    +W  +W L +P K KFFLWR   D
Subjt:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPS-NPDRMHAWWSGLWRLNVPGKHKFFLWRLFHD

Query:  RLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQSD
         LP    L  R +    +C  C    E   H  W C   + +W  S + ++   +    F E+  A++    G +  L     W +WN RN+  + G+S+
Subjt:  RLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQSD

Query:  GRDLWV---------FSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACM-GLPRCWS
             +         FS          GR+     PL            WRPP  G  K+NVD +V+        G +VR A GE FMAAC+  +   + 
Subjt:  GRDLWV---------FSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACM-GLPRCWS

Query:  VDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGWKKGLSKF
            E  A  +G++ A  +G +  V+E D+   +  +    +     G L+++V  +LH +      ++PR GN+VAH LA  AF       W +   ++
Subjt:  VDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGWKKGLSKF

Query:  LKP
        L P
Subjt:  LKP

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]4.5e-6131.06Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGREL-------------------------------LSVPSLPAASVVSDLF
        ++AKQ WR+++ P+SL+  V++ RY+  S F  A +GS PSF+WR++LWG ++                               +S  +LP  +VV+DL 
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGREL-------------------------------LSVPSLPAASVVSDLF

Query:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDR
             W    L  HF   D EAIL+I L  G  ED ++WHF+K   ++VKSGYQLA      + P  SN       W   W L++P K K F+WR   + 
Subjt:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDR

Query:  LPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTW-GGQSD
        LPT  NL KR      +C  C   VE   H+  +C     +W  +             F   I  M  R    + EL++++ W +W+ RN   + G +SD
Subjt:  LPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTW-GGQSD

Query:  GRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAEGWAVY
         R L   ++  L  +    R    G+   A+ R  + +  W+PP+   LKLNVDA+V     +   G +VR AEG++             V LAE  A++
Subjt:  GRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAEGWAVY

Query:  KGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGW
         G+Q+A Q+  S  +VE+D   +V++L+      +E+  ++ DVR     +   +  F PR  N  AHALA  A    +   W
Subjt:  KGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGW

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.1e-7233.75Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGREL-------------------------------LSVPSLPAASVVSDLF
        +LAKQCWRIL  P+S+L  VL+GRYF    F+EA +   PS++WR++LWGR+L                               LS P LP  S VS L 
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGREL-------------------------------LSVPSLPAASVVSDLF

Query:  -AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLA-QTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFH
            GGW   V+R  F   + + IL IP+  G  EDRLIW++EK   ++V+SGY++A         PS S+ + +  WW+G W++++P K K FLWRL  
Subjt:  -AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLA-QTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFH

Query:  DRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQS
        DRLPT  NL KR + + + C  C  + ED  HLFW C   E++W+ SKF  L           ++    E L   DFE + +  W +WN RN   +   +
Subjt:  DRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQS

Query:  D-----GRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLA
              G +L  ++  Y   F     R    +P+  R  +     +W+PP  G  K+N DAS       A  G ++    G+V  AA   L    SVD+A
Subjt:  D-----GRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLA

Query:  EGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLA
        E  A  +G+QLA ++G                +H  L+D+SE G ++   +       H+   F  R+GN+ AH LA  A
Subjt:  EGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLA

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.3e-6331.35Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF
        ++AKQ WRI+Q PSSL+  VL+ RYF  +GF+ AGLGS+PSFVWR+++WGR++L                               S PS+   + V++L 
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF

Query:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDR
             W E ++  HF   D EAI++IPL     ED+LIWH++K   ++VKSGYQ+A  +   + PS SN D+    W  +W+L +P K K FLWR  HD 
Subjt:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDR

Query:  LPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMK------ERLPGPDFELVVIFWWSVWNLRNTLTW
        LPT  NL K+ +    +C  C   VE   H   +C     +W  S  A   R   +    +++W ++       ++ G +   V    W++W  RN   +
Subjt:  LPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMK------ERLPGPDFELVVIFWWSVWNLRNTLTW

Query:  GGQSDGRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAE
         G+ +     V + + +       R+    + +     + E +  W PP  G  K+NVDA+V  ++  A  G +VR ++G    AA   L    SV +AE
Subjt:  GGQSDGRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAE

Query:  GWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGW
          A+  G+++A +  ++  + E+DSL ++ +++     ++E+G L+ D++  L  + + K   SPR  N  AH+LA LA        W
Subjt:  GWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGW

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]1.5e-6132.22Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELLS--------------------VP-----------SLPAASVVSDLF
        ++AKQ WR+LQ P+SL+  VL+ RYF  S FL A  G+  S++WR+++WGR+++                     +P           SLP +SVV+DL 
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELLS--------------------VP-----------SLPAASVVSDLF

Query:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDR
             WDE  LR HF   D   IL+IPL     ED ++WH++K   ++VKSGYQLA  L  K   S S  +  H +WS LW L +P K K F+WR  ++ 
Subjt:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDR

Query:  LPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQSDG
        LP+  NL KR +     C  C   VE   H   +C     +WL+S F++ R   +       +  M + L   D EL+V   WS W  RN   +    DG
Subjt:  LPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQSDG

Query:  RDL-----WVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAEG
        R+L        +E  LT F    +   +   +  + +  E    W PP     K+NVDA+    +  A  G ++R + G++  A         S  LAE 
Subjt:  RDL-----WVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAEG

Query:  WAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLA
         AV  G+QLAR   +S  ++E+D L +V++++      SE+   +  ++  +  +    V   PR  N  AH LA +A
Subjt:  WAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLA

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein1.0e-5831.09Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF
        +LAKQCWRIL+ P SL+  + R RY P   FLEA +G+ PSF+WR+L WG+ELL                               S P LP ++ V DLF
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF

Query:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPS-NPDRMHAWWSGLWRLNVPGKHKFFLWRLFHD
          SG W+  +L+  F   + +AIL+IPL    G D LIWH+E++  ++VKSGY+LA     K    PS   D    +W  +W L +P K KFFLWR   D
Subjt:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPS-NPDRMHAWWSGLWRLNVPGKHKFFLWRLFHD

Query:  RLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQSD
         LP    L  R +    +C  C    E   H  W C   + +W  S + ++   +    F E+  A++    G +  L     W +WN RN+  + G+S+
Subjt:  RLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQSD

Query:  ---------GRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEA-RG-GCLVRGAEGEVFMAACM-GLPRC
                  +    FS+    +    GR+     PL+           WRPP   K            SG++ RG G +VR A GE FMAAC+  +   
Subjt:  ---------GRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEA-RG-GCLVRGAEGEVFMAACM-GLPRC

Query:  WSVDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGWKKGLS
        +     E  A  +G++ A  +G +D ++E D+   +  +    +     G L+++V  +L+ +      ++PR GN+VAH LA  AF       W +   
Subjt:  WSVDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGWKKGLS

Query:  KFLKP
         +L P
Subjt:  KFLKP

A0A5E4FZN9 PREDICTED: retrotransposon6.1e-6431.41Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF
        +LAKQCWRIL+ P SL+  + R RY P   FLEA +G+ PSF+WR+L WG+ELL                               S P LP +++V DLF
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF

Query:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPS-NPDRMHAWWSGLWRLNVPGKHKFFLWRLFHD
          SG W+  +L+  F   + +A L+IPL    G D LIWH+E++  ++VKSGY+LA     K    PS   D    +W  +W L +P K KFFLWR   D
Subjt:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPS-NPDRMHAWWSGLWRLNVPGKHKFFLWRLFHD

Query:  RLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQSD
         LP    L  R +    +C  C    E   H  W C   + +W  S + ++   +    F E+  A++    G +  L     W +WN RN+  + G+S+
Subjt:  RLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQSD

Query:  GRDLWV---------FSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACM-GLPRCWS
             +         FS          GR+     PL            WRPP  G  K+NVD +V+        G +VR A GE FMAAC+  +   + 
Subjt:  GRDLWV---------FSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACM-GLPRCWS

Query:  VDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGWKKGLSKF
            E  A  +G++ A  +G +  V+E D+   +  +    +     G L+++V  +LH +      ++PR GN+VAH LA  AF       W +   ++
Subjt:  VDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGWKKGLSKF

Query:  LKP
        L P
Subjt:  LKP

A0A6J1DAR4 uncharacterized protein LOC1110189545.5e-7333.75Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGREL-------------------------------LSVPSLPAASVVSDLF
        +LAKQCWRIL  P+S+L  VL+GRYF    F+EA +   PS++WR++LWGR+L                               LS P LP  S VS L 
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGREL-------------------------------LSVPSLPAASVVSDLF

Query:  -AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLA-QTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFH
            GGW   V+R  F   + + IL IP+  G  EDRLIW++EK   ++V+SGY++A         PS S+ + +  WW+G W++++P K K FLWRL  
Subjt:  -AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLA-QTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFH

Query:  DRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQS
        DRLPT  NL KR + + + C  C  + ED  HLFW C   E++W+ SKF  L           ++    E L   DFE + +  W +WN RN   +   +
Subjt:  DRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQS

Query:  D-----GRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLA
              G +L  ++  Y   F     R    +P+  R  +     +W+PP  G  K+N DAS       A  G ++    G+V  AA   L    SVD+A
Subjt:  D-----GRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLA

Query:  EGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLA
        E  A  +G+QLA ++G                +H  L+D+SE G ++   +       H+   F  R+GN+ AH LA  A
Subjt:  EGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLA

A0A803QQT2 Uncharacterized protein4.7e-6431.21Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF
        +LAKQ WR L+ P  L   VL+  YFP+ G LEAG G+  SFVWR+L+WG++L+                                 PSLPA   V+DL 
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF

Query:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDR
           G WDE  +R+ F+ +D + IL IP      ED+++WH+ K+  ++VKSGY++A +   +     SN   +  WW  LWRL +P K K F+W++ H+ 
Subjt:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDR

Query:  LPTKINLLKRDLNVPSVCVLCDEDVEDR-RHLFWDCPVVESMWLRSKFASLRRSFSQLRFEE---VIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGG
        LP  +NL KR +    VC  C   V++   H  W+C   +  W   + + L     Q+  E+   ++  +         E  ++  W++WN+RNT+  GG
Subjt:  LPTKINLLKRDLNVPSVCVLCDEDVEDR-RHLFWDCPVVESMWLRSKFASLRRSFSQLRFEE---VIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGG

Query:  -QSDGRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAEG
              ++  +  ++L  F         GD  R RS+       W PPA  ++ +NVDA V+     +  G +VR A G V  AA   L +       E 
Subjt:  -QSDGRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAEG

Query:  WAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGW
         A+ KG+Q+  Q  L  F VETD L+ V ++        ++  L++ +R ++       + F  R+ NRVAHALA+ A  +     W
Subjt:  WAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGW

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)1.0e-5831.09Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF
        +LAKQCWRIL+ P SL+  + R RY P   FLEA +G+ PSF+WR+L WG+ELL                               S P LP ++ V DLF
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-------------------------------SVPSLPAASVVSDLF

Query:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPS-NPDRMHAWWSGLWRLNVPGKHKFFLWRLFHD
          SG W+  +L+  F   + +AIL+IPL    G D LIWH+E++  ++VKSGY+LA     K    PS   D    +W  +W L +P K KFFLWR   D
Subjt:  AVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPS-NPDRMHAWWSGLWRLNVPGKHKFFLWRLFHD

Query:  RLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQSD
         LP    L  R +    +C  C    E   H  W C   + +W  S + ++   +    F E+  A++    G +  L     W +WN RN+  + G+S+
Subjt:  RLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQSD

Query:  ---------GRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEA-RG-GCLVRGAEGEVFMAACM-GLPRC
                  +    FS+    +    GR+     PL+           WRPP   K            SG++ RG G +VR A GE FMAAC+  +   
Subjt:  ---------GRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEA-RG-GCLVRGAEGEVFMAACM-GLPRC

Query:  WSVDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGWKKGLS
        +     E  A  +G++ A  +G +D ++E D+   +  +    +     G L+++V  +L+ +      ++PR GN+VAH LA  AF       W +   
Subjt:  WSVDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGWKKGLS

Query:  KFLKP
         +L P
Subjt:  KFLKP

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.1e-3326.11Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRY-----------FPQSGF--------------LEAGLGSRPSFVWRNLLW------GRELLSV-----PSLPAASV
        +++K  WR+LQE +SL   VL+ +Y            P+  +              +  G+G  P    +   W      G+ LL +     P+     V
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRY-----------FPQSGF--------------LEAGLGSRPSFVWRNLLW------GRELLSV-----PSLPAASV

Query:  VSDLFAVSGGWDEAVLRAHFDLSDREAILRIPLRHGLG-EDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLW
          DL+    GWD A +  +   + R  +  + L    G  DRL W F +   F+V+S Y++   L V + P P+    M ++++ LW++ VP + K FLW
Subjt:  VSDLFAVSGGWDEAVLRAHFDLSDREAILRIPLRHGLG-EDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLW

Query:  RLFHDRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRS---FSQLRFEEVIWAMKERLPGPDFE----LVVIFWWSVWN
         + +  + T+    +R L+  +VC +C   VE   H+  DCP    +W+R      RR    FS+  FE +   + +R    D        VI WW  W 
Subjt:  RLFHDRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRS---FSQLRFEEVIWAMKERLPGPDFE----LVVIFWWSVWN

Query:  LRNTLTWGGQSDGRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCV-WRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPR
         R    +G  +  RD   F +++    +    R  +G+ L   ++    R + W  P +G +K+N D + R + G A  G ++R   G       + + R
Subjt:  LRNTLTWGGQSDGRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCV-WRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPR

Query:  CWSVDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVF
        C S   AE W VY G+  A +  +    +E DS  +V  L  G+ D   +  L+      L      +++   R+ NR+A  LA+ AFS    F
Subjt:  CWSVDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVF

P93295 Uncharacterized mitochondrial protein AtMg003107.6e-1152.73Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELLS
        +LAKQ +RI+ +P +LL  +LR RYFP S  +E  +G+RPS+ WR+++ GRELLS
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELLS

Arabidopsis top hitse value%identityAlignment
AT2G02650.1 Ribonuclease H-like superfamily protein1.4e-2023.98Show/hide
Query:  VKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFA
        ++SGY +A    + +  +   P         +W+L+V  K K FLWR     L T   L  R+++   +C  C  + E   H+ ++CP  +S+W  +   
Subjt:  VKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKFA

Query:  SLRRSFSQLRFEE----VIWAMKERLPGPDFELVVIFW--WSVWNLRNTLTWGGQSDGRDL-----------WVFSEDYL--TVFHAGGRRGLAGDPLRA
           +      FE+    +I   K +      +  + FW  W +W  RN   +  +    D            W+ + +    T  H      +A +P++ 
Subjt:  SLRRSFSQLRFEE----VIWAMKERLPGPDFELVVIFW--WSVWNLRNTLTWGGQSDGRDL-----------WVFSEDYL--TVFHAGGRRGLAGDPLRA

Query:  RSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGG
          RD      W PP  G +K N D+     S   R G  +R   G + +     L        AE       +Q+    GL     E+DS  LV +++ G
Subjt:  RSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGG

Query:  LQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALAS
         +D S +G L+ D+R  +    +  + F  R+ N  A ALAS
Subjt:  LQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALAS

AT3G09510.1 Ribonuclease H-like superfamily protein3.6e-3223.9Show/hide
Query:  LRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-----------------------SVPSLPAAS-------VVSDLFAVSGG---WDEAVLRAHFDLS
        ++ RYF     L+A +  + S+ W +LL G  LL                       S P  P  +        +++LF   G    WD++ +    D S
Subjt:  LRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL-----------------------SVPSLPAAS-------VVSDLFAVSGG---WDEAVLRAHFDLS

Query:  DREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDRLPTKINLLKRDLNVPSVC
        D   I RI L      D++IW++     +TV+SGY L       + P+ + P       + +W L +  K K FLWR     L T   L  R + +   C
Subjt:  DREAILRIPLRHGLGEDRLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDRLPTKINLLKRDLNVPSVC

Query:  VLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPG---PDFELVVIFW--WSVWNLRNTLTWGGQSDGRDLWVFSEDYLTV
          C  + E   H  + CP     W  S  + +R       FEE I  +   +      DF  ++  W  W +W  RN + +    +     V S    T 
Subjt:  VLCDEDVEDRRHLFWDCPVVESMWLRSKFASLRRSFSQLRFEEVIWAMKERLPG---PDFELVVIFW--WSVWNLRNTLTWGGQSDGRDLWVFSEDYLTV

Query:  FHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAEGWAVYKGVQLARQLGLSDF
              +     P   R +  E +  WR P    +K N DA       EA GG ++R   G       M L    +   AE  A+   +Q     G +  
Subjt:  FHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAEGWAVYKGVQLARQLGLSDF

Query:  VVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGWKKGLSKFL
         +E D   L+ +++ G+   S +   ++D+    + +   +  F  R+GN++AH LA    +Y T +     L  +L
Subjt:  VVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSYGTVFGWKKGLSKFL

AT3G25270.1 Ribonuclease H-like superfamily protein1.3e-1322.15Show/hide
Query:  LWRLNVPGKHKFFLWRLFHDRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRS--KFASLRRS--FSQLRFEEVIWAMKERLPGPDF
        +W+L    K K FLW+L    L T  NL +R +     C  C ++ E  +HLF+DC   + +W  S      LR +    + + E ++ +         F
Subjt:  LWRLNVPGKHKFFLWRLFHDRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRS--KFASLRRS--FSQLRFEEVIWAMKERLPGPDF

Query:  ELVVIFWWSVWNLRNTLTWGGQS-----------DGRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEAR
         L +   W +W  RN L +  +S           +    W  +  Y+   +          P  AR++       W+ P    +K N D +    +  A+
Subjt:  ELVVIFWWSVWNLRNTLTWGGQS-----------DGRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEAR

Query:  GGCLVRGAEGEVFMAACMGLPRCWSVDL-AEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGN
         G L+R   G V+M +   +    S  L +E  A+   +Q A   G    + E DS ++ ++++    +      + +  R     ++ +   + PR  N
Subjt:  GGCLVRGAEGEVFMAACMGLPRCWSVDL-AEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGN

Query:  RVAHALA
        + A  LA
Subjt:  RVAHALA

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-3827.33Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL----------------------------------SVPSLPAASV--
        +L KQ WR+L  P SL+  V + RYF +S  L A LGSRPSFVW+++   +E+L                                   VP    ASV  
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELL----------------------------------SVPSLPAASV--

Query:  ---VSDLFAVSG-GWDEAVLRAHFDLSDREAI--LRIPLRHGLGEDRLIWHFEKHEAFTVKSGY-QLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGK
           VSDL   SG  W + V+   F   +R+ I  LR   R  L  D   W +     +TVKSGY  L Q +  +  P   +   ++  +  +W+     K
Subjt:  ---VSDLFAVSG-GWDEAVLRAHFDLSDREAI--LRIPLRHGLGEDRLIWHFEKHEAFTVKSGY-QLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGK

Query:  HKFFLWRLFHDRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKF-ASLRRSFSQLRFEEVIWAMKERLPGPDFE----LVVIFWW
         + FLW+   + LP    L  R L+  S C+ C    E   HL + C      W  S     L   ++   +  + W        P +E    LV    W
Subjt:  HKFFLWRLFHDRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRSKF-ASLRRSFSQLRFEEVIWAMKERLPGPDFE----LVVIFWW

Query:  SVWNLRNTLTWGGQS-DGRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACM
         +W  RN L + G+  + +++   +ED L  +         G   +  +R   GR  WRPP    +K N DA+   D+     G ++R  +GEV      
Subjt:  SVWNLRNTLTWGGQS-DGRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDASVRPDSGEARGGCLVRGAEGEVFMAACM

Query:  GLPRCWSVDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSY
         LP+  SV  AE  A+   V    +   +  + E+DS  L++IL+   +    +   + D++ +L  +   K +F PR+GN +A  +A  + S+
Subjt:  GLPRCWSVDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRVAHALASLAFSY

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.4e-1252.73Show/hide
Query:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELLS
        +LAKQ +RI+ +P +LL  +LR RYFP S  +E  +G+RPS+ WR+++ GRELLS
Subjt:  VLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELLS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATACATATAATATAGTACTAGCTAAGCAGTGCTGGCGTATCTTGCAGGAACCGTCCTCTCTTTTGTGTTCGGTGCTTAGGGGTCGTTATTTCCCCCAGTCAGGGTT
TTTGGAGGCAGGTCTTGGCTCACGACCTTCGTTTGTATGGCGCAATTTGTTGTGGGGGCGGGAGCTCTTGTCGGTCCCGTCGCTTCCTGCTGCTAGTGTGGTTAGTGATC
TTTTTGCTGTGTCTGGTGGGTGGGATGAGGCTGTGCTCAGAGCCCATTTTGATTTGTCGGATCGTGAGGCCATCTTGAGAATCCCATTGCGGCATGGTCTGGGGGAGGAT
CGATTAATTTGGCATTTTGAGAAGCATGAGGCCTTCACTGTGAAGAGTGGGTATCAGCTTGCTCAGACGTTGGCTGTGAAGGACCGACCCTCACCCTCGAACCCTGATAG
GATGCACGCGTGGTGGTCCGGCCTCTGGAGGCTAAATGTGCCTGGTAAGCATAAGTTCTTTTTATGGCGACTGTTCCATGACCGTTTGCCTACTAAGATAAACCTCCTCA
AGCGTGACCTAAATGTCCCTAGCGTGTGTGTTTTGTGCGATGAGGATGTCGAGGATCGTCGCCATTTGTTCTGGGACTGCCCTGTGGTTGAGAGTATGTGGTTGCGCTCA
AAGTTTGCCTCACTCCGTCGGTCCTTTTCTCAGCTACGGTTTGAAGAAGTCATTTGGGCGATGAAGGAAAGACTTCCGGGGCCGGATTTTGAGCTTGTGGTCATCTTCTG
GTGGTCTGTGTGGAATCTCCGGAATACTCTGACCTGGGGTGGCCAGTCAGACGGTCGAGACTTATGGGTCTTTTCTGAGGATTACCTCACTGTCTTCCACGCTGGTGGGA
GGCGTGGCCTGGCAGGGGACCCCTTACGGGCCCGGTCAAGGGACTACGAGGGTCGCTGTGTGTGGAGGCCGCCGGCTATTGGAAAGCTGAAGCTGAATGTCGATGCCTCT
GTCAGGCCGGATTCAGGGGAAGCTAGGGGTGGTTGTTTGGTGCGTGGGGCTGAGGGTGAGGTCTTTATGGCGGCATGTATGGGCTTACCGAGGTGTTGGAGCGTGGATTT
GGCTGAGGGTTGGGCTGTGTATAAAGGGGTTCAGCTTGCTCGTCAGCTGGGGTTGTCAGATTTTGTGGTGGAGACCGACTCCCTGAGATTGGTCAAAATCCTTCATGGTG
GTTTGCAGGATGTTTCGGAGGTAGGCCGACTAATGGATGACGTCCGAATGATCCTCCATCCTTGGGACCATAGCAAGGTTCTATTTTCGCCACGCCAGGGAAATAGGGTG
GCGCATGCCTTGGCAAGCTTGGCCTTTTCTTATGGGACTGTGTTTGGCTGGAAGAAGGGCCTATCGAAATTTTTGAAGCCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCATACATATAATATAGTACTAGCTAAGCAGTGCTGGCGTATCTTGCAGGAACCGTCCTCTCTTTTGTGTTCGGTGCTTAGGGGTCGTTATTTCCCCCAGTCAGGGTT
TTTGGAGGCAGGTCTTGGCTCACGACCTTCGTTTGTATGGCGCAATTTGTTGTGGGGGCGGGAGCTCTTGTCGGTCCCGTCGCTTCCTGCTGCTAGTGTGGTTAGTGATC
TTTTTGCTGTGTCTGGTGGGTGGGATGAGGCTGTGCTCAGAGCCCATTTTGATTTGTCGGATCGTGAGGCCATCTTGAGAATCCCATTGCGGCATGGTCTGGGGGAGGAT
CGATTAATTTGGCATTTTGAGAAGCATGAGGCCTTCACTGTGAAGAGTGGGTATCAGCTTGCTCAGACGTTGGCTGTGAAGGACCGACCCTCACCCTCGAACCCTGATAG
GATGCACGCGTGGTGGTCCGGCCTCTGGAGGCTAAATGTGCCTGGTAAGCATAAGTTCTTTTTATGGCGACTGTTCCATGACCGTTTGCCTACTAAGATAAACCTCCTCA
AGCGTGACCTAAATGTCCCTAGCGTGTGTGTTTTGTGCGATGAGGATGTCGAGGATCGTCGCCATTTGTTCTGGGACTGCCCTGTGGTTGAGAGTATGTGGTTGCGCTCA
AAGTTTGCCTCACTCCGTCGGTCCTTTTCTCAGCTACGGTTTGAAGAAGTCATTTGGGCGATGAAGGAAAGACTTCCGGGGCCGGATTTTGAGCTTGTGGTCATCTTCTG
GTGGTCTGTGTGGAATCTCCGGAATACTCTGACCTGGGGTGGCCAGTCAGACGGTCGAGACTTATGGGTCTTTTCTGAGGATTACCTCACTGTCTTCCACGCTGGTGGGA
GGCGTGGCCTGGCAGGGGACCCCTTACGGGCCCGGTCAAGGGACTACGAGGGTCGCTGTGTGTGGAGGCCGCCGGCTATTGGAAAGCTGAAGCTGAATGTCGATGCCTCT
GTCAGGCCGGATTCAGGGGAAGCTAGGGGTGGTTGTTTGGTGCGTGGGGCTGAGGGTGAGGTCTTTATGGCGGCATGTATGGGCTTACCGAGGTGTTGGAGCGTGGATTT
GGCTGAGGGTTGGGCTGTGTATAAAGGGGTTCAGCTTGCTCGTCAGCTGGGGTTGTCAGATTTTGTGGTGGAGACCGACTCCCTGAGATTGGTCAAAATCCTTCATGGTG
GTTTGCAGGATGTTTCGGAGGTAGGCCGACTAATGGATGACGTCCGAATGATCCTCCATCCTTGGGACCATAGCAAGGTTCTATTTTCGCCACGCCAGGGAAATAGGGTG
GCGCATGCCTTGGCAAGCTTGGCCTTTTCTTATGGGACTGTGTTTGGCTGGAAGAAGGGCCTATCGAAATTTTTGAAGCCCTAA
Protein sequenceShow/hide protein sequence
MHTYNIVLAKQCWRILQEPSSLLCSVLRGRYFPQSGFLEAGLGSRPSFVWRNLLWGRELLSVPSLPAASVVSDLFAVSGGWDEAVLRAHFDLSDREAILRIPLRHGLGED
RLIWHFEKHEAFTVKSGYQLAQTLAVKDRPSPSNPDRMHAWWSGLWRLNVPGKHKFFLWRLFHDRLPTKINLLKRDLNVPSVCVLCDEDVEDRRHLFWDCPVVESMWLRS
KFASLRRSFSQLRFEEVIWAMKERLPGPDFELVVIFWWSVWNLRNTLTWGGQSDGRDLWVFSEDYLTVFHAGGRRGLAGDPLRARSRDYEGRCVWRPPAIGKLKLNVDAS
VRPDSGEARGGCLVRGAEGEVFMAACMGLPRCWSVDLAEGWAVYKGVQLARQLGLSDFVVETDSLRLVKILHGGLQDVSEVGRLMDDVRMILHPWDHSKVLFSPRQGNRV
AHALASLAFSYGTVFGWKKGLSKFLKP