; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0016086 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0016086
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:32972396..32973676
RNA-Seq ExpressionLag0016086
SyntenyLag0016086
Gene Ontology termsGO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4268750.1 unnamed protein product [Prunus armeniaca]6.9e-12550.72Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +IS +QSAFVPGR I DN I+ FE +H++ ++  G+ G+ ALKLDMSKAYDRVEW FLE  M  +G D R V+LIM C+++VSYSF LNG  VG ++P R
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        GLRQ DP+SPYLFLLC E  S ++   E++  + GV + RG P++SHLFFADD  LF KA+      + ++L+ Y  ++GQ+IN  K  +  S NV ++ 
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD
         E  A  L V+ V  HD+YLGLP   G S+      +KD IW  IQ W+    S  GKEVL+KVV QA P Y MS F +PK L  +  +MMA +WWG  +
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD

Query:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS
        G R+IHWLSWK +C  K EGGLGFR+L  FN AL+AKQ WRL ++P SL+ ++LK +YYRDCS L+A    S SYVW+SL   + +L  G RWR+GDG++
Subjt:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS

Query:  IRVVGDRWIPRPPTLKLI
        +R+  DRW+P P + K++
Subjt:  IRVVGDRWIPRPPTLKLI

PRQ56718.1 putative RNA-directed DNA polymerase [Rosa chinensis]2.2e-12352.67Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +ISP QSAFVPGR I DN ++ FE  H LK++R GK G+ ALKLDMSKAYDRVEW FLE  M  +G     +  IM CVSTVSYSF +NG+  GR++PSR
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        GLRQ D ISPYLFLLC E LSR +   E    + GVRI  G PSISHLFFADD  +FF+A   + ++V  +L  Y   +GQ++NY+K  +  S NVD  +
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD
         E IA +L V  V+ HDKYLGLP     SK      + +KI    Q WR    S  GKEVLIK V QA P+Y MS F +P+ L  + +R++A+FWWG++ 
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD

Query:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS
          R+IHW++W+K+C  K +GGLGFR++ +FN ALL KQ WRLI  PDS+I K+LK KY+  CSFL+A+ KG  SY W+S++  R +L +G+R++VGDG S
Subjt:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS

Query:  IRVVGDRWIPRP
        IRV  D W+P P
Subjt:  IRVVGDRWIPRP

XP_010682492.1 PREDICTED: uncharacterized protein LOC104897331 [Beta vulgaris subsp. vulgaris]1.8e-12049.64Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +ISP QSAFVPGR I DN ++ +E  HY+K+    KTG  A KLDMSKAYDRVEW FLE+ M  +G     V  IM C+S+VSY+F+LNG+V G I+PSR
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        GLRQ DP+SPYLFLLC E  S +L    +D R+ G R+ R  P ISHLFFADD +LF +A ++E  +VA ++  Y   +GQ+IN+ K  +  S NVD+  
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD
           I ++L V+ V  HDKYLGLP   G SK  V   +K+++W  +Q W+    S  GKEVLIK V+QA PTY MS F +P  ++ D N M ARFWW    
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD

Query:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS
          R++HW+SW+K C+ K  GG+GFRDL+ FN+ALLAKQ WRL+ D  SL  ++++ +Y+++  FL A+     S+VW+S+   +SLL EGL+WRVG+G S
Subjt:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS

Query:  IRVVGDRWIPRPPTLKL
        IRV    W+P   + K+
Subjt:  IRVVGDRWIPRPPTLKL

XP_024156142.1 uncharacterized protein LOC112164137 [Rosa chinensis]2.9e-12352.91Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +I+PTQSAFVPGR I DN +L FE  H+LK++  G  G+ ALKLDMSKAYDRVEW F+E  M ++G D   +  IM CV+TVSYSF LNG+  G ++P+R
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        GLRQ D ISPYLFLLC EGLSRML++ EE  R+ G+ IA G PSI+HLFFADD  +F KA   E   V ++L+ Y   +GQ++N++K  +  S NVD   
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD
         E +A V  V+ V+ HDKYLGLP     SK    + I +K    ++ W+    SV GKEV+IK V+Q+ PTY MS F LPK L  + +R MA FWWG+ +
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD

Query:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS
          R+IHWL+W KMC+ K EGGLGFR++E FN+ALLAKQ WR++  PDSL+GK LK KY+ +  F+ A      SY W+SLM  + LL +GLR++VG G  
Subjt:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS

Query:  IRVVGDRWIPRP
        I V  D WIPRP
Subjt:  IRVVGDRWIPRP

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]1.7e-12352.67Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +I+PTQSAFVPGR I DN +L FE  H+LK++  G  G+ ALKLDMSKAYDRVEW F+E  M ++G D   ++ IM CV+TVSYSF LNG+  G ++P+R
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        GLRQ D ISPYLFLLC EGLSRML++ EE  R+ G+ IA G PSI+HLFFADD  +F KA   E   V ++L+ Y   +GQ++N++K  +  S NVD   
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD
         E +A V  V+ V+ HDKYLGLP     SK    + I +K    ++ W+    SV GKEV+IK V+Q+ PTY MS F LPK L  + +R MA FWWG+ +
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD

Query:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS
          R+IHWL+W KMC+ K +GGLGFR++E FN+ALLAKQ WR++  PDSL+GK LK KY+ +  F+ A      SY W+SLM  + LL +GLR++VG G  
Subjt:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS

Query:  IRVVGDRWIPRP
        I V  D WIPRP
Subjt:  IRVVGDRWIPRP

TrEMBL top hitse value%identityAlignment
A0A2P6SDG4 Putative RNA-directed DNA polymerase1.1e-12352.67Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +ISP QSAFVPGR I DN ++ FE  H LK++R GK G+ ALKLDMSKAYDRVEW FLE  M  +G     +  IM CVSTVSYSF +NG+  GR++PSR
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        GLRQ D ISPYLFLLC E LSR +   E    + GVRI  G PSISHLFFADD  +FF+A   + ++V  +L  Y   +GQ++NY+K  +  S NVD  +
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD
         E IA +L V  V+ HDKYLGLP     SK      + +KI    Q WR    S  GKEVLIK V QA P+Y MS F +P+ L  + +R++A+FWWG++ 
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD

Query:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS
          R+IHW++W+K+C  K +GGLGFR++ +FN ALL KQ WRLI  PDS+I K+LK KY+  CSFL+A+ KG  SY W+S++  R +L +G+R++VGDG S
Subjt:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS

Query:  IRVVGDRWIPRP
        IRV  D W+P P
Subjt:  IRVVGDRWIPRP

A0A6J5TXB8 Reverse transcriptase domain-containing protein3.3e-12550.72Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +IS +QSAFVPGR I DN I+ FE +H++ ++  G+ G+ ALKLDMSKAYDRVEW FLE  M  +G D R V+LIM C+++VSYSF LNG  VG ++P R
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        GLRQ DP+SPYLFLLC E  S ++   E++  + GV + RG P++SHLFFADD  LF KA+      + ++L+ Y  ++GQ+IN  K  +  S NV ++ 
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD
         E  A  L V+ V  HD+YLGLP   G S+      +KD IW  IQ W+    S  GKEVL+KVV QA P Y MS F +PK L  +  +MMA +WWG  +
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD

Query:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS
        G R+IHWLSWK +C  K EGGLGFR+L  FN AL+AKQ WRL ++P SL+ ++LK +YYRDCS L+A    S SYVW+SL   + +L  G RWR+GDG++
Subjt:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS

Query:  IRVVGDRWIPRPPTLKLI
        +R+  DRW+P P + K++
Subjt:  IRVVGDRWIPRPPTLKLI

A0A803PBM9 Uncharacterized protein1.4e-12049.88Show/hide
Query:  ISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSRG
        IS  QSAFV GR I DN I+GFE +H +K++R G     ALKLDMSKAYDRVEW FL   M  LG +   +E IMRCV++VS+S  +NG+ +G+  P+RG
Subjt:  ISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSRG

Query:  LRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQVG
        LRQ D +SPYLFL+C EGLS ++   E    + GVR  +    +SHLFFADD  +F + N  E   ++ +L+ Y+ ++GQ+IN +K  + +   +  Q+G
Subjt:  LRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQVG

Query:  EAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDG
         ++A  L V LV  H KYLGLP+  G  K  V + IKDK+W  ++ W+ T FS  GKE+LIK V+QA P+YSMS FRLPK L+H  + + A FWWG+   
Subjt:  EAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDG

Query:  FRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRSI
         ++IHW +W K+C  K EGGLGFR L  FN+ALLAKQ WRLI+ P SL+ +VLK  YY + SFL+AK +  AS +W+ + W R ++ EG RWRVG+GR++
Subjt:  FRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRSI

Query:  RVVGDRWIPRP
        R+  D+WIPRP
Subjt:  RVVGDRWIPRP

A0A803Q9W0 Uncharacterized protein3.7e-12449.88Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +IS  QSAF+ GR I DN ILGFE +H +K+ R G     ALKLDMSKAYDRVEW FLE  M  LG D R V+ IM C+ ++S+S  LNG V G+I PSR
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        GLRQ DP+SPY+FLLC EGLS ++   E   R+ G+R  R    +SHLFFADD  +F  A   + Q +  +L+ Y++++GQ IN+ K  LC+   ++   
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD
        G  +AA+L V+LV+ H KYLG+PA  G  K  V + I+ KI   +Q W+ + FS  G+E+L+K ++QA PTY MS FRLPK L+ D + MMARFWWG  D
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD

Query:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS
          ++ HW +WKK+C  K +GG+GF++LELFN++LLAKQ W++IN+P S++ +VLK  YY + +FL+AK  G  SY+W+S++W R ++ +G+RWRV  GR 
Subjt:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS

Query:  IRVVGDRWIPRPPTLKL
        +R+  D+W+PRP T  L
Subjt:  IRVVGDRWIPRPPTLKL

M5W5K8 Uncharacterized protein5.1e-12650.12Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +IS TQSAFVPGR I DN I+ FE +H + +K +G+ G+ ALK+DMSKAYDRVEW FLE  M  +G   R ++LIM CV+TVSYSF LNG  VG ++P R
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        GLRQ DP+SPYLFLLC E LS ++   E    + GV + RG PS+SHLFFADD  LF +A+ ++ + ++ + + Y +++GQ+I+ +K  +  S N+D   
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD
         + +AAVL V+ V+ HD YLGLP   G S+      +K++IW  IQ W+    S  GKE+L+KVV QA P Y M+ F +PK L ++  ++MAR+WW E D
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDD

Query:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS
        G R+IHWLSW K+C+ K EGGLGFR+L  FN ALLAKQ WRLI  P+SL+  +LK +Y+++CS L+A+   S SY+W+SL   R L+ +G RWR+G+G S
Subjt:  GFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRS

Query:  IRVVGDRWIPRPPTLKL
        +R+ GDRW+P   + ++
Subjt:  IRVVGDRWIPRPPTLKL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.6e-2323.85Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +I   Q  F+PG     N+      + ++   R        + +D  KA+D+++  F+ K +  LGIDG  +++I       + +  LNGQ +       
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        G RQ  P+SP LF + +E L+R    + ++K + G+++  G   +    FADD +++ +  I  AQ + K++  ++ ++G +IN +K    L  N + Q 
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAG------FGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSF--FRLPKSLVHDCNRMMA
           I   L   +     KYLG+         F  +   +LK+IK+       +W++   S  G+  ++K+ +     Y  +    +LP +   +  +   
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAG------FGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSF--FRLPKSLVHDCNRMMA

Query:  RFWWGEDDGFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRW
        +F W +         LS K        GG+   D +L+ KA + K  W
Subjt:  RFWWGEDDGFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRW

P08548 LINE-1 reverse transcriptase homolog2.0e-1823.23Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +I   Q  F+PG     N+      + ++  K K K     L +D  KA+D ++  F+ + +  +GI+G  ++LI    S  + +  LNG  +       
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        G RQ  P+SP LF + +E L+     + E+K + G+ I  G   I    FADD +++ +        + +V++ Y+ ++G +IN  K    +  N + Q 
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKD-----------KIWFYIQRWRHTCFSVGGKEVLIK--VVLQAFPTYSMSFFRLPKSLVHDC
         + +   +   +V    KYLG+          + K +KD           +I   + +W++   S  G+  ++K  ++ +A   ++    + P S   D 
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVVLKQIKD-----------KIWFYIQRWRHTCFSVGGKEVLIK--VVLQAFPTYSMSFFRLPKSLVHDC

Query:  NRMMARFWWGEDDGFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRW
         +++  F W +         LS K        GG+   DL L+ K+++ K  W
Subjt:  NRMMARFWWGEDDGFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRW

P0C2F6 Putative ribonuclease H protein At1g657502.4e-2735.29Show/hide
Query:  QIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDGFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALL
        +I +++   +  WR    S  G+  L K VL + P +SMS   LP+S+++  +++   F WG     ++ H + W K+C  K EGGLG R  +  N+AL+
Subjt:  QIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDGFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALL

Query:  AKQRWRLINDPDSLIGKVLKGKYY----RDCSFLKAKDKGSASYVWKSL-MWRRSLLAEGLRWRVGDGRSIRVVGDRWIPRPPTLKL
        +K  WRL+ + +SL   VL+ KY+    RD  +L    KGS S  W+S+ +  R +++ G+ W  GDG+ IR   DRW+   P L+L
Subjt:  AKQRWRLINDPDSLIGKVLKGKYY----RDCSFLKAKDKGSASYVWKSL-MWRRSLLAEGLRWRVGDGRSIRVVGDRWIPRPPTLKL

P11369 LINE-1 retrotransposable element ORF2 protein1.4e-2423.98Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +I P Q  F+PG     N+      +HY+  K K K     + LD  KA+D+++  F+ K +   GI G  + +I    S    + ++NG+ +  I    
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV
        G RQ  P+SPYLF + +E L+R    + + K + G++I +    IS L  ADD +++        + +  ++ ++  + G +IN  K  +      ++Q 
Subjt:  GLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQV

Query:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVV------LKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVL--QAFPTYSMSFFRLPKSLVHDCNRMMA
         + I       +V ++ KYLG+      +K V        K +K +I   ++RW+    S  G+  ++K+ +  +A   ++    ++P    ++    + 
Subjt:  GEAIAAVLNVQLVESHDKYLGLPAGFGGSKAVV------LKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVL--QAFPTYSMSFFRLPKSLVHDCNRMMA

Query:  RFWWGEDDGFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRW-----RLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWK------SLM
        +F W           L  K+       GG+   DL+L+ +A++ K  W     R ++  + +    +    Y    F    DKG+ +  WK      +  
Subjt:  RFWWGEDDGFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRW-----RLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWK------SLM

Query:  WRRSLLAEGLRWRVGDGRS-IRVVGDRWIP----RPPTLKLI
        W   LL+   R R+    S    V  +WI     +P TLKLI
Subjt:  WRRSLLAEGLRWRVGDGRS-IRVVGDRWIP----RPPTLKLI

P93295 Uncharacterized mitochondrial protein AtMg003106.6e-3043.05Show/hide
Query:  AFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDGFRRIHWLSWKKMCMSK-FEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLK
        A P Y+MS FRL K L       M  FWW   +  R+I W++W+K+C SK  +GGLGFRDL  FN+ALLAKQ +R+I+ P +L+ ++L+ +Y+   S ++
Subjt:  AFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDGFRRIHWLSWKKMCMSK-FEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLK

Query:  AKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRSIRVVGDRWI----PRPP
               SY W+S++  R LL+ GL   +GDG   +V  DRWI    P PP
Subjt:  AKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRSIRVVGDRWI----PRPP

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.2e-1123.3Show/hide
Query:  KYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDGFRRIHWLSWKKMCMSK
        +YLGLP             + +KI   I +W     S  G+  LI  V+ +   + MS FRLP + + + + + + F W   +   +   ++W  +C  K
Subjt:  KYLGLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDGFRRIHWLSWKKMCMSK

Query:  FEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRSIRVVGDRWIPRPPTLKL
         EGGLG R L+  NK       W       S+ G    G                 S++WK ++  R+L +  ++  + +G +     D W      + +
Subjt:  FEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRSIRVVGDRWIPRPPTLKL

Query:  IHNGGC
          + GC
Subjt:  IHNGGC

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.9e-1234.91Show/hide
Query:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR
        +I P Q++F+PGR   DN++   E +H +++K KG  GW  LKLD+ KAYDR+ W +LE  + + G      E+ +  ++  ++  R     VGR   S+
Subjt:  MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSR

Query:  GLRQRD
          R  D
Subjt:  GLRQRD

AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-2838.36Show/hide
Query:  AFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDGFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKA
        A PTY+M+ F LPK++      ++A FWW      + +HW +W  +   K EGG+GF+D+E FN ALL KQ WR+++ P+SL+ KV K +Y+     L A
Subjt:  AFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDGFRRIHWLSWKKMCMSKFEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKA

Query:  KDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRSIRVVGDRWIPRPP
              S+VWKS+   + +L +G R  VG+G  I +   +W+   P
Subjt:  KDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRSIRVVGDRWIPRPP

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.7e-3143.05Show/hide
Query:  AFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDGFRRIHWLSWKKMCMSK-FEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLK
        A P Y+MS FRL K L       M  FWW   +  R+I W++W+K+C SK  +GGLGFRDL  FN+ALLAKQ +R+I+ P +L+ ++L+ +Y+   S ++
Subjt:  AFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDGFRRIHWLSWKKMCMSK-FEGGLGFRDLELFNKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLK

Query:  AKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRSIRVVGDRWI----PRPP
               SY W+S++  R LL+ GL   +GDG   +V  DRWI    P PP
Subjt:  AKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRSIRVVGDRWI----PRPP

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.2e-1551.47Show/hide
Query:  FRLNGQVVGRIVPSRGLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADD
        F +NG   G + PSRGLRQ DP+SPYLF+LC E LS +    +E  R+ G+R++   P I+HL FADD
Subjt:  FRLNGQVVGRIVPSRGLRQRDPISPYLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTCCCCCACTCAAAGTGCCTTTGTTCCAGGACGGAATATTTGTGATAATGTTATTCTAGGGTTTGAGTGCATGCATTATCTAAAGCAGAAGAGAAAGGGGAAGAC
GGGGTGGGCGGCCTTGAAACTCGACATGAGCAAGGCTTATGATCGGGTAGAGTGGTTCTTTTTAGAGAAGTTCATGGGGGCACTGGGCATTGATGGGAGAGTGGTGGAGC
TTATCATGAGGTGTGTGAGTACTGTCTCATATTCTTTTAGATTGAATGGCCAGGTGGTGGGGAGGATTGTCCCTTCTAGGGGCCTCCGCCAAAGGGACCCCATTTCTCCT
TACCTATTCTTATTGTGTGTTGAAGGGTTGTCGAGGATGTTGAATTGGTTGGAGGAGGATAAGAGGATGACGGGGGTAAGGATTGCCAGGGGTTGCCCTTCCATATCCCA
CTTATTTTTTGCAGATGACTGTTTGTTGTTCTTTAAGGCGAATATAAGGGAAGCGCAAATGGTGGCTAAGGTGTTGGAGACCTATGCTATTATTACTGGTCAAGAGATTA
ATTACAAGAAGTATGGGCTTTGCCTTAGCCCAAACGTGGATGAGCAAGTAGGGGAGGCGATTGCTGCTGTTTTGAATGTTCAGTTAGTTGAGTCTCATGACAAGTATTTG
GGATTGCCTGCAGGGTTTGGGGGTAGTAAGGCAGTAGTCCTGAAACAAATAAAGGATAAGATCTGGTTTTACATCCAGAGGTGGAGGCATACGTGTTTTTCGGTGGGTGG
GAAGGAGGTTCTTATAAAGGTTGTGCTGCAAGCATTCCCGACCTATTCAATGTCTTTTTTTCGCCTACCGAAAAGTCTAGTTCATGATTGTAACCGAATGATGGCCAGAT
TCTGGTGGGGAGAGGATGATGGGTTTAGGAGGATACACTGGTTGTCGTGGAAGAAGATGTGTATGTCTAAGTTTGAGGGTGGGCTAGGCTTTCGGGACTTGGAGCTGTTT
AATAAGGCTCTCCTAGCAAAACAGAGGTGGAGGCTTATTAATGACCCCGACTCTCTGATTGGAAAGGTCTTGAAGGGGAAGTACTACCGTGATTGCTCCTTCCTGAAAGC
GAAGGATAAGGGGAGTGCCTCCTATGTGTGGAAGAGTTTGATGTGGAGGAGGAGTTTGCTTGCTGAGGGGCTTAGATGGAGGGTGGGGGATGGGAGGTCAATAAGGGTTG
TGGGAGATAGGTGGATTCCTCGACCCCCTACTCTGAAGCTTATTCACAATGGGGGGTGTGCCCGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTCCCCCACTCAAAGTGCCTTTGTTCCAGGACGGAATATTTGTGATAATGTTATTCTAGGGTTTGAGTGCATGCATTATCTAAAGCAGAAGAGAAAGGGGAAGAC
GGGGTGGGCGGCCTTGAAACTCGACATGAGCAAGGCTTATGATCGGGTAGAGTGGTTCTTTTTAGAGAAGTTCATGGGGGCACTGGGCATTGATGGGAGAGTGGTGGAGC
TTATCATGAGGTGTGTGAGTACTGTCTCATATTCTTTTAGATTGAATGGCCAGGTGGTGGGGAGGATTGTCCCTTCTAGGGGCCTCCGCCAAAGGGACCCCATTTCTCCT
TACCTATTCTTATTGTGTGTTGAAGGGTTGTCGAGGATGTTGAATTGGTTGGAGGAGGATAAGAGGATGACGGGGGTAAGGATTGCCAGGGGTTGCCCTTCCATATCCCA
CTTATTTTTTGCAGATGACTGTTTGTTGTTCTTTAAGGCGAATATAAGGGAAGCGCAAATGGTGGCTAAGGTGTTGGAGACCTATGCTATTATTACTGGTCAAGAGATTA
ATTACAAGAAGTATGGGCTTTGCCTTAGCCCAAACGTGGATGAGCAAGTAGGGGAGGCGATTGCTGCTGTTTTGAATGTTCAGTTAGTTGAGTCTCATGACAAGTATTTG
GGATTGCCTGCAGGGTTTGGGGGTAGTAAGGCAGTAGTCCTGAAACAAATAAAGGATAAGATCTGGTTTTACATCCAGAGGTGGAGGCATACGTGTTTTTCGGTGGGTGG
GAAGGAGGTTCTTATAAAGGTTGTGCTGCAAGCATTCCCGACCTATTCAATGTCTTTTTTTCGCCTACCGAAAAGTCTAGTTCATGATTGTAACCGAATGATGGCCAGAT
TCTGGTGGGGAGAGGATGATGGGTTTAGGAGGATACACTGGTTGTCGTGGAAGAAGATGTGTATGTCTAAGTTTGAGGGTGGGCTAGGCTTTCGGGACTTGGAGCTGTTT
AATAAGGCTCTCCTAGCAAAACAGAGGTGGAGGCTTATTAATGACCCCGACTCTCTGATTGGAAAGGTCTTGAAGGGGAAGTACTACCGTGATTGCTCCTTCCTGAAAGC
GAAGGATAAGGGGAGTGCCTCCTATGTGTGGAAGAGTTTGATGTGGAGGAGGAGTTTGCTTGCTGAGGGGCTTAGATGGAGGGTGGGGGATGGGAGGTCAATAAGGGTTG
TGGGAGATAGGTGGATTCCTCGACCCCCTACTCTGAAGCTTATTCACAATGGGGGGTGTGCCCGGAGATGA
Protein sequenceShow/hide protein sequence
MISPTQSAFVPGRNICDNVILGFECMHYLKQKRKGKTGWAALKLDMSKAYDRVEWFFLEKFMGALGIDGRVVELIMRCVSTVSYSFRLNGQVVGRIVPSRGLRQRDPISP
YLFLLCVEGLSRMLNWLEEDKRMTGVRIARGCPSISHLFFADDCLLFFKANIREAQMVAKVLETYAIITGQEINYKKYGLCLSPNVDEQVGEAIAAVLNVQLVESHDKYL
GLPAGFGGSKAVVLKQIKDKIWFYIQRWRHTCFSVGGKEVLIKVVLQAFPTYSMSFFRLPKSLVHDCNRMMARFWWGEDDGFRRIHWLSWKKMCMSKFEGGLGFRDLELF
NKALLAKQRWRLINDPDSLIGKVLKGKYYRDCSFLKAKDKGSASYVWKSLMWRRSLLAEGLRWRVGDGRSIRVVGDRWIPRPPTLKLIHNGGCARR