; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015896 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015896
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:28372100..28377452
RNA-Seq ExpressionLag0015896
SyntenyLag0015896
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]5.6e-10139.75Show/hide
Query:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK
        +A   +   +  L   YE+ +GQ INY KS ++ SPN                               G+G +Q F+  K        G + ++ S AGK
Subjt:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK

Query:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY
        E+L+K+++QAIP Y+M+CFR+P+GL KE++  MA+FWW  ++D R IHW+ WE LC  K  GGLGFRD+E FNQALLAKQCWR+L+ P SL+  + + RY
Subjt:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY

Query:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR
         P   FLEA +G+ PSFIWRSL WG+ELL +G RWR+G+G S  +Y   WLP     ++ S P LP  + V DLFT SG WN  +L   F + + +AIL+
Subjt:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR

Query:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRA-WWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY
        IPL    G D LIWH+E++G++S+KSGYRLA +   +    PS    + + +W   W L +P+K KFFLWR   D LP    L  R +    +C  C   
Subjt:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRA-WWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY

Query:  VEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ
         E  LH  W C   + +W  S +  + + +    F E+  A++ S  G +  L     W +WN  NS  + G+
Subjt:  VEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ

ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]5.6e-9341.45Show/hide
Query:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK
        +A       +  L   YE+ +GQ INY KS ++ SPN                               G+G +Q F+  K        G + ++ S AGK
Subjt:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK

Query:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY
        E+L+K+++QAIP Y+M+CF++P+GL KE++  MA+FWW  ++D R IHW+ WE LC  K  GGLGFRD+E FNQALLAKQCWR+L+ P SL+  + + RY
Subjt:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY

Query:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR
         P   FLEA +G+ PSFIW SL WG+ELL +G RWR+G+G S  +Y   WLP     ++ S P LP  + V DLFT SG WN  +L   F + + +AIL+
Subjt:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR

Query:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRA-WWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY
        IPL    G D LIWH+E++G++S+KSGYRLA +   +    PS    + + +W   W L +P+K KFFLWR   D LP    L  R +    +C  C   
Subjt:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRA-WWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY

Query:  VEDRLHLFWKCSVVR
         E  LH  W C   +
Subjt:  VEDRLHLFWKCSVVR

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]1.2e-10040.17Show/hide
Query:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK
        +A       +  L   YE+ SGQ INY KS  + SPN                               G+G +Q F+  K        G + ++ S AGK
Subjt:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK

Query:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY
        E+L+K+++QAIP Y+M+CFR+P+GL KE++  MA+FWW  ++D R IHW+ WE LC  K  GGLGFRD+E FNQALLAKQCWR+L+ P SL+  + + RY
Subjt:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY

Query:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR
         P   FLEA +G+ PSFIWRSL WG+ELL +G RWR+GNG S  +Y   WLP     ++ S P LP  ++V DLFT SG WN  +L   F + + +A L+
Subjt:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR

Query:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPS-NSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY
        IPL    G D LIWH+E++G++S+KSGYRLA +   +    PS   D    +W   W L +P+K KFFLWR   D LP    L  R +    +C  C   
Subjt:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPS-NSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY

Query:  VEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ
         E  LH  W C   + +W  S +  + + +    F E+  A++ S  G +  L     W +WN  NS  + G+
Subjt:  VEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]7.8e-11149.09Show/hide
Query:  GLEGQVFSIAGKEVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDP
        G + ++FSI GKEVL+K++ QAIPCYTM+CFRLP+ L +E H   A+FWW  S++ ++IHW++W SL LPKC GG+GFRD+ELFN+ALLAKQCWR+L  P
Subjt:  GLEGQVFSIAGKEVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDP

Query:  SSLLCSVLKGRYFPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFT-VSGGWNEAMLM
        +S+L  VLKGRYF    F+EA +   PS+IWRS+LWGR+LL +G RWRIGNG S  IYG NW+PN  +L++ S P LP  S VS L     GGW   ++ 
Subjt:  SSLLCSVLKGRYFPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFT-VSGGWNEAMLM

Query:  AHFSESDCEAILRIPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHM-LAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRG
          F+  + + IL IP+  G  EDRLIW++EK GV+S++SGY++A +       PS S+S+ +R WW+ FW++++P+K K FLWRL  DRLPT  NL KRG
Subjt:  AHFSESDCEAILRIPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHM-LAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRG

Query:  VNVPSLCVLCCDYVEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSW
        V + + C  C    ED +HLFW C     +W  SKF  L           I+    ESL  +DFE + +  W +WN  N+ ++
Subjt:  VNVPSLCVLCCDYVEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSW

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]2.8e-9237.29Show/hide
Query:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPNT------------------------------GEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK
        RA   +   ++ L   Y KASGQ  N+EKS + FS  T                              G  ++  F   K   L      + + F+  GK
Subjt:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPNT------------------------------GEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK

Query:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY
        EVL+K++ QAIP Y M+ F++P GL ++I   MA+FWW   +D + IHW  WE +   K  GG+GFRD+  FNQAL+AKQ WR++Q PSSL+  VLK RY
Subjt:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY

Query:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR
        F  +GF+ AGLGS+PSF+WRS++WGR++L +G RWRIGNG++  +YG+NW+P   + +  S P++  D+ V++L      W E +++ HF   D EAI++
Subjt:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR

Query:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDYV
        IPL     ED+LIWH++K G +S+KSGY++A  +     PS SN D  +  W   W+L +P K K FLWR  HD LPT  NL K+ V    +C  C  +V
Subjt:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDYV

Query:  EDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ
        E   H   +C+  R +W  S  A   +  +      ++          +   V    W++W   N   + G+
Subjt:  EDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein2.7e-10139.75Show/hide
Query:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK
        +A   +   +  L   YE+ +GQ INY KS ++ SPN                               G+G +Q F+  K        G + ++ S AGK
Subjt:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK

Query:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY
        E+L+K+++QAIP Y+M+CFR+P+GL KE++  MA+FWW  ++D R IHW+ WE LC  K  GGLGFRD+E FNQALLAKQCWR+L+ P SL+  + + RY
Subjt:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY

Query:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR
         P   FLEA +G+ PSFIWRSL WG+ELL +G RWR+G+G S  +Y   WLP     ++ S P LP  + V DLFT SG WN  +L   F + + +AIL+
Subjt:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR

Query:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRA-WWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY
        IPL    G D LIWH+E++G++S+KSGYRLA +   +    PS    + + +W   W L +P+K KFFLWR   D LP    L  R +    +C  C   
Subjt:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRA-WWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY

Query:  VEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ
         E  LH  W C   + +W  S +  + + +    F E+  A++ S  G +  L     W +WN  NS  + G+
Subjt:  VEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ

A0A5E4FZN9 PREDICTED: retrotransposon6.0e-10140.17Show/hide
Query:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK
        +A       +  L   YE+ SGQ INY KS  + SPN                               G+G +Q F+  K        G + ++ S AGK
Subjt:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK

Query:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY
        E+L+K+++QAIP Y+M+CFR+P+GL KE++  MA+FWW  ++D R IHW+ WE LC  K  GGLGFRD+E FNQALLAKQCWR+L+ P SL+  + + RY
Subjt:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY

Query:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR
         P   FLEA +G+ PSFIWRSL WG+ELL +G RWR+GNG S  +Y   WLP     ++ S P LP  ++V DLFT SG WN  +L   F + + +A L+
Subjt:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR

Query:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPS-NSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY
        IPL    G D LIWH+E++G++S+KSGYRLA +   +    PS   D    +W   W L +P+K KFFLWR   D LP    L  R +    +C  C   
Subjt:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPS-NSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY

Query:  VEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ
         E  LH  W C   + +W  S +  + + +    F E+  A++ S  G +  L     W +WN  NS  + G+
Subjt:  VEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ

A0A6J1DAR4 uncharacterized protein LOC1110189543.8e-11149.09Show/hide
Query:  GLEGQVFSIAGKEVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDP
        G + ++FSI GKEVL+K++ QAIPCYTM+CFRLP+ L +E H   A+FWW  S++ ++IHW++W SL LPKC GG+GFRD+ELFN+ALLAKQCWR+L  P
Subjt:  GLEGQVFSIAGKEVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDP

Query:  SSLLCSVLKGRYFPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFT-VSGGWNEAMLM
        +S+L  VLKGRYF    F+EA +   PS+IWRS+LWGR+LL +G RWRIGNG S  IYG NW+PN  +L++ S P LP  S VS L     GGW   ++ 
Subjt:  SSLLCSVLKGRYFPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFT-VSGGWNEAMLM

Query:  AHFSESDCEAILRIPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHM-LAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRG
          F+  + + IL IP+  G  EDRLIW++EK GV+S++SGY++A +       PS S+S+ +R WW+ FW++++P+K K FLWRL  DRLPT  NL KRG
Subjt:  AHFSESDCEAILRIPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHM-LAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRG

Query:  VNVPSLCVLCCDYVEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSW
        V + + C  C    ED +HLFW C     +W  SKF  L           I+    ESL  +DFE + +  W +WN  N+ ++
Subjt:  VNVPSLCVLCCDYVEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSW

A0A803Q1K6 Uncharacterized protein1.7e-9537.37Show/hide
Query:  ANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGKE
        AN       + LL  Y  ASGQ +NY KS   F  N                               G   ++  +  K    A   G +  +FS+AGKE
Subjt:  ANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGKE

Query:  VLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRYF
        VL+K+IVQAIP YTM+CF+LP+     +H   ++FWW  S+  ++IHW  W  LC PK  GGLGFRD+ +FNQALLAKQ WR L+ P  L   VLK  YF
Subjt:  VLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRYF

Query:  PQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILRI
        P+ G LEAG G+  SF+ RSL+WG++L+++G RWR+GNG S  +    WLP   + +V   P+LP +  V+DL    G W+E  + + F+ +D + IL I
Subjt:  PQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILRI

Query:  PLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDYVE
        P      ED+++WH+ K+G +S+KSGYR+A  L        SN   I  WW   WRLN P K K F+W++ H+ LP  VNL KRG+    +C  C  +VE
Subjt:  PLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDYVE

Query:  DRL-HLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGG
        + + H  W+C   +G W  S      +         ++  +         E  ++  W++WN+ N++  GG
Subjt:  DRL-HLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGG

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)2.7e-10139.75Show/hide
Query:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK
        +A   +   +  L   YE+ +GQ INY KS ++ SPN                               G+G +Q F+  K        G + ++ S AGK
Subjt:  RANGSEASVIRDLLIWYEKASGQTINYEKSVVAFSPN------------------------------TGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGK

Query:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY
        E+L+K+++QAIP Y+M+CFR+P+GL KE++  MA+FWW  ++D R IHW+ WE LC  K  GGLGFRD+E FNQALLAKQCWR+L+ P SL+  + + RY
Subjt:  EVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRY

Query:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR
         P   FLEA +G+ PSFIWRSL WG+ELL +G RWR+G+G S  +Y   WLP     ++ S P LP  + V DLFT SG WN  +L   F + + +AIL+
Subjt:  FPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILR

Query:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRA-WWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY
        IPL    G D LIWH+E++G++S+KSGYRLA +   +    PS    + + +W   W L +P+K KFFLWR   D LP    L  R +    +C  C   
Subjt:  IPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRA-WWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCCDY

Query:  VEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ
         E  LH  W C   + +W  S +  + + +    F E+  A++ S  G +  L     W +WN  NS  + G+
Subjt:  VEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQ

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657505.0e-3626.34Show/hide
Query:  GLEGQVFSIAGKEVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDP
        G   +  S AG+  L K+++ ++P ++M+   LP+ +   +      F W  + + ++ H + W  +C PK  GGLG R  +  N+AL++K  WR+LQ+ 
Subjt:  GLEGQVFSIAGKEVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDP

Query:  SSLLCSVLKGRYFPQSGFLEAGLGSRP----SFIWRSLLWG-RELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQS--VPTLPGDSMVSDLFTVSGGW
        +SL   VL+ +Y    G +       P    S  WRS+  G R+++  G  W  G+G+    +   W+     L++ +   PT     +  DL+    GW
Subjt:  SSLLCSVLKGRYFPQSGFLEAGLGSRP----SFIWRSLLWG-RELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQS--VPTLPGDSMVSDLFTVSGGW

Query:  NEAMLMAHFSESDCEAILRIPLRHGLG-EDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKV
        + A +  + + +    +  + L    G  DRL W F + G FS++S Y    ML     P P+    + ++++  W++ VP + K FLW + +  + T+ 
Subjt:  NEAMLMAHFSESDCEAILRIPLRHGLG-EDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKV

Query:  NLLKRGVNVPSLCVLCCDYVEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFH-FEEIIGAMK-----ESLPGSDFELVVIFWWSVWNLLN
           +R ++  ++C +C   VE  LH+   C    G+W         Q FF    FE +   +      E +P S    V+I+W   W   N
Subjt:  NLLKRGVNVPSLCVLCCDYVEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFH-FEEIIGAMK-----ESLPGSDFELVVIFWWSVWNLLN

P93295 Uncharacterized mitochondrial protein AtMg003105.9e-3748.25Show/hide
Query:  AIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPK-CLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRYFPQSGFLE
        A+P Y M+CFRL + L K++ S M +FWW+  E+ R+I W++W+ LC  K   GGLGFRD+  FNQALLAKQ +R++  P +LL  +L+ RYFP S  +E
Subjt:  AIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPK-CLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRYFPQSGFLE

Query:  AGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWL
          +G+RPS+ WRS++ GRELL RG    IG+G  T ++   W+
Subjt:  AGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWL

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.8e-1225.11Show/hide
Query:  LLCSVLKGRYFPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHF
        LL S L   +  Q  F      +  S+IWR L   RE+        +G+G +   +  NW  +G           P   +V  L   + G          
Subjt:  LLCSVLKGRYFPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDLFTVSGGWNEAMLMAHF

Query:  SESDCEAILRIPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVP
             +A+  I  +H   +D  IW  + H   ++ S  + +  L       P N   I  W+ + W  N   KH F  W +  +RL T+  L   G+++P
Subjt:  SESDCEAILRIPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVP

Query:  SLCVLCCDYVEDRLHLFWKCSVVRGMW
        ++C+LC  + E R HLF++C     +W
Subjt:  SLCVLCCDYVEDRLHLFWKCSVVRGMW

AT3G09510.1 Ribonuclease H-like superfamily protein6.5e-2326.83Show/hide
Query:  LKGRYFPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLP-------GDSMVSDLFTVSGG---WNEAM
        +K RYF     L+A +  + S+ W SLL G  LL +G R  IG+G++  I        G    V S P  P        +  +++LF   G    W+++ 
Subjt:  LKGRYFPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLP-------GDSMVSDLFTVSGG---WNEAM

Query:  LMAHFSESDCEAILRIPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKR
        +     +SD   I RI L      D++IW++   G ++++SGY L     +   P+ +   G     +  W L +  K K FLWR L   L T   L  R
Subjt:  LMAHFSESDCEAILRIPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKR

Query:  GVNVPSLCVLCCDYVEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAM---KESLPGSDFELVVIFW--WSVWNLLNSM
        G+ +   C  C    E   H  + C      W  S  + +        FEE I  +    +    SDF  ++  W  W +W   N++
Subjt:  GVNVPSLCVLCCDYVEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAM---KESLPGSDFELVVIFW--WSVWNLLNSM

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.4e-2226.85Show/hide
Query:  SIAGKEVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSV
        S AG+  L+ S++ ++  + M+ FRLP    KEI S  + F W+G E   +   ++W  +C PK  GGLG R ++  N                      
Subjt:  SIAGKEVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSV

Query:  LKGRYFPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDL-FTVSGGWNEAMLMAHFSESD
         KG ++  SG    G     S++W+ +L  R L     +  I NG +T  +  NW   G  + V       G     D+  T+     EA++        
Subjt:  LKGRYFPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPTLPGDSMVSDL-FTVSGGWNEAMLMAHFSESD

Query:  CEAILRI-----PLRH-GL--GEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRG
         + +LRI      +RH GL  GED + W   K      K  +      AA   P    +     W+   W  +   K+    W  + +RL T   +L   
Subjt:  CEAILRI-----PLRH-GL--GEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRG

Query:  VNVPSLCVLCCDYVEDRLHLFWKC
            S CVLC   VE R HLF+ C
Subjt:  VNVPSLCVLCCDYVEDRLHLFWKC

AT4G29090.1 Ribonuclease H-like superfamily protein1.0e-5532.75Show/hide
Query:  AIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRYFPQSGFLEA
        A+P YTM CF LP+ + K+I S +A FWW   ++ + +HW +W+ L   K  GG+GF+D+E FN ALL KQ WR+L  P SL+  V K RYF +S  L A
Subjt:  AIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRYFPQSGFLEA

Query:  GLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWL---PNGFSLQVQSVPTLPGDSM-----VSDLFTVSG-GWNEAMLMAHFSESDCEAI--
         LGSRPSF+W+S+   +E+L +G R  +GNG    I+   WL   P   +L++Q VP     S+     VSDL   SG  W + ++   F E + + I  
Subjt:  GLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWL---PNGFSLQVQSVPTLPGDSM-----VSDLFTVSG-GWNEAMLMAHFSESDCEAI--

Query:  LRIPLRHGLGEDRLIWHFEKHGVFSLKSGY-RLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCC
        LR   R  L  D   W +   G +++KSGY  L  ++  R  P   +   +   +   W+     K + FLW+ L + LP    L  R ++  S C+ C 
Subjt:  LRIPLRHGLGEDRLIWHFEKHGVFSLKSGY-RLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHDRLPTKVNLLKRGVNVPSLCVLCC

Query:  DYVEDRLHLFWKCSVVRGMWNCSKF-APLHQSFFHFHFEEIIGAMKESLPGSDFE----LVVIFWWSVWNLLNSMSW-GGQFDARDLWAFSSDYLRAFHM
           E   HL +KC+  R  W  S    PL   +    +  +            +E    LV    W +W   N + + G +F+A+++   + D L  + +
Subjt:  DYVEDRLHLFWKCSVVRGMWNCSKF-APLHQSFFHFHFEEIIGAMKESLPGSDFE----LVVIFWWSVWNLLNSMSW-GGQFDARDLWAFSSDYLRAFHM

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.2e-3848.25Show/hide
Query:  AIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPK-CLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRYFPQSGFLE
        A+P Y M+CFRL + L K++ S M +FWW+  E+ R+I W++W+ LC  K   GGLGFRD+  FNQALLAKQ +R++  P +LL  +L+ RYFP S  +E
Subjt:  AIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWESLCLPK-CLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRYFPQSGFLE

Query:  AGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWL
          +G+RPS+ WRS++ GRELL RG    IG+G  T ++   W+
Subjt:  AGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGTGAACCCCTCTATTTATAGAGTTCTCGCAGGATGGGCCCTGTTCACAACCGTCGCTGCTCGCCGCCATCGCTCACCACCACACCACTGCCACACCACCGCCGC
CGCTCACCGCCACACCACTGCCGCCGCTCACCGCCACACTACCGCCGTCGCTCACCGCCGCAACTTGCCGCCACCCTTGCCCGTTGATTCCGACAAGGTTCCTAAGAGGG
AAGGAAAAGATAAAAGAGGAGGGAAGGTGGAAGAGTTAAAAGGAAAAAAGAGGGAAGGAAAAGAAAAAAGGGCCAATGGGAGTGAAGCGTCGGTTATTCGAGACCTGCTG
ATATGGTATGAGAAGGCATCAGGGCAGACTATCAATTACGAGAAGTCGGTTGTTGCTTTTAGCCCGAATACAGGAGAAGGGTCCCAACAGGACTTTGAAGTTTATAAAGG
ACCGTATCTGGCGTCAGATCCAGGGTTGGAAGGGCAAGTTTTTTCGATAGCAGGGAAAGAAGTCCTCCTTAAATCTATAGTGCAGGCTATTCCTTGTTATACGATGAACT
GCTTTCGGTTGCCTAGGGGTTTGACAAAGGAGATCCACAGTACCATGGCCAAGTTTTGGTGGAATGGGTCCGAGGATACGAGGCGAATACATTGGATGAGTTGGGAGTCG
CTATGCCTTCCCAAATGCTTGGGTGGGTTGGGTTTTCGTGATATGGAACTTTTCAACCAAGCCCTGTTGGCTAAGCAATGCTGGCGTGTTCTCCAGGATCCTTCTTCGCT
TTTGTGCTCTGTGCTTAAGGGCCGTTATTTTCCCCAATCAGGTTTCTTGGAGGCAGGTCTGGGTTCACGACCGTCTTTTATCTGGCGCAGCTTGTTATGGGGGCGGGAGC
TCTTGGTTCGAGGGTGCCGCTGGAGGATAGGTAACGGCCGATCTACACCTATTTATGGCTCGAACTGGCTGCCTAATGGGTTCTCTCTTCAAGTGCAGTCGGTTCCGACG
CTTCCTGGGGATAGTATGGTTAGTGATCTATTTACTGTGTCCGGTGGTTGGAATGAGGCTATGCTCATGGCCCATTTCAGTGAGTCTGACTGTGAGGCAATCTTGAGAAT
CCCATTACGACATGGTTTGGGGGAGGATCGGTTAATTTGGCATTTTGAGAAGCATGGGGTTTTTTCTTTGAAAAGTGGGTATCGGTTGGCTCATATGTTGGCCGCTCGGG
GTCGACCTTCACCTTCGAACTCTGATGGGATTCGCGCGTGGTGGTCTAGTTTTTGGAGGCTGAATGTGCCTAGCAAGCATAAGTTCTTTCTATGGCGGTTGCTCCATGAT
CGTCTGCCTACTAAGGTGAACCTTCTCAAACGTGGCGTCAATGTCCCGAGTCTGTGTGTTTTGTGCTGTGATTATGTTGAGGATCGCCTCCATCTGTTCTGGAAGTGCTC
GGTGGTCAGGGGTATGTGGAACTGCTCCAAGTTTGCTCCACTTCATCAGTCATTTTTCCATTTCCATTTCGAGGAGATCATTGGGGCGATGAAGGAAAGCCTCCCGGGTT
CGGATTTTGAGCTTGTGGTCATCTTCTGGTGGTCTGTGTGGAATCTTCTGAATAGCATGAGTTGGGGCGGTCAGTTCGATGCTCGAGACTTATGGGCTTTTTCGAGTGAC
TACCTTCGTGCTTTTCATATGGCTGGGGGCGTTGCCTGGCGAGGGATGCCTCGCGGGGCCAGCTGGGTGGTCATGAGGAGCGCTGTGTGTGGAGGCCGCCTCCTGCTGGG
GAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGTGAACCCCTCTATTTATAGAGTTCTCGCAGGATGGGCCCTGTTCACAACCGTCGCTGCTCGCCGCCATCGCTCACCACCACACCACTGCCACACCACCGCCGC
CGCTCACCGCCACACCACTGCCGCCGCTCACCGCCACACTACCGCCGTCGCTCACCGCCGCAACTTGCCGCCACCCTTGCCCGTTGATTCCGACAAGGTTCCTAAGAGGG
AAGGAAAAGATAAAAGAGGAGGGAAGGTGGAAGAGTTAAAAGGAAAAAAGAGGGAAGGAAAAGAAAAAAGGGCCAATGGGAGTGAAGCGTCGGTTATTCGAGACCTGCTG
ATATGGTATGAGAAGGCATCAGGGCAGACTATCAATTACGAGAAGTCGGTTGTTGCTTTTAGCCCGAATACAGGAGAAGGGTCCCAACAGGACTTTGAAGTTTATAAAGG
ACCGTATCTGGCGTCAGATCCAGGGTTGGAAGGGCAAGTTTTTTCGATAGCAGGGAAAGAAGTCCTCCTTAAATCTATAGTGCAGGCTATTCCTTGTTATACGATGAACT
GCTTTCGGTTGCCTAGGGGTTTGACAAAGGAGATCCACAGTACCATGGCCAAGTTTTGGTGGAATGGGTCCGAGGATACGAGGCGAATACATTGGATGAGTTGGGAGTCG
CTATGCCTTCCCAAATGCTTGGGTGGGTTGGGTTTTCGTGATATGGAACTTTTCAACCAAGCCCTGTTGGCTAAGCAATGCTGGCGTGTTCTCCAGGATCCTTCTTCGCT
TTTGTGCTCTGTGCTTAAGGGCCGTTATTTTCCCCAATCAGGTTTCTTGGAGGCAGGTCTGGGTTCACGACCGTCTTTTATCTGGCGCAGCTTGTTATGGGGGCGGGAGC
TCTTGGTTCGAGGGTGCCGCTGGAGGATAGGTAACGGCCGATCTACACCTATTTATGGCTCGAACTGGCTGCCTAATGGGTTCTCTCTTCAAGTGCAGTCGGTTCCGACG
CTTCCTGGGGATAGTATGGTTAGTGATCTATTTACTGTGTCCGGTGGTTGGAATGAGGCTATGCTCATGGCCCATTTCAGTGAGTCTGACTGTGAGGCAATCTTGAGAAT
CCCATTACGACATGGTTTGGGGGAGGATCGGTTAATTTGGCATTTTGAGAAGCATGGGGTTTTTTCTTTGAAAAGTGGGTATCGGTTGGCTCATATGTTGGCCGCTCGGG
GTCGACCTTCACCTTCGAACTCTGATGGGATTCGCGCGTGGTGGTCTAGTTTTTGGAGGCTGAATGTGCCTAGCAAGCATAAGTTCTTTCTATGGCGGTTGCTCCATGAT
CGTCTGCCTACTAAGGTGAACCTTCTCAAACGTGGCGTCAATGTCCCGAGTCTGTGTGTTTTGTGCTGTGATTATGTTGAGGATCGCCTCCATCTGTTCTGGAAGTGCTC
GGTGGTCAGGGGTATGTGGAACTGCTCCAAGTTTGCTCCACTTCATCAGTCATTTTTCCATTTCCATTTCGAGGAGATCATTGGGGCGATGAAGGAAAGCCTCCCGGGTT
CGGATTTTGAGCTTGTGGTCATCTTCTGGTGGTCTGTGTGGAATCTTCTGAATAGCATGAGTTGGGGCGGTCAGTTCGATGCTCGAGACTTATGGGCTTTTTCGAGTGAC
TACCTTCGTGCTTTTCATATGGCTGGGGGCGTTGCCTGGCGAGGGATGCCTCGCGGGGCCAGCTGGGTGGTCATGAGGAGCGCTGTGTGTGGAGGCCGCCTCCTGCTGGG
GAGCTAA
Protein sequenceShow/hide protein sequence
MVVNPSIYRVLAGWALFTTVAARRHRSPPHHCHTTAAAHRHTTAAAHRHTTAVAHRRNLPPPLPVDSDKVPKREGKDKRGGKVEELKGKKREGKEKRANGSEASVIRDLL
IWYEKASGQTINYEKSVVAFSPNTGEGSQQDFEVYKGPYLASDPGLEGQVFSIAGKEVLLKSIVQAIPCYTMNCFRLPRGLTKEIHSTMAKFWWNGSEDTRRIHWMSWES
LCLPKCLGGLGFRDMELFNQALLAKQCWRVLQDPSSLLCSVLKGRYFPQSGFLEAGLGSRPSFIWRSLLWGRELLVRGCRWRIGNGRSTPIYGSNWLPNGFSLQVQSVPT
LPGDSMVSDLFTVSGGWNEAMLMAHFSESDCEAILRIPLRHGLGEDRLIWHFEKHGVFSLKSGYRLAHMLAARGRPSPSNSDGIRAWWSSFWRLNVPSKHKFFLWRLLHD
RLPTKVNLLKRGVNVPSLCVLCCDYVEDRLHLFWKCSVVRGMWNCSKFAPLHQSFFHFHFEEIIGAMKESLPGSDFELVVIFWWSVWNLLNSMSWGGQFDARDLWAFSSD
YLRAFHMAGGVAWRGMPRGASWVVMRSAVCGGRLLLGS