; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015060 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015060
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDUF4283 domain-containing protein
Genome locationscaffold3:31733801..31736380
RNA-Seq ExpressionSpg015060
SyntenySpg015060
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0004518 - nuclease activity (molecular function)
InterPro domainsIPR004808 - AP endonuclease 1
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0041398.1 hypothetical protein E6C27_scaffold206G00440 [Cucumis melo var. makuwa]2.9e-5128.74Show/hide
Query:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL
        R C  E+K F +   K   +  + I+E      F I + +    W+      LL TP + +FF +    +  +W QK+ N+RG  AEI +V    R+  +
Subjt:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL

Query:  IIPQGEDSTGW-----------------------------------------------KAFLAMVNDFF----------NVSECRR--------------
        ++P+G D TGW                                               K +  +V+ F           N S  +R              
Subjt:  IIPQGEDSTGW-----------------------------------------------KAFLAMVNDFF----------NVSECRR--------------

Query:  ---INWDQTLVITRRCFHDEWSKIFDTLRNAFQKD---LIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLR
           I+W++T++++RRCFHD+W KI D LR    K        PFH DKALL   D D A+++  N GW+ +G F +KFE W+  +H    V+PSYG W R
Subjt:  ---INWDQTLVITRRCFHDEWSKIFDTLRNAFQKD---LIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLR

Query:  FQNIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFG
        F+ IPL  W L+TF  IG+  GGFI+   + ++ +   E +I+V++NY GF+PA  ++   +G V I Q+VT  + + L  R   IHG F   AA     
Subjt:  FQNIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFG

Query:  EDEGSDELCPVDKARMENRSDRFQIQAALNQAGDVSISYKRGFQSFEKDKAENMTSRKGVSEEKMRPRVGSSSTSEKEENNTNEPRKAVGPTREKKQVWR
          E  +E  P  +         F+   A+    D+S S K G +    D+ +  T     + +K    +GS  +  K E  +  P  +      KK+V+R
Subjt:  EDEGSDELCPVDKARMENRSDRFQIQAALNQAGDVSISYKRGFQSFEKDKAENMTSRKGVSEEKMRPRVGSSSTSEKEENNTNEPRKAVGPTREKKQVWR

Query:  EKDLSDKG
         K  S +G
Subjt:  EKDLSDKG

KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]1.1e-5325.83Show/hide
Query:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL
        R C  E+K F +   K   +  + I+E      F I +      W+      LL TP + +FF +    +  +W Q + N+RG  AEI +V    R+  +
Subjt:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL

Query:  IIPQGEDSTGWKAFLAMV-------------NDFFN--------------------------------------VSECRRINWDQTLVITRRCFHDEWSK
        ++P+G D TGW  F  M+               ++N                                       S+C +     T  + RRCFHD+W+K
Subjt:  IIPQGEDSTGWKAFLAMV-------------NDFFN--------------------------------------VSECRRINWDQTLVITRRCFHDEWSK

Query:  IFDTLRN-AFQKDLIIN--PFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFKAIGDVYGG
        I D LR+   +KD      PFH DKALL   D + A+++  N GW+ +G F +KFE W+   H    V+PSYG W RF+ IPL  W L+TF  IG+ YGG
Subjt:  IFDTLRN-AFQKDLIIN--PFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFKAIGDVYGG

Query:  FIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGEDEGSDELCPVDKARMENRSDRF
        FI+   + ++ +   E +I+V++NY GF+PA  ++   +G   I Q VT    + L  R   IHG F+  AA       E  +E  P  +         F
Subjt:  FIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGEDEGSDELCPVDKARMENRSDRF

Query:  QIQAALNQAGDVSISYKRGFQSFEKDKAENMT--------------------SRKGVSEEKMRPRVGSSSTSEKEENN---TNEPRKAVGPTREKKQVWR
        +   A+    D+  S K+G +    D+    T                    S    S EK++ +   S  +EK++       E  + + P   K++V  
Subjt:  QIQAALNQAGDVSISYKRGFQSFEKDKAENMT--------------------SRKGVSEEKMRPRVGSSSTSEKEENN---TNEPRKAVGPTREKKQVWR

Query:  EKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECL--GQNRKDKMAIVVSDSQEPLIEANPINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPIL
            S K  +F   + +   +         +A  E +     Q+ K K+  V S S E               RE S    S QK       ++  + + 
Subjt:  EKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECL--GQNRKDKMAIVVSDSQEPLIEANPINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPIL

Query:  KETIEILWQNGLCIRPIPKKIGGGASKGKRLRRCSMKILSWNVRGLGDADKRRLVKELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRFISWSSFNSEG
           I +L          P  I    S         +K    ++      D+ +  KE +R         ++ ++ F  R +   +  +     + FNS+ 
Subjt:  KETIEILWQNGLCIRPIPKKIGGGASKGKRLRRCSMKILSWNVRGLGDADKRRLVKELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRFISWSSFNSEG

Query:  SS----GGILVLWKEKEV----KAVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLFWRELYDLNGLCSGAWCLGG
        +S      I +L     V     + DV+ G+F+VS+  +  N    W++A+YGP++ K RPLFW EL +L  +C   W LGG
Subjt:  SS----GGILVLWKEKEV----KAVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLFWRELYDLNGLCSGAWCLGG

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-5624.18Show/hide
Query:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL
        R C  ERK F +   K        ++E      F I +  +   WI   +  L+ TP + +FF +T      IW +K  N +G  AEI +V    R+  +
Subjt:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL

Query:  IIPQGEDSTGWKAFLAMVN---------------------------DFFNVSECRRIN-----------------------------------WDQTLVI
        ++P+G D +GW +FL+M+                            D+   S  + +                                     + T+VI
Subjt:  IIPQGEDSTGWKAFLAMVN---------------------------DFFNVSECRRIN-----------------------------------WDQTLVI

Query:  TRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFK
         RR FHD+W KI   LR   ++    N FH +KAL+       A ++  NKGWS +G ++++FE W+   H    ++PSYG W  F+ IPL  W + TF+
Subjt:  TRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFK

Query:  AIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGE---------DEGSD
         IG    G I+  ++  S    +E  I+VR NY GF+PA+  +   +G+    QVVT  + + L  R V +HG F  +AA  SF +          EGS+
Subjt:  AIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGE---------DEGSD

Query:  ELCP-----VDKARMENRSDR-------------------FQIQAALNQAGDVSISYKR-----------GFQSFEKDKAE-NMTSRKGVSEEKMRPRVG
         + P         R  +  D+                   F  +  +N +   + + K            G     K K +  +     ++ +K + +V 
Subjt:  ELCP-----VDKARMENRSDR-------------------FQIQAALNQAGDVSISYKR-----------GFQSFEKDKAE-NMTSRKGVSEEKMRPRVG

Query:  SSSTSEKEE--NNTNEPRK---AVGPTREKKQVWREKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECLGQNR---KDKMAIVVSDSQEPLIEAN
         +S S K    N  + P     ++    +K++V RE+ +  K  S    S+A +Q + +      F  + ++ +  +R   K  +++ V     P ++ N
Subjt:  SSSTSEKEE--NNTNEPRK---AVGPTREKKQVWREKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECLGQNR---KDKMAIVVSDSQEPLIEAN

Query:  PINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPILKETIEILWQNGLCIRPIPKKIGGGASKGKRLR-----RCSMKILSW----NVRGLGDADKRRLV
           + ED     ++       +  V    ++ +P+ + +      N    + + K+      K ++ +         +++SW     ++   D D     
Subjt:  PINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPILKETIEILWQNGLCIRPIPKKIGGGASKGKRLR-----RCSMKILSW----NVRGLGDADKRRLV

Query:  KELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRFISWSSFNSEGSSGGILVLWKEKEVKAVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLF
             +S   ++    S L   ++ I+K +W S  I+W + N+ GSSGGIL+LW  +    +    G F++S  F L N    W+T +YGP + +ER  F
Subjt:  KELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRFISWSSFNSEGSSGGILVLWKEKEVKAVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLF

Query:  WRELYDLNGLCSGAWCLGG
        W EL++L  L S  W LGG
Subjt:  WRELYDLNGLCSGAWCLGG

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.9e-5024.37Show/hide
Query:  WIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSN--KRGVFAEISKVTSSRRRDNLIIPQGEDSTGWKAFLAMV-----------------------N
        WI +   DLL T  ++ FF +   E+  +W +K  N  K  + AEI ++ +  R+ ++++P+G DS GWK+FLA++                       +
Subjt:  WIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSN--KRGVFAEISKVTSSRRRDNLIIPQGEDSTGWKAFLAMV-----------------------N

Query:  DFFNV-------------------------------SECRRI-------------NWDQTLVITRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLK
        D F+                                S  RR              ++++T++ITRRCFHD+W++I  +LR   +      PF  DKA+L 
Subjt:  DFFNV-------------------------------SECRRI-------------NWDQTLVITRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLK

Query:  CPDPDFARIVVHNK---GWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYC
          +PD A+++  NK   GWS +GN+ +KFE W+  LH   +V+PSYG WLRF+ IPL  W  +TF+ IG   GGF++   + + +   ++  I+VR NY 
Subjt:  CPDPDFARIVVHNK---GWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYC

Query:  GFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGEDEGSDELCPVDKARMENRSDRFQIQAALNQAGDVSISYKRGFQSFEKD
        GF+PA   +    G   I   V   + + L  R V +HG F ++AA      +  ++         +     R     +++ +   SISY    +     
Subjt:  GFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGEDEGSDELCPVDKARMENRSDRFQIQAALNQAGDVSISYKRGFQSFEKD

Query:  KAENMTSRKGVSEEKMRPRVGSSSTSEKEENNTNEPRKAVGPTREKKQVWREKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECLG---------
        ++E     + +S+ +          ++++  + ++               R K +S++ VSF L         +  +I     + E+  +          
Subjt:  KAENMTSRKGVSEEKMRPRVGSSSTSEKEENNTNEPRKAVGPTREKKQVWREKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECLG---------

Query:  -QNRKDKMAIVVSDSQEPLIEANPINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPILKETIEILWQNGLC------------------IRPIPKKIGG
         Q  K K+   +    +   E + ++ +E          GS Q + SV+ G    I  L+  I+    +GL                    + +   +  
Subjt:  -QNRKDKMAIVVSDSQEPLIEANPINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPILKETIEILWQNGLC------------------IRPIPKKIGG

Query:  GASKGKRLRRCSMKILSWNVRGLGDADKRRLVKELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRF-ISWSSFNSE-------GSSGGILVLWKEKEVK
        GA + K   R + +  S + +   + +  R  KE +      ++ ++E++L    ++      SS F +  S  N +       G  GGILVLW +   K
Subjt:  GASKGKRLRRCSMKILSWNVRGLGDADKRRLVKELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRF-ISWSSFNSE-------GSSGGILVLWKEKEVK

Query:  AVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLFWRELYDLNGLCSGAWCLGG
          D+ VG++++S L  L      W+T+VYGP +Y +R   W EL  L  LC   W + G
Subjt:  AVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLFWRELYDLNGLCSGAWCLGG

XP_022149859.1 uncharacterized protein LOC111018186 [Momordica charantia]3.3e-5549.77Show/hide
Query:  ECRRINWDQTLVITRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQ
        E RR+NW++T+VITRR FHD+WS+I   ++   +   IINPF  DKAL+KCP  D A +++ NKGW   G  T+K E WN  LHGR  + PSYG+W++ +
Subjt:  ECRRINWDQTLVITRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQ

Query:  NIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSI-AQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGE
        NIPL  W L TFKAIG+  GGFI+ DD     + C +V I+V+ NYCGFIPA  E+   DG +   A+VV+FED + L  + V IHGGFSSEAAR SF +
Subjt:  NIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSI-AQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGE

Query:  DEGSDELCPVDKARMEN
                 +D+ R+EN
Subjt:  DEGSDELCPVDKARMEN

TrEMBL top hitse value%identityAlignment
A0A5A7TEP0 DUF4283 domain-containing protein1.4e-5128.74Show/hide
Query:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL
        R C  E+K F +   K   +  + I+E      F I + +    W+      LL TP + +FF +    +  +W QK+ N+RG  AEI +V    R+  +
Subjt:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL

Query:  IIPQGEDSTGW-----------------------------------------------KAFLAMVNDFF----------NVSECRR--------------
        ++P+G D TGW                                               K +  +V+ F           N S  +R              
Subjt:  IIPQGEDSTGW-----------------------------------------------KAFLAMVNDFF----------NVSECRR--------------

Query:  ---INWDQTLVITRRCFHDEWSKIFDTLRNAFQKD---LIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLR
           I+W++T++++RRCFHD+W KI D LR    K        PFH DKALL   D D A+++  N GW+ +G F +KFE W+  +H    V+PSYG W R
Subjt:  ---INWDQTLVITRRCFHDEWSKIFDTLRNAFQKD---LIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLR

Query:  FQNIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFG
        F+ IPL  W L+TF  IG+  GGFI+   + ++ +   E +I+V++NY GF+PA  ++   +G V I Q+VT  + + L  R   IHG F   AA     
Subjt:  FQNIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFG

Query:  EDEGSDELCPVDKARMENRSDRFQIQAALNQAGDVSISYKRGFQSFEKDKAENMTSRKGVSEEKMRPRVGSSSTSEKEENNTNEPRKAVGPTREKKQVWR
          E  +E  P  +         F+   A+    D+S S K G +    D+ +  T     + +K    +GS  +  K E  +  P  +      KK+V+R
Subjt:  EDEGSDELCPVDKARMENRSDRFQIQAALNQAGDVSISYKRGFQSFEKDKAENMTSRKGVSEEKMRPRVGSSSTSEKEENNTNEPRKAVGPTREKKQVWR

Query:  EKDLSDKG
         K  S +G
Subjt:  EKDLSDKG

A0A5A7TTA1 DUF4283 domain-containing protein5.1e-5425.83Show/hide
Query:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL
        R C  E+K F +   K   +  + I+E      F I +      W+      LL TP + +FF +    +  +W Q + N+RG  AEI +V    R+  +
Subjt:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL

Query:  IIPQGEDSTGWKAFLAMV-------------NDFFN--------------------------------------VSECRRINWDQTLVITRRCFHDEWSK
        ++P+G D TGW  F  M+               ++N                                       S+C +     T  + RRCFHD+W+K
Subjt:  IIPQGEDSTGWKAFLAMV-------------NDFFN--------------------------------------VSECRRINWDQTLVITRRCFHDEWSK

Query:  IFDTLRN-AFQKDLIIN--PFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFKAIGDVYGG
        I D LR+   +KD      PFH DKALL   D + A+++  N GW+ +G F +KFE W+   H    V+PSYG W RF+ IPL  W L+TF  IG+ YGG
Subjt:  IFDTLRN-AFQKDLIIN--PFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFKAIGDVYGG

Query:  FIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGEDEGSDELCPVDKARMENRSDRF
        FI+   + ++ +   E +I+V++NY GF+PA  ++   +G   I Q VT    + L  R   IHG F+  AA       E  +E  P  +         F
Subjt:  FIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGEDEGSDELCPVDKARMENRSDRF

Query:  QIQAALNQAGDVSISYKRGFQSFEKDKAENMT--------------------SRKGVSEEKMRPRVGSSSTSEKEENN---TNEPRKAVGPTREKKQVWR
        +   A+    D+  S K+G +    D+    T                    S    S EK++ +   S  +EK++       E  + + P   K++V  
Subjt:  QIQAALNQAGDVSISYKRGFQSFEKDKAENMT--------------------SRKGVSEEKMRPRVGSSSTSEKEENN---TNEPRKAVGPTREKKQVWR

Query:  EKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECL--GQNRKDKMAIVVSDSQEPLIEANPINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPIL
            S K  +F   + +   +         +A  E +     Q+ K K+  V S S E               RE S    S QK       ++  + + 
Subjt:  EKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECL--GQNRKDKMAIVVSDSQEPLIEANPINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPIL

Query:  KETIEILWQNGLCIRPIPKKIGGGASKGKRLRRCSMKILSWNVRGLGDADKRRLVKELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRFISWSSFNSEG
           I +L          P  I    S         +K    ++      D+ +  KE +R         ++ ++ F  R +   +  +     + FNS+ 
Subjt:  KETIEILWQNGLCIRPIPKKIGGGASKGKRLRRCSMKILSWNVRGLGDADKRRLVKELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRFISWSSFNSEG

Query:  SS----GGILVLWKEKEV----KAVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLFWRELYDLNGLCSGAWCLGG
        +S      I +L     V     + DV+ G+F+VS+  +  N    W++A+YGP++ K RPLFW EL +L  +C   W LGG
Subjt:  SS----GGILVLWKEKEV----KAVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLFWRELYDLNGLCSGAWCLGG

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein9.0e-5124.37Show/hide
Query:  WIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSN--KRGVFAEISKVTSSRRRDNLIIPQGEDSTGWKAFLAMV-----------------------N
        WI +   DLL T  ++ FF +   E+  +W +K  N  K  + AEI ++ +  R+ ++++P+G DS GWK+FLA++                       +
Subjt:  WIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSN--KRGVFAEISKVTSSRRRDNLIIPQGEDSTGWKAFLAMV-----------------------N

Query:  DFFNV-------------------------------SECRRI-------------NWDQTLVITRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLK
        D F+                                S  RR              ++++T++ITRRCFHD+W++I  +LR   +      PF  DKA+L 
Subjt:  DFFNV-------------------------------SECRRI-------------NWDQTLVITRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLK

Query:  CPDPDFARIVVHNK---GWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYC
          +PD A+++  NK   GWS +GN+ +KFE W+  LH   +V+PSYG WLRF+ IPL  W  +TF+ IG   GGF++   + + +   ++  I+VR NY 
Subjt:  CPDPDFARIVVHNK---GWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYC

Query:  GFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGEDEGSDELCPVDKARMENRSDRFQIQAALNQAGDVSISYKRGFQSFEKD
        GF+PA   +    G   I   V   + + L  R V +HG F ++AA      +  ++         +     R     +++ +   SISY    +     
Subjt:  GFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGEDEGSDELCPVDKARMENRSDRFQIQAALNQAGDVSISYKRGFQSFEKD

Query:  KAENMTSRKGVSEEKMRPRVGSSSTSEKEENNTNEPRKAVGPTREKKQVWREKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECLG---------
        ++E     + +S+ +          ++++  + ++               R K +S++ VSF L         +  +I     + E+  +          
Subjt:  KAENMTSRKGVSEEKMRPRVGSSSTSEKEENNTNEPRKAVGPTREKKQVWREKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECLG---------

Query:  -QNRKDKMAIVVSDSQEPLIEANPINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPILKETIEILWQNGLC------------------IRPIPKKIGG
         Q  K K+   +    +   E + ++ +E          GS Q + SV+ G    I  L+  I+    +GL                    + +   +  
Subjt:  -QNRKDKMAIVVSDSQEPLIEANPINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPILKETIEILWQNGLC------------------IRPIPKKIGG

Query:  GASKGKRLRRCSMKILSWNVRGLGDADKRRLVKELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRF-ISWSSFNSE-------GSSGGILVLWKEKEVK
        GA + K   R + +  S + +   + +  R  KE +      ++ ++E++L    ++      SS F +  S  N +       G  GGILVLW +   K
Subjt:  GASKGKRLRRCSMKILSWNVRGLGDADKRRLVKELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRF-ISWSSFNSE-------GSSGGILVLWKEKEVK

Query:  AVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLFWRELYDLNGLCSGAWCLGG
          D+ VG++++S L  L      W+T+VYGP +Y +R   W EL  L  LC   W + G
Subjt:  AVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLFWRELYDLNGLCSGAWCLGG

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein6.4e-5724.18Show/hide
Query:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL
        R C  ERK F +   K        ++E      F I +  +   WI   +  L+ TP + +FF +T      IW +K  N +G  AEI +V    R+  +
Subjt:  RKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNL

Query:  IIPQGEDSTGWKAFLAMVN---------------------------DFFNVSECRRIN-----------------------------------WDQTLVI
        ++P+G D +GW +FL+M+                            D+   S  + +                                     + T+VI
Subjt:  IIPQGEDSTGWKAFLAMVN---------------------------DFFNVSECRRIN-----------------------------------WDQTLVI

Query:  TRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFK
         RR FHD+W KI   LR   ++    N FH +KAL+       A ++  NKGWS +G ++++FE W+   H    ++PSYG W  F+ IPL  W + TF+
Subjt:  TRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQNIPLQNWCLDTFK

Query:  AIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGE---------DEGSD
         IG    G I+  ++  S    +E  I+VR NY GF+PA+  +   +G+    QVVT  + + L  R V +HG F  +AA  SF +          EGS+
Subjt:  AIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGE---------DEGSD

Query:  ELCP-----VDKARMENRSDR-------------------FQIQAALNQAGDVSISYKR-----------GFQSFEKDKAE-NMTSRKGVSEEKMRPRVG
         + P         R  +  D+                   F  +  +N +   + + K            G     K K +  +     ++ +K + +V 
Subjt:  ELCP-----VDKARMENRSDR-------------------FQIQAALNQAGDVSISYKR-----------GFQSFEKDKAE-NMTSRKGVSEEKMRPRVG

Query:  SSSTSEKEE--NNTNEPRK---AVGPTREKKQVWREKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECLGQNR---KDKMAIVVSDSQEPLIEAN
         +S S K    N  + P     ++    +K++V RE+ +  K  S    S+A +Q + +      F  + ++ +  +R   K  +++ V     P ++ N
Subjt:  SSSTSEKEE--NNTNEPRK---AVGPTREKKQVWREKDLSDKGVSFALKSEAWDQEEAIEDIAAMFANEEVECLGQNR---KDKMAIVVSDSQEPLIEAN

Query:  PINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPILKETIEILWQNGLCIRPIPKKIGGGASKGKRLR-----RCSMKILSW----NVRGLGDADKRRLV
           + ED     ++       +  V    ++ +P+ + +      N    + + K+      K ++ +         +++SW     ++   D D     
Subjt:  PINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPILKETIEILWQNGLCIRPIPKKIGGGASKGKRLR-----RCSMKILSW----NVRGLGDADKRRLV

Query:  KELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRFISWSSFNSEGSSGGILVLWKEKEVKAVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLF
             +S   ++    S L   ++ I+K +W S  I+W + N+ GSSGGIL+LW  +    +    G F++S  F L N    W+T +YGP + +ER  F
Subjt:  KELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRFISWSSFNSEGSSGGILVLWKEKEVKAVDVVVGSFTVSVLFELENAKKCWITAVYGPSRYKERPLF

Query:  WRELYDLNGLCSGAWCLGG
        W EL++L  L S  W LGG
Subjt:  WRELYDLNGLCSGAWCLGG

A0A6J1D6X4 uncharacterized protein LOC1110181861.6e-5549.77Show/hide
Query:  ECRRINWDQTLVITRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQ
        E RR+NW++T+VITRR FHD+WS+I   ++   +   IINPF  DKAL+KCP  D A +++ NKGW   G  T+K E WN  LHGR  + PSYG+W++ +
Subjt:  ECRRINWDQTLVITRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINVVPSYGSWLRFQ

Query:  NIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSI-AQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGE
        NIPL  W L TFKAIG+  GGFI+ DD     + C +V I+V+ NYCGFIPA  E+   DG +   A+VV+FED + L  + V IHGGFSSEAAR SF +
Subjt:  NIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSI-AQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGE

Query:  DEGSDELCPVDKARMEN
                 +D+ R+EN
Subjt:  DEGSDELCPVDKARMEN

SwissProt top hitse value%identityAlignment
Q9SKZ1 Transcription factor Pur-alpha 12.0e-0728.93Show/hide
Query:  ERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNLIIPQG
        E KLF  D  +    R +KISEK  A R  II+      W  D+ +  + +   + F ++   ++   +F    N+RG F ++S+ + SR R  +I+P G
Subjt:  ERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNLIIPQG

Query:  ED-STGWKAFLAMVNDFFNVS
             GW AF  ++ +    S
Subjt:  ED-STGWKAFLAMVNDFFNVS

Arabidopsis top hitse value%identityAlignment
AT2G32080.1 purin-rich alpha 11.4e-0828.93Show/hide
Query:  ERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNLIIPQG
        E KLF  D  +    R +KISEK  A R  II+      W  D+ +  + +   + F ++   ++   +F    N+RG F ++S+ + SR R  +I+P G
Subjt:  ERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNLIIPQG

Query:  ED-STGWKAFLAMVNDFFNVS
             GW AF  ++ +    S
Subjt:  ED-STGWKAFLAMVNDFFNVS

AT2G32080.2 purin-rich alpha 11.4e-0828.93Show/hide
Query:  ERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNLIIPQG
        E KLF  D  +    R +KISEK  A R  II+      W  D+ +  + +   + F ++   ++   +F    N+RG F ++S+ + SR R  +I+P G
Subjt:  ERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNLIIPQG

Query:  ED-STGWKAFLAMVNDFFNVS
             GW AF  ++ +    S
Subjt:  ED-STGWKAFLAMVNDFFNVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTCGAGATCAGCCCGTAAGTGCGTAGCTGAGAGGAAACTGTTTGAGATAGACTATATCAAAGAGAGGAACAAGAGAGCAGTTAAGATTTCTGAGAAAAACCACGC
TCTGCGGTTCGAGATTATTCTGGAAGTTAAACACGCAATTTGGATAGCAGATGTAGTCGATGACTTGCTCATTACCCCCATAAGCCAAAAATTTTTTAGAAAGACTTCGT
GCGAAAACGGTTTCATTTGGTTTCAAAAGGTGTCCAACAAGAGAGGGGTTTTTGCAGAAATATCGAAGGTGACTTCATCTAGAAGAAGAGATAATCTGATTATTCCCCAG
GGAGAAGACTCTACAGGATGGAAAGCTTTCCTCGCTATGGTTAACGACTTCTTCAATGTCAGTGAGTGTCGAAGGATCAACTGGGATCAAACTCTGGTTATTACGAGAAG
ATGCTTTCATGATGAATGGAGTAAAATCTTCGACACTCTTCGTAATGCTTTTCAGAAGGATCTAATCATCAACCCTTTCCACCCGGATAAGGCCCTCCTGAAGTGCCCGG
ACCCAGACTTTGCTAGAATCGTTGTTCATAATAAAGGTTGGTCAGTAATAGGGAACTTCACCCTGAAATTTGAATATTGGAACTATAAGTTGCATGGTAGAATTAATGTA
GTTCCTTCTTATGGCAGCTGGCTGAGATTCCAGAACATTCCCCTGCAGAATTGGTGTCTTGACACTTTTAAAGCGATAGGGGACGTGTATGGAGGGTTTATTGAATGCGA
TGATAAATGCTTGTCTTTAGTTGGTTGTATGGAAGTCGTTATTCGCGTAAGAGACAATTATTGTGGTTTTATTCCAGCCGATTTTGAGCTGATGCAGGCAGACGGCTCAG
TTTCGATCGCCCAAGTGGTTACTTTTGAAGATCCTCAACTCCTGGAAAGTAGACGAGTTTACATCCATGGAGGTTTTTCCAGTGAAGCAGCTAGGGTTTCCTTTGGGGAA
GACGAAGGTAGTGACGAGCTTTGCCCAGTGGATAAAGCCCGTATGGAAAACAGGTCGGACAGGTTTCAGATCCAGGCGGCTCTAAACCAGGCTGGGGATGTCAGTATAAG
CTACAAAAGGGGATTTCAAAGTTTTGAAAAGGATAAAGCTGAAAATATGACTTCTAGAAAGGGGGTGTCAGAAGAAAAGATGAGGCCCAGAGTGGGGAGTAGCAGTACCA
GCGAAAAAGAGGAAAATAATACAAATGAACCAAGGAAGGCAGTGGGGCCCACGCGCGAGAAAAAGCAGGTTTGGAGAGAGAAAGACTTGTCTGATAAAGGAGTCTCCTTT
GCCTTAAAATCTGAAGCGTGGGATCAAGAGGAGGCCATAGAGGACATAGCAGCCATGTTTGCGAACGAAGAAGTCGAATGCCTAGGTCAAAACCGAAAAGATAAGATGGC
CATAGTAGTTAGCGATTCTCAGGAACCTCTGATAGAAGCAAATCCGATTAATACTGAAGAGGATGATAGAAGGGAACCCTCGAGTCCTTGTGGATCTGGGCAGAAATCTG
GTAGTGTAGAATCGGGGTCAAAGCTCGCAATTCCGATATTAAAGGAAACGATAGAAATCCTGTGGCAAAATGGGCTCTGTATTAGGCCAATTCCAAAGAAGATAGGAGGT
GGGGCGAGCAAAGGCAAAAGGTTGCGACGTTGCTCAATGAAGATTCTCTCATGGAATGTCAGGGGGTTGGGGGATGCGGATAAAAGAAGATTGGTTAAGGAGTTAGTTAG
GAGTAGCGGCCCATATATTGTGCTTATTCAGGAGTCTAAATTGGGTTTTGTCCATAGGCACATTGTCAAGCAAGTGTGGAGCTCTCGTTTTATAAGTTGGTCTTCCTTTA
ATTCTGAAGGGTCTTCAGGTGGGATTCTGGTGTTGTGGAAGGAGAAAGAGGTCAAGGCAGTGGATGTGGTTGTGGGGAGTTTCACAGTGTCTGTCCTGTTTGAGCTTGAG
AACGCCAAGAAATGCTGGATTACCGCCGTGTATGGTCCTAGTAGATATAAAGAAAGACCTTTGTTTTGGCGTGAGTTATATGATCTGAATGGCTTATGCAGCGGGGCTTG
GTGTCTAGGGGGATTTCAACGTGGTCAGAAGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTTCGAGATCAGCCCGTAAGTGCGTAGCTGAGAGGAAACTGTTTGAGATAGACTATATCAAAGAGAGGAACAAGAGAGCAGTTAAGATTTCTGAGAAAAACCACGC
TCTGCGGTTCGAGATTATTCTGGAAGTTAAACACGCAATTTGGATAGCAGATGTAGTCGATGACTTGCTCATTACCCCCATAAGCCAAAAATTTTTTAGAAAGACTTCGT
GCGAAAACGGTTTCATTTGGTTTCAAAAGGTGTCCAACAAGAGAGGGGTTTTTGCAGAAATATCGAAGGTGACTTCATCTAGAAGAAGAGATAATCTGATTATTCCCCAG
GGAGAAGACTCTACAGGATGGAAAGCTTTCCTCGCTATGGTTAACGACTTCTTCAATGTCAGTGAGTGTCGAAGGATCAACTGGGATCAAACTCTGGTTATTACGAGAAG
ATGCTTTCATGATGAATGGAGTAAAATCTTCGACACTCTTCGTAATGCTTTTCAGAAGGATCTAATCATCAACCCTTTCCACCCGGATAAGGCCCTCCTGAAGTGCCCGG
ACCCAGACTTTGCTAGAATCGTTGTTCATAATAAAGGTTGGTCAGTAATAGGGAACTTCACCCTGAAATTTGAATATTGGAACTATAAGTTGCATGGTAGAATTAATGTA
GTTCCTTCTTATGGCAGCTGGCTGAGATTCCAGAACATTCCCCTGCAGAATTGGTGTCTTGACACTTTTAAAGCGATAGGGGACGTGTATGGAGGGTTTATTGAATGCGA
TGATAAATGCTTGTCTTTAGTTGGTTGTATGGAAGTCGTTATTCGCGTAAGAGACAATTATTGTGGTTTTATTCCAGCCGATTTTGAGCTGATGCAGGCAGACGGCTCAG
TTTCGATCGCCCAAGTGGTTACTTTTGAAGATCCTCAACTCCTGGAAAGTAGACGAGTTTACATCCATGGAGGTTTTTCCAGTGAAGCAGCTAGGGTTTCCTTTGGGGAA
GACGAAGGTAGTGACGAGCTTTGCCCAGTGGATAAAGCCCGTATGGAAAACAGGTCGGACAGGTTTCAGATCCAGGCGGCTCTAAACCAGGCTGGGGATGTCAGTATAAG
CTACAAAAGGGGATTTCAAAGTTTTGAAAAGGATAAAGCTGAAAATATGACTTCTAGAAAGGGGGTGTCAGAAGAAAAGATGAGGCCCAGAGTGGGGAGTAGCAGTACCA
GCGAAAAAGAGGAAAATAATACAAATGAACCAAGGAAGGCAGTGGGGCCCACGCGCGAGAAAAAGCAGGTTTGGAGAGAGAAAGACTTGTCTGATAAAGGAGTCTCCTTT
GCCTTAAAATCTGAAGCGTGGGATCAAGAGGAGGCCATAGAGGACATAGCAGCCATGTTTGCGAACGAAGAAGTCGAATGCCTAGGTCAAAACCGAAAAGATAAGATGGC
CATAGTAGTTAGCGATTCTCAGGAACCTCTGATAGAAGCAAATCCGATTAATACTGAAGAGGATGATAGAAGGGAACCCTCGAGTCCTTGTGGATCTGGGCAGAAATCTG
GTAGTGTAGAATCGGGGTCAAAGCTCGCAATTCCGATATTAAAGGAAACGATAGAAATCCTGTGGCAAAATGGGCTCTGTATTAGGCCAATTCCAAAGAAGATAGGAGGT
GGGGCGAGCAAAGGCAAAAGGTTGCGACGTTGCTCAATGAAGATTCTCTCATGGAATGTCAGGGGGTTGGGGGATGCGGATAAAAGAAGATTGGTTAAGGAGTTAGTTAG
GAGTAGCGGCCCATATATTGTGCTTATTCAGGAGTCTAAATTGGGTTTTGTCCATAGGCACATTGTCAAGCAAGTGTGGAGCTCTCGTTTTATAAGTTGGTCTTCCTTTA
ATTCTGAAGGGTCTTCAGGTGGGATTCTGGTGTTGTGGAAGGAGAAAGAGGTCAAGGCAGTGGATGTGGTTGTGGGGAGTTTCACAGTGTCTGTCCTGTTTGAGCTTGAG
AACGCCAAGAAATGCTGGATTACCGCCGTGTATGGTCCTAGTAGATATAAAGAAAGACCTTTGTTTTGGCGTGAGTTATATGATCTGAATGGCTTATGCAGCGGGGCTTG
GTGTCTAGGGGGATTTCAACGTGGTCAGAAGAATTGA
Protein sequenceShow/hide protein sequence
MTSRSARKCVAERKLFEIDYIKERNKRAVKISEKNHALRFEIILEVKHAIWIADVVDDLLITPISQKFFRKTSCENGFIWFQKVSNKRGVFAEISKVTSSRRRDNLIIPQ
GEDSTGWKAFLAMVNDFFNVSECRRINWDQTLVITRRCFHDEWSKIFDTLRNAFQKDLIINPFHPDKALLKCPDPDFARIVVHNKGWSVIGNFTLKFEYWNYKLHGRINV
VPSYGSWLRFQNIPLQNWCLDTFKAIGDVYGGFIECDDKCLSLVGCMEVVIRVRDNYCGFIPADFELMQADGSVSIAQVVTFEDPQLLESRRVYIHGGFSSEAARVSFGE
DEGSDELCPVDKARMENRSDRFQIQAALNQAGDVSISYKRGFQSFEKDKAENMTSRKGVSEEKMRPRVGSSSTSEKEENNTNEPRKAVGPTREKKQVWREKDLSDKGVSF
ALKSEAWDQEEAIEDIAAMFANEEVECLGQNRKDKMAIVVSDSQEPLIEANPINTEEDDRREPSSPCGSGQKSGSVESGSKLAIPILKETIEILWQNGLCIRPIPKKIGG
GASKGKRLRRCSMKILSWNVRGLGDADKRRLVKELVRSSGPYIVLIQESKLGFVHRHIVKQVWSSRFISWSSFNSEGSSGGILVLWKEKEVKAVDVVVGSFTVSVLFELE
NAKKCWITAVYGPSRYKERPLFWRELYDLNGLCSGAWCLGGFQRGQKN