; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006233 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006233
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationscaffold4:4510526..4512450
RNA-Seq ExpressionSpg006233
SyntenySpg006233
Gene Ontology termsNA
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025238.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-6038.04Show/hide
Query:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM
        S  +IP  +    P A++P ++S GPY+HGK HL   E  KL     F+ RC  +     ++I + V +I  L+ L + Y + + D WK+ D   FL+LM
Subjt:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM

Query:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM------------------------------------------ANLLHPQISRSDRVITF
        +V+GCFML  L++     +    D   IK+DMLLLENQLPM                                           +LLHP I R+D  +  
Subjt:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM------------------------------------------ANLLHPQISRSDRVITF

Query:  EYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVA
        ++   + Q       +I PAT+L+ AGI F+RS T SL+DV FD K GVL +P L VDD T+S L NV+A+E+L+   G +VTSF +LM++LID ++DVA
Subjt:  EYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVA

Query:  LLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYGYH
        +L  + +L + +G D+ AA  FN LG GAA+          V++ + ++C++PW+E   +L +  FQSPWTIISL AA  GF++L LQ  YQ   Y+
Subjt:  LLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYGYH

XP_022148888.1 UPF0481 protein At3g47200-like [Momordica charantia]9.4e-6638.95Show/hide
Query:  RFLQEELEKAGMEYEREELTRSYPIIGDKVIFGDPSSCQRIPQHIRNVEPSA-FDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAI
        R +++  E A +   RE L +   ++    +  +  S  +IP  +R V+P A F+P L+SFGPYHHG+SHL + E  K    + F+ R         + +
Subjt:  RFLQEELEKAGMEYEREELTRSYPIIGDKVIFGDPSSCQRIPQHIRNVEPSA-FDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAI

Query:  AKVVCSISMLEPLKKFYHQPDSDNWKDAD-EFSFLKLMLVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPMANL--LHPQISRS--------
               SMLE ++  Y + + + WK  D    FL+LM+++GCFMLE+LL+D   WL N +  E I RDMLLLENQLPM  L  LH   + S        
Subjt:  AKVVCSISMLEPLKKFYHQPDSDNWKDAD-EFSFLKLMLVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPMANL--LHPQISRS--------

Query:  --------DRV---ITFEY-------------------------DVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVD
                D+V   +  EY                             A+  +S   +I PATRL  AGI F RS++ S++DV FD KRGVLK+P + VD
Subjt:  --------DRV---ITFEY-------------------------DVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVD

Query:  DVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWW
        D T+S   NV+A+E+L+   G++VT F +LMN+LIDVD+DVALL S  I+ + LG D+ AAE F  L +GAAL+  +      V   + ++C K  H+W 
Subjt:  DVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWW

Query:  TSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYGYH
         SL +  FQ PW I+SLIAA+LGFV+L LQ  YQ+  Y+
Subjt:  TSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYGYH

XP_022960454.1 UPF0481 protein At3g47200-like [Cucurbita moschata]1.3e-5937.47Show/hide
Query:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM
        S  +IP  +    P A++P ++S GPY+HGK HL   E  KL     F+ RC  +     ++I + V +I  L+ L + Y + + D WK+ D   FL+LM
Subjt:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM

Query:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM------------------------------------------------ANLLHPQISRS
        +V+GCFML  L++     +    D   IK+DMLLLENQLPM                                                 +LLHP I R+
Subjt:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM------------------------------------------------ANLLHPQISRS

Query:  DRVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLID
        D  +   +   + Q       +I PAT+L+ AGI F+RS+T SL+DV FD K GVL +P L VDD T+S L NV+A+E+L+   G +VTSF +LM++LID
Subjt:  DRVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLID

Query:  VDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVY
         ++DVA+L  + +L + +G D+ AA  FN LG GAA+       +  V++ + ++C++PW+E   +L +  FQSPWTIISL AA  GF++L LQ  YQ  
Subjt:  VDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVY

Query:  GYH
         Y+
Subjt:  GYH

XP_023513987.1 UPF0481 protein At3g47200-like isoform X2 [Cucurbita pepo subsp. pepo]6.5e-5937.31Show/hide
Query:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM
        S  +IP  +    P A++P ++S GPY+HGK HL   E  KL     F+ RC  +     ++I K V +I  L+ L + Y + + + W + D   FL+LM
Subjt:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM

Query:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM-----------------------------------------------ANLLHPQISRSD
        +V+GCFML  L++     +    D   IK+DMLLLENQLPM                                                +LLHP I R+D
Subjt:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM-----------------------------------------------ANLLHPQISRSD

Query:  RVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDV
          +  ++   ++Q       +I PAT+L+ AGI F+RS+T SL+DV FD K GVL +P L VDD T+S L NV+A+E+L+   G  VTSF +LM++LID 
Subjt:  RVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDV

Query:  DKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYG
        ++DVA+L  + +L + +G D+ AA  FN LG GAA+          V++ + ++C++PW+E   +L +  FQSPWTIISL AA  GF++L LQ  YQ   
Subjt:  DKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYG

Query:  YH
        Y+
Subjt:  YH

XP_038875622.1 UPF0481 protein At3g47200-like [Benincasa hispida]9.7e-6337.47Show/hide
Query:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM
        S  +IP+ +R ++  AF+P L+S GPYHHGK HL+  E  K    RRF      N  +  ++++      + LE L   Y + D + WK  D   FL++M
Subjt:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM

Query:  LVEGCFMLELLLS-----DDRQWLRNDKDVETIKRDMLLLENQLPM-----------------------------------------------ANLLHPQ
        +V+GCFML+            +W         IKRDMLLLENQLPM                                               A+LL+P 
Subjt:  LVEGCFMLELLLS-----DDRQWLRNDKDVETIKRDMLLLENQLPM-----------------------------------------------ANLLHPQ

Query:  ISRSDR----VITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFT
        + R DR     +T++    + Q       +I  AT+L  AGI F+ S T +L+DVSF+ K+GVL++P++ VDD T++ L NV+A+E+L    G++VTSF 
Subjt:  ISRSDR----VITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFT

Query:  VLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLF
        +LMN+LIDVD+DVALL S  IL + LG DE+AA+ F++LG+GAA++  +      V+  + K+CS  W++W  SL +  FQ+PW IISL AA  GF +L 
Subjt:  VLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLF

Query:  LQTFYQVYGYH
        +Q  YQ+  YH
Subjt:  LQTFYQVYGYH

TrEMBL top hitse value%identityAlignment
A0A2R6P1L8 UPF0481 protein6.0e-5833.57Show/hide
Query:  DPSSCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFL
        +  S  R+P  + ++   A+ P  +SFGPYHHG+ HL+  E  K   L  F KR +       +++A V       + LK+ Y   DS +    D  +FL
Subjt:  DPSSCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFL

Query:  KLMLVEGCFMLELLLSDDRQWLRND--------------KDVETIKRDMLLLENQLPMANLLHPQISRSDRV----------------------------
        +LM+++GCFMLE+L + D    +ND                +  IKRDML+LENQLPM  L +     +D+                             
Subjt:  KLMLVEGCFMLELLLSDDRQWLRND--------------KDVETIKRDMLLLENQLPMANLLHPQISRSDRV----------------------------

Query:  ----------ITFEYDVRDAQRHNSTR----------PMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPN
                  + +E  +++++R  S             +I  A  L  AGI F++S++ SL D+SF  K GVL +P + VDD T+S   N+IAYER +  
Subjt:  ----------ITFEYDVRDAQRHNSTR----------PMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPN

Query:  IGNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIA
         GNEVTS+   M+++ID  +DV+LL SK I+++ +G+D+A A+ FN L +   L+  +    D V++++ KYC K W+EW  +L++T F++PW I+S+IA
Subjt:  IGNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIA

Query:  ASLGFVLLFLQTFYQVYGYHHPS
        A   F L  +QT Y +Y Y+HP+
Subjt:  ASLGFVLLFLQTFYQVYGYHHPS

A0A6J1D5C0 UPF0481 protein At3g47200-like4.5e-6638.95Show/hide
Query:  RFLQEELEKAGMEYEREELTRSYPIIGDKVIFGDPSSCQRIPQHIRNVEPSA-FDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAI
        R +++  E A +   RE L +   ++    +  +  S  +IP  +R V+P A F+P L+SFGPYHHG+SHL + E  K    + F+ R         + +
Subjt:  RFLQEELEKAGMEYEREELTRSYPIIGDKVIFGDPSSCQRIPQHIRNVEPSA-FDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAI

Query:  AKVVCSISMLEPLKKFYHQPDSDNWKDAD-EFSFLKLMLVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPMANL--LHPQISRS--------
               SMLE ++  Y + + + WK  D    FL+LM+++GCFMLE+LL+D   WL N +  E I RDMLLLENQLPM  L  LH   + S        
Subjt:  AKVVCSISMLEPLKKFYHQPDSDNWKDAD-EFSFLKLMLVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPMANL--LHPQISRS--------

Query:  --------DRV---ITFEY-------------------------DVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVD
                D+V   +  EY                             A+  +S   +I PATRL  AGI F RS++ S++DV FD KRGVLK+P + VD
Subjt:  --------DRV---ITFEY-------------------------DVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVD

Query:  DVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWW
        D T+S   NV+A+E+L+   G++VT F +LMN+LIDVD+DVALL S  I+ + LG D+ AAE F  L +GAAL+  +      V   + ++C K  H+W 
Subjt:  DVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWW

Query:  TSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYGYH
         SL +  FQ PW I+SLIAA+LGFV+L LQ  YQ+  Y+
Subjt:  TSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYGYH

A0A6J1HB25 UPF0481 protein At3g47200-like6.4e-6037.47Show/hide
Query:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM
        S  +IP  +    P A++P ++S GPY+HGK HL   E  KL     F+ RC  +     ++I + V +I  L+ L + Y + + D WK+ D   FL+LM
Subjt:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM

Query:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM------------------------------------------------ANLLHPQISRS
        +V+GCFML  L++     +    D   IK+DMLLLENQLPM                                                 +LLHP I R+
Subjt:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM------------------------------------------------ANLLHPQISRS

Query:  DRVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLID
        D  +   +   + Q       +I PAT+L+ AGI F+RS+T SL+DV FD K GVL +P L VDD T+S L NV+A+E+L+   G +VTSF +LM++LID
Subjt:  DRVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLID

Query:  VDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVY
         ++DVA+L  + +L + +G D+ AA  FN LG GAA+       +  V++ + ++C++PW+E   +L +  FQSPWTIISL AA  GF++L LQ  YQ  
Subjt:  VDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVY

Query:  GYH
         Y+
Subjt:  GYH

A0A6J1KVQ6 UPF0481 protein At3g47200-like isoform X21.3e-5737.06Show/hide
Query:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM
        S  +IP  +    P A++P ++S GPY+HGK HL   E  KL     F+ RC  +     ++I + V +I  L+ L + Y + + + W   D   FL+LM
Subjt:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM

Query:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM-----------------------------------------------ANLLHPQISRSD
        +V+GCFML  L+S     +    D   IK+DMLLLENQLPM                                                +LL+P I R D
Subjt:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM-----------------------------------------------ANLLHPQISRSD

Query:  RVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDV
             ++   + Q       +I PAT+L  AGI F+RS+T SL DV FD KRGVL +P L VDD T+S + NV+A+E+L+   G +VTSF +LM++LID 
Subjt:  RVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDV

Query:  DKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYG
        ++DVA+L  + IL + +G D+ AA  F+ LG GAA+   +      V++ +  +C++PW+E   +L +  FQSPWTIISL AA  GF++L LQ  YQ   
Subjt:  DKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYG

Query:  YH
        Y+
Subjt:  YH

A0A6J1KYV8 UPF0481 protein At3g47200-like isoform X11.7e-5736.97Show/hide
Query:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM
        S  +IP  +    P A++P ++S GPY+HGK HL   E  KL     F+ RC  +     ++I + V +I  L+ L + Y + + + W   D   FL+LM
Subjt:  SCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLM

Query:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM------------------------------------------------ANLLHPQISRS
        +V+GCFML  L+S     +    D   IK+DMLLLENQLPM                                                 +LL+P I R 
Subjt:  LVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPM------------------------------------------------ANLLHPQISRS

Query:  DRVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLID
        D     ++   + Q       +I PAT+L  AGI F+RS+T SL DV FD KRGVL +P L VDD T+S + NV+A+E+L+   G +VTSF +LM++LID
Subjt:  DRVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLID

Query:  VDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVY
         ++DVA+L  + IL + +G D+ AA  F+ LG GAA+   +      V++ +  +C++PW+E   +L +  FQSPWTIISL AA  GF++L LQ  YQ  
Subjt:  VDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVY

Query:  GYH
         Y+
Subjt:  GYH

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026457.8e-1527.6Show/hide
Query:  VRDAQRHNSTRPMILPA-TRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALL
        + D ++      + +P+ + L  AG+ F+ +   ++S V+FD   G   +P + +D  T++ L N++AYE  N +     T +T L+N +ID ++DV LL
Subjt:  VRDAQRHNSTRPMILPA-TRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALL

Query:  TSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVY
          + +L   L +D+ AAE +N + +   L +      D     + +Y +  W      LV       W I++ +AA L  +L+ LQ F  V+
Subjt:  TSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVY

Q9SD53 UPF0481 protein At3g472001.2e-2623.84Show/hide
Query:  GDPSSC-QRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFS
        G  S C  R+P+    + P A+ P ++S GPYH+G+ HL   ++ K   L+ F            +    V   + + + ++K Y    S+  K   +  
Subjt:  GDPSSC-QRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFS

Query:  FLKLMLVEGCFML--------ELLLSDDR----QWLRNDKDVETIKRDMLLLENQLP-------------------------------------------
        F  +M+++GCF+L         + LS+D      WL     + +I+ D+LLLENQ+P                                           
Subjt:  FLKLMLVEGCFML--------ELLLSDDR----QWLRNDKDVETIKRDMLLLENQLP-------------------------------------------

Query:  -----MANLLH----PQISRSDRVIT--FEYDVRDAQRHN------STRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSAL
             + +L+     P  S SD+  +   +  + + +  N         P+IL A RL+  GI F    +   S ++  LK+  L+IP L+ D    S  
Subjt:  -----MANLLH----PQISRSDRVIT--FEYDVRDAQRHN------STRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSAL

Query:  FNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALL-TSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNT
         N +A+E+   +  NE+T++ V M  L++ ++DV  L   K I+E+  G++   +EFF  + +       +    + V++ + +Y  K ++  W    +T
Subjt:  FNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALL-TSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNT

Query:  NFQSPWTIISLIAASLGFVLLFLQTFYQVYGY
        +F+SPWT +S  A     +L  LQ+   +  Y
Subjt:  NFQSPWTIISLIAASLGFVLLFLQTFYQVYGY

Arabidopsis top hitse value%identityAlignment
AT3G50130.1 Plant protein of unknown function (DUF247)8.6e-3324.95Show/hide
Query:  EELERFLQEELEKAGMEYEREELTRSYPIIGDKVIFGDPSSCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRF
        EE    +++++E+A     RE+ T S+          D     R+PQ+++     ++ P  +S GP+HHG  HL+  +R K   +     R   +     
Subjt:  EELERFLQEELEKAGMEYEREELTRSYPIIGDKVIFGDPSSCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRF

Query:  DAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLMLVEGCFMLELLLSDDRQWL-----RNDK------DVETIKRDMLLLENQLPM--------
        DA+ +      + +  +  Y  P      D     F ++++++GCF+LEL    D  +      RND        + +I+RDM++LENQLP+        
Subjt:  DAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLMLVEGCFMLELLLSDDRQWL-----RNDK------DVETIKRDMLLLENQLPM--------

Query:  -------------------------------------------------------------ANLLHPQISRSDRVITFEYDVRDAQRHNSTRPMILPATR
                                                                      NLL P  +   R+    +  R        + +I   T 
Subjt:  -------------------------------------------------------------ANLLHPQISRSDRVITFEYDVRDAQRHNSTRPMILPATR

Query:  LQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFF
        L+ AGI F   +T    D+ F  K G L+IP L + D TKS   N+IA+E+ + +  N++TS+ + M++LID  +DV  L    I+EH LG D   A+ F
Subjt:  LQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFF

Query:  NVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYGYHHP
        N L +  A +  N  L   +   +++  S+ W+     L +  F +PW   S  AA +  VL   Q+F+  Y Y +P
Subjt:  NVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYGYHHP

AT3G50150.1 Plant protein of unknown function (DUF247)2.6e-3726.46Show/hide
Query:  RIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLMLVE
        R+P +++  +  ++ P  +S GPYHHGK HL   ER K   +     R   N     DA+ +      + E  +  Y  P   + K+++EF+  ++++++
Subjt:  RIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLMLVE

Query:  GCFMLELLLSDDR-----QWLRNDKD------VETIKRDMLLLENQLPM---------------------------------------------------
        GCF+LEL     +      + RND        + +I+RDM++LENQLP+                                                   
Subjt:  GCFMLELLLSDDR-----QWLRNDKD------VETIKRDMLLLENQLPM---------------------------------------------------

Query:  -----------ANLLHPQISRSDRVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYE
                    ++ H  + +S           D       + +I   T L+ AG+ F R ET  L D+ F  K G LKIP L + D TKS   N+IA+E
Subjt:  -----------ANLLHPQISRSDRVITFEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYE

Query:  RLNPNIGNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTI
        + +    N +TS+ + M++LI+  +DV+ L    I+EH LG+D   A+ FN L +    +  +      + R + +Y S+ W+    +L    F +PW  
Subjt:  RLNPNIGNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTI

Query:  ISLIAASLGFVLLFLQTFYQVYGYHHP
         S  AA +   L F Q+F+ VY Y+ P
Subjt:  ISLIAASLGFVLLFLQTFYQVYGYHHP

AT3G50160.1 Plant protein of unknown function (DUF247)3.1e-3525.24Show/hide
Query:  IGDKVIFGDPSSC-QRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNW
        +GD       + C  R+P +++  +  ++ P ++S GPYHHG  HL+  ER K   +     R   +     DA+ +      + E  +  Y  P + N 
Subjt:  IGDKVIFGDPSSC-QRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNW

Query:  KDADEFSFLKLMLVEGCFMLELLLS-----DDRQWLRNDKD------VETIKRDMLLLENQLP------MANLLHPQISRSDRVITFE------------
         +     F+++++++G F++E+         +  +  ND        +++I+RDM++LENQLP      +  L  P +     V  F+            
Subjt:  KDADEFSFLKLMLVEGCFMLELLLS-----DDRQWLRNDKD------VETIKRDMLLLENQLP------MANLLHPQISRSDRVITFE------------

Query:  -----------------------YDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNI
                                D   +  +   + +I   T L+ AG+ F R ET    D+ F  K G LKIP L + D TKS   N+IA+E+ +   
Subjt:  -----------------------YDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNI

Query:  GNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAA
          ++TS+ + M++LI+  +DV+ L    I+E+ LG+D   ++ FN LG+    +  N      +   +  Y  + W+    +L +  F +PW   S IAA
Subjt:  GNEVTSFTVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAA

Query:  SLGFVLLFLQTFYQVYGYHHPSPR
            +  F Q+F+ V+ Y  P P+
Subjt:  SLGFVLLFLQTFYQVYGYHHPSPR

AT5G22540.1 Plant protein of unknown function (DUF247)3.4e-3728.26Show/hide
Query:  RIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLMLVE
        RIPQ +  +   A++P ++S GPYHHGK HL  T++ K    RRF K        +     ++V ++S LE + +  +  D       D  + +++M+++
Subjt:  RIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLMLVE

Query:  GCFMLEL--LLSDDRQWLRNDKDV-------ETIKRDMLLLENQLPMANLLHPQISRSDRV-------ITFEY---------------------------
        GCF+L L  ++S   ++   D  +        +I+ D+LLLENQ+P   LL      S  V       I FE+                           
Subjt:  GCFMLEL--LLSDDRQWLRNDKDV-------ETIKRDMLLLENQLPMANLLHPQISRSDRV-------ITFEY---------------------------

Query:  --------DVRDAQRHNSTRP--------MILPATRLQTAGITFE-RSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSF
                  R  + H+S            +L A +L   GI F+ R  T S+ D+S+    GVL IP + +DD T S   N +A+E+L  +  N +TS+
Subjt:  --------DVRDAQRHNSTRP--------MILPATRLQTAGITFE-RSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSF

Query:  TVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLL
           M  LI+ + D + L+ + ILE+  GT++  + F+  +G+  AL+     L   V+  + +Y S+ +H      ++T+F SPWT  S  AA L  +  
Subjt:  TVLMNDLIDVDKDVALLTSKTILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLL

Query:  FLQTFYQVYGYHHP
         LQ F+  Y Y  P
Subjt:  FLQTFYQVYGYHHP

AT5G22550.2 Plant protein of unknown function (DUF247)3.8e-3327.18Show/hide
Query:  EKAGMEYEREE-----LTRSYPIIGDKVIFGDPSSC-QRIPQHIRNVEPSAFDPHLLSFGPYHHG--KSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAI
        E A +  E EE     L +  P +  K   GD   C  RIP  ++ V   A+ P ++S GPYHH   K HL   E  K   L  F  +   N       +
Subjt:  EKAGMEYEREE-----LTRSYPIIGDKVIFGDPSSC-QRIPQHIRNVEPSAFDPHLLSFGPYHHG--KSHLIQTERLKLDGLRRFRKRCNCNCSNRFDAI

Query:  AKVVCSISMLEPLKKFYHQPDSDNWKDADEFS---FLKLMLVEGCFMLEL-LLSDDRQWLRNDKD--------VETIKRDMLLLENQLPM--------AN
          +V  +S LE       Q   D++ +  EFS    +K+ML++GCF+L L L+   +    N KD        + T++ D+LLLENQ+P+         +
Subjt:  AKVVCSISMLEPLKKFYHQPDSDNWKDADEFS---FLKLMLVEGCFMLEL-LLSDDRQWLRNDKD--------VETIKRDMLLLENQLPM--------AN

Query:  LLHPQISRSDRVIT-FEYDVRDA----QRHN---------------------STRP--------------------------------------------
         L P  S +      F+Y ++      ++HN                     ST P                                            
Subjt:  LLHPQISRSDRVIT-FEYDVRDA----QRHN---------------------STRP--------------------------------------------

Query:  --------MILPATRLQTAGITFERSETWSLS-DVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALLTSKT
                +I+ A +L+  GI F R E      D+SF  K G+++IP L  DD   + L N +A+E+ N +   E+TSF + M  LI+ + D   L  K 
Subjt:  --------MILPATRLQTAGITFERSETWSLS-DVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALLTSKT

Query:  ILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYGYHHP
        ILE+  GT E  + FF  +G+  + + S   L + V+  + +Y S+ +H  W     T+F +PWT +S  AA +  +L   Q F+  Y Y  P
Subjt:  ILEHGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYGYHHP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCACAATGGCGCGCATGTTTATCCCTGGAAGGGGTTTTGTTGATTCACATTATAAGGTTGGACTGGAGGAACTGGAAAGATTTCTGCAAGAGGAGCTAGAAAAAGC
TGGCATGGAATATGAGAGAGAGGAGTTAACAAGAAGTTATCCGATCATAGGCGACAAAGTTATATTTGGAGATCCTTCTTCATGCCAGAGAATACCACAGCACATCCGAA
ACGTTGAGCCGAGTGCTTTCGATCCTCACTTGCTGTCGTTTGGGCCATACCACCATGGCAAATCGCATTTGATTCAAACCGAACGTCTTAAACTCGACGGCCTTCGTAGA
TTTCGCAAACGTTGCAATTGCAACTGTTCGAACAGGTTTGACGCCATAGCGAAAGTCGTGTGCAGCATCAGCATGTTGGAACCTCTCAAGAAATTCTACCATCAGCCTGA
TTCTGATAATTGGAAAGACGCTGATGAATTTTCATTCTTGAAGCTCATGCTCGTGGAGGGTTGTTTCATGCTGGAACTGCTGTTGAGCGATGATCGCCAATGGCTCAGAA
ATGACAAGGACGTTGAGACTATAAAGCGGGATATGCTGCTGCTTGAGAATCAGTTGCCCATGGCGAATCTATTGCATCCTCAGATTTCGCGGTCAGATCGAGTTATTACT
TTTGAATATGATGTTAGGGATGCGCAGAGGCATAACTCCACCAGACCGATGATTTTGCCCGCAACACGGCTTCAAACGGCGGGGATCACATTCGAAAGGAGCGAAACTTG
GAGCCTTTCCGACGTGTCTTTCGACTTGAAACGAGGTGTGTTGAAGATCCCACACTTGAAGGTAGACGATGTCACGAAATCAGCGTTGTTTAATGTGATAGCATATGAGA
GACTAAACCCCAATATTGGCAATGAAGTGACCTCTTTCACTGTCCTAATGAATGATCTGATCGATGTGGACAAAGATGTGGCGCTACTAACCTCCAAAACGATATTGGAG
CATGGTTTAGGAACCGACGAAGCTGCGGCGGAGTTTTTCAATGTGCTGGGAAGAGGGGCGGCTTTGAACCGATCGAACTGCGACCTCTTTGATCCAGTTTACAGGTCCAT
TGAAAAGTATTGCAGCAAGCCATGGCATGAATGGTGGACAAGTCTTGTAAACACCAATTTCCAAAGCCCATGGACCATCATCTCTCTCATTGCCGCTTCTTTGGGTTTTG
TGCTTCTCTTCCTTCAAACTTTCTACCAAGTATATGGATACCACCACCCATCACCACGTGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCACAATGGCGCGCATGTTTATCCCTGGAAGGGGTTTTGTTGATTCACATTATAAGGTTGGACTGGAGGAACTGGAAAGATTTCTGCAAGAGGAGCTAGAAAAAGC
TGGCATGGAATATGAGAGAGAGGAGTTAACAAGAAGTTATCCGATCATAGGCGACAAAGTTATATTTGGAGATCCTTCTTCATGCCAGAGAATACCACAGCACATCCGAA
ACGTTGAGCCGAGTGCTTTCGATCCTCACTTGCTGTCGTTTGGGCCATACCACCATGGCAAATCGCATTTGATTCAAACCGAACGTCTTAAACTCGACGGCCTTCGTAGA
TTTCGCAAACGTTGCAATTGCAACTGTTCGAACAGGTTTGACGCCATAGCGAAAGTCGTGTGCAGCATCAGCATGTTGGAACCTCTCAAGAAATTCTACCATCAGCCTGA
TTCTGATAATTGGAAAGACGCTGATGAATTTTCATTCTTGAAGCTCATGCTCGTGGAGGGTTGTTTCATGCTGGAACTGCTGTTGAGCGATGATCGCCAATGGCTCAGAA
ATGACAAGGACGTTGAGACTATAAAGCGGGATATGCTGCTGCTTGAGAATCAGTTGCCCATGGCGAATCTATTGCATCCTCAGATTTCGCGGTCAGATCGAGTTATTACT
TTTGAATATGATGTTAGGGATGCGCAGAGGCATAACTCCACCAGACCGATGATTTTGCCCGCAACACGGCTTCAAACGGCGGGGATCACATTCGAAAGGAGCGAAACTTG
GAGCCTTTCCGACGTGTCTTTCGACTTGAAACGAGGTGTGTTGAAGATCCCACACTTGAAGGTAGACGATGTCACGAAATCAGCGTTGTTTAATGTGATAGCATATGAGA
GACTAAACCCCAATATTGGCAATGAAGTGACCTCTTTCACTGTCCTAATGAATGATCTGATCGATGTGGACAAAGATGTGGCGCTACTAACCTCCAAAACGATATTGGAG
CATGGTTTAGGAACCGACGAAGCTGCGGCGGAGTTTTTCAATGTGCTGGGAAGAGGGGCGGCTTTGAACCGATCGAACTGCGACCTCTTTGATCCAGTTTACAGGTCCAT
TGAAAAGTATTGCAGCAAGCCATGGCATGAATGGTGGACAAGTCTTGTAAACACCAATTTCCAAAGCCCATGGACCATCATCTCTCTCATTGCCGCTTCTTTGGGTTTTG
TGCTTCTCTTCCTTCAAACTTTCTACCAAGTATATGGATACCACCACCCATCACCACGTGGATAA
Protein sequenceShow/hide protein sequence
MSTMARMFIPGRGFVDSHYKVGLEELERFLQEELEKAGMEYEREELTRSYPIIGDKVIFGDPSSCQRIPQHIRNVEPSAFDPHLLSFGPYHHGKSHLIQTERLKLDGLRR
FRKRCNCNCSNRFDAIAKVVCSISMLEPLKKFYHQPDSDNWKDADEFSFLKLMLVEGCFMLELLLSDDRQWLRNDKDVETIKRDMLLLENQLPMANLLHPQISRSDRVIT
FEYDVRDAQRHNSTRPMILPATRLQTAGITFERSETWSLSDVSFDLKRGVLKIPHLKVDDVTKSALFNVIAYERLNPNIGNEVTSFTVLMNDLIDVDKDVALLTSKTILE
HGLGTDEAAAEFFNVLGRGAALNRSNCDLFDPVYRSIEKYCSKPWHEWWTSLVNTNFQSPWTIISLIAASLGFVLLFLQTFYQVYGYHHPSPRG