; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G14950 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G14950
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPlant protein of unknown function (DUF247)
Genome locationChr3:11121257..11124258
RNA-Seq ExpressionCSPI03G14950
SyntenyCSPI03G14950
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018572.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-17674.31Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD S  +SH+I++   SQ S +EESLLS IE KLEA CSS TI++AP+EI+IEDRNVF+P+KVSIGPFHHGAPHLES+E LKW YLS FL + PS+ LQ 
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        L++LVVKSESR RKCYE+EFY  D D+FSQIMLLDCCFILELLLR++K+R RR ND VFTTPGLL+DLRCDL+LLENQIPYFLL+++Y  V D  EE M 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        L+DLT RFF+TMV GDR+ + DNF+VEA+HLLEMV+SCFLSTYP VETNDK KSKELP+ASKLK AGIK KNARSSKSLLDIKFQNGVLEIPPL+VYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNL AYEI Q G+D QVKSY+NFMSHLLQSD+DVKIL R+KILN  +D+EEQII  LKW+ E ++SLSGT+FAGIVQKL EKPDR VARWR+LR N
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
          AI +  VL+VVVIF AAFF+AFSVLQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

XP_004134502.1 UPF0481 protein At3g47200 [Cucumis sativus]7.5e-244100Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
        STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

XP_008438948.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]9.9e-20484.95Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD STPVSHTINI GISQES +EESLLS IE KLEANCSS TI+KAPSEINIE RNVF+PAKVSIGPFHHGA HL+SVE LKW YLSTFL H  SLTLQD
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LIK+VVKSESR +KCYEK+F S DRDEFS IMLLDCCFILELLLRY+KRRF+R NDPVFTTPGLL+D++CDL+LLENQIPYFLL+EIY KVLD  EENM+
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        LSDLT RFFRTMVPGDRKF+GDNF+VEA+HLLEMV+SCFLSTYPPV+TNDKLKSKELPSASKLK AGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQ+T
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNLAAYEI Q GTD QVKSYL FMSHLLQSD DVKILCRKKIL AL+D+EEQIIE LKWIREQK+SLSGT+FAGIVQKL EKPDRSV RWRRLR  
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
         TAI VA  LMVVVIFGAAFFAAFS+LQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

XP_022955709.1 UPF0481 protein At3g47200-like [Cucurbita moschata]3.5e-17774.54Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD S  +SH+I++   SQ S +EESLLS IE KLEA CSS TI++AP+EI+IEDRNVF+PAKVSIGPFHHGAPHLES+E LKW YLS FL + PS+ LQ 
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        L++LVVKSESR RKCYE+EFY  D D+FSQIMLLDCCFILELLLR++K+R RR ND VFTTPGLL+DLRCDL+LLENQIPYFLL+++Y  V D  EENM 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        L+DLT RFF+TMV GDR+ + DNF+VEA+HLLEMV+SCFLSTYP VETNDK KSKELP+ASKLK AGIK KNARSSKSLLDIKFQNGVLEIPPL+VYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNL AYEI Q G+D QVKSY+NFMSHLLQSD+DVKIL R+KILN  +D+EEQII  LKW+ E ++SLSGT+FAGIVQKL EKPDR VARWR+LR  
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
          AI +  VL+VVVIF AAFF+AFSVLQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

XP_038880915.1 UPF0481 protein At3g47200-like [Benincasa hispida]2.4e-18677.08Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        M+ S P SH+I+IS I+Q S QEESLLS +E KLEA CSS TI++AP++I+IED+NVF+PAKVSIGPFHHGAPHLE +E LKW YLSTFL H PSLTL D
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LI+LVVKSESR RKCYE EFY  D D+FSQ+MLLDCCFILELLLRY+K+RFRR NDPVF TPGLL+DLRCDL+LLENQIPYFLL+E+Y  V D LEENM 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        L+DLT RFF+TMV GDRKF+ DNF+VEA+HLLEMV+SCFLSTYP +ETNDK KS+ELPSASKLK AGIKFKNARS KSLLDIKFQ GVLEIPPLRVYQ+T
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNLAAYEI QFG+DLQVKSY+NFMSHLLQSDEDVKILCR+KIL  L+D+EEQII+ LKW+RE+K+SLSGT+FAGIVQKL EKPDR + +WR LR N
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
          AI VA V +VVVIF AAFF+A S+LQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

TrEMBL top hitse value%identityAlignment
A0A0A0L821 Uncharacterized protein3.6e-244100Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
        STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

A0A1S3AY98 UPF0481 protein At3g47200-like4.8e-20484.95Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD STPVSHTINI GISQES +EESLLS IE KLEANCSS TI+KAPSEINIE RNVF+PAKVSIGPFHHGA HL+SVE LKW YLSTFL H  SLTLQD
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LIK+VVKSESR +KCYEK+F S DRDEFS IMLLDCCFILELLLRY+KRRF+R NDPVFTTPGLL+D++CDL+LLENQIPYFLL+EIY KVLD  EENM+
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        LSDLT RFFRTMVPGDRKF+GDNF+VEA+HLLEMV+SCFLSTYPPV+TNDKLKSKELPSASKLK AGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQ+T
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNLAAYEI Q GTD QVKSYL FMSHLLQSD DVKILCRKKIL AL+D+EEQIIE LKWIREQK+SLSGT+FAGIVQKL EKPDRSV RWRRLR  
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
         TAI VA  LMVVVIFGAAFFAAFS+LQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

A0A6J1CA62 UPF0481 protein At3g47200-like2.2e-15364.2Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        M+PS  +SH I+I  IS+E   EESLL  +E K+EA CSS  I+K P EI+I++R VF+PAKVSIGPFHHGAPHLES+E LKW+YL  FL H PS+ L D
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        L++ V KSESR RKCYE EF+  D  +F+++M+LDCCF+LELLLR++ +R +R NDPVFTTPGLL DL+ DL+LLENQIPYFLL E+Y KV D  EENM 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVET-NDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQK
        L+DL  RFFRT+V G+R+ + DNF  +A+HLL++V+SCFLSTYP +ET N+K K+ ELP ASKLK+AGIKFKNA + KS+LDIKFQNG LEIP L V + 
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVET-NDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQK

Query:  TETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRS
        TETIL+NL AYEICQ G+  QVKSY++FMSHLLQSDED+K+LC +KIL  L+ +E QII  LKW+R+QK +LSGT+FAG+VQKL E PDR +  WRRLR 
Subjt:  TETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRS

Query:  NSTAISVATVLMVVVIFGAAFFAAFSVLQRRYK
        N  AI V  V  +VVIF AAFF+A S+LQRRY+
Subjt:  NSTAISVATVLMVVVIFGAAFFAAFSVLQRRYK

A0A6J1GVU1 UPF0481 protein At3g47200-like1.7e-17774.54Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD S  +SH+I++   SQ S +EESLLS IE KLEA CSS TI++AP+EI+IEDRNVF+PAKVSIGPFHHGAPHLES+E LKW YLS FL + PS+ LQ 
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        L++LVVKSESR RKCYE+EFY  D D+FSQIMLLDCCFILELLLR++K+R RR ND VFTTPGLL+DLRCDL+LLENQIPYFLL+++Y  V D  EENM 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        L+DLT RFF+TMV GDR+ + DNF+VEA+HLLEMV+SCFLSTYP VETNDK KSKELP+ASKLK AGIK KNARSSKSLLDIKFQNGVLEIPPL+VYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNL AYEI Q G+D QVKSY+NFMSHLLQSD+DVKIL R+KILN  +D+EEQII  LKW+ E ++SLSGT+FAGIVQKL EKPDR VARWR+LR  
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
          AI +  VL+VVVIF AAFF+AFSVLQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

A0A6J1IWZ4 UPF0481 protein At3g47200-like9.4e-16074.17Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD S  +SH+I++   SQ S +EESLLS IERKLEA CSS TI++A +EI+IEDRNVF+PAKVSIGPFHHGAPHLES+E LKW YLS FL + PS+ LQ 
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LI+LVVKSESR RKCYE+EFY  D ++FSQIMLLDCCFILELLLRY+K+R RR ND VFTTPGLL+DLRCDL+LLENQIPYFLL+++YA V D  EENM 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        L+DLT RFF+TMV GDR+F+ DNF+VEA+HLLEM++SCFLSTYP +ETND  KS+ELPSASKLK AGIK KN +SSKSLLDIKFQNGVLEIPPL+VYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSV
        E ILRNL AYEI Q G+D QVKSY+NFMSHLLQSD+DVKIL R+KILN  +++EEQII  LKW+RE K+SLSGT+FAGIVQKL +K DR V
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSV

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.6e-1521.88Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCS-------SFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHK
        M P  P+      S   Q  F E   +  +++ L+A          + +I+  P  +     + + P +VSIGP+H   P L  +E+ K           
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCS-------SFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHK

Query:  PSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLD
         S    DL++ +   E + R CY K +   + +    IM +D  F++E L  Y+ R+     + +    G    LR D++++ENQIP F+L +     L+
Subjt:  PSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLD

Query:  GLE--ENMYLSDLTSRFFRTMVPGDRKFIGDNFI----VEANHLLEMVYSC-------------------------------------------------
          E  +++ LS LT    + + P   KF  D  +     E NH+L+ +Y                                                   
Subjt:  GLE--ENMYLSDLTSRFFRTMVPGDRKFIGDNFI----VEANHLLEMVYSC-------------------------------------------------

Query:  -------FLSTYP-------------------------------PVETNDKLKSKELPSASKLKAAGIKFK-NARSSKSLLDIKFQNGVLEIPPLRVYQK
                +S  P                                +E    ++   +PS S L  AG++FK  A  + S +     +G   +P + +   
Subjt:  -------FLSTYP-------------------------------PVETNDKLKSKELPSASKLKAAGIKFK-NARSSKSLLDIKFQNGVLEIPPLRVYQK

Query:  TETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKIL-NALKDEEE
        TET+LRNL AYE       L    Y   ++ ++ S+EDV++L  + +L + LK ++E
Subjt:  TETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKIL-NALKDEEE

Q9SD53 UPF0481 protein At3g472001.0e-3028.33Show/hide
Query:  ANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD--LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIML
        A   S  I++ P      +   + P  VSIGP+H+G  HL+ +++ K   L  FL       +++  L+K VV  E + RK Y +E  +    +   +M+
Subjt:  ANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD--LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIML

Query:  LDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLE
        LD CFIL + L           DP+F+ P LL  ++ DL+LLENQ+P+F+L+ +Y     G+  +  L+ +   FF+  +  +  +   +   +A HLL+
Subjt:  LDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLE

Query:  MVYSCFL--------STYPPVETN---------DKLKSKELP---SASKLKAAGIKFKNARSSK-SLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEI
        ++   FL        ++ P V+             + SK +P   SA +L+  GIKF+  RS + S+L+++ +   L+IP LR      +   N  A+E 
Subjt:  MVYSCFL--------STYPPVETN---------DKLKSKELP---SASKLKAAGIKFKNARSSK-SLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEI

Query:  CQFGTDL--QVKSYLNFMSHLLQSDEDVKILCRKK-ILNALKDEEEQIIEKLKWIR-----EQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSNS--T
         QF TD   ++ +Y+ FM  LL ++EDV  L   K I+        ++ E  K I      E   S     F G+ +  K+  +   A +R     S  T
Subjt:  CQFGTDL--QVKSYLNFMSHLLQSDEDVKILCRKK-ILNALKDEEEQIIEKLKWIR-----EQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSNS--T

Query:  AISVATVLMVVVI
         +S   VL V+++
Subjt:  AISVATVLMVVVI

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)3.5e-4231.85Show/hide
Query:  TIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILE
        +I++ P  +   +   + P  VSIGP+H G   L+ +E+ KW YL+  LT   +LTL+D +K V   E   R+CY +  +  D +EF+++M+LD CF+LE
Subjt:  TIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILE

Query:  LLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIY--AKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCF
        L  +        PNDP+     +L     D + LENQIP+F+LE ++   +  +  E N  L  L   FF  M+    + +     + A HLL+++ S F
Subjt:  LLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIY--AKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCF

Query:  L---STYPPVETN---DKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQ
        +     + P  TN   +K+ S  + S SKL+ AGIK +  + ++S L ++F++G +E+P + V     + L N  AYE C     +   +Y   +  L  
Subjt:  L---STYPPVETN---DKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQ

Query:  SDEDVKILCRKKIL
        + +DV+ LC + I+
Subjt:  SDEDVKILCRKKIL

AT3G50150.1 Plant protein of unknown function (DUF247)1.3e-3625.53Show/hide
Query:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL
        IY+ P  +   D+  +LP  VSIGP+HHG  HL  +E+ KW  ++  +  +    ++  I  + + E   R CY+      + +EF+++++LD CF+LE 
Subjt:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL

Query:  LLRYTKRRFRR----PNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVP-------GDRKF--------IGDN
        L + T + F++     NDPVF   GL++ ++ D+++LENQ+P F+L+ +         +   ++++  RFF+T++P        +R          +GDN
Subjt:  LLRYTKRRFRR----PNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVP-------GDRKF--------IGDN

Query:  FIVEANHLLEMVYSCFLSTY------PPVETNDKLKSKE--LPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQF
              H L++ +   + +        P E    ++ ++  +   ++L+ AG+ F   + +  L DI+F+NG L+IP L ++  T+++  NL A+E C  
Subjt:  FIVEANHLLEMVYSCFLSTY------PPVETNDKLKSKE--LPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQF

Query:  GTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKL-----KWIREQKDSLSGTFFAGIVQKLKEKPDRSVAR-WRRLRSNST-------
         +   + SY+ FM +L+ S +DV  L    I+      + ++ +       + I + KD        G + +L  + +R  +R W  L++          
Subjt:  GTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKL-----KWIREQKDSLSGTFFAGIVQKLKEKPDRSVAR-WRRLRSNST-------

Query:  ----AISVATVLMVVVIFGAAFFAAFS
            + S A +L+ +  F  +FFA ++
Subjt:  ----AISVATVLMVVVIFGAAFFAAFS

AT3G50180.1 Plant protein of unknown function (DUF247)1.2e-3726.63Show/hide
Query:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL
        IYK P  ++  D+  + P  VS+GP+HHG    +S+E  KW  ++  L  + +  ++  +  +++ E + R CYE     S  +EF++++LLD CFILEL
Subjt:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL

Query:  LLRYTKRRFR---RPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEAN----HLLEMV
        L    +   +     NDPVF   G ++ ++ D+++LENQ+P F+L  +  ++  G +    L +L  RFF  ++P       ++     +    H L++ 
Subjt:  LLRYTKRRFR---RPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEAN----HLLEMV

Query:  YSCFLSTYPPVETN-----DKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYLNFMSH
        +   L      + N     DK   + +P+ ++L+ AG KFK  ++ +   DIKF NG LEIP L ++  T+++  NL A+E C   +   + SY+ FM +
Subjt:  YSCFLSTYPPVETN-----DKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYLNFMSH

Query:  LLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQ-----KDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSNSTAISV-------------ATVLMV
        L+ S ED+  L    I+        ++ +    + ++     KD         + +  K+         R+L S  T + +             A V+++
Subjt:  LLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQ-----KDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSNSTAISV-------------ATVLMV

Query:  VVIFGAAFFAAFS
        ++ F  ++FAA++
Subjt:  VVIFGAAFFAAFS

AT3G50190.1 Plant protein of unknown function (DUF247)4.5e-3730.54Show/hide
Query:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL
        IY+ P      D+N + P  VS+GP+HHG  HL  ++  KW  ++  +  +    ++  I  + + E R R CYE     S  ++F+Q+++LD CF+L+L
Subjt:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL

Query:  LL----RYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSC
               ++K  + R NDPVF   G ++ +R D+++LENQ+P         K+ +  E N       ++FF  +   D+         E  H L++    
Subjt:  LL----RYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSC

Query:  FLS-------------TYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYL
         L              ++ PV   DK + K L   ++L+ AGIKFK  R S  L DI+F+NG LEIP L ++  T+++L NL A+E C   +  Q+ SY+
Subjt:  FLS-------------TYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYL

Query:  NFMSHLLQSDEDVKILCRKKILNALKDEEEQIIE
         F+ +L+ S+EDV+ L    I+    + + ++++
Subjt:  NFMSHLLQSDEDVKILCRKKILNALKDEEEQIIE

AT4G31980.1 unknown protein3.9e-4130.49Show/hide
Query:  ESLLSCIERKLEANCSSFT---IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEF
        ++L+  I+ KL    S  T   IYK P+++   + + + P  VS GP H G   L+++E  K+ YL +F+    S +L+DL++L    E   R CY ++ 
Subjt:  ESLLSCIERKLEANCSSFT---IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEF

Query:  YSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENM-YLSDLTSRFFRTMVPGDRKF
             DEF +++++D  F++ELLLR    R R  ND +F    ++ D+  D++L+ENQ+P+F+++EI+  +L+  ++    +  L  R F   +    + 
Subjt:  YSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENM-YLSDLTSRFFRTMVPGDRKF

Query:  IGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDL
          + FI E  H ++++ SC+L  +P       +K    P A++L  AG++FK A +S  LLDI F +GVL+IP + V   TE++ +N+  +E C+  ++ 
Subjt:  IGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDL

Query:  QVKSYLNFMSHLLQSDEDVKILCRKKIL
            Y+  +   ++S  D  +L    I+
Subjt:  QVKSYLNFMSHLLQSDEDVKILCRKKIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATCCATCAACACCGGTTTCCCATACGATTAATATTTCTGGAATCTCACAAGAAAGCTTTCAGGAAGAGTCTCTCCTATCTTGCATTGAACGAAAATTGGAAGCCAA
TTGTTCATCCTTTACCATCTACAAAGCTCCAAGCGAAATCAATATTGAAGATAGAAACGTCTTCCTTCCCGCCAAAGTCTCAATCGGCCCTTTCCACCACGGCGCTCCAC
ATCTCGAATCTGTTGAAAAGCTCAAGTGGCATTACTTGTCCACTTTCTTGACGCACAAACCGTCTCTCACTTTACAGGATCTTATTAAACTCGTTGTAAAATCGGAGAGC
CGGGGGAGGAAATGCTATGAGAAAGAGTTTTATAGTTCGGATAGGGATGAGTTTTCACAGATCATGTTGCTTGATTGCTGCTTCATTCTTGAGTTACTTTTGCGATACAC
GAAAAGGAGGTTCAGGCGCCCGAATGATCCTGTTTTCACTACTCCTGGTTTGCTTTACGATTTAAGATGCGATTTGGTGTTGCTTGAAAATCAGATTCCATACTTCCTTC
TCGAAGAAATTTATGCAAAGGTGCTAGATGGGTTAGAGGAAAATATGTATCTCAGTGACCTGACCTCCCGATTCTTTAGAACTATGGTACCTGGAGACCGGAAATTTATC
GGCGATAACTTCATAGTGGAAGCAAATCATTTACTCGAAATGGTGTATTCTTGTTTCCTCTCCACCTATCCGCCAGTTGAGACGAACGACAAATTGAAATCGAAAGAATT
ACCTAGTGCGTCGAAGCTTAAAGCTGCCGGAATCAAATTCAAAAACGCCAGATCTTCAAAAAGCTTATTGGACATCAAATTTCAGAACGGCGTCCTTGAAATTCCGCCTC
TAAGAGTGTACCAGAAAACGGAGACAATTCTGAGGAATCTTGCTGCTTATGAAATCTGTCAATTCGGAACCGATCTGCAAGTGAAATCGTATCTCAATTTCATGAGCCAC
CTTCTCCAGAGCGATGAAGACGTGAAGATACTGTGCAGAAAGAAAATCCTGAATGCTTTGAAGGACGAGGAGGAACAGATTATTGAGAAACTGAAATGGATACGCGAGCA
GAAGGATAGCCTATCGGGAACGTTCTTTGCCGGCATTGTTCAAAAATTAAAAGAGAAGCCGGACCGATCCGTCGCGCGGTGGCGGAGGCTGAGAAGCAATTCGACGGCCA
TCAGCGTTGCCACCGTTTTGATGGTGGTTGTGATCTTCGGCGCGGCCTTTTTCGCTGCATTTTCAGTACTTCAGCGCCGTTACAAATAA
mRNA sequenceShow/hide mRNA sequence
ATTATTCATTACACTAACACTTAATATTTCTTTTTTCTCTCTTTAATTATTTCTTCACTTCATCATATTTTCAAGGCATCACCAGATATTGAAGTCCTTTTAAGCACCAC
CGGAAAGTCTCTGACCGTCGCTGGAAAGATGGATCCATCAACACCGGTTTCCCATACGATTAATATTTCTGGAATCTCACAAGAAAGCTTTCAGGAAGAGTCTCTCCTAT
CTTGCATTGAACGAAAATTGGAAGCCAATTGTTCATCCTTTACCATCTACAAAGCTCCAAGCGAAATCAATATTGAAGATAGAAACGTCTTCCTTCCCGCCAAAGTCTCA
ATCGGCCCTTTCCACCACGGCGCTCCACATCTCGAATCTGTTGAAAAGCTCAAGTGGCATTACTTGTCCACTTTCTTGACGCACAAACCGTCTCTCACTTTACAGGATCT
TATTAAACTCGTTGTAAAATCGGAGAGCCGGGGGAGGAAATGCTATGAGAAAGAGTTTTATAGTTCGGATAGGGATGAGTTTTCACAGATCATGTTGCTTGATTGCTGCT
TCATTCTTGAGTTACTTTTGCGATACACGAAAAGGAGGTTCAGGCGCCCGAATGATCCTGTTTTCACTACTCCTGGTTTGCTTTACGATTTAAGATGCGATTTGGTGTTG
CTTGAAAATCAGATTCCATACTTCCTTCTCGAAGAAATTTATGCAAAGGTGCTAGATGGGTTAGAGGAAAATATGTATCTCAGTGACCTGACCTCCCGATTCTTTAGAAC
TATGGTACCTGGAGACCGGAAATTTATCGGCGATAACTTCATAGTGGAAGCAAATCATTTACTCGAAATGGTGTATTCTTGTTTCCTCTCCACCTATCCGCCAGTTGAGA
CGAACGACAAATTGAAATCGAAAGAATTACCTAGTGCGTCGAAGCTTAAAGCTGCCGGAATCAAATTCAAAAACGCCAGATCTTCAAAAAGCTTATTGGACATCAAATTT
CAGAACGGCGTCCTTGAAATTCCGCCTCTAAGAGTGTACCAGAAAACGGAGACAATTCTGAGGAATCTTGCTGCTTATGAAATCTGTCAATTCGGAACCGATCTGCAAGT
GAAATCGTATCTCAATTTCATGAGCCACCTTCTCCAGAGCGATGAAGACGTGAAGATACTGTGCAGAAAGAAAATCCTGAATGCTTTGAAGGACGAGGAGGAACAGATTA
TTGAGAAACTGAAATGGATACGCGAGCAGAAGGATAGCCTATCGGGAACGTTCTTTGCCGGCATTGTTCAAAAATTAAAAGAGAAGCCGGACCGATCCGTCGCGCGGTGG
CGGAGGCTGAGAAGCAATTCGACGGCCATCAGCGTTGCCACCGTTTTGATGGTGGTTGTGATCTTCGGCGCGGCCTTTTTCGCTGCATTTTCAGTACTTCAGCGCCGTTA
CAAATAAGTTTCGGAGTCCAAAGACGTCTAAACCGTATCAATAAGTTTCCGTTTTGGCAATCACTCCCTAAACTTACCGATTTTTACGATTATAAACCCTTCACTGTTGC
AAAATTAATTTCGGACATTCATTTTTTTAATTAAATTATATCAAAAAATAATTTGAGTAAAAATAAAGAAACTACGGAAAATAGTTTCTTTTAATTCTTATTTTATTTAA
TCTTAAAACTTCTAATTTGCTCATTGGTGGAAAGACATTTATATTCATTTTTTACCTACAAAATTTTAGGAGGTAAATATGA
Protein sequenceShow/hide protein sequence
MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSES
RGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFI
GDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYLNFMSH
LLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSNSTAISVATVLMVVVIFGAAFFAAFSVLQRRYK