; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G578 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G578
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionPlant protein of unknown function (DUF247)
Genome locationctg1:11143947..11147037
RNA-Seq ExpressionCucsat.G578
SyntenyCucsat.G578
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018572.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]9.83e-22274.31Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD S  +SH+I++   SQ S +EESLLS IE KLEA CSS TI++AP+EI+IEDRNVF+P+KVSIGPFHHGAPHLES+E LKW YLS FL + PS+ LQ 
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        L++LVVKSESR RKCYE+EFY  D D+FSQIMLLDCCFILELLLR++K+R RR ND VFTTPGLL+DLRCDL+LLENQIPYFLL+++Y  V D  EE M 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        L+DLT RFF+TMV GDR+ + DNF+VEA+HLLEMV+SCFLSTYP VETNDK KSKELP+ASKLK AGIK KNARSSKSLLDIKFQNGVLEIPPL+VYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNL AYEI Q G+D QVKSY+NFMSHLLQSD+DVKIL R+KILN  +D+EEQII  LKW+ E+ +SLSGT+FAGIVQKL EKPDR VARWR+LR N
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
          AI +  VL+VVVIF AAFF+AFSVLQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

XP_004134502.1 UPF0481 protein At3g47200 [Cucumis sativus]2.45e-310100Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
        STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

XP_008438948.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]1.24e-25784.95Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD STPVSHTINI GISQES +EESLLS IE KLEANCSS TI+KAPSEINIE RNVF+PAKVSIGPFHHGA HL+SVE LKW YLSTFL H  SLTLQD
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LIK+VVKSESR +KCYEK+F S DRDEFS IMLLDCCFILELLLRY+KRRF+R NDPVFTTPGLL+D++CDL+LLENQIPYFLL+EIY KVLD  EENM+
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        LSDLT RFFRTMVPGDRKF+GDNF+VEA+HLLEMV+SCFLSTYPPV+TNDKLKSKELPSASKLK AGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQ+T
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNLAAYEI Q GTD QVKSYL FMSHLLQSD DVKILCRKKIL AL+D+EEQIIE LKWIREQK+SLSGT+FAGIVQKL EKPDRSV RWRRLR  
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
         TAI VA  LMVVVIFGAAFFAAFS+LQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

XP_022955709.1 UPF0481 protein At3g47200-like [Cucurbita moschata]1.20e-22274.54Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD S  +SH+I++   SQ S +EESLLS IE KLEA CSS TI++AP+EI+IEDRNVF+PAKVSIGPFHHGAPHLES+E LKW YLS FL + PS+ LQ 
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        L++LVVKSESR RKCYE+EFY  D D+FSQIMLLDCCFILELLLR++K+R RR ND VFTTPGLL+DLRCDL+LLENQIPYFLL+++Y  V D  EENM 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        L+DLT RFF+TMV GDR+ + DNF+VEA+HLLEMV+SCFLSTYP VETNDK KSKELP+ASKLK AGIK KNARSSKSLLDIKFQNGVLEIPPL+VYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNL AYEI Q G+D QVKSY+NFMSHLLQSD+DVKIL R+KILN  +D+EEQII  LKW+ E+ +SLSGT+FAGIVQKL EKPDR VARWR+LR  
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
          AI +  VL+VVVIF AAFF+AFSVLQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

XP_038880915.1 UPF0481 protein At3g47200-like [Benincasa hispida]8.43e-23577.08Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        M+ S P SH+I+IS I+Q S QEESLLS +E KLEA CSS TI++AP++I+IED+NVF+PAKVSIGPFHHGAPHLE +E LKW YLSTFL H PSLTL D
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LI+LVVKSESR RKCYE EFY  D D+FSQ+MLLDCCFILELLLRY+K+RFRR NDPVF TPGLL+DLRCDL+LLENQIPYFLL+E+Y  V D LEENM 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        L+DLT RFF+TMV GDRKF+ DNF+VEA+HLLEMV+SCFLSTYP +ETNDK KS+ELPSASKLK AGIKFKNARS KSLLDIKFQ GVLEIPPLRVYQ+T
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNLAAYEI QFG+DLQVKSY+NFMSHLLQSDEDVKILCR+KIL  L+D+EEQII+ LKW+RE+K+SLSGT+FAGIVQKL EKPDR + +WR LR N
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
          AI VA V +VVVIF AAFF+A S+LQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

TrEMBL top hitse value%identityAlignment
A0A0A0L821 Uncharacterized protein1.18e-310100Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
        STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

A0A1S3AY98 UPF0481 protein At3g47200-like6.02e-25884.95Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD STPVSHTINI GISQES +EESLLS IE KLEANCSS TI+KAPSEINIE RNVF+PAKVSIGPFHHGA HL+SVE LKW YLSTFL H  SLTLQD
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LIK+VVKSESR +KCYEK+F S DRDEFS IMLLDCCFILELLLRY+KRRF+R NDPVFTTPGLL+D++CDL+LLENQIPYFLL+EIY KVLD  EENM+
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        LSDLT RFFRTMVPGDRKF+GDNF+VEA+HLLEMV+SCFLSTYPPV+TNDKLKSKELPSASKLK AGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQ+T
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNLAAYEI Q GTD QVKSYL FMSHLLQSD DVKILCRKKIL AL+D+EEQIIE LKWIREQK+SLSGT+FAGIVQKL EKPDRSV RWRRLR  
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
         TAI VA  LMVVVIFGAAFFAAFS+LQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

A0A6J1CA62 UPF0481 protein At3g47200-like1.82e-19164.2Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        M+PS  +SH I+I  IS+E   EESLL  +E K+EA CSS  I+K P EI+I++R VF+PAKVSIGPFHHGAPHLES+E LKW+YL  FL H PS+ L D
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        L++ V KSESR RKCYE EF+  D  +F+++M+LDCCF+LELLLR++ +R +R NDPVFTTPGLL DL+ DL+LLENQIPYFLL E+Y KV D  EENM 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVET-NDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQK
        L+DL  RFFRT+V G+R+ + DNF  +A+HLL++V+SCFLSTYP +ET N+K K+ ELP ASKLK+AGIKFKNA + KS+LDIKFQNG LEIP L V + 
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVET-NDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQK

Query:  TETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRS
        TETIL+NL AYEICQ G+  QVKSY++FMSHLLQSDED+K+LC +KIL  L+ +E QII  LKW+R+QK +LSGT+FAG+VQKL E PDR +  WRRLR 
Subjt:  TETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRS

Query:  NSTAISVATVLMVVVIFGAAFFAAFSVLQRRYK
        N  AI V  V  +VVIF AAFF+A S+LQRRY+
Subjt:  NSTAISVATVLMVVVIFGAAFFAAFSVLQRRYK

A0A6J1GVU1 UPF0481 protein At3g47200-like5.82e-22374.54Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD S  +SH+I++   SQ S +EESLLS IE KLEA CSS TI++AP+EI+IEDRNVF+PAKVSIGPFHHGAPHLES+E LKW YLS FL + PS+ LQ 
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        L++LVVKSESR RKCYE+EFY  D D+FSQIMLLDCCFILELLLR++K+R RR ND VFTTPGLL+DLRCDL+LLENQIPYFLL+++Y  V D  EENM 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        L+DLT RFF+TMV GDR+ + DNF+VEA+HLLEMV+SCFLSTYP VETNDK KSKELP+ASKLK AGIK KNARSSKSLLDIKFQNGVLEIPPL+VYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN
        E ILRNL AYEI Q G+D QVKSY+NFMSHLLQSD+DVKIL R+KILN  +D+EEQII  LKW+ E+ +SLSGT+FAGIVQKL EKPDR VARWR+LR  
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSN

Query:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK
          AI +  VL+VVVIF AAFF+AFSVLQRRYK
Subjt:  STAISVATVLMVVVIFGAAFFAAFSVLQRRYK

A0A6J1IWZ4 UPF0481 protein At3g47200-like1.18e-20074.17Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD
        MD S  +SH+I++   SQ S +EESLLS IERKLEA CSS TI++A +EI+IEDRNVF+PAKVSIGPFHHGAPHLES+E LKW YLS FL + PS+ LQ 
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD

Query:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY
        LI+LVVKSESR RKCYE+EFY  D ++FSQIMLLDCCFILELLLRY+K+R RR ND VFTTPGLL+DLRCDL+LLENQIPYFLL+++YA V D  EENM 
Subjt:  LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMY

Query:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT
        L+DLT RFF+TMV GDR+F+ DNF+VEA+HLLEM++SCFLSTYP +ETND  KS+ELPSASKLK AGIK KN +SSKSLLDIKFQNGVLEIPPL+VYQKT
Subjt:  LSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKT

Query:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSV
        E ILRNL AYEI Q G+D QVKSY+NFMSHLLQSD+DVKIL R+KILN  +++EEQII  LKW+RE K+SLSGT+FAGIVQKL +K DR V
Subjt:  ETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSV

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026451.7e-1521.88Show/hide
Query:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCS-------SFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHK
        M P  P+      S   Q  F E   +  +++ L+A          + +I+  P  +     + + P +VSIGP+H   P L  +E+ K           
Subjt:  MDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCS-------SFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHK

Query:  PSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLD
         S    DL++ +   E + R CY K +   + +    IM +D  F++E L  Y+ R+     + +    G    LR D++++ENQIP F+L +     L+
Subjt:  PSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLD

Query:  GLE--ENMYLSDLTSRFFRTMVPGDRKFIGDNFI----VEANHLLEMVYSC-------------------------------------------------
          E  +++ LS LT    + + P   KF  D  +     E NH+L+ +Y                                                   
Subjt:  GLE--ENMYLSDLTSRFFRTMVPGDRKFIGDNFI----VEANHLLEMVYSC-------------------------------------------------

Query:  -------FLSTYP-------------------------------PVETNDKLKSKELPSASKLKAAGIKFK-NARSSKSLLDIKFQNGVLEIPPLRVYQK
                +S  P                                +E    ++   +PS S L  AG++FK  A  + S +     +G   +P + +   
Subjt:  -------FLSTYP-------------------------------PVETNDKLKSKELPSASKLKAAGIKFK-NARSSKSLLDIKFQNGVLEIPPLRVYQK

Query:  TETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKIL-NALKDEEE
        TET+LRNL AYE       L    Y   ++ ++ S+EDV++L  + +L + LK ++E
Subjt:  TETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKIL-NALKDEEE

Q9SD53 UPF0481 protein At3g472001.1e-3028.33Show/hide
Query:  ANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD--LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIML
        A   S  I++ P      +   + P  VSIGP+H+G  HL+ +++ K   L  FL       +++  L+K VV  E + RK Y +E  +    +   +M+
Subjt:  ANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQD--LIKLVVKSESRGRKCYEKEFYSSDRDEFSQIML

Query:  LDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLE
        LD CFIL + L           DP+F+ P LL  ++ DL+LLENQ+P+F+L+ +Y     G+  +  L+ +   FF+  +  +  +   +   +A HLL+
Subjt:  LDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLE

Query:  MVYSCFL--------STYPPVETN---------DKLKSKELP---SASKLKAAGIKFKNARSSK-SLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEI
        ++   FL        ++ P V+             + SK +P   SA +L+  GIKF+  RS + S+L+++ +   L+IP LR      +   N  A+E 
Subjt:  MVYSCFL--------STYPPVETN---------DKLKSKELP---SASKLKAAGIKFKNARSSK-SLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEI

Query:  CQFGTDL--QVKSYLNFMSHLLQSDEDVKILCRKK-ILNALKDEEEQIIEKLKWIR-----EQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSNS--T
         QF TD   ++ +Y+ FM  LL ++EDV  L   K I+        ++ E  K I      E   S     F G+ +  K+  +   A +R     S  T
Subjt:  CQFGTDL--QVKSYLNFMSHLLQSDEDVKILCRKK-ILNALKDEEEQIIEKLKWIR-----EQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSNS--T

Query:  AISVATVLMVVVI
         +S   VL V+++
Subjt:  AISVATVLMVVVI

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)4.9e-4231.02Show/hide
Query:  AGKMDPSTPVSHT---INISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKP
        A +  PST    +   I+I  + ++  +   LLS    K    CS   I++ P  +   +   + P  VSIGP+H G   L+ +E+ KW YL+  LT   
Subjt:  AGKMDPSTPVSHT---INISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKP

Query:  SLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIY--AKVL
        +LTL+D +K V   E   R+CY +  +  D +EF+++M+LD CF+LEL  +        PNDP+     +L     D + LENQIP+F+LE ++   +  
Subjt:  SLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIY--AKVL

Query:  DGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFL---STYPPVETN---DKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQN
        +  E N  L  L   FF  M+    + +     + A HLL+++ S F+     + P  TN   +K+ S  + S SKL+ AGIK +  + ++S L ++F++
Subjt:  DGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFL---STYPPVETN---DKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQN

Query:  GVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKIL
        G +E+P + V     + L N  AYE C     +   +Y   +  L  + +DV+ LC + I+
Subjt:  GVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKIL

AT3G50150.1 Plant protein of unknown function (DUF247)1.8e-3625.53Show/hide
Query:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL
        IY+ P  +   D+  +LP  VSIGP+HHG  HL  +E+ KW  ++  +  +    ++  I  + + E   R CY+      + +EF+++++LD CF+LE 
Subjt:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL

Query:  LLRYTKRRFRR----PNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVP-------GDRKF--------IGDN
        L + T + F++     NDPVF   GL++ ++ D+++LENQ+P F+L+ +         +   ++++  RFF+T++P        +R          +GDN
Subjt:  LLRYTKRRFRR----PNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVP-------GDRKF--------IGDN

Query:  FIVEANHLLEMVYSCFLSTY------PPVETNDKLKSKE--LPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQF
              H L++ +   + +        P E    ++ ++  +   ++L+ AG+ F   + +  L DI+F+NG L+IP L ++  T+++  NL A+E C  
Subjt:  FIVEANHLLEMVYSCFLSTY------PPVETNDKLKSKE--LPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQF

Query:  GTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKL-----KWIREQKDSLSGTFFAGIVQKLKEKPDRSVAR-WRRLRSNST-------
         +   + SY+ FM +L+ S +DV  L    I+      + ++ +       + I + KD        G + +L  + +R  +R W  L++          
Subjt:  GTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKL-----KWIREQKDSLSGTFFAGIVQKLKEKPDRSVAR-WRRLRSNST-------

Query:  ----AISVATVLMVVVIFGAAFFAAFS
            + S A +L+ +  F  +FFA ++
Subjt:  ----AISVATVLMVVVIFGAAFFAAFS

AT3G50180.1 Plant protein of unknown function (DUF247)2.1e-3726.63Show/hide
Query:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL
        IYK P  ++  D+  + P  VS+GP+HHG    +S+E  KW  ++  L  + +  ++  +  +++ E + R CYE     S  +EF++++LLD CFILEL
Subjt:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL

Query:  LLRYTKRRFR---RPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEAN----HLLEMV
        L    +   +     NDPVF   G ++ ++ D+++LENQ+P F+L  +  ++  G +    L +L  RFF  ++P       ++     +    H L++ 
Subjt:  LLRYTKRRFR---RPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEAN----HLLEMV

Query:  YSCFLSTYPPVETN-----DKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYLNFMSH
        +   L      + N     DK   + +P+ ++L+ AG KFK  ++ +   DIKF NG LEIP L ++  T+++  NL A+E C   +   + SY+ FM +
Subjt:  YSCFLSTYPPVETN-----DKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYLNFMSH

Query:  LLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQ-----KDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSNSTAISV-------------ATVLMV
        L+ S ED+  L    I+        ++ +    + ++     KD         + +  K+         R+L S  T + +             A V+++
Subjt:  LLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQ-----KDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSNSTAISV-------------ATVLMV

Query:  VVIFGAAFFAAFS
        ++ F  ++FAA++
Subjt:  VVIFGAAFFAAFS

AT3G50190.1 Plant protein of unknown function (DUF247)4.8e-3730.54Show/hide
Query:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL
        IY+ P      D+N + P  VS+GP+HHG  HL  ++  KW  ++  +  +    ++  I  + + E R R CYE     S  ++F+Q+++LD CF+L+L
Subjt:  IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILEL

Query:  LL----RYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSC
               ++K  + R NDPVF   G ++ +R D+++LENQ+P         K+ +  E N       ++FF  +   D+         E  H L++    
Subjt:  LL----RYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSC

Query:  FLS-------------TYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYL
         L              ++ PV   DK + K L   ++L+ AGIKFK  R S  L DI+F+NG LEIP L ++  T+++L NL A+E C   +  Q+ SY+
Subjt:  FLS-------------TYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDLQVKSYL

Query:  NFMSHLLQSDEDVKILCRKKILNALKDEEEQIIE
         F+ +L+ S+EDV+ L    I+    + + ++++
Subjt:  NFMSHLLQSDEDVKILCRKKILNALKDEEEQIIE

AT4G31980.1 unknown protein4.2e-4130.49Show/hide
Query:  ESLLSCIERKLEANCSSFT---IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEF
        ++L+  I+ KL    S  T   IYK P+++   + + + P  VS GP H G   L+++E  K+ YL +F+    S +L+DL++L    E   R CY ++ 
Subjt:  ESLLSCIERKLEANCSSFT---IYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKLKWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEF

Query:  YSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENM-YLSDLTSRFFRTMVPGDRKF
             DEF +++++D  F++ELLLR    R R  ND +F    ++ D+  D++L+ENQ+P+F+++EI+  +L+  ++    +  L  R F   +    + 
Subjt:  YSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKVLDGLEENM-YLSDLTSRFFRTMVPGDRKF

Query:  IGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDL
          + FI E  H ++++ SC+L  +P       +K    P A++L  AG++FK A +S  LLDI F +GVL+IP + V   TE++ +N+  +E C+  ++ 
Subjt:  IGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTETILRNLAAYEICQFGTDL

Query:  QVKSYLNFMSHLLQSDEDVKILCRKKIL
            Y+  +   ++S  D  +L    I+
Subjt:  QVKSYLNFMSHLLQSDEDVKILCRKKIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCTAACAATGGCGGCTGTATAAAGGCATCACCAGATATTGAAGTCCTTTTAAGCACCACCGGAAAGTCTCTGACCGTCGCTGGAAAGATGGATCCATCAACACCGGTTTC
CCATACGATTAATATTTCTGGAATCTCACAAGAAAGCTTTCAGGAAGAGTCTCTCCTATCTTGCATTGAACGAAAATTGGAAGCCAATTGTTCATCCTTTACCATCTACA
AAGCTCCAAGCGAAATCAATATTGAAGATAGAAACGTCTTCCTTCCCGCCAAAGTCTCAATCGGCCCTTTCCACCACGGCGCTCCACATCTCGAATCTGTTGAAAAGCTC
AAGTGGCATTACTTGTCCACTTTCTTGACGCACAAACCGTCTCTCACTTTACAGGATCTTATTAAACTCGTTGTAAAATCGGAGAGCCGGGGGAGGAAATGCTATGAGAA
AGAGTTTTATAGTTCGGATAGGGATGAGTTTTCACAGATCATGTTGCTTGATTGCTGCTTCATTCTTGAGTTACTTTTGCGATACACGAAAAGGAGGTTCAGGCGCCCGA
ATGATCCTGTTTTCACTACTCCTGGTTTGCTTTACGATTTAAGATGCGATTTGGTGTTGCTTGAAAATCAGATTCCATACTTCCTTCTCGAAGAAATTTATGCAAAGGTG
CTAGATGGGTTAGAGGAAAATATGTATCTCAGTGACCTGACCTCCCGATTCTTTAGAACTATGGTACCTGGAGACCGGAAATTTATCGGCGATAACTTCATAGTGGAAGC
AAATCATTTACTCGAAATGGTGTATTCTTGTTTCCTCTCCACCTATCCGCCAGTTGAGACGAACGACAAATTGAAATCGAAAGAATTACCTAGTGCGTCGAAGCTTAAAG
CTGCCGGAATCAAATTCAAAAACGCCAGATCTTCAAAAAGCTTATTGGACATCAAATTTCAGAACGGCGTCCTTGAAATTCCGCCTCTAAGAGTGTACCAGAAAACGGAG
ACAATTCTGAGGAATCTTGCTGCTTATGAAATCTGTCAATTCGGAACCGATCTGCAAGTGAAATCGTATCTCAATTTCATGAGCCACCTTCTCCAGAGCGATGAAGACGT
GAAGATACTGTGCAGAAAGAAAATCCTGAATGCTTTGAAGGACGAGGAGGAACAGATTATTGAGAAACTGAAATGGATACGCGAGCAGAAGGATAGCCTATCGGGAACGT
TCTTTGCCGGCATTGTTCAAAAATTAAAAGAGAAGCCGGACCGATCCGTCGCGCGGTGGCGGAGGCTGAGAAGCAATTCGACGGCCATCAGCGTTGCCACCGTTTTGATG
GTGGTTGTGATCTTCGGCGCGGCCTTTTTCGCTGCATTTTCAGTACTTCAGCGCCGTTACAAATAA
mRNA sequenceShow/hide mRNA sequence
TCTAACAATGGCGGCTGTATAAAGGCATCACCAGATATTGAAGTCCTTTTAAGCACCACCGGAAAGTCTCTGACCGTCGCTGGAAAGATGGATCCATCAACACCGGTTTC
CCATACGATTAATATTTCTGGAATCTCACAAGAAAGCTTTCAGGAAGAGTCTCTCCTATCTTGCATTGAACGAAAATTGGAAGCCAATTGTTCATCCTTTACCATCTACA
AAGCTCCAAGCGAAATCAATATTGAAGATAGAAACGTCTTCCTTCCCGCCAAAGTCTCAATCGGCCCTTTCCACCACGGCGCTCCACATCTCGAATCTGTTGAAAAGCTC
AAGTGGCATTACTTGTCCACTTTCTTGACGCACAAACCGTCTCTCACTTTACAGGATCTTATTAAACTCGTTGTAAAATCGGAGAGCCGGGGGAGGAAATGCTATGAGAA
AGAGTTTTATAGTTCGGATAGGGATGAGTTTTCACAGATCATGTTGCTTGATTGCTGCTTCATTCTTGAGTTACTTTTGCGATACACGAAAAGGAGGTTCAGGCGCCCGA
ATGATCCTGTTTTCACTACTCCTGGTTTGCTTTACGATTTAAGATGCGATTTGGTGTTGCTTGAAAATCAGATTCCATACTTCCTTCTCGAAGAAATTTATGCAAAGGTG
CTAGATGGGTTAGAGGAAAATATGTATCTCAGTGACCTGACCTCCCGATTCTTTAGAACTATGGTACCTGGAGACCGGAAATTTATCGGCGATAACTTCATAGTGGAAGC
AAATCATTTACTCGAAATGGTGTATTCTTGTTTCCTCTCCACCTATCCGCCAGTTGAGACGAACGACAAATTGAAATCGAAAGAATTACCTAGTGCGTCGAAGCTTAAAG
CTGCCGGAATCAAATTCAAAAACGCCAGATCTTCAAAAAGCTTATTGGACATCAAATTTCAGAACGGCGTCCTTGAAATTCCGCCTCTAAGAGTGTACCAGAAAACGGAG
ACAATTCTGAGGAATCTTGCTGCTTATGAAATCTGTCAATTCGGAACCGATCTGCAAGTGAAATCGTATCTCAATTTCATGAGCCACCTTCTCCAGAGCGATGAAGACGT
GAAGATACTGTGCAGAAAGAAAATCCTGAATGCTTTGAAGGACGAGGAGGAACAGATTATTGAGAAACTGAAATGGATACGCGAGCAGAAGGATAGCCTATCGGGAACGT
TCTTTGCCGGCATTGTTCAAAAATTAAAAGAGAAGCCGGACCGATCCGTCGCGCGGTGGCGGAGGCTGAGAAGCAATTCGACGGCCATCAGCGTTGCCACCGTTTTGATG
GTGGTTGTGATCTTCGGCGCGGCCTTTTTCGCTGCATTTTCAGTACTTCAGCGCCGTTACAAATAA
Protein sequenceShow/hide protein sequence
SNNGGCIKASPDIEVLLSTTGKSLTVAGKMDPSTPVSHTINISGISQESFQEESLLSCIERKLEANCSSFTIYKAPSEINIEDRNVFLPAKVSIGPFHHGAPHLESVEKL
KWHYLSTFLTHKPSLTLQDLIKLVVKSESRGRKCYEKEFYSSDRDEFSQIMLLDCCFILELLLRYTKRRFRRPNDPVFTTPGLLYDLRCDLVLLENQIPYFLLEEIYAKV
LDGLEENMYLSDLTSRFFRTMVPGDRKFIGDNFIVEANHLLEMVYSCFLSTYPPVETNDKLKSKELPSASKLKAAGIKFKNARSSKSLLDIKFQNGVLEIPPLRVYQKTE
TILRNLAAYEICQFGTDLQVKSYLNFMSHLLQSDEDVKILCRKKILNALKDEEEQIIEKLKWIREQKDSLSGTFFAGIVQKLKEKPDRSVARWRRLRSNSTAISVATVLM
VVVIFGAAFFAAFSVLQRRYK