; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G08170 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G08170
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPlant protein of unknown function (DUF247)
Genome locationClcChr05:6222353..6223656
RNA-Seq ExpressionClc05G08170
SyntenyClc05G08170
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018572.1 UPF0481 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]5.9e-18278.8Show/hide
Query:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL
        MDSSR LSHSI++PA SQGSS EESLLSSIEGKLEAF SSITIFRAPN+ISIEDRNVFVP+KVSIGPFHHGAPHLESMENLKW YLS FLK+  NPS  L
Subjt:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL

Query:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN
        + L+ELV                         +MLLDCCFILELLLR+ KKR RRRND VFTTPGLLFDLRCDLMLLENQIPYFLL DVYENVQDP EE 
Subjt:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN

Query:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ
        MSLNDLTF+FFKT+V GDR+ VYDNFMVEADHLLEMVHSCFLSTYPR+ETN+KS S ELP+ASKLKTAGIK KNARS KSLLDIKF NGVLEIPPL+VYQ
Subjt:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ

Query:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR
        +TE ILRNL AYEI Q G+D QVKSY+NFMSHLLQS++DVKIL RRKIL+D EDDEEQII+NLKW+ E +ESLSGTYFAGIVQKLNEKPDR VARWR+LR
Subjt:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR

Query:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK
        RNPVAIG+ AV VVVVIFVA FFSAF +LQRR+K
Subjt:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK

XP_008438948.1 PREDICTED: UPF0481 protein At3g47200-like [Cucumis melo]3.9e-17876.73Show/hide
Query:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL
        MDSS P+SH+INIP ISQ SS EESLLSSIEGKLEA  SS+TIF+AP++I+IE RNVFVPAKVSIGPFHHGA HL+S+ENLKWRYLSTFLKH  N S TL
Subjt:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL

Query:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN
        +DLI++V                         +MLLDCCFILELLLRY K+RF+RRNDPVFTTPGLLFD++CDLMLLENQIPYFLL+++YE V DP EEN
Subjt:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN

Query:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ
        M L+DLTF+FF+T+V GDRKF+ DNF+VEADHLLEMVHSCFLSTYP ++TN+K  S ELPSASKLKTAGIKFKNARS KSLLDIKF NGVLEIPPL VYQ
Subjt:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ

Query:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR
        QTEAILRNLAAYEIRQ GTD QVKSYL FMSHLLQS+ DVKILCR+KIL  LEDDEEQII+NLKW+REQKESLSGTYFAGIVQKLNEKPDRSV RWRRLR
Subjt:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR

Query:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK
        R P AIGVAA  +VVVIF A FF+AF ILQRR+K
Subjt:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK

XP_022955709.1 UPF0481 protein At3g47200-like [Cucurbita moschata]1.2e-18279.03Show/hide
Query:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL
        MDSSR LSHSI++PA SQGSS EESLLSSIEGKLEAF SSITIFRAPN+ISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKW YLS FLK+  NPS  L
Subjt:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL

Query:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN
        + L+ELV                         +MLLDCCFILELLLR+ KKR RRRND VFTTPGLLFDLRCDLMLLENQIPYFLL DVYENVQDP EEN
Subjt:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN

Query:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ
        MSLNDLTF+FFKT+V GDR+ VYDNFMVEADHLLEMVHSCFLSTYPR+ETN+KS S ELP+ASKLKTAGIK KNARS KSLLDIKF NGVLEIPPL+VYQ
Subjt:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ

Query:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR
        +TE ILRNL AYEI Q G+D QVKSY+NFMSHLLQS++DVKIL RRKIL+D EDDEEQII+NLKW+ E +ESLSGTYFAGIVQKLNEKPDR VARWR+LR
Subjt:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR

Query:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK
        R PVAIG+ AV VVVVIFVA FFSAF +LQRR+K
Subjt:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK

XP_023526431.1 UPF0481 protein At3g47200-like [Cucurbita pepo subsp. pepo]1.8e-18379.72Show/hide
Query:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL
        MDSSR LSHSI++PA SQGSS EESLLSSIEGKLEAF SSITIFRAPN+ISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKW YLS FLK+  NPS  L
Subjt:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL

Query:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN
        + L+ELV                         +MLLDCCFILELLLR+ KKR RRRND VFTTPGLLFDLRCDLMLLENQIPYFLL DVYENVQDP EEN
Subjt:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN

Query:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ
        MSLNDLTF+FFKT+VVGDR+ VYDNF VEADHLLEMVHSCFLSTYPR+ETN+KS S ELPSASKLKTAGIK KNARS KSLLDIKF NGVLEIPPL+VYQ
Subjt:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ

Query:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR
        +TE ILRNL AYEI Q G+D QVKSY+NFMSHLLQS++DVKIL RRKIL D EDDEEQII+NLKW+ E KESLSGTYFAGIVQKLNEKPDR VARWR+LR
Subjt:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR

Query:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK
        RNPVAIG+ AV VVVVIFVA FFSAF +LQRR+K
Subjt:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK

XP_038880915.1 UPF0481 protein At3g47200-like [Benincasa hispida]3.4e-19883.87Show/hide
Query:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL
        M+SS+P SHSI+I AI+QGSS EESLLSS+EGKLEAF SSITIFRAPNDISIED+NVFVPAKVSIGPFHHGAPHLE MENLKWRYLSTFLKH  NPS TL
Subjt:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL

Query:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN
         DLIELV                         MMLLDCCFILELLLRY KKRFRR NDPVF TPGLLFDLRCDLMLLENQIPYFLL++VYENVQDPLEEN
Subjt:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN

Query:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ
        MSLNDLTF+FFKT+V GDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETN+KS S ELPSASKLKTAGIKFKNARSPKSLLDIKF  GVLEIPPL VYQ
Subjt:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ

Query:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR
        QTEAILRNLAAYEIRQFG+DLQVKSY+NFMSHLLQS+EDVKILCRRKIL DLEDDEEQIIQNLKW+RE+KESLSGTYFAGIVQKLNEKPDR + +WR LR
Subjt:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR

Query:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK
        RNPVAIGVAAVWVVVVIFVA FFSA  +LQRR+K
Subjt:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK

TrEMBL top hitse value%identityAlignment
A0A0A0L821 Uncharacterized protein5.2e-16872.81Show/hide
Query:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL
        MD S P+SH+INI  ISQ S  EESLLS IE KLEA  SS TI++AP++I+IEDRNVF+PAKVSIGPFHHGAPHLES+E LKW YLSTFL H   PS TL
Subjt:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL

Query:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN
        +DLI+LV                         +MLLDCCFILELLLRY K+RFRR NDPVFTTPGLL+DLRCDL+LLENQIPYFLL ++Y  V D LEEN
Subjt:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN

Query:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ
        M L+DLT +FF+T+V GDRKF+ DNF+VEA+HLLEMV+SCFLSTYP +ETN+K  S ELPSASKLK AGIKFKNARS KSLLDIKF NGVLEIPPL VYQ
Subjt:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ

Query:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR
        +TE ILRNLAAYEI QFGTDLQVKSYLNFMSHLLQS+EDVKILCR+KIL+ L+D+EEQII+ LKW+REQK+SLSGT+FAGIVQKL EKPDRSVARWRRLR
Subjt:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR

Query:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK
         N  AI VA V +VVVIF A FF+AF +LQRR+K
Subjt:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK

A0A1S3AY98 UPF0481 protein At3g47200-like1.9e-17876.73Show/hide
Query:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL
        MDSS P+SH+INIP ISQ SS EESLLSSIEGKLEA  SS+TIF+AP++I+IE RNVFVPAKVSIGPFHHGA HL+S+ENLKWRYLSTFLKH  N S TL
Subjt:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL

Query:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN
        +DLI++V                         +MLLDCCFILELLLRY K+RF+RRNDPVFTTPGLLFD++CDLMLLENQIPYFLL+++YE V DP EEN
Subjt:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN

Query:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ
        M L+DLTF+FF+T+V GDRKF+ DNF+VEADHLLEMVHSCFLSTYP ++TN+K  S ELPSASKLKTAGIKFKNARS KSLLDIKF NGVLEIPPL VYQ
Subjt:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ

Query:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR
        QTEAILRNLAAYEIRQ GTD QVKSYL FMSHLLQS+ DVKILCR+KIL  LEDDEEQII+NLKW+REQKESLSGTYFAGIVQKLNEKPDRSV RWRRLR
Subjt:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR

Query:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK
        R P AIGVAA  +VVVIF A FF+AF ILQRR+K
Subjt:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK

A0A6J1CA62 UPF0481 protein At3g47200-like1.3e-15567.13Show/hide
Query:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL
        M+ SR LSH+I+IPAIS+  S EESLL S+E K+EAF SSI IF+ P++ISI++R VFVPAKVSIGPFHHGAPHLESME+LKW YL  FLKH  NPS  L
Subjt:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL

Query:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN
         DL+E V                         MM+LDCCF+LELLLR+  KR +RRNDPVFTTPGLL DL+ DL+LLENQIPYFLL +VYE VQD  EEN
Subjt:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN

Query:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMET-NEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVY
        M LNDL F+FF+T+V G+R+ VYDNF  +ADHLL++VHSCFLSTYPR+ET N KS + ELP ASKLK+AGIKFKNA +PKS+LDIKF NG LEIP LEV 
Subjt:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMET-NEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVY

Query:  QQTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRL
        + TE IL+NL AYEI Q G+  QVKSY++FMSHLLQS+ED+K+LC RKIL +LE DE QII NLKW+R+QK +LSGTYFAG+VQKLNE PDR +  WRRL
Subjt:  QQTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRL

Query:  RRNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK
        RRNPVAIGV AVW +VVIFVA FFSA  +LQRR++
Subjt:  RRNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK

A0A6J1GVU1 UPF0481 protein At3g47200-like5.7e-18379.03Show/hide
Query:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL
        MDSSR LSHSI++PA SQGSS EESLLSSIEGKLEAF SSITIFRAPN+ISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKW YLS FLK+  NPS  L
Subjt:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL

Query:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN
        + L+ELV                         +MLLDCCFILELLLR+ KKR RRRND VFTTPGLLFDLRCDLMLLENQIPYFLL DVYENVQDP EEN
Subjt:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN

Query:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ
        MSLNDLTF+FFKT+V GDR+ VYDNFMVEADHLLEMVHSCFLSTYPR+ETN+KS S ELP+ASKLKTAGIK KNARS KSLLDIKF NGVLEIPPL+VYQ
Subjt:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ

Query:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR
        +TE ILRNL AYEI Q G+D QVKSY+NFMSHLLQS++DVKIL RRKIL+D EDDEEQII+NLKW+ E +ESLSGTYFAGIVQKLNEKPDR VARWR+LR
Subjt:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLR

Query:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK
        R PVAIG+ AV VVVVIFVA FFSAF +LQRR+K
Subjt:  RNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK

A0A6J1IWZ4 UPF0481 protein At3g47200-like9.5e-16278.37Show/hide
Query:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL
        MDSSR LSHSI++PA SQGSS EESLLSSIE KLEAF SSITIFRA N+ISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKW YLS FLK+  NPS  L
Subjt:  MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTL

Query:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN
        + LIELV                         +MLLDCCFILELLLRY KKR RRRND VFTTPGLLFDLRCDLMLLENQIPYFLL DVY NVQDP EEN
Subjt:  KDLIELV-------------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEEN

Query:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ
        MSLNDLTF+FFKT+V GDR+FVYDNFMVEADHLLEM+HSCFLSTYPRMETN+ S S ELPSASKLKTAGIK KN +S KSLLDIKF NGVLEIPPL+VYQ
Subjt:  MSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQ

Query:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSV
        +TE ILRNL AYEI Q G+D QVKSY+NFMSHLLQS++DVKIL RRKIL+D E+DEEQII+NLKW+RE KESLSGTYFAGIVQKLN+K DR V
Subjt:  QTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSV

SwissProt top hitse value%identityAlignment
P0C897 Putative UPF0481 protein At3g026459.6e-1022.06Show/hide
Query:  SITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTLKDLIE------------------------LVMMLLDCCF
        +++IF  P  +     + + P +VSIGP+H   P L  ME  +++ +      N   SF   DL+E                        L +M +D  F
Subjt:  SITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTLKDLIE------------------------LVMMLLDCCF

Query:  ILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEENMSLNDLTFQFF--------KTVVVGDRKFVYDNFMVEADH
        ++E    +LK    R+ + +    G    LR D+M++ENQIP F+L    + ++  LE   S +DL               V+  D   +      E +H
Subjt:  ILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEENMSLNDLTFQFF--------KTVVVGDRKFVYDNFMVEADH

Query:  LLEMVHSCFLSTYPRME------------------------------------------------------------------------------TNEKS
        +L+ ++   +   PR+E                                                                              T ++S
Subjt:  LLEMVHSCFLSTYPRME------------------------------------------------------------------------------TNEKS

Query:  MSI------------ELPSASKLKTAGIKFK-NARSPKSLLDIKFNNGVLEIPPLEVYQQTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVK
        +SI             +PS S L  AG++FK  A    S +    N+G   +P + +   TE +LRNL AYE       L    Y   ++ ++ SEEDV+
Subjt:  MSI------------ELPSASKLKTAGIKFK-NARSPKSLLDIKFNNGVLEIPPLEVYQQTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVK

Query:  ILCRRKIL-SDLEDDEE
        +L  + +L S L+ D+E
Subjt:  ILCRRKIL-SDLEDDEE

Q9SD53 UPF0481 protein At3g472007.8e-2829.86Show/hide
Query:  ISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFL----KHNPNPSFTLKDLIEL-----
        +S GS     LL S      A   S  IFR P      +   + P  VSIGP+H+G  HL+ ++  K R L  FL    K +   +  +K +++L     
Subjt:  ISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFL----KHNPNPSFTLKDLIEL-----

Query:  --------------VMMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEENMSLNDLTFQFFKTVVVG
                       MM+LD CFIL + L  +        DP+F+ P LL  ++ DL+LLENQ+P+F+L  +Y  V   +  +  LN + F FFK  +  
Subjt:  --------------VMMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEENMSLNDLTFQFFKTVVVG

Query:  DRKFVYDNFMVEADHLLEMVHSCFL----------STYPRMETNE-KSMSIE---------LPSASKLKTAGIKFKNARSPK-SLLDIKFNNGVLEIPPL
        +  +   +   +A HLL+++   FL          S + +++ +E KS ++          + SA +L+  GIKF+  RS + S+L+++     L+IP L
Subjt:  DRKFVYDNFMVEADHLLEMVHSCFL----------STYPRMETNE-KSMSIE---------LPSASKLKTAGIKFKNARSPK-SLLDIKFNNGVLEIPPL

Query:  EVYQQTEAILRNLAAYEIRQFGTDL--QVKSYLNFMSHLLQSEEDVKILCRRKIL
               +   N  A+E  QF TD   ++ +Y+ FM  LL +EEDV  L   K++
Subjt:  EVYQQTEAILRNLAAYEIRQFGTDL--QVKSYLNFMSHLLQSEEDVKILCRRKIL

Arabidopsis top hitse value%identityAlignment
AT2G36430.1 Plant protein of unknown function (DUF247)2.9e-3027.79Show/hide
Query:  INIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTLKDLIELV---
        I+I  + +       LLSS  GK      + +IFR P  +   +   + P  VSIGP+H G   L+ +E  KWRYL+  L    N   TL+D ++ V   
Subjt:  INIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTLKDLIELV---

Query:  ---------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQ--DPLEENMSLNDLTFQ
                             MM+LD CF+LEL  +         NDP+     +L     D + LENQIP+F+L  ++   +  +  E N SL  L F 
Subjt:  ---------------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQ--DPLEENMSLNDLTFQ

Query:  FFKTVVVGDRKFVYDNFMVEADHLLEMVHSCF-----LSTYPRMET-NEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQQTE
        FF  ++    + +     + A HLL+++ S F     L T P      EK  S  + S SKL+ AGIK +  +  +S L ++F +G +E+P + V     
Subjt:  FFKTVVVGDRKFVYDNFMVEADHLLEMVHSCF-----LSTYPRMET-NEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQQTE

Query:  AILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWL-REQKESLSGTYFAGIVQKLNE
        + L N  AYE       +   +Y   +  L  + +DV+ LC + I+ +    + ++ + +  L R+    ++  Y   + +++NE
Subjt:  AILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWL-REQKESLSGTYFAGIVQKLNE

AT3G47210.1 Plant protein of unknown function (DUF247)9.1e-3229.18Show/hide
Query:  LEAFG-SSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNP-------NPSFTLKDLI-------------ELV-MMLL
        LE+ G  S  IFR P   +  +   + P  VSIGP+HHG  HLE ++  K R+L  FL+          N     +D I             ELV MM+L
Subjt:  LEAFG-SSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNP-------NPSFTLKDLI-------------ELV-MMLL

Query:  DCCFILELLLRYLKK-RFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEENMSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLE
        D CFIL LLL   +K       DP+ T P +L  ++ DL+LLENQ+P+F+L  +++  +  +  +  LN + F FF   +    ++   +    A HLL+
Subjt:  DCCFILELLLRYLKK-RFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEENMSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLE

Query:  MVHSCFLS---------TYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQQTEAILRNLAAYEIRQFGTDLQVKSY
        ++   FL          T  +      S    L SA++L   GI F       S+LDI+     L+IP L +     +IL N  A+E     +   + SY
Subjt:  MVHSCFLS---------TYPRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQQTEAILRNLAAYEIRQFGTDLQVKSY

Query:  LNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWL-REQKESLSGTYFAGIVQKLNEKPDRSVARWRRLRRN
        + FM  LL  +ED   L RR+I+ +    E+++ +  K + ++    +  +Y   +  ++NE    + ++W  + R+
Subjt:  LNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWL-REQKESLSGTYFAGIVQKLNEKPDRSVARWRRLRRN

AT3G50180.1 Plant protein of unknown function (DUF247)1.5e-2928.43Show/hide
Query:  ITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKH-NPNPSFTLKDLIEL--------------------VMMLLDCCFILEL
        + I++ P+ +   D+  + P  VS+GP+HHG    +SME  KWR ++  LK  N      L  +IEL                     M+LLD CFILEL
Subjt:  ITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKH-NPNPSFTLKDLIEL--------------------VMMLLDCCFILEL

Query:  LL----RYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEENMSLNDLTFQFF-------KTVVVGDRKFVYDNFMVEADHL
        L      +LK  +   NDPVF   G +  ++ D+++LENQ+P F+LN + E +Q   +    L +L  +FF       +T+          N  +   H 
Subjt:  LL----RYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEENMSLNDLTFQFF-------KTVVVGDRKFVYDNFMVEADHL

Query:  LEMVHSCFLSTYPR-------METNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQQTEAILRNLAAYEIRQFGTDLQVKSY
        L++ H   L  +PR           +K +   +P+ ++L+ AG KFK  ++ +   DIKF+NG LEIP L ++  T+++  NL A+E     +   + SY
Subjt:  LEMVHSCFLSTYPR-------METNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQQTEAILRNLAAYEIRQFGTDLQVKSY

Query:  LNFMSHLLQSEEDVKILCRRKIL-------SDLEDDEEQIIQNLK------WLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLRRNPVAI--GVAAV
        + FM +L+ S ED+  L    I+       S++ D   Q+ Q +       +L +    +   Y     +KLN      + ++     NP A     AAV
Subjt:  LNFMSHLLQSEEDVKILCRRKIL-------SDLEDDEEQIIQNLK------WLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLRRNPVAI--GVAAV

Query:  WVVVVIFVAGFFSAF
         ++++ F   +F+A+
Subjt:  WVVVVIFVAGFFSAF

AT4G31980.1 unknown protein1.8e-3226.27Show/hide
Query:  ESLLSSIEGKLEAFGSSIT----IFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTLKDLIELV------------
        ++L+ SI+ KL AF SS++    I++ PN +   + + + P  VS GP H G   L++ME+ K+RYL +F+   P  + +L+DL+ L             
Subjt:  ESLLSSIEGKLEAFGSSIT----IFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTLKDLIELV------------

Query:  ------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVY----ENVQDPLEENMSLNDLTFQFFKTVVV
                    M+++D  F++ELLLR    R R  ND +F    ++ D+  D++L+ENQ+P+F++ +++       Q      + L    F +F +  +
Subjt:  ------------MMLLDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVY----ENVQDPLEENMSLNDLTFQFFKTVVV

Query:  GDRKFVYDNFMVEADHLLEMVHSCFLSTYP-RMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQQTEAILRNLAAYEIR
         D KF+      E +H ++++ SC+L  +P ++E     +    P A++L TAG++FK A +   LLDI F +GVL+IP + V   TE++ +N+  +E  
Subjt:  GDRKFVYDNFMVEADHLLEMVHSCFLSTYP-RMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQQTEAILRNLAAYEIR

Query:  QFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLRR-----NPVAIGVAA
        +  ++     Y+  +   ++S  D  +L    I+ +   +   +      + ++       YF+ + + L    +    RW+ + R     NP A  VA+
Subjt:  QFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLRR-----NPVAIGVAA

Query:  VWVVVVIFVAGFFSA
        V+  +++ +  F  +
Subjt:  VWVVVVIFVAGFFSA

AT5G22540.1 Plant protein of unknown function (DUF247)3.8e-3830.69Show/hide
Query:  ESLLSSIEG-KL--EAFGSSI-TIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTLKDLIELV------------
        +SL+S   G KL  E+ GS +  I R P  ++  +   + P  VSIGP+HHG  HL+  +  K R+L  F+       F  ++L++ V            
Subjt:  ESLLSSIEG-KL--EAFGSSI-TIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTLKDLIELV------------

Query:  ------------MMLLDCCFILELLLRYL-KKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEENMSLNDLTFQFFKTVVVGDR
                    MM+LD CFIL L      K  +   +DP+F  P +L  +R DL+LLENQ+PY LL  ++E     L     LN++ F+FF   +    
Subjt:  ------------MMLLDCCFILELLLRYL-KKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEENMSLNDLTFQFFKTVVVGDR

Query:  KFVYDNFMVEADHLLEMVHSCF--LSTYPRMETNEKSMSIE-------LPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQQTEAILRNLA
         F   ++ +EA HLL+++   F  + +  R++ +    S         + SA KL   GIKFK  ++  S+LDI ++NGVL IPP+ +   T +I  N  
Subjt:  KFVYDNFMVEADHLLEMVHSCF--LSTYPRMETNEKSMSIE-------LPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQQTEAILRNLA

Query:  AYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWL-REQKESLSGTYFAGIVQKLNE
        A+E     +   + SY+ FM+ L+  E D   L  R+IL +    E+++ +  K + ++    L  +Y A + + +NE
Subjt:  AYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDDEEQIIQNLKWL-REQKESLSGTYFAGIVQKLNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCATCAAGACCGCTTTCTCATTCAATTAATATTCCGGCAATCTCACAAGGAAGCTCTCATGAAGAATCTCTCCTTTCTTCCATTGAAGGAAAATTGGAAGCCTT
CGGTTCATCCATTACCATCTTCAGAGCTCCAAACGATATCAGTATCGAAGACAGAAACGTCTTCGTCCCTGCCAAAGTCTCAATCGGCCCTTTCCACCACGGCGCTCCAC
ATCTTGAATCCATGGAAAATCTCAAGTGGCGATACTTGTCCACTTTCTTGAAGCACAATCCCAATCCATCTTTCACTTTGAAGGATCTTATTGAACTCGTTATGATGTTG
CTTGATTGTTGCTTCATTCTCGAGTTGCTTTTGCGATACTTGAAAAAGAGGTTCAGGCGCCGGAATGATCCTGTTTTCACTACTCCTGGTTTGCTCTTCGATTTGAGATG
CGACTTGATGTTACTTGAAAATCAGATTCCATACTTCCTTCTCAATGATGTTTATGAAAATGTGCAAGATCCACTAGAGGAAAATATGTCTCTCAATGACCTAACCTTCC
AATTTTTCAAAACTGTGGTTGTTGGAGATCGGAAATTTGTCTACGACAATTTCATGGTGGAAGCAGATCATTTACTCGAAATGGTGCACTCTTGTTTCCTCTCAACTTAT
CCTCGAATGGAGACGAACGAAAAATCGATGTCGATTGAATTACCTAGTGCGTCGAAGCTTAAAACTGCCGGAATCAAATTCAAGAACGCCAGATCTCCAAAGAGTCTATT
GGACATCAAATTTAATAACGGCGTCCTCGAAATTCCGCCTCTCGAAGTGTACCAGCAGACGGAGGCGATTCTGAGGAATCTCGCCGCGTACGAAATCCGTCAATTCGGAA
CCGATCTGCAGGTGAAATCGTATCTCAATTTCATGAGCCACCTTCTCCAGTCCGAAGAAGACGTGAAAATACTCTGTAGAAGGAAAATCCTGAGCGATCTGGAGGACGAC
GAGGAGCAAATTATTCAGAATCTGAAATGGCTACGCGAGCAGAAGGAGAGCTTATCCGGAACGTACTTTGCCGGCATTGTTCAGAAATTAAACGAGAAGCCGGACCGAAG
CGTTGCACGGTGGCGGAGGTTAAGGAGGAATCCAGTGGCCATCGGCGTCGCCGCCGTTTGGGTGGTGGTTGTGATCTTCGTCGCGGGCTTCTTCTCTGCATTTTTAATAC
TTCAGCGCCGTCACAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCATCAAGACCGCTTTCTCATTCAATTAATATTCCGGCAATCTCACAAGGAAGCTCTCATGAAGAATCTCTCCTTTCTTCCATTGAAGGAAAATTGGAAGCCTT
CGGTTCATCCATTACCATCTTCAGAGCTCCAAACGATATCAGTATCGAAGACAGAAACGTCTTCGTCCCTGCCAAAGTCTCAATCGGCCCTTTCCACCACGGCGCTCCAC
ATCTTGAATCCATGGAAAATCTCAAGTGGCGATACTTGTCCACTTTCTTGAAGCACAATCCCAATCCATCTTTCACTTTGAAGGATCTTATTGAACTCGTTATGATGTTG
CTTGATTGTTGCTTCATTCTCGAGTTGCTTTTGCGATACTTGAAAAAGAGGTTCAGGCGCCGGAATGATCCTGTTTTCACTACTCCTGGTTTGCTCTTCGATTTGAGATG
CGACTTGATGTTACTTGAAAATCAGATTCCATACTTCCTTCTCAATGATGTTTATGAAAATGTGCAAGATCCACTAGAGGAAAATATGTCTCTCAATGACCTAACCTTCC
AATTTTTCAAAACTGTGGTTGTTGGAGATCGGAAATTTGTCTACGACAATTTCATGGTGGAAGCAGATCATTTACTCGAAATGGTGCACTCTTGTTTCCTCTCAACTTAT
CCTCGAATGGAGACGAACGAAAAATCGATGTCGATTGAATTACCTAGTGCGTCGAAGCTTAAAACTGCCGGAATCAAATTCAAGAACGCCAGATCTCCAAAGAGTCTATT
GGACATCAAATTTAATAACGGCGTCCTCGAAATTCCGCCTCTCGAAGTGTACCAGCAGACGGAGGCGATTCTGAGGAATCTCGCCGCGTACGAAATCCGTCAATTCGGAA
CCGATCTGCAGGTGAAATCGTATCTCAATTTCATGAGCCACCTTCTCCAGTCCGAAGAAGACGTGAAAATACTCTGTAGAAGGAAAATCCTGAGCGATCTGGAGGACGAC
GAGGAGCAAATTATTCAGAATCTGAAATGGCTACGCGAGCAGAAGGAGAGCTTATCCGGAACGTACTTTGCCGGCATTGTTCAGAAATTAAACGAGAAGCCGGACCGAAG
CGTTGCACGGTGGCGGAGGTTAAGGAGGAATCCAGTGGCCATCGGCGTCGCCGCCGTTTGGGTGGTGGTTGTGATCTTCGTCGCGGGCTTCTTCTCTGCATTTTTAATAC
TTCAGCGCCGTCACAAATGA
Protein sequenceShow/hide protein sequence
MDSSRPLSHSINIPAISQGSSHEESLLSSIEGKLEAFGSSITIFRAPNDISIEDRNVFVPAKVSIGPFHHGAPHLESMENLKWRYLSTFLKHNPNPSFTLKDLIELVMML
LDCCFILELLLRYLKKRFRRRNDPVFTTPGLLFDLRCDLMLLENQIPYFLLNDVYENVQDPLEENMSLNDLTFQFFKTVVVGDRKFVYDNFMVEADHLLEMVHSCFLSTY
PRMETNEKSMSIELPSASKLKTAGIKFKNARSPKSLLDIKFNNGVLEIPPLEVYQQTEAILRNLAAYEIRQFGTDLQVKSYLNFMSHLLQSEEDVKILCRRKILSDLEDD
EEQIIQNLKWLREQKESLSGTYFAGIVQKLNEKPDRSVARWRRLRRNPVAIGVAAVWVVVVIFVAGFFSAFLILQRRHK