; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004151 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004151
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionProtein of unknown function DUF455
Genome locationChr08:14223899..14226648
RNA-Seq ExpressionHG10004151
SyntenyHG10004151
Gene Ontology termsGO:0030247 - polysaccharide binding (molecular function)
InterPro domainsIPR007402 - Protein of unknown function DUF455
IPR009078 - Ferritin-like superfamily
IPR011197 - Uncharacterised conserved protein UCP012318


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK11737.1 uncharacterized protein E5676_scaffold304G00540 [Cucumis melo var. makuwa]7.7e-15583.48Show/hide
Query:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSD---------------------------------------VK
        MA+ETLVEAALRVLNTSDPFEKA LGD VASRWLNG ISS YDPSADL VPDRPARLS+                                       VK
Subjt:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSD---------------------------------------VK

Query:  LVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATS
        LVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMP EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATS
Subjt:  LVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATS

Query:  KDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVR
        KDLLARLAIEHCVHEARGLDVLPTTI RFRNGGDNETADLLEKVVYPEEVTHCAAGVKWFKYLCQRS +RKLDEDDD AE NAMEMEKEETI+KFHE+VR
Subjt:  KDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVR

Query:  KYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP
        KYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTD T HP
Subjt:  KYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP

XP_004138680.1 uncharacterized protein LOC101205330 [Cucumis sativus]4.5e-16395Show/hide
Query:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
        MA+ETLVEAALRVLNTSDPFEKAELGD VASRWLNG IS+PYDPSADL VPDRPARLS+VKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
Subjt:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD

Query:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
        IIARFGKQEGMP EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTI RFRNGGDNETAD
Subjt:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD

Query:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP
        LLEKVVYPEEVTHCAAGVKWFKYLCQRS +RKLDEDDD AE NAMEMEKEETI KFHE+VRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPT HP
Subjt:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP

XP_008456568.1 PREDICTED: uncharacterized protein HI_0077 [Cucumis melo]1.2e-16094.33Show/hide
Query:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
        MA+ETLVEAALRVLNTSDPFEKA LGD VASRWLNG ISS YDPSADL VPDRPARLS+VKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
Subjt:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD

Query:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
        IIARFGKQEGMP EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTI RFRNGGDNETAD
Subjt:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD

Query:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP
        LLEKVVYPEEVTHCAAGVKWFKYLCQRS +RKLDEDDD AE NAMEMEKEETI+KFHE+VRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTD T HP
Subjt:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP

XP_022954428.1 uncharacterized protein LOC111456685 [Cucurbita moschata]5.5e-15389.58Show/hide
Query:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
        MA+ETLVEAALRVLNTSDPFEKAELGDKVASRWLNG IS PYDPS+DLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
Subjt:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD

Query:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
        IIARFGKQE MP EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
Subjt:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD

Query:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDED------DDEAEKNA-MEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFK
        LLEKVVYPEEVTHCAAGVKWF+YLCQRSG + LD D       D AE NA +EME EE I KFH IVRK+FRGPLKPPFNEVARKAAGFGP+WYEPLAFK
Subjt:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDED------DDEAEKNA-MEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFK

Query:  TDPTLHP
         + TL+P
Subjt:  TDPTLHP

XP_038884770.1 uncharacterized protein HI_0077 [Benincasa hispida]1.4e-16496.33Show/hide
Query:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
        MA ETLVEAALRVLNTSDPFEKAELGD VASRWLNG ISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
Subjt:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD

Query:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
        IIARFGKQEGMP EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
Subjt:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD

Query:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP
        LLEKVVYPEEVTHCAAGVKWFKY+CQRSGNRKLDEDD  A+ NAMEMEKEETI KFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDP LHP
Subjt:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP

TrEMBL top hitse value%identityAlignment
A0A0A0LMC3 Uncharacterized protein2.2e-16395Show/hide
Query:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
        MA+ETLVEAALRVLNTSDPFEKAELGD VASRWLNG IS+PYDPSADL VPDRPARLS+VKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
Subjt:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD

Query:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
        IIARFGKQEGMP EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTI RFRNGGDNETAD
Subjt:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD

Query:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP
        LLEKVVYPEEVTHCAAGVKWFKYLCQRS +RKLDEDDD AE NAMEMEKEETI KFHE+VRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPT HP
Subjt:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP

A0A1S3C3M9 uncharacterized protein HI_00775.9e-16194.33Show/hide
Query:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
        MA+ETLVEAALRVLNTSDPFEKA LGD VASRWLNG ISS YDPSADL VPDRPARLS+VKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
Subjt:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD

Query:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
        IIARFGKQEGMP EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTI RFRNGGDNETAD
Subjt:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD

Query:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP
        LLEKVVYPEEVTHCAAGVKWFKYLCQRS +RKLDEDDD AE NAMEMEKEETI+KFHE+VRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTD T HP
Subjt:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP

A0A5D3CIH9 Uncharacterized protein3.7e-15583.48Show/hide
Query:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSD---------------------------------------VK
        MA+ETLVEAALRVLNTSDPFEKA LGD VASRWLNG ISS YDPSADL VPDRPARLS+                                       VK
Subjt:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSD---------------------------------------VK

Query:  LVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATS
        LVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMP EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATS
Subjt:  LVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATS

Query:  KDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVR
        KDLLARLAIEHCVHEARGLDVLPTTI RFRNGGDNETADLLEKVVYPEEVTHCAAGVKWFKYLCQRS +RKLDEDDD AE NAMEMEKEETI+KFHE+VR
Subjt:  KDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVR

Query:  KYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP
        KYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTD T HP
Subjt:  KYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP

A0A6J1GR27 uncharacterized protein LOC1114566852.7e-15389.58Show/hide
Query:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
        MA+ETLVEAALRVLNTSDPFEKAELGDKVASRWLNG IS PYDPS+DLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
Subjt:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD

Query:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
        IIARFGKQE MP EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
Subjt:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD

Query:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDED------DDEAEKNA-MEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFK
        LLEKVVYPEEVTHCAAGVKWF+YLCQRSG + LD D       D AE NA +EME EE I KFH IVRK+FRGPLKPPFNEVARKAAGFGP+WYEPLAFK
Subjt:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDED------DDEAEKNA-MEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFK

Query:  TDPTLHP
         + TL+P
Subjt:  TDPTLHP

A0A6J1JQ17 uncharacterized protein LOC1114872814.2e-15188.6Show/hide
Query:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
        MA+ETLVEAALRVLNTSDPFEKAELGDKVASRWLNG IS PYDPS+DL+VPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
Subjt:  MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD

Query:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
        IIARFGKQE MP EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATS+DLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD
Subjt:  IIARFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETAD

Query:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDED------DDEAEKNA-MEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFK
        LLEKVVYPEEVTHCAAG+KWFKYL QRSG + LD D       D AE NA +EME EE I KFH IVR YFRGPLKPPFNEVARKAAGFGP+WYEPLAFK
Subjt:  LLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDED------DDEAEKNA-MEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFK

Query:  TDPTLHP
         + TL+P
Subjt:  TDPTLHP

SwissProt top hitse value%identityAlignment
P43935 Uncharacterized protein HI_00773.0e-2129.68Show/hide
Query:  VEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVK-LVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARF
        VE AL+   T++P EK  L + +    L        +   ++   D  A   +   LV+P  +PK   A + +   A +H++ H E  AI+L  D   RF
Subjt:  VEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVK-LVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARF

Query:  GKQ------EGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETA
        G+       EG+   F  D++RVA++E  HF+L+   LK LG  YG   AH GLW+ A AT+ D+  R+A+   V EARGLD  P    +     D    
Subjt:  GKQ------EGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETA

Query:  DLLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGF
        ++L+ ++  +E+ H   G  W+  L ++ G                     + ++ F E++ KY     K   N  AR  AGF
Subjt:  DLLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGF

Arabidopsis top hitse value%identityAlignment
AT1G06240.1 Protein of unknown function DUF4552.8e-3835.52Show/hide
Query:  AEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLA-VPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD
        A  +L +    VL+TSDP  K+ +     SRW    +     P   ++ +P  PAR     LV+ + +P   K  +L     ++H+L H E  AIDL+WD
Subjt:  AEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLA-VPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWD

Query:  IIARFGKQEGMPG-EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETA
         +ARF     + G  FF DF  VA DE RHF   + RL ELG  YG +PA++ L      TS ++ ARLA    V EARGLD  P  + R    GDN T+
Subjt:  IIARFGKQEGMPG-EFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETA

Query:  DLLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEP
         ++ K+   EEV H A GV WF  +CQ+                 M      T   F +++++Y    L+ PFN  AR+ AG    WY+P
Subjt:  DLLEKVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEP

AT5G04520.1 Protein of unknown function DUF4555.1e-13378.97Show/hide
Query:  ETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIA
        ETL+E+A+R+LNTSDP EKA LGD +A +WL G I+ PYDP+ D  VPDRPARL  VKLVSPSLMPKLG+AGSLQSRQAIVHSL HTESWAIDLSWDIIA
Subjt:  ETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIA

Query:  RFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETADLLE
        RFGKQE MP +FFTDFVRVAQDEGRHFTLLAARL+E+GS YGALPAHDGLWDSA ATS DLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETADLLE
Subjt:  RFGKQEGMPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETADLLE

Query:  KVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFK
        KVVYPEE+THCAAGVKWFKYLC+RS + +      E++ +      EE I KFH +VR++FRGPLKPPFN  ARKAAGFGPQWYEPLA K
Subjt:  KVVYPEEVTHCAAGVKWFKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGGAGACCTTGGTTGAAGCGGCGCTTCGAGTCCTCAACACTTCCGATCCCTTCGAGAAGGCTGAACTTGGCGATAAAGTAGCTTCTCGATGGCTTAACGGCAC
CATTTCAAGCCCTTACGATCCCTCCGCCGATCTTGCCGTCCCCGACCGCCCCGCCAGGCTTTCCGATGTCAAGTTGGTATCGCCGAGTCTCATGCCGAAGCTGGGGAAGG
CGGGAAGCTTACAGAGCAGGCAGGCTATTGTGCATAGTCTCGTCCACACTGAAAGTTGGGCGATCGATTTGTCGTGGGACATAATAGCTCGGTTTGGAAAGCAAGAAGGA
ATGCCAGGAGAATTCTTCACTGATTTTGTTAGGGTAGCTCAAGATGAAGGTAGGCATTTTACTCTTCTAGCTGCAAGGCTTAAGGAACTGGGCTCTTTCTATGGAGCACT
ACCCGCGCATGATGGCCTATGGGATTCTGCTATTGCTACTTCCAAGGATTTATTAGCACGCTTGGCAATTGAGCATTGCGTCCATGAGGCTAGAGGGCTGGATGTGCTTC
CCACAACCATCTCCCGATTCCGAAATGGAGGTGACAATGAGACTGCAGATTTATTGGAGAAAGTAGTGTACCCAGAAGAAGTAACCCATTGTGCTGCTGGAGTAAAATGG
TTCAAATATCTTTGCCAGAGGTCTGGAAATAGGAAGTTGGATGAGGATGATGATGAGGCAGAGAAGAATGCAATGGAAATGGAGAAGGAAGAAACCATTCAAAAGTTTCA
TGAAATTGTGAGAAAGTACTTCAGGGGGCCATTGAAGCCACCTTTCAATGAAGTGGCAAGAAAAGCTGCTGGTTTTGGCCCTCAATGGTATGAACCACTTGCTTTTAAAA
CAGACCCTACCTTGCATCCATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAGGAGACCTTGGTTGAAGCGGCGCTTCGAGTCCTCAACACTTCCGATCCCTTCGAGAAGGCTGAACTTGGCGATAAAGTAGCTTCTCGATGGCTTAACGGCAC
CATTTCAAGCCCTTACGATCCCTCCGCCGATCTTGCCGTCCCCGACCGCCCCGCCAGGCTTTCCGATGTCAAGTTGGTATCGCCGAGTCTCATGCCGAAGCTGGGGAAGG
CGGGAAGCTTACAGAGCAGGCAGGCTATTGTGCATAGTCTCGTCCACACTGAAAGTTGGGCGATCGATTTGTCGTGGGACATAATAGCTCGGTTTGGAAAGCAAGAAGGA
ATGCCAGGAGAATTCTTCACTGATTTTGTTAGGGTAGCTCAAGATGAAGGTAGGCATTTTACTCTTCTAGCTGCAAGGCTTAAGGAACTGGGCTCTTTCTATGGAGCACT
ACCCGCGCATGATGGCCTATGGGATTCTGCTATTGCTACTTCCAAGGATTTATTAGCACGCTTGGCAATTGAGCATTGCGTCCATGAGGCTAGAGGGCTGGATGTGCTTC
CCACAACCATCTCCCGATTCCGAAATGGAGGTGACAATGAGACTGCAGATTTATTGGAGAAAGTAGTGTACCCAGAAGAAGTAACCCATTGTGCTGCTGGAGTAAAATGG
TTCAAATATCTTTGCCAGAGGTCTGGAAATAGGAAGTTGGATGAGGATGATGATGAGGCAGAGAAGAATGCAATGGAAATGGAGAAGGAAGAAACCATTCAAAAGTTTCA
TGAAATTGTGAGAAAGTACTTCAGGGGGCCATTGAAGCCACCTTTCAATGAAGTGGCAAGAAAAGCTGCTGGTTTTGGCCCTCAATGGTATGAACCACTTGCTTTTAAAA
CAGACCCTACCTTGCATCCATAA
Protein sequenceShow/hide protein sequence
MAEETLVEAALRVLNTSDPFEKAELGDKVASRWLNGTISSPYDPSADLAVPDRPARLSDVKLVSPSLMPKLGKAGSLQSRQAIVHSLVHTESWAIDLSWDIIARFGKQEG
MPGEFFTDFVRVAQDEGRHFTLLAARLKELGSFYGALPAHDGLWDSAIATSKDLLARLAIEHCVHEARGLDVLPTTISRFRNGGDNETADLLEKVVYPEEVTHCAAGVKW
FKYLCQRSGNRKLDEDDDEAEKNAMEMEKEETIQKFHEIVRKYFRGPLKPPFNEVARKAAGFGPQWYEPLAFKTDPTLHP