; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1424 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1424
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionHemerythrin domain-containing protein
Genome locationMC04:22189376..22193591
RNA-Seq ExpressionMC04g1424
SyntenyMC04g1424
Gene Ontology termsGO:0055072 - iron ion homeostasis (biological process)
InterPro domainsIPR012312 - Haemerythrin-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456602.1 PREDICTED: uncharacterized protein LOC103496512 [Cucumis melo]2.67e-19382.09Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        NC  SS KS AEIVPQE  R C +   +AA NPIVRL GPPN+ALTCYIRFALLYKSVKLSF+PSETPHFGSD+P IRIGSE++SGSRER+LRFID+RFP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR
        HPPL L SRR D D+T+ LVA+ V ALQHKSVLWH+ER+LRW KDLA RGGRT  DPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPIL+ ADR
Subjt:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR

Query:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLN
        GLCKASNEEHARDLPIMNGIKEDIKS VVLDLGS VCQEALSNLSKRLKLLQEHCKHHF+DEEK LLP LEA+EL KEQQ+ MLEQLLDVMKQTHSHLLN
Subjt:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLN

Query:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM
        FFLEGLLP EALQYLDL+TSS D+ R SFG ML M
Subjt:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM

XP_022956168.1 uncharacterized protein LOC111457939 [Cucurbita moschata]2.62e-19782.63Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A NP+VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        KASNEEHARDLPIMNGIKEDIKSTVVLD+GS VCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EA+EL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

XP_022990403.1 uncharacterized protein LOC111487271 [Cucurbita maxima]1.76e-19582.04Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A N +VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        K SNEEHARDLPIMNGIKEDIKSTVVLD+GS VCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EA+EL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

XP_023525990.1 uncharacterized protein LOC111789551 isoform X2 [Cucurbita pepo subsp. pepo]5.29e-19782.34Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A NP+VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        KASNEEHARDLPIMNGIKEDIKSTVVLD+GS VCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EA+E+ KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

XP_038884073.1 uncharacterized protein LOC120075009 [Benincasa hispida]7.01e-20083.88Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAP-NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHP
        NC+ SSKKS AEIVPQEFIRGCGD+A P NPIVRL GPPN+A TCYIRFALLYKSVKLSF+PSETPHFGSD+P IRIG+E++SGSRER+LRFID+RFPHP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAP-NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHP

Query:  PLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGL
        PL L +RR D D+TT LVAV V ALQHKSVLWH+ER+LRWAKDLA+RGGRT VDPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILE ADRGL
Subjt:  PLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGL

Query:  CKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFF
        CKASNEEHARDLPIMNGIKEDIKS VVLDLGS VCQEALS+LSKRLKLLQEHCKHHF+DEEK +LP LEA++L KEQQ+ MLEQLLDVMKQTHSHLLNFF
Subjt:  CKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFF

Query:  LEGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        LEGLLP EALQYLDL+TSS D+ R SFG ML M V
Subjt:  LEGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

TrEMBL top hitse value%identityAlignment
A0A0A0K8J3 Hemerythrin domain-containing protein1.43e-19080.6Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        NC  SS KS AEIVPQE  R C +   +AA NP VRL GPPN+A TCYIRFALLYKSVKLSF+PS+ PHFGSD+P IRIGSE++SGSRER+LRFID++FP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR
        HPPL L SRR D D+T+ LVAV V ALQHKSVLWH+ER+LRW KDLA RGGRT  DPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPIL+ ADR
Subjt:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR

Query:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLN
        GLCKASNEEHARDLPIMNGIKEDIKS VVLDLGS VCQEALSNLSKRLKLLQEHCKHHF+DEEK LLP LEA+EL+KEQQ+ MLEQLLD+MKQTHSHLLN
Subjt:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLN

Query:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM
        FFLEGLLP EALQYLDL+TSS D+ R SFG ML M
Subjt:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM

A0A1S3C379 uncharacterized protein LOC1034965121.29e-19382.09Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        NC  SS KS AEIVPQE  R C +   +AA NPIVRL GPPN+ALTCYIRFALLYKSVKLSF+PSETPHFGSD+P IRIGSE++SGSRER+LRFID+RFP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR
        HPPL L SRR D D+T+ LVA+ V ALQHKSVLWH+ER+LRW KDLA RGGRT  DPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPIL+ ADR
Subjt:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR

Query:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLN
        GLCKASNEEHARDLPIMNGIKEDIKS VVLDLGS VCQEALSNLSKRLKLLQEHCKHHF+DEEK LLP LEA+EL KEQQ+ MLEQLLDVMKQTHSHLLN
Subjt:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLN

Query:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM
        FFLEGLLP EALQYLDL+TSS D+ R SFG ML M
Subjt:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM

A0A2N9FWA8 Hemerythrin domain-containing protein1.55e-14968.88Show/hide
Query:  MGNCIA-SSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        MGNC   +SKKSTAEIVP + I+G   S +P+P VRL G P   +T YIRFALLYK+V L FVPSETP FGS+ PV++IGSE+VSGSRE LL +I++RFP
Subjt:  MGNCIA-SSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRDDG-----DQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPIL
        HPPLV+  RR D      D+TTPLV V V  LQHKS+ WHVERL+RW  DL  RGG+ +VDPAVG+PRME+RKF RSYS+LLEVMLEHAQMEEKV+FPIL
Subjt:  HPPLVLPSRRDDG-----DQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPIL

Query:  EMADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTH
        +MADRGLCKA+N+EHARDLPIMNGIKEDIKS  VLD GS V QEAL +LS RLK L EH K HFM+E++ LLPL+EA+EL+KEQQ+  LEQ LDVM+ TH
Subjt:  EMADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTH

Query:  SHLLNFFLEGLLPQEALQYLDLVTSSCDKNR
        SHL NF LEGLLP EA+QYLDL TS  D+ R
Subjt:  SHLLNFFLEGLLPQEALQYLDLVTSSCDKNR

A0A6J1GW18 uncharacterized protein LOC1114579391.27e-19782.63Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A NP+VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        KASNEEHARDLPIMNGIKEDIKSTVVLD+GS VCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EA+EL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

A0A6J1JT65 uncharacterized protein LOC1114872718.54e-19682.04Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A N +VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        K SNEEHARDLPIMNGIKEDIKSTVVLD+GS VCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EA+EL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G54290.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Haemerythrin/HHE cation-binding motif (InterPro:IPR012312); Has 59 Blast hits to 59 proteins in 14 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 56; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).5.6e-9653.3Show/hide
Query:  MGNCIASSKKSTAEIVPQEFI-----RGCGDSAAP----------------NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIG
        MG C +SS KSTAEI P + +          +A P                   VRL GPPNS +T Y+RFALL+K V L FVPSE        P I++G
Subjt:  MGNCIASSKKSTAEIVPQEFI-----RGCGDSAAP----------------NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIG

Query:  SESVSGSRERLLRFIDSRFPHPPLVLPSRRDDG-DQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEV
        SE+VSGSRE LLR+I+ +FP P L++     +G D+ TPL+ V +  LQH+S+LWH+ER+LRW++DLAARGG+ AVDP+VGTP+ME+RKF +SY+ L E+
Subjt:  SESVSGSRERLLRFIDSRFPHPPLVLPSRRDDG-DQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEV

Query:  MLEHAQMEEKVLFPILEMADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQ
        MLEHAQMEE++LFP+LE  DRG+CK++NEEH R+LP+MNGIKEDIKS  VLD G  +C EAL +L+ R K LQ  CK HF +EEK LLP++EA E+ KE+
Subjt:  MLEHAQMEEKVLFPILEMADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQ

Query:  QENMLEQLLDVMKQTHSHLLNFFLEGLLPQEALQYLDLVTSSCDKNRAS
        Q+ ++ Q L+VM  THS+  +F LEGL PQEA+QY+DL+ +  D N  S
Subjt:  QENMLEQLLDVMKQTHSHLLNFFLEGLLPQEALQYLDLVTSSCDKNRAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATTGCATTGCGAGTTCGAAGAAATCGACGGCGGAGATTGTGCCTCAGGAGTTCATCAGAGGCTGCGGCGATTCTGCGGCTCCGAATCCGATCGTACGGCTTTC
CGGCCCTCCGAATAGTGCCCTAACCTGCTATATCCGATTCGCGCTGCTCTACAAGTCCGTGAAACTCAGCTTCGTCCCTTCCGAGACTCCGCATTTCGGCTCCGACGCGC
CGGTCATCCGAATCGGGTCCGAGTCCGTTTCCGGTTCACGCGAAAGGTTGCTCCGGTTCATCGACAGTAGATTTCCTCATCCGCCGCTGGTACTGCCGAGCCGCCGTGAC
GATGGCGATCAAACGACGCCGTTGGTTGCGGTGAGTGTGGCGGCGCTGCAGCACAAGAGCGTATTGTGGCATGTGGAGAGGCTACTGAGATGGGCGAAGGATCTGGCTGC
TCGTGGAGGGAGAACGGCCGTTGATCCGGCGGTGGGGACGCCGAGGATGGAGCTGAGGAAGTTCGGGAGGAGTTACTCTCAGCTGCTGGAAGTGATGCTGGAGCACGCTC
AGATGGAGGAGAAAGTCCTCTTCCCGATCTTGGAGATGGCTGATCGAGGATTATGTAAAGCTTCAAATGAGGAGCATGCAAGAGATCTACCCATCATGAATGGCATCAAA
GAAGACATTAAATCCACTGTCGTTTTAGACTTGGGAAGTTTCGTTTGCCAAGAAGCACTCTCCAACCTTTCCAAACGGCTCAAGTTGTTGCAGGAACACTGTAAGCACCA
CTTCATGGATGAAGAGAAAAAACTATTACCTTTGCTTGAGGCTATAGAACTGACCAAAGAGCAGCAGGAGAACATGTTAGAGCAGCTCCTGGATGTGATGAAACAAACAC
ATTCACATTTACTAAATTTCTTTCTTGAAGGTCTTCTCCCTCAGGAAGCTCTGCAGTATTTGGATCTGGTTACGAGCAGCTGCGATAAAAACCGCGCTAGCTTCGGCTTA
ATGCTCCAGATGACTGTTGCCTAA
mRNA sequenceShow/hide mRNA sequence
CAAATGCGGGTCCTACATATATTGAATAATAAATATATAAAGATGAGTGTGAAGTATTTGAAAGGGATATAGTGATAGGTGACTAGTTAGTACGTTATCCATTAATTTAG
GAGAAAAAGAAAAAAAAAAAAGGAAAGAATGGAATCAAAAGAACAAAGGAGCGTGGGCTGTCCATTTCACCAAGCAAACTTCCTTTAATATCAACGCCCTCATTATATCC
CAATCCCAATCCCCTCCCATTTTCCCCTCAACCGAATTCCCTTTCGTTACAAGCAACCACCGCCGCCGTCGCCGCCGCACCTCCACAACCGCCTGGTTTCTCCTCTTAGC
TTCATTGTTTATTATTCCATCTTGCTTTTTCCTTTCGCGCGTTGTTGTTGTTGTTGGATTTATTCTTCTTCCTGCTTTTTATACACTCATCCACATTCCTTATTCGCTTT
CTCTCTCTCTCTCACGCCGATTCTTCCCTAACTGAACCGACGCCTACGCCTTTTTCCATCCGCTCTGTATTTTGTTCCTTCTGATTCTGTTTTCCACTCTGTTTTTGTGC
CATTTTGGAATTTCCGACGGAGTTTTTCGCTATGGGAAATTGCATTGCGAGTTCGAAGAAATCGACGGCGGAGATTGTGCCTCAGGAGTTCATCAGAGGCTGCGGCGATT
CTGCGGCTCCGAATCCGATCGTACGGCTTTCCGGCCCTCCGAATAGTGCCCTAACCTGCTATATCCGATTCGCGCTGCTCTACAAGTCCGTGAAACTCAGCTTCGTCCCT
TCCGAGACTCCGCATTTCGGCTCCGACGCGCCGGTCATCCGAATCGGGTCCGAGTCCGTTTCCGGTTCACGCGAAAGGTTGCTCCGGTTCATCGACAGTAGATTTCCTCA
TCCGCCGCTGGTACTGCCGAGCCGCCGTGACGATGGCGATCAAACGACGCCGTTGGTTGCGGTGAGTGTGGCGGCGCTGCAGCACAAGAGCGTATTGTGGCATGTGGAGA
GGCTACTGAGATGGGCGAAGGATCTGGCTGCTCGTGGAGGGAGAACGGCCGTTGATCCGGCGGTGGGGACGCCGAGGATGGAGCTGAGGAAGTTCGGGAGGAGTTACTCT
CAGCTGCTGGAAGTGATGCTGGAGCACGCTCAGATGGAGGAGAAAGTCCTCTTCCCGATCTTGGAGATGGCTGATCGAGGATTATGTAAAGCTTCAAATGAGGAGCATGC
AAGAGATCTACCCATCATGAATGGCATCAAAGAAGACATTAAATCCACTGTCGTTTTAGACTTGGGAAGTTTCGTTTGCCAAGAAGCACTCTCCAACCTTTCCAAACGGC
TCAAGTTGTTGCAGGAACACTGTAAGCACCACTTCATGGATGAAGAGAAAAAACTATTACCTTTGCTTGAGGCTATAGAACTGACCAAAGAGCAGCAGGAGAACATGTTA
GAGCAGCTCCTGGATGTGATGAAACAAACACATTCACATTTACTAAATTTCTTTCTTGAAGGTCTTCTCCCTCAGGAAGCTCTGCAGTATTTGGATCTGGTTACGAGCAG
CTGCGATAAAAACCGCGCTAGCTTCGGCTTAATGCTCCAGATGACTGTTGCCTAAGATCAAAGATGTATATAGATTTCTATTTTCTTTTCTTTTTTCTCTTCTAGGATTG
GATTGGCAACATCCTTCATGTGTCAATAGGTCATAATGGGCTGTTCTTTCTTTGGCATTTTGTTTTGGTTTGGAGATGAAGTATAGTGTCATTTATTGCATATTTTTTTA
GCACAGTGATTAGTCCTGATTCAAGTGTATGTGCCACAGAAATGAAGAAAATTTAATGTTTGTGATTGTAGGCTTTGGAACTTATAATTGACCATTGTTGTTTTGTTGAC
ATATGTAATTAGCATCAAAATCATATTTTAAAACAAG
Protein sequenceShow/hide protein sequence
MGNCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPPLVLPSRRD
DGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLCKASNEEHARDLPIMNGIK
EDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAIELTKEQQENMLEQLLDVMKQTHSHLLNFFLEGLLPQEALQYLDLVTSSCDKNRASFGL
MLQMTVA