; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g33220 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g33220
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionHemerythrin domain-containing protein
Genome locationchr4:24991219..24994541
RNA-Seq ExpressionMoc04g33220
SyntenyMoc04g33220
Gene Ontology termsGO:0055072 - iron ion homeostasis (biological process)
InterPro domainsIPR012312 - Haemerythrin-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456602.1 PREDICTED: uncharacterized protein LOC103496512 [Cucumis melo]1.4e-15282.39Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        NC  SS KS AEIVPQE  R C +   +AA NPIVRL GPPN+ALTCYIRFALLYKSVKLSF+PSETPHFGSD+P IRIGSE++SGSRER+LRFID+RFP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR
        HPPL L SRR D D+T+ LVA+ V ALQHKSVLWH+ER+LRW KDLA RGGRT  DPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPIL+ ADR
Subjt:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR

Query:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN
        GLCKASNEEHARDLPIMNGIKEDIKS VVLDLGS VCQEALSNLSKRLKLLQEHCKHHF+DEEK LLP LEAVEL KEQQ+ MLEQLLDVMKQTHSHLLN
Subjt:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN

Query:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM
        FFLEGLLP EALQYLDL+TSS D+ R SFG ML M
Subjt:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM

XP_022956168.1 uncharacterized protein LOC111457939 [Cucurbita moschata]1.3e-15582.93Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A NP+VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        KASNEEHARDLPIMNGIKEDIKSTVVLD+GS VCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EAVEL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

XP_022990403.1 uncharacterized protein LOC111487271 [Cucurbita maxima]3.3e-15482.34Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A N +VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        K SNEEHARDLPIMNGIKEDIKSTVVLD+GS VCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EAVEL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

XP_023525990.1 uncharacterized protein LOC111789551 isoform X2 [Cucurbita pepo subsp. pepo]2.3e-15582.63Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A NP+VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        KASNEEHARDLPIMNGIKEDIKSTVVLD+GS VCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EAVE+ KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

XP_038884073.1 uncharacterized protein LOC120075009 [Benincasa hispida]1.4e-15784.18Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAP-NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHP
        NC+ SSKKS AEIVPQEFIRGCGD+A P NPIVRL GPPN+A TCYIRFALLYKSVKLSF+PSETPHFGSD+P IRIG+E++SGSRER+LRFID+RFPHP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAP-NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHP

Query:  PLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGL
        PL L +RR D D+TT LVAV V ALQHKSVLWH+ER+LRWAKDLA+RGGRT VDPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILE ADRGL
Subjt:  PLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGL

Query:  CKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFF
        CKASNEEHARDLPIMNGIKEDIKS VVLDLGS VCQEALS+LSKRLKLLQEHCKHHF+DEEK +LP LEAV+L KEQQ+ MLEQLLDVMKQTHSHLLNFF
Subjt:  CKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFF

Query:  LEGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        LEGLLP EALQYLDL+TSS D+ R SFG ML M V
Subjt:  LEGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

TrEMBL top hitse value%identityAlignment
A0A0A0K8J3 Hemerythrin domain-containing protein1.4e-15080.9Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        NC  SS KS AEIVPQE  R C +   +AA NP VRL GPPN+A TCYIRFALLYKSVKLSF+PS+ PHFGSD+P IRIGSE++SGSRER+LRFID++FP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR
        HPPL L SRR D D+T+ LVAV V ALQHKSVLWH+ER+LRW KDLA RGGRT  DPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPIL+ ADR
Subjt:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR

Query:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN
        GLCKASNEEHARDLPIMNGIKEDIKS VVLDLGS VCQEALSNLSKRLKLLQEHCKHHF+DEEK LLP LEAVEL+KEQQ+ MLEQLLD+MKQTHSHLLN
Subjt:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN

Query:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM
        FFLEGLLP EALQYLDL+TSS D+ R SFG ML M
Subjt:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM

A0A1S3C379 uncharacterized protein LOC1034965126.6e-15382.39Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        NC  SS KS AEIVPQE  R C +   +AA NPIVRL GPPN+ALTCYIRFALLYKSVKLSF+PSETPHFGSD+P IRIGSE++SGSRER+LRFID+RFP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR
        HPPL L SRR D D+T+ LVA+ V ALQHKSVLWH+ER+LRW KDLA RGGRT  DPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPIL+ ADR
Subjt:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR

Query:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN
        GLCKASNEEHARDLPIMNGIKEDIKS VVLDLGS VCQEALSNLSKRLKLLQEHCKHHF+DEEK LLP LEAVEL KEQQ+ MLEQLLDVMKQTHSHLLN
Subjt:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN

Query:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM
        FFLEGLLP EALQYLDL+TSS D+ R SFG ML M
Subjt:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM

A0A2N9FWA8 Hemerythrin domain-containing protein1.7e-11969Show/hide
Query:  MGNCIA-SSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        MGNC   +SKKSTAEIVP + I+G   S +P+P VRL G P   +T YIRFALLYK+V L FVPSETP FGS+ PV++IGSE+VSGSRE LL +I++RFP
Subjt:  MGNCIA-SSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRD---DGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEM
        HPPLV+    D   + D+TTPLV V V  LQHKS+ WHVERL+RW  DL  RGG+ +VDPAVG+PRME+RKF RSYS+LLEVMLEHAQMEEKV+FPIL+M
Subjt:  HPPLVLPSRRD---DGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEM

Query:  ADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSH
        ADRGLCKA+N+EHARDLPIMNGIKEDIKS  VLD GS V QEAL +LS RLK L EH K HFM+E++ LLPL+EAVEL+KEQQ+  LEQ LDVM+ THSH
Subjt:  ADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSH

Query:  LLNFFLEGLLPQEALQYLDLVTSSCDKNR
        L NF LEGLLP EA+QYLDL TS  D+ R
Subjt:  LLNFFLEGLLPQEALQYLDLVTSSCDKNR

A0A6J1GW18 uncharacterized protein LOC1114579396.4e-15682.93Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A NP+VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        KASNEEHARDLPIMNGIKEDIKSTVVLD+GS VCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EAVEL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

A0A6J1JT65 uncharacterized protein LOC1114872711.6e-15482.34Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A N +VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        K SNEEHARDLPIMNGIKEDIKSTVVLD+GS VCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EAVEL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G54290.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Haemerythrin/HHE cation-binding motif (InterPro:IPR012312); Has 59 Blast hits to 59 proteins in 14 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 56; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).5.6e-9653.3Show/hide
Query:  MGNCIASSKKSTAEIVPQEFI-----RGCGDSAAP----------------NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIG
        MG C +SS KSTAEI P + +          +A P                   VRL GPPNS +T Y+RFALL+K V L FVPSE        P I++G
Subjt:  MGNCIASSKKSTAEIVPQEFI-----RGCGDSAAP----------------NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIG

Query:  SESVSGSRERLLRFIDSRFPHPPLVLPSRRDDG-DQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEV
        SE+VSGSRE LLR+I+ +FP P L++     +G D+ TPL+ V +  LQH+S+LWH+ER+LRW++DLAARGG+ AVDP+VGTP+ME+RKF +SY+ L E+
Subjt:  SESVSGSRERLLRFIDSRFPHPPLVLPSRRDDG-DQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEV

Query:  MLEHAQMEEKVLFPILEMADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQ
        MLEHAQMEE++LFP+LE  DRG+CK++NEEH R+LP+MNGIKEDIKS  VLD G  +C EAL +L+ R K LQ  CK HF +EEK LLP++EA E+ KE+
Subjt:  MLEHAQMEEKVLFPILEMADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQ

Query:  QENMLEQLLDVMKQTHSHLLNFFLEGLLPQEALQYLDLVTSSCDKNRAS
        Q+ ++ Q L+VM  THS+  +F LEGL PQEA+QY+DL+ +  D N  S
Subjt:  QENMLEQLLDVMKQTHSHLLNFFLEGLLPQEALQYLDLVTSSCDKNRAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATTGCATTGCGAGTTCGAAGAAATCGACGGCGGAGATTGTGCCTCAGGAGTTCATCAGAGGCTGCGGCGATTCTGCGGCTCCGAATCCGATCGTACGGCTTTC
CGGCCCTCCGAATAGTGCCCTAACCTGCTATATCCGATTCGCGCTGCTCTACAAGTCCGTGAAACTCAGCTTCGTCCCTTCCGAGACTCCGCATTTCGGCTCCGACGCGC
CGGTCATCCGAATCGGGTCCGAGTCCGTTTCCGGTTCACGCGAAAGGTTGCTCCGGTTCATCGACAGTAGATTTCCTCATCCGCCGCTGGTACTGCCGAGCCGCCGTGAC
GATGGCGATCAAACGACGCCGTTGGTTGCGGTGAGTGTGGCGGCGCTGCAGCACAAGAGCGTATTGTGGCATGTGGAGAGGCTACTGAGATGGGCGAAGGATCTGGCTGC
TCGTGGAGGGAGAACGGCCGTTGATCCGGCGGTGGGGACGCCGAGGATGGAGCTGAGGAAGTTCGGGAGGAGTTACTCTCAGCTGCTGGAAGTGATGCTGGAGCACGCTC
AGATGGAGGAGAAAGTCCTCTTCCCGATCTTGGAGATGGCTGATCGAGGATTATGTAAAGCTTCAAATGAGGAGCATGCAAGAGATCTACCCATCATGAATGGCATCAAA
GAAGACATTAAATCCACTGTCGTTTTAGACTTGGGAAGTTTCGTTTGCCAAGAAGCACTCTCCAACCTTTCCAAACGGCTCAAGTTGTTGCAGGAACACTGTAAGCACCA
CTTCATGGATGAAGAGAAAAAACTATTACCTTTGCTTGAGGCTGTAGAACTGACCAAAGAGCAGCAGGAGAACATGTTAGAGCAGCTCCTGGATGTGATGAAACAAACAC
ATTCACATTTACTAAATTTCTTTCTTGAAGGTCTTCTCCCTCAGGAAGCTCTGCAGTATTTGGATCTGGTTACGAGCAGCTGCGATAAAAACCGCGCTAGCTTCGGCTTA
ATGCTCCAGATGACTGTTGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAATTGCATTGCGAGTTCGAAGAAATCGACGGCGGAGATTGTGCCTCAGGAGTTCATCAGAGGCTGCGGCGATTCTGCGGCTCCGAATCCGATCGTACGGCTTTC
CGGCCCTCCGAATAGTGCCCTAACCTGCTATATCCGATTCGCGCTGCTCTACAAGTCCGTGAAACTCAGCTTCGTCCCTTCCGAGACTCCGCATTTCGGCTCCGACGCGC
CGGTCATCCGAATCGGGTCCGAGTCCGTTTCCGGTTCACGCGAAAGGTTGCTCCGGTTCATCGACAGTAGATTTCCTCATCCGCCGCTGGTACTGCCGAGCCGCCGTGAC
GATGGCGATCAAACGACGCCGTTGGTTGCGGTGAGTGTGGCGGCGCTGCAGCACAAGAGCGTATTGTGGCATGTGGAGAGGCTACTGAGATGGGCGAAGGATCTGGCTGC
TCGTGGAGGGAGAACGGCCGTTGATCCGGCGGTGGGGACGCCGAGGATGGAGCTGAGGAAGTTCGGGAGGAGTTACTCTCAGCTGCTGGAAGTGATGCTGGAGCACGCTC
AGATGGAGGAGAAAGTCCTCTTCCCGATCTTGGAGATGGCTGATCGAGGATTATGTAAAGCTTCAAATGAGGAGCATGCAAGAGATCTACCCATCATGAATGGCATCAAA
GAAGACATTAAATCCACTGTCGTTTTAGACTTGGGAAGTTTCGTTTGCCAAGAAGCACTCTCCAACCTTTCCAAACGGCTCAAGTTGTTGCAGGAACACTGTAAGCACCA
CTTCATGGATGAAGAGAAAAAACTATTACCTTTGCTTGAGGCTGTAGAACTGACCAAAGAGCAGCAGGAGAACATGTTAGAGCAGCTCCTGGATGTGATGAAACAAACAC
ATTCACATTTACTAAATTTCTTTCTTGAAGGTCTTCTCCCTCAGGAAGCTCTGCAGTATTTGGATCTGGTTACGAGCAGCTGCGATAAAAACCGCGCTAGCTTCGGCTTA
ATGCTCCAGATGACTGTTGCCTAA
Protein sequenceShow/hide protein sequence
MGNCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPPLVLPSRRD
DGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLCKASNEEHARDLPIMNGIK
EDIKSTVVLDLGSFVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFLEGLLPQEALQYLDLVTSSCDKNRASFGL
MLQMTVA