; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009142 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009142
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHemerythrin domain-containing protein
Genome locationscaffold220:202547..205892
RNA-Seq ExpressionMS009142
SyntenyMS009142
Gene Ontology termsGO:0055072 - iron ion homeostasis (biological process)
InterPro domainsIPR012312 - Haemerythrin-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008456602.1 PREDICTED: uncharacterized protein LOC103496512 [Cucumis melo]3.6e-15382.69Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        NC  SS KS AEIVPQE  R C +   +AA NPIVRL GPPN+ALTCYIRFALLYKSVKLSF+PSETPHFGSD+P IRIGSE++SGSRER+LRFID+RFP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR
        HPPL L SRR D D+T+ LVA+ V ALQHKSVLWH+ER+LRW KDLA RGGRT  DPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPIL+ ADR
Subjt:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR

Query:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN
        GLCKASNEEHARDLPIMNGIKEDIKS VVLDLGSSVCQEALSNLSKRLKLLQEHCKHHF+DEEK LLP LEAVEL KEQQ+ MLEQLLDVMKQTHSHLLN
Subjt:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN

Query:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM
        FFLEGLLP EALQYLDL+TSS D+ R SFG ML M
Subjt:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM

XP_022956168.1 uncharacterized protein LOC111457939 [Cucurbita moschata]4.6e-15683.23Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A NP+VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        KASNEEHARDLPIMNGIKEDIKSTVVLD+GSSVCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EAVEL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

XP_022990403.1 uncharacterized protein LOC111487271 [Cucurbita maxima]1.1e-15482.63Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A N +VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        K SNEEHARDLPIMNGIKEDIKSTVVLD+GSSVCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EAVEL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

XP_023525990.1 uncharacterized protein LOC111789551 isoform X2 [Cucurbita pepo subsp. pepo]7.8e-15682.93Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A NP+VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        KASNEEHARDLPIMNGIKEDIKSTVVLD+GSSVCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EAVE+ KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

XP_038884073.1 uncharacterized protein LOC120075009 [Benincasa hispida]4.9e-15884.48Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAP-NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHP
        NC+ SSKKS AEIVPQEFIRGCGD+A P NPIVRL GPPN+A TCYIRFALLYKSVKLSF+PSETPHFGSD+P IRIG+E++SGSRER+LRFID+RFPHP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAP-NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHP

Query:  PLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGL
        PL L +RR D D+TT LVAV V ALQHKSVLWH+ER+LRWAKDLA+RGGRT VDPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILE ADRGL
Subjt:  PLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGL

Query:  CKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFF
        CKASNEEHARDLPIMNGIKEDIKS VVLDLGSSVCQEALS+LSKRLKLLQEHCKHHF+DEEK +LP LEAV+L KEQQ+ MLEQLLDVMKQTHSHLLNFF
Subjt:  CKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFF

Query:  LEGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        LEGLLP EALQYLDL+TSS D+ R SFG ML M V
Subjt:  LEGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

TrEMBL top hitse value%identityAlignment
A0A0A0K8J3 Hemerythrin domain-containing protein3.6e-15181.19Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        NC  SS KS AEIVPQE  R C +   +AA NP VRL GPPN+A TCYIRFALLYKSVKLSF+PS+ PHFGSD+P IRIGSE++SGSRER+LRFID++FP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR
        HPPL L SRR D D+T+ LVAV V ALQHKSVLWH+ER+LRW KDLA RGGRT  DPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPIL+ ADR
Subjt:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR

Query:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN
        GLCKASNEEHARDLPIMNGIKEDIKS VVLDLGSSVCQEALSNLSKRLKLLQEHCKHHF+DEEK LLP LEAVEL+KEQQ+ MLEQLLD+MKQTHSHLLN
Subjt:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN

Query:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM
        FFLEGLLP EALQYLDL+TSS D+ R SFG ML M
Subjt:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM

A0A1S3C379 uncharacterized protein LOC1034965121.7e-15382.69Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        NC  SS KS AEIVPQE  R C +   +AA NPIVRL GPPN+ALTCYIRFALLYKSVKLSF+PSETPHFGSD+P IRIGSE++SGSRER+LRFID+RFP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGD---SAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR
        HPPL L SRR D D+T+ LVA+ V ALQHKSVLWH+ER+LRW KDLA RGGRT  DPAVGTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPIL+ ADR
Subjt:  HPPLVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADR

Query:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN
        GLCKASNEEHARDLPIMNGIKEDIKS VVLDLGSSVCQEALSNLSKRLKLLQEHCKHHF+DEEK LLP LEAVEL KEQQ+ MLEQLLDVMKQTHSHLLN
Subjt:  GLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLN

Query:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM
        FFLEGLLP EALQYLDL+TSS D+ R SFG ML M
Subjt:  FFLEGLLPQEALQYLDLVTSSCDKNRASFGLMLQM

A0A2N9FWA8 Hemerythrin domain-containing protein1.3e-11969Show/hide
Query:  MGNCIA-SSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP
        MGNC   +SKKSTAEIVP + I+G   S +P+P VRL G P   +T YIRFALLYK+V L FVPSETP FGS+ PV++IGSE+VSGSRE LL +I++RFP
Subjt:  MGNCIA-SSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFP

Query:  HPPLVLPSRRD---DGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEM
        HPPLV+    D   + D+TTPLV V V  LQHKS+ WHVERL+RW  DL  RGG+ +VDPAVG+PRME+RKF RSYS+LLEVMLEHAQMEEKV+FPIL+M
Subjt:  HPPLVLPSRRD---DGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEM

Query:  ADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSH
        ADRGLCKA+N+EHARDLPIMNGIKEDIKS  VLD GS V QEAL +LS RLK L EH K HFM+E++ LLPL+EAVEL+KEQQ+  LEQ LDVM+ THSH
Subjt:  ADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSH

Query:  LLNFFLEGLLPQEALQYLDLVTSSCDKNR
        L NF LEGLLP EA+QYLDL TS  D+ R
Subjt:  LLNFFLEGLLPQEALQYLDLVTSSCDKNR

A0A6J1GW18 uncharacterized protein LOC1114579392.2e-15683.23Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A NP+VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        KASNEEHARDLPIMNGIKEDIKSTVVLD+GSSVCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EAVEL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

A0A6J1JT65 uncharacterized protein LOC1114872715.4e-15582.63Show/hide
Query:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP
        NC+ S  KSTAEIVPQEFIRGC DS A N +VRL GPPN+ALTCYIRFALLYKSVK SF+PSET HFGSD+P IRIG+E+VSGSR+RLLR+ID++FPHPP
Subjt:  NCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPP

Query:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC
        L + SRR D D+TT LVA++V +LQHKSVLWH+ER+LRWAKDLAARGGRTAVDP +GTPRMELRKFG+SYSQLLEVMLEHAQMEE+VLFPILEMADRGLC
Subjt:  LVLPSRRDDGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLC

Query:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL
        K SNEEHARDLPIMNGIKEDIKSTVVLD+GSSVCQEALSNLSKRLKLLQEHCKHHFM+EEK LLP  EAVEL KEQQ+  LEQLLDVMKQTHSHLLNFFL
Subjt:  KASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFL

Query:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV
        EGLLP EALQYLDL+TSS DK R S G ML M V
Subjt:  EGLLPQEALQYLDLVTSSCDKNRASFGLMLQMTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G54290.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: N-terminal protein myristoylation; LOCATED IN: chloroplast; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Haemerythrin/HHE cation-binding motif (InterPro:IPR012312); Has 59 Blast hits to 59 proteins in 14 species: Archae - 0; Bacteria - 2; Metazoa - 0; Fungi - 0; Plants - 56; Viruses - 0; Other Eukaryotes - 1 (source: NCBI BLink).7.4e-9653.3Show/hide
Query:  MGNCIASSKKSTAEIVPQEFI-----RGCGDSAAP----------------NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIG
        MG C +SS KSTAEI P + +          +A P                   VRL GPPNS +T Y+RFALL+K V L FVPSE        P I++G
Subjt:  MGNCIASSKKSTAEIVPQEFI-----RGCGDSAAP----------------NPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIG

Query:  SESVSGSRERLLRFIDSRFPHPPLVLPSRRDDG-DQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEV
        SE+VSGSRE LLR+I+ +FP P L++     +G D+ TPL+ V +  LQH+S+LWH+ER+LRW++DLAARGG+ AVDP+VGTP+ME+RKF +SY+ L E+
Subjt:  SESVSGSRERLLRFIDSRFPHPPLVLPSRRDDG-DQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEV

Query:  MLEHAQMEEKVLFPILEMADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQ
        MLEHAQMEE++LFP+LE  DRG+CK++NEEH R+LP+MNGIKEDIKS  VLD  S +C EAL +L+ R K LQ  CK HF +EEK LLP++EA E+ KE+
Subjt:  MLEHAQMEEKVLFPILEMADRGLCKASNEEHARDLPIMNGIKEDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQ

Query:  QENMLEQLLDVMKQTHSHLLNFFLEGLLPQEALQYLDLVTSSCDKNRAS
        Q+ ++ Q L+VM  THS+  +F LEGL PQEA+QY+DL+ +  D N  S
Subjt:  QENMLEQLLDVMKQTHSHLLNFFLEGLLPQEALQYLDLVTSSCDKNRAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATTGCATTGCGAGTTCGAAGAAATCGACGGCGGAGATTGTGCCTCAGGAGTTCATCAGAGGCTGCGGCGATTCCGCGGCTCCGAATCCGATCGTACGGCTTTC
CGGCCCTCCGAATAGTGCCCTAACCTGCTATATCCGATTCGCGCTGCTCTACAAGTCCGTGAAACTCAGCTTCGTCCCTTCCGAGACTCCGCATTTCGGCTCCGACGCGC
CGGTCATCCGAATCGGGTCCGAGTCCGTTTCCGGTTCACGCGAAAGGTTGCTCCGGTTCATCGACAGTAGATTTCCTCATCCGCCGCTGGTACTGCCGAGCCGCCGTGAC
GATGGCGATCAAACGACGCCGTTGGTTGCGGTGAGCGTGGCGGCGCTGCAGCACAAGAGCGTATTGTGGCATGTGGAGAGGCTATTGAGATGGGCGAAGGATCTGGCTGC
TCGTGGAGGGAGAACGGCCGTTGATCCGGCGGTGGGGACGCCGAGGATGGAGCTGAGGAAGTTCGGGAGGAGTTACTCTCAGCTGCTGGAAGTGATGCTGGAGCACGCTC
AGATGGAGGAGAAAGTCCTCTTCCCGATCTTGGAGATGGCTGATCGAGGATTATGTAAAGCTTCAAATGAGGAGCATGCAAGAGATCTACCCATCATGAATGGCATCAAA
GAAGACATTAAATCCACTGTCGTTTTAGACTTGGGAAGTTCCGTTTGCCAAGAAGCACTCTCCAACCTTTCCAAACGGCTCAAGTTGTTGCAGGAACACTGTAAGCACCA
CTTCATGGATGAAGAGAAAAAACTATTACCTTTGCTTGAGGCTGTAGAACTGACCAAAGAGCAGCAGGAGAACATGTTAGAGCAGCTCCTGGATGTGATGAAACAAACAC
ATTCACATTTACTAAATTTCTTTCTTGAAGGTCTTCTCCCTCAGGAAGCTCTGCAGTATTTGGATCTGGTTACGAGCAGCTGCGATAAAAACCGCGCTAGCTTCGGCTTA
ATGCTCCAGATGACTGTTGCC
mRNA sequenceShow/hide mRNA sequence
ATGGGAAATTGCATTGCGAGTTCGAAGAAATCGACGGCGGAGATTGTGCCTCAGGAGTTCATCAGAGGCTGCGGCGATTCCGCGGCTCCGAATCCGATCGTACGGCTTTC
CGGCCCTCCGAATAGTGCCCTAACCTGCTATATCCGATTCGCGCTGCTCTACAAGTCCGTGAAACTCAGCTTCGTCCCTTCCGAGACTCCGCATTTCGGCTCCGACGCGC
CGGTCATCCGAATCGGGTCCGAGTCCGTTTCCGGTTCACGCGAAAGGTTGCTCCGGTTCATCGACAGTAGATTTCCTCATCCGCCGCTGGTACTGCCGAGCCGCCGTGAC
GATGGCGATCAAACGACGCCGTTGGTTGCGGTGAGCGTGGCGGCGCTGCAGCACAAGAGCGTATTGTGGCATGTGGAGAGGCTATTGAGATGGGCGAAGGATCTGGCTGC
TCGTGGAGGGAGAACGGCCGTTGATCCGGCGGTGGGGACGCCGAGGATGGAGCTGAGGAAGTTCGGGAGGAGTTACTCTCAGCTGCTGGAAGTGATGCTGGAGCACGCTC
AGATGGAGGAGAAAGTCCTCTTCCCGATCTTGGAGATGGCTGATCGAGGATTATGTAAAGCTTCAAATGAGGAGCATGCAAGAGATCTACCCATCATGAATGGCATCAAA
GAAGACATTAAATCCACTGTCGTTTTAGACTTGGGAAGTTCCGTTTGCCAAGAAGCACTCTCCAACCTTTCCAAACGGCTCAAGTTGTTGCAGGAACACTGTAAGCACCA
CTTCATGGATGAAGAGAAAAAACTATTACCTTTGCTTGAGGCTGTAGAACTGACCAAAGAGCAGCAGGAGAACATGTTAGAGCAGCTCCTGGATGTGATGAAACAAACAC
ATTCACATTTACTAAATTTCTTTCTTGAAGGTCTTCTCCCTCAGGAAGCTCTGCAGTATTTGGATCTGGTTACGAGCAGCTGCGATAAAAACCGCGCTAGCTTCGGCTTA
ATGCTCCAGATGACTGTTGCC
Protein sequenceShow/hide protein sequence
MGNCIASSKKSTAEIVPQEFIRGCGDSAAPNPIVRLSGPPNSALTCYIRFALLYKSVKLSFVPSETPHFGSDAPVIRIGSESVSGSRERLLRFIDSRFPHPPLVLPSRRD
DGDQTTPLVAVSVAALQHKSVLWHVERLLRWAKDLAARGGRTAVDPAVGTPRMELRKFGRSYSQLLEVMLEHAQMEEKVLFPILEMADRGLCKASNEEHARDLPIMNGIK
EDIKSTVVLDLGSSVCQEALSNLSKRLKLLQEHCKHHFMDEEKKLLPLLEAVELTKEQQENMLEQLLDVMKQTHSHLLNFFLEGLLPQEALQYLDLVTSSCDKNRASFGL
MLQMTVA