; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10015210 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10015210
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionHeme-binding protein 2-like
Genome locationChr02:24859699..24860410
RNA-Seq ExpressionHG10015210
SyntenyHG10015210
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsIPR006917 - SOUL haem-binding protein
IPR011256 - Regulatory factor, effector binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034292.1 heme-binding protein 2-like [Cucumis melo var. makuwa]2.3e-9785.58Show/hide
Query:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
        MKG +LINFAL I  FFCCSSGRVIESP YKVIHVESDFEIRQYKQISWMSALVQGT+SFEKSTQQGFHRLYQY+HGANSNS  FL TSPVTTTIM STR
Subjt:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR

Query:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL
        EPE L+RYYL  +NAE PPL +SELN+ FEKW++NCLAVRRFPGFAKDDNINKEIDALKS+LSK+LPESAAISEYTIAQYNSSRRL GRLNEVWLDVS  
Subjt:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL

Query:  TAEGCQPL
        T EGCQPL
Subjt:  TAEGCQPL

XP_008446180.1 PREDICTED: uncharacterized protein LOC103488984 [Cucumis melo]2.3e-9785.58Show/hide
Query:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
        MKG +LINFAL I  FFCCSSGRVIESP YKVIHVESDFEIRQYKQISWMSALVQGT+SFEKSTQQGFHRLYQY+HGANSNS  FL TSPVTTTIM STR
Subjt:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR

Query:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL
        EPE L+RYYL  +NAE PPL +SELN+ FEKW++NCLAVRRFPGFAKDDNINKEIDALKS+LSK+LPESAAISEYTIAQYNSSRRL GRLNEVWLDVS  
Subjt:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL

Query:  TAEGCQPL
        T EGCQPL
Subjt:  TAEGCQPL

XP_011655613.1 uncharacterized protein LOC101213086 [Cucumis sativus]9.8e-10188.46Show/hide
Query:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
        MKG +LINFAL I  FFCCSSGRVIESP YKVIHVESDFEIRQYKQISWMSALVQGTASFEKST+QGFHRLYQY+HGANSNS HFL TSPVTTTIM  TR
Subjt:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR

Query:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL
        EPERL+RYYL I+NAE PPL +SELNV FEKWR+NCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAA+SEYTIAQYNSSRRL GRLNEVWLDVSG 
Subjt:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL

Query:  TAEGCQPL
        T EGCQPL
Subjt:  TAEGCQPL

XP_022944900.1 heme-binding protein 2-like [Cucurbita moschata]1.8e-9483.5Show/hide
Query:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
        M G ++INFAL  ICFFCCSSGRVIESP Y VIHVE++FEIRQYKQ+SW+SALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
Subjt:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR

Query:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL
         PERL+RYYL  +  E PPL +SELNVQFEKWRSNCLAVRRF GFAKDDNINKE++ALKSSL+KYLP+S+AISEYT+AQYNSSR LSGRLNEVW+DVS +
Subjt:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL

Query:  TAEGCQ
        T+EGCQ
Subjt:  TAEGCQ

XP_038892072.1 heme-binding protein 2-like [Benincasa hispida]1.2e-10392.79Show/hide
Query:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
        MKG LLINFAL IICFFCCSSGRVIESP YKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSH LITSPVTTT++AS  
Subjt:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR

Query:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL
        EPE LIRYYL IVNAE PPL +SELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALK SLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSG 
Subjt:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL

Query:  TAEGCQPL
        TAEGCQPL
Subjt:  TAEGCQPL

TrEMBL top hitse value%identityAlignment
A0A0A0KVJ5 Uncharacterized protein4.8e-10188.46Show/hide
Query:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
        MKG +LINFAL I  FFCCSSGRVIESP YKVIHVESDFEIRQYKQISWMSALVQGTASFEKST+QGFHRLYQY+HGANSNS HFL TSPVTTTIM  TR
Subjt:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR

Query:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL
        EPERL+RYYL I+NAE PPL +SELNV FEKWR+NCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAA+SEYTIAQYNSSRRL GRLNEVWLDVSG 
Subjt:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL

Query:  TAEGCQPL
        T EGCQPL
Subjt:  TAEGCQPL

A0A1S3BEF7 uncharacterized protein LOC1034889841.1e-9785.58Show/hide
Query:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
        MKG +LINFAL I  FFCCSSGRVIESP YKVIHVESDFEIRQYKQISWMSALVQGT+SFEKSTQQGFHRLYQY+HGANSNS  FL TSPVTTTIM STR
Subjt:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR

Query:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL
        EPE L+RYYL  +NAE PPL +SELN+ FEKW++NCLAVRRFPGFAKDDNINKEIDALKS+LSK+LPESAAISEYTIAQYNSSRRL GRLNEVWLDVS  
Subjt:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL

Query:  TAEGCQPL
        T EGCQPL
Subjt:  TAEGCQPL

A0A5A7ST56 Heme-binding protein 2-like1.1e-9785.58Show/hide
Query:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
        MKG +LINFAL I  FFCCSSGRVIESP YKVIHVESDFEIRQYKQISWMSALVQGT+SFEKSTQQGFHRLYQY+HGANSNS  FL TSPVTTTIM STR
Subjt:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR

Query:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL
        EPE L+RYYL  +NAE PPL +SELN+ FEKW++NCLAVRRFPGFAKDDNINKEIDALKS+LSK+LPESAAISEYTIAQYNSSRRL GRLNEVWLDVS  
Subjt:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL

Query:  TAEGCQPL
        T EGCQPL
Subjt:  TAEGCQPL

A0A6J1FZD2 heme-binding protein 2-like8.7e-9583.5Show/hide
Query:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
        M G ++INFAL  ICFFCCSSGRVIESP Y VIHVE++FEIRQYKQ+SW+SALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
Subjt:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR

Query:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL
         PERL+RYYL  +  E PPL +SELNVQFEKWRSNCLAVRRF GFAKDDNINKE++ALKSSL+KYLP+S+AISEYT+AQYNSSR LSGRLNEVW+DVS +
Subjt:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL

Query:  TAEGCQ
        T+EGCQ
Subjt:  TAEGCQ

A0A6J1HVL2 heme-binding protein 2-like8.1e-9382.04Show/hide
Query:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR
        M G +L+NFAL I  F CCSSGRVIESP Y VIHVE++FEIRQYKQ+SW+SALVQGTASFEKSTQQGFHRLYQYIHGAN NSSHFLITSPVTTTIMAST 
Subjt:  MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR

Query:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL
         PERL+RYYL  +  E PPL +SELNVQFEKWRSNCLAVRRF GFAKDDNINKE++ALKSSL+KYLP+S+AISEYT+AQYNSSR LSGRLNEVWLDVS +
Subjt:  EPERLIRYYLAIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGL

Query:  TAEGCQ
        T+EGCQ
Subjt:  TAEGCQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G17100.1 SOUL heme-binding family protein5.7e-2235.14Show/hide
Query:  IESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR---EPERLIRYYLAIVNAETPPLA
        IE P Y+++H  + +EIR+Y    W+S       S   +T+  F +L+ YI G N       +T+PV + +  S     E    + +Y+   N   P  A
Subjt:  IESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTR---EPERLIRYYLAIVNAETPPLA

Query:  DSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSL-----------SKYLPESAAISEYTIAQYNSSRRLSGRLNEVWL
         SE N+  +KW S  +AVR+F GF  DD+I ++  AL SSL           SK      + S YT+AQYNS    SGR+NE+WL
Subjt:  DSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSL-----------SKYLPESAAISEYTIAQYNSSRRLSGRLNEVWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGGTAATTTGTTGATCAATTTTGCACTAATAATAATTTGCTTCTTCTGTTGCAGCTCAGGCAGAGTGATTGAATCTCCACGTTATAAAGTGATTCATGTAGAATC
AGATTTTGAAATCAGACAGTACAAACAAATTTCATGGATGTCTGCTCTTGTTCAAGGAACAGCCTCCTTTGAAAAGTCCACCCAACAAGGCTTCCACAGGTTGTATCAAT
ACATTCATGGTGCTAACAGCAATTCTTCTCACTTTCTTATTACTTCTCCTGTCACAACCACCATTATGGCATCGACACGCGAACCCGAGCGTTTGATTAGGTATTATCTG
GCAATTGTGAATGCCGAAACCCCGCCCCTGGCTGATTCTGAACTGAATGTTCAGTTTGAGAAGTGGAGAAGCAATTGCTTGGCAGTCAGGAGGTTTCCTGGATTTGCTAA
AGATGATAATATCAACAAAGAAATTGATGCTCTAAAGAGCAGCTTGAGCAAGTATCTACCTGAGAGTGCAGCTATTTCAGAATACACCATTGCTCAGTATAATTCTTCAC
GTCGCTTGTCGGGGCGTTTGAATGAAGTCTGGCTCGATGTTTCGGGTTTAACTGCAGAGGGATGTCAACCCCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGGTAATTTGTTGATCAATTTTGCACTAATAATAATTTGCTTCTTCTGTTGCAGCTCAGGCAGAGTGATTGAATCTCCACGTTATAAAGTGATTCATGTAGAATC
AGATTTTGAAATCAGACAGTACAAACAAATTTCATGGATGTCTGCTCTTGTTCAAGGAACAGCCTCCTTTGAAAAGTCCACCCAACAAGGCTTCCACAGGTTGTATCAAT
ACATTCATGGTGCTAACAGCAATTCTTCTCACTTTCTTATTACTTCTCCTGTCACAACCACCATTATGGCATCGACACGCGAACCCGAGCGTTTGATTAGGTATTATCTG
GCAATTGTGAATGCCGAAACCCCGCCCCTGGCTGATTCTGAACTGAATGTTCAGTTTGAGAAGTGGAGAAGCAATTGCTTGGCAGTCAGGAGGTTTCCTGGATTTGCTAA
AGATGATAATATCAACAAAGAAATTGATGCTCTAAAGAGCAGCTTGAGCAAGTATCTACCTGAGAGTGCAGCTATTTCAGAATACACCATTGCTCAGTATAATTCTTCAC
GTCGCTTGTCGGGGCGTTTGAATGAAGTCTGGCTCGATGTTTCGGGTTTAACTGCAGAGGGATGTCAACCCCTGTAA
Protein sequenceShow/hide protein sequence
MKGNLLINFALIIICFFCCSSGRVIESPRYKVIHVESDFEIRQYKQISWMSALVQGTASFEKSTQQGFHRLYQYIHGANSNSSHFLITSPVTTTIMASTREPERLIRYYL
AIVNAETPPLADSELNVQFEKWRSNCLAVRRFPGFAKDDNINKEIDALKSSLSKYLPESAAISEYTIAQYNSSRRLSGRLNEVWLDVSGLTAEGCQPL