; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Bhi04G001030 (gene) of Wax gourd (B227) v1 genome

Gene IDBhi04G001030
OrganismBenincasa hispida cv. B227 (Wax gourd (B227) v1)
DescriptionUnknown protein
Genome locationchr4:32744228..32749468
RNA-Seq ExpressionBhi04G001030
SyntenyBhi04G001030
Gene Ontology termsGO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138679.1 uncharacterized protein LOC101205088 [Cucumis sativus]3.4e-6288.06Show/hide
Query:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA
        M +NPKF+ +KEDE++PKS  PFPWFSFLPKFDFRLPFP+NGGKKPP VVVDE RKAD+DAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAV+WQVYA
Subjt:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA

Query:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDSSN
        LGGFLILSWAWARWKERRPQRRSNDD+EDEDSS+
Subjt:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDSSN

XP_008456569.1 PREDICTED: uncharacterized protein LOC103496485 [Cucumis melo]7.1e-6087.12Show/hide
Query:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA
        M +NPKF+ MKEDE++PKS  PFPWFSFLPKFDFRLPFP+NGGKK P VVVDE RKAD+DAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAV+WQVYA
Subjt:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA

Query:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS
        LGGFLILSWAWARWKERRPQRRSNDD+ED+ S
Subjt:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS

XP_023550786.1 uncharacterized protein LOC111808822 isoform X1 [Cucurbita pepo subsp. pepo]2.0e-5484.09Show/hide
Query:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA
        M+E PK IGMKEDE SPK  FPF WFSFLPKFD RLP PINGGKKPPA V+DEG K DDDAQKPEFVRFPKA L V SVE EADVSGKTSNPAVIWQVYA
Subjt:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA

Query:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS
        LGGFLILSWAWARWKERRP+RRS+DD+E+EDS
Subjt:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS

XP_023550789.1 uncharacterized protein LOC111808822 isoform X3 [Cucurbita pepo subsp. pepo]2.0e-5484.09Show/hide
Query:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA
        M+E PK IGMKEDE SPK  FPF WFSFLPKFD RLP PINGGKKPPA V+DEG K DDDAQKPEFVRFPKA L V SVE EADVSGKTSNPAVIWQVYA
Subjt:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA

Query:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS
        LGGFLILSWAWARWKERRP+RRS+DD+E+EDS
Subjt:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS

XP_038886165.1 uncharacterized protein LOC120076415 [Benincasa hispida]6.2e-72100Show/hide
Query:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA
        MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA
Subjt:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA

Query:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDSSNI
        LGGFLILSWAWARWKERRPQRRSNDDEEDEDSSNI
Subjt:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDSSNI

TrEMBL top hitse value%identityAlignment
A0A0A0LSR5 Uncharacterized protein1.7e-6288.06Show/hide
Query:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA
        M +NPKF+ +KEDE++PKS  PFPWFSFLPKFDFRLPFP+NGGKKPP VVVDE RKAD+DAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAV+WQVYA
Subjt:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA

Query:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDSSN
        LGGFLILSWAWARWKERRPQRRSNDD+EDEDSS+
Subjt:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDSSN

A0A1S3C3J3 uncharacterized protein LOC1034964853.5e-6087.12Show/hide
Query:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA
        M +NPKF+ MKEDE++PKS  PFPWFSFLPKFDFRLPFP+NGGKK P VVVDE RKAD+DAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAV+WQVYA
Subjt:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA

Query:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS
        LGGFLILSWAWARWKERRPQRRSNDD+ED+ S
Subjt:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS

A0A5D3CKR2 Uncharacterized protein3.5e-6087.12Show/hide
Query:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA
        M +NPKF+ MKEDE++PKS  PFPWFSFLPKFDFRLPFP+NGGKK P VVVDE RKAD+DAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAV+WQVYA
Subjt:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA

Query:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS
        LGGFLILSWAWARWKERRPQRRSNDD+ED+ S
Subjt:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS

A0A6J1FFF8 uncharacterized protein LOC1114449955.3e-5382.58Show/hide
Query:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA
        M+ENPK IGMKED  SPK  FPF WFSFLPKFD RLP PINGGKKPPA V+DE  K DD AQKPEFVRFPKA L V SVE EADVSGKTSNPAVIWQVYA
Subjt:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA

Query:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS
        LGGFLILSWAWARWKERRP+RRS+DD+E+EDS
Subjt:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS

A0A6J1JVP7 uncharacterized protein LOC1114892981.8e-5381.82Show/hide
Query:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA
        M+E+PK IG+KE E+SPK  FPF WFSFLPKFD RLP PINGGKKPPA V+DEG K D+DAQKPEFVRFPKA L V SVE EADVSGKTSNPAVIWQVYA
Subjt:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYA

Query:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS
        LGGFLILSWAWARWKERRP+RRS+DD+E+EDS
Subjt:  LGGFLILSWAWARWKERRPQRRSNDDEEDEDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G52230.1 unknown protein1.2e-1235.81Show/hide
Query:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKP-----EFVRF----PKAELPVASVEAEADVSGKTSN
        MAE  + +      DS   P   P F     F F  P       KPP   +D         ++P     E V F    PK+  P+   EAE   SG+TSN
Subjt:  MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKP-----EFVRF----PKAELPVASVEAEADVSGKTSN

Query:  PAVIWQVYALGGFLILSWAWARWKER-----RPQRRSNDDEEDEDSSN
          ++WQVYALGGFL+L WAWARW ER     + +   +DD++D+D  +
Subjt:  PAVIWQVYALGGFLILSWAWARWKER-----RPQRRSNDDEEDEDSSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGAACCCAAAATTTATTGGGATGAAAGAAGACGAGGATTCCCCAAAATCCCCCTTCCCCTTTCCTTGGTTCTCTTTTCTTCCCAAGTTTGACTTCCGA
TTGCCTTTTCCGATCAACGGCGGCAAGAAACCGCCGGCTGTGGTGGTTGATGAAGGCCGAAAGGCCGACGACGATGCTCAGAAGCCGGAGTTCGTGAGGTTTCCT
AAAGCAGAGTTGCCTGTCGCTTCGGTGGAGGCTGAAGCTGATGTCTCCGGCAAGACTTCCAATCCGGCGGTCATCTGGCAGGTATATGCCCTGGGTGGGTTTCTT
ATATTGAGTTGGGCATGGGCAAGATGGAAGGAGAGAAGGCCCCAAAGACGTTCAAATGATGACGAGGAGGACGAAGATTCCAGTAATATCTAG
mRNA sequenceShow/hide mRNA sequence
AAGGATCACAACAATGGACTGAGATTGTCTCTTTCTCTCATATTCATTGTAGCCATAATATTACAGCAAAAAAATGGCAGAGAACCCAAAATTTATTGGGATGAA
AGAAGACGAGGATTCCCCAAAATCCCCCTTCCCCTTTCCTTGGTTCTCTTTTCTTCCCAAGTTTGACTTCCGATTGCCTTTTCCGATCAACGGCGGCAAGAAACC
GCCGGCTGTGGTGGTTGATGAAGGCCGAAAGGCCGACGACGATGCTCAGAAGCCGGAGTTCGTGAGGTTTCCTAAAGCAGAGTTGCCTGTCGCTTCGGTGGAGGC
TGAAGCTGATGTCTCCGGCAAGACTTCCAATCCGGCGGTCATCTGGCAGGTATATGCCCTGGGTGGGTTTCTTATATTGAGTTGGGCATGGGCAAGATGGAAGGA
GAGAAGGCCCCAAAGACGTTCAAATGATGACGAGGAGGACGAAGATTCCAGTAATATCTAGTTTTGTGTTTGCCTTGAACAGCCCTTAATAAATGTTGAAGTTAC
ATTTTTAGTGCTAGAATCAATCTTGAGTTTATACCCGTTATTTGTAATTTCGAGTTAAATCAGTCATTTTCTAACCCTTTGTCAATCCTTGAAAATGATCTTAGT
ATGAAATTAAAACCCATCTCCATGAGGGTTTTCTAATGTATTGGTGCAAAAGCAACAATATTTGATCTTAATAAGAGGCTTCCTGCTTGCAAACGGTAATATGCA
AAAGCAAGTATTTAACGTTAGCCACCGTCTTCATGAGGCTTCAGAGTAAAGAAAACTTGTATGTTTGAATCCTTTGCTATCAGTCATTGCTATATATACATGATA
CAGAATTTGCAAGAGGTCAATGAATTGAAAACCAGCATTGGTTTATTACTATTATTAATCACAAAAGTAGTTGCCGGTAAAAGAATGATACATCTCGAGTAAAAC
ATGTACAGAGACCAAATTTATGCAGGAACTCAATCAAGTTTCTAACAAGCTATCAAGAAAATTTGACTAACTAAACCAACCTATTCCGTGTAGCCCTGAACAAGT
GTGGTAGTTTTGATTTGCCAGCTAACTCAACATTCTATGTACACCATCCTGAGACCCTTTCCTAATAACTGGGGTCCATGATATGCGACTGGTACCGCATATAAC
TTAATGGTATATCCTTCGCAAGAGCAACGAGAAAATCCGGTTCATACCAAAACCAAGTAGTTCACTCTGGCTTGAAATAACCATCATATATGGATAGTTGATCGA
ATGCATCAAAATTTGCCTTCAGTCCAATAAATTGGCTACCCAAATGAGCACAACCAGCATCAGGCCACCGACACCCTCCGTCCCGAGTGCGTAATGGGCAAAGAG
CTTCACAGATGTGACCGAGGGGGTTACTTGGAACATCATCTGCTGTAACTGTGATAGAAAGGGAGAGGTCATTCTCTTCACTGGGGGATGGCAGTGAACGATGTG
AAACCCTTCTTGAAGCAGGATGACTTTTCTGGTTGGATTTTTTCATCAGGTCGCCCTCAATATCAACAGTACAATAACCATTACTAGAGACTTGAACCTTGCAAC
CTAGAGACTGAATTGCTCTCTTTATGGCTTCACATTCTTTGCTTGCACTTTTTGCAGCATTCCTAGCAGATATACTTAATTGTTTCTCGTTTTGCAACTCTGATT
TTAATGATTCCATGGCAGCCGTCATCTCCTGGGCTTTTCTCTCGGCCATTTCTTTGTCCTATTAGGCATTTTATGGTATGCAAGAATCAATCAGGTTGACACTGA
GGACAACAAAACATTGTACATAAACGACGCCCATACATAAAATTAACTTTCAAGAAACAACAGGTTCACCAACTATGAATAAACCGAATCTTAAGTGGTTATAGA
AAGCCACTTCTCTCTTTTCGGTTTTGGCTACTGTTCATTTGCTCCAAAACAATCAGATTAAAGTAAAAGTCATATAATATCAGCAATCCAATTACTATAGAACTA
CTTCAATGAAGTAAGGGGAAGTTGTGCATTGCAACAGAATAAATAGTTTTGTAAGTTCCTTGGCTTGGAAGTTGGAATTCTTTGGCTATCCCCCCTCTTTTTGTG
ATTTCATACCATCAACAGGAGAATATAAAAAATGAAAACAATTTGTTCTTGTGCAGAGAACTCCCAAGATAATTTTTAAAAACAGTTCATTAAAAAAAAAAAAAA
AAACTGCAGCTGTTTTTTGTTGTATTCTTTAGTGTGTTACAACTGTCCCTGCGTTTGGAACTAAAACGATAGAAGATAAATTTTGAGACAAACCAATTTGAAAAG
GAGTTTTTTAAAACAAAGTTTAAAAATACAACCAAACAATGAATTAAATTTCTTCTGCAACTGTTTCTAGAGTTCGGGACATTTATTCATGATCCAAGTATGAGA
AAAGAATCGTACCTTCTGCATTTCAGAGAATTTTATAGCCATTTTCCTGCATCTGACTTCAAGTTCCCCAGCTTGGGATTCTTTTGGACAAGCCCAACGAAGATG
GAATTGTGGCCAGAGAGTTGGAGCTAGAGCTGCTGCCGGAGGTAAGATAGGGCCTTCATGTTTGGTAGGTTCATAGAAGAGGTTATAATGCACATGTGAACTTCC
TTCCGAGGCTCGTAAATCAGCTAAATATGCCCATAGACAGCCACAAACTTCAGAAACTGCACATTGCTGCCGCTCCTTCTCACTGAGACATCAGAAAACAATCAT
TCACCAAGTCCCTCCAATAAGTGCTATTATTAACGTTTAACAATTTAGGGCGCGTTTGGGGTACTGAGTTGGTTATTATAGTCAATGGGTTATAAGAGTTTGTGT
TTAAGGGTGCAAACTATTTTTATTTGGATAAGAAATAGTAAATGTTGTAGCAAGAAAAGAAGACAGGATGATTGAAATATAATAAATACGGTAGCAAATGGTAAA
TACTATAGCAAAAAGGAGTTTGAAATAGTATTTATTATAGTTAATTGTAGGTTATGCTTAGAATTCAAGGATACAATTGTACAACCTATAGAATAGATCCTGTTC
AAGTAAGGTCGTTATGCTTAATTGTAAGTTATAAATAATTGTTCACACCACTATTATACTTGGTGCGCCAAAGCATGGAGTGGGCTATAATAACCTACTCCACCA
ATTTCTAGTTGGTGCCCCCAAACGGCCCCTTAAGATCAGTAGTGTAAGATATATATAACATGGGCCAGTTACACGGAACTATTCCAAGAAATAAACTGTCTTAAA
AGTTCTAATCATGGAATAATGATCTTTATCTACAGACAATAGTTAAACAGG
Protein sequenceShow/hide protein sequence
MAENPKFIGMKEDEDSPKSPFPFPWFSFLPKFDFRLPFPINGGKKPPAVVVDEGRKADDDAQKPEFVRFPKAELPVASVEAEADVSGKTSNPAVIWQVYALGGFL
ILSWAWARWKERRPQRRSNDDEEDEDSSNI