; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g09270 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g09270
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:6849139..6852590
RNA-Seq ExpressionMoc04g09270
SyntenyMoc04g09270
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7011540.1 hypothetical protein SDJN02_26446, partial [Cucurbita argyrosperma subsp. argyrosperma]8.4e-4255.76Show/hide
Query:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRR--SSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWV
        MSS SGHFWS T++     LLQL +K       GG  RR  S PP  SSVA++PY+           FR FG                SS    F P   
Subjt:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRR--SSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWV

Query:  KWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQ
        KWIFGSLLSLL+P+W    NK Q  E EAE +IEEAE+VAEVVEK AE+ EK SAEI +K+ E+SK+KEAAEVVE YSK+IAH A L Q ILHKVEEWKQ
Subjt:  KWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQ

Query:  KLDKSETAINEQIRKKE
        KLDKSE  INEQI+KKE
Subjt:  KLDKSETAINEQIRKKE

XP_008455165.1 PREDICTED: uncharacterized protein LOC103495399 [Cucumis melo]2.2e-4252.86Show/hide
Query:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGG----GRRSSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNRE--KDQ--SSTPYF
        MSSK GH W +        LLQLF KT+N            RR S P Q S ++    P  +   +  + R   SY+    M+  ++  KD   SS+ +F
Subjt:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGG----GRRSSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNRE--KDQ--SSTPYF

Query:  FFPGWVKWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHK
        FFP W KWIFG+LLSLL+P W    N LQ +E EAEMV+EE E+VAEVVEK AE+ EK S EI++KLPEKSKLKEAA+VVE YSK+IAHDAHLTQDILHK
Subjt:  FFPGWVKWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHK

Query:  VEEWKQKLDKSETAINEQIRKKEGPAN
        VEEWKQK+DKS+  +NE   K+   AN
Subjt:  VEEWKQKLDKSETAINEQIRKKEGPAN

XP_022141966.1 uncharacterized protein LOC111012212 [Momordica charantia]6.1e-117100Show/hide
Query:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRRSSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWVKW
        MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRRSSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWVKW
Subjt:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRRSSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWVKW

Query:  IFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQKL
        IFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQKL
Subjt:  IFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQKL

Query:  DKSETAINEQIRKKEGPANK
        DKSETAINEQIRKKEGPANK
Subjt:  DKSETAINEQIRKKEGPANK

XP_022972002.1 uncharacterized protein LOC111470651 [Cucurbita maxima]2.9e-4256.94Show/hide
Query:  MSSKSGHFWSSTVVLRLRS-LLQLFHKTENCHGGGGGGRR--SSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGW
        MSS SGHFWS T +LR RS LLQL HK       GG  RR  S PP  SSVA++PY+           FR FG                SS    + P  
Subjt:  MSSKSGHFWSSTVVLRLRS-LLQLFHKTENCHGGGGGGRR--SSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGW

Query:  VKWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWK
         KWIFGSLLSL +P+W    NK Q LE EAE  IEEAE VAEVVEK AE+ EK SAEI +KLPEKS++K+AAE VE YSK+IAHDA L Q ILHKVEEWK
Subjt:  VKWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWK

Query:  QKLDKSETAINEQIRK
        QKLDKSE  INEQ++K
Subjt:  QKLDKSETAINEQIRK

XP_038888803.1 uncharacterized protein LOC120078589 [Benincasa hispida]5.2e-5260.37Show/hide
Query:  MSSKSGHFWSSTVVLRLRSLLQLFHKTE-NCHGGGGGGRRSSPPPQ-SSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSST-PYFFFPGW
        MSSK GHFW +        L QLFHK + +   GG   RR S PPQ SS+ M+ +    +      + RL+ S      M+ ++EKDQ ST  +FFFPGW
Subjt:  MSSKSGHFWSSTVVLRLRSLLQLFHKTE-NCHGGGGGGRRSSPPPQ-SSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSST-PYFFFPGW

Query:  VKWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWK
         KW+FGSLLSLL+P+W    N+L+TLE EAEMVIEEAE+VA+VVE+ AE+ EK SAEIA+KLPEKSKLKEAA+VVE YSK++AHDAHLTQDILHKVEEWK
Subjt:  VKWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWK

Query:  QKLDKSETAINEQIRKK
        QKLD SET +NEQI+KK
Subjt:  QKLDKSETAINEQIRKK

TrEMBL top hitse value%identityAlignment
A0A1S3C099 uncharacterized protein LOC1034953991.1e-4252.86Show/hide
Query:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGG----GRRSSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNRE--KDQ--SSTPYF
        MSSK GH W +        LLQLF KT+N            RR S P Q S ++    P  +   +  + R   SY+    M+  ++  KD   SS+ +F
Subjt:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGG----GRRSSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNRE--KDQ--SSTPYF

Query:  FFPGWVKWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHK
        FFP W KWIFG+LLSLL+P W    N LQ +E EAEMV+EE E+VAEVVEK AE+ EK S EI++KLPEKSKLKEAA+VVE YSK+IAHDAHLTQDILHK
Subjt:  FFPGWVKWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHK

Query:  VEEWKQKLDKSETAINEQIRKKEGPAN
        VEEWKQK+DKS+  +NE   K+   AN
Subjt:  VEEWKQKLDKSETAINEQIRKKEGPAN

A0A6J1CKS7 uncharacterized protein LOC1110122122.9e-117100Show/hide
Query:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRRSSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWVKW
        MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRRSSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWVKW
Subjt:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRRSSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWVKW

Query:  IFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQKL
        IFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQKL
Subjt:  IFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQKL

Query:  DKSETAINEQIRKKEGPANK
        DKSETAINEQIRKKEGPANK
Subjt:  DKSETAINEQIRKKEGPANK

A0A6J1GLX4 uncharacterized protein LOC111455450 isoform X12.6e-4154.84Show/hide
Query:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRR--SSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWV
        MSS SGHFWS T++     LLQL +K       GG  RR  S PP  SSVA++PY+           FR FG                SS    F P   
Subjt:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRR--SSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWV

Query:  KWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQ
        KWIFGSLLSLL+P+W    NK Q  E EAE +IEEAE+VAEVVEK AE+ EK SAEI +K+ E+SK+KEAAEVVE YSK+IAH A L Q ILHKVEEWKQ
Subjt:  KWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQ

Query:  KLDKSETAINEQIRKKE
        KLDKS+  INEQ++KKE
Subjt:  KLDKSETAINEQIRKKE

A0A6J1GN56 uncharacterized protein LOC111455450 isoform X22.6e-4154.84Show/hide
Query:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRR--SSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWV
        MSS SGHFWS T++     LLQL +K       GG  RR  S PP  SSVA++PY+           FR FG                SS    F P   
Subjt:  MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRR--SSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWV

Query:  KWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQ
        KWIFGSLLSLL+P+W    NK Q  E EAE +IEEAE+VAEVVEK AE+ EK SAEI +K+ E+SK+KEAAEVVE YSK+IAH A L Q ILHKVEEWKQ
Subjt:  KWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQ

Query:  KLDKSETAINEQIRKKE
        KLDKS+  INEQ++KKE
Subjt:  KLDKSETAINEQIRKKE

A0A6J1I3G6 uncharacterized protein LOC1114706511.4e-4256.94Show/hide
Query:  MSSKSGHFWSSTVVLRLRS-LLQLFHKTENCHGGGGGGRR--SSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGW
        MSS SGHFWS T +LR RS LLQL HK       GG  RR  S PP  SSVA++PY+           FR FG                SS    + P  
Subjt:  MSSKSGHFWSSTVVLRLRS-LLQLFHKTENCHGGGGGGRR--SSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGW

Query:  VKWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWK
         KWIFGSLLSL +P+W    NK Q LE EAE  IEEAE VAEVVEK AE+ EK SAEI +KLPEKS++K+AAE VE YSK+IAHDA L Q ILHKVEEWK
Subjt:  VKWIFGSLLSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWK

Query:  QKLDKSETAINEQIRK
        QKLDKSE  INEQ++K
Subjt:  QKLDKSETAINEQIRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G14095.1 unknown protein1.5e-2546.62Show/hide
Query:  GSYFTSATMKLNREKDQSSTP--YFFFPGWVKWIFGSLLSLLIPTW-KQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLK
        G YF++ +   N  K Q S P  +F FP W +W+ GS +SL++  W  +   KL+ +EGEAE+V+E  E+VAE+VEK A   ++ + E+A+KLPEK+KLK
Subjt:  GSYFTSATMKLNREKDQSSTP--YFFFPGWVKWIFGSLLSLLIPTW-KQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLK

Query:  EAAEVVETYSKQIAHDAHLTQDILHKVEEWKQKLDKSETAINEQIRKK
        + A V+E  S+  AH+AHLTQD LHKVE+  Q +D  E  I   I KK
Subjt:  EAAEVVETYSKQIAHDAHLTQDILHKVEEWKQKLDKSETAINEQIRKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCGAAATCTGGTCATTTCTGGAGTTCTACTGTAGTGCTCAGATTGCGAAGCTTGCTTCAACTTTTCCATAAGACAGAGAACTGCCACGGCGGTGGCGGC
GGCGGTCGTCGGAGTTCTCCACCGCCGCAATCCTCCGTCGCGATGCTTCCTTACACGCCATGTTATAACCAACCACCAATTTATCCCATGTTTCGGTTATTTGGT
TCTTATTTCACCAGTGCTACGATGAAACTTAACAGAGAAAAGGACCAGTCATCTACACCTTATTTCTTCTTCCCTGGTTGGGTAAAATGGATTTTTGGCTCTCTA
TTGTCTCTCTTGATACCCACTTGGAAGCAAAGTTCGAATAAACTGCAAACTCTTGAAGGAGAAGCGGAAATGGTGATTGAGGAGGCTGAAAGTGTAGCAGAAGTA
GTAGAAAAGGCAGCAGAAATAGCAGAGAAGGCATCAGCAGAAATTGCAAAGAAACTTCCTGAGAAGAGTAAGCTGAAAGAAGCAGCTGAAGTGGTAGAAACTTAT
TCAAAGCAAATTGCCCATGATGCTCACTTAACACAAGACATCCTCCACAAGGTGGAAGAATGGAAGCAAAAGCTAGACAAGTCAGAGACAGCCATTAACGAACAG
ATCAGGAAGAAAGAAGGCCCAGCAAACAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCGAAATCTGGTCATTTCTGGAGTTCTACTGTAGTGCTCAGATTGCGAAGCTTGCTTCAACTTTTCCATAAGACAGAGAACTGCCACGGCGGTGGCGGC
GGCGGTCGTCGGAGTTCTCCACCGCCGCAATCCTCCGTCGCGATGCTTCCTTACACGCCATGTTATAACCAACCACCAATTTATCCCATGTTTCGGTTATTTGGT
TCTTATTTCACCAGTGCTACGATGAAACTTAACAGAGAAAAGGACCAGTCATCTACACCTTATTTCTTCTTCCCTGGTTGGGTAAAATGGATTTTTGGCTCTCTA
TTGTCTCTCTTGATACCCACTTGGAAGCAAAGTTCGAATAAACTGCAAACTCTTGAAGGAGAAGCGGAAATGGTGATTGAGGAGGCTGAAAGTGTAGCAGAAGTA
GTAGAAAAGGCAGCAGAAATAGCAGAGAAGGCATCAGCAGAAATTGCAAAGAAACTTCCTGAGAAGAGTAAGCTGAAAGAAGCAGCTGAAGTGGTAGAAACTTAT
TCAAAGCAAATTGCCCATGATGCTCACTTAACACAAGACATCCTCCACAAGGTGGAAGAATGGAAGCAAAAGCTAGACAAGTCAGAGACAGCCATTAACGAACAG
ATCAGGAAGAAAGAAGGCCCAGCAAACAAGTGA
Protein sequenceShow/hide protein sequence
MSSKSGHFWSSTVVLRLRSLLQLFHKTENCHGGGGGGRRSSPPPQSSVAMLPYTPCYNQPPIYPMFRLFGSYFTSATMKLNREKDQSSTPYFFFPGWVKWIFGSL
LSLLIPTWKQSSNKLQTLEGEAEMVIEEAESVAEVVEKAAEIAEKASAEIAKKLPEKSKLKEAAEVVETYSKQIAHDAHLTQDILHKVEEWKQKLDKSETAINEQ
IRKKEGPANK