; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g32390 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g32390
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionARM repeat superfamily protein
Genome locationchr9:24473688..24476512
RNA-Seq ExpressionMoc09g32390
SyntenyMoc09g32390
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004146451.1 uncharacterized protein LOC101212969 isoform X1 [Cucumis sativus]2.3e-8587.89Show/hide
Query:  MSMLASKLNTY--LCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSID
        M ++ASKL T+  LCRREPVRTLQFRTFSAY+E EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNL+QQSV+LL+VKDPLFKRMGASRLARFSID
Subjt:  MSMLASKLNTY--LCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSID

Query:  DKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        D++RMKIVE+GGAQELLNMLGAAKDDRTRKEALKALHAISHSDEA  ALHKAGAILVIKSTPDS ED K+NE+KSNL+KRF DLRYDVSS
Subjt:  DKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

XP_011655026.1 uncharacterized protein LOC101212969 isoform X2 [Cucumis sativus]9.5e-8788.83Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        M ++ASKL T+LCRREPVRTLQFRTFSAY+E EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNL+QQSV+LL+VKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        +RMKIVE+GGAQELLNMLGAAKDDRTRKEALKALHAISHSDEA  ALHKAGAILVIKSTPDS ED K+NE+KSNL+KRF DLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

XP_022146798.1 uncharacterized protein LOC111015917 [Momordica charantia]1.3e-96100Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

XP_022150134.1 uncharacterized protein LOC111018388 [Momordica charantia]2.1e-8992.55Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        MS+L  KLN YLCRREPVRTLQFRTFSAY ESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGD+LMQQSVALLQVKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKS P+SVED K+NE+KSNL+KRFEDLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

XP_038891777.1 uncharacterized protein LOC120081166 isoform X1 [Benincasa hispida]4.3e-8789.36Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        M ++ASKL T+LCRREPVRTLQFRTFSAY+E EIEKEAERKVGWLLKLIFAGTATF+GYQIFPYMGDNL+QQSV+LLQVKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVE+GGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDS ED K+NE+KSNL+KRF DL YDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

TrEMBL top hitse value%identityAlignment
A0A0A0KM85 Uncharacterized protein4.6e-8788.83Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        M ++ASKL T+LCRREPVRTLQFRTFSAY+E EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNL+QQSV+LL+VKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        +RMKIVE+GGAQELLNMLGAAKDDRTRKEALKALHAISHSDEA  ALHKAGAILVIKSTPDS ED K+NE+KSNL+KRF DLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

A0A1S3C497 uncharacterized protein LOC103496719 isoform X29.6e-8587.77Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        M ++ASKL  +LCRREPVRTLQFRTFSAY+E EIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNL+QQSV LL+VKDPLFKRMGASRLARFSIDDK
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVE GGAQELLNML  AKDDRTRKEALKAL+AISHSDEAV  LHKAGAILVIKSTPDS ED K+NE+KSNL+KRF DLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

A0A6J1CZI6 uncharacterized protein LOC1110159176.4e-97100Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

A0A6J1D932 uncharacterized protein LOC1110183889.9e-9092.55Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        MS+L  KLN YLCRREPVRTLQFRTFSAY ESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGD+LMQQSVALLQVKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKS P+SVED K+NE+KSNL+KRFEDLRYDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

A0A6J1FTA5 uncharacterized protein LOC111447100 isoform X29.6e-8586.7Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK
        M ++ SKL  +LCRREP RTLQFR FSAY+E EIEKEAERKVGWLLKLIFAGTATFLGY IFPYMGDNL+QQSV+LLQVKDPLFKRMGASRLARFSIDD+
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDK

Query:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        +RMKIVE+GGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAV ALHKAGAILVIKSTPDS EDT++NE+KSNL+KRF DL YDVSS
Subjt:  RRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G56210.1 ARM repeat superfamily protein1.4e-5656.54Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI
        M +L +++  + CR       + R FS+ N+ +   +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+I
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI

Query:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        DD+RRMK+VEMGGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S+ED+ ++ +KSN++++  +    VSS
Subjt:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

AT3G56210.2 ARM repeat superfamily protein1.4e-5656.54Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI
        M +L +++  + CR       + R FS+ N+ +   +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+I
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI

Query:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        DD+RRMK+VEMGGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S+ED+ ++ +KSN++++  +    VSS
Subjt:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

AT3G56210.4 ARM repeat superfamily protein1.4e-5656.54Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI
        M +L +++  + CR       + R FS+ N+ +   +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+I
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI

Query:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        DD+RRMK+VEMGGAQELL+MLG+AKDD+TRKEALKAL A+S S EA   L   GA+ ++KSTP+S+ED+ ++ +KSN++++  +    VSS
Subjt:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS

AT3G56210.5 ARM repeat superfamily protein4.6e-5555.96Show/hide
Query:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI
        M +L +++  + CR       + R FS+ N+ +   +E+EAERK+GW LK+ FAGTAT++GYQ FPY+GDNL+ QS++LL VKDPLFKRMGASRL+RF+I
Subjt:  MSMLASKLNTYLCRREPVRTLQFRTFSAYNESE---IEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSI

Query:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHS--DEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS
        DD+RRMK+VEMGGAQELL+MLG+AKDD+TRKEALKAL A+S S   EA   L   GA+ ++KSTP+S+ED+ ++ +KSN++++  +    VSS
Subjt:  DDKRRMKIVEMGGAQELLNMLGAAKDDRTRKEALKALHAISHS--DEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCATGCTCGCATCGAAGTTGAACACTTATCTCTGCAGAAGAGAGCCTGTGCGGACCCTGCAATTTCGCACTTTTTCAGCTTACAACGAAAGCGAGATCGAGAAGGA
GGCTGAAAGAAAAGTAGGATGGTTATTGAAACTAATCTTTGCTGGGACTGCCACATTTCTGGGTTACCAGATTTTTCCATACATGGGGGATAACTTGATGCAGCAATCTG
TGGCGCTCTTGCAAGTCAAGGATCCATTGTTTAAGAGGATGGGAGCATCTAGATTGGCTCGTTTTTCGATAGACGATAAAAGAAGAATGAAAATAGTGGAGATGGGTGGA
GCTCAAGAGCTCTTAAACATGTTGGGGGCTGCCAAAGACGACCGCACGCGTAAGGAAGCTTTGAAGGCTTTACATGCCATCTCTCATTCAGATGAAGCTGTCAGTGCTCT
GCATAAAGCAGGGGCAATCTTGGTTATTAAATCTACTCCGGATTCGGTTGAAGATACGAAACTGAACGAGTTCAAGTCCAACCTAATAAAGAGATTTGAAGATCTTAGAT
ACGATGTCTCGTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCATGCTCGCATCGAAGTTGAACACTTATCTCTGCAGAAGAGAGCCTGTGCGGACCCTGCAATTTCGCACTTTTTCAGCTTACAACGAAAGCGAGATCGAGAAGGA
GGCTGAAAGAAAAGTAGGATGGTTATTGAAACTAATCTTTGCTGGGACTGCCACATTTCTGGGTTACCAGATTTTTCCATACATGGGGGATAACTTGATGCAGCAATCTG
TGGCGCTCTTGCAAGTCAAGGATCCATTGTTTAAGAGGATGGGAGCATCTAGATTGGCTCGTTTTTCGATAGACGATAAAAGAAGAATGAAAATAGTGGAGATGGGTGGA
GCTCAAGAGCTCTTAAACATGTTGGGGGCTGCCAAAGACGACCGCACGCGTAAGGAAGCTTTGAAGGCTTTACATGCCATCTCTCATTCAGATGAAGCTGTCAGTGCTCT
GCATAAAGCAGGGGCAATCTTGGTTATTAAATCTACTCCGGATTCGGTTGAAGATACGAAACTGAACGAGTTCAAGTCCAACCTAATAAAGAGATTTGAAGATCTTAGAT
ACGATGTCTCGTCTTGA
Protein sequenceShow/hide protein sequence
MSMLASKLNTYLCRREPVRTLQFRTFSAYNESEIEKEAERKVGWLLKLIFAGTATFLGYQIFPYMGDNLMQQSVALLQVKDPLFKRMGASRLARFSIDDKRRMKIVEMGG
AQELLNMLGAAKDDRTRKEALKALHAISHSDEAVSALHKAGAILVIKSTPDSVEDTKLNEFKSNLIKRFEDLRYDVSS