; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021223 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021223
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold358:118956..119369
RNA-Seq ExpressionMS021223
SyntenyMS021223
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7024654.1 hypothetical protein SDJN02_13472, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-3663.4Show/hide
Query:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG--------GGGGETPVRLLGGE---EELGFGSRNSTAAV
        MEALWNLEDK +LST+QA IL T  A AV G CA AW K R       R+     RW   G        GGGGETP RLL GE   E   FGSRNSTAAV
Subjt:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG--------GGGGETPVRLLGGE---EELGFGSRNSTAAV

Query:  WQRPILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRKSFRNY
        WQRPILMGEKCEMLKYSGLILYD+RGRLL D IA  AME   KVLLRK FR++
Subjt:  WQRPILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRKSFRNY

XP_022141812.1 uncharacterized protein LOC111012091 isoform X1 [Momordica charantia]1.7e-6497.71Show/hide
Query:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNRGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS
        MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVK RGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS
Subjt:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNRGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS

Query:  GLILYDQRGRLLHDSIAATAMEGAPKVLLRK
        GLILYDQRGRLLHDSIAATAMEGA KVLLR+
Subjt:  GLILYDQRGRLLHDSIAATAMEGAPKVLLRK

XP_022141821.1 uncharacterized protein LOC111012091 isoform X2 [Momordica charantia]2.7e-6298.41Show/hide
Query:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNRGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS
        MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVK RGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS
Subjt:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNRGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS

Query:  GLILYDQRGRLLHDSIAATAMEGAPK
        GLILYDQRGRLLHDSIAATAMEGA K
Subjt:  GLILYDQRGRLLHDSIAATAMEGAPK

XP_022976856.1 uncharacterized protein LOC111477103 isoform X1 [Cucurbita maxima]2.5e-3664.86Show/hide
Query:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG-------GGGGETPVRLLGGEEELGFGSRNSTAAVWQRP
        MEALWNLEDK +LST+QA IL T  A AV G CA  W K R       R+     RW   G        GGGETPVRLLGGE E  FGSRNSTAAVWQRP
Subjt:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG-------GGGGETPVRLLGGEEELGFGSRNSTAAVWQRP

Query:  ILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRKSFRN
        ILMGEKCEMLKYSGLILYD+RGRLL D IA  AME   KVLLR+  RN
Subjt:  ILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRKSFRN

XP_022976858.1 uncharacterized protein LOC111477103 isoform X3 [Cucurbita maxima]1.3e-3565.97Show/hide
Query:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG-------GGGGETPVRLLGGEEELGFGSRNSTAAVWQRP
        MEALWNLEDK +LST+QA IL T  A AV G CA  W K R       R+     RW   G        GGGETPVRLLGGE E  FGSRNSTAAVWQRP
Subjt:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG-------GGGGETPVRLLGGEEELGFGSRNSTAAVWQRP

Query:  ILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRK
        ILMGEKCEMLKYSGLILYD+RGRLL D IA  AME   KVLLRK
Subjt:  ILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRK

TrEMBL top hitse value%identityAlignment
A0A6J1CKD7 uncharacterized protein LOC111012091 isoform X21.3e-6298.41Show/hide
Query:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNRGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS
        MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVK RGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS
Subjt:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNRGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS

Query:  GLILYDQRGRLLHDSIAATAMEGAPK
        GLILYDQRGRLLHDSIAATAMEGA K
Subjt:  GLILYDQRGRLLHDSIAATAMEGAPK

A0A6J1CKW9 uncharacterized protein LOC111012091 isoform X18.1e-6597.71Show/hide
Query:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNRGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS
        MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVK RGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS
Subjt:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNRGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYS

Query:  GLILYDQRGRLLHDSIAATAMEGAPKVLLRK
        GLILYDQRGRLLHDSIAATAMEGA KVLLR+
Subjt:  GLILYDQRGRLLHDSIAATAMEGAPKVLLRK

A0A6J1F750 uncharacterized protein LOC111442724 isoform X11.1e-3463.51Show/hide
Query:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG--------GGGGETPVRLLGGE---EELGFGSRNSTAAV
        MEALWNLEDK +LST+QA IL T  A AV G CA AW K R       R+     RW   G        GGGGETP RLL GE   E   FGSRNSTAAV
Subjt:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG--------GGGGETPVRLLGGE---EELGFGSRNSTAAV

Query:  WQRPILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRK
        WQRPILMGEKCEMLKYSGLILYD+RGRLL D IA  AME   KVLLR+
Subjt:  WQRPILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRK

A0A6J1II18 uncharacterized protein LOC111477103 isoform X11.2e-3664.86Show/hide
Query:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG-------GGGGETPVRLLGGEEELGFGSRNSTAAVWQRP
        MEALWNLEDK +LST+QA IL T  A AV G CA  W K R       R+     RW   G        GGGETPVRLLGGE E  FGSRNSTAAVWQRP
Subjt:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG-------GGGGETPVRLLGGEEELGFGSRNSTAAVWQRP

Query:  ILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRKSFRN
        ILMGEKCEMLKYSGLILYD+RGRLL D IA  AME   KVLLR+  RN
Subjt:  ILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRKSFRN

A0A6J1IND6 uncharacterized protein LOC111477103 isoform X36.1e-3665.97Show/hide
Query:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG-------GGGGETPVRLLGGEEELGFGSRNSTAAVWQRP
        MEALWNLEDK +LST+QA IL T  A AV G CA  W K R       R+     RW   G        GGGETPVRLLGGE E  FGSRNSTAAVWQRP
Subjt:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNR------GRRRSGWARWGGGG-------GGGGETPVRLLGGEEELGFGSRNSTAAVWQRP

Query:  ILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRK
        ILMGEKCEMLKYSGLILYD+RGRLL D IA  AME   KVLLRK
Subjt:  ILMGEKCEMLKYSGLILYDQRGRLLHDSIAATAMEGAPKVLLRK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49000.1 unknown protein4.9e-0662.5Show/hide
Query:  VWQRPILMGEKCEMLKYSGLILYDQRGRLLHD
        +WQR ILMG KCE L +SG+I YD  GRLL++
Subjt:  VWQRPILMGEKCEMLKYSGLILYDQRGRLLHD

AT1G71740.1 unknown protein2.7e-0439.22Show/hide
Query:  PVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYSGLILYDQRGRLL
        P  LL G++  G         +WQ+ ILMG KC++  +SG+ILYD  G+++
Subjt:  PVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYSGLILYDQRGRLL

AT3G14760.1 unknown protein2.1e-1237.67Show/hide
Query:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNRGRR--RSGWARWGGGGGGGGETPVR------------LLGG----------EEE-----
        MEALW LE+KL+L+TK+A+++    AAAV  LC  A   NR  R  +   A W      G  T               L+G           E E     
Subjt:  MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNRGRR--RSGWARWGGGGGGGGETPVR------------LLGG----------EEE-----

Query:  ----LGFGSRNSTAAVWQRPILMGEKCEMLKYSGLILYDQRGRLLH
            +   S N+   VWQRPILMGEKCE+ ++SGLILYD+ G   H
Subjt:  ----LGFGSRNSTAAVWQRPILMGEKCEMLKYSGLILYDQRGRLLH

AT3G18560.1 unknown protein5.5e-0546.55Show/hide
Query:  GGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYSGLILYDQRGRLL
        GG   +T V ++  E+E  +G       VWQR ILMG KCE L YSG+I YD  G  L
Subjt:  GGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYSGLILYDQRGRLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGCTCTGTGGAACTTGGAAGACAAATTGAGGCTCTCCACGAAGCAAGCCCTAATTCTCTTCACATTCGCGGCGGCGGCCGTGGCCGGGCTCTGCGCGACGGCGTG
GGTGAAGAACAGAGGCCGGCGACGGAGCGGGTGGGCGCGGTGGGGCGGAGGAGGCGGAGGCGGAGGAGAGACGCCGGTGCGGCTGTTGGGAGGGGAAGAGGAATTAGGGT
TCGGAAGCCGGAATTCGACGGCGGCGGTGTGGCAGAGGCCGATATTAATGGGGGAGAAATGCGAGATGCTCAAGTACAGTGGGCTTATTCTGTACGACCAAAGGGGAAGA
TTGCTGCACGATTCCATTGCCGCCACCGCCATGGAAGGCGCTCCCAAGGTGCTGCTTCGTAAGTCGTTCAGAAATTATTTTTTT
mRNA sequenceShow/hide mRNA sequence
ATGGAAGCTCTGTGGAACTTGGAAGACAAATTGAGGCTCTCCACGAAGCAAGCCCTAATTCTCTTCACATTCGCGGCGGCGGCCGTGGCCGGGCTCTGCGCGACGGCGTG
GGTGAAGAACAGAGGCCGGCGACGGAGCGGGTGGGCGCGGTGGGGCGGAGGAGGCGGAGGCGGAGGAGAGACGCCGGTGCGGCTGTTGGGAGGGGAAGAGGAATTAGGGT
TCGGAAGCCGGAATTCGACGGCGGCGGTGTGGCAGAGGCCGATATTAATGGGGGAGAAATGCGAGATGCTCAAGTACAGTGGGCTTATTCTGTACGACCAAAGGGGAAGA
TTGCTGCACGATTCCATTGCCGCCACCGCCATGGAAGGCGCTCCCAAGGTGCTGCTTCGTAAGTCGTTCAGAAATTATTTTTTT
Protein sequenceShow/hide protein sequence
MEALWNLEDKLRLSTKQALILFTFAAAAVAGLCATAWVKNRGRRRSGWARWGGGGGGGGETPVRLLGGEEELGFGSRNSTAAVWQRPILMGEKCEMLKYSGLILYDQRGR
LLHDSIAATAMEGAPKVLLRKSFRNYFF