; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001937 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001937
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDUF4228 domain-containing protein
Genome locationscaffold30:1158926..1159745
RNA-Seq ExpressionMS001937
SyntenyMS001937
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057887.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa]1.4e-7486.1Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTMFTPNES NNSN  +NETSSNSVRLTRIKLLRPADMLVLGQVYRLIT+QEV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        M+GLSAKK AKV ++ LEAA+KP+RRKER    SD+AA RS  ED + QA KHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

XP_004138125.1 uncharacterized protein LOC101211887 [Cucumis sativus]7.8e-7384.21Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNN N  +NETSSNSVRLTRIKLLRPADMLVLGQVYRLIT+QEV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAAR---RSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        M+GLSAKK AKV ++ LEAA+KP+RRK+R    SD+AA    RS  ED + QA KHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAAR---RSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

XP_008453120.1 PREDICTED: uncharacterized protein LOC103493930 [Cucumis melo]1.4e-7486.1Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTMFTPNES NNSN  +NETSSNSVRLTRIKLLRPADMLVLGQVYRLIT+QEV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        M+GLSAKK AKV ++ LEAA+KP+RRKER    SD+AA RS  ED + QA KHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

XP_022135561.1 uncharacterized protein LOC111007486 isoform X1 [Momordica charantia]2.0e-92100Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

XP_022135563.1 uncharacterized protein LOC111007486 isoform X2 [Momordica charantia]1.4e-9099.47Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLV QATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

TrEMBL top hitse value%identityAlignment
A0A1S3BVG8 uncharacterized protein LOC1034939306.9e-7586.1Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTMFTPNES NNSN  +NETSSNSVRLTRIKLLRPADMLVLGQVYRLIT+QEV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        M+GLSAKK AKV ++ LEAA+KP+RRKER    SD+AA RS  ED + QA KHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A5A7UT51 DUF4228 domain-containing protein6.9e-7586.1Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTMFTPNES NNSN  +NETSSNSVRLTRIKLLRPADMLVLGQVYRLIT+QEV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        M+GLSAKK AKV ++ LEAA+KP+RRKER    SD+AA RS  ED + QA KHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A5D3CU41 DUF4228 domain-containing protein6.9e-7586.1Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTMFTPNES NNSN  +NETSSNSVRLTRIKLLRPADMLVLGQVYRLIT+QEV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        M+GLSAKK AKV ++ LEAA+KP+RRKER    SD+AA RS  ED + QA KHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A6J1C138 uncharacterized protein LOC111007486 isoform X19.6e-93100Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

A0A6J1C554 uncharacterized protein LOC111007486 isoform X26.8e-9199.47Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLV QATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10530.1 unknown protein9.3e-2439.15Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNN--NSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQ
        MGNCQA++AA LV+QHP G +D+ Y  V+  E+M M PGHYV+L+I     +  E  N   +   +++    +VR TR++LLRP + LVLG  YRLITSQ
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNN--NSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQ

Query:  EVMRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        EVM+ L  KK AK  K+ +E             T +   + +  PE         +K  +       +++  +S+TW+PSL SISEA S
Subjt:  EVMRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

AT1G60010.1 unknown protein3.5e-3146.32Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNN---NSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITS
        MGNCQA+DAA LV+QHP GK+D+ Y PV+  EIM+M PGHYV+L+I      P    N    +   ++++    VR TR+KLLRP + LVLG  YRLITS
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNN---NSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITS

Query:  QEVMRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        QEVM+ L AKK+AK  K+  E ++  E++K  +    D  + +++        TK EK    + S  T SA++RS+TW+PSL SISEA S
Subjt:  QEVMRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

AT5G50090.1 unknown protein3.5e-3147.06Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQA+D A +VIQHP+GK +KL  PV+A  +MKMNPGH V+LLISTT  +   S +             +RLTRIKLLRP D LVLG VYRLIT++EV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        M+GL AKK +K+ K    + +K E  K   +T  D+       ED +Q   + ++ +R             SR+WQPSL SISE GS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

AT5G50090.2 unknown protein6.7e-3045.45Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQA+D A +VIQHP+GK +KL  PV+A  +MKMNPGH V+LLISTT  +   S +             +RLTRIKLLRP D LVLG VYRLIT++EV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        M+GL AKK +K+ K    + +K E  K   +T  D+  +  E   +                         SR+WQPSL SISE GS
Subjt:  MRGLSAKKHAKVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS

AT5G62900.1 unknown protein1.7e-2541.67Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV
        MGNCQA +AAT VIQ P GK  + Y  V A E++K +PGH+VALL+S+ +                    S+R+TRIKLLRP+D L+LG VYRLI+S+EV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEV

Query:  MRGLSAKKHAKVNK-----NLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS
        M+G+ AKK  K+ K     ++ E    P   +  +A+  D+            Q   HEK    R   +T  AT + R WQPSL SISE+ S
Subjt:  MRGLSAKKHAKVNK-----NLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATTGCCAAGCCATTGATGCTGCAACACTTGTGATACAACATCCAAGTGGGAAAGTGGACAAATTGTATTGGCCTGTGACTGCTAGAGAGATCATGAAGATGAA
TCCTGGTCACTATGTTGCTCTTCTCATATCCACCACCATGTTTACACCAAATGAAAGTAACAACAACAGCAACAACAACAACAATGAAACCAGCAGTAATTCAGTTCGTT
TAACTCGAATCAAGCTTCTCCGCCCAGCTGACATGCTTGTTCTTGGCCAAGTTTACAGGCTCATCACCTCTCAAGAGGTTATGAGAGGTTTATCAGCAAAGAAACACGCA
AAGGTTAATAAAAACCTGTTAGAAGCAGCAGAGAAGCCAGAGAGGAGGAAAGAACGTGCAGCCACAGGGTCAGATTCAGCGGCCAGAAGATCTGAACCTGAAGACCTTGT
TCAGCAGGCGACCAAACATGAGAAAAACAATAGACCAAGGACAAGTACATCGACAACCTCGGCCACAGCCAGGTCAAGAACATGGCAACCTTCATTACATAGCATCTCAG
AAGCTGGAAGC
mRNA sequenceShow/hide mRNA sequence
ATGGGAAATTGCCAAGCCATTGATGCTGCAACACTTGTGATACAACATCCAAGTGGGAAAGTGGACAAATTGTATTGGCCTGTGACTGCTAGAGAGATCATGAAGATGAA
TCCTGGTCACTATGTTGCTCTTCTCATATCCACCACCATGTTTACACCAAATGAAAGTAACAACAACAGCAACAACAACAACAATGAAACCAGCAGTAATTCAGTTCGTT
TAACTCGAATCAAGCTTCTCCGCCCAGCTGACATGCTTGTTCTTGGCCAAGTTTACAGGCTCATCACCTCTCAAGAGGTTATGAGAGGTTTATCAGCAAAGAAACACGCA
AAGGTTAATAAAAACCTGTTAGAAGCAGCAGAGAAGCCAGAGAGGAGGAAAGAACGTGCAGCCACAGGGTCAGATTCAGCGGCCAGAAGATCTGAACCTGAAGACCTTGT
TCAGCAGGCGACCAAACATGAGAAAAACAATAGACCAAGGACAAGTACATCGACAACCTCGGCCACAGCCAGGTCAAGAACATGGCAACCTTCATTACATAGCATCTCAG
AAGCTGGAAGC
Protein sequenceShow/hide protein sequence
MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNSNNNNNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEVMRGLSAKKHA
KVNKNLLEAAEKPERRKERAATGSDSAARRSEPEDLVQQATKHEKNNRPRTSTSTTSATARSRTWQPSLHSISEAGS