; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS012847 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS012847
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold63:3928640..3929242
RNA-Seq ExpressionMS012847
SyntenyMS012847
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577133.1 hypothetical protein SDJN03_24707, partial [Cucurbita argyrosperma subsp. sororia]3.6e-7676.35Show/hide
Query:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
        MGNCSLKG+  +CEKPIRILTDSGNIINFHG KQV QILK+YPP  YGVFRRPNLSSPLP S  LDAGKSYFLLPLSRAAE+   D           AAE
Subjt:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE

Query:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNSGNGCL
        DL  GSG+EVLP GGDG+WRVKLVIDTKQL EILAEE NTEALIERMRAAAA AA +SPRR KIGGWKP  GNW K  PID+G N+KAQIKDF+SGNGCL
Subjt:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNSGNGCL

Query:  NAT
         AT
Subjt:  NAT

XP_022136823.1 uncharacterized protein LOC111008428 [Momordica charantia]5.6e-109100Show/hide
Query:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
        MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
Subjt:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE

Query:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAAAATESPRRAKIGGWKPVLGNWLKIFPIDLGNSKAQIKDFNSGNGCLNA
        DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAAAATESPRRAKIGGWKPVLGNWLKIFPIDLGNSKAQIKDFNSGNGCLNA
Subjt:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAAAATESPRRAKIGGWKPVLGNWLKIFPIDLGNSKAQIKDFNSGNGCLNA

Query:  T
        T
Subjt:  T

XP_022985422.1 uncharacterized protein LOC111483432 [Cucurbita maxima]2.8e-7675.86Show/hide
Query:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
        MGNCSLKG+  +CEKPIRILTDSGNIINFHG KQV QILK+YPP  YGVFRRPNLSSPLP S PLDAGKSYFLLPLSRAAE+   D           AA 
Subjt:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE

Query:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNSGNGCL
        DL  GSG+EVLP GGDG+WRVKLVIDTKQL EILAE+ NTEALIERMRAAAA AA +SPRR KIGGWKP  GNW K FPID+G N+KAQ+KDF+SGNGCL
Subjt:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNSGNGCL

Query:  NAT
         AT
Subjt:  NAT

XP_023522023.1 uncharacterized protein LOC111785895 [Cucurbita pepo subsp. pepo]4.3e-7776.85Show/hide
Query:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
        MGNCSLKG+  +CEKPIRILTDSGNIINFHG KQV QILK+YPP  YGVFRRPNLSSPLP S PLDAGKSYFLLPLSRAAE+   D           AAE
Subjt:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE

Query:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLGN-SKAQIKDFNSGNGCL
        DL  GSG+EVLP GGDG+WRVKLVIDTKQL EILAEE NTEALIERMRAAAA AA +SPRR KIGGWKP  GNW K  PID+GN +KAQIKDF+SGNGCL
Subjt:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLGN-SKAQIKDFNSGNGCL

Query:  NAT
         AT
Subjt:  NAT

XP_038907067.1 uncharacterized protein LOC120092893 [Benincasa hispida]3.6e-7676.88Show/hide
Query:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
        MGNCSLKGMT +CEKPIRILTDSGNIINFHG KQVHQIL +YPP  YGVFRRPNLSSPLP S PLDAGKSYFLLPLSR A++V  DDG  P        +
Subjt:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE

Query:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAAAATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNSGNGCL
        +LG GSG+EVLPAGG+GVWRVKLVIDTKQL EILAEE NTEALIERMRAAAA   ESP+R KIGGWK   GN LK FPID+G N+KAQIKDF++GNGCL
Subjt:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAAAATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNSGNGCL

TrEMBL top hitse value%identityAlignment
A0A0A0KYF7 Uncharacterized protein1.6e-6974.23Show/hide
Query:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
        MGNCSLKGM  +CEKPIRILTDSG+IINFHG KQVHQIL +YPP  YGVFRRPNLSSPLP S PLDAGKSYFLLPLS++      +DG +P       ++
Subjt:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE

Query:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLGNS-KAQIKDFN
        D+G  SG+EVLPAGG+GVWRVKLVIDTKQL EILAEE NTEALIERMRAAAA AA +SPRR KIGGWKP+ GNW K FPID+GNS KAQ+K FN
Subjt:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLGNS-KAQIKDFN

A0A5A7TTS6 Uncharacterized protein1.8e-6872.31Show/hide
Query:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
        MGNCSLKGM  +C KPIRILTDSG+IINFHG KQVHQIL +YPP  YGVFRRPNLSSPLP S PLDAGKSYFLLPLS+            PS+       
Subjt:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE

Query:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNS
        DLG  SG+EVLPA G+GVWRVKLVIDTKQL EILAEE NTEALIER+RAAAA AA +SPRR KI GWKP+ GNWLK FP+D G N+KAQIK+FNS
Subjt:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNS

A0A6J1C8K7 uncharacterized protein LOC1110084282.7e-109100Show/hide
Query:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
        MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
Subjt:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE

Query:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAAAATESPRRAKIGGWKPVLGNWLKIFPIDLGNSKAQIKDFNSGNGCLNA
        DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAAAATESPRRAKIGGWKPVLGNWLKIFPIDLGNSKAQIKDFNSGNGCLNA
Subjt:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAAAATESPRRAKIGGWKPVLGNWLKIFPIDLGNSKAQIKDFNSGNGCLNA

Query:  T
        T
Subjt:  T

A0A6J1EZE8 uncharacterized protein LOC1114376145.1e-7675.86Show/hide
Query:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
        MGNCSLKG+  +CEKPIRILTDSGNIINFHG KQV QILK+YPP  YGVFRRPNLSSPLP S  LDAGKSYFLLPLSRA E+   D           AAE
Subjt:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE

Query:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNSGNGCL
        DL  GSG+EVLP GGDG+WRVKLVIDTKQL EILAEE NTEALIERMRAAAA AA +SPRR KIGGWKP  GNW K  PID+G N+KAQIKDF+SGNGCL
Subjt:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNSGNGCL

Query:  NAT
         AT
Subjt:  NAT

A0A6J1J4V0 uncharacterized protein LOC1114834321.3e-7675.86Show/hide
Query:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE
        MGNCSLKG+  +CEKPIRILTDSGNIINFHG KQV QILK+YPP  YGVFRRPNLSSPLP S PLDAGKSYFLLPLSRAAE+   D           AA 
Subjt:  MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAE

Query:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNSGNGCL
        DL  GSG+EVLP GGDG+WRVKLVIDTKQL EILAE+ NTEALIERMRAAAA AA +SPRR KIGGWKP  GNW K FPID+G N+KAQ+KDF+SGNGCL
Subjt:  DLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAA-AATESPRRAKIGGWKPVLGNWLKIFPIDLG-NSKAQIKDFNSGNGCL

Query:  NAT
         AT
Subjt:  NAT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G61920.1 unknown protein8.0e-1332.91Show/hide
Query:  MGNCSLKG------MTTECEKPIRILTDSGNIINFHGAKQVHQILKDYP-PAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSR
        MGNC  KG      +  + +  I+++T +G ++  H       I  ++P    +      + S PL +   L  G  Y+LLPLS +A    + D  +  +
Subjt:  MGNCSLKG------MTTECEKPIRILTDSGNIINFHGAKQVHQILKDYP-PAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSR

Query:  RSGAAAEDLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAA
         S       G    +  L  GG GVW+V+LVI  +QL EILAE+V TEAL+E +R  A
Subjt:  RSGAAAEDLGGGSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAA

AT4G10910.1 unknown protein5.0e-0752.63Show/hide
Query:  GSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAAAATESPRRA
        G  ++V P   +GVW+ K+VI +KQLEEILA E NT ALI+++R AAA A  S   A
Subjt:  GSGVEVLPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAAAATESPRRA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAACTGTTCTCTCAAAGGAATGACCACCGAATGCGAGAAGCCCATCAGAATCTTAACCGATTCTGGAAACATAATCAATTTCCATGGCGCAAAGCAAGTCCATCA
AATCCTCAAGGATTACCCGCCCGCCGCCTACGGCGTTTTCCGGCGCCCCAATCTCTCTTCGCCGCTGCCCAATTCGGCGCCCCTCGACGCCGGAAAATCCTACTTTCTCC
TCCCGCTTTCCCGAGCCGCCGAGAGAGTTCCGAAGGACGACGGAGGGGCGCCGTCGAGGAGGTCCGGCGCGGCGGCGGAGGATCTGGGAGGTGGGTCGGGGGTGGAGGTG
CTTCCGGCGGGTGGCGACGGCGTTTGGAGGGTGAAATTGGTGATAGATACGAAACAATTGGAGGAGATTTTGGCGGAGGAAGTGAACACGGAGGCGTTGATTGAGAGAAT
GAGGGCAGCGGCGGCGGCGGCGACGGAAAGTCCACGGCGGGCGAAGATCGGAGGGTGGAAGCCGGTGCTGGGGAATTGGCTGAAGATTTTTCCAATTGATTTGGGTAACA
GTAAAGCACAAATTAAGGATTTTAATTCTGGAAATGGGTGTTTAAATGCAACA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAACTGTTCTCTCAAAGGAATGACCACCGAATGCGAGAAGCCCATCAGAATCTTAACCGATTCTGGAAACATAATCAATTTCCATGGCGCAAAGCAAGTCCATCA
AATCCTCAAGGATTACCCGCCCGCCGCCTACGGCGTTTTCCGGCGCCCCAATCTCTCTTCGCCGCTGCCCAATTCGGCGCCCCTCGACGCCGGAAAATCCTACTTTCTCC
TCCCGCTTTCCCGAGCCGCCGAGAGAGTTCCGAAGGACGACGGAGGGGCGCCGTCGAGGAGGTCCGGCGCGGCGGCGGAGGATCTGGGAGGTGGGTCGGGGGTGGAGGTG
CTTCCGGCGGGTGGCGACGGCGTTTGGAGGGTGAAATTGGTGATAGATACGAAACAATTGGAGGAGATTTTGGCGGAGGAAGTGAACACGGAGGCGTTGATTGAGAGAAT
GAGGGCAGCGGCGGCGGCGGCGACGGAAAGTCCACGGCGGGCGAAGATCGGAGGGTGGAAGCCGGTGCTGGGGAATTGGCTGAAGATTTTTCCAATTGATTTGGGTAACA
GTAAAGCACAAATTAAGGATTTTAATTCTGGAAATGGGTGTTTAAATGCAACA
Protein sequenceShow/hide protein sequence
MGNCSLKGMTTECEKPIRILTDSGNIINFHGAKQVHQILKDYPPAAYGVFRRPNLSSPLPNSAPLDAGKSYFLLPLSRAAERVPKDDGGAPSRRSGAAAEDLGGGSGVEV
LPAGGDGVWRVKLVIDTKQLEEILAEEVNTEALIERMRAAAAAATESPRRAKIGGWKPVLGNWLKIFPIDLGNSKAQIKDFNSGNGCLNAT