; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS020378 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS020378
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionHth-type transcriptional regulator
Genome locationscaffold211:64400..64777
RNA-Seq ExpressionMS020378
SyntenyMS020378
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045798.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa]1.3e-3183.13Show/hide
Query:  MDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPLSLPDLCNLAIKASSALRN
        MDG IEEF+DPIKASKI SRNPNF LCNSEQMLIGSCVP+LS DE+LQ+GQIYFL+P   AH+PLSLPDLCN AIKASSALRN
Subjt:  MDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPLSLPDLCNLAIKASSALRN

KAE8651930.1 hypothetical protein Csa_006329 [Cucumis sativus]9.4e-3866.67Show/hide
Query:  MGVCVSTQT--TNSSRRGITV----SQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRC
        MGVC STQT  T + + G+       Q + +    +IK+VHMDG IEEF+DPIKASKI SRNPNF LCNSEQMLIGSCVP+LS DE+LQ+GQIYFL+P  
Subjt:  MGVCVSTQT--TNSSRRGITV----SQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRC

Query:  QAHTPLSLPDLCNLAIKASSALRNHRAASLFR
         AH+PLSLPDLCN AIKASSALRN + +SLFR
Subjt:  QAHTPLSLPDLCNLAIKASSALRNHRAASLFR

KAG6571421.1 hypothetical protein SDJN03_30336, partial [Cucurbita argyrosperma subsp. sororia]3.8e-3163.03Show/hide
Query:  MGVCVSTQTTNSS---RRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAH
        MGVC ST T   S   +RG+ + Q +  +  S IK++HMDG ++EF+DPIKASKI S NPN LLCNS++M IGS VP+LS DE+LQLGQIYFL+P   A 
Subjt:  MGVCVSTQTTNSS---RRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAH

Query:  TPLSLPDLCNLAIKASSAL
        +PLSLPDLCN AIKASSAL
Subjt:  TPLSLPDLCNLAIKASSAL

XP_008457799.2 PREDICTED: uncharacterized protein LOC103497401 [Cucumis melo]4.6e-3766.93Show/hide
Query:  MGVCVSTQT--TNSSRRGITVS-------QLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLI
        MGVC STQT  T + + GI  S       Q   +    +IK++HMDG IEEF+DPIKASKI SRNPNF LCNSEQMLIGSCVP+LS DE+LQ+GQIYFL+
Subjt:  MGVCVSTQT--TNSSRRGITVS-------QLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLI

Query:  PRCQAHTPLSLPDLCNLAIKASSALRN
        P   AH+PLSLPDLCN AIKASSALRN
Subjt:  PRCQAHTPLSLPDLCNLAIKASSALRN

XP_022158695.1 uncharacterized protein LOC111025158 [Momordica charantia]3.8e-63100Show/hide
Query:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL
        MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL
Subjt:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL

Query:  SLPDLCNLAIKASSALRNHRAASLFR
        SLPDLCNLAIKASSALRNHRAASLFR
Subjt:  SLPDLCNLAIKASSALRNHRAASLFR

TrEMBL top hitse value%identityAlignment
A0A0A0LJJ9 Uncharacterized protein4.5e-3866.67Show/hide
Query:  MGVCVSTQT--TNSSRRGITV----SQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRC
        MGVC STQT  T + + G+       Q + +    +IK+VHMDG IEEF+DPIKASKI SRNPNF LCNSEQMLIGSCVP+LS DE+LQ+GQIYFL+P  
Subjt:  MGVCVSTQT--TNSSRRGITV----SQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRC

Query:  QAHTPLSLPDLCNLAIKASSALRNHRAASLFR
         AH+PLSLPDLCN AIKASSALRN + +SLFR
Subjt:  QAHTPLSLPDLCNLAIKASSALRNHRAASLFR

A0A1S3C6C0 uncharacterized protein LOC1034974012.2e-3766.93Show/hide
Query:  MGVCVSTQT--TNSSRRGITVS-------QLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLI
        MGVC STQT  T + + GI  S       Q   +    +IK++HMDG IEEF+DPIKASKI SRNPNF LCNSEQMLIGSCVP+LS DE+LQ+GQIYFL+
Subjt:  MGVCVSTQT--TNSSRRGITVS-------QLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLI

Query:  PRCQAHTPLSLPDLCNLAIKASSALRN
        P   AH+PLSLPDLCN AIKASSALRN
Subjt:  PRCQAHTPLSLPDLCNLAIKASSALRN

A0A2C9UTH4 Uncharacterized protein1.5e-2553.39Show/hide
Query:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL
        MG+C S+Q+ NS +             +STIK+V  DG ++E   PIKAS I ++NPNF LC+SE M++G CVP +S DE+LQLGQIYFL+P  QAH PL
Subjt:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL

Query:  SLPDLCNLAIKASSALRN
         LPDLC LA  ASS++ N
Subjt:  SLPDLCNLAIKASSALRN

A0A5A7TSG7 DUF4228 domain-containing protein6.3e-3283.13Show/hide
Query:  MDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPLSLPDLCNLAIKASSALRN
        MDG IEEF+DPIKASKI SRNPNF LCNSEQMLIGSCVP+LS DE+LQ+GQIYFL+P   AH+PLSLPDLCN AIKASSALRN
Subjt:  MDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPLSLPDLCNLAIKASSALRN

A0A6J1DWJ0 uncharacterized protein LOC1110251581.8e-63100Show/hide
Query:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL
        MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL
Subjt:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL

Query:  SLPDLCNLAIKASSALRNHRAASLFR
        SLPDLCNLAIKASSALRNHRAASLFR
Subjt:  SLPDLCNLAIKASSALRNHRAASLFR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76600.1 unknown protein4.2e-1232.54Show/hide
Query:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKI----------ASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFL
        MG+CVS            V++ E +   +T KIV ++G + E+  P+ AS++          +S + ++ LCNS+ +     +PA+  DE LQ  QIYF+
Subjt:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKI----------ASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFL

Query:  IPRCQAHTPLSLPDLCNLAIKASSAL
        +P  +    LS  D+  LA+KAS A+
Subjt:  IPRCQAHTPLSLPDLCNLAIKASSAL

AT2G23690.1 unknown protein2.4e-1535.34Show/hide
Query:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL
        MG+C S ++T                + +T K++  DG + EF  P+K   +  +NP   +CNS+ M   + V A+S DE+ QLGQ+YF +P    H  L
Subjt:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL

Query:  SLPDLCNLAIKASSAL
           ++  LA+KASSAL
Subjt:  SLPDLCNLAIKASSAL

AT3G50800.1 unknown protein5.3e-1540Show/hide
Query:  RRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPLSLPDLCNLAIKASSAL
        R  T K++  DG ++EF+ P+K  +I  +NP   +CNS+ M     V A+ G EDL+ G++YF++P    + PL   ++  LA+KASSAL
Subjt:  RRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPLSLPDLCNLAIKASSAL

AT4G37240.1 unknown protein4.4e-1737.93Show/hide
Query:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL
        MG+C S+++T                + +T K++  DG + EFA+P+K   +  + P   +CNS+ M     V A+S DE+LQLGQIYF +P C    PL
Subjt:  MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPL

Query:  SLPDLCNLAIKASSAL
           ++  LA+KASSAL
Subjt:  SLPDLCNLAIKASSAL

AT5G66580.1 unknown protein6.3e-1638.89Show/hide
Query:  RRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPLSLPDLCNLAIKASSAL
        R  + K++ +DG ++EF+ P+K  +I  +NP   +CNS++M     V A++G+E+L+ GQ+YF++P    + PL   ++  LA+KASSAL
Subjt:  RRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPLSLPDLCNLAIKASSAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTCTGCGTTTCCACTCAAACCACGAATTCTTCAAGGCGCGGAATCACAGTGTCGCAACTGGAATCAATCCACCGCCGATCCACGATCAAGATCGTTCACATGGA
CGGATTAATCGAGGAGTTCGCGGATCCGATCAAAGCCTCCAAAATCGCCTCCCGGAACCCTAACTTCCTCCTCTGCAATTCCGAGCAAATGTTGATCGGCAGCTGTGTCC
CGGCGCTCTCCGGCGACGAGGACCTCCAGCTCGGCCAAATCTACTTCCTCATTCCCCGCTGTCAGGCTCACACCCCTCTCTCCCTCCCCGACCTCTGCAATCTCGCCATT
AAAGCCAGCTCTGCTCTTCGCAACCACCGCGCCGCGTCGCTGTTCAGG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGTCTGCGTTTCCACTCAAACCACGAATTCTTCAAGGCGCGGAATCACAGTGTCGCAACTGGAATCAATCCACCGCCGATCCACGATCAAGATCGTTCACATGGA
CGGATTAATCGAGGAGTTCGCGGATCCGATCAAAGCCTCCAAAATCGCCTCCCGGAACCCTAACTTCCTCCTCTGCAATTCCGAGCAAATGTTGATCGGCAGCTGTGTCC
CGGCGCTCTCCGGCGACGAGGACCTCCAGCTCGGCCAAATCTACTTCCTCATTCCCCGCTGTCAGGCTCACACCCCTCTCTCCCTCCCCGACCTCTGCAATCTCGCCATT
AAAGCCAGCTCTGCTCTTCGCAACCACCGCGCCGCGTCGCTGTTCAGG
Protein sequenceShow/hide protein sequence
MGVCVSTQTTNSSRRGITVSQLESIHRRSTIKIVHMDGLIEEFADPIKASKIASRNPNFLLCNSEQMLIGSCVPALSGDEDLQLGQIYFLIPRCQAHTPLSLPDLCNLAI
KASSALRNHRAASLFR