; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS008208 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS008208
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDNA binding protein
Genome locationscaffold45:203439..204140
RNA-Seq ExpressionMS008208
SyntenyMS008208
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052154.1 DNA binding protein [Cucumis melo var. makuwa]1.6e-7872.49Show/hide
Query:  MNRPHRNLLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCSL
        MNR   +L  +  SHCQECG SQS CWILH VR KA+FRRLCTNCVLK+NLS FCP+CFDVY+DS+PPPSH RVMC+RCPSISH SCVS  FSSTFLC L
Subjt:  MNRPHRNLLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCSL

Query:  CSDPRFTFFDGFHS-AALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS
        CSDPRF FFDGF S  +L QS S       + VD KS +AIVAAARV+AQSMRRAA DARA AE KIKNAAFAKKQATLALE+LA+LVLQEKD+NG  KS
Subjt:  CSDPRFTFFDGFHS-AALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS

Query:  NNGDAVAGERTVEES--KLQDKEVTAILE
         NGDAVA ER VEE   KL +KEVTAI +
Subjt:  NNGDAVAGERTVEES--KLQDKEVTAILE

KAE8650412.1 hypothetical protein Csa_010702 [Cucumis sativus]3.5e-7872.61Show/hide
Query:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS
        MNR H N L  +  SHCQ+CG SQS CWILH VR KA+FRRLCTNCVLK+NLS FCP+CFDVY+DS+PPPSH RVMCFRCPSISH SCVS  FSSTFLC 
Subjt:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS

Query:  LCSDPRFTFFDGFHS-AALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVK
        LCSDPRF FFDGF S  +L QS S       + VD KS +AIVAAARV+AQSMRRAA DARA AE KIKNAAFAKKQATLALE+LA+LVLQEKD+NG  K
Subjt:  LCSDPRFTFFDGFHS-AALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVK

Query:  SNNGDAVAGERTVEES--KLQDKEVTAILE
        S NGDAV  ER VEE   KLQ+K+VTAI +
Subjt:  SNNGDAVAGERTVEES--KLQDKEVTAILE

XP_022137907.1 uncharacterized protein LOC111009209 [Momordica charantia]9.3e-12498.72Show/hide
Query:  MNRPHRNLLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCSL
        MNRPHRNLLLHPNSHCQECGTSQSHCWILH VRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS 
Subjt:  MNRPHRNLLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCSL

Query:  CSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKSN
        CSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQE+DRNGCVKSN
Subjt:  CSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKSN

Query:  NGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ
        NGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ
Subjt:  NGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ

XP_022955770.1 uncharacterized protein LOC111457661 [Cucurbita moschata]4.5e-7871.06Show/hide
Query:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS
        MNRPH N L  +  S+C ECG SQS CWILH VR KASFRRLCTNCVLKNNLS FCP+CFD+YDDS+PP SHQRVMCFRCPSISH SC S  FSSTFLC 
Subjt:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS

Query:  LCSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS
        +CSDP F FFDGF S  L QS  A  +   R  D KSA+AIVAAARVAAQSMRRAAADARA AE KI+NA FAKKQATLALERLAFLVLQEKDRNG  K+
Subjt:  LCSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS

Query:  NNGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ
        N   A       EE++LQ + VTAILER K N +Q
Subjt:  NNGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ

XP_038902666.1 uncharacterized protein LOC120089301 [Benincasa hispida]1.3e-8576.27Show/hide
Query:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS
        MNR H N L  +  SHCQECG SQS CWILH VRLKASFRRLCTNCVLK+NLS FCP+CFDVYDDS+PPPSHQRVMCFRCPSISH SC S  FSSTFLC 
Subjt:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS

Query:  LCSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS
        LCSDP F FFDGF S  L QS S       R VD KSA+AIVAAARV+AQSMRRAA DARA AE KIKNAAFAKKQATLALERLA+LVLQEKDRNG  K+
Subjt:  LCSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS

Query:  NNGDAVAGERTVEE--SKLQDKEVTAILERKKWNQS
         NGDAVAGERTVEE  SKLQ+KEVT+I  R K N++
Subjt:  NNGDAVAGERTVEE--SKLQDKEVTAILERKKWNQS

TrEMBL top hitse value%identityAlignment
A0A0A0L5G4 Uncharacterized protein1.7e-7872.61Show/hide
Query:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS
        MNR H N L  +  SHCQ+CG SQS CWILH VR KA+FRRLCTNCVLK+NLS FCP+CFDVY+DS+PPPSH RVMCFRCPSISH SCVS  FSSTFLC 
Subjt:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS

Query:  LCSDPRFTFFDGFHS-AALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVK
        LCSDPRF FFDGF S  +L QS S       + VD KS +AIVAAARV+AQSMRRAA DARA AE KIKNAAFAKKQATLALE+LA+LVLQEKD+NG  K
Subjt:  LCSDPRFTFFDGFHS-AALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVK

Query:  SNNGDAVAGERTVEES--KLQDKEVTAILE
        S NGDAV  ER VEE   KLQ+K+VTAI +
Subjt:  SNNGDAVAGERTVEES--KLQDKEVTAILE

A0A5A7U9X0 DNA binding protein7.5e-7972.49Show/hide
Query:  MNRPHRNLLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCSL
        MNR   +L  +  SHCQECG SQS CWILH VR KA+FRRLCTNCVLK+NLS FCP+CFDVY+DS+PPPSH RVMC+RCPSISH SCVS  FSSTFLC L
Subjt:  MNRPHRNLLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCSL

Query:  CSDPRFTFFDGFHS-AALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS
        CSDPRF FFDGF S  +L QS S       + VD KS +AIVAAARV+AQSMRRAA DARA AE KIKNAAFAKKQATLALE+LA+LVLQEKD+NG  KS
Subjt:  CSDPRFTFFDGFHS-AALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS

Query:  NNGDAVAGERTVEES--KLQDKEVTAILE
         NGDAVA ER VEE   KL +KEVTAI +
Subjt:  NNGDAVAGERTVEES--KLQDKEVTAILE

A0A6J1C9K9 uncharacterized protein LOC1110092094.5e-12498.72Show/hide
Query:  MNRPHRNLLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCSL
        MNRPHRNLLLHPNSHCQECGTSQSHCWILH VRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS 
Subjt:  MNRPHRNLLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCSL

Query:  CSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKSN
        CSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQE+DRNGCVKSN
Subjt:  CSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKSN

Query:  NGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ
        NGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ
Subjt:  NGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ

A0A6J1GVZ7 uncharacterized protein LOC1114576612.2e-7871.06Show/hide
Query:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS
        MNRPH N L  +  S+C ECG SQS CWILH VR KASFRRLCTNCVLKNNLS FCP+CFD+YDDS+PP SHQRVMCFRCPSISH SC S  FSSTFLC 
Subjt:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS

Query:  LCSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS
        +CSDP F FFDGF S  L QS  A  +   R  D KSA+AIVAAARVAAQSMRRAAADARA AE KI+NA FAKKQATLALERLAFLVLQEKDRNG  K+
Subjt:  LCSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS

Query:  NNGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ
        N   A       EE++LQ + VTAILER K N +Q
Subjt:  NNGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ

A0A6J1IXP5 uncharacterized protein LOC1114794684.9e-7870.64Show/hide
Query:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS
        MNRPH N L  +  S+C ECG SQS CWILH VR KASFRRLCTNCVLKNNLS FCP+CFD+YDDS+PP SHQRVMCFRCPSISH SC S  FSSTFLC 
Subjt:  MNRPHRN-LLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCS

Query:  LCSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS
        LCSDP F FFDGF S +L QS  A  +   R  D KSA+AIVAAARV AQSMRRAAADARA AE K +NA FAKKQATLALERLA+LVLQEKDRNG  K+
Subjt:  LCSDPRFTFFDGFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKS

Query:  NNGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ
        N   A       EE++LQ + VTAILER K NQ+Q
Subjt:  NNGDAVAGERTVEESKLQDKEVTAILERKKWNQSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G09520.1 LOCATED IN: chloroplast1.8e-2439.01Show/hide
Query:  CQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCF--RCPSISHFSCVSVHFSSTFLCSLCSDPR-FTFFDGF
        C +CG+S +  W++H VRL+AS R  CT+C+L+N+ + FCP CF +YD S  PPS +RV C    C S++H  C       ++LC  C DP  F+FF   
Subjt:  CQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCF--RCPSISHFSCVSVHFSSTFLCSLCSDPR-FTFFDGF

Query:  HSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRN
            + ++GS       R VD   + A + AA++AA SM +A   A+   + + K AA AKK+A  ALE++  L  +EK R+
Subjt:  HSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRN

AT3G17460.1 PHD finger family protein5.2e-1634.25Show/hide
Query:  LHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFS--------------STFLCSLCSDPRFTFFDGFHS
        +H V    +FRRLCT+C+LK     FC VCF+++D++ PP +  R++C  CPS +H SC +   S              S+F C  CS+P FTFF     
Subjt:  LHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFS--------------STFLCSLCSDPRFTFFDGFHS

Query:  AALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNG
            +S     + D   +  KSA A+VAA  ++  +M +A A  +  A  KI  A  AK +A  AL  L  +V+++    G
Subjt:  AALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCGGCCGCACAGGAACCTTCTTCTTCACCCCAATTCTCACTGCCAAGAATGCGGCACCTCTCAATCCCACTGCTGGATCCTCCACCAAGTCCGTCTCAAAGCCTC
CTTCCGCCGTCTCTGCACCAATTGCGTTCTCAAGAACAATCTCTCCGGTTTCTGCCCCGTTTGCTTCGATGTTTACGACGATTCCTCTCCCCCGCCCTCTCACCAGCGAG
TTATGTGCTTCAGATGCCCTTCAATCTCCCACTTCTCCTGCGTTTCCGTCCATTTTTCTTCTACCTTCTTGTGCTCTCTCTGCTCTGATCCTCGTTTTACCTTCTTCGAC
GGTTTCCACTCCGCCGCCCTCTCCCAGTCCGGCTCGGCCGCCGTTCTGGCTGACAACAGAGTCGTTGATTGTAAATCGGCTAGGGCGATCGTCGCTGCCGCACGTGTCGC
TGCTCAATCTATGCGGAGAGCGGCGGCTGACGCTAGGGCTGCGGCGGAGACGAAGATCAAGAATGCTGCATTTGCCAAGAAGCAGGCTACTCTCGCATTGGAGCGACTGG
CTTTTCTCGTGCTACAGGAGAAGGACAGAAACGGATGCGTGAAGAGTAATAATGGAGATGCTGTTGCAGGTGAGAGGACGGTGGAAGAATCCAAGCTACAGGACAAGGAG
GTAACAGCCATTCTCGAGCGGAAGAAGTGGAATCAGAGTCAG
mRNA sequenceShow/hide mRNA sequence
ATGAATCGGCCGCACAGGAACCTTCTTCTTCACCCCAATTCTCACTGCCAAGAATGCGGCACCTCTCAATCCCACTGCTGGATCCTCCACCAAGTCCGTCTCAAAGCCTC
CTTCCGCCGTCTCTGCACCAATTGCGTTCTCAAGAACAATCTCTCCGGTTTCTGCCCCGTTTGCTTCGATGTTTACGACGATTCCTCTCCCCCGCCCTCTCACCAGCGAG
TTATGTGCTTCAGATGCCCTTCAATCTCCCACTTCTCCTGCGTTTCCGTCCATTTTTCTTCTACCTTCTTGTGCTCTCTCTGCTCTGATCCTCGTTTTACCTTCTTCGAC
GGTTTCCACTCCGCCGCCCTCTCCCAGTCCGGCTCGGCCGCCGTTCTGGCTGACAACAGAGTCGTTGATTGTAAATCGGCTAGGGCGATCGTCGCTGCCGCACGTGTCGC
TGCTCAATCTATGCGGAGAGCGGCGGCTGACGCTAGGGCTGCGGCGGAGACGAAGATCAAGAATGCTGCATTTGCCAAGAAGCAGGCTACTCTCGCATTGGAGCGACTGG
CTTTTCTCGTGCTACAGGAGAAGGACAGAAACGGATGCGTGAAGAGTAATAATGGAGATGCTGTTGCAGGTGAGAGGACGGTGGAAGAATCCAAGCTACAGGACAAGGAG
GTAACAGCCATTCTCGAGCGGAAGAAGTGGAATCAGAGTCAG
Protein sequenceShow/hide protein sequence
MNRPHRNLLLHPNSHCQECGTSQSHCWILHQVRLKASFRRLCTNCVLKNNLSGFCPVCFDVYDDSSPPPSHQRVMCFRCPSISHFSCVSVHFSSTFLCSLCSDPRFTFFD
GFHSAALSQSGSAAVLADNRVVDCKSARAIVAAARVAAQSMRRAAADARAAAETKIKNAAFAKKQATLALERLAFLVLQEKDRNGCVKSNNGDAVAGERTVEESKLQDKE
VTAILERKKWNQSQ