; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021779 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021779
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein canopy-1 isoform X1
Genome locationscaffold1:299012..301154
RNA-Seq ExpressionMS021779
SyntenyMS021779
Gene Ontology termsNA
InterPro domainsIPR021852 - Domain of unknown function DUF3456


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008452994.1 PREDICTED: protein canopy-1 isoform X1 [Cucumis melo]4.5e-6982.14Show/hide
Query:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ
        TMKFNAW LLLLVIYSG V+CIDDKCAACNAVA         EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKT S+GQQ
Subjt:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ

Query:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCS
        WIKVD+WDNLTN+QEARAYSKDISTYCGR    +      LAELIKKGSV  GDVSKVLCHDLSRHC+
Subjt:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCS

XP_022141591.1 protein seele isoform X1 [Momordica charantia]4.2e-7588.17Show/hide
Query:  MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW
        MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA         EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW
Subjt:  MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW

Query:  IKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA
        IKVDSWDNLTNQQEARAYSKDISTYCGR    +      LAELIKKGSVREGDVSKVLCHDLSRHCSNA
Subjt:  IKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA

XP_022141601.1 protein seele isoform X2 [Momordica charantia]4.2e-7588.17Show/hide
Query:  MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW
        MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA         EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW
Subjt:  MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW

Query:  IKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA
        IKVDSWDNLTNQQEARAYSKDISTYCGR    +      LAELIKKGSVREGDVSKVLCHDLSRHCSNA
Subjt:  IKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA

XP_038896985.1 protein canopy-1 isoform X1 [Benincasa hispida]2.0e-6982.35Show/hide
Query:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ
        TMK NAW LLLLVIYSG V+CIDDKCAACNAVA         EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKT S+GQQ
Subjt:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ

Query:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA
        WIKVD+WDNLTN+QEARAYSKDISTYCGR    +      LAELIKKGSVR GDVSKVLCHDLSRHCS A
Subjt:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA

XP_038896987.1 protein canopy-1 isoform X2 [Benincasa hispida]2.0e-6982.35Show/hide
Query:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ
        TMK NAW LLLLVIYSG V+CIDDKCAACNAVA         EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKT S+GQQ
Subjt:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ

Query:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA
        WIKVD+WDNLTN+QEARAYSKDISTYCGR    +      LAELIKKGSVR GDVSKVLCHDLSRHCS A
Subjt:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA

TrEMBL top hitse value%identityAlignment
A0A1S3BUL2 protein canopy-1 isoform X22.2e-6982.14Show/hide
Query:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ
        TMKFNAW LLLLVIYSG V+CIDDKCAACNAVA         EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKT S+GQQ
Subjt:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ

Query:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCS
        WIKVD+WDNLTN+QEARAYSKDISTYCGR    +      LAELIKKGSV  GDVSKVLCHDLSRHC+
Subjt:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCS

A0A1S3BWB0 protein canopy-1 isoform X12.2e-6982.14Show/hide
Query:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ
        TMKFNAW LLLLVIYSG V+CIDDKCAACNAVA         EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKT S+GQQ
Subjt:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ

Query:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCS
        WIKVD+WDNLTN+QEARAYSKDISTYCGR    +      LAELIKKGSV  GDVSKVLCHDLSRHC+
Subjt:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCS

A0A5D3D8N5 Protein canopy-1 isoform X12.2e-6982.14Show/hide
Query:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ
        TMKFNAW LLLLVIYSG V+CIDDKCAACNAVA         EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKT S+GQQ
Subjt:  TMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQ

Query:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCS
        WIKVD+WDNLTN+QEARAYSKDISTYCGR    +      LAELIKKGSV  GDVSKVLCHDLSRHC+
Subjt:  WIKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCS

A0A6J1CJM8 protein seele isoform X12.0e-7588.17Show/hide
Query:  MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW
        MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA         EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW
Subjt:  MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW

Query:  IKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA
        IKVDSWDNLTNQQEARAYSKDISTYCGR    +      LAELIKKGSVREGDVSKVLCHDLSRHCSNA
Subjt:  IKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA

A0A6J1CJN9 protein seele isoform X22.0e-7588.17Show/hide
Query:  MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW
        MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA         EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW
Subjt:  MKFNAWGLLLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQW

Query:  IKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA
        IKVDSWDNLTNQQEARAYSKDISTYCGR    +      LAELIKKGSVREGDVSKVLCHDLSRHCSNA
Subjt:  IKVDSWDNLTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA

SwissProt top hitse value%identityAlignment
Q5HZV5 Protein canopy homolog 34.1e-0427.88Show/hide
Query:  WGLLLLVIYSGAVNCID-------DKCAACNAVAEKPRNHLD----MRHRLDSK-----GQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQ
        W LLLL ++ G     D        KC  C  VA + ++  D     R  +D++       +K K I Y  S++R++E+ +GLC ++ +Y + K  S   
Subjt:  WGLLLLVIYSGAVNCID-------DKCAACNAVAEKPRNHLD----MRHRLDSK-----GQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQ

Query:  QWIK
        ++ K
Subjt:  QWIK

Arabidopsis top hitse value%identityAlignment
AT1G42480.1 unknown protein4.0e-4757.59Show/hide
Query:  LLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQWIKVDSWDN
        +++ I+S   + +DDKCAACNAVA         EKPRNHLDMR+RL+SKGQR+GKVIDYR+S+LRVV+LLDGLC++MQDYT++K     +QW+KV ++DN
Subjt:  LLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQWIKVDSWDN

Query:  LTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHC
        LTN+QEA+A++ DISTYCGR    +     +L E+IK GS++ G+V KVLC  LS HC
Subjt:  LTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHC

AT1G42480.2 unknown protein1.5e-3060.19Show/hide
Query:  LLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQWIKVDSWDN
        +++ I+S   + +DDKCAACNAVA         EKPRNHLDMR+RL+SKGQR+GKVIDYR+S+LRVV+LLDGLC++MQDYT++K     +QW+KV ++DN
Subjt:  LLLVIYSGAVNCIDDKCAACNAVA---------EKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQWIKVDSWDN

Query:  LTN
        LT+
Subjt:  LTN

AT1G42480.3 unknown protein2.1e-3247.47Show/hide
Query:  LLLVIYSGAVNCIDDKCAACNAVAEKPRNHL---DMRHR-LDSKGQRKGKVIDYR-----VSELRVVELLDGLCEKMQDYTIEKTASSGQQWIKVDSWDN
        +++ I+S   + +DDKCAACNAVAE+    L    ++ R L +    KGK  D       +S+LRVV+LLDGLC++MQDYT++K     +QW+KV ++DN
Subjt:  LLLVIYSGAVNCIDDKCAACNAVAEKPRNHL---DMRHR-LDSKGQRKGKVIDYR-----VSELRVVELLDGLCEKMQDYTIEKTASSGQQWIKVDSWDN

Query:  LTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHC
        LTN+QEA+A++ DISTYCGR    +     +L E+IK GS++ G+V KVLC  LS HC
Subjt:  LTNQQEARAYSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ACGACGATGAAGTTCAACGCGTGGGGGCTACTGTTGTTAGTGATCTACTCCGGTGCCGTCAATTGCATTGATGATAAATGCGCCGCTTGCAATGCTGTTGCCGAAAAACC
GAGGAATCATTTAGATATGAGACATCGGTTGGATTCAAAAGGTCAGCGTAAGGGGAAGGTAATTGATTACAGGGTCAGTGAGCTAAGAGTTGTAGAACTCCTGGATGGGC
TTTGTGAAAAGATGCAAGATTACACTATTGAGAAGACAGCTTCATCCGGACAACAATGGATCAAGGTGGATAGCTGGGACAACTTGACAAATCAACAAGAAGCTCGAGCT
TACTCAAAAGATATATCAACTTATTGTGGGAGATCTTACTCAATTTCTTCATATATGATCCTCCAGTTGGCAGAATTGATTAAGAAAGGTTCTGTCAGAGAAGGTGACGT
TAGCAAAGTCCTATGCCATGATTTGAGCAGGCACTGCAGCAACGCC
mRNA sequenceShow/hide mRNA sequence
ACGACGATGAAGTTCAACGCGTGGGGGCTACTGTTGTTAGTGATCTACTCCGGTGCCGTCAATTGCATTGATGATAAATGCGCCGCTTGCAATGCTGTTGCCGAAAAACC
GAGGAATCATTTAGATATGAGACATCGGTTGGATTCAAAAGGTCAGCGTAAGGGGAAGGTAATTGATTACAGGGTCAGTGAGCTAAGAGTTGTAGAACTCCTGGATGGGC
TTTGTGAAAAGATGCAAGATTACACTATTGAGAAGACAGCTTCATCCGGACAACAATGGATCAAGGTGGATAGCTGGGACAACTTGACAAATCAACAAGAAGCTCGAGCT
TACTCAAAAGATATATCAACTTATTGTGGGAGATCTTACTCAATTTCTTCATATATGATCCTCCAGTTGGCAGAATTGATTAAGAAAGGTTCTGTCAGAGAAGGTGACGT
TAGCAAAGTCCTATGCCATGATTTGAGCAGGCACTGCAGCAACGCC
Protein sequenceShow/hide protein sequence
TTMKFNAWGLLLLVIYSGAVNCIDDKCAACNAVAEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKTASSGQQWIKVDSWDNLTNQQEARA
YSKDISTYCGRSYSISSYMILQLAELIKKGSVREGDVSKVLCHDLSRHCSNA