; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS001844 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS001844
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionprotein indeterminate-domain 16
Genome locationscaffold30:343288..343791
RNA-Seq ExpressionMS001844
SyntenyMS001844
Gene Ontology termsGO:0009630 - gravitropism (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR039288 - Zinc finger protein SHOOT GRAVITROPISM 5-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0057771.1 protein indeterminate-domain 16 [Cucumis melo var. makuwa]2.7e-5678.02Show/hide
Query:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS----------ELMKPEAA---EALK
        MEDEDQKELQLLPTPHS+A SSSHMSSR  +SS RFR    SDP FE G      +G S+DLQLSIS+RPIR  +          E+MKPEAA   EALK
Subjt:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS----------ELMKPEAA---EALK

Query:  WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
        WQAAEQIRLAA+EKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
Subjt:  WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS

XP_004138047.1 protein indeterminate-domain 16 [Cucumis sativus]3.5e-5676.76Show/hide
Query:  MEDEDQKELQLLPTPHSMASS--SHMSSRAGE-SSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS-----------ELMKPEAA---E
        MEDEDQKELQLLPTPHS+ASS  SH+SSR  + SS RFR    SDPLFE G      +G S+DLQLSIS+RPIR  +           E+MKPEAA   +
Subjt:  MEDEDQKELQLLPTPHSMASS--SHMSSRAGE-SSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS-----------ELMKPEAA---E

Query:  ALKWQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
        ALKWQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
Subjt:  ALKWQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS

XP_008464429.1 PREDICTED: protein indeterminate-domain 16 [Cucumis melo]9.2e-5778.57Show/hide
Query:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS----------ELMKPEAA---EALK
        MEDEDQKELQLLPTPHS+A SSSHMSSR  +SS RFR    SDP FE G      +G S+DLQLSIS+RPIR  +          E+MKPEAA   EALK
Subjt:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS----------ELMKPEAA---EALK

Query:  WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
        WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
Subjt:  WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS

XP_022135320.1 protein indeterminate-domain 16 [Momordica charantia]5.0e-7999.4Show/hide
Query:  MEDEDQKELQLLPTPHSMASSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEK
        MEDEDQKELQLLPTPHSMASSSHMSSRAGESSFRFRSTT SDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEK
Subjt:  MEDEDQKELQLLPTPHSMASSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEK

Query:  AYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
        AYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
Subjt:  AYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS

XP_038878928.1 protein indeterminate-domain 16 [Benincasa hispida]7.0e-5779.33Show/hide
Query:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS-------ELMKPEAA---EALKWQA
        MEDEDQKELQLLPTPHS+A SSSHMSSR  +SS RFRS       FE  G G      S+DLQLSIS+RPIR A+       E+MKPEAA   EALKWQA
Subjt:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS-------ELMKPEAA---EALKWQA

Query:  AEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
        AEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
Subjt:  AEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS

TrEMBL top hitse value%identityAlignment
A0A0A0LRS8 Uncharacterized protein1.7e-5676.76Show/hide
Query:  MEDEDQKELQLLPTPHSMASS--SHMSSRAGE-SSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS-----------ELMKPEAA---E
        MEDEDQKELQLLPTPHS+ASS  SH+SSR  + SS RFR    SDPLFE G      +G S+DLQLSIS+RPIR  +           E+MKPEAA   +
Subjt:  MEDEDQKELQLLPTPHSMASS--SHMSSRAGE-SSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS-----------ELMKPEAA---E

Query:  ALKWQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
        ALKWQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
Subjt:  ALKWQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS

A0A1S3CN02 protein indeterminate-domain 164.4e-5778.57Show/hide
Query:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS----------ELMKPEAA---EALK
        MEDEDQKELQLLPTPHS+A SSSHMSSR  +SS RFR    SDP FE G      +G S+DLQLSIS+RPIR  +          E+MKPEAA   EALK
Subjt:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS----------ELMKPEAA---EALK

Query:  WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
        WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
Subjt:  WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS

A0A5A7UWD1 Protein indeterminate-domain 161.3e-5678.02Show/hide
Query:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS----------ELMKPEAA---EALK
        MEDEDQKELQLLPTPHS+A SSSHMSSR  +SS RFR    SDP FE G      +G S+DLQLSIS+RPIR  +          E+MKPEAA   EALK
Subjt:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS----------ELMKPEAA---EALK

Query:  WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
        WQAAEQIRLAA+EKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
Subjt:  WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS

A0A5D3BFD1 Protein indeterminate-domain 164.4e-5778.57Show/hide
Query:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS----------ELMKPEAA---EALK
        MEDEDQKELQLLPTPHS+A SSSHMSSR  +SS RFR    SDP FE G      +G S+DLQLSIS+RPIR  +          E+MKPEAA   EALK
Subjt:  MEDEDQKELQLLPTPHSMA-SSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAAS----------ELMKPEAA---EALK

Query:  WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
        WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
Subjt:  WQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS

A0A6J1C0T1 protein indeterminate-domain 162.4e-7999.4Show/hide
Query:  MEDEDQKELQLLPTPHSMASSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEK
        MEDEDQKELQLLPTPHSMASSSHMSSRAGESSFRFRSTT SDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEK
Subjt:  MEDEDQKELQLLPTPHSMASSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEK

Query:  AYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
        AYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS
Subjt:  AYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS

SwissProt top hitse value%identityAlignment
F4IPE3 Zinc finger protein SHOOT GRAVITROPISM 51.7e-0841.77Show/hide
Query:  AAEQI-RLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFR
        A+EQI ++A  EKAYAE  +   KR+ E+A+ EFA A+++ ++A+ E+E+A+ +KE++ +++ ST M++TCQ+C+ +F+
Subjt:  AAEQI-RLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFR

Q9C9X7 Protein indeterminate-domain 141.5e-0644.44Show/hide
Query:  ERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFR
        E  R  TKR+IE+A+ EFA A+++ + AR E+ KA   +E A+R++ +T M+ITC +C+Q F+
Subjt:  ERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFR

Q9FRH4 Protein indeterminate-domain 161.8e-0733.59Show/hide
Query:  RSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEV
        R +    P F      + A   SL+LQLSI +    A +   +       K +A E+ R        AE  R+  KR+IE+A+ +F +A+++ E A+ E+
Subjt:  RSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEV

Query:  EKAERMKERATRQMDSTCMEITCQSCRQRFR
        EKA  ++E A +++++T MEITC SC+Q F+
Subjt:  EKAERMKERATRQMDSTCMEITCQSCRQRFR

Arabidopsis top hitse value%identityAlignment
AT1G25250.1 indeterminate(ID)-domain 161.3e-0833.59Show/hide
Query:  RSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEV
        R +    P F      + A   SL+LQLSI +    A +   +       K +A E+ R        AE  R+  KR+IE+A+ +F +A+++ E A+ E+
Subjt:  RSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEV

Query:  EKAERMKERATRQMDSTCMEITCQSCRQRFR
        EKA  ++E A +++++T MEITC SC+Q F+
Subjt:  EKAERMKERATRQMDSTCMEITCQSCRQRFR

AT2G01940.1 C2H2-like zinc finger protein1.2e-0941.77Show/hide
Query:  AAEQI-RLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFR
        A+EQI ++A  EKAYAE  +   KR+ E+A+ EFA A+++ ++A+ E+E+A+ +KE++ +++ ST M++TCQ+C+ +F+
Subjt:  AAEQI-RLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFR

AT2G01940.2 C2H2-like zinc finger protein1.2e-0941.77Show/hide
Query:  AAEQI-RLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFR
        A+EQI ++A  EKAYAE  +   KR+ E+A+ EFA A+++ ++A+ E+E+A+ +KE++ +++ ST M++TCQ+C+ +F+
Subjt:  AAEQI-RLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFR

AT2G01940.3 C2H2-like zinc finger protein1.2e-0941.77Show/hide
Query:  AAEQI-RLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFR
        A+EQI ++A  EKAYAE  +   KR+ E+A+ EFA A+++ ++A+ E+E+A+ +KE++ +++ ST M++TCQ+C+ +F+
Subjt:  AAEQI-RLAAMEKAYAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFR

AT3G48550.1 BEST Arabidopsis thaliana protein match is: C2H2-like zinc finger protein (TAIR:AT2G01940.3)4.0e-3453.61Show/hide
Query:  EDEDQKELQLLPTPHSMASSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEKA
        + +  KELQLLP+P S  S             + R T ++          +      LDL+LSIS+  I  A EL      EALKWQAAEQIRLAA+EKA
Subjt:  EDEDQKELQLLPTPHSMASSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEKA

Query:  YAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRP
        YAERVRELT+RE+E+AQ+EFARAR MW++AREEVE+AER+KER+  ++D+ C+EITC SCRQRFRP
Subjt:  YAERVRELTKREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGACGAAGATCAGAAGGAGTTGCAGCTCCTCCCGACGCCTCACTCGATGGCATCGTCATCCCACATGTCGTCTCGGGCGGGCGAGTCGTCGTTCCGGTTCCGGTC
GACGACGATGTCGGATCCGCTGTTCGAAGGCGGGGGTGCAGGTGCAGGTGCAGCAGGTGCGTCGCTGGACCTGCAGCTGTCGATCAGCGTGAGGCCAATCCGGGCGGCGT
CGGAGCTGATGAAGCCGGAGGCGGCGGAGGCGCTGAAGTGGCAGGCGGCGGAGCAAATCCGGCTGGCGGCGATGGAGAAGGCGTACGCGGAGAGGGTGAGGGAGCTGACG
AAGCGGGAGATAGAATTGGCGCAGACGGAATTCGCGAGGGCGCGGCAGATGTGGGAGAGGGCTCGCGAAGAGGTTGAAAAGGCGGAGCGTATGAAAGAGAGAGCCACGCG
CCAAATGGATTCTACGTGCATGGAGATCACCTGCCAATCCTGCCGCCAGAGGTTTCGGCCTTCC
mRNA sequenceShow/hide mRNA sequence
ATGGAAGACGAAGATCAGAAGGAGTTGCAGCTCCTCCCGACGCCTCACTCGATGGCATCGTCATCCCACATGTCGTCTCGGGCGGGCGAGTCGTCGTTCCGGTTCCGGTC
GACGACGATGTCGGATCCGCTGTTCGAAGGCGGGGGTGCAGGTGCAGGTGCAGCAGGTGCGTCGCTGGACCTGCAGCTGTCGATCAGCGTGAGGCCAATCCGGGCGGCGT
CGGAGCTGATGAAGCCGGAGGCGGCGGAGGCGCTGAAGTGGCAGGCGGCGGAGCAAATCCGGCTGGCGGCGATGGAGAAGGCGTACGCGGAGAGGGTGAGGGAGCTGACG
AAGCGGGAGATAGAATTGGCGCAGACGGAATTCGCGAGGGCGCGGCAGATGTGGGAGAGGGCTCGCGAAGAGGTTGAAAAGGCGGAGCGTATGAAAGAGAGAGCCACGCG
CCAAATGGATTCTACGTGCATGGAGATCACCTGCCAATCCTGCCGCCAGAGGTTTCGGCCTTCC
Protein sequenceShow/hide protein sequence
MEDEDQKELQLLPTPHSMASSSHMSSRAGESSFRFRSTTMSDPLFEGGGAGAGAAGASLDLQLSISVRPIRAASELMKPEAAEALKWQAAEQIRLAAMEKAYAERVRELT
KREIELAQTEFARARQMWERAREEVEKAERMKERATRQMDSTCMEITCQSCRQRFRPS