; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS005120 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS005120
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionSerine/arginine repetitive matrix protein 1-like
Genome locationscaffold176:2825992..2826687
RNA-Seq ExpressionMS005120
SyntenyMS005120
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR007592 - GLABROUS1 enhancer-binding protein family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037301.1 serine/arginine repetitive matrix protein 1-like [Cucumis melo var. makuwa]1.1e-5754.24Show/hide
Query:  MSTPPLPAPAPAA----------LSSDGHTNQFQFTEEEEVKLLKCYLKIARS---AENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFA
        MSTP    P+ A+             +  T+Q  FT+E+E+ LLK YL+I+RS    +NS P+LDSPA DR++ A+GPKFS   IADKLHRLKLLYHKFA
Subjt:  MSTPPLPAPAPAA----------LSSDGHTNQFQFTEEEEVKLLKCYLKIARS---AENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFA

Query:  RTKSFVKTPHHRRILDIGRRIWGKSPTPTTR----TKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGL
        RTKSF+KTPH R+ILD+GR IWGKSPTP TR      P +++ R  ++ S T          ++ GVG   DL  FP+LV EFS+ FPGN VWREG+R +
Subjt:  RTKSFVKTPHHRRILDIGRRIWGKSPTPTTR----TKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGL

Query:  EEKILMGMNQNWVLLHIEEAELNARRAALLHHQLRS
        EEK L  MN+ WVLLHIE AEL ARRAALL  QL++
Subjt:  EEKILMGMNQNWVLLHIEEAELNARRAALLHHQLRS

KAG6570827.1 hypothetical protein SDJN03_29742, partial [Cucurbita argyrosperma subsp. sororia]6.7e-5858.82Show/hide
Query:  MSTPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSA---ENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPH
        MSTP    P+ A+  +  H     F+EE+E+ LL CYL+IARSA    NSQP+LDSPALDRIE AL PKF+  HIADKLHRLKL YHK ARTKS +KTPH
Subjt:  MSTPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSA---ENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPH

Query:  HRRILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVL
        HRRIL+IGR IWGK   PT+RTKP  ++    R+           + ++      G DLN FP+L+ EFSR FPGN VW+EG+RG+EEK L  MN+ WVL
Subjt:  HRRILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVL

Query:  LHIEEAELNARRAALLHHQLR
        LHIEEAEL ARRAALL  QLR
Subjt:  LHIEEAELNARRAALLHHQLR

KAG6606028.1 hypothetical protein SDJN03_03345, partial [Cucurbita argyrosperma subsp. sororia]8.2e-5660Show/hide
Query:  APAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSAENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHRRILDIGRR
        AP P   S      Q  FT+E+E+ LL CYL++A S +NSQP+LDSPALDRI  ALG KF   HIADKLHRLK+ YHKFARTKSF+KTPHH RIL+IGR 
Subjt:  APAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSAENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHRRILDIGRR

Query:  IWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLHIEEAELNA
        IWGK   P  RTKP +VI R  R           SVAKK+     G DL  FP+LV EFSR FPGN VWREG++G+EE  L GMN+ WVLLHIEEAEL A
Subjt:  IWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLHIEEAELNA

Query:  RRAALLHHQLRSTLT
        RR AL+  Q+ +T T
Subjt:  RRAALLHHQLRSTLT

KGN46890.1 hypothetical protein Csa_020607 [Cucumis sativus]3.3e-5756.64Show/hide
Query:  TPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARS---AENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHR
        +PP  A       S+   +Q  F +E+E+ LL  YL+I+RS    +NS P+LDS A DR++ A+GPKFS   IADKLHRLKLLYHKFARTKSF+KTPH R
Subjt:  TPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARS---AENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHR

Query:  RILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLH
        +ILD+GR IWGKSPTP TR KP   ++  +R  S    Q   S+ +KE GVG   DL  FP+LV EFSR FPGN VWR+G+R + EK L  MN+ WVLLH
Subjt:  RILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLH

Query:  IEEAELNARRAALLHHQLRSTLTDTE
        IE AEL ARRAALL  QLR+T T+ +
Subjt:  IEEAELNARRAALLHHQLRSTLTDTE

XP_022756636.1 probable transcription factor At1g61730 [Durio zibethinus]5.4e-3139.11Show/hide
Query:  MSTPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSAENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHRR
        MST P P+  P   SS        F+E +E ++L+C ++  +S      ++ +P ++RI + L  +FS   I DK+ RL++ YHK ARTKS V+T H RR
Subjt:  MSTPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSAENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHRR

Query:  ILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGA-DLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLH
        I  + +RIWGK  TP              +K  E          + E G G G  DL+KFP LV EFSR+ P N VW+E M+GLEE+ L  M+Q WVL+ 
Subjt:  ILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGA-DLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLH

Query:  IEEAELNARRAALLHHQLRSTLTDT
        +EEA+L A++A L+  Q+   + +T
Subjt:  IEEAELNARRAALLHHQLRSTLTDT

TrEMBL top hitse value%identityAlignment
A0A0A0KAX7 Uncharacterized protein1.6e-5756.64Show/hide
Query:  TPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARS---AENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHR
        +PP  A       S+   +Q  F +E+E+ LL  YL+I+RS    +NS P+LDS A DR++ A+GPKFS   IADKLHRLKLLYHKFARTKSF+KTPH R
Subjt:  TPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARS---AENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHR

Query:  RILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLH
        +ILD+GR IWGKSPTP TR KP   ++  +R  S    Q   S+ +KE GVG   DL  FP+LV EFSR FPGN VWR+G+R + EK L  MN+ WVLLH
Subjt:  RILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLH

Query:  IEEAELNARRAALLHHQLRSTLTDTE
        IE AEL ARRAALL  QLR+T T+ +
Subjt:  IEEAELNARRAALLHHQLRSTLTDTE

A0A5A7T6T1 Serine/arginine repetitive matrix protein 1-like5.6e-5854.24Show/hide
Query:  MSTPPLPAPAPAA----------LSSDGHTNQFQFTEEEEVKLLKCYLKIARS---AENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFA
        MSTP    P+ A+             +  T+Q  FT+E+E+ LLK YL+I+RS    +NS P+LDSPA DR++ A+GPKFS   IADKLHRLKLLYHKFA
Subjt:  MSTPPLPAPAPAA----------LSSDGHTNQFQFTEEEEVKLLKCYLKIARS---AENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFA

Query:  RTKSFVKTPHHRRILDIGRRIWGKSPTPTTR----TKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGL
        RTKSF+KTPH R+ILD+GR IWGKSPTP TR      P +++ R  ++ S T          ++ GVG   DL  FP+LV EFS+ FPGN VWREG+R +
Subjt:  RTKSFVKTPHHRRILDIGRRIWGKSPTPTTR----TKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGL

Query:  EEKILMGMNQNWVLLHIEEAELNARRAALLHHQLRS
        EEK L  MN+ WVLLHIE AEL ARRAALL  QL++
Subjt:  EEKILMGMNQNWVLLHIEEAELNARRAALLHHQLRS

A0A6A2Z6N0 Uncharacterized protein6.4e-3038.39Show/hide
Query:  MSTPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSAENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHRR
        MSTP    P P   + +G +++  F+E +E+++LKC ++  +S      ++ +P ++RI + L  KFS   I DKL RL+  YHK AR KS V+T H RR
Subjt:  MSTPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSAENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHRR

Query:  ILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLHI
        I  + +RIWGK   P              +K +E  S+       KE GV    +L  F  LV EFSR+ PGN VW+E M G+ E  L  M+Q WV L +
Subjt:  ILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLHI

Query:  EEAELNARRAALLHHQLRSTLTDT
        EEA+L AR+A L+  Q+   + +T
Subjt:  EEAELNARRAALLHHQLRSTLTDT

A0A6J0ZSJ3 probable transcription factor At1g617305.4e-2935.91Show/hide
Query:  APAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSAENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHRRILDIGRR
        A +P + + +G +++  F+E +E+++LKC  +  +S      ++ +  ++RI + L  KF+   I DKL RL+L YHK AR KS V+T H RRI  + +R
Subjt:  APAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSAENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHRRILDIGRR

Query:  IWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAG---ADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLHIEEAE
        IWGK  + T   K                        ++E G G      +L KFP LV EFSR+ P N VW+E M+GLEE+ L  M+Q W+ + +EEA+
Subjt:  IWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAG---ADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLHIEEAE

Query:  LNARRAALLHHQLRSTLTDT
        L A++A L+  Q+   L +T
Subjt:  LNARRAALLHHQLRSTLTDT

A0A6P5ZW34 probable transcription factor At1g617302.6e-3139.11Show/hide
Query:  MSTPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSAENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHRR
        MST P P+  P   SS        F+E +E ++L+C ++  +S      ++ +P ++RI + L  +FS   I DK+ RL++ YHK ARTKS V+T H RR
Subjt:  MSTPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSAENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHRR

Query:  ILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGA-DLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLH
        I  + +RIWGK  TP              +K  E          + E G G G  DL+KFP LV EFSR+ P N VW+E M+GLEE+ L  M+Q WVL+ 
Subjt:  ILDIGRRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGA-DLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLH

Query:  IEEAELNARRAALLHHQLRSTLTDT
        +EEA+L A++A L+  Q+   + +T
Subjt:  IEEAELNARRAALLHHQLRSTLTDT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CTATCCCTTAAGCCGATGTCAACGCCCCCGCTGCCGGCGCCGGCGCCGGCCGCCCTTTCTTCCGACGGCCACACCAATCAATTTCAATTCACGGAGGAGGAGGAAGTGAA
GCTTCTCAAGTGCTACCTGAAAATCGCAAGATCCGCGGAGAATTCCCAACCCAGTCTGGATTCTCCGGCCTTGGATCGCATCGAGAGAGCCCTCGGCCCCAAATTCAGCC
AGATCCACATCGCCGACAAGCTCCACAGGCTCAAGCTCCTGTACCACAAATTCGCCAGAACCAAGTCCTTCGTCAAGACCCCCCACCACCGCCGGATCCTCGACATCGGC
CGCCGCATCTGGGGCAAATCCCCCACCCCCACAACCAGAACAAAACCCCATAAGGTAATTTTACGCACAACCAGAAAAAATTCAGAAACCCCTTCTCAACTCTCATTCTC
AGTAGCAAAAAAAGAAGTAGGGGTTGGAGCTGGGGCTGATCTTAACAAGTTCCCTCTTCTCGTGGGCGAATTTTCTCGGCTGTTTCCGGGAAATGTGGTGTGGAGAGAGG
GGATGAGGGGTCTAGAGGAGAAGATTTTGATGGGTATGAATCAGAATTGGGTTTTGTTGCACATTGAAGAGGCAGAGTTGAACGCTAGAAGGGCTGCGTTATTACACCAC
CAACTCAGAAGCACACTCACAGACACTGAGGATCAG
mRNA sequenceShow/hide mRNA sequence
CTATCCCTTAAGCCGATGTCAACGCCCCCGCTGCCGGCGCCGGCGCCGGCCGCCCTTTCTTCCGACGGCCACACCAATCAATTTCAATTCACGGAGGAGGAGGAAGTGAA
GCTTCTCAAGTGCTACCTGAAAATCGCAAGATCCGCGGAGAATTCCCAACCCAGTCTGGATTCTCCGGCCTTGGATCGCATCGAGAGAGCCCTCGGCCCCAAATTCAGCC
AGATCCACATCGCCGACAAGCTCCACAGGCTCAAGCTCCTGTACCACAAATTCGCCAGAACCAAGTCCTTCGTCAAGACCCCCCACCACCGCCGGATCCTCGACATCGGC
CGCCGCATCTGGGGCAAATCCCCCACCCCCACAACCAGAACAAAACCCCATAAGGTAATTTTACGCACAACCAGAAAAAATTCAGAAACCCCTTCTCAACTCTCATTCTC
AGTAGCAAAAAAAGAAGTAGGGGTTGGAGCTGGGGCTGATCTTAACAAGTTCCCTCTTCTCGTGGGCGAATTTTCTCGGCTGTTTCCGGGAAATGTGGTGTGGAGAGAGG
GGATGAGGGGTCTAGAGGAGAAGATTTTGATGGGTATGAATCAGAATTGGGTTTTGTTGCACATTGAAGAGGCAGAGTTGAACGCTAGAAGGGCTGCGTTATTACACCAC
CAACTCAGAAGCACACTCACAGACACTGAGGATCAG
Protein sequenceShow/hide protein sequence
LSLKPMSTPPLPAPAPAALSSDGHTNQFQFTEEEEVKLLKCYLKIARSAENSQPSLDSPALDRIERALGPKFSQIHIADKLHRLKLLYHKFARTKSFVKTPHHRRILDIG
RRIWGKSPTPTTRTKPHKVILRTTRKNSETPSQLSFSVAKKEVGVGAGADLNKFPLLVGEFSRLFPGNVVWREGMRGLEEKILMGMNQNWVLLHIEEAELNARRAALLHH
QLRSTLTDTEDQ