; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G047040 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G047040
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionSerine/arginine repetitive matrix protein 1-like
Genome locationCiama_Chr02:34781843..34782538
RNA-Seq ExpressionCaUC02G047040
SyntenyCaUC02G047040
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR007592 - GLABROUS1 enhancer-binding protein family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAY33107.1 hypothetical protein CUMW_281630 [Citrus unshiu]1.1e-3138.57Show/hide
Query:  NNPTDQ--LHFTDEDELNLLNCYLQIAR-----SENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFARTKSFIKTPHHRQILELGR
        ++PT Q    FTD DE+ LL  + ++A      S + T NS  TLDS + +RI ++LG KFTH+ I DK+ RL++ YHK AR+KS +KTPH + + ++ +
Subjt:  NNPTDQ--LHFTDEDELNLLNCYLQIAR-----SENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFARTKSFIKTPHHRQILELGR

Query:  CIWGKSPTPITRKPQVISPSRAL---SRRIKQRSISRLSR-KEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMNEKWVLLHIEGAELKARR
         IWGK  T   ++ + +  S +        +Q++     R +E  +L  YPVL GE S+  P N VWRE L+++    LK+MN+KW+L+ IE A++ +++
Subjt:  CIWGKSPTPITRKPQVISPSRAL---SRRIKQRSISRLSR-KEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMNEKWVLLHIEGAELKARR

Query:  ASLLKQQLRI
        A L+K+Q ++
Subjt:  ASLLKQQLRI

KAA0037301.1 serine/arginine repetitive matrix protein 1-like [Cucumis melo var. makuwa]9.9e-9481.58Show/hide
Query:  MSTPRWSPPSNASPG--TLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFA
        MSTP  S PSNASP   TLTLTLTQ NN TDQLHFTDEDE+NLL  YLQI+RSENPTKNS PTLDSP  DR+QTA+GPKF+HS IADKLHRLKLLYHKFA
Subjt:  MSTPRWSPPSNASPG--TLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFA

Query:  RTKSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKE----KVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKD
        RTKSFIKTPH RQIL+LGR IWGKSPTPITRKPQVISPSR LSRRIKQRS    +RKE     +DLKN+PVLV EFS+ FPGNGVWREGLR MEEK+LKD
Subjt:  RTKSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKE----KVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKD

Query:  MNEKWVLLHIEGAELKARRASLLKQQLR
        MNEKWVLLHIEGAELKARRA+LLKQQL+
Subjt:  MNEKWVLLHIEGAELKARRASLLKQQLR

KAG6570827.1 hypothetical protein SDJN03_29742, partial [Cucurbita argyrosperma subsp. sororia]2.2e-7771.56Show/hide
Query:  MSTPRWSPPSNASPGTLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFART
        MSTP   PPSNASP TLT             HF++EDELNLLNCYLQIARS NPT NSQPTLDSP LDRI+TAL PKFT+SHIADKLHRLKL YHK ART
Subjt:  MSTPRWSPPSNASPGTLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFART

Query:  KSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMNEKWV
        KS IKTPHHR+ILE+GR IWGK PT  T KPQ I   RA+ RRIKQRS    S K+ VDL ++PVL+ EFSR FPGNGVW+EGLR MEEKSL+DMNEKWV
Subjt:  KSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMNEKWV

Query:  LLHIEGAELKARRASLLKQQLRITE
        LLHIE AELKARRA+LL++QLR+ +
Subjt:  LLHIEGAELKARRASLLKQQLRITE

KAG6606028.1 hypothetical protein SDJN03_03345, partial [Cucurbita argyrosperma subsp. sororia]1.8e-7167.1Show/hide
Query:  MSTPRWSPPS---NASPGTLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKF
        MSTPR SPPS    AS       LTQS     Q+HFTDEDEL+LLNCYL++A S    KNSQPTLDSP LDRI TALG KF HSHIADKLHRLK+ YHKF
Subjt:  MSTPRWSPPS---NASPGTLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKF

Query:  ARTKSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMNE
        ARTKSFIKTPHH +ILE+GR IWGK   P T KPQVI      SRRI+ RS+   ++K+ VDLKN+PVLV EFSR FPGNGVWREGL+ MEE SLK MNE
Subjt:  ARTKSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMNE

Query:  KWVLLHIEGAELKARRASLLKQQLRITETSE
        KWVLLHIE AELKARR +L+++Q+  T+T +
Subjt:  KWVLLHIEGAELKARRASLLKQQLRITETSE

KGN46890.1 hypothetical protein Csa_020607 [Cucumis sativus]2.6e-9479.57Show/hide
Query:  MSTPRWSPPSNASPGTLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFART
        MST   SPPS+ASP     TLTQSNN  DQLHF DEDE+NLLN YLQI+RSENPTKNS PTLDS   DR+QTA+GPKF+HS IADKLHRLKLLYHKFART
Subjt:  MSTPRWSPPSNASPGTLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFART

Query:  KSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKE----KVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMN
        KSFIKTPH RQIL+LGR IWGKSPTP+TRKPQVISPSR LSRRIKQRSI   +RKE     VDLKN+PVLV EFSR FPGNGVWR+GLR M EK+LKDMN
Subjt:  KSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKE----KVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMN

Query:  EKWVLLHIEGAELKARRASLLKQQLRITETSEDVN
        EKWVLLHIEGAELKARRA+LLK+QLR TET+EDVN
Subjt:  EKWVLLHIEGAELKARRASLLKQQLRITETSEDVN

TrEMBL top hitse value%identityAlignment
A0A0A0KAX7 Uncharacterized protein1.3e-9479.57Show/hide
Query:  MSTPRWSPPSNASPGTLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFART
        MST   SPPS+ASP     TLTQSNN  DQLHF DEDE+NLLN YLQI+RSENPTKNS PTLDS   DR+QTA+GPKF+HS IADKLHRLKLLYHKFART
Subjt:  MSTPRWSPPSNASPGTLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFART

Query:  KSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKE----KVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMN
        KSFIKTPH RQIL+LGR IWGKSPTP+TRKPQVISPSR LSRRIKQRSI   +RKE     VDLKN+PVLV EFSR FPGNGVWR+GLR M EK+LKDMN
Subjt:  KSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKE----KVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMN

Query:  EKWVLLHIEGAELKARRASLLKQQLRITETSEDVN
        EKWVLLHIEGAELKARRA+LLK+QLR TET+EDVN
Subjt:  EKWVLLHIEGAELKARRASLLKQQLRITETSEDVN

A0A2H5MZL8 Uncharacterized protein5.2e-3238.57Show/hide
Query:  NNPTDQ--LHFTDEDELNLLNCYLQIAR-----SENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFARTKSFIKTPHHRQILELGR
        ++PT Q    FTD DE+ LL  + ++A      S + T NS  TLDS + +RI ++LG KFTH+ I DK+ RL++ YHK AR+KS +KTPH + + ++ +
Subjt:  NNPTDQ--LHFTDEDELNLLNCYLQIAR-----SENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFARTKSFIKTPHHRQILELGR

Query:  CIWGKSPTPITRKPQVISPSRAL---SRRIKQRSISRLSR-KEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMNEKWVLLHIEGAELKARR
         IWGK  T   ++ + +  S +        +Q++     R +E  +L  YPVL GE S+  P N VWRE L+++    LK+MN+KW+L+ IE A++ +++
Subjt:  CIWGKSPTPITRKPQVISPSRAL---SRRIKQRSISRLSR-KEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMNEKWVLLHIEGAELKARR

Query:  ASLLKQQLRI
        A L+K+Q ++
Subjt:  ASLLKQQLRI

A0A5A7T6T1 Serine/arginine repetitive matrix protein 1-like4.8e-9481.58Show/hide
Query:  MSTPRWSPPSNASPG--TLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFA
        MSTP  S PSNASP   TLTLTLTQ NN TDQLHFTDEDE+NLL  YLQI+RSENPTKNS PTLDSP  DR+QTA+GPKF+HS IADKLHRLKLLYHKFA
Subjt:  MSTPRWSPPSNASPG--TLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFA

Query:  RTKSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKE----KVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKD
        RTKSFIKTPH RQIL+LGR IWGKSPTPITRKPQVISPSR LSRRIKQRS    +RKE     +DLKN+PVLV EFS+ FPGNGVWREGLR MEEK+LKD
Subjt:  RTKSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKE----KVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKD

Query:  MNEKWVLLHIEGAELKARRASLLKQQLR
        MNEKWVLLHIEGAELKARRA+LLKQQL+
Subjt:  MNEKWVLLHIEGAELKARRASLLKQQLR

A0A6P3Z773 probable transcription factor At1g617301.4e-2938.3Show/hide
Query:  STPRWSPPSNASPGTLTLT---LTQSNNPTDQLH----------FTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLH
        S+ + + PS+ SP T T T    + S +P +Q            FT+EDE +LL  +  I +S  P+      ++SP LDRI+ +LG +F+H+ I DKL 
Subjt:  STPRWSPPSNASPGTLTLT---LTQSNNPTDQLH----------FTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLH

Query:  RLKLLYHKFARTKSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAME
        RL+L YHK ARTKS IKT H R++ ++ R IWG+            S SR   +  +++      R   V L+ +PVL  E   +  GN VW++GL  +E
Subjt:  RLKLLYHKFARTKSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAME

Query:  EKSLKDMNEKWVLLHIEGAELKARRASLLKQQLRI
        EK+LK +NEKW+LL +E AE+ A RA L K+ + +
Subjt:  EKSLKDMNEKWVLLHIEGAELKARRASLLKQQLRI

V4RSW9 Uncharacterized protein1.2e-3137.23Show/hide
Query:  STPRWSPPSNASPGTLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIAR-----SENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHK
        STPR       SP      ++   +P  ++ FTD DE+ LL  + ++A      S + T NS  TLDS + +RI ++LG KFTH+ I DK+ RL++ YHK
Subjt:  STPRWSPPSNASPGTLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIAR-----SENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHK

Query:  FARTKSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRAL---SRRIKQRSISRLSR-KEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSL
         AR+KS IKTPH + + ++ + IWGK  T   ++ + +  S +        +Q++     R +E  +L  YPVL GE S+  P N VWRE L+++    L
Subjt:  FARTKSFIKTPHHRQILELGRCIWGKSPTPITRKPQVISPSRAL---SRRIKQRSISRLSR-KEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSL

Query:  KDMNEKWVLLHIEGAELKARRASLLKQQLRI
        K+MN+KW+L+ IE A++ +++A L+K+Q ++
Subjt:  KDMNEKWVLLHIEGAELKARRASLLKQQLRI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAACGCCCCGCTGGTCACCGCCGTCCAACGCCTCTCCCGGAACCCTAACCCTAACCCTAACCCAATCCAACAATCCAACCGATCAACTCCATTTCACCGACGAAGA
TGAACTCAATCTCCTGAATTGCTACCTCCAAATCGCCAGATCCGAAAACCCCACCAAGAATTCTCAACCCACTCTCGATTCTCCGACCTTGGACCGCATCCAGACCGCTC
TCGGACCCAAATTCACCCACTCCCACATCGCCGATAAGCTCCACCGCCTCAAACTCCTGTACCACAAATTCGCAAGAACCAAGTCTTTCATCAAGACTCCCCACCACCGT
CAGATCCTCGAGCTCGGCCGCTGCATCTGGGGAAAATCCCCCACACCTATAACAAGAAAACCACAGGTAATTTCACCTTCACGAGCACTTTCACGAAGAATCAAACAAAG
GTCAATCTCAAGACTCTCAAGGAAAGAAAAGGTTGATCTGAAGAACTATCCTGTTCTTGTGGGTGAATTTTCTCGGCTGTTTCCGGGCAATGGGGTGTGGAGAGAAGGGC
TGCGGGCAATGGAAGAGAAGAGTTTGAAGGATATGAATGAGAAGTGGGTTTTGTTGCATATTGAAGGGGCAGAGCTTAAGGCAAGAAGGGCTTCATTATTAAAGCAGCAA
CTTAGAATTACAGAGACTTCTGAAGATGTGAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAACGCCCCGCTGGTCACCGCCGTCCAACGCCTCTCCCGGAACCCTAACCCTAACCCTAACCCAATCCAACAATCCAACCGATCAACTCCATTTCACCGACGAAGA
TGAACTCAATCTCCTGAATTGCTACCTCCAAATCGCCAGATCCGAAAACCCCACCAAGAATTCTCAACCCACTCTCGATTCTCCGACCTTGGACCGCATCCAGACCGCTC
TCGGACCCAAATTCACCCACTCCCACATCGCCGATAAGCTCCACCGCCTCAAACTCCTGTACCACAAATTCGCAAGAACCAAGTCTTTCATCAAGACTCCCCACCACCGT
CAGATCCTCGAGCTCGGCCGCTGCATCTGGGGAAAATCCCCCACACCTATAACAAGAAAACCACAGGTAATTTCACCTTCACGAGCACTTTCACGAAGAATCAAACAAAG
GTCAATCTCAAGACTCTCAAGGAAAGAAAAGGTTGATCTGAAGAACTATCCTGTTCTTGTGGGTGAATTTTCTCGGCTGTTTCCGGGCAATGGGGTGTGGAGAGAAGGGC
TGCGGGCAATGGAAGAGAAGAGTTTGAAGGATATGAATGAGAAGTGGGTTTTGTTGCATATTGAAGGGGCAGAGCTTAAGGCAAGAAGGGCTTCATTATTAAAGCAGCAA
CTTAGAATTACAGAGACTTCTGAAGATGTGAATTAG
Protein sequenceShow/hide protein sequence
MSTPRWSPPSNASPGTLTLTLTQSNNPTDQLHFTDEDELNLLNCYLQIARSENPTKNSQPTLDSPTLDRIQTALGPKFTHSHIADKLHRLKLLYHKFARTKSFIKTPHHR
QILELGRCIWGKSPTPITRKPQVISPSRALSRRIKQRSISRLSRKEKVDLKNYPVLVGEFSRLFPGNGVWREGLRAMEEKSLKDMNEKWVLLHIEGAELKARRASLLKQQ
LRITETSEDVN