; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G022780 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G022780
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationCicolChr02:5448181..5448882
RNA-Seq ExpressionCcUC02G022780
SyntenyCcUC02G022780
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572024.1 hypothetical protein SDJN03_28752, partial [Cucurbita argyrosperma subsp. sororia]4.0e-6662.96Show/hide
Query:  PQPVD---NETSSHQQQEEEEEEEEETMEQ---------PRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYE
        PQ V+   N++++  Q  E+   +  T+ Q         PR+RR RTRADTRR+EPPYPWS ++RA IH LEYLQ+NNIVTIKG+V+CKKCER YE+EY 
Subjt:  PQPVD---NETSSHQQQEEEEEEEEETMEQ---------PRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYE

Query:  LMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRL
        LMNKF EI RFIE E+++MHDRAP CW NPILPNC  C +E CVEP+I  +E   DD+QFSRINWLFLLLG+ +G LKLKQL+YFCA    HRTGAK+RL
Subjt:  LMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRL

Query:  LYLVYLALCQQLQPSN
        ++L YLALC+QLQPSN
Subjt:  LYLVYLALCQQLQPSN

KAG7011696.1 hypothetical protein SDJN02_26602, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-6662.96Show/hide
Query:  PQPVD---NETSSHQQQEEEEEEEEETMEQ---------PRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYE
        PQ V+   N++++  Q  E+   +  T+ Q         PR+RR RTRADTRR+EPPYPWS ++RA IH LEYLQ+NNIVTIKG+V+CKKCER YE+EY 
Subjt:  PQPVD---NETSSHQQQEEEEEEEEETMEQ---------PRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYE

Query:  LMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRL
        LMNKF EI RFIE E+++MHDRAP CW NPILPNC +C +E CVEP+I  +E   DD+QFSRINWLFLLLG+ +G LKLKQL+YFCA    HRTGAK+RL
Subjt:  LMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRL

Query:  LYLVYLALCQQLQPSN
        ++L YLALC+QLQPSN
Subjt:  LYLVYLALCQQLQPSN

XP_008463189.1 PREDICTED: uncharacterized protein LOC103501397 [Cucumis melo]4.0e-6663.44Show/hide
Query:  LNLDLSLHPPSHSPP------QPVDNETSSHQQQEEEE-----EEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVK
        L+LDLSL  PS  PP      QP  +   S    EE E       E    +Q ++RRRRTRAD  R+EPPYPWSTDRRAV+H+L+YLQ NNI+TIKGEV 
Subjt:  LNLDLSLHPPSHSPP------QPVDNETSSHQQQEEEE-----EEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVK

Query:  CKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCA
        CKKCE KYEMEY+LMNK  EITRF E E +SMHDRAPSCW NP LPNC+ CN+EKCV PV  SQE+       ++INWLFL LG+FLGCLKL+QL+YFC 
Subjt:  CKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCA

Query:  QNNIHRTGAKNRLLYLVYLALCQQLQP
        Q NIHRTGAKNRLLYL Y  L +QLQP
Subjt:  QNNIHRTGAKNRLLYLVYLALCQQLQP

XP_011656206.1 uncharacterized protein LOC105435666 [Cucumis sativus]2.7e-6757.85Show/hide
Query:  MRTQEKSRQGRLNLDLSLHPPSHSPP--------------QPVDNETSSHQQQEEEEEEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEY
        M  ++ + +    LDLSL PPS  PP                + NE S+   Q   E   ++   + R RRRRTRAD  R+EPPYPW+TD+RAV+H+L+Y
Subjt:  MRTQEKSRQGRLNLDLSLHPPSHSPP--------------QPVDNETSSHQQQEEEEEEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEY

Query:  LQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKF
        LQ+NNI+ IKGEV CKKCE KYE+EY+LMNK  EITRF E E +SMHDRAP+CW  P LPNCNFCN+EKCV PVI  +++       S+INWLFL LG+F
Subjt:  LQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKF

Query:  LGCLKLKQLRYFCAQNNIHRTGAKNRLLYLVYLALCQQLQPS
        LGCL+LKQL++FCAQ+NIHRTGAKNRLLYL Y AL  QLQPS
Subjt:  LGCLKLKQLRYFCAQNNIHRTGAKNRLLYLVYLALCQQLQPS

XP_038895979.1 junction-mediating and -regulatory protein-like [Benincasa hispida]4.5e-7873.56Show/hide
Query:  SHSPPQPVDNETSSHQQQEEEEEEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEI
        S   P   +NETS+HQQQ  E  E      QPR RRRRTRAD  R+EPPYPWSTDRRAVIH+L+YLQ+NNIVTIKGEVKCKKCE+KYEMEY+LMNKF EI
Subjt:  SHSPPQPVDNETSSHQQQEEEEEEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEI

Query:  TRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRLLYLVYLAL
         RFIE EK+SMHDRAP CW  PILPNCN CNKE+CVEPVI        ++ +++INWLFLLLGKFLGCLKLKQL+YFCAQ NIHRTGAKNRLLYL+YL L
Subjt:  TRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRLLYLVYLAL

Query:  CQQLQPSN
        C QLQPSN
Subjt:  CQQLQPSN

TrEMBL top hitse value%identityAlignment
A0A0A0KMQ2 Uncharacterized protein1.3e-6757.85Show/hide
Query:  MRTQEKSRQGRLNLDLSLHPPSHSPP--------------QPVDNETSSHQQQEEEEEEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEY
        M  ++ + +    LDLSL PPS  PP                + NE S+   Q   E   ++   + R RRRRTRAD  R+EPPYPW+TD+RAV+H+L+Y
Subjt:  MRTQEKSRQGRLNLDLSLHPPSHSPP--------------QPVDNETSSHQQQEEEEEEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEY

Query:  LQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKF
        LQ+NNI+ IKGEV CKKCE KYE+EY+LMNK  EITRF E E +SMHDRAP+CW  P LPNCNFCN+EKCV PVI  +++       S+INWLFL LG+F
Subjt:  LQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKF

Query:  LGCLKLKQLRYFCAQNNIHRTGAKNRLLYLVYLALCQQLQPS
        LGCL+LKQL++FCAQ+NIHRTGAKNRLLYL Y AL  QLQPS
Subjt:  LGCLKLKQLRYFCAQNNIHRTGAKNRLLYLVYLALCQQLQPS

A0A1S3AZB1 protein PAF1 homolog4.7e-6561.71Show/hide
Query:  SLHPPSHSPPQPVD----NETSSHQQQEEE-----EEEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERK
        SL  P  SPP P D      T   Q Q  E      + + +  E P+ +RRRT+AD  R+EPPYPWST++ AVIHKLEYL+ANNI+TIKGEVKCK+C+RK
Subjt:  SLHPPSHSPPQPVD----NETSSHQQQEEE-----EEEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERK

Query:  YEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRT
         E+EYEL++KF EI RFIE EK++MHDRAP  W NPIL NCNFCNKE+CVEP+I         +  S INWLFLLLG FLGCLKL QL+YFC Q NIHRT
Subjt:  YEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRT

Query:  GAKNRLLYLVYLALCQQLQPSN
        GAK+RL+YL YLALC+QLQP++
Subjt:  GAKNRLLYLVYLALCQQLQPSN

A0A1S3CK70 uncharacterized protein LOC1035013971.9e-6663.44Show/hide
Query:  LNLDLSLHPPSHSPP------QPVDNETSSHQQQEEEE-----EEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVK
        L+LDLSL  PS  PP      QP  +   S    EE E       E    +Q ++RRRRTRAD  R+EPPYPWSTDRRAV+H+L+YLQ NNI+TIKGEV 
Subjt:  LNLDLSLHPPSHSPP------QPVDNETSSHQQQEEEE-----EEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVK

Query:  CKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCA
        CKKCE KYEMEY+LMNK  EITRF E E +SMHDRAPSCW NP LPNC+ CN+EKCV PV  SQE+       ++INWLFL LG+FLGCLKL+QL+YFC 
Subjt:  CKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCA

Query:  QNNIHRTGAKNRLLYLVYLALCQQLQP
        Q NIHRTGAKNRLLYL Y  L +QLQP
Subjt:  QNNIHRTGAKNRLLYLVYLALCQQLQP

A0A6J1GM83 mucin-16-like1.9e-6662.96Show/hide
Query:  PQPVD---NETSSHQQQEEEEEEEEETMEQ---------PRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYE
        PQ V+   N++++  Q  E+   +  T+ Q         PR+RR RTRADTRR+EPPYPWS ++RA IH LEYLQ+NNIVTIKG+V+CKKCER YE+EY 
Subjt:  PQPVD---NETSSHQQQEEEEEEEEETMEQ---------PRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYE

Query:  LMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRL
        LMNKF EI RFIE E+++MHDRAP CW NPILPNC  C +E CVEP+I  +E   DD+QFSRINWLFLLLG+ +G LKLKQL+YFCA    HRTGAK+RL
Subjt:  LMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRL

Query:  LYLVYLALCQQLQPSN
        ++L YLALC+QLQPSN
Subjt:  LYLVYLALCQQLQPSN

A0A6J1I8I0 uncharacterized protein KIAA0754-like1.4e-6461.32Show/hide
Query:  LHPPSHSPPQPVDNETSSHQQQEEEEEEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYELMNK
        +  P+     P   E + +Q     +     +  +PR+RR RTRADTRR+EPPYPWS ++RA IH LEYLQ+NNIV IKG+V+CKKCER YE+EY LMNK
Subjt:  LHPPSHSPPQPVDNETSSHQQQEEEEEEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYELMNK

Query:  FYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRLLYLV
        F EI RFIE E+++MHDRAP CW NPILPNC  C +E CVEP+I  +E   DD+QF RINWLFLLLG+ +G LKLKQL+YFCA    HRTGAK+RL++L 
Subjt:  FYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRLLYLV

Query:  YLALCQQLQPSN
        YLALC+QLQPSN
Subjt:  YLALCQQLQPSN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein7.2e-4247.34Show/hide
Query:  RADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPV
        ++DT  + PP+PW+T+RR  I  LEYL++N I TI GEV+C+ CE+ Y++ Y L  +F E+ +F  +EK  M DRA   WA P    C  C +EK V+PV
Subjt:  RADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPV

Query:  IISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRLLYLVYLALCQQLQPSN
        I  ++        S+INWLFLLLG+ LG   L+QL+ FC  +  HRTGAK+R+LYL Y+ LC+ LQP +
Subjt:  IISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRLLYLVYLALCQQLQPSN

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)3.5e-3645.4Show/hide
Query:  RRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQ
        R + PPYPW+T +   I     L +NNI  I G+V CK C+R   +EY L  KF E+  +I+  K  M  RAP  W+ P L  C  C  E  ++PV+  +
Subjt:  RRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQ

Query:  EEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRLLYLVYLALCQQLQP
        +E         INWLFLLLG+ LGC  L QLRYFC  N+ HRTG+K+R++Y+ YL+LC+QL P
Subjt:  EEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRLLYLVYLALCQQLQP

AT2G16190.2 FUNCTIONS IN: molecular_function unknown9.8e-2341.98Show/hide
Query:  RRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQ
        R + PPYPW+T +   I     L +NNI  I G+V CK C+R   +EY L  KF E+  +I+  K  M  RAP  W+ P L  C  C  E  ++PV+  +
Subjt:  RRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEMEYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQ

Query:  EEGDDDDQFSRINWLFLLLGKFLGCLKLKQL
        +E         INWLFLLLG+ LGC  L QL
Subjt:  EEGDDDDQFSRINWLFLLLGKFLGCLKLKQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGACCCAAGAGAAAAGTAGGCAAGGCCGCCTCAATCTTGATCTCTCTCTCCATCCACCGTCGCACTCTCCACCGCAACCGGTGGACAATGAAACTTCAAGCCACCA
ACAACAAGAAGAGGAGGAGGAGGAGGAGGAGGAGACGATGGAACAACCGAGACAGAGACGACGTAGAACGAGAGCCGACACTAGAAGGATGGAGCCACCATATCCATGGT
CAACTGACCGACGAGCGGTAATCCACAAACTGGAGTACCTTCAAGCAAACAACATAGTGACAATCAAGGGGGAAGTAAAATGCAAAAAATGCGAGAGAAAGTATGAGATG
GAGTATGAGTTAATGAATAAGTTTTATGAGATAACAAGGTTTATTGAAAGTGAAAAGAATAGTATGCATGACAGAGCTCCAAGTTGTTGGGCAAACCCTATTTTACCAAA
TTGCAATTTTTGCAATAAAGAAAAATGTGTTGAGCCAGTGATAATTAGTCAAGAGGAAGGGGATGATGACGATCAATTCAGTAGAATCAATTGGCTGTTCTTGCTTTTGG
GAAAATTTCTTGGATGTTTGAAGCTCAAACAACTCAGATACTTTTGTGCTCAAAATAATATTCATCGAACTGGGGCCAAGAATCGTCTTCTTTATCTCGTATATCTTGCT
CTGTGTCAGCAACTTCAACCCTCCAATATACACTTTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGACCCAAGAGAAAAGTAGGCAAGGCCGCCTCAATCTTGATCTCTCTCTCCATCCACCGTCGCACTCTCCACCGCAACCGGTGGACAATGAAACTTCAAGCCACCA
ACAACAAGAAGAGGAGGAGGAGGAGGAGGAGGAGACGATGGAACAACCGAGACAGAGACGACGTAGAACGAGAGCCGACACTAGAAGGATGGAGCCACCATATCCATGGT
CAACTGACCGACGAGCGGTAATCCACAAACTGGAGTACCTTCAAGCAAACAACATAGTGACAATCAAGGGGGAAGTAAAATGCAAAAAATGCGAGAGAAAGTATGAGATG
GAGTATGAGTTAATGAATAAGTTTTATGAGATAACAAGGTTTATTGAAAGTGAAAAGAATAGTATGCATGACAGAGCTCCAAGTTGTTGGGCAAACCCTATTTTACCAAA
TTGCAATTTTTGCAATAAAGAAAAATGTGTTGAGCCAGTGATAATTAGTCAAGAGGAAGGGGATGATGACGATCAATTCAGTAGAATCAATTGGCTGTTCTTGCTTTTGG
GAAAATTTCTTGGATGTTTGAAGCTCAAACAACTCAGATACTTTTGTGCTCAAAATAATATTCATCGAACTGGGGCCAAGAATCGTCTTCTTTATCTCGTATATCTTGCT
CTGTGTCAGCAACTTCAACCCTCCAATATACACTTTCTTTGA
Protein sequenceShow/hide protein sequence
MRTQEKSRQGRLNLDLSLHPPSHSPPQPVDNETSSHQQQEEEEEEEEETMEQPRQRRRRTRADTRRMEPPYPWSTDRRAVIHKLEYLQANNIVTIKGEVKCKKCERKYEM
EYELMNKFYEITRFIESEKNSMHDRAPSCWANPILPNCNFCNKEKCVEPVIISQEEGDDDDQFSRINWLFLLLGKFLGCLKLKQLRYFCAQNNIHRTGAKNRLLYLVYLA
LCQQLQPSNIHFL