; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ChyUNG227170 (gene) of Cucumber (hystrix) v1 genome

Gene IDChyUNG227170
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionHydroxyproline-rich glycoprotein family protein
Genome locationscaffold25_size3045174:2120858..2124646
RNA-Seq ExpressionChyUNG227170
SyntenyChyUNG227170
Gene Ontology termsGO:0006353 - DNA-templated transcription, termination (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003690 - double-stranded DNA binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648356.1 hypothetical protein Csa_023118, partial [Cucumis sativus]3.74e-7390.84Show/hide
Query:  MNAENGNDETSPGLDLSLRPPSAQPPPAPEYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQ-QRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKY
        MNAE+GN+E SP LDLSLRPPSAQPPPAPEYPQ+SSTLPSSQIPNEESNA PQPNIETSN+QQQ +RRLRRRRTRAD TRIEPPYPWATDKRAVVHELKY
Subjt:  MNAENGNDETSPGLDLSLRPPSAQPPPAPEYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQ-QRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKY

Query:  LQSNNIMTIKGEVICKKCEMKYEMEYDLMNK
        LQSNNIM IKGEVICKKCEMKYE+EYDLMNK
Subjt:  LQSNNIMTIKGEVICKKCEMKYEMEYDLMNK

KAG7011696.1 hypothetical protein SDJN02_26602, partial [Cucurbita argyrosperma subsp. argyrosperma]1.35e-2156.31Show/hide
Query:  EYPQMSSTLPSS--QIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIKGEVICKKCEMKYEMEYDL
        E P  S+T+  +  Q PN+ S  IPQ    T+     + R RR RTRADT RIEPPYPW+ ++RA +H L+YLQSNNI+TIKG+V CKKCE  YE+EY+L
Subjt:  EYPQMSSTLPSS--QIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIKGEVICKKCEMKYEMEYDL

Query:  MNK
        MNK
Subjt:  MNK

XP_008463189.1 PREDICTED: uncharacterized protein LOC103501397 [Cucumis melo]1.10e-4780.51Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQM-SSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIKGEV
        LDLSLR PS +PP  P+Y Q  +STLPS QI NEES AI +PN ETSN QQQQRR  RRRTRAD TRIEPPYPW+TD+RAVVHELKYLQ NNIMTIKGEV
Subjt:  LDLSLRPPSAQPPPAPEYPQM-SSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIKGEV

Query:  ICKKCEMKYEMEYDLMNK
        ICKKCEMKYEMEYDLMNK
Subjt:  ICKKCEMKYEMEYDLMNK

XP_011656206.1 uncharacterized protein LOC105435666 [Cucumis sativus]5.18e-7290.84Show/hide
Query:  MNAENGNDETSPGLDLSLRPPSAQPPPAPEYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQ-QRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKY
        MNAE+GN+E SP LDLSLRPPSAQPPPAPEYPQ+SSTLPSSQIPNEESNA PQPNIETSN+QQQ +RRLRRRRTRAD TRIEPPYPWATDKRAVVHELKY
Subjt:  MNAENGNDETSPGLDLSLRPPSAQPPPAPEYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQ-QRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKY

Query:  LQSNNIMTIKGEVICKKCEMKYEMEYDLMNK
        LQSNNIM IKGEVICKKCEMKYE+EYDLMNK
Subjt:  LQSNNIMTIKGEVICKKCEMKYEMEYDLMNK

XP_038895979.1 junction-mediating and -regulatory protein-like [Benincasa hispida]6.12e-3659.31Show/hide
Query:  LDLSLR----PPSAQPPPAP---------------EYPQMSST--LPSSQIPNEESNAIPQPNIETSNEQQQ-------QRRLRRRRTRADTTRIEPPYP
        L+LSLR    PP   PPP P               EYP +S+T  L   + PNE      Q N ETSN QQQ       Q R RRRRTRAD TRIEPPYP
Subjt:  LDLSLR----PPSAQPPPAP---------------EYPQMSST--LPSSQIPNEESNAIPQPNIETSNEQQQ-------QRRLRRRRTRADTTRIEPPYP

Query:  WATDKRAVVHELKYLQSNNIMTIKGEVICKKCEMKYEMEYDLMNK
        W+TD+RAV+HELKYLQSNNI+TIKGEV CKKCE KYEMEYDLMNK
Subjt:  WATDKRAVVHELKYLQSNNIMTIKGEVICKKCEMKYEMEYDLMNK

TrEMBL top hitse value%identityAlignment
A0A0A0KMQ2 Uncharacterized protein1.7e-5690.15Show/hide
Query:  MNAENGNDETSPGLDLSLRPPSAQPPPAPEYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQ-QRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKY
        MNAE+GN+E SP LDLSLRPPSAQPPPAPEYPQ+SSTLPSSQIPNEESNA PQPNIETSN+QQQ +RRLRRRRTRAD TRIEPPYPWATDKRAVVHELKY
Subjt:  MNAENGNDETSPGLDLSLRPPSAQPPPAPEYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQ-QRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKY

Query:  LQSNNIMTIKGEVICKKCEMKYEMEYDLMNKM
        LQSNNIM IKGEVICKKCEMKYE+EYDLMNK+
Subjt:  LQSNNIMTIKGEVICKKCEMKYEMEYDLMNKM

A0A0A0LAK2 Uncharacterized protein1.6e-1739.61Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQMSSTLPSSQ--------IPNEESNAIPQPNIETSNEQQQQR---------------------RLRRRRTRADTTRIEPPY
        L LSL PP   PPP    P   S LPS+         +P++  + +P   ++  + QQ Q                      R +R R +ADT+RIEPPY
Subjt:  LDLSLRPPSAQPPPAPEYPQMSSTLPSSQ--------IPNEESNAIPQPNIETSNEQQQQR---------------------RLRRRRTRADTTRIEPPY

Query:  PWATDKRAVVHELKYLQSNNIMTIKGEVICKKCEMKYEMEYDLMNKMLLRLKLV
        PW+T++ A++H+L+YLQ+NNI TIKGEV CK+C+ K E+EYDLM+K    +K +
Subjt:  PWATDKRAVVHELKYLQSNNIMTIKGEVICKKCEMKYEMEYDLMNKMLLRLKLV

A0A1S3CK70 uncharacterized protein LOC1035013978.1e-3879.83Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQ-MSSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIKGEV
        LDLSLR PS +PP  P+Y Q  +STLP SQI NEES AI +PN ETSN QQQQR  RRRRTRAD TRIEPPYPW+TD+RAVVHELKYLQ NNIMTIKGEV
Subjt:  LDLSLRPPSAQPPPAPEYPQ-MSSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIKGEV

Query:  ICKKCEMKYEMEYDLMNKM
        ICKKCEMKYEMEYDLMNK+
Subjt:  ICKKCEMKYEMEYDLMNKM

A0A5D3CY08 Transcription termination factor MTEF12.1e-1735.5Show/hide
Query:  SPGLDLSLRPPSAQPPPAP-EYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIK
        SP    S  P   Q PP P +Y Q   T  +   P E    IP+P  +T N+  +  + +RRRT+AD +RIEPPYPW+T+K AV+H+L+YL++NNI+TIK
Subjt:  SPGLDLSLRPPSAQPPPAP-EYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIK

Query:  GEVICKKCEMKYEMEYDLMNKMLLRLKLVYVNITLNDIGYHSSERREACVLYHLLLFSSTSRLFSFRAAAAPRFTSFILRPVQSSSKLRSVRIRERPRAI
        GEV CK+C+ K E+EY+L++K     +++ +    +   +  SE    C L  L   S++S + S               P  +S    +V+++  P+A+
Subjt:  GEVICKKCEMKYEMEYDLMNKMLLRLKLVYVNITLNDIGYHSSERREACVLYHLLLFSSTSRLFSFRAAAAPRFTSFILRPVQSSSKLRSVRIRERPRAI

A0A6J1GM83 mucin-16-like8.4e-1953.45Show/hide
Query:  SLRPPSAQPPPAPEYPQMSSTLPSS--QIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIKGEVIC
        +L  P A P    E P  S+T+  +  Q PN +S  IPQ    T+     + R RR RTRADT RIEPPYPW+ ++RA +H L+YLQSNNI+TIKG+V C
Subjt:  SLRPPSAQPPPAPEYPQMSSTLPSS--QIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIKGEVIC

Query:  KKCEMKYEMEYDLMNK
        KKCE  YE+EY+LMNK
Subjt:  KKCEMKYEMEYDLMNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein8.6e-0835Show/hide
Query:  PPSAQPP--------PAPEYPQM-SSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIKG
        PPS Q P          P  PQ+ +   P S +    SN  P P         +  R R   ++   T I PP+PWAT++R  +  L+YL+SN I TI G
Subjt:  PPSAQPP--------PAPEYPQM-SSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIKG

Query:  EVICKKCEMKYEMEYDLMNK
        EV C+ CE  Y++ Y+L  +
Subjt:  EVICKKCEMKYEMEYDLMNK

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)1.2e-0429.75Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRR----RTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIK
        L+ ++ PP+        Y      LP  Q+    + A+  P        Q +R  +R             I PPYPWAT K   +   + L SNNI  I 
Subjt:  LDLSLRPPSAQPPPAPEYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRR----RTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIK

Query:  GEVICKKCEMKYEMEYDLMNK
        G+V CK C+    +EY+L  K
Subjt:  GEVICKKCEMKYEMEYDLMNK

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.2e-0429.75Show/hide
Query:  LDLSLRPPSAQPPPAPEYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRR----RTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIK
        L+ ++ PP+        Y      LP  Q+    + A+  P        Q +R  +R             I PPYPWAT K   +   + L SNNI  I 
Subjt:  LDLSLRPPSAQPPPAPEYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRR----RTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIK

Query:  GEVICKKCEMKYEMEYDLMNK
        G+V CK C+    +EY+L  K
Subjt:  GEVICKKCEMKYEMEYDLMNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGCCGAAAATGGCAATGACGAAACGAGCCCCGGTCTCGATCTTTCTCTCCGTCCGCCGTCTGCTCAACCTCCTCCAGCACCTGAATATCCACAGATGTCATCGAC
ACTGCCCTCGTCGCAAATTCCAAACGAAGAATCCAACGCAATCCCTCAACCCAACATTGAAACTTCAAATGAGCAACAACAACAACGGAGACTGAGACGACGTAGAACGA
GAGCAGACACGACAAGGATTGAGCCACCGTATCCATGGGCGACTGACAAACGAGCGGTAGTCCACGAACTCAAGTACCTTCAATCGAACAACATAATGACAATCAAGGGG
GAAGTGATATGCAAAAAATGCGAGATGAAGTATGAAATGGAGTATGATCTAATGAATAAGATGTTGTTAAGACTGAAACTTGTATATGTTAACATTACGTTGAATGATAT
TGGTTATCATTCAAGTGAACGTCGTGAAGCTTGCGTTCTTTATCATCTCCTTCTCTTCTCATCCACCAGTCGTCTATTTTCCTTTCGTGCTGCTGCTGCTCCACGCTTTA
CTAGCTTCATCCTCCGTCCCGTGCAATCGTCTTCGAAGCTCCGTTCGGTCCGCATCAGAGAACGTCCTCGAGCCATCCCTCTCTGTCTCAAGGTGCGTGCGTCGGTCACC
AATTGTTCGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACGCCGAAAATGGCAATGACGAAACGAGCCCCGGTCTCGATCTTTCTCTCCGTCCGCCGTCTGCTCAACCTCCTCCAGCACCTGAATATCCACAGATGTCATCGAC
ACTGCCCTCGTCGCAAATTCCAAACGAAGAATCCAACGCAATCCCTCAACCCAACATTGAAACTTCAAATGAGCAACAACAACAACGGAGACTGAGACGACGTAGAACGA
GAGCAGACACGACAAGGATTGAGCCACCGTATCCATGGGCGACTGACAAACGAGCGGTAGTCCACGAACTCAAGTACCTTCAATCGAACAACATAATGACAATCAAGGGG
GAAGTGATATGCAAAAAATGCGAGATGAAGTATGAAATGGAGTATGATCTAATGAATAAGATGTTGTTAAGACTGAAACTTGTATATGTTAACATTACGTTGAATGATAT
TGGTTATCATTCAAGTGAACGTCGTGAAGCTTGCGTTCTTTATCATCTCCTTCTCTTCTCATCCACCAGTCGTCTATTTTCCTTTCGTGCTGCTGCTGCTCCACGCTTTA
CTAGCTTCATCCTCCGTCCCGTGCAATCGTCTTCGAAGCTCCGTTCGGTCCGCATCAGAGAACGTCCTCGAGCCATCCCTCTCTGTCTCAAGGTGCGTGCGTCGGTCACC
AATTGTTCGAGTTGA
Protein sequenceShow/hide protein sequence
MNAENGNDETSPGLDLSLRPPSAQPPPAPEYPQMSSTLPSSQIPNEESNAIPQPNIETSNEQQQQRRLRRRRTRADTTRIEPPYPWATDKRAVVHELKYLQSNNIMTIKG
EVICKKCEMKYEMEYDLMNKMLLRLKLVYVNITLNDIGYHSSERREACVLYHLLLFSSTSRLFSFRAAAAPRFTSFILRPVQSSSKLRSVRIRERPRAIPLCLKVRASVT
NCSS