; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg22026 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg22026
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionBEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein .
Genome locationCarg_Chr19:6558086..6558577
RNA-Seq ExpressionCarg22026
SyntenyCarg22026
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572022.1 hypothetical protein SDJN03_28750, partial [Cucurbita argyrosperma subsp. sororia]5.0e-9298.77Show/hide
Query:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE
        GPIEPPYPWSTDRIAVV TLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE
Subjt:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE

Query:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
        WEKINWVFLLLGEM+GALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
Subjt:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI

KAG7011694.1 hypothetical protein SDJN02_26600, partial [Cucurbita argyrosperma subsp. argyrosperma]9.1e-94100Show/hide
Query:  MGPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPK
        MGPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPK
Subjt:  MGPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPK

Query:  EWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
        EWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
Subjt:  EWEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI

XP_022952797.1 uncharacterized protein LOC111455388 [Cucurbita moschata]9.4e-9197.53Show/hide
Query:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE
        GPIEPPYPWSTDRIAVVHTL YLTSNQILTITGEVKCQQCRRIYE+EYDVVSKFNEIG FVEH MESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE
Subjt:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE

Query:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
        WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
Subjt:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI

XP_022972401.1 uncharacterized protein LOC111470968 [Cucurbita maxima]1.3e-8794.44Show/hide
Query:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE
        GPIEPPYPWSTDRIAVVHTL YLT NQILTITG+VKCQQCRRIYE+EY+VVSKFNEIG FVEH MESFRDRAPK+WMQPNYPTCRFCGAEKGVKPVIPKE
Subjt:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE

Query:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
        WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFS I
Subjt:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI

XP_023511615.1 uncharacterized protein LOC111776409 [Cucurbita pepo subsp. pepo]8.8e-8995.68Show/hide
Query:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE
        G IEPPYPWSTDRIAVVHTL YLTSNQI+TITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVE+KMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE
Subjt:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE

Query:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
        WEKINWVFLLLGEMVGAL+LNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQI PSGRF+PI
Subjt:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI

TrEMBL top hitse value%identityAlignment
A0A0A0K3Q8 Uncharacterized protein1.1e-6370Show/hide
Query:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE
        IEPPYPWST+R A+V TL  L SNQIL ITG+V+C+QC+  Y +EYD+ SKF EI  FVE    SFRDRAP+ WM PNYPTCRFCG E G +PVIPK+W 
Subjt:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE

Query:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
        KINW+FLLLGEM+G L LNHLKYFCS T NHRTG+K+RL+YLTYITLC Q+DPSGRF+ +
Subjt:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI

A0A1S3BHR1 uncharacterized protein LOC1034897702.1e-6470Show/hide
Query:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE
        IEPPYPWST+R A+V TL  L S+QIL ITG+V+C+QC+  Y +EYD+VSKF EI  FVE     FRDRAP+ WM PNYPTCRFCG E G +PVIP EW 
Subjt:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE

Query:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
        KINW+FLLLGEM+G L LNHLKYFCSYT NHRTG+K+RL+YLTYITLC Q+DPSGRF+ +
Subjt:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI

A0A5A7T547 Uncharacterized protein8.1e-5669.66Show/hide
Query:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE
        IEPPYPWST+R A+V TL  L S+QIL ITG+V+C+QC+  Y +EYD+VSKF EI  FVE     FRDRAP+ WM PNYPTCRFCG E G +PVIP EW 
Subjt:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE

Query:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYI
        KINW+FLLLGEM+G L LNHLKYFCSYT NHRTG+K+RL+YLT I
Subjt:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYI

A0A6J1GLD4 uncharacterized protein LOC1114553884.6e-9197.53Show/hide
Query:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE
        GPIEPPYPWSTDRIAVVHTL YLTSNQILTITGEVKCQQCRRIYE+EYDVVSKFNEIG FVEH MESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE
Subjt:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE

Query:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
        WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
Subjt:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI

A0A6J1I5V9 uncharacterized protein LOC1114709686.2e-8894.44Show/hide
Query:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE
        GPIEPPYPWSTDRIAVVHTL YLT NQILTITG+VKCQQCRRIYE+EY+VVSKFNEIG FVEH MESFRDRAPK+WMQPNYPTCRFCGAEKGVKPVIPKE
Subjt:  GPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKE

Query:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI
        WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFS I
Subjt:  WEKINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49330.1 hydroxyproline-rich glycoprotein family protein2.4e-4448.37Show/hide
Query:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE
        I PP+PW+T+R   + +L+YL SNQI TITGEV+C+ C ++Y++ Y++  +F E+ +F   +    RDRA K+W  P    C  CG EK VKPVI +   
Subjt:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE

Query:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDP
        +INW+FLLLG+ +G   L  LK FC ++KNHRTG+KDR++YLTY+ LC+ + P
Subjt:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDP

AT2G16190.1 BEST Arabidopsis thaliana protein match is: hydroxyproline-rich glycoprotein family protein (TAIR:AT1G49330.1)4.7e-4044.94Show/hide
Query:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE
        I PPYPW+T +   + + + L+SN I  I+G+V C+ C R   +EY++  KF+E+  +++   E  R RAP  W  P    CR C +E  +KPV+ +  E
Subjt:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE

Query:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFS
        +INW+FLLLG+M+G   L+ L+YFC     HRTGSKDR+VY+TY++LC+Q+DP G F+
Subjt:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFS

AT2G16190.2 FUNCTIONS IN: molecular_function unknown1.4e-2339.85Show/hide
Query:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE
        I PPYPW+T +   + + + L+SN I  I+G+V C+ C R   +EY++  KF+E+  +++   E  R RAP  W  P    CR C +E  +KPV+ +  E
Subjt:  IEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWE

Query:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRT
        +INW+FLLLG+M+G   L+ L    S  K+H T
Subjt:  KINWVFLLLGEMVGALKLNHLKYFCSYTKNHRT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCCGATCGAGCCACCATATCCGTGGTCGACGGACCGAATAGCGGTGGTTCATACGCTACAGTATTTGACATCGAACCAAATCCTGACGATCACCGGGGAAGTCAA
GTGCCAACAATGTCGGAGAATTTACGAGATGGAATACGACGTTGTTTCGAAGTTTAACGAGATTGGGAGGTTCGTAGAGCACAAGATGGAGTCGTTCCGGGACCGGGCGC
CGAAGGAGTGGATGCAGCCGAATTATCCGACGTGTCGGTTTTGCGGGGCGGAAAAAGGAGTGAAGCCGGTGATTCCAAAGGAATGGGAGAAGATCAATTGGGTGTTCTTG
CTTTTGGGGGAAATGGTTGGAGCTTTGAAACTGAATCATTTGAAGTACTTTTGTAGTTACACGAAGAATCATCGAACAGGTTCAAAGGATCGTCTTGTTTATCTCACTTA
TATCACTTTGTGCCGCCAAATTGATCCTTCTGGTCGTTTCAGTCCAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGCCGATCGAGCCACCATATCCGTGGTCGACGGACCGAATAGCGGTGGTTCATACGCTACAGTATTTGACATCGAACCAAATCCTGACGATCACCGGGGAAGTCAA
GTGCCAACAATGTCGGAGAATTTACGAGATGGAATACGACGTTGTTTCGAAGTTTAACGAGATTGGGAGGTTCGTAGAGCACAAGATGGAGTCGTTCCGGGACCGGGCGC
CGAAGGAGTGGATGCAGCCGAATTATCCGACGTGTCGGTTTTGCGGGGCGGAAAAAGGAGTGAAGCCGGTGATTCCAAAGGAATGGGAGAAGATCAATTGGGTGTTCTTG
CTTTTGGGGGAAATGGTTGGAGCTTTGAAACTGAATCATTTGAAGTACTTTTGTAGTTACACGAAGAATCATCGAACAGGTTCAAAGGATCGTCTTGTTTATCTCACTTA
TATCACTTTGTGCCGCCAAATTGATCCTTCTGGTCGTTTCAGTCCAATTTGA
Protein sequenceShow/hide protein sequence
MGPIEPPYPWSTDRIAVVHTLQYLTSNQILTITGEVKCQQCRRIYEMEYDVVSKFNEIGRFVEHKMESFRDRAPKEWMQPNYPTCRFCGAEKGVKPVIPKEWEKINWVFL
LLGEMVGALKLNHLKYFCSYTKNHRTGSKDRLVYLTYITLCRQIDPSGRFSPI