; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C11G221925 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C11G221925
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionExtensin-like protein
Genome locationCla97Chr11:27929315..27959870
RNA-Seq ExpressionCla97C11G221925
SyntenyCla97C11G221925
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649619.1 hypothetical protein Csa_012837 [Cucumis sativus]4.3e-6585.8Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHH-QKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQV-SSGCFPSPLPNRK
        MD DEFYRQPAAVPFKWEIKPGVPRNHHRLR SPTHS PQHH QKLKPPPAVS+F HP NSLHSS RT+S+RWRF RS     EQV SSGCFPSPLPNRK
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHH-QKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQV-SSGCFPSPLPNRK

Query:  SPKTVSRK-PEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWA
        SPK+VSRK PEPDYSS+L+TLSRWSVSSRKSISPFRYSVSSSPSSFSS QSSPRPTSDT+WA
Subjt:  SPKTVSRK-PEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWA

XP_004142634.1 uncharacterized protein LOC101220757 [Cucumis sativus]8.6e-6685.89Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHH-QKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQV-SSGCFPSPLPNRK
        MD DEFYRQPAAVPFKWEIKPGVPRNHHRLR SPTHS PQHH QKLKPPPAVS+F HP NSLHSS RT+S+RWRF RS     EQV SSGCFPSPLPNRK
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHH-QKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQV-SSGCFPSPLPNRK

Query:  SPKTVSRK-PEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWAG
        SPK+VSRK PEPDYSS+L+TLSRWSVSSRKSISPFRYSVSSSPSSFSS QSSPRPTSDT+WAG
Subjt:  SPKTVSRK-PEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWAG

XP_008444194.1 PREDICTED: uncharacterized protein LOC103487607 [Cucumis melo]9.5e-6585.98Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHH-QKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQV-SSGCFPSPLPNRK
        MD DEFYR+PAAVPFKWEIKPGVPRNHHR RQSPTHS PQHH QKLKPPPAVS+F HPSNSLHSS RTRSDRWRF RS     EQV SSGCFPSPLPNRK
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHH-QKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQV-SSGCFPSPLPNRK

Query:  SPKTVSRK-PEPDYSSELETLSRWSVSSRKSISPFRYSV-SSSPSSFSSNQSSPRPTSDTDWAG
        SPK +SRK PEPDYSS+L+TLSRWSVSSRKSISPFRYSV SSSPSSFSS QSSPRPTSDT+WAG
Subjt:  SPKTVSRK-PEPDYSSELETLSRWSVSSRKSISPFRYSV-SSSPSSFSSNQSSPRPTSDTDWAG

XP_022131529.1 uncharacterized protein DKFZp434B061-like [Momordica charantia]7.8e-5977.58Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSPQHHQKLKPPPAVSNFLHPSN----SLHSSSRTRSDRWRFARSNLLEPEQVS--SGCFPSPLP
        MD DEFYR+PAAVPFKWEIKPGVPR HHRL  SP+  P   QKLKPPP VS+F  PS     SLHSSSRTRSDRWRFARS+L EP QVS  +GCFPSP P
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSPQHHQKLKPPPAVSNFLHPSN----SLHSSSRTRSDRWRFARSNLLEPEQVS--SGCFPSPLP

Query:  NRKSPKTVSRKPEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWAG
        NRKS K+++RKPEP+Y++ELETLSRWSVSSRKSISPFR SVSSSPSSFSS QSSPRPTSDT+WAG
Subjt:  NRKSPKTVSRKPEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWAG

XP_038899347.1 uncharacterized protein LOC120086669 [Benincasa hispida]5.7e-7086.88Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHHQKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQVSSGCFPSPLPNRKSP
        MDVDEFYRQPAAVPFKWEIKPGVP+NHHRLR SPTHS PQHHQKLKPPP+VSNFLHPSNSLHSSSRTRSDRWRF+     +PEQVSSGCFPSPLPNRKS 
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHHQKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQVSSGCFPSPLPNRKSP

Query:  KTVSRKPEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWAG
        K++SR PEPDYSS LE+LSRWSVSSRKSISPFRYSVSSSPSS+SS  SSPRPTSDT+WAG
Subjt:  KTVSRKPEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWAG

TrEMBL top hitse value%identityAlignment
A0A0A0KY52 Uncharacterized protein1.8e-4582.93Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHH-QKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQV-SSGCFPSPLPNRK
        MD DEFYRQPAAVPFKWEIKPGVPRNHHRLR SPTHS PQHH QKLKPPPAVS+F HP NSLHSS RT+S+RWRF RS     EQV SSGCFPSPLPNRK
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHH-QKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQV-SSGCFPSPLPNRK

Query:  SPKTVSRK-PEPDYSSELETLSR
        SPK+VSRK PEPDYSS+L+TLSR
Subjt:  SPKTVSRK-PEPDYSSELETLSR

A0A1S3BAM4 uncharacterized protein LOC1034876074.6e-6585.98Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHH-QKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQV-SSGCFPSPLPNRK
        MD DEFYR+PAAVPFKWEIKPGVPRNHHR RQSPTHS PQHH QKLKPPPAVS+F HPSNSLHSS RTRSDRWRF RS     EQV SSGCFPSPLPNRK
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHS-PQHH-QKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQV-SSGCFPSPLPNRK

Query:  SPKTVSRK-PEPDYSSELETLSRWSVSSRKSISPFRYSV-SSSPSSFSSNQSSPRPTSDTDWAG
        SPK +SRK PEPDYSS+L+TLSRWSVSSRKSISPFRYSV SSSPSSFSS QSSPRPTSDT+WAG
Subjt:  SPKTVSRK-PEPDYSSELETLSRWSVSSRKSISPFRYSV-SSSPSSFSSNQSSPRPTSDTDWAG

A0A6J1BQH3 uncharacterized protein DKFZp434B061-like3.8e-5977.58Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSPQHHQKLKPPPAVSNFLHPSN----SLHSSSRTRSDRWRFARSNLLEPEQVS--SGCFPSPLP
        MD DEFYR+PAAVPFKWEIKPGVPR HHRL  SP+  P   QKLKPPP VS+F  PS     SLHSSSRTRSDRWRFARS+L EP QVS  +GCFPSP P
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSPQHHQKLKPPPAVSNFLHPSN----SLHSSSRTRSDRWRFARSNLLEPEQVS--SGCFPSPLP

Query:  NRKSPKTVSRKPEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWAG
        NRKS K+++RKPEP+Y++ELETLSRWSVSSRKSISPFR SVSSSPSSFSS QSSPRPTSDT+WAG
Subjt:  NRKSPKTVSRKPEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWAG

A0A6J1FHC7 uncharacterized protein LOC1114457754.6e-5777.02Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSPQHHQKLKPPPAV--SNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQVSSGCFPSPLPNRKS
        MDVDEFYRQPAAVPFKWEIKPGVPRNHHRL Q PTHSPQ H+KLKPPPAV  + F   SNSL    RTRSDRW   +S L EPEQVS GCF SPLPNRK+
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSPQHHQKLKPPPAV--SNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQVSSGCFPSPLPNRKS

Query:  PKTVSRKPEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWAG
         K V+RKPEPDY+SELETL RWSVSS+KSISPFR SVSS  SS SS QSSPRPTSD++WAG
Subjt:  PKTVSRKPEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWAG

A0A6J1ISY3 uncharacterized protein LOC1114803257.4e-5574.23Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSPQHHQKLKPPPAV--SNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQVSSGCFPSPLPNRKS
        MDVDEFYRQPAAVPFKWEIKPGVPRNHH L   PTHSPQ H+KLKPPPAV  + F   SNSL    RTRSDRW  ++S L EPEQVS GCF SPLPNRK+
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSPQHHQKLKPPPAV--SNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQVSSGCFPSPLPNRKS

Query:  PKTVSRKPEPDYSSELETLSRWSVSSRKSISPFRYSVSS--SPSSFSSNQSSPRPTSDTDWAG
         K ++RKPEPD +SELETL RWS+SS+KSISPFR SVSS  SPSS SS QSSPRPTSD++WAG
Subjt:  PKTVSRKPEPDYSSELETLSRWSVSSRKSISPFRYSVSS--SPSSFSSNQSSPRPTSDTDWAG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21695.1 hydroxyproline-rich glycoprotein family protein2.0e-0431.4Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPR-----NHHRLRQSPTHSPQHHQKLK-PPPAVSNFLHPSNSLHSSSRTR--SDRWRFARSNLLEPEQVSSGCFPSP
        +D D+ +++P AVPFKWEI+PGVP+     +   L Q P  +     KLK  PP+ S     S+S  S SR+R  S         L  P    S C  SP
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPR-----NHHRLRQSPTHSPQHHQKLK-PPPAVSNFLHPSNSLHSSSRTR--SDRWRFARSNLLEPEQVSSGCFPSP

Query:  LP--NRKSPKTVSRKP-----------EPDYSSELETLSRWSVSSRKSISP--FRYSVSSSPSSFSSNQSSP
         P     SP+     P           +   S +L+   +   SS  S S   +    + SPS   S+Q SP
Subjt:  LP--NRKSPKTVSRKP-----------EPDYSSELETLSRWSVSSRKSISP--FRYSVSSSPSSFSSNQSSP

AT1G77400.1 CONTAINS InterPro DOMAIN/s: Protein of unknown function DUF688 (InterPro:IPR007789)1.9e-1531.58Show/hide
Query:  MDVDEFYRQPAAVPFKWEIKPGVPR--------------------------NHHRLRQSPTHSPQHH-----------------------QKLKP---PP
        +DVD+ +++P  +PF WEI+PGVP+                          +H +    P  SP                           KLKP   P 
Subjt:  MDVDEFYRQPAAVPFKWEIKPGVPR--------------------------NHHRLRQSPTHSPQHH-----------------------QKLKP---PP

Query:  AVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQ----------VSSGCFPSP---LPNRKS----PKTVSRKPEPDYSSELETLSRWSVSSRKSISPF
        ++S F  P  S  SS R  S+RW+  R N + PE              GCFPSP   L   KS     K+ SR     Y S++ET+S W+VSSR+S+SP 
Subjt:  AVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQ----------VSSGCFPSP---LPNRKS----PKTVSRKPEPDYSSELETLSRWSVSSRKSISPF

Query:  RYSVSSSPSSFSSNQSSPRPTSDTDWAG
             S  SSFSS + SPR  ++ +W G
Subjt:  RYSVSSSPSSFSSNQSSPRPTSDTDWAG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACTTTCTGTCAATGGACGTCGATGAATTTTACCGGCAACCGGCTGCCGTTCCTTTCAAATGGGAGATCAAACCGGGCGTCCCCAGAAACCACCACCGCCTCCGACA
GTCTCCAACTCACTCTCCTCAGCACCATCAAAAGCTGAAGCCTCCTCCTGCTGTATCCAACTTCCTCCATCCTTCAAATTCCCTCCACTCCTCCTCTCGAACCCGGTCCG
ACCGGTGGCGGTTTGCCCGCTCCAACCTCCTCGAGCCCGAGCAAGTCTCTTCCGGTTGCTTCCCCTCGCCTTTGCCCAACCGGAAATCGCCCAAGACCGTGAGCCGGAAA
CCCGAACCGGATTACTCCTCTGAATTGGAGACTTTGTCGCGGTGGTCTGTTTCCAGCCGGAAGTCGATTTCGCCGTTCCGATATTCAGTTTCGTCGTCGCCGTCGTCGTT
CTCGTCGAACCAGTCATCTCCCCGTCCGACTAGTGATACTGATTGGGCCGGTTGTTCTCCCACAATCTATTTACACCTGTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCACTTTCTGTCAATGGACGTCGATGAATTTTACCGGCAACCGGCTGCCGTTCCTTTCAAATGGGAGATCAAACCGGGCGTCCCCAGAAACCACCACCGCCTCCGACA
GTCTCCAACTCACTCTCCTCAGCACCATCAAAAGCTGAAGCCTCCTCCTGCTGTATCCAACTTCCTCCATCCTTCAAATTCCCTCCACTCCTCCTCTCGAACCCGGTCCG
ACCGGTGGCGGTTTGCCCGCTCCAACCTCCTCGAGCCCGAGCAAGTCTCTTCCGGTTGCTTCCCCTCGCCTTTGCCCAACCGGAAATCGCCCAAGACCGTGAGCCGGAAA
CCCGAACCGGATTACTCCTCTGAATTGGAGACTTTGTCGCGGTGGTCTGTTTCCAGCCGGAAGTCGATTTCGCCGTTCCGATATTCAGTTTCGTCGTCGCCGTCGTCGTT
CTCGTCGAACCAGTCATCTCCCCGTCCGACTAGTGATACTGATTGGGCCGGTTGTTCTCCCACAATCTATTTACACCTGTCTTAA
Protein sequenceShow/hide protein sequence
MHFLSMDVDEFYRQPAAVPFKWEIKPGVPRNHHRLRQSPTHSPQHHQKLKPPPAVSNFLHPSNSLHSSSRTRSDRWRFARSNLLEPEQVSSGCFPSPLPNRKSPKTVSRK
PEPDYSSELETLSRWSVSSRKSISPFRYSVSSSPSSFSSNQSSPRPTSDTDWAGCSPTIYLHLS