; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G23540 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G23540
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionDefensin-like protein
Genome locationClcChr01:34289051..34292657
RNA-Seq ExpressionClc01G23540
SyntenyClc01G23540
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7026578.1 hypothetical protein SDJN02_10580 [Cucurbita argyrosperma subsp. argyrosperma]9.0e-3661.94Show/hide
Query:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP
        LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSE                                        ECFHAC SGCGFKFDV+S  ADKIQ 
Subjt:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP

Query:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
        NRP KPVP+VSKP+PRPPI E +AKNEDLPSTSA
Subjt:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA

XP_004134712.1 uncharacterized protein LOC101206260 [Cucumis sativus]3.9e-3967.16Show/hide
Query:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP
        LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSE                                        ECFHACVSGCGFKF+VE EKADKIQP
Subjt:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP

Query:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
        NRPAKP PVVSKPSPRPPI ESIAKNEDLPSTSA
Subjt:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA

XP_008439869.1 PREDICTED: uncharacterized protein LOC103484528 isoform X1 [Cucumis melo]1.5e-3866.42Show/hide
Query:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP
        LEECTEICYKDPVFK+HQWSAYIDRSPGSASYSE                                        ECFHACVSGCGFKF+VE EKADKIQP
Subjt:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP

Query:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
        NRPAKP PVVSKPSPRPPI ESIAKNEDLPSTSA
Subjt:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA

XP_038883449.1 uncharacterized protein LOC120074406 isoform X1 [Benincasa hispida]1.6e-4069.4Show/hide
Query:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP
        LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSE                                        ECFHACVSGCGFKFDVE EKADKIQP
Subjt:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP

Query:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
        NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
Subjt:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA

XP_038883451.1 uncharacterized protein LOC120074406 isoform X2 [Benincasa hispida]4.1e-4461.4Show/hide
Query:  MQMGRPGRNQVYMPSHLCVQLISLLRFIEILGHRSQGLEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLT
        MQMGRPGRN                         SQGLEECTEICYKDPVFKDHQWSAYIDRSPGSASYSE                             
Subjt:  MQMGRPGRNQVYMPSHLCVQLISLLRFIEILGHRSQGLEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLT

Query:  YFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQPNRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
                   ECFHACVSGCGFKFDVE EKADKIQPNRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
Subjt:  YFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQPNRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA

TrEMBL top hitse value%identityAlignment
A0A1S3AZC9 uncharacterized protein LOC103484528 isoform X17.2e-3966.42Show/hide
Query:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP
        LEECTEICYKDPVFK+HQWSAYIDRSPGSASYSE                                        ECFHACVSGCGFKF+VE EKADKIQP
Subjt:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP

Query:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
        NRPAKP PVVSKPSPRPPI ESIAKNEDLPSTSA
Subjt:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA

A0A6J1CKR8 uncharacterized protein LOC1110125091.3e-3561.94Show/hide
Query:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP
        LEECTEICYKDPV KDHQWSAYIDRSPG+ASYSE                                        ECFHACVSGCGFKF+V+SEKADKIQP
Subjt:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP

Query:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
        NRP KP PVV  PSPRPPI ES  KNEDLPSTSA
Subjt:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA

A0A6J1EJ45 uncharacterized protein LOC1114337864.4e-3661.94Show/hide
Query:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP
        LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSE                                        ECFHAC SGCGFKFDV+S  ADKIQ 
Subjt:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP

Query:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
        NRP KPVP+VSKP+PRPPI E +AKNEDLPSTSA
Subjt:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA

A0A6J1KVS0 uncharacterized protein LOC1114968774.4e-3661.94Show/hide
Query:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP
        LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSE                                        ECFHAC SGCGFKFDV+S  ADKIQ 
Subjt:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP

Query:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
        NRP KPVP+VSKP+PRPPI E +AKNEDLPSTSA
Subjt:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA

A0A7N2LNQ0 Uncharacterized protein8.6e-2448.51Show/hide
Query:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP
        LEEC EICYKDPV KDHQWS+YIDRSPG ++YSE                                        ECFHACVSGCG+KFD+ SEK DK++P
Subjt:  LEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQP

Query:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA
        NRP K  P V  P P  P  + I   ED+PSTSA
Subjt:  NRPAKPVPVVSKPSPRPPIAESIAKNEDLPSTSA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G21720.1 unknown protein7.4e-2041.18Show/hide
Query:  ECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQPNR
        +C+EICYKDPV KD  W+A IDRSPG A YSE                                        ECFHACV+GCG+KFDVE+E  +K++P R
Subjt:  ECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQPNR

Query:  PAKPVPVVSKPSPRPPIAES----IAKNEDLPSTSA
        P  P P   KP P PP   S        ED+ +TSA
Subjt:  PAKPVPVVSKPSPRPPIAES----IAKNEDLPSTSA

AT4G21720.2 unknown protein1.1e-2041.13Show/hide
Query:  SQGLEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADK
        S G+ +C+EICYKDPV KD  W+A IDRSPG A YSE                                        ECFHACV+GCG+KFDVE+E  +K
Subjt:  SQGLEECTEICYKDPVFKDHQWSAYIDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADK

Query:  IQPNRPAKPVPVVSKPSPRPPIAES----IAKNEDLPSTSA
        ++P RP  P P   KP P PP   S        ED+ +TSA
Subjt:  IQPNRPAKPVPVVSKPSPRPPIAES----IAKNEDLPSTSA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGTCTCCTTGTGGAAACCAATGCACGCAGAAATACGCAGCCCTCACCCAGATTCCTTAAACTCACGTCGTGTATGTTCGCTCTCTATAACTCGAAATGTTGAATT
CCTCGCGTCTGACCGGGGCGGGTATTTTGCAAAAAGGGGTGTGATGCAGATGGGGAGGCCTGGGAGGAATCAAGTGTACATGCCATCACACTTGTGCGTTCAATTGATTA
GCTTGTTGAGGTTCATTGAAATTCTGGGGCACAGGTCGCAAGGCTTGGAAGAATGCACTGAGATATGCTATAAGGATCCTGTATTTAAGGACCACCAGTGGAGTGCTTAC
ATTGACCGATCTCCTGGGTCTGCCAGTTACTCAGAGGTATTTAATTGTCCCATTGTTATTTTTTTGGCATTTTTATTGACTCTGATGATTTCACCATATTTGAGACAATG
TGTCTATTTAACATACTTTTGGATTTTGACTCTTTGCTGCTCACAGGAGTGTTTCCATGCTTGTGTATCTGGTTGCGGCTTTAAGTTTGATGTTGAATCAGAGAAAGCTG
ACAAAATCCAACCAAACAGGCCTGCAAAGCCAGTGCCTGTTGTTTCTAAACCATCCCCACGACCTCCAATCGCTGAATCCATTGCCAAAAATGAAGATTTACCAAGCACT
TCTGCATAG
mRNA sequenceShow/hide mRNA sequence
TAGGCCTAATCAAACTCTAAATTTTAGGCCTAATCAATCTAAATCTGTAATTGAGATTAAACCAGTAATTTACCGAAGGATCATAATAAGGAAAAAGAAAATTCAGAGAA
TGCGAGCATTCGCAATTTCATTCCTTTTGGGCTTCTCCTCAACATTAAAAGAAGAAAACAATCGGCAGAGAGCGGTGAAGATTTGAACTTTGAAGGAGGCTGTGTATTAT
ATGATGATACCGCAACAATGGGCGTCTCCTTGTGGAAACCAATGCACGCAGAAATACGCAGCCCTCACCCAGATTCCTTAAACTCACGTCGTGTATGTTCGCTCTCTATA
ACTCGAAATGTTGAATTCCTCGCGTCTGACCGGGGCGGGTATTTTGCAAAAAGGGGTGTGATGCAGATGGGGAGGCCTGGGAGGAATCAAGTGTACATGCCATCACACTT
GTGCGTTCAATTGATTAGCTTGTTGAGGTTCATTGAAATTCTGGGGCACAGGTCGCAAGGCTTGGAAGAATGCACTGAGATATGCTATAAGGATCCTGTATTTAAGGACC
ACCAGTGGAGTGCTTACATTGACCGATCTCCTGGGTCTGCCAGTTACTCAGAGGTATTTAATTGTCCCATTGTTATTTTTTTGGCATTTTTATTGACTCTGATGATTTCA
CCATATTTGAGACAATGTGTCTATTTAACATACTTTTGGATTTTGACTCTTTGCTGCTCACAGGAGTGTTTCCATGCTTGTGTATCTGGTTGCGGCTTTAAGTTTGATGT
TGAATCAGAGAAAGCTGACAAAATCCAACCAAACAGGCCTGCAAAGCCAGTGCCTGTTGTTTCTAAACCATCCCCACGACCTCCAATCGCTGAATCCATTGCCAAAAATG
AAGATTTACCAAGCACTTCTGCATAGAAACACACAATGCAGATAAACGAGGAAACCTGGCGCATCAAAATTATAGTCAGTCTTTTAGGCTTACGTCTCCCGTTGCTATGT
TTTCCCGCACGTGGGCTTCCATATTCTGTTGGATTATATATTATGACAGTCACTTGTAAGTGCAGTCCTTCCTTGCCTGTGAAAGGACTTAGGTCTCTGACATAGATGTT
TGTCCTGCATGGTGAGAGTTGCTAATTGTCTTCCTTGTTGTAGCTTCCACTTCTCTGGCTTCTAAAGTTGGAGAATTTTGTGAAACGATAATTTATAATAGCGAATTGGG
GAATTGGGGACCTGTTGTTTCGGAAAAGAAGTAAATTTCTGCTGTAGCTTACCTTTCTTCTGTGTTTTTACAATGAGCTCGTAAAAGTATATTAGCAGGATTCAATTCAT
TAGGGTTTTGAGAAATGTAGTTTTCATACATTTTTTTTCATTGAAATCGAGAGTTGGGTCTTTGGGAGGTGTTCTCCCACTGCCAACCTCATGAAATGGCTGTTCAGATG
ATCTGCTTCTAAGTTCTAATGATATTTCTCTATTCCAAGAAATGTTATGTTATGGGTCAATTTACTGATAACGAGAGGAAGACCAATTTCTATGAGTCTTGTTGGATCTT
TTAACGTCCTTGGAAAAGAAAAGCTCTGG
Protein sequenceShow/hide protein sequence
MGVSLWKPMHAEIRSPHPDSLNSRRVCSLSITRNVEFLASDRGGYFAKRGVMQMGRPGRNQVYMPSHLCVQLISLLRFIEILGHRSQGLEECTEICYKDPVFKDHQWSAY
IDRSPGSASYSEVFNCPIVIFLAFLLTLMISPYLRQCVYLTYFWILTLCCSQECFHACVSGCGFKFDVESEKADKIQPNRPAKPVPVVSKPSPRPPIAESIAKNEDLPST
SA