; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G013340 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G013340
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionAT-rich interactive domain-containing protein
Genome locationCG_Chr08:26190286..26193604
RNA-Seq ExpressionClCG08G013340
SyntenyClCG08G013340
Gene Ontology termsNA
InterPro domainsIPR025124 - Domain of unknown function DUF4050


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064925.1 uncharacterized protein E6C27_scaffold82G002430 [Cucumis melo var. makuwa]2.3e-6889.26Show/hide
Query:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES
        +R   GC +ASALGNAMDGPSKGLRVKD+EAKKQCLPEN PSSSTCEMDNSTVWSQRSM  AQS +S SNIGSSTDFVNSGLLLWNETRKQWVGNKMS+S
Subjt:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES

Query:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        QKQVQEPKISWNATYDSLLTTNKPFPE IPL EMIEFLVDVWEQEGLYD
Subjt:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

KAG6585468.1 hypothetical protein SDJN03_18201, partial [Cucurbita argyrosperma subsp. sororia]5.1e-6878.49Show/hide
Query:  WRRVNALSREYLASALTDYNTNERRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDF
        WRRVN              ++NERRSC G  T  ALG+AMDGPS GLRV+DQEAKKQCLPEN  SSSTCEMDNSTVWSQRSM  AQS +SH+N+GSST+F
Subjt:  WRRVNALSREYLASALTDYNTNERRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDF

Query:  VNSGLLLWNETRKQWVGNKMSESQKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        VNSGLLLWNETRKQWVGNK SESQK+V+EPKISWNATYDSLLTTNKPFPE IPLAEMIEFLVDVWEQEGLYD
Subjt:  VNSGLLLWNETRKQWVGNKMSESQKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

XP_004138726.1 uncharacterized protein LOC101216869 [Cucumis sativus]1.5e-6787.92Show/hide
Query:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES
        +R   GC TASALGNAMDGPSKGLRVK++EAKKQCLPEN PSSSTCEMDNSTVWSQRSM   Q+ +SHSNIGSSTDFVNSGLLLWNETRKQWVGNKMS S
Subjt:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES

Query:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        QKQVQEPKISWNATYD+LLTTNKPFPE IPL EMIEFLVDVWEQEGLYD
Subjt:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

XP_008445211.1 PREDICTED: uncharacterized protein LOC103488310 isoform X1 [Cucumis melo]2.3e-6889.26Show/hide
Query:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES
        +R   GC +ASALGNAMDGPSKGLRVKD+EAKKQCLPEN PSSSTCEMDNSTVWSQRSM  AQS +S SNIGSSTDFVNSGLLLWNETRKQWVGNKMS+S
Subjt:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES

Query:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        QKQVQEPKISWNATYDSLLTTNKPFPE IPL EMIEFLVDVWEQEGLYD
Subjt:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

XP_038885342.1 uncharacterized protein LOC120075759 isoform X1 [Benincasa hispida]4.9e-7189.93Show/hide
Query:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES
        RRSCHGC TASAL NAMDGPSKGLRVKDQEAKKQCLPEN PSSSTCEMDNSTVWSQRSM  A S +SHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSE 
Subjt:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES

Query:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        QKQVQEPKISW+ATYDSLL TNKPFPEP+PL EMI+FLVDVWEQ+GLYD
Subjt:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

TrEMBL top hitse value%identityAlignment
A0A0A0LPL3 Uncharacterized protein7.2e-6887.92Show/hide
Query:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES
        +R   GC TASALGNAMDGPSKGLRVK++EAKKQCLPEN PSSSTCEMDNSTVWSQRSM   Q+ +SHSNIGSSTDFVNSGLLLWNETRKQWVGNKMS S
Subjt:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES

Query:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        QKQVQEPKISWNATYD+LLTTNKPFPE IPL EMIEFLVDVWEQEGLYD
Subjt:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

A0A1S3BC47 uncharacterized protein LOC103488310 isoform X11.1e-6889.26Show/hide
Query:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES
        +R   GC +ASALGNAMDGPSKGLRVKD+EAKKQCLPEN PSSSTCEMDNSTVWSQRSM  AQS +S SNIGSSTDFVNSGLLLWNETRKQWVGNKMS+S
Subjt:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES

Query:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        QKQVQEPKISWNATYDSLLTTNKPFPE IPL EMIEFLVDVWEQEGLYD
Subjt:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

A0A5A7VGA9 Uncharacterized protein1.1e-6889.26Show/hide
Query:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES
        +R   GC +ASALGNAMDGPSKGLRVKD+EAKKQCLPEN PSSSTCEMDNSTVWSQRSM  AQS +S SNIGSSTDFVNSGLLLWNETRKQWVGNKMS+S
Subjt:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES

Query:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        QKQVQEPKISWNATYDSLLTTNKPFPE IPL EMIEFLVDVWEQEGLYD
Subjt:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

A0A6J1GIP5 uncharacterized protein LOC111454240 isoform X13.9e-6685.91Show/hide
Query:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES
        RRSC G  T  ALG+AMDGPS GLRV+DQEAKKQCLPEN  SSSTCEMDNSTVWSQRSM  AQS +SH+N+GSST+FVNSGLLLWNETRKQWVGNK SES
Subjt:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES

Query:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        QK+V+EPKISWNATYDSLLTTNKPFPE IPLAEMIEFLVDVWEQEGLYD
Subjt:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

A0A6J1KQM2 uncharacterized protein LOC111496323 isoform X12.0e-6585.23Show/hide
Query:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES
        RRSC G  T  ALG+AMDGPS GLRV DQEAKKQCLP+N  SSSTCEMDNSTVWSQRSM  AQS +SH+N+GSST+FVNSGLLLWNETRKQWVGNK SES
Subjt:  RRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSES

Query:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        QK+V+EPKISWNATYDSLLTTNKPFPE IPLAEMIEFLVDVWEQEGLYD
Subjt:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15350.1 unknown protein1.6e-2746.1Show/hide
Query:  CHGCYT--ASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPA----QSQNSHSNIGSSTDFVNSGLLLWNETRKQWVG-NK
        C GCY    S   +  D PS  +    +  KK  + E+  S+ST +MDN T  SQ S+  +     SQ++  N  +  ++VN GLLLWN+TR++WVG +K
Subjt:  CHGCYT--ASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPA----QSQNSHSNIGSSTDFVNSGLLLWNETRKQWVG-NK

Query:  MSESQKQVQEPKISWN-ATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
         +      Q  K++WN ATYDSLL +NK FP+PIPL EM++FLVD+WEQEGLYD
Subjt:  MSESQKQVQEPKISWN-ATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

AT1G15350.2 unknown protein1.6e-2746.1Show/hide
Query:  CHGCYT--ASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPA----QSQNSHSNIGSSTDFVNSGLLLWNETRKQWVG-NK
        C GCY    S   +  D PS  +    +  KK  + E+  S+ST +MDN T  SQ S+  +     SQ++  N  +  ++VN GLLLWN+TR++WVG +K
Subjt:  CHGCYT--ASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPA----QSQNSHSNIGSSTDFVNSGLLLWNETRKQWVG-NK

Query:  MSESQKQVQEPKISWN-ATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
         +      Q  K++WN ATYDSLL +NK FP+PIPL EM++FLVD+WEQEGLYD
Subjt:  MSESQKQVQEPKISWN-ATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

AT4G32342.1 unknown protein3.7e-3253.02Show/hide
Query:  SCHGCYTAS-ALGNAMDGPSKGLRVKDQEAKK-QCLPENCPSSSTCEMD-NSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSE
        +C GC      L   +D PSKGL+++ +  KK     ++  S+STC+MD N T+ SQ S PP   Q S SN   ST+FVN GL+LWN TR+QW    ++ 
Subjt:  SCHGCYTAS-ALGNAMDGPSKGLRVKDQEAKK-QCLPENCPSSSTCEMD-NSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSE

Query:  SQKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLY
         Q  V EP ISWN+TYDSLL+TNK FP+PIPL EM+ FLVDVWE+EGLY
Subjt:  SQKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLY

AT5G25360.1 unknown protein5.3e-3955.7Show/hide
Query:  CHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKMSES
        C GC     L  A+D PSKGLR++ +  KK  + E+  S+STCEMDNST+ SQRSM      N+ S   S+   T+FVN GL LWN+TR+QW+ N  S+ 
Subjt:  CHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKMSES

Query:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        + +V+EP ISWNATY+SLL  NK F  PIPL EM++FLVDVWEQEGLYD
Subjt:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD

AT5G25360.2 unknown protein5.3e-3955.7Show/hide
Query:  CHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKMSES
        C GC     L  A+D PSKGLR++ +  KK  + E+  S+STCEMDNST+ SQRSM      N+ S   S+   T+FVN GL LWN+TR+QW+ N  S+ 
Subjt:  CHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSSTCEMDNSTVWSQRSMPPAQSQNSHSNIGSS---TDFVNSGLLLWNETRKQWVGNKMSES

Query:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD
        + +V+EP ISWNATY+SLL  NK F  PIPL EM++FLVDVWEQEGLYD
Subjt:  QKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTATTTATTTCGACCCTTATAAAATCTGCCCAATCTTCTACTGTGGACCGTCTGGACATTATTCGAAGGCTTCTCTCTTTCGCGGTGTTTTTCAACGTCTGCTC
TTCTTCGGATAATTTAGCTTGGAGACGCGTAAATGCATTATCCAGGGAATATTTGGCTTCCGCTCTAACAGATTATAACACGAATGAGAGAAGAAGCTGTCATGGATGCT
ACACTGCATCTGCACTAGGTAATGCAATGGATGGGCCGTCTAAAGGTCTGAGAGTTAAAGACCAAGAAGCAAAGAAACAATGCTTACCAGAAAATTGCCCGAGCTCTAGC
ACATGTGAAATGGACAACAGTACAGTTTGGTCCCAGAGAAGCATGCCACCAGCCCAGTCACAAAATTCTCACAGCAATATTGGGAGCAGTACAGACTTTGTAAACTCTGG
ACTACTTCTTTGGAATGAGACCAGGAAGCAATGGGTTGGAAATAAAATGTCCGAGAGCCAAAAGCAAGTTCAAGAACCCAAAATAAGCTGGAATGCTACTTATGACAGCT
TATTAACAACAAACAAGCCGTTCCCCGAGCCCATACCTCTTGCTGAGATGATAGAGTTTCTTGTTGATGTCTGGGAGCAGGAGGGTCTATATGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTATTTATTTCGACCCTTATAAAATCTGCCCAATCTTCTACTGTGGACCGTCTGGACATTATTCGAAGGCTTCTCTCTTTCGCGGTGTTTTTCAACGTCTGCTC
TTCTTCGGATAATTTAGCTTGGAGACGCGTAAATGCATTATCCAGGGAATATTTGGCTTCCGCTCTAACAGATTATAACACGAATGAGAGAAGAAGCTGTCATGGATGCT
ACACTGCATCTGCACTAGGTAATGCAATGGATGGGCCGTCTAAAGGTCTGAGAGTTAAAGACCAAGAAGCAAAGAAACAATGCTTACCAGAAAATTGCCCGAGCTCTAGC
ACATGTGAAATGGACAACAGTACAGTTTGGTCCCAGAGAAGCATGCCACCAGCCCAGTCACAAAATTCTCACAGCAATATTGGGAGCAGTACAGACTTTGTAAACTCTGG
ACTACTTCTTTGGAATGAGACCAGGAAGCAATGGGTTGGAAATAAAATGTCCGAGAGCCAAAAGCAAGTTCAAGAACCCAAAATAAGCTGGAATGCTACTTATGACAGCT
TATTAACAACAAACAAGCCGTTCCCCGAGCCCATACCTCTTGCTGAGATGATAGAGTTTCTTGTTGATGTCTGGGAGCAGGAGGGTCTATATGACTGA
Protein sequenceShow/hide protein sequence
MVLFISTLIKSAQSSTVDRLDIIRRLLSFAVFFNVCSSSDNLAWRRVNALSREYLASALTDYNTNERRSCHGCYTASALGNAMDGPSKGLRVKDQEAKKQCLPENCPSSS
TCEMDNSTVWSQRSMPPAQSQNSHSNIGSSTDFVNSGLLLWNETRKQWVGNKMSESQKQVQEPKISWNATYDSLLTTNKPFPEPIPLAEMIEFLVDVWEQEGLYD