; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C11G217810 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C11G217810
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionDUF4228 domain-containing protein
Genome locationCla97Chr11:23059229..23059720
RNA-Seq ExpressionCla97C11G217810
SyntenyCla97C11G217810
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8651677.1 hypothetical protein Csa_021330 [Cucumis sativus]3.0e-5277.46Show/hide
Query:  VVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIARRLTAAAKSAAKTGAPPPCE
        VVHL+G VQHFH PITA QV G+P P AEYFICTAAQLVS AASPAL+PD +LQPGKVYFILP STLHPDVS ADLA IARRLTAAAKSAAK+G+ PPCE
Subjt:  VVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIARRLTAAAKSAAKTGAPPPCE

Query:  ATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE
        A  GG+D +C  A KSRQWRPLLDTI EKP NN  RI+SD E
Subjt:  ATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE

KAG7029841.1 hypothetical protein SDJN02_08184, partial [Cucurbita argyrosperma subsp. argyrosperma]3.5e-5372.39Show/hide
Query:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA
        MGGCISRRSSSAVAAAD IQ+VHL+G VQHFH PITA QV G   P AEYFI TAAQLVS A SPAL+PDAILQPGKVYF+LPFSTLHPDVSP+DL+ IA
Subjt:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA

Query:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSED
        R+LTAAAKSA +    PPC A GGGD  K P  AKSRQW+P LDTI EK  N   + ESD +D
Subjt:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSED

XP_008460258.1 PREDICTED: uncharacterized protein LOC103499134 [Cucumis melo]5.0e-6078.4Show/hide
Query:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA
        MGGC+S RSSS  AAADR+QVVHL+G VQHFH PITA QV  KP P  EYFICTAAQLVS AASPAL PDA+LQPGKVYFILPFSTLHPDVS ADLA IA
Subjt:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA

Query:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE
        RRLTAAAKSAAK+G+ PPCE   GG++ KC AA KSRQWRPLLDTI EKPAN+ +RIESD E
Subjt:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE

XP_011650123.1 uncharacterized protein LOC105434722 [Cucumis sativus]1.3e-6077.91Show/hide
Query:  MGGCISRRSSS-AVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACI
        MGGCIS RSSS A AAADR+QVVHL+G VQHFH PITA QV G+P P AEYFICTAAQLVS AASPAL+PD +LQPGKVYFILP STLHPDVS ADLA I
Subjt:  MGGCISRRSSS-AVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACI

Query:  ARRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE
        ARRLTAAAKSAAK+G+ PPCEA  GG+D +C  A KSRQWRPLLDTI EKP NN  RI+SD E
Subjt:  ARRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE

XP_022996489.1 uncharacterized protein LOC111491721 [Cucurbita maxima]7.1e-5473.01Show/hide
Query:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA
        MG CISRRSSSAVAAAD IQ+VHL+G VQHFH PITA QV G   P AEYFI TAAQLVS A SPAL+PDAILQPGKVYF+LPFSTLHPDVSP+DL+ IA
Subjt:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA

Query:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSED
        R+LTAAAKSA +   PPPC A GGG+D K P AAKSRQW+P LDTI EK  N   + ESD +D
Subjt:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSED

TrEMBL top hitse value%identityAlignment
A0A061GRG6 Uncharacterized protein1.5e-2242.16Show/hide
Query:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA
        MG   SR SSS V   + I+VVHL+G V+ F  P++  Q+ GKP    + F+CT AQL+S   S  L PD IL+ G++YF+LP+STLHPDVSPADLA +A
Subjt:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA

Query:  RRLTAAAKSA-AKTGAPPPCEATGGGDDL---------------------KCPAAAKSRQWRPLLDTITEKPANNYQRIESDSED
        R+L+A AKS   K  +P   + + G   L                        ++++ R WRP+LDTI EK  N  +R ESD ++
Subjt:  RRLTAAAKSA-AKTGAPPPCEATGGGDDL---------------------KCPAAAKSRQWRPLLDTITEKPANNYQRIESDSED

A0A0A0LHC0 Uncharacterized protein6.4e-6177.91Show/hide
Query:  MGGCISRRSSS-AVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACI
        MGGCIS RSSS A AAADR+QVVHL+G VQHFH PITA QV G+P P AEYFICTAAQLVS AASPAL+PD +LQPGKVYFILP STLHPDVS ADLA I
Subjt:  MGGCISRRSSS-AVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACI

Query:  ARRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE
        ARRLTAAAKSAAK+G+ PPCEA  GG+D +C  A KSRQWRPLLDTI EKP NN  RI+SD E
Subjt:  ARRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE

A0A1S3CC70 uncharacterized protein LOC1034991342.4e-6078.4Show/hide
Query:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA
        MGGC+S RSSS  AAADR+QVVHL+G VQHFH PITA QV  KP P  EYFICTAAQLVS AASPAL PDA+LQPGKVYFILPFSTLHPDVS ADLA IA
Subjt:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA

Query:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE
        RRLTAAAKSAAK+G+ PPCE   GG++ KC AA KSRQWRPLLDTI EKPAN+ +RIESD E
Subjt:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE

A0A5D3CAJ8 DUF4228 domain-containing protein2.4e-6078.4Show/hide
Query:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA
        MGGC+S RSSS  AAADR+QVVHL+G VQHFH PITA QV  KP P  EYFICTAAQLVS AASPAL PDA+LQPGKVYFILPFSTLHPDVS ADLA IA
Subjt:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA

Query:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE
        RRLTAAAKSAAK+G+ PPCE   GG++ KC AA KSRQWRPLLDTI EKPAN+ +RIESD E
Subjt:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSE

A0A6J1K8V3 uncharacterized protein LOC1114917213.4e-5473.01Show/hide
Query:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA
        MG CISRRSSSAVAAAD IQ+VHL+G VQHFH PITA QV G   P AEYFI TAAQLVS A SPAL+PDAILQPGKVYF+LPFSTLHPDVSP+DL+ IA
Subjt:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIA

Query:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSED
        R+LTAAAKSA +   PPPC A GGG+D K P AAKSRQW+P LDTI EK  N   + ESD +D
Subjt:  RRLTAAAKSAAKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G76600.1 unknown protein8.7e-1029.66Show/hide
Query:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVV-------GKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSP
        MG C+S   +  V+++   ++V ++G ++ + +P+ A QV+          S S+ YF+C +  L      PA+  D ILQ  ++YF+LP S     +S 
Subjt:  MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVV-------GKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSP

Query:  ADLACIARRLTAAAKSAA
        +D+A +A + + A + AA
Subjt:  ADLACIARRLTAAAKSAA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGCTGTATTTCCCGCCGATCATCTTCCGCCGTCGCTGCCGCCGACAGAATCCAAGTTGTCCACCTCAGTGGCCAAGTACAACACTTCCACCTCCCCATCACCGC
CTGCCAGGTCGTCGGAAAGCCGTCCCCCTCGGCGGAGTACTTCATCTGCACGGCGGCGCAGTTGGTCTCCCCCGCCGCCAGCCCAGCGCTGAGCCCCGACGCCATCCTGC
AGCCGGGCAAAGTGTATTTTATTCTGCCGTTCTCCACTCTTCATCCCGACGTTTCTCCCGCCGACTTGGCCTGCATAGCCAGAAGGCTCACCGCCGCCGCCAAGTCCGCC
GCGAAGACCGGCGCTCCGCCTCCGTGTGAGGCCACCGGTGGTGGCGACGATTTGAAGTGTCCGGCAGCGGCGAAGTCGAGACAGTGGAGACCGTTGTTGGATACGATAAC
GGAAAAGCCGGCGAATAATTACCAGAGGATCGAATCAGATTCGGAAGATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGCTGTATTTCCCGCCGATCATCTTCCGCCGTCGCTGCCGCCGACAGAATCCAAGTTGTCCACCTCAGTGGCCAAGTACAACACTTCCACCTCCCCATCACCGC
CTGCCAGGTCGTCGGAAAGCCGTCCCCCTCGGCGGAGTACTTCATCTGCACGGCGGCGCAGTTGGTCTCCCCCGCCGCCAGCCCAGCGCTGAGCCCCGACGCCATCCTGC
AGCCGGGCAAAGTGTATTTTATTCTGCCGTTCTCCACTCTTCATCCCGACGTTTCTCCCGCCGACTTGGCCTGCATAGCCAGAAGGCTCACCGCCGCCGCCAAGTCCGCC
GCGAAGACCGGCGCTCCGCCTCCGTGTGAGGCCACCGGTGGTGGCGACGATTTGAAGTGTCCGGCAGCGGCGAAGTCGAGACAGTGGAGACCGTTGTTGGATACGATAAC
GGAAAAGCCGGCGAATAATTACCAGAGGATCGAATCAGATTCGGAAGATTAG
Protein sequenceShow/hide protein sequence
MGGCISRRSSSAVAAADRIQVVHLSGQVQHFHLPITACQVVGKPSPSAEYFICTAAQLVSPAASPALSPDAILQPGKVYFILPFSTLHPDVSPADLACIARRLTAAAKSA
AKTGAPPPCEATGGGDDLKCPAAAKSRQWRPLLDTITEKPANNYQRIESDSED