; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G03590 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G03590
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGAG1At protein
Genome locationClcChr09:2743617..2745810
RNA-Seq ExpressionClc09G03590
SyntenyClc09G03590
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135879.1 uncharacterized protein LOC101214375 [Cucumis sativus]1.1e-3190.12Show/hide
Query:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS
        MSGEAKSGG SGG GGFRSRMEHYLYSGDKKHVAAGIV+ GIIFGIPWALMNRGSKH+SHQDYMERADKARSQRLSS   S
Subjt:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS

XP_022953259.1 uncharacterized protein LOC111455862 [Cucurbita moschata]2.9e-3292.59Show/hide
Query:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS
        MSGEAKSGG  GGGGGFRSRMEHYLYSG+KKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS   S
Subjt:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS

XP_022992483.1 uncharacterized protein LOC111488802 [Cucurbita maxima]6.4e-3291.36Show/hide
Query:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS
        MSG+AKSGG  GGGGGFRSRMEHYLYSG+KKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS   S
Subjt:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS

XP_038897163.1 uncharacterized protein LOC120085312 isoform X1 [Benincasa hispida]1.6e-3886.27Show/hide
Query:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS-AP-ISCIFPLPLLTLNEGKLYS
        MSGEAK GG +GG GGFRSRMEHYLYSG+KKHVAAGIVVIGIIFGIPW LMNRGSKHQSHQDYME+ADKARSQRLSS AP ISCI P P+LTL EGKLYS
Subjt:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS-AP-ISCIFPLPLLTLNEGKLYS

Query:  SR
         R
Subjt:  SR

XP_038897164.1 uncharacterized protein LOC120085312 isoform X2 [Benincasa hispida]2.7e-3878.57Show/hide
Query:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS-AP-ISCIFPLPLLTLNEGKLYS
        MSGEAK GG +GG GGFRSRMEHYLYSG+KKHVAAGIVVIGIIFGIPW LMNRGSKHQSHQDYME+ADKARSQRLSS AP ISCI P P+LTL EGKLYS
Subjt:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS-AP-ISCIFPLPLLTLNEGKLYS

Query:  SRILMQSIALYS
           + +    Y+
Subjt:  SRILMQSIALYS

TrEMBL top hitse value%identityAlignment
A0A1S3CEK3 uncharacterized protein LOC1034998459.0e-3290.12Show/hide
Query:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS
        MSGEAKSGG SGG GGFRSRME+YLYSGDKKHVAAGIV+ GIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS   S
Subjt:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS

A0A5D3CFG0 Uncharacterized protein9.0e-3290.12Show/hide
Query:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS
        MSGEAKSGG SGG GGFRSRME+YLYSGDKKHVAAGIV+ GIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS   S
Subjt:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS

A0A6J1DAT5 uncharacterized protein LOC1110189663.0e-2780.95Show/hide
Query:  MSGEAKSGGGSGGGG-------GFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS
        M GE K GG SGGGG       GFRSRMEH+LYSGDKKHVAAGI VI IIFGIPW LM+RGSKHQSHQDYMERADKARSQRLSS
Subjt:  MSGEAKSGGGSGGGG-------GFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS

A0A6J1GP50 uncharacterized protein LOC1114558621.4e-3292.59Show/hide
Query:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS
        MSGEAKSGG  GGGGGFRSRMEHYLYSG+KKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS   S
Subjt:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS

A0A6J1JZC0 uncharacterized protein LOC1114888023.1e-3291.36Show/hide
Query:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS
        MSG+AKSGG  GGGGGFRSRMEHYLYSG+KKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSS   S
Subjt:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16000.1 unknown protein3.1e-2464.2Show/hide
Query:  MSGEAKSGGG---SGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSA
        M  E K+ GG     GGGGFR++MEHY+YSG+KKHV  GI ++ IIFG+PW LM +GSKHQSHQDYM++ADKAR  RLSS+
Subjt:  MSGEAKSGGG---SGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSA

AT1G80890.1 unknown protein1.8e-2466.67Show/hide
Query:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS
        M  E+KS   + GGGG R++MEHY+YSG+KKHV AGI +I IIFGIPW LMN+GSKH+SHQDY+E+ADKAR  RLSS+  S
Subjt:  MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGGCGAGGCTAAAAGCGGCGGCGGGAGCGGCGGAGGAGGTGGTTTCAGATCGAGAATGGAGCACTATTTATACAGCGGCGACAAAAAGCACGTCGCCGCTGGGAT
AGTCGTCATTGGTATCATCTTCGGCATCCCTTGGGCCCTCATGAATCGAGGATCAAAACATCAGTCGCATCAAGACTATATGGAAAGAGCTGATAAAGCTCGAAGTCAGA
GACTCTCTTCAGCTCCTATATCTTGCATTTTCCCCCTCCCTCTTCTGACATTAAATGAAGGCAAATTATACTCATCAAGAATATTAATGCAATCTATTGCATTGTATTCC
CAACCTGTGCCGAATCATAAAACCACTCTAAATAGTGAAGTCGTCCTTGAAGAAGGGACCCAGAGTTATGCCATTGTTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGGCGAGGCTAAAAGCGGCGGCGGGAGCGGCGGAGGAGGTGGTTTCAGATCGAGAATGGAGCACTATTTATACAGCGGCGACAAAAAGCACGTCGCCGCTGGGAT
AGTCGTCATTGGTATCATCTTCGGCATCCCTTGGGCCCTCATGAATCGAGGATCAAAACATCAGTCGCATCAAGACTATATGGAAAGAGCTGATAAAGCTCGAAGTCAGA
GACTCTCTTCAGCTCCTATATCTTGCATTTTCCCCCTCCCTCTTCTGACATTAAATGAAGGCAAATTATACTCATCAAGAATATTAATGCAATCTATTGCATTGTATTCC
CAACCTGTGCCGAATCATAAAACCACTCTAAATAGTGAAGTCGTCCTTGAAGAAGGGACCCAGAGTTATGCCATTGTTGAATGATTCCAATTTAGACTATGTATGTTCTA
ATCTTGGGTGCTTCAAGAGTTGTTCTTTGTTTTTTTCTATTATTATTACTATTATTGTTCATATTTTTATGAGAAAATTTCCATGAACAAAGAAACAAGGGGTGAAATTA
GTATTATTGCCGTCTAGAGTTGGGAAAAATCTATTCAATCCTAATGAAGCTATAGCTTTAACCCTCAAACTGTTCCCATG
Protein sequenceShow/hide protein sequence
MSGEAKSGGGSGGGGGFRSRMEHYLYSGDKKHVAAGIVVIGIIFGIPWALMNRGSKHQSHQDYMERADKARSQRLSSAPISCIFPLPLLTLNEGKLYSSRILMQSIALYS
QPVPNHKTTLNSEVVLEEGTQSYAIVE