; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G03870 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G03870
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGag-pol polyprotein
Genome locationClcChr08:11498497..11498865
RNA-Seq ExpressionClc08G03870
SyntenyClc08G03870
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032368.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.4e-0940.18Show/hide
Query:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGLIRKLKV-SRSLNLRSTRFKQRMKKLLGILVHAMPYSMVW-IKNMFCLINTYTFAKE
        +EGGSTT PP+L+G+NYAYWK  MT FLKS+DN      + +     + +  +S   R + +     +    L ++   + ++  +N+F LINT T AKE
Subjt:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGLIRKLKV-SRSLNLRSTRFKQRMKKLLGILVHAMPYSMVW-IKNMFCLINTYTFAKE

Query:  AWNILAIAHEGT
        AW IL +A+EGT
Subjt:  AWNILAIAHEGT

KAA0035422.1 gag-proteinase polyprotein [Cucumis melo var. makuwa]2.8e-0738.6Show/hide
Query:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGL----IRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLINTYTFA
        REG STT P +L+G NYAYWK  +  F+KS+D+     +++AGL     + +    +L    T  K + +   G             +N+F  INT  FA
Subjt:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGL----IRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLINTYTFA

Query:  KEAWNILAIAHEGT
        KEAWNIL  A+EGT
Subjt:  KEAWNILAIAHEGT

PNX92658.1 gag-protease polyprotein [Trifolium pratense]3.7e-0736.67Show/hide
Query:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGLIR----------KLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLI
        ++GGS   PP+L+G NY YWK  M  FLKS+D      +++ GL             LK     ++          K L  + + +       KNMF LI
Subjt:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGLIR----------KLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLI

Query:  NTYTFAKEAWNILAIAHEGT
        NT T AK+AW IL  AHEGT
Subjt:  NTYTFAKEAWNILAIAHEGT

XP_008464095.1 PREDICTED: uncharacterized protein LOC103502061 [Cucumis melo]2.8e-0738.6Show/hide
Query:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGL----IRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLINTYTFA
        REG STT P +L+G NYAYWK  +  F+KS+D+     +++AGL     + +    +L    T  K + +   G             +N+F  INT  FA
Subjt:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGL----IRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLINTYTFA

Query:  KEAWNILAIAHEGT
        KEAWNIL  A+EGT
Subjt:  KEAWNILAIAHEGT

XP_024028691.1 uncharacterized protein LOC112093781 [Morus notabilis]2.2e-0735.83Show/hide
Query:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLL----------AGLIRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLI
        REGGS +HPP+L+G N +YWKV M  F+K+LD       L+          AG    +K     +    R      + L  + + +        N F LI
Subjt:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLL----------AGLIRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLI

Query:  NTYTFAKEAWNILAIAHEGT
        +T   AKEAW IL +AHEGT
Subjt:  NTYTFAKEAWNILAIAHEGT

TrEMBL top hitse value%identityAlignment
A0A1S3CKP5 uncharacterized protein LOC1035020611.4e-0738.6Show/hide
Query:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGL----IRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLINTYTFA
        REG STT P +L+G NYAYWK  +  F+KS+D+     +++AGL     + +    +L    T  K + +   G             +N+F  INT  FA
Subjt:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGL----IRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLINTYTFA

Query:  KEAWNILAIAHEGT
        KEAWNIL  A+EGT
Subjt:  KEAWNILAIAHEGT

A0A2K3MPC9 Gag-protease polyprotein1.8e-0736.67Show/hide
Query:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGLIR----------KLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLI
        ++GGS   PP+L+G NY YWK  M  FLKS+D      +++ GL             LK     ++          K L  + + +       KNMF LI
Subjt:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGLIR----------KLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLI

Query:  NTYTFAKEAWNILAIAHEGT
        NT T AK+AW IL  AHEGT
Subjt:  NTYTFAKEAWNILAIAHEGT

A0A5A7SP99 Gag-pol polyprotein6.6e-1040.18Show/hide
Query:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGLIRKLKV-SRSLNLRSTRFKQRMKKLLGILVHAMPYSMVW-IKNMFCLINTYTFAKE
        +EGGSTT PP+L+G+NYAYWK  MT FLKS+DN      + +     + +  +S   R + +     +    L ++   + ++  +N+F LINT T AKE
Subjt:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGLIRKLKV-SRSLNLRSTRFKQRMKKLLGILVHAMPYSMVW-IKNMFCLINTYTFAKE

Query:  AWNILAIAHEGT
        AW IL +A+EGT
Subjt:  AWNILAIAHEGT

A0A5A7SVY3 Gag-proteinase polyprotein1.4e-0738.6Show/hide
Query:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGL----IRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLINTYTFA
        REG STT P +L+G NYAYWK  +  F+KS+D+     +++AGL     + +    +L    T  K + +   G             +N+F  INT  FA
Subjt:  REGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGL----IRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLINTYTFA

Query:  KEAWNILAIAHEGT
        KEAWNIL  A+EGT
Subjt:  KEAWNILAIAHEGT

A0A5A7U4G8 Gag-proteinase polyprotein2.3e-0740.37Show/hide
Query:  EGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGLIRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLINTYTFAKEAWN
        EG STT   ML+  NYAYWK  M  FL S+DN            +  K     +L +TR            ++A+   + W  N+F LINT T AKEAWN
Subjt:  EGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGLIRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLINTYTFAKEAWN

Query:  ILAIAHEGT
        IL +A+EGT
Subjt:  ILAIAHEGT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGTCGTTTTTGTGTTTCAGAGAAGGAGGTTCTACCACTCATCCACCTATGTTAAATGGGATGAACTATGCCTACTGGAAGGTAATAATGACTGTTTTCCTG
AAGTCGCTGGACAATAGTCTTGGAAATCTATCATTGTTGGCTGGACTCATCCGGAAGTTAAAGGTGAGTCGCAGCCTAAACCTGAGATCAACGAGATTCAAGCAG
AGAATGAAGAAGCTCTTGGGAATTCTCGTGCATGCAATGCCATATTCAATGGTGTGGATAAAAAACATGTTTTGTCTAATCAACACTTATACTTTTGCCAAGGAA
GCATGGAATATTCTTGCCATAGCTCATGAAGGAACTGTCACACCCTACTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTGTCGTTTTTGTGTTTCAGAGAAGGAGGTTCTACCACTCATCCACCTATGTTAAATGGGATGAACTATGCCTACTGGAAGGTAATAATGACTGTTTTCCTG
AAGTCGCTGGACAATAGTCTTGGAAATCTATCATTGTTGGCTGGACTCATCCGGAAGTTAAAGGTGAGTCGCAGCCTAAACCTGAGATCAACGAGATTCAAGCAG
AGAATGAAGAAGCTCTTGGGAATTCTCGTGCATGCAATGCCATATTCAATGGTGTGGATAAAAAACATGTTTTGTCTAATCAACACTTATACTTTTGCCAAGGAA
GCATGGAATATTCTTGCCATAGCTCATGAAGGAACTGTCACACCCTACTTTTAA
Protein sequenceShow/hide protein sequence
MLSFLCFREGGSTTHPPMLNGMNYAYWKVIMTVFLKSLDNSLGNLSLLAGLIRKLKVSRSLNLRSTRFKQRMKKLLGILVHAMPYSMVWIKNMFCLINTYTFAKE
AWNILAIAHEGTVTPYF