; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G010655 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G010655
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionGag protease polyprotein
Genome locationCG_Chr06:22943852..22944553
RNA-Seq ExpressionClCG06G010655
SyntenyClCG06G010655
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156662.1 uncharacterized protein LOC111023512 [Momordica charantia]6.4e-1852.58Show/hide
Query:  SISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRA
        SISIE KYLRDFKK+DP+  DG S DP++ E WL  +E IF+ M CLEE                WWE  ER IDVS G VTW QF+E FF++Y+ A
Subjt:  SISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRA

XP_038875077.1 uncharacterized protein LOC120067606 [Benincasa hispida]1.1e-1743.36Show/hide
Query:  PLIVVTGQDRDTRSISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFR
        PL     QD DT+ +S+E K+LRDF+K+DP+  +GS GDP   ++WL ++E IF  M C EEH               WW  AE+ ID+   L TW QF+
Subjt:  PLIVVTGQDRDTRSISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFR

Query:  EVFFRKYFRATIR
        E F+ KYF A  R
Subjt:  EVFFRKYFRATIR

XP_038883046.1 uncharacterized protein LOC120074107 [Benincasa hispida]1.0e-1538.24Show/hide
Query:  LVAMDMVEEVEGVHEAGDP----PLIVVTGQDRDTRSISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH--------------
        L+   MVE+        +P     L     Q+RD + +S+E K+LRDF+K+DP+  DGS GDP   ++WL  +E IF+ M C EEH              
Subjt:  LVAMDMVEEVEGVHEAGDP----PLIVVTGQDRDTRSISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH--------------

Query:  -WWEFAERSIDVSNGLVTWTQFREVFFRKYFRATIR
         WW   E+ ID    L TW QF+E F+ KYF A  R
Subjt:  -WWEFAERSIDVSNGLVTWTQFREVFFRKYFRATIR

XP_038887018.1 uncharacterized protein LOC120077183 [Benincasa hispida]1.9e-1443.56Show/hide
Query:  RSISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRATI
        + +S+E K+LRDF+K+D +  DGS  DP   ++WL  +E IF  M CLEEH               WW  AE+ ID S GL TW QF+E F+  YF A  
Subjt:  RSISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRATI

Query:  R
        R
Subjt:  R

XP_038889305.1 uncharacterized protein LOC120079215 [Benincasa hispida]2.3e-1544.34Show/hide
Query:  QDRDTRSISIETKYLRDFKKWDP--QDGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKY
        Q+RD + + +E K+LRDF+K+DP   DGS GDP   ++WL  +E IF  M C EEH               WW  AE+ ID    LVTW QF+E F+ KY
Subjt:  QDRDTRSISIETKYLRDFKKWDP--QDGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKY

Query:  FRATIR
        F A  R
Subjt:  FRATIR

TrEMBL top hitse value%identityAlignment
A0A5A7TTY2 Ty3-gypsy retrotransposon protein3.3e-1237.12Show/hide
Query:  MDMVEEVEGVHEAGDP---PLIVVTGQDRDTRSISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWE
        M M E+ + V  A  P   P+ VV     D   +S E K+LRDF+K++P   DGS  DP   +LWL ++E IF+ M C E+                WWE
Subjt:  MDMVEEVEGVHEAGDP---PLIVVTGQDRDTRSISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWE

Query:  FAERSIDVSNGLVTWTQFREVFFRKYFRATIR
          ER ++   G +TW QF+E F+ K+F A++R
Subjt:  FAERSIDVSNGLVTWTQFREVFFRKYFRATIR

A0A5A7TYE6 Ty3-gypsy retrotransposon protein3.3e-1240.4Show/hide
Query:  ISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRATIR
        +S E K+LRDF+K++P   DGS  DP   +LWL F+E IF+ M C E+                WWE  ER +    G +TW QF+E F+ K+F A++R
Subjt:  ISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRATIR

A0A5A7UIF4 Reverse transcriptase1.9e-1240.4Show/hide
Query:  ISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRATIR
        +S E K+LRDFKK++P   DGS  DP   ++WL F+E IF+ M C E+                WW+ AER +    G +TW QF+E F+ K+F A++R
Subjt:  ISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRATIR

A0A5D3BU69 Gag protease polyprotein1.1e-1241.41Show/hide
Query:  ISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRATIR
        +S E K+LRDF+K++P   DGS  DP   +LWL  +E IFQ M C E+                WWE AER ++   G +TW QF+E F+ K+F A++R
Subjt:  ISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRATIR

A0A6J1DSJ6 uncharacterized protein LOC1110235123.1e-1852.58Show/hide
Query:  SISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRA
        SISIE KYLRDFKK+DP+  DG S DP++ E WL  +E IF+ M CLEE                WWE  ER IDVS G VTW QF+E FF++Y+ A
Subjt:  SISIETKYLRDFKKWDPQ--DGSSGDPIIVELWLLFVEAIFQRMTCLEEH---------------WWEFAERSIDVSNGLVTWTQFREVFFRKYFRA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGACAATGATCCTCTATCGTCGCTTGGAGAGATGCTCGTGGCCATGGACATGGTAGAAGAGGTCGAGGGGGTTCATGAGGCAGGGGACCCGCCCCTAATAGTTGT
CACCGGTCAGGACAGAGATACTAGAAGTATTTCTATAGAGACCAAGTATCTACGGGATTTCAAGAAGTGGGATCCCCAAGATGGGTCGTCGGGTGACCCAATCATTGTAG
AATTGTGGCTGTTATTTGTCGAAGCCATTTTTCAACGGATGACCTGTCTTGAAGAGCATTGGTGGGAGTTTGCGGAGAGATCTATCGACGTGAGTAATGGACTGGTTACT
TGGACTCAGTTTAGGGAGGTGTTTTTCAGAAAGTACTTCCGTGCCACCATTCGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGACAATGATCCTCTATCGTCGCTTGGAGAGATGCTCGTGGCCATGGACATGGTAGAAGAGGTCGAGGGGGTTCATGAGGCAGGGGACCCGCCCCTAATAGTTGT
CACCGGTCAGGACAGAGATACTAGAAGTATTTCTATAGAGACCAAGTATCTACGGGATTTCAAGAAGTGGGATCCCCAAGATGGGTCGTCGGGTGACCCAATCATTGTAG
AATTGTGGCTGTTATTTGTCGAAGCCATTTTTCAACGGATGACCTGTCTTGAAGAGCATTGGTGGGAGTTTGCGGAGAGATCTATCGACGTGAGTAATGGACTGGTTACT
TGGACTCAGTTTAGGGAGGTGTTTTTCAGAAAGTACTTCCGTGCCACCATTCGTTAA
Protein sequenceShow/hide protein sequence
MADNDPLSSLGEMLVAMDMVEEVEGVHEAGDPPLIVVTGQDRDTRSISIETKYLRDFKKWDPQDGSSGDPIIVELWLLFVEAIFQRMTCLEEHWWEFAERSIDVSNGLVT
WTQFREVFFRKYFRATIR