; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG11G010370 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG11G010370
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationCG_Chr11:19618711..19620586
RNA-Seq ExpressionClCG11G010370
SyntenyClCG11G010370
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047357.1 uncharacterized protein E6C27_scaffold754G00030 [Cucumis melo var. makuwa]2.8e-1446.46Show/hide
Query:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQT-------------------LCNLEEEK-EFMKIEIAELSSRFHHLVEQDTEAYLEWERKIEH
        M+ + +DRQERR QQQREERVLQEDE ++DL E   Q                    L   +E++   +K++I   S         + E YLEWERKIEH
Subjt:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQT-------------------LCNLEEEK-EFMKIEIAELSSRFHHLVEQDTEAYLEWERKIEH

Query:  VFDCNTNSENKKMRLVIAKFTNHAENW
        VFDCNT S+NKKM+L+  +FT+HAENW
Subjt:  VFDCNTNSENKKMRLVIAKFTNHAENW

KAA0057113.1 putative gag protein [Cucumis melo var. makuwa]6.1e-1445.38Show/hide
Query:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQ-------------TLCNL----------EEEKEFMKIEIAELSSRFHHLVEQDTEAYLEWERK
        +E+M E+RQERR QQQRE R LQEDEGM DL     Q              L N+          E+    +K++I   +         D+E YL+WERK
Subjt:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQ-------------TLCNL----------EEEKEFMKIEIAELSSRFHHLVEQDTEAYLEWERK

Query:  IEHVFDCNTNSENKKMRLVIAKFTNHAENW
        IEHVFDCNT SEN+KM+L IA+FTN+A  W
Subjt:  IEHVFDCNTNSENKKMRLVIAKFTNHAENW

TYK22420.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]8.0e-1445.38Show/hide
Query:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQ-------------TLCNL----------EEEKEFMKIEIAELSSRFHHLVEQDTEAYLEWERK
        +E+M E+RQERR QQQRE R  QEDEGM DL     Q              L N+          E+    +K++I   +         D+E YL+WERK
Subjt:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQ-------------TLCNL----------EEEKEFMKIEIAELSSRFHHLVEQDTEAYLEWERK

Query:  IEHVFDCNTNSENKKMRLVIAKFTNHAENW
        IEHVFDCNT SENKKM+L IA+FTN+A  W
Subjt:  IEHVFDCNTNSENKKMRLVIAKFTNHAENW

XP_016902063.1 PREDICTED: uncharacterized protein LOC107991521 [Cucumis melo]2.8e-1446.46Show/hide
Query:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQT-------------------LCNLEEEK-EFMKIEIAELSSRFHHLVEQDTEAYLEWERKIEH
        M+ + +DRQERR QQQREERVLQEDE ++DL E   Q                    L   +E++   +K++I   S         + E YLEWERKIEH
Subjt:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQT-------------------LCNLEEEK-EFMKIEIAELSSRFHHLVEQDTEAYLEWERKIEH

Query:  VFDCNTNSENKKMRLVIAKFTNHAENW
        VFDCNT S+NKKM+L+  +FT+HAENW
Subjt:  VFDCNTNSENKKMRLVIAKFTNHAENW

XP_022158402.1 uncharacterized protein LOC111024891 [Momordica charantia]1.9e-1550.83Show/hide
Query:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQTLCNLEEEKEFMKIEIAELSSRFHHLVE-------------QDTEAYLEWERKIEHVFDCNTN
        +EMM EDRQERR QQ+REERVLQEDEGM D +  + +          F  I       R H   +              D EAYL+WERKIEHVFD  T 
Subjt:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQTLCNLEEEKEFMKIEIAELSSRFHHLVE-------------QDTEAYLEWERKIEHVFDCNTN

Query:  SENKKMRLVIAKFTNHAENW
        SENKKMRLVIA+F NHA  W
Subjt:  SENKKMRLVIAKFTNHAENW

TrEMBL top hitse value%identityAlignment
A0A1S4E1G3 uncharacterized protein LOC1079915211.3e-1446.46Show/hide
Query:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQT-------------------LCNLEEEK-EFMKIEIAELSSRFHHLVEQDTEAYLEWERKIEH
        M+ + +DRQERR QQQREERVLQEDE ++DL E   Q                    L   +E++   +K++I   S         + E YLEWERKIEH
Subjt:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQT-------------------LCNLEEEK-EFMKIEIAELSSRFHHLVEQDTEAYLEWERKIEH

Query:  VFDCNTNSENKKMRLVIAKFTNHAENW
        VFDCNT S+NKKM+L+  +FT+HAENW
Subjt:  VFDCNTNSENKKMRLVIAKFTNHAENW

A0A5A7TZH9 Retrotrans_gag domain-containing protein1.3e-1446.46Show/hide
Query:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQT-------------------LCNLEEEK-EFMKIEIAELSSRFHHLVEQDTEAYLEWERKIEH
        M+ + +DRQERR QQQREERVLQEDE ++DL E   Q                    L   +E++   +K++I   S         + E YLEWERKIEH
Subjt:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQT-------------------LCNLEEEK-EFMKIEIAELSSRFHHLVEQDTEAYLEWERKIEH

Query:  VFDCNTNSENKKMRLVIAKFTNHAENW
        VFDCNT S+NKKM+L+  +FT+HAENW
Subjt:  VFDCNTNSENKKMRLVIAKFTNHAENW

A0A5A7UMH2 Putative gag protein3.0e-1445.38Show/hide
Query:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQ-------------TLCNL----------EEEKEFMKIEIAELSSRFHHLVEQDTEAYLEWERK
        +E+M E+RQERR QQQRE R LQEDEGM DL     Q              L N+          E+    +K++I   +         D+E YL+WERK
Subjt:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQ-------------TLCNL----------EEEKEFMKIEIAELSSRFHHLVEQDTEAYLEWERK

Query:  IEHVFDCNTNSENKKMRLVIAKFTNHAENW
        IEHVFDCNT SEN+KM+L IA+FTN+A  W
Subjt:  IEHVFDCNTNSENKKMRLVIAKFTNHAENW

A0A5D3DRJ1 F15O4.133.9e-1445.38Show/hide
Query:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQ-------------TLCNL----------EEEKEFMKIEIAELSSRFHHLVEQDTEAYLEWERK
        +E+M E+RQERR QQQRE R  QEDEGM DL     Q              L N+          E+    +K++I   +         D+E YL+WERK
Subjt:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQ-------------TLCNL----------EEEKEFMKIEIAELSSRFHHLVEQDTEAYLEWERK

Query:  IEHVFDCNTNSENKKMRLVIAKFTNHAENW
        IEHVFDCNT SENKKM+L IA+FTN+A  W
Subjt:  IEHVFDCNTNSENKKMRLVIAKFTNHAENW

A0A6J1DX52 uncharacterized protein LOC1110248919.2e-1650.83Show/hide
Query:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQTLCNLEEEKEFMKIEIAELSSRFHHLVE-------------QDTEAYLEWERKIEHVFDCNTN
        +EMM EDRQERR QQ+REERVLQEDEGM D +  + +          F  I       R H   +              D EAYL+WERKIEHVFD  T 
Subjt:  MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQTLCNLEEEKEFMKIEIAELSSRFHHLVE-------------QDTEAYLEWERKIEHVFDCNTN

Query:  SENKKMRLVIAKFTNHAENW
        SENKKMRLVIA+F NHA  W
Subjt:  SENKKMRLVIAKFTNHAENW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATGATGAGCGAAGATAGACAAGAAAGGAGGACACAACAACAAAGAGAAGAACGAGTCTTACAAGAAGATGAAGGAATGCTTGATCTAGAAGAAATAATCTTGCA
AACCTTATGCAATCTAGAAGAGGAGAAAGAATTCATGAAGATAGAGATAGCAGAGTTAAGCTCAAGATTTCACCATTTAGTGGAACAAGACACGGAGGCATACTTAGAAT
GGGAAAGGAAGATAGAACACGTGTTTGATTGCAACACAAATAGTGAGAATAAGAAGATGAGACTTGTCATTGCCAAATTCACCAATCATGCAGAAAACTGGCGTATGCGG
GTGTGCGTGTGCGTATGTGTGTGCGCGTGTGGGTGGGTTGGTGTGTGTGTGTGGGCATGTGTGTGCTACATGGAAGAAGATTACACTGTAGATATATTCTTTAGGAGAAA
AAGTGTTAGAAGAAGTGTTCTTCAAAGAATGGGAGCACTGGGCAAGGTCAAAGGCCTTCTTGCAACTGTCATCAAACACAAAGGACACATCTTTCTGGAAGAGGGAGAGT
AA
mRNA sequenceShow/hide mRNA sequence
ATGGAAATGATGAGCGAAGATAGACAAGAAAGGAGGACACAACAACAAAGAGAAGAACGAGTCTTACAAGAAGATGAAGGAATGCTTGATCTAGAAGAAATAATCTTGCA
AACCTTATGCAATCTAGAAGAGGAGAAAGAATTCATGAAGATAGAGATAGCAGAGTTAAGCTCAAGATTTCACCATTTAGTGGAACAAGACACGGAGGCATACTTAGAAT
GGGAAAGGAAGATAGAACACGTGTTTGATTGCAACACAAATAGTGAGAATAAGAAGATGAGACTTGTCATTGCCAAATTCACCAATCATGCAGAAAACTGGCGTATGCGG
GTGTGCGTGTGCGTATGTGTGTGCGCGTGTGGGTGGGTTGGTGTGTGTGTGTGGGCATGTGTGTGCTACATGGAAGAAGATTACACTGTAGATATATTCTTTAGGAGAAA
AAGTGTTAGAAGAAGTGTTCTTCAAAGAATGGGAGCACTGGGCAAGGTCAAAGGCCTTCTTGCAACTGTCATCAAACACAAAGGACACATCTTTCTGGAAGAGGGAGAGT
AA
Protein sequenceShow/hide protein sequence
MEMMSEDRQERRTQQQREERVLQEDEGMLDLEEIILQTLCNLEEEKEFMKIEIAELSSRFHHLVEQDTEAYLEWERKIEHVFDCNTNSENKKMRLVIAKFTNHAENWRMR
VCVCVCVCACGWVGVCVWACVCYMEEDYTVDIFFRRKSVRRSVLQRMGALGKVKGLLATVIKHKGHIFLEEGE