; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G013860 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G013860
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationCG_Chr02:27980590..27983212
RNA-Seq ExpressionClCG02G013860
SyntenyClCG02G013860
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN63649.1 hypothetical protein VITISV_037657 [Vitis vinifera]8.7e-2337.36Show/hide
Query:  PTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLDVSDN----FTYWHPLNRILMCWIYSSLPQKKMGEIIKEITYQFAAIGEPISY
        P+L Q  + +L+ +NYL  +TQ++N +I N       G    P +FL  S+N    ++ W   NR++MCWIYSSL +  +            AIGE I+ 
Subjt:  PTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLDVSDN----FTYWHPLNRILMCWIYSSLPQKKMGEIIKEITYQFAAIGEPISY

Query:  RDHLGHILDDLRVEYNAFVTSIQNQSDTPTLEDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQ
        +D + ++L  LR EYN+FV ++ ++ +  +LE+I S+LL +E RLE+    ++ NL QA+IT+++IQ +NKK+Q
Subjt:  RDHLGHILDDLRVEYNAFVTSIQNQSDTPTLEDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQ

GFY85402.1 hypothetical protein Acr_04g0001400 [Actinidia rufa]3.3e-2233.62Show/hide
Query:  PPQPLYQQYPQYQPPN-FPQFLPRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD-----VSDNFTYWHPLNR
        PP P     P    PN  PQ L   P P      P++ QPL+ KL+D NY+  K QL+N VI N      DGS  CPP+FLD      +  F  W   NR
Subjt:  PPQPLYQQYPQYQPPN-FPQFLPRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD-----VSDNFTYWHPLNR

Query:  ILMCWIYSSLPQKKMGEII------------------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNA
        ++M WIY+S+ +  +G+I+                                                + +    A+IGEP++Y DHL + L  L  +YN 
Subjt:  ILMCWIYSSLPQKKMGEII------------------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNA

Query:  FVTSIQNQSDTPTLEDIRSLLLRYEARLEKHTLVD
        FVTSIQ+Q+  P++E++ SLLL Y+ARLE+ +  D
Subjt:  FVTSIQNQSDTPTLEDIRSLLLRYEARLEKHTLVD

PON47862.1 hypothetical protein TorRG33x02_321990 [Trema orientale]2.1e-2434.29Show/hide
Query:  QYQPPNFPQFL-----PRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD-----VSDNFTYWHPLNRILMCWI
        Q  P N P         +PP P   P+ P++ QP + KL+  NYL  K QL+N +I N      DGS PCPP+F D     V+  +  W   NR++M WI
Subjt:  QYQPPNFPQFL-----PRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD-----VSDNFTYWHPLNRILMCWI

Query:  YSSLPQKKMGEII------------------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQ
        Y+SL Q  MG+I+                                                K I    AA+GEP+S +DHL ++   L  EYNAFVTSI 
Subjt:  YSSLPQKKMGEII------------------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQ

Query:  NQSDTPTLEDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQPK-------FPPNF----PQFRSFPQTSH
         + D   LE+I SLLL YE RLE      QL+  QA++  L+I  N K  +P        F  NF     QF+S P  S+
Subjt:  NQSDTPTLEDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQPK-------FPPNF----PQFRSFPQTSH

RVW69807.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]1.4e-2031.62Show/hide
Query:  PRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD-----VSDNFTYWHPLNRILMCWIYSSLPQKKMGEII---
        P P         P+L Q LS KL++TN L  K+QL+N +I N      D  +  PP++LD     V+  F  W  LN+++M WIYSSL    +G+I+   
Subjt:  PRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD-----VSDNFTYWHPLNRILMCWIYSSLPQKKMGEII---

Query:  ---------------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQNQSDTPTLEDIRSLLL
                                                     K +  +FA IGEP+SYRD L  IL+ L  EY+ FVTSI N+SD P+L+++ SLL 
Subjt:  ---------------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQNQSDTPTLEDIRSLLL

Query:  RYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQPKFPPNFPQFRSFPQTSH
         YE RL + ++   LN  QA+             QP +  + PQ +   ++ H
Subjt:  RYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQPKFPPNFPQFRSFPQTSH

XP_022155181.1 uncharacterized protein LOC111022315 [Momordica charantia]2.4e-4947.51Show/hide
Query:  PPNFPQFLPRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD---VSDNFTY--WHPLNRILMCWIYSSLPQKK
        PP  P FL +PP P  A  FPTLPQPL+ KLND N+L  K QL+NAVI N      DG+   PPQFLD   +  N  Y  W   NR+LMCWIYSSL ++K
Subjt:  PPNFPQFLPRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD---VSDNFTY--WHPLNRILMCWIYSSLPQKK

Query:  MGEI------------------------------------------------IKEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQNQSDTPTL
        MGE+                                                IKEI  +FAA+GEP+SYRDHL H+LD L  EYNAFVTSI N++D+P+L
Subjt:  MGEI------------------------------------------------IKEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQNQSDTPTL

Query:  EDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQPKFPPNFPQF--RSFPQT
        ED+RSLLL YEARL+K   VDQLN+AQA++ +LS+Q N+K+  PKF  +FP     SFP +
Subjt:  EDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQPKFPPNFPQF--RSFPQT

TrEMBL top hitse value%identityAlignment
A0A2P5BGF8 Uncharacterized protein1.0e-2434.29Show/hide
Query:  QYQPPNFPQFL-----PRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD-----VSDNFTYWHPLNRILMCWI
        Q  P N P         +PP P   P+ P++ QP + KL+  NYL  K QL+N +I N      DGS PCPP+F D     V+  +  W   NR++M WI
Subjt:  QYQPPNFPQFL-----PRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD-----VSDNFTYWHPLNRILMCWI

Query:  YSSLPQKKMGEII------------------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQ
        Y+SL Q  MG+I+                                                K I    AA+GEP+S +DHL ++   L  EYNAFVTSI 
Subjt:  YSSLPQKKMGEII------------------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQ

Query:  NQSDTPTLEDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQPK-------FPPNF----PQFRSFPQTSH
         + D   LE+I SLLL YE RLE      QL+  QA++  L+I  N K  +P        F  NF     QF+S P  S+
Subjt:  NQSDTPTLEDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQPK-------FPPNF----PQFRSFPQTSH

A0A6J1DQX7 uncharacterized protein LOC1110223151.2e-4947.51Show/hide
Query:  PPNFPQFLPRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD---VSDNFTY--WHPLNRILMCWIYSSLPQKK
        PP  P FL +PP P  A  FPTLPQPL+ KLND N+L  K QL+NAVI N      DG+   PPQFLD   +  N  Y  W   NR+LMCWIYSSL ++K
Subjt:  PPNFPQFLPRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD---VSDNFTY--WHPLNRILMCWIYSSLPQKK

Query:  MGEI------------------------------------------------IKEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQNQSDTPTL
        MGE+                                                IKEI  +FAA+GEP+SYRDHL H+LD L  EYNAFVTSI N++D+P+L
Subjt:  MGEI------------------------------------------------IKEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQNQSDTPTL

Query:  EDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQPKFPPNFPQF--RSFPQT
        ED+RSLLL YEARL+K   VDQLN+AQA++ +LS+Q N+K+  PKF  +FP     SFP +
Subjt:  EDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQPKFPPNFPQF--RSFPQT

A0A7J0EGI5 Uncharacterized protein1.6e-2233.62Show/hide
Query:  PPQPLYQQYPQYQPPN-FPQFLPRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD-----VSDNFTYWHPLNR
        PP P     P    PN  PQ L   P P      P++ QPL+ KL+D NY+  K QL+N VI N      DGS  CPP+FLD      +  F  W   NR
Subjt:  PPQPLYQQYPQYQPPN-FPQFLPRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLD-----VSDNFTYWHPLNR

Query:  ILMCWIYSSLPQKKMGEII------------------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNA
        ++M WIY+S+ +  +G+I+                                                + +    A+IGEP++Y DHL + L  L  +YN 
Subjt:  ILMCWIYSSLPQKKMGEII------------------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNA

Query:  FVTSIQNQSDTPTLEDIRSLLLRYEARLEKHTLVD
        FVTSIQ+Q+  P++E++ SLLL Y+ARLE+ +  D
Subjt:  FVTSIQNQSDTPTLEDIRSLLLRYEARLEKHTLVD

A0A803NL56 Uncharacterized protein5.0e-2431.2Show/hide
Query:  PGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQF-----LDVSDNFTYWHPLNRILMCWIYSSLPQKKMGEII--------
        P   P F +  Q +S KL+DTNYL  + Q+ N +I N      DG+  C  QF       V   FT WH  N++LM W+Y+SL    +G+I+        
Subjt:  PGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQF-----LDVSDNFTYWHPLNRILMCWIYSSLPQKKMGEII--------

Query:  ----------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQNQSDTPTLEDIRSLLLRYEAR
                                                K +    A++G+PIS ++HL ++L+ L +EYNAFVT I  +   PT+E++ +LLL YEAR
Subjt:  ----------------------------------------KEITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQNQSDTPTLEDIRSLLLRYEAR

Query:  LEKHTLVDQLNLAQAHITSLSIQQNNKKSQPKFPPNFPQFRSFPQTSHNP
        LE+       +  QA+  +LS  +   KS  + P + P+F S PQ +  P
Subjt:  LEKHTLVDQLNLAQAHITSLSIQQNNKKSQPKFPPNFPQFRSFPQTSHNP

A5BMF5 Uncharacterized protein4.2e-2337.36Show/hide
Query:  PTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLDVSDN----FTYWHPLNRILMCWIYSSLPQKKMGEIIKEITYQFAAIGEPISY
        P+L Q  + +L+ +NYL  +TQ++N +I N       G    P +FL  S+N    ++ W   NR++MCWIYSSL +  +            AIGE I+ 
Subjt:  PTLPQPLSTKLNDTNYLSLKTQLMNAVIVN------DGSEPCPPQFLDVSDN----FTYWHPLNRILMCWIYSSLPQKKMGEIIKEITYQFAAIGEPISY

Query:  RDHLGHILDDLRVEYNAFVTSIQNQSDTPTLEDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQ
        +D + ++L  LR EYN+FV ++ ++ +  +LE+I S+LL +E RLE+    ++ NL QA+IT+++IQ +NKK+Q
Subjt:  RDHLGHILDDLRVEYNAFVTSIQNQSDTPTLEDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCCCACCCCAGCCTCTATATCAACAATATCCACAGTATCAACCTCCTAATTTTCCCCAATTTCTTCCCAGACCACCTTATCCTGGCTTTGCCCCAATGTTCCCCAC
ATTACCACAACCTCTGTCCACCAAGCTCAATGACACCAATTACCTCTCCTTGAAGACCCAACTCATGAATGCAGTCATCGTCAATGATGGTAGTGAGCCCTGCCCCCCTC
AGTTCCTTGATGTCAGTGACAATTTTACTTACTGGCACCCATTAAATAGGATATTGATGTGTTGGATTTACTCCTCTCTTCCCCAAAAAAAGATGGGTGAGATTATTAAG
GAAATTACATATCAATTTGCTGCCATAGGGGAACCAATTTCATATCGAGATCATTTGGGTCACATTTTGGATGACTTAAGAGTGGAATACAATGCTTTTGTTACTTCTAT
TCAGAATCAATCAGATACTCCTACCCTAGAAGATATACGTAGTTTATTATTAAGATATGAGGCAAGATTGGAAAAACACACTTTGGTTGATCAGTTAAATTTGGCTCAAG
CTCACATTACTAGCCTTTCCATTCAACAAAATAACAAGAAGTCTCAACCTAAATTCCCCCCAAACTTCCCTCAATTTCGATCATTCCCACAAACTTCTCATAATCCCCTT
mRNA sequenceShow/hide mRNA sequence
ATGCCCCCACCCCAGCCTCTATATCAACAATATCCACAGTATCAACCTCCTAATTTTCCCCAATTTCTTCCCAGACCACCTTATCCTGGCTTTGCCCCAATGTTCCCCAC
ATTACCACAACCTCTGTCCACCAAGCTCAATGACACCAATTACCTCTCCTTGAAGACCCAACTCATGAATGCAGTCATCGTCAATGATGGTAGTGAGCCCTGCCCCCCTC
AGTTCCTTGATGTCAGTGACAATTTTACTTACTGGCACCCATTAAATAGGATATTGATGTGTTGGATTTACTCCTCTCTTCCCCAAAAAAAGATGGGTGAGATTATTAAG
GAAATTACATATCAATTTGCTGCCATAGGGGAACCAATTTCATATCGAGATCATTTGGGTCACATTTTGGATGACTTAAGAGTGGAATACAATGCTTTTGTTACTTCTAT
TCAGAATCAATCAGATACTCCTACCCTAGAAGATATACGTAGTTTATTATTAAGATATGAGGCAAGATTGGAAAAACACACTTTGGTTGATCAGTTAAATTTGGCTCAAG
CTCACATTACTAGCCTTTCCATTCAACAAAATAACAAGAAGTCTCAACCTAAATTCCCCCCAAACTTCCCTCAATTTCGATCATTCCCACAAACTTCTCATAATCCCCTT
Protein sequenceShow/hide protein sequence
MPPPQPLYQQYPQYQPPNFPQFLPRPPYPGFAPMFPTLPQPLSTKLNDTNYLSLKTQLMNAVIVNDGSEPCPPQFLDVSDNFTYWHPLNRILMCWIYSSLPQKKMGEIIK
EITYQFAAIGEPISYRDHLGHILDDLRVEYNAFVTSIQNQSDTPTLEDIRSLLLRYEARLEKHTLVDQLNLAQAHITSLSIQQNNKKSQPKFPPNFPQFRSFPQTSHNPL