; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G12220 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G12220
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr02:23892466..23893173
RNA-Seq ExpressionClc02G12220
SyntenyClc02G12220
Gene Ontology termsGO:0006810 - transport (biological process)
GO:0009987 - cellular process (biological process)
GO:0000325 - plant-type vacuole (cellular component)
GO:0016020 - membrane (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
GO:0043167 - ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033139.1 uncharacterized protein E6C27_scaffold269G002790 [Cucumis melo var. makuwa]4.4e-7365.38Show/hide
Query:  MDAMKNLVISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLE
        M  ++N+V+SNVI LASKITEHKLNG NYYDWR+TI FYL+STDMDDHMT + P++ K+KK+WL DD RLYLQIKNSIESE+IGL+DH            
Subjt:  MDAMKNLVISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLE

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVL
                                 +ESVTNYFMRLK+I AEL LLLPF+PDVKVQQAQREKMAVMI LNGLLPEFGM KTQILS+SKIPSL++AF+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVL

Query:  RIESSQSSLSIPQSSSAIISKNNNSRAPRGMDNN
        RIESS + +SIPQS+S +ISKNNN RAPR MD N
Subjt:  RIESSQSSLSIPQSSSAIISKNNNSRAPRGMDNN

TYJ97232.1 putative Polyprotein [Cucumis melo var. makuwa]1.2e-7081.67Show/hide
Query:  MDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELT
        MDDH T   P+D K KKDWL DD RLYLQIKNSIESE+IGLVDHC S+KELLEFL+FLYSGKEQV RMFEVCMQF RAEQKA SVTNYFMRLK+I AEL 
Subjt:  MDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELT

Query:  LLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQSSLSIPQSSSAIISKNNN
        LLLPFSPDVKVQQAQR+KMAVMIFLNG LPEFGM K QILSDSKIPSL++AF+ VL IESS + +SIPQSSSA+ISKNNN
Subjt:  LLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQSSLSIPQSSSAIISKNNN

XP_021674814.1 uncharacterized protein LOC110660727 [Hevea brasiliensis]2.0e-6258.9Show/hide
Query:  VISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKE
        VIS+VIP+ +KITEHKLNG NY +W +T+  YLRS D DDH+T DPP D+  ++ WLR+D RL+LQ++NSI SEVI L++HC  VKEL+++L+FLYSGK 
Subjt:  VISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKE

Query:  QVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQS
         + R+++VC  F+RAE++ +S+T YFM  KR+  EL +LLPFSPDVKVQQAQRE++AVM FL GL  E+  AK+QILS S+I SL E F+RVLR ES+QS
Subjt:  QVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQS

Query:  SLSIPQSSSAIISKNNNSR
        S     +SSA+IS+N N +
Subjt:  SLSIPQSSSAIISKNNNSR

XP_031744753.1 uncharacterized protein LOC101212255 isoform X1 [Cucumis sativus]1.8e-10688.21Show/hide
Query:  MDAMKNLVISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLE
        M  +KNLV+SNVIPLASKITEHKLNG NYYDWRRTILFYLRSTDMDDHMT DPPKD KQKKDWLRDD RLYLQIKNSIESE+IGLVDHC SVKELLEFL+
Subjt:  MDAMKNLVISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLE

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVL
        FLYSGKEQVHRMFEVCMQFFRAEQKAESVT+YFMRLK+I AEL LLLPFSPDVKVQQ QREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSL++AF+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVL

Query:  RIESSQSSLSIPQSSSAIISKNNNSRAPR
        RIESS +S+SIPQ SSA+ SKNNN RAP+
Subjt:  RIESSQSSLSIPQSSSAIISKNNNSRAPR

XP_038898187.1 uncharacterized protein LOC120085933 [Benincasa hispida]1.2e-8180Show/hide
Query:  MDAMKNLVISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLE
        MD +K++V+SNVIPLASKIT+HKLNG NYYDWRRTI FYL ST+MDDH+  +P KDEK KK WL DD RLYLQIKNSIESEV+GLVDHC+ VKELLEFLE
Subjt:  MDAMKNLVISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLE

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVL
        FLYSGKE V+RMF+V MQFFR EQK E VT+YFMRLK+  AEL LLLP+S DVKVQQAQREKM VMIFLNGL  EFGMAKTQILSDS+IPSLEEAFSRVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVL

Query:  RIESS
         IESS
Subjt:  RIESS

TrEMBL top hitse value%identityAlignment
A0A5A7SR90 Gag-pol polyprotein3.2e-6171.05Show/hide
Query:  MDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELT
        MDDHMT D PKD K+KKDWLRDD RLYLQIKNSIESE+IGLV          +++E               C++FFRAEQKAESVTNYFMRLK+I A L 
Subjt:  MDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELT

Query:  LLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQSSLSIPQSSSAIISKNNNSRAPRGMDNN
        LLLPFSPDVKVQQAQREKM V IFLNGLLPEFGM K QILSDSKIPSL++AF+RVLRIESS + +SIPQSSSA+ISKNNN RAPR MD N
Subjt:  LLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQSSLSIPQSSSAIISKNNNSRAPRGMDNN

A0A5A7SVC9 Uncharacterized protein2.1e-7365.38Show/hide
Query:  MDAMKNLVISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLE
        M  ++N+V+SNVI LASKITEHKLNG NYYDWR+TI FYL+STDMDDHMT + P++ K+KK+WL DD RLYLQIKNSIESE+IGL+DH            
Subjt:  MDAMKNLVISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLE

Query:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVL
                                 +ESVTNYFMRLK+I AEL LLLPF+PDVKVQQAQREKMAVMI LNGLLPEFGM KTQILS+SKIPSL++AF+RVL
Subjt:  FLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVL

Query:  RIESSQSSLSIPQSSSAIISKNNNSRAPRGMDNN
        RIESS + +SIPQS+S +ISKNNN RAPR MD N
Subjt:  RIESSQSSLSIPQSSSAIISKNNNSRAPRGMDNN

A0A5D3BBM8 Putative Polyprotein5.8e-7181.67Show/hide
Query:  MDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELT
        MDDH T   P+D K KKDWL DD RLYLQIKNSIESE+IGLVDHC S+KELLEFL+FLYSGKEQV RMFEVCMQF RAEQKA SVTNYFMRLK+I AEL 
Subjt:  MDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKEQVHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELT

Query:  LLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQSSLSIPQSSSAIISKNNN
        LLLPFSPDVKVQQAQR+KMAVMIFLNG LPEFGM K QILSDSKIPSL++AF+ VL IESS + +SIPQSSSA+ISKNNN
Subjt:  LLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQSSLSIPQSSSAIISKNNN

A0A5N5JC74 Uncharacterized protein1.0e-5953.12Show/hide
Query:  ISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKEQ
        I+ ++P  SKIT+HKLNG NY +W +TI  YLRS + DDH+  +PP DE +KK W+RDD RL+LQI+NSI+SE++GL++HC  VKEL+++LEFLYSGK  
Subjt:  ISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKEQ

Query:  VHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQSS
        + RM++VC  F+RAE++A+S+T YFM  K+   EL +LLPFS D+KVQQ QREKMAVM FL GL  E    K+QILS  +I SL+E FSR+LR E    +
Subjt:  VHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQSS

Query:  LSIPQSSSAIISKNNNSRAPRGMD
        LSI  +++ +++K   +   R ++
Subjt:  LSIPQSSSAIISKNNNSRAPRGMD

A0A5N5JJ99 Uncharacterized protein1.0e-5953.12Show/hide
Query:  ISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKEQ
        I+ ++P  SKIT+HKLNG NY +W +TI  YLRS + DDH+  +PP DE +KK W+RDD RL+LQI+NSI+SE++GL++HC  VKEL+++LEFLYSGK  
Subjt:  ISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKEQ

Query:  VHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQSS
        + RM++VC  F+RAE++A+S+T YFM  K+   EL +LLPFS D+KVQQ QREKMAVM FL GL  E    K+QILS  +I SL+E FSR+LR E    +
Subjt:  VHRMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQSS

Query:  LSIPQSSSAIISKNNNSRAPRGMD
        LSI  +++ +++K   +   R ++
Subjt:  LSIPQSSSAIISKNNNSRAPRGMD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
atggatgcaatgaagaatttagtgatatctaatgtgattcccctagcctcaaagatcacagaacacaagttaaatggatttaattactatgattggcgtcggacaattct
gttttatttacggagtacggatatggatgatcatatgactacagacccacctaaagatgaaaaacagaagaaggattggcttcgtgatgacactcgactttatcttcaga
tcaaaaattcgattgagagtgaggtaattggtttggtcgatcattgtaattcagtgaaggagctcttagaatttttagaatttctctactcaggtaaagagcaagtccat
agaatgtttgaagtttgtatgcagttctttcgcgccgaacagaaagcagagtctgtgaccaactactttatgagacttaagagaatagatgccgaacttactttgttact
acctttcagcccagatgttaaagtgcaacaggctcagcgagaaaagatggctgttatgatctttttgaatggacttttacctgaatttggaatggccaaaacacaaattc
tttctgactctaaaatcccgtcattagaagaggctttcagtcgtgttcttcgtattgagagttctcaatccagtttatctattcctcaatccagcagcgccattattagc
aagaataataactccagggctcctcgagggatggataacaacttttag
mRNA sequenceShow/hide mRNA sequence
atggatgcaatgaagaatttagtgatatctaatgtgattcccctagcctcaaagatcacagaacacaagttaaatggatttaattactatgattggcgtcggacaattct
gttttatttacggagtacggatatggatgatcatatgactacagacccacctaaagatgaaaaacagaagaaggattggcttcgtgatgacactcgactttatcttcaga
tcaaaaattcgattgagagtgaggtaattggtttggtcgatcattgtaattcagtgaaggagctcttagaatttttagaatttctctactcaggtaaagagcaagtccat
agaatgtttgaagtttgtatgcagttctttcgcgccgaacagaaagcagagtctgtgaccaactactttatgagacttaagagaatagatgccgaacttactttgttact
acctttcagcccagatgttaaagtgcaacaggctcagcgagaaaagatggctgttatgatctttttgaatggacttttacctgaatttggaatggccaaaacacaaattc
tttctgactctaaaatcccgtcattagaagaggctttcagtcgtgttcttcgtattgagagttctcaatccagtttatctattcctcaatccagcagcgccattattagc
aagaataataactccagggctcctcgagggatggataacaacttttag
Protein sequenceShow/hide protein sequence
MDAMKNLVISNVIPLASKITEHKLNGFNYYDWRRTILFYLRSTDMDDHMTTDPPKDEKQKKDWLRDDTRLYLQIKNSIESEVIGLVDHCNSVKELLEFLEFLYSGKEQVH
RMFEVCMQFFRAEQKAESVTNYFMRLKRIDAELTLLLPFSPDVKVQQAQREKMAVMIFLNGLLPEFGMAKTQILSDSKIPSLEEAFSRVLRIESSQSSLSIPQSSSAIIS
KNNNSRAPRGMDNNF