; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC01G010880 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC01G010880
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmU531Chr01:16409802..16413442
RNA-Seq ExpressionCmUC01G010880
SyntenyCmUC01G010880
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065687.1 retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.8e-2457.43Show/hide
Query:  FLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFV
        ++RE+FFTFKMD  +SL ENL EFKK+  +FK LG+K+GD+NE+++LLNSLPEAY+EVK AL+Y R+ ITTG +ISA++  EL+L++ +++Q + E LF 
Subjt:  FLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFV

Query:  K
        K
Subjt:  K

XP_038875093.1 uncharacterized protein LOC120067620 [Benincasa hispida]2.0e-2352.94Show/hide
Query:  FLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFV
        FLRE+FFT+KM +A+SLT+NL+E K++  EF+S+ + +G++NEA++LLNSL E++++VK A+KY RE ITT AIISA+K+ ELEL  +KK+Q   + LF 
Subjt:  FLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFV

Query:  KSKDKPNQSKGGKQQSNED
        K K+K N   G  +Q+N D
Subjt:  KSKDKPNQSKGGKQQSNED

XP_038885928.1 uncharacterized protein LOC120076236 [Benincasa hispida]1.2e-3148.31Show/hide
Query:  FLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFV
        FLRE+FFT+KMD A+SLT+NL+EFK +  +F+S+G+ +G++NEA++LLNSLPE +++VK ALKY RE ITT AIISA+ + ELEL  TKK+QP  E  F 
Subjt:  FLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFV

Query:  KSKDKPNQSKGGKQQSNEDHKLKTKIRCNYRKKKGHLMRDCYSLKRKNQEKEKDAKGKQPKASIVEGSYFYSDALAST
        K  +        K     +  ++  IR + +K       DCY+LKRK  ++ K   GKQ +A++ E S  YSDALA+T
Subjt:  KSKDKPNQSKGGKQQSNEDHKLKTKIRCNYRKKKGHLMRDCYSLKRKNQEKEKDAKGKQPKASIVEGSYFYSDALAST

XP_038890043.1 uncharacterized protein LOC120079747 [Benincasa hispida]3.1e-2456.76Show/hide
Query:  FLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFV
        FLRE+FFT+KMD+A+SLT+ L+EFK++  EF+S+G  +G++NEA++LLNSLPE++++ K A+KY RE ITT AIISA+++ ELEL  +KK Q   E LF 
Subjt:  FLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFV

Query:  KSKDKPNQSKG
        K K+K N  KG
Subjt:  KSKDKPNQSKG

XP_038896323.1 uncharacterized protein LOC120084587 [Benincasa hispida]1.1e-2456.91Show/hide
Query:  MFLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLF
        M+L EKFFTFKMDS+++LT N DEFKKIV EFK+LGEKL D NEAYVL NSLPE+Y+E+KNALKY R+S +T  IISAL+  ELEL   +K  P+ E + 
Subjt:  MFLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLF

Query:  VKSKDKPNQSKGGKQQSNEDHKL
           + +  + KGG  +  +D K+
Subjt:  VKSKDKPNQSKGGKQQSNEDHKL

TrEMBL top hitse value%identityAlignment
A0A5A7UB25 Putative gag-pol polyprotein2.0e-0545.76Show/hide
Query:  NPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTKLVLKLKLLKCLELIN
        NP +H R+KHID+K+H++R+ I   +VE++KVHT EN++DMLTK +   +    L+ +N
Subjt:  NPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTKLVLKLKLLKCLELIN

A0A5A7UB25 Putative gag-pol polyprotein3.8e-2038.55Show/hide
Query:  REKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFVKS
        +EKFF +KMD ++SL ENLDEF+KI+++  ++GEK+ D+N+A +LLNSLPE Y EVK A+KY R+S+T   ++ ALK   LE+   KKE+   E L  + 
Subjt:  REKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFVKS

Query:  KDKPNQSKGGKQQSNEDHKLKTKIRCNYRKKKGHLMRDC-YSLKRKNQEKEKDAKGKQPKASIVEGSYFYSDALASTRD
        + +    KG ++ S    K K K R  +   KGH  ++C  +  R+    E +       A I +G       + S RD
Subjt:  KDKPNQSKGGKQQSNEDHKLKTKIRCNYRKKKGHLMRDC-YSLKRKNQEKEKDAKGKQPKASIVEGSYFYSDALASTRD

A0A5D3CAI4 Retrovirus-related Pol polyprotein from transposon TNT 1-948.7e-2557.43Show/hide
Query:  FLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFV
        ++RE+FFTFKMD  +SL ENL EFKK+  +FK LG+K+GD+NE+++LLNSLPEAY+EVK AL+Y R+ ITTG +ISA++  EL+L++ +++Q + E LF 
Subjt:  FLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFV

Query:  K
        K
Subjt:  K

A0A5D3CG50 Retrovirus-related Pol polyprotein from transposon TNT 1-945.8e-2142.25Show/hide
Query:  MFLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLF
        ++++EKFF +KMD ++SL ENLDEF+KI+++  ++GEK+ D+N+A +LLNSLPE Y EVK A+KY R+S+    ++ ALK   LE+   KKE    E L 
Subjt:  MFLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLF

Query:  VKSKDKPNQSKGGKQQSNEDHKLKTKIRCNYRKKKGHLMRDC
         + + +    KG ++ S    K K++ +C    K+GHL ++C
Subjt:  VKSKDKPNQSKGGKQQSNEDHKLKTKIRCNYRKKKGHLMRDC

A0A5D3DNU1 Putative gag-pol polyprotein9.9e-2138.46Show/hide
Query:  MFLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLF
        ++++EKFF +KMD ++SL ENLDEF+KIV++  ++GEK+ D+N+A +LLNSLPE Y EVK A+KY R+S+T   ++ ALK   LE+   KKE+   E L 
Subjt:  MFLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLF

Query:  VKSKDKPNQSKGGKQQSNEDHKLKTKIRCNYRKKKGHLMRDC-YSLKRKNQEKEKDAKGKQPKASIVEG
         + + +    KG ++      K K++ +C    K+GH  ++C  +  R+    E +       A I +G
Subjt:  VKSKDKPNQSKGGKQQSNEDHKLKTKIRCNYRKKKGHLMRDC-YSLKRKNQEKEKDAKGKQPKASIVEG

A0A5D3DNU1 Putative gag-pol polyprotein5.3e-0656.82Show/hide
Query:  NPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTK
        NP +H R+KHID+K+H++R+ I   +VE++KVHT EN++DMLTK
Subjt:  NPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTK

A0A5D3DNU1 Putative gag-pol polyprotein9.9e-2138.46Show/hide
Query:  MFLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLF
        ++++EKFF +KMD ++SL ENLDEF+KIV++  ++GEK+ D+N+A +LLNSLPE Y EVK A+KY R+S+T   ++ ALK   LE+   KKE+   E L 
Subjt:  MFLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLF

Query:  VKSKDKPNQSKGGKQQSNEDHKLKTKIRCNYRKKKGHLMRDC-YSLKRKNQEKEKDAKGKQPKASIVEG
         + + +    KG ++      K K++ +C    K+GH  ++C  +  R+    E +       A I +G
Subjt:  VKSKDKPNQSKGGKQQSNEDHKLKTKIRCNYRKKKGHLMRDC-YSLKRKNQEKEKDAKGKQPKASIVEG

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.2e-0443.18Show/hide
Query:  NPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTK
        NP  H R KHIDIKYHF R++++   + +  + T   +AD+ TK
Subjt:  NPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTK

P04146 Copia protein3.1e-0322.89Show/hide
Query:  LREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALK-YDRESITTGAIISALKINELELHATKKE-QPSVERLF
        LR++  + K+ S  SL  +   F +++ E  + G K+ + ++   LL +LP  Y+ +  A++    E++T   + + L   E+++     +    V    
Subjt:  LREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALK-YDRESITTGAIISALKINELELHATKKE-QPSVERLF

Query:  VKSKDKPNQSKGGKQQSNEDHKL-----KTKIRCNYRKKKGHLMRDCYSLKR----KNQEKEKDAK
        V + +   ++   K +  +  K+     K K++C++  ++GH+ +DC+  KR    KN+E EK  +
Subjt:  VKSKDKPNQSKGGKQQSNEDHKL-----KTKIRCNYRKKKGHLMRDCYSLKR----KNQEKEKDAK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-0950Show/hide
Query:  NPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTKLVLKLKLLKCLELI
        N  YH RTKHID++YH++R+ ++   ++VLK+ T+EN ADMLTK+V + K   C EL+
Subjt:  NPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTKLVLKLKLLKCLELI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-0521.91Show/hide
Query:  MFLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLF
        ++L+++ +   M    +   +L+ F  ++ +  +LG K+ ++++A +LLNSLP +Y+ +   + + + +I    + SAL +NE      KK +   + L 
Subjt:  MFLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLF

Query:  VKSKDKPNQSKG---------GKQQSNEDHKLKTKIRCNYRKKKGHLMRDCYSLKRKNQEKEKDAKGKQPKASIVEGS
         + + +  Q            GK ++    +++    CN   + GH  RDC +  RK + +    K     A++V+ +
Subjt:  VKSKDKPNQSKG---------GKQQSNEDHKLKTKIRCNYRKKKGHLMRDCYSLKRKNQEKEKDAKGKQPKASIVEGS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-0543.75Show/hide
Query:  FFLINPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTK
        +   NP +H+R KHI I YHF+R++++ G + V+ V T + +AD LTK
Subjt:  FFLINPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.5e-0541.67Show/hide
Query:  FFLINPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTK
        +   NP +H+R KHI + YHF+R++++ G + V+ V T + +AD LTK
Subjt:  FFLINPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCTGAGAGAGAAGTTCTTCACATTCAAGATGGATTCGGCTAGATCACTTACTGAAAATTTAGATGAGTTTAAGAAGATAGTAATTGAATTCAAGAGTCTTGGTGA
GAAACTAGGGGATGACAATGAAGCCTATGTTCTATTGAATTCCCTTCCAGAAGCATATGAAGAGGTAAAGAATGCTTTAAAATACGACAGAGAGTCAATTACCACTGGTG
CAATAATTTCAGCTCTTAAAATCAATGAATTAGAGCTTCATGCTACTAAGAAAGAGCAGCCTAGTGTGGAAAGGCTATTTGTTAAGAGCAAGGACAAACCAAATCAGTCT
AAGGGTGGTAAACAACAGTCTAATGAGGATCATAAGCTTAAGACGAAGATAAGGTGCAACTATCGCAAGAAGAAGGGCCACCTCATGAGAGATTGCTACAGCTTAAAGAG
GAAAAATCAAGAAAAAGAGAAAGATGCTAAGGGGAAACAACCTAAAGCTTCTATAGTGGAAGGCTCTTATTTCTACTCCGATGCACTAGCTTCTACAAGAGACAAGGCCA
ACCAAGCGAGATGGACGCATAGTATCAATCTCGCCGCCAATCAAGTAGATTTGCCATTAATTGCGGACGTATCATCTATAAAAGGGCAGCTCTGCGAGATGAGAAAGGAC
TTCAGCCGGAGACGAATAGCTCGAAAGAGAAGAGTCTTCCCTCCGCCGGACCACTTCACAGCAGAACCTCACGCTTCCATCTGGGCGACTCAAGACATTGACGCCTTTCC
TGCTTTCTTTCTTATTAATCCTCAGTATCATACAAGAACAAAGCATATAGACATAAAGTACCATTTCGTGAGAGATAAAATAGAAGGAGGAGAGGTAGAAGTGCTGAAAG
TTCATACCTCTGAAAATGTTGCCGACATGCTAACCAAACTAGTGTTGAAGCTAAAGCTGCTCAAGTGTCTCGAGCTGATCAACTTTGACCTGCCAGAGAAAGGGCAAGCT
CGAATTCTTCTCCCTCAAGCCCAAGTCGAAACTCTGGCAACATTTTATCTAAATAGAGGGGACGAGGAAGAGGTTGTGGGCGATGGAACAACAAGTCGATTTGCTAGGTC
TGTGGCAAGGTTGGTCACACAGCCGCTATCTGCTATAATCGCTTCAACAAAGAATTCAATAATCCTAAAAAATTACAATCTTAATCCACTCTTTCGAGGAGCACCCAACA
CTTATGTGGCAAACTCGGTCATGGCTACGCCTGAGACTATCATCAACGCCCAACTGCTAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCTGAGAGAGAAGTTCTTCACATTCAAGATGGATTCGGCTAGATCACTTACTGAAAATTTAGATGAGTTTAAGAAGATAGTAATTGAATTCAAGAGTCTTGGTGA
GAAACTAGGGGATGACAATGAAGCCTATGTTCTATTGAATTCCCTTCCAGAAGCATATGAAGAGGTAAAGAATGCTTTAAAATACGACAGAGAGTCAATTACCACTGGTG
CAATAATTTCAGCTCTTAAAATCAATGAATTAGAGCTTCATGCTACTAAGAAAGAGCAGCCTAGTGTGGAAAGGCTATTTGTTAAGAGCAAGGACAAACCAAATCAGTCT
AAGGGTGGTAAACAACAGTCTAATGAGGATCATAAGCTTAAGACGAAGATAAGGTGCAACTATCGCAAGAAGAAGGGCCACCTCATGAGAGATTGCTACAGCTTAAAGAG
GAAAAATCAAGAAAAAGAGAAAGATGCTAAGGGGAAACAACCTAAAGCTTCTATAGTGGAAGGCTCTTATTTCTACTCCGATGCACTAGCTTCTACAAGAGACAAGGCCA
ACCAAGCGAGATGGACGCATAGTATCAATCTCGCCGCCAATCAAGTAGATTTGCCATTAATTGCGGACGTATCATCTATAAAAGGGCAGCTCTGCGAGATGAGAAAGGAC
TTCAGCCGGAGACGAATAGCTCGAAAGAGAAGAGTCTTCCCTCCGCCGGACCACTTCACAGCAGAACCTCACGCTTCCATCTGGGCGACTCAAGACATTGACGCCTTTCC
TGCTTTCTTTCTTATTAATCCTCAGTATCATACAAGAACAAAGCATATAGACATAAAGTACCATTTCGTGAGAGATAAAATAGAAGGAGGAGAGGTAGAAGTGCTGAAAG
TTCATACCTCTGAAAATGTTGCCGACATGCTAACCAAACTAGTGTTGAAGCTAAAGCTGCTCAAGTGTCTCGAGCTGATCAACTTTGACCTGCCAGAGAAAGGGCAAGCT
CGAATTCTTCTCCCTCAAGCCCAAGTCGAAACTCTGGCAACATTTTATCTAAATAGAGGGGACGAGGAAGAGGTTGTGGGCGATGGAACAACAAGTCGATTTGCTAGGTC
TGTGGCAAGGTTGGTCACACAGCCGCTATCTGCTATAATCGCTTCAACAAAGAATTCAATAATCCTAAAAAATTACAATCTTAATCCACTCTTTCGAGGAGCACCCAACA
CTTATGTGGCAAACTCGGTCATGGCTACGCCTGAGACTATCATCAACGCCCAACTGCTAAGCTGA
Protein sequenceShow/hide protein sequence
MFLREKFFTFKMDSARSLTENLDEFKKIVIEFKSLGEKLGDDNEAYVLLNSLPEAYEEVKNALKYDRESITTGAIISALKINELELHATKKEQPSVERLFVKSKDKPNQS
KGGKQQSNEDHKLKTKIRCNYRKKKGHLMRDCYSLKRKNQEKEKDAKGKQPKASIVEGSYFYSDALASTRDKANQARWTHSINLAANQVDLPLIADVSSIKGQLCEMRKD
FSRRRIARKRRVFPPPDHFTAEPHASIWATQDIDAFPAFFLINPQYHTRTKHIDIKYHFVRDKIEGGEVEVLKVHTSENVADMLTKLVLKLKLLKCLELINFDLPEKGQA
RILLPQAQVETLATFYLNRGDEEEVVGDGTTSRFARSVARLVTQPLSAIIASTKNSIILKNYNLNPLFRGAPNTYVANSVMATPETIINAQLLS