; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC11G215910 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC11G215910
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmU531Chr11:23949909..23951528
RNA-Seq ExpressionCmUC11G215910
SyntenyCmUC11G215910
Gene Ontology termsGO:0044237 - cellular metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050070.1 putative polyprotein [Cucumis melo var. makuwa]3.0e-4547.8Show/hide
Query:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY
        S +FE+ KF+  GDF  W+KK++A+LVQ KV+K LD   +LP +++++EK +M ++ YST++LYL+D VLR V     + +LW KL+ L++ KSL+NKIY
Subjt:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY

Query:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN
        +KE+ F +KMD SKGLEEN+D+F +I +DL N  EK+  ENQ VILLNSLP+ Y+EVK+ I+Y  D L   I+ DAL+ R     K  +  E+L+    +
Subjt:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN

Query:  ETDEW
        E   W
Subjt:  ETDEW

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.3e-4549.27Show/hide
Query:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY
        S +FE+ KF+  GDF  W+KK++A+LVQ KV+K LD   +LP +++++EK +M ++AYST++LYL+D VLR V       +LW KL+ L+L+KSL NKIY
Subjt:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY

Query:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN
        +KE+ F +KMD SK LEEN+D+F +I +DL N  EK+  ENQ VILLNSLP+ Y+EVK+ I+Y  D LT  IV DAL+ R     K  +  E+L+    +
Subjt:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN

Query:  ETDEW
        E   W
Subjt:  ETDEW

PON64464.1 hypothetical protein TorRG33x02_273130 [Trema orientale]2.7e-4655.62Show/hide
Query:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY
        ++K E++KFD KGDF  WKKKMKA+LVQQK +K L   S LP +M  +EK E+ + AYS +IL LADNVLR+V   D  + +W+KLD L+L+K+L+NKIY
Subjt:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY

Query:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQ
        LKE+LF FKMDS+K LE+N+DDF RI + L N  EK++ ENQ +I+LNSLP+ YK++KS I+Y  + L+ D V  AL+
Subjt:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQ

TXG64595.1 hypothetical protein EZV62_011589 [Acer yangbiense]3.0e-4540.65Show/hide
Query:  MSAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKI
        MS KF++DKFD  GDFG W++K+KALL QQK+ KA++ P KLP S++  +K +M ++A  T+IL L+DNVLR ++      D+W KL+ L+L+KSL NKI
Subjt:  MSAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKI

Query:  YLKERLFDFKMDSSKGLEENIDDFCRICLDLVNA--SEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRRKGYESAEVLVITESNET
        YLKERLF FKMD+SKGL +N+DDF ++ ++L NA   EKL  EN+ +ILLNSLPD +K+VK+ I+Y    L+ +    AL      +S E+ +  E  + 
Subjt:  YLKERLFDFKMDSSKGLEENIDDFCRICLDLVNA--SEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRRKGYESAEVLVITESNET

Query:  DEWILDSGSLQSTTYQSE---VDEGYTHRLQYISNKEMSTFQVQRRPY---KNTKTKVKRMLLKSLDFGHLKQTVSKE
         E +    +++   ++ E     +G     ++ +N   +T +V  + +    NT +K+K  +  S   GH+ +T + E
Subjt:  DEWILDSGSLQSTTYQSE---VDEGYTHRLQYISNKEMSTFQVQRRPY---KNTKTKVKRMLLKSLDFGHLKQTVSKE

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]1.3e-4549.27Show/hide
Query:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY
        S +FE+ KF+  GDF  W+KK++A+LVQ KV+K LD   +LP +++++EK +M ++AYST++LYL+D VLR V       +LW KL+ L+L+KSL NKIY
Subjt:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY

Query:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN
        +KE+ F +KMD SK LEEN+D+F +I +DL N  EK+  ENQ VILLNSLP+ Y+EVK+ I+Y  D LT  IV DAL+ R     K  +  E+L+    +
Subjt:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN

Query:  ETDEW
        E   W
Subjt:  ETDEW

TrEMBL top hitse value%identityAlignment
A0A2P5CTT8 Uncharacterized protein1.3e-4655.62Show/hide
Query:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY
        ++K E++KFD KGDF  WKKKMKA+LVQQK +K L   S LP +M  +EK E+ + AYS +IL LADNVLR+V   D  + +W+KLD L+L+K+L+NKIY
Subjt:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY

Query:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQ
        LKE+LF FKMDS+K LE+N+DDF RI + L N  EK++ ENQ +I+LNSLP+ YK++KS I+Y  + L+ D V  AL+
Subjt:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQ

A0A5A7U459 Putative polyprotein1.4e-4547.8Show/hide
Query:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY
        S +FE+ KF+  GDF  W+KK++A+LVQ KV+K LD   +LP +++++EK +M ++ YST++LYL+D VLR V     + +LW KL+ L++ KSL+NKIY
Subjt:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY

Query:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN
        +KE+ F +KMD SKGLEEN+D+F +I +DL N  EK+  ENQ VILLNSLP+ Y+EVK+ I+Y  D L   I+ DAL+ R     K  +  E+L+    +
Subjt:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN

Query:  ETDEW
        E   W
Subjt:  ETDEW

A0A5A7UB25 Putative gag-pol polyprotein6.5e-4649.27Show/hide
Query:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY
        S +FE+ KF+  GDF  W+KK++A+LVQ KV+K LD   +LP +++++EK +M ++AYST++LYL+D VLR V       +LW KL+ L+L+KSL NKIY
Subjt:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY

Query:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN
        +KE+ F +KMD SK LEEN+D+F +I +DL N  EK+  ENQ VILLNSLP+ Y+EVK+ I+Y  D LT  IV DAL+ R     K  +  E+L+    +
Subjt:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN

Query:  ETDEW
        E   W
Subjt:  ETDEW

A0A5C7I661 Uncharacterized protein1.4e-4540.65Show/hide
Query:  MSAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKI
        MS KF++DKFD  GDFG W++K+KALL QQK+ KA++ P KLP S++  +K +M ++A  T+IL L+DNVLR ++      D+W KL+ L+L+KSL NKI
Subjt:  MSAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKI

Query:  YLKERLFDFKMDSSKGLEENIDDFCRICLDLVNA--SEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRRKGYESAEVLVITESNET
        YLKERLF FKMD+SKGL +N+DDF ++ ++L NA   EKL  EN+ +ILLNSLPD +K+VK+ I+Y    L+ +    AL      +S E+ +  E  + 
Subjt:  YLKERLFDFKMDSSKGLEENIDDFCRICLDLVNA--SEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRRKGYESAEVLVITESNET

Query:  DEWILDSGSLQSTTYQSE---VDEGYTHRLQYISNKEMSTFQVQRRPY---KNTKTKVKRMLLKSLDFGHLKQTVSKE
         E +    +++   ++ E     +G     ++ +N   +T +V  + +    NT +K+K  +  S   GH+ +T + E
Subjt:  DEWILDSGSLQSTTYQSE---VDEGYTHRLQYISNKEMSTFQVQRRPY---KNTKTKVKRMLLKSLDFGHLKQTVSKE

A0A5D3DNU1 Putative gag-pol polyprotein6.5e-4649.27Show/hide
Query:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY
        S +FE+ KF+  GDF  W+KK++A+LVQ KV+K LD   +LP +++++EK +M ++AYST++LYL+D VLR V       +LW KL+ L+L+KSL NKIY
Subjt:  SAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY

Query:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN
        +KE+ F +KMD SK LEEN+D+F +I +DL N  EK+  ENQ VILLNSLP+ Y+EVK+ I+Y  D LT  IV DAL+ R     K  +  E+L+    +
Subjt:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRR-----KGYESAEVLVITESN

Query:  ETDEW
        E   W
Subjt:  ETDEW

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.0e-0926.83Show/hide
Query:  AKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDA-EKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY
        AK  +  FD +  +  WK +++ALL +Q V K +D    +P  + D+ +KAE  + A ST+I YL+D+ L   ++      +   LD ++  KSLA+++ 
Subjt:  AKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDA-EKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIY

Query:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIR-YECDKLTFDIVFDALQRRKGYESAEVLVITESNETDE
        L++RL   K+ S   L  +   F  +  +L+ A  K++  +++  LL +LP  Y  + + I     + LT   V + L         E+ +  + N+T +
Subjt:  LKERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIR-YECDKLTFDIVFDALQRRKGYESAEVLVITESNETDE

Query:  WILDS
         ++++
Subjt:  WILDS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.7e-2230.05Show/hide
Query:  KFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIYLK
        K+E+ KF+    F  W+++M+ LL+QQ + K LD  SK P +M   + A++ + A S + L+L+D+V+  +   D    +WT+L+ L++SK+L NK+YLK
Subjt:  KFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIYLK

Query:  ERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRY-----ECDKLTFDIVFDALQRRKGYESAEVLV
        ++L+   M        +++ F  +   L N   K++ E++ ++LLNSLP  Y  + + I +     E   +T  ++ +   R+K     + L+
Subjt:  ERLFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRY-----ECDKLTFDIVFDALQRRKGYESAEVLV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCCAAGTTTGAATTAGACAAATTCGATGAAAAAGGTGATTTCGGCTTTTGGAAGAAGAAGATGAAGGCTCTTCTGGTTCAACAAAAGGTTTCGAAAGCT
CTAGATGCTCCAAGTAAGTTGCCCACTTCTATGAGTGATGCCGAAAAGGCTGAGATGGCTAAAATTGCATATAGTACTGTCATTCTATATTTGGCGGATAATGTT
TTGAGACGTGTTAGTGCTATTGATAATGTTTCTGATTTATGGACTAAGCTTGATGAATTGTTCTTATCCAAGTCATTAGCGAACAAGATCTACTTAAAGGAGAGG
TTATTTGATTTTAAAATGGATTCATCTAAGGGTCTAGAAGAGAATATTGATGATTTTTGTAGGATTTGTCTAGATCTTGTTAATGCTAGTGAGAAATTAGATGGA
GAAAACCAAGTTGTCATCCTTCTTAATTCATTACCTGATAAATATAAGGAAGTGAAATCAGATATTAGATATGAATGCGATAAGTTAACTTTTGATATTGTCTTT
GATGCACTTCAACGAAGGAAAGGCTATGAATCTGCTGAAGTACTTGTTATTACTGAATCTAATGAAACTGATGAATGGATATTAGATTCAGGATCACTACAATCA
ACCACCTACCAGTCAGAAGTTGACGAAGGATATACACATCGTCTGCAGTACATCAGCAACAAGGAAATGTCAACATTCCAAGTACAACGCAGACCCTATAAAAAC
ACCAAGACAAAGGTAAAAAGAATGTTGTTGAAATCTTTGGACTTTGGCCATTTGAAGCAGACCGTTTCCAAGGAGACAACTAGAGATAAAGAACTCGAGAGAGTA
ACCTTCTCGTCGACCAACCATTCCACCACCACACGTAGTGAAACACACGTCCACTGCCGACGGGAACAAAAGATTGTCATCGCCGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCCAAGTTTGAATTAGACAAATTCGATGAAAAAGGTGATTTCGGCTTTTGGAAGAAGAAGATGAAGGCTCTTCTGGTTCAACAAAAGGTTTCGAAAGCT
CTAGATGCTCCAAGTAAGTTGCCCACTTCTATGAGTGATGCCGAAAAGGCTGAGATGGCTAAAATTGCATATAGTACTGTCATTCTATATTTGGCGGATAATGTT
TTGAGACGTGTTAGTGCTATTGATAATGTTTCTGATTTATGGACTAAGCTTGATGAATTGTTCTTATCCAAGTCATTAGCGAACAAGATCTACTTAAAGGAGAGG
TTATTTGATTTTAAAATGGATTCATCTAAGGGTCTAGAAGAGAATATTGATGATTTTTGTAGGATTTGTCTAGATCTTGTTAATGCTAGTGAGAAATTAGATGGA
GAAAACCAAGTTGTCATCCTTCTTAATTCATTACCTGATAAATATAAGGAAGTGAAATCAGATATTAGATATGAATGCGATAAGTTAACTTTTGATATTGTCTTT
GATGCACTTCAACGAAGGAAAGGCTATGAATCTGCTGAAGTACTTGTTATTACTGAATCTAATGAAACTGATGAATGGATATTAGATTCAGGATCACTACAATCA
ACCACCTACCAGTCAGAAGTTGACGAAGGATATACACATCGTCTGCAGTACATCAGCAACAAGGAAATGTCAACATTCCAAGTACAACGCAGACCCTATAAAAAC
ACCAAGACAAAGGTAAAAAGAATGTTGTTGAAATCTTTGGACTTTGGCCATTTGAAGCAGACCGTTTCCAAGGAGACAACTAGAGATAAAGAACTCGAGAGAGTA
ACCTTCTCGTCGACCAACCATTCCACCACCACACGTAGTGAAACACACGTCCACTGCCGACGGGAACAAAAGATTGTCATCGCCGACTAG
Protein sequenceShow/hide protein sequence
MSAKFELDKFDEKGDFGFWKKKMKALLVQQKVSKALDAPSKLPTSMSDAEKAEMAKIAYSTVILYLADNVLRRVSAIDNVSDLWTKLDELFLSKSLANKIYLKER
LFDFKMDSSKGLEENIDDFCRICLDLVNASEKLDGENQVVILLNSLPDKYKEVKSDIRYECDKLTFDIVFDALQRRKGYESAEVLVITESNETDEWILDSGSLQS
TTYQSEVDEGYTHRLQYISNKEMSTFQVQRRPYKNTKTKVKRMLLKSLDFGHLKQTVSKETTRDKELERVTFSSTNHSTTTRSETHVHCRREQKIVIAD