; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G009785 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G009785
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase
Genome locationCG_Chr06:19297134..19298446
RNA-Seq ExpressionClCG06G009785
SyntenyClCG06G009785
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIM97577.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]1.3e-13056.28Show/hide
Query:  SQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR-------------------
        S  PWYADIVNYL C   P +L+AQQKKK  ++++ Y WD+P+L++ G D+ILRRCVPE E + IL  CH +PYGGHF G                    
Subjt:  SQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR-------------------

Query:  --------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE
                ++ QRT NIS ++EMPLN++LEVELFDVWGIDFMGPF PS GN YILVAVDYVSKWVEAAA   ND+  V  F+KK IF+RFGTPRAIISD 
Subjt:  --------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE

Query:  GTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL------------------
        GT F NR    LL+K+ V H+++T YHPQT+GQ E++NREIK ILEK VS++RKDW +RLDEALWAYRTA+KTPIGMSPY L                  
Subjt:  GTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL------------------

Query:  ---------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDG
                  A+GE R LQLNEL E+R  AYENAK+YKE+ K+WH+K I ++    GQ VLLFNS+L+LFPGKLKSRWSGPF I EV PHGAV+L N++ 
Subjt:  ---------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDG

Query:  TTSFKVNGQRVKPYHIEELEIEKTSIDLHE
           FKVN QR+K Y  E ++ +  SI L++
Subjt:  TTSFKVNGQRVKPYHIEELEIEKTSIDLHE

PIN14790.1 DNA-directed DNA polymerase [Handroanthus impetiginosus]7.0e-12956.05Show/hide
Query:  SQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR-------------------
        S  PWYADIVNYL C   P +L+AQQKKK+ ++++ Y WD+ +L++ G D+ILRRCVPE E + IL  CH +PYGGHF G                    
Subjt:  SQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR-------------------

Query:  --------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE
                ++ QRT NIS ++EMPLN++LEVELFDVWGIDFMG F PS GN YILVAVDYVSKWVEA A   ND+  V  F+KK IF+RFGTPRAIIS+ 
Subjt:  --------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE

Query:  GTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL------------------
        GT F NR    LL+K+ V H+++T YHPQT+GQ E++NREIK ILEK VS++RKDW +RLDEALWAYRTAFKTPIGMSPY L                  
Subjt:  GTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL------------------

Query:  ---------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDG
                  A+GE R LQLNEL E+R  AYENAK+YKE+TK+WHDK I ++    GQ VLLFNS+L+LFPGKLKSRWSGPF + EV  HGAV+L NE+ 
Subjt:  ---------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDG

Query:  TTSFKVNGQRVKPYHIEELEIEKTSIDLHE
           FKVN QR+K Y    ++   TSI L+E
Subjt:  TTSFKVNGQRVKPYHIEELEIEKTSIDLHE

PNY03357.1 hypothetical protein L195_g026684, partial [Trifolium pratense]2.3e-12755.29Show/hide
Query:  PWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR----------------------
        PW+AD  NY+V    P +  +QQ+KK  ++ KFY WDEP+LY+ G D +LRRCVPE E   +L  CH++ YGGHF G                       
Subjt:  PWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR----------------------

Query:  -----EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTD
             ++ QRT NIS +NEMP N +LEVE+FDVWGIDFMGPFP S    YILVAVDYVSKWVEA A   NDA  V  FLKK IFSRFG PRA+ISDEGT 
Subjt:  -----EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTD

Query:  FINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYA----------------------
        F+NR +  LL K+NV HR+AT YHPQT+GQ E++NR+IK ILEK V++SRKDW  +LD+ALWAYRTAFKTPIGMSP+                       
Subjt:  FINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYA----------------------

Query:  -----LDASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDGTTS
             L  +GE+R LQL+EL E+R+ AYENAK++KE+TKKWHDK I  +    GQ VLLFNS+L+LFPGKLKSRWSGPF IK+V PHGAV+L + D   +
Subjt:  -----LDASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDGTTS

Query:  FKVNGQRVKPYHIEELEIEKTSIDL
        FKVNGQR+KPY  +E  + + SI L
Subjt:  FKVNGQRVKPYHIEELEIEKTSIDL

WP_217833161.1 DDE-type integrase/transposase/recombinase, partial [Synechococcus sp. PCC 7002]8.0e-14977.62Show/hide
Query:  MNAESQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR---------------
        M A+SQEPWY DIVNYLVCNQWPEE NA QKKKL++ESKFYCWDEPYLYRLG DHILRRCVPEYETHSIL+SCHEAPYGGHFGG+               
Subjt:  MNAESQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR---------------

Query:  ------------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAI
                    ++ QRT NISN+NEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAI
Subjt:  ------------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAI

Query:  ISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL--------------
        ISDEGT FINRIITNLLTKFNVSHRVAT YHPQTN QAEITN+EIKSILEKVVSTSRKDW ERLDEALWAYRT FKTPIGMSPYAL              
Subjt:  ISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL--------------

Query:  -------------DASGEARKLQLNELLEWRHSAYENAKLYKER
                     DASGEARKLQLNEL+EWRHSAYENAKLYKE+
Subjt:  -------------DASGEARKLQLNELLEWRHSAYENAKLYKER

XP_012842899.1 PREDICTED: uncharacterized protein LOC105963074 [Erythranthe guttata]1.7e-12755.83Show/hide
Query:  PWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR----------------------
        PWYAD+ N+L     P++L+  QKKK  ++S+FY WDEP L+R G D ++RRCVPE E   IL  CH +P GGH G                        
Subjt:  PWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR----------------------

Query:  -----EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTD
             ++ QRT N+SN+++MPLN+M EVELFDVWGIDFMGPFP S G  YIL+AVDYVSKWVEA A   NDA TV KF  K IFSRFGTPRAIISDEG+ 
Subjt:  -----EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTD

Query:  FINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL---------------------
        F N+++TNL  K  + H++A  YHPQTNG AE++NREIK ILEK VST+RKDW  +LD+ALWAYRTAFKTPIGMSPY L                     
Subjt:  FINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL---------------------

Query:  ------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDGTTS
               A+G+ R LQLNE+ E+R+ AYENAK+YKE+TKKWHDK I+K+    G +VLLFNS+LRLFPGKLKSRWSGPF++  V+P G +++   DG  S
Subjt:  ------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDGTTS

Query:  FKVNGQRVKPYH
        FKVNGQRVK Y+
Subjt:  FKVNGQRVKPYH

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase6.2e-13156.28Show/hide
Query:  SQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR-------------------
        S  PWYADIVNYL C   P +L+AQQKKK  ++++ Y WD+P+L++ G D+ILRRCVPE E + IL  CH +PYGGHF G                    
Subjt:  SQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR-------------------

Query:  --------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE
                ++ QRT NIS ++EMPLN++LEVELFDVWGIDFMGPF PS GN YILVAVDYVSKWVEAAA   ND+  V  F+KK IF+RFGTPRAIISD 
Subjt:  --------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE

Query:  GTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL------------------
        GT F NR    LL+K+ V H+++T YHPQT+GQ E++NREIK ILEK VS++RKDW +RLDEALWAYRTA+KTPIGMSPY L                  
Subjt:  GTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL------------------

Query:  ---------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDG
                  A+GE R LQLNEL E+R  AYENAK+YKE+ K+WH+K I ++    GQ VLLFNS+L+LFPGKLKSRWSGPF I EV PHGAV+L N++ 
Subjt:  ---------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDG

Query:  TTSFKVNGQRVKPYHIEELEIEKTSIDLHE
           FKVN QR+K Y  E ++ +  SI L++
Subjt:  TTSFKVNGQRVKPYHIEELEIEKTSIDLHE

A0A2G9G6G2 Reverse transcriptase2.4e-12755.09Show/hide
Query:  SQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR-------------------
        S  PWYADIVNYL C   P +L+ QQKKK+ ++++ Y W++P+L + G D+ILRRCVPE E + IL  CH +PYGGHF G                    
Subjt:  SQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR-------------------

Query:  --------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE
                ++ QRT NIS ++EMPLN++LEVELFDVWGIDFMGPF PS GN YILVAVDYVSKWVEAAA   ND+  V  F+KK IF+RFGTPRAIISD 
Subjt:  --------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE

Query:  GTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL------------------
         T F NR    LL+K+ V H++ T YHPQT+G  E++NREIK ILEK VS++RKDW +RLDEALWAYRTA+KTPIGMSPY L                  
Subjt:  GTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL------------------

Query:  ---------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDG
                  A+GE R LQLNEL E+R  AYENAK+YKE+TK+WHDK I ++    GQ VLLFNS+L+LFPGKLKSRW G F I EV PHGAV+L NE+ 
Subjt:  ---------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDG

Query:  TTSFKVNGQRVKPYHIEELEIEKTSIDLHECN
           FK+N +R+K Y    ++ + TSI L++ N
Subjt:  TTSFKVNGQRVKPYHIEELEIEKTSIDLHECN

A0A2G9HBV9 DNA-directed DNA polymerase3.4e-12956.05Show/hide
Query:  SQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR-------------------
        S  PWYADIVNYL C   P +L+AQQKKK+ ++++ Y WD+ +L++ G D+ILRRCVPE E + IL  CH +PYGGHF G                    
Subjt:  SQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR-------------------

Query:  --------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE
                ++ QRT NIS ++EMPLN++LEVELFDVWGIDFMG F PS GN YILVAVDYVSKWVEA A   ND+  V  F+KK IF+RFGTPRAIIS+ 
Subjt:  --------EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDE

Query:  GTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL------------------
        GT F NR    LL+K+ V H+++T YHPQT+GQ E++NREIK ILEK VS++RKDW +RLDEALWAYRTAFKTPIGMSPY L                  
Subjt:  GTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL------------------

Query:  ---------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDG
                  A+GE R LQLNEL E+R  AYENAK+YKE+TK+WHDK I ++    GQ VLLFNS+L+LFPGKLKSRWSGPF + EV  HGAV+L NE+ 
Subjt:  ---------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDG

Query:  TTSFKVNGQRVKPYHIEELEIEKTSIDLHE
           FKVN QR+K Y    ++   TSI L+E
Subjt:  TTSFKVNGQRVKPYHIEELEIEKTSIDLHE

A0A2K3NJZ5 Integrase catalytic domain-containing protein (Fragment)1.1e-12755.29Show/hide
Query:  PWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR----------------------
        PW+AD  NY+V    P +  +QQ+KK  ++ KFY WDEP+LY+ G D +LRRCVPE E   +L  CH++ YGGHF G                       
Subjt:  PWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR----------------------

Query:  -----EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTD
             ++ QRT NIS +NEMP N +LEVE+FDVWGIDFMGPFP S    YILVAVDYVSKWVEA A   NDA  V  FLKK IFSRFG PRA+ISDEGT 
Subjt:  -----EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTD

Query:  FINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYA----------------------
        F+NR +  LL K+NV HR+AT YHPQT+GQ E++NR+IK ILEK V++SRKDW  +LD+ALWAYRTAFKTPIGMSP+                       
Subjt:  FINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYA----------------------

Query:  -----LDASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDGTTS
             L  +GE+R LQL+EL E+R+ AYENAK++KE+TKKWHDK I  +    GQ VLLFNS+L+LFPGKLKSRWSGPF IK+V PHGAV+L + D   +
Subjt:  -----LDASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDGTTS

Query:  FKVNGQRVKPYHIEELEIEKTSIDL
        FKVNGQR+KPY  +E  + + SI L
Subjt:  FKVNGQRVKPYHIEELEIEKTSIDL

A0A4Y1RSJ3 Transposable element protein1.6e-12654.57Show/hide
Query:  PWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR----------------------
        PWYAD VNYL C   P +++  QKKK     K Y WD+PYL++ G D ++RRCVPE E   IL  CH    GGH+G                        
Subjt:  PWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGR----------------------

Query:  -----EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTD
             +  QRT NIS++N+MPLN++LEVELFDVWGIDFMGPFP S GN YILVAVDYVSKWVEAAA   NDA  V +FL+K IF+RFG PRAIISD GT 
Subjt:  -----EQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTD

Query:  FINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL---------------------
        F NR   +LL K+ ++H+V+T YHPQT+GQ E++NRE+K ILEK VS SRKDW  +LD+ALWAYRTAFK PIGMSPY L                     
Subjt:  FINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKSILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL---------------------

Query:  ------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDGTTS
               ++GE RKLQLNEL E R+ +YENAK+YK+RTKKWHDK+I KK  YVGQ VLL+NS+L+LFPGKL+SRWSGPF +  V P+G V++ N+   T+
Subjt:  ------DASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSGPFIIKEVSPHGAVKLTNEDGTTS

Query:  FKVNGQRVKPYHIEELEIEKTSIDLHE
        FKVNG R+KPY       E+T+I L +
Subjt:  FKVNGQRVKPYHIEELEIEKTSIDLHE

SwissProt top hitse value%identityAlignment
P08361 Gag-Pol polyprotein1.1e-1835.37Show/hide
Query:  WGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEI
        W IDF    P   G +Y+LV VD  S W+EA    K  A  V+K L ++IF RFG P+ + +D G  F++++   +     +  ++   Y PQ++GQ E 
Subjt:  WGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEI

Query:  TNREIKSILEKV-VSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL
         NR IK  L K+ ++T  +DW+  L  AL+  R     P G++PY +
Subjt:  TNREIKSILEKV-VSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL

P26808 Gag-Pol polyprotein6.2e-1936.73Show/hide
Query:  WGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEI
        W IDF    P   G +Y+LV VD  S WVEA    K  A  V+K L ++IF RFG P+ + +D G  F++++   +     V  ++   Y PQ++GQ E 
Subjt:  WGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEI

Query:  TNREIKSILEKV-VSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL
         NR IK  L K+ ++T  +DW+  L  AL+  R     P G++PY +
Subjt:  TNREIKSILEKV-VSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL

P26809 Gag-Pol polyprotein8.1e-1936.05Show/hide
Query:  WGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEI
        W IDF    P   G +Y+LV +D  S WVEA    K  A  V+K L ++IF RFG P+ + +D G  F++++   +     V  ++   Y PQ++GQ E 
Subjt:  WGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEI

Query:  TNREIKSILEKV-VSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL
         NR IK  L K+ ++T  +DW+  L  AL+  R     P G++PY +
Subjt:  TNREIKSILEKV-VSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL

P26810 Gag-Pol polyprotein6.2e-1936.73Show/hide
Query:  WGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEI
        W IDF    P   G +Y+LV VD  S WVEA    K  A  V+K L ++IF RFG P+ + +D G  F++++   +     V  ++   Y PQ++GQ E 
Subjt:  WGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEI

Query:  TNREIKSILEKV-VSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL
         NR IK  L K+ ++T  +DW+  L  AL+  R     P G++PY +
Subjt:  TNREIKSILEKV-VSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL

P31792 Pol polyprotein (Fragment)1.8e-1836.73Show/hide
Query:  WGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEI
        W IDF    P   G +Y+LV VD  S WVEA    +  A+ V+K + ++IF RFG P+ I SD G  F++++   L     ++ ++   Y PQ++GQ E 
Subjt:  WGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEI

Query:  TNREIKSILEKV-VSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL
         NR IK  L K+ + T  KDW   L  AL   R       G++PY +
Subjt:  TNREIKSILEKV-VSTSRKDWIERLDEALWAYRTAFKTPIGMSPYAL

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein4.4e-0450Show/hide
Query:  GHFGGREQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFM
        G     +  QR  N + +NEMP + +LEVE+FDVWGI FM
Subjt:  GHFGGREQLQRTVNISNQNEMPLNSMLEVELFDVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGCAGAGAGTCAGGAACCGTGGTATGCAGACATAGTAAATTACTTGGTCTGCAACCAATGGCCTGAAGAATTGAATGCTCAACAAAAGAAAAAGCTCCAATATGA
AAGTAAGTTTTACTGCTGGGATGAGCCATATCTATACAGACTTGGCTCGGACCACATACTGCGTCGATGTGTTCCAGAATATGAAACGCATAGCATTTTGAGAAGCTGTC
ATGAAGCACCTTACGGAGGACACTTTGGGGGCAGAGAACAACTGCAAAGGACAGTCAACATTTCCAACCAAAATGAGATGCCTCTAAACTCAATGCTGGAAGTTGAGTTG
TTTGACGTATGGGGAATCGATTTCATGGGACCATTTCCTCCCTCTTGCGGTAATCAATATATCCTAGTAGCGGTTGACTACGTATCAAAATGGGTAGAAGCAGCAGCCTG
TGCGAAGAACGACGCAAACACAGTGTCCAAGTTCTTAAAGAAACAAATCTTCTCTCGATTTGGGACACCAAGGGCGATAATTAGTGATGAAGGTACAGATTTTATAAATC
GCATAATCACTAATTTACTGACAAAATTTAATGTCTCGCACAGGGTAGCAACTACTTATCACCCACAGACAAACGGCCAAGCCGAAATAACAAACCGGGAGATCAAGTCC
ATACTTGAAAAAGTCGTGAGCACATCAAGGAAAGATTGGATAGAGAGATTAGATGAAGCTCTATGGGCATACAGAACGGCATTCAAAACACCTATAGGCATGTCACCCTA
TGCGCTGGACGCAAGTGGCGAAGCAAGAAAGCTTCAATTAAACGAACTCCTCGAATGGAGACATTCAGCTTACGAAAACGCAAAGCTGTATAAGGAAAGGACCAAGAAAT
GGCACGACAAAAATATTAGTAAGAAAACTCTATACGTCGGCCAGAAGGTCCTATTATTTAACTCAAAGTTGCGTTTATTTCCAGGTAAGCTGAAATCTCGCTGGTCTGGA
CCATTTATAATCAAGGAAGTGTCCCCGCATGGTGCTGTCAAGCTCACAAATGAAGACGGAACCACATCCTTCAAGGTCAATGGACAAAGGGTAAAACCCTACCACATTGA
AGAGCTCGAAATCGAAAAAACCTCCATTGACCTACACGAGTGTAATGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACGCAGAGAGTCAGGAACCGTGGTATGCAGACATAGTAAATTACTTGGTCTGCAACCAATGGCCTGAAGAATTGAATGCTCAACAAAAGAAAAAGCTCCAATATGA
AAGTAAGTTTTACTGCTGGGATGAGCCATATCTATACAGACTTGGCTCGGACCACATACTGCGTCGATGTGTTCCAGAATATGAAACGCATAGCATTTTGAGAAGCTGTC
ATGAAGCACCTTACGGAGGACACTTTGGGGGCAGAGAACAACTGCAAAGGACAGTCAACATTTCCAACCAAAATGAGATGCCTCTAAACTCAATGCTGGAAGTTGAGTTG
TTTGACGTATGGGGAATCGATTTCATGGGACCATTTCCTCCCTCTTGCGGTAATCAATATATCCTAGTAGCGGTTGACTACGTATCAAAATGGGTAGAAGCAGCAGCCTG
TGCGAAGAACGACGCAAACACAGTGTCCAAGTTCTTAAAGAAACAAATCTTCTCTCGATTTGGGACACCAAGGGCGATAATTAGTGATGAAGGTACAGATTTTATAAATC
GCATAATCACTAATTTACTGACAAAATTTAATGTCTCGCACAGGGTAGCAACTACTTATCACCCACAGACAAACGGCCAAGCCGAAATAACAAACCGGGAGATCAAGTCC
ATACTTGAAAAAGTCGTGAGCACATCAAGGAAAGATTGGATAGAGAGATTAGATGAAGCTCTATGGGCATACAGAACGGCATTCAAAACACCTATAGGCATGTCACCCTA
TGCGCTGGACGCAAGTGGCGAAGCAAGAAAGCTTCAATTAAACGAACTCCTCGAATGGAGACATTCAGCTTACGAAAACGCAAAGCTGTATAAGGAAAGGACCAAGAAAT
GGCACGACAAAAATATTAGTAAGAAAACTCTATACGTCGGCCAGAAGGTCCTATTATTTAACTCAAAGTTGCGTTTATTTCCAGGTAAGCTGAAATCTCGCTGGTCTGGA
CCATTTATAATCAAGGAAGTGTCCCCGCATGGTGCTGTCAAGCTCACAAATGAAGACGGAACCACATCCTTCAAGGTCAATGGACAAAGGGTAAAACCCTACCACATTGA
AGAGCTCGAAATCGAAAAAACCTCCATTGACCTACACGAGTGTAATGACTGA
Protein sequenceShow/hide protein sequence
MNAESQEPWYADIVNYLVCNQWPEELNAQQKKKLQYESKFYCWDEPYLYRLGSDHILRRCVPEYETHSILRSCHEAPYGGHFGGREQLQRTVNISNQNEMPLNSMLEVEL
FDVWGIDFMGPFPPSCGNQYILVAVDYVSKWVEAAACAKNDANTVSKFLKKQIFSRFGTPRAIISDEGTDFINRIITNLLTKFNVSHRVATTYHPQTNGQAEITNREIKS
ILEKVVSTSRKDWIERLDEALWAYRTAFKTPIGMSPYALDASGEARKLQLNELLEWRHSAYENAKLYKERTKKWHDKNISKKTLYVGQKVLLFNSKLRLFPGKLKSRWSG
PFIIKEVSPHGAVKLTNEDGTTSFKVNGQRVKPYHIEELEIEKTSIDLHECND