; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc09G12310 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc09G12310
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTransposon Ty3-I Gag-Pol polyprotein
Genome locationClcChr09:11065761..11068704
RNA-Seq ExpressionClc09G12310
SyntenyClc09G12310
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN66228.1 hypothetical protein VITISV_012977 [Vitis vinifera]5.3e-4329.24Show/hide
Query:  KNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQAL
        K++KEL GFLGLT Y R+FV  YG+I+ PLTQ LKK+ F+W+ + + A ++LK+ M  IL+L LPNF + F IE+DAS  G G        P+AYF+Q L
Subjt:  KNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQAL

Query:  LSTHRYKAVYEWELMAILFAIQKWRSYLLERHFVV----------VEDLVLSEIQQ--------------------------------------------
         +  R K +Y+ ELMAI  AIQKWR YLL RHF+V          +E  V++E+ Q                                            
Subjt:  LSTHRYKAVYEWELMAILFAIQKWRSYLLERHFVV----------VEDLVLSEIQQ--------------------------------------------

Query:  ---------------------------------------MLRAGQ----------SALRAEFHCSPIGGHQGALKTYQRLARDVYWTDMKAKSIVSDRDK
                                               +L  G+            L  E H S +GGH  A        +DV        SIVSDRDK
Subjt:  ---------------------------------------MLRAGQ----------SALRAEFHCSPIGGHQGALKTYQRLARDVYWTDMKAKSIVSDRDK

Query:  IFTSLFWEELFWPLGTQLCRSMAYHPQTDGEAEVTAR---------------------------------------------------------------
        +F SLFW +LF  LGT LC S AYHPQTDG+ EV  R                                                               
Subjt:  IFTSLFWEELFWPLGTQLCRSMAYHPQTDGEAEVTAR---------------------------------------------------------------

Query:  ------------------------------VGKVAYRFALPADSGIHSVFHVSLRSTVFGVCD---------------------------SP-TKEGIVE
                                      +  V Y+  LP+   IH VFHVS      G  D                           SP   +  +E
Subjt:  ------------------------------VGKVAYRFALPADSGIHSVFHVSLRSTVFGVCD---------------------------SP-TKEGIVE

Query:  VLIRWENGLP-IDATWVVVAVIKEQYPDFYLEDKVAL
        VLI+W+ GLP  +A+W +V  IK+ +PDF+LEDKV+L
Subjt:  VLIRWENGLP-IDATWVVVAVIKEQYPDFYLEDKVAL

KAA0031986.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]1.6e-4730.04Show/hide
Query:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQALL
        N++E+ GFLGLT YYR+FV NYGS+A PLTQLLKK  + W     E  ++LK  M+ + IL LP+F   FEIE+DASG G G        P+AYFSQ L 
Subjt:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQALL

Query:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVEDL------------------------------------------VLSEIQQMLRAGQ-------
           R K+VYE ELMA++ A+Q+WR YLL + FVV  D                                            LS +       Q       
Subjt:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVEDL------------------------------------------VLSEIQQMLRAGQ-------

Query:  --SALRAEFH-----------------------CSPIGGHQGALKTYQRLARDVYWTDMKA---------------------------------------
          S LR+E                          +  GGH G L+TY+RL  ++YW  MKA                                       
Subjt:  --SALRAEFH-----------------------CSPIGGHQGALKTYQRLARDVYWTDMKA---------------------------------------

Query:  --------------------------KSIVSDRDKIFTSLFWEELFWPLGTQLCRSMAYHPQTDGEAE--------------------------------
                                  KSIVSDR+K+F S FW+E+FW   T+L RS AYHPQ++G+ E                                
Subjt:  --------------------------KSIVSDRDKIFTSLFWEELFWPLGTQLCRSMAYHPQTDGEAE--------------------------------

Query:  ------------------------------------VTARVGKVAYRFALPADSGIHSVFHVS--------------------------LRSTVFGVCD-
                                            V AR+G VAY+  LP  + IH VFHVS                          + + +FG    
Subjt:  ------------------------------------VTARVGKVAYRFALPADSGIHSVFHVS--------------------------LRSTVFGVCD-

Query:  SPTKEGIVEVLIRWENGLPI-DATWVVVAVIKEQYPDFYLEDKVAL
         PTKE   EVLI W+ GLP  +ATW   AV  +Q+P F+LEDKV+L
Subjt:  SPTKEGIVEVLIRWENGLPI-DATWVVVAVIKEQYPDFYLEDKVAL

KAF8393178.1 hypothetical protein HHK36_021419 [Tetracentron sinense]3.8e-4131.66Show/hide
Query:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQALL
        N++ L GFLGLT YYRKFVA Y  IALPLT+ LKK+KF WN E + + + LK  M  +L+L +P+F ++F IE+DASG G G        P+A+FSQAL 
Subjt:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQALL

Query:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED-----------LVLSEIQQML------------RAGQ--------------------------
           R K++YE ELMAI+FA+ KWR YLL R F+V  D           +V +E Q+ +            R+G                           
Subjt:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED-----------LVLSEIQQML------------RAGQ--------------------------

Query:  --------------------SALRAEFHCSPIGGHQGALKTYQRLARDVYWTDMK---------------------------------------------
                            S L  EFH SPIGGH G  KTYQR+A +++W  M+                                             
Subjt:  --------------------SALRAEFHCSPIGGHQGALKTYQRLARDVYWTDMK---------------------------------------------

Query:  ---------------------------------------------------AKSIVSDRDKIFTSLFWEELFWPLGTQLCRSMAYHPQTDGEAEVTAR
                                                            +S++SDRD++F S FW ELF   GT L RS AYHPQ+DG+ EV  R
Subjt:  ---------------------------------------------------AKSIVSDRDKIFTSLFWEELFWPLGTQLCRSMAYHPQTDGEAEVTAR

KYP46337.1 Retrovirus-related Pol polyprotein from transposon 297 family [Cajanus cajan]1.4e-4034.26Show/hide
Query:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTG-----KCCPLAYFSQALL
        N++ + GFLGLT YYRKF+ +YG +A  LT L KK+ F WN +   A + LK  +  + IL LPNF++ FE+E DASG+G G     +  P+AYFS+AL 
Subjt:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTG-----KCCPLAYFSQALL

Query:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED-------LVLSEIQQ--------------MLRAGQSALR----------AEFHCSPIGGHQGA
          +  K+ YE ELMA+  AIQ WR YLL R F V  D        V+ +++               +L  G+  +            EFH +P+GGH G 
Subjt:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED-------LVLSEIQQ--------------MLRAGQSALR----------AEFHCSPIGGHQGA

Query:  LKTYQRLARDVYWTDMK------------------------------------------------------------AKSIVSDRDKIFTSLFWEELFWP
         +TY+R+A ++YW  MK                                                              S++SDRD +F SLFW+E F  
Subjt:  LKTYQRLARDVYWTDMK------------------------------------------------------------AKSIVSDRDKIFTSLFWEELFWP

Query:  LGTQLCRSMAYHPQTDGEAEVTAR
         GT L  S AYHPQ++G+ EV  R
Subjt:  LGTQLCRSMAYHPQTDGEAEVTAR

OAO89457.1 hypothetical protein AXX17_ATUG00140 [Arabidopsis thaliana]1.3e-4136.69Show/hide
Query:  MTKNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQ
        + K++ EL GFLG T YYR+FV NYG IA PLT  L+K  F WN     A Q LK  +  +L+L LP+F++ F +++DASGVG G        P+AY SQ
Subjt:  MTKNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQ

Query:  ALLSTHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVEDLVLSEIQQMLRAGQSALRAEFHCSPIGGHQGALKTYQRLARDVYWTDMK-----------
        A  S  R K+VYE EL+AI+ A+ KW+ YL  R FV+  D             Q +LR     + +GGH+GALKT++RL  +VYW  M+           
Subjt:  ALLSTHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVEDLVLSEIQQMLRAGQSALRAEFHCSPIGGHQGALKTYQRLARDVYWTDMK-----------

Query:  ---------------------AKSIVSD----------------------------------------RDKIFTSLFWEELFWPLGTQLCRSMAYHPQTD
                             ++ I SD                                        RDK+F S FW ELF   GT L +S AYHPQTD
Subjt:  ---------------------AKSIVSD----------------------------------------RDKIFTSLFWEELFWPLGTQLCRSMAYHPQTD

Query:  GEAEVTAR
        G+ EV  R
Subjt:  GEAEVTAR

TrEMBL top hitse value%identityAlignment
A0A151RUW5 Retrovirus-related Pol polyprotein from transposon 297 family7.0e-4134.26Show/hide
Query:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTG-----KCCPLAYFSQALL
        N++ + GFLGLT YYRKF+ +YG +A  LT L KK+ F WN +   A + LK  +  + IL LPNF++ FE+E DASG+G G     +  P+AYFS+AL 
Subjt:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTG-----KCCPLAYFSQALL

Query:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED-------LVLSEIQQ--------------MLRAGQSALR----------AEFHCSPIGGHQGA
          +  K+ YE ELMA+  AIQ WR YLL R F V  D        V+ +++               +L  G+  +            EFH +P+GGH G 
Subjt:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED-------LVLSEIQQ--------------MLRAGQSALR----------AEFHCSPIGGHQGA

Query:  LKTYQRLARDVYWTDMK------------------------------------------------------------AKSIVSDRDKIFTSLFWEELFWP
         +TY+R+A ++YW  MK                                                              S++SDRD +F SLFW+E F  
Subjt:  LKTYQRLARDVYWTDMK------------------------------------------------------------AKSIVSDRDKIFTSLFWEELFWP

Query:  LGTQLCRSMAYHPQTDGEAEVTAR
         GT L  S AYHPQ++G+ EV  R
Subjt:  LGTQLCRSMAYHPQTDGEAEVTAR

A0A178U8F9 Uncharacterized protein6.3e-4236.69Show/hide
Query:  MTKNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQ
        + K++ EL GFLG T YYR+FV NYG IA PLT  L+K  F WN     A Q LK  +  +L+L LP+F++ F +++DASGVG G        P+AY SQ
Subjt:  MTKNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQ

Query:  ALLSTHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVEDLVLSEIQQMLRAGQSALRAEFHCSPIGGHQGALKTYQRLARDVYWTDMK-----------
        A  S  R K+VYE EL+AI+ A+ KW+ YL  R FV+  D             Q +LR     + +GGH+GALKT++RL  +VYW  M+           
Subjt:  ALLSTHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVEDLVLSEIQQMLRAGQSALRAEFHCSPIGGHQGALKTYQRLARDVYWTDMK-----------

Query:  ---------------------AKSIVSD----------------------------------------RDKIFTSLFWEELFWPLGTQLCRSMAYHPQTD
                             ++ I SD                                        RDK+F S FW ELF   GT L +S AYHPQTD
Subjt:  ---------------------AKSIVSD----------------------------------------RDKIFTSLFWEELFWPLGTQLCRSMAYHPQTD

Query:  GEAEVTAR
        G+ EV  R
Subjt:  GEAEVTAR

A0A5A7SNK3 Putative retroelement pol polyprotein2.0e-4026.79Show/hide
Query:  KNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQAL
        KN++EL GFLGLT YYR+FVANYG+IA PLT+L KK  F W+ E   A + LK  MV + +L LP+F++ FEIE+DASG G G        P+AYFSQ L
Subjt:  KNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQAL

Query:  LSTHRYKAVYEWELMAILFAIQKWRSYLLERHFV------------------------------------------------VVEDLVLSEIQQML----
          T R K+VYE ELMAI+ A+ KWR YLL   FV                                                V ED  L +I + L    
Subjt:  LSTHRYKAVYEWELMAILFAIQKWRSYLLERHFV------------------------------------------------VVEDLVLSEIQQML----

Query:  --------RAGQSALRAE----------------FHCSPIGGHQGALKTYQRLARDVYWTDMK-------------------------------------
                R G+   +                  FH S IGGH G L+TY+R+A ++YW  MK                                     
Subjt:  --------RAGQSALRAE----------------FHCSPIGGHQGALKTYQRLARDVYWTDMK-------------------------------------

Query:  -----------------------------------------------------------AKSIVSDRDKIFTSLFWEELFWPLGTQLCRSMAYHPQTDGE
                                                                    +SIV DRD++F S FW+ELF   GTQL RS  YHPQTDG+
Subjt:  -----------------------------------------------------------AKSIVSDRDKIFTSLFWEELFWPLGTQLCRSMAYHPQTDGE

Query:  AEVT------------------------------------------------------------------------------------------------
         EV                                                                                                 
Subjt:  AEVT------------------------------------------------------------------------------------------------

Query:  ---------------------ARVGKVAYRFALPADSGIHSVFHVSLRSTVFG-----------------VCDSPTK---------EGIVEVLIRWENGL
                              RVG+VAY   LP  + IHSVFHVS    V G                 +   P K         E   E L+ W++  
Subjt:  ---------------------ARVGKVAYRFALPADSGIHSVFHVSLRSTVFG-----------------VCDSPTK---------EGIVEVLIRWENGL

Query:  PIDATWVVVAVIKEQYPDFYLEDKVAL
          +ATW   A +  Q+PDF+LEDKVAL
Subjt:  PIDATWVVVAVIKEQYPDFYLEDKVAL

A0A5A7SPQ8 Transposon Ty3-I Gag-Pol polyprotein7.7e-4830.04Show/hide
Query:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQALL
        N++E+ GFLGLT YYR+FV NYGS+A PLTQLLKK  + W     E  ++LK  M+ + IL LP+F   FEIE+DASG G G        P+AYFSQ L 
Subjt:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQALL

Query:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVEDL------------------------------------------VLSEIQQMLRAGQ-------
           R K+VYE ELMA++ A+Q+WR YLL + FVV  D                                            LS +       Q       
Subjt:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVEDL------------------------------------------VLSEIQQMLRAGQ-------

Query:  --SALRAEFH-----------------------CSPIGGHQGALKTYQRLARDVYWTDMKA---------------------------------------
          S LR+E                          +  GGH G L+TY+RL  ++YW  MKA                                       
Subjt:  --SALRAEFH-----------------------CSPIGGHQGALKTYQRLARDVYWTDMKA---------------------------------------

Query:  --------------------------KSIVSDRDKIFTSLFWEELFWPLGTQLCRSMAYHPQTDGEAE--------------------------------
                                  KSIVSDR+K+F S FW+E+FW   T+L RS AYHPQ++G+ E                                
Subjt:  --------------------------KSIVSDRDKIFTSLFWEELFWPLGTQLCRSMAYHPQTDGEAE--------------------------------

Query:  ------------------------------------VTARVGKVAYRFALPADSGIHSVFHVS--------------------------LRSTVFGVCD-
                                            V AR+G VAY+  LP  + IH VFHVS                          + + +FG    
Subjt:  ------------------------------------VTARVGKVAYRFALPADSGIHSVFHVS--------------------------LRSTVFGVCD-

Query:  SPTKEGIVEVLIRWENGLPI-DATWVVVAVIKEQYPDFYLEDKVAL
         PTKE   EVLI W+ GLP  +ATW   AV  +Q+P F+LEDKV+L
Subjt:  SPTKEGIVEVLIRWENGLPI-DATWVVVAVIKEQYPDFYLEDKVAL

A5C0Y9 Uncharacterized protein2.6e-4329.24Show/hide
Query:  KNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQAL
        K++KEL GFLGLT Y R+FV  YG+I+ PLTQ LKK+ F+W+ + + A ++LK+ M  IL+L LPNF + F IE+DAS  G G        P+AYF+Q L
Subjt:  KNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQAL

Query:  LSTHRYKAVYEWELMAILFAIQKWRSYLLERHFVV----------VEDLVLSEIQQ--------------------------------------------
         +  R K +Y+ ELMAI  AIQKWR YLL RHF+V          +E  V++E+ Q                                            
Subjt:  LSTHRYKAVYEWELMAILFAIQKWRSYLLERHFVV----------VEDLVLSEIQQ--------------------------------------------

Query:  ---------------------------------------MLRAGQ----------SALRAEFHCSPIGGHQGALKTYQRLARDVYWTDMKAKSIVSDRDK
                                               +L  G+            L  E H S +GGH  A        +DV        SIVSDRDK
Subjt:  ---------------------------------------MLRAGQ----------SALRAEFHCSPIGGHQGALKTYQRLARDVYWTDMKAKSIVSDRDK

Query:  IFTSLFWEELFWPLGTQLCRSMAYHPQTDGEAEVTAR---------------------------------------------------------------
        +F SLFW +LF  LGT LC S AYHPQTDG+ EV  R                                                               
Subjt:  IFTSLFWEELFWPLGTQLCRSMAYHPQTDGEAEVTAR---------------------------------------------------------------

Query:  ------------------------------VGKVAYRFALPADSGIHSVFHVSLRSTVFGVCD---------------------------SP-TKEGIVE
                                      +  V Y+  LP+   IH VFHVS      G  D                           SP   +  +E
Subjt:  ------------------------------VGKVAYRFALPADSGIHSVFHVSLRSTVFGVCD---------------------------SP-TKEGIVE

Query:  VLIRWENGLP-IDATWVVVAVIKEQYPDFYLEDKVAL
        VLI+W+ GLP  +A+W +V  IK+ +PDF+LEDKV+L
Subjt:  VLIRWENGLP-IDATWVVVAVIKEQYPDFYLEDKVAL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.7e-1840.88Show/hide
Query:  KELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEK--FSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQALL
        KE+  FLGLT YYRKF+ N+  IA P+T+ LKK     + N E D A ++LK ++ E  IL +P+F + F + +DAS V  G        PL+Y S+ L 
Subjt:  KELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEK--FSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQALL

Query:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED
              +  E EL+AI++A + +R YLL RHF +  D
Subjt:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED

P10394 Retrovirus-related Pol polyprotein from transposon 4121.5e-1133.33Show/hide
Query:  FLGLTEYYRKFVANYGSIALPLTQLLKKE-KFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKC---------CPLAYFSQALLST
        F+    YYR+F+ N+   +  +T+L KK   F W  E  +A   LKS ++   +L  P+F + F I +DAS    G            P+AY S+A    
Subjt:  FLGLTEYYRKFVANYGSIALPLTQLLKKE-KFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKC---------CPLAYFSQALLST

Query:  HRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED
           K+  E EL AI +AI  +R Y+  +HF V  D
Subjt:  HRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED

P20825 Retrovirus-related Pol polyprotein from transposon 2972.3e-1737.96Show/hide
Query:  KELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEK--FSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQALL
        KE+  FLGLT YYRKF+ NY  IA P+T  LKK     +  +E  EA ++LK++++   IL LP+F++ F + +DAS +  G        P+++ S+ L 
Subjt:  KELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEK--FSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCC-----PLAYFSQALL

Query:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED
              +  E EL+AI++A + +R YLL R F++  D
Subjt:  STHRYKAVYEWELMAILFAIQKWRSYLLERHFVVVED

P92523 Uncharacterized mitochondrial protein AtMg008602.5e-1149.3Show/hide
Query:  KNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETF
        KN  EL GFLGLT YYR+FV NYG I  PLT+LLKK    W      A + LK  +  + +L LP+ K  F
Subjt:  KNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETF

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.8e-1232.87Show/hide
Query:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLK--------KEKFSWNVEVDEACQR----LKSMMVEILILGLPNFKETFEIESDAS--GVGT-----
        ++KEL  FLG+T YYRKF+ +Y  +A PLT L +         +     + +DE   +    LKS++    IL  P F + F + +DAS   +G      
Subjt:  NIKELCGFLGLTEYYRKFVANYGSIALPLTQLLK--------KEKFSWNVEVDEACQR----LKSMMVEILILGLPNFKETFEIESDAS--GVGT-----

Query:  --GKCCPLAYFSQALLSTHRYKAVYEWELMAILFAIQKWRSYL
          G+  P+AY S++L  T    A  E E++AI++++   R+YL
Subjt:  --GKCCPLAYFSQALLSTHRYKAVYEWELMAILFAIQKWRSYL

Arabidopsis top hitse value%identityAlignment
AT5G44200.1 CAP-binding protein 206.9e-0992.86Show/hide
Query:  WG-QDGRQWGRGRSGGQVRDEYRTDYDP
        WG Q+GRQWGRGRSGGQVRDEYRTDYDP
Subjt:  WG-QDGRQWGRGRSGGQVRDEYRTDYDP

AT5G44200.2 CAP-binding protein 206.9e-0992.86Show/hide
Query:  WG-QDGRQWGRGRSGGQVRDEYRTDYDP
        WG Q+GRQWGRGRSGGQVRDEYRTDYDP
Subjt:  WG-QDGRQWGRGRSGGQVRDEYRTDYDP

ATMG00860.1 DNA/RNA polymerases superfamily protein1.8e-1249.3Show/hide
Query:  KNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETF
        KN  EL GFLGLT YYR+FV NYG I  PLT+LLKK    W      A + LK  +  + +L LP+ K  F
Subjt:  KNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCAAAAATATTAAGGAATTATGCGGGTTCCTCGGGTTGACTGAGTACTATCGAAAATTTGTTGCAAACTATGGGTCGATAGCACTGCCATTGACACAATTGCTAAA
AAAGGAAAAATTTTCATGGAATGTGGAAGTGGATGAAGCTTGTCAAAGGTTAAAATCAATGATGGTGGAGATACTGATATTGGGATTGCCCAATTTTAAAGAGACATTTG
AGATTGAAAGCGATGCTTCAGGTGTGGGAACAGGGAAATGTTGTCCTTTGGCCTATTTCAGTCAAGCTTTACTTTCAACACACCGTTACAAAGCAGTGTATGAATGGGAA
TTAATGGCGATATTGTTTGCCATTCAAAAGTGGCGATCTTACTTGTTGGAGAGGCATTTTGTGGTAGTAGAAGATCTGGTATTGAGTGAGATTCAGCAAATGCTTAGAGC
TGGACAGTCAGCCCTAAGGGCGGAATTCCATTGCAGCCCTATTGGAGGACACCAAGGAGCACTCAAAACGTACCAAAGGTTAGCTAGAGATGTGTATTGGACCGATATGA
AAGCCAAGAGTATTGTGTCGGATAGGGATAAAATATTCACTAGCCTTTTTTGGGAGGAATTGTTCTGGCCATTGGGGACTCAACTATGCCGAAGCATGGCATATCACCCA
CAAACAGATGGCGAGGCGGAGGTAACTGCACGAGTGGGGAAAGTGGCATATAGATTTGCCTTACCAGCAGATTCGGGAATTCATTCGGTGTTCCATGTGTCGTTACGATC
AACGGTGTTTGGTGTGTGCGACTCGCCTACGAAGGAGGGGATCGTAGAAGTTCTAATCCGGTGGGAAAATGGGCTGCCTATTGATGCTACTTGGGTGGTTGTTGCAGTCA
TTAAAGAACAATATCCAGATTTTTACCTTGAGGACAAGGTGGCTCTTTGGGGGCAGGATGGCAGGCAATGGGGCCGTGGTCGAAGTGGTGGACAGGTGCGTGATGAATAT
CGAACAGACTATGATCCTGATATCCTTTTTTTCTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGACCAAAAATATTAAGGAATTATGCGGGTTCCTCGGGTTGACTGAGTACTATCGAAAATTTGTTGCAAACTATGGGTCGATAGCACTGCCATTGACACAATTGCTAAA
AAAGGAAAAATTTTCATGGAATGTGGAAGTGGATGAAGCTTGTCAAAGGTTAAAATCAATGATGGTGGAGATACTGATATTGGGATTGCCCAATTTTAAAGAGACATTTG
AGATTGAAAGCGATGCTTCAGGTGTGGGAACAGGGAAATGTTGTCCTTTGGCCTATTTCAGTCAAGCTTTACTTTCAACACACCGTTACAAAGCAGTGTATGAATGGGAA
TTAATGGCGATATTGTTTGCCATTCAAAAGTGGCGATCTTACTTGTTGGAGAGGCATTTTGTGGTAGTAGAAGATCTGGTATTGAGTGAGATTCAGCAAATGCTTAGAGC
TGGACAGTCAGCCCTAAGGGCGGAATTCCATTGCAGCCCTATTGGAGGACACCAAGGAGCACTCAAAACGTACCAAAGGTTAGCTAGAGATGTGTATTGGACCGATATGA
AAGCCAAGAGTATTGTGTCGGATAGGGATAAAATATTCACTAGCCTTTTTTGGGAGGAATTGTTCTGGCCATTGGGGACTCAACTATGCCGAAGCATGGCATATCACCCA
CAAACAGATGGCGAGGCGGAGGTAACTGCACGAGTGGGGAAAGTGGCATATAGATTTGCCTTACCAGCAGATTCGGGAATTCATTCGGTGTTCCATGTGTCGTTACGATC
AACGGTGTTTGGTGTGTGCGACTCGCCTACGAAGGAGGGGATCGTAGAAGTTCTAATCCGGTGGGAAAATGGGCTGCCTATTGATGCTACTTGGGTGGTTGTTGCAGTCA
TTAAAGAACAATATCCAGATTTTTACCTTGAGGACAAGGTGGCTCTTTGGGGGCAGGATGGCAGGCAATGGGGCCGTGGTCGAAGTGGTGGACAGGTGCGTGATGAATAT
CGAACAGACTATGATCCTGATATCCTTTTTTTCTCATAA
Protein sequenceShow/hide protein sequence
MTKNIKELCGFLGLTEYYRKFVANYGSIALPLTQLLKKEKFSWNVEVDEACQRLKSMMVEILILGLPNFKETFEIESDASGVGTGKCCPLAYFSQALLSTHRYKAVYEWE
LMAILFAIQKWRSYLLERHFVVVEDLVLSEIQQMLRAGQSALRAEFHCSPIGGHQGALKTYQRLARDVYWTDMKAKSIVSDRDKIFTSLFWEELFWPLGTQLCRSMAYHP
QTDGEAEVTARVGKVAYRFALPADSGIHSVFHVSLRSTVFGVCDSPTKEGIVEVLIRWENGLPIDATWVVVAVIKEQYPDFYLEDKVALWGQDGRQWGRGRSGGQVRDEY
RTDYDPDILFFS