; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G006090 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G006090
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr12:3805555..3806613
RNA-Seq ExpressionCmoCh12G006090
SyntenyCmoCh12G006090
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]6.9e-16380.4Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MNQVF+EYLDQFV+VYLD+IVVYS TL+EH++HL+LVFDKLRQNQLYVKKEKCAFAQ  I FLGHV+  GQISMD+DK+KAIQEW+VPTSV ELRSFLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE
        ANYYRRF+EGFSRRA P+TELLKK   W WS + Q AFE+LK  M +GPVLGL DVTKPFEVETDASD+ALGGVL+Q+ HPI YESRKLN+AE+RYTVSE
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE

Query:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
        KEMLAVVHCLR WRQYLLGS FVVKTDNSA CHFF+QPKLT+KQARWQE LAEFDFKF+HK GKSNQAADALSRKGEHA LCMLAHIH+SK DGS+RD+I
Subjt:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII

Query:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI
         E+L   PSA+ VVELAK  KTRQFWVEGDLL T+GN LYVPRTG LRKKL+
Subjt:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI

XP_022975176.1 uncharacterized protein LOC111474215 [Cucurbita maxima]2.9e-16184.09Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MNQVFYEYLDQFVIVYLD+IVVYSTTLEEHKVHLKL                                   ISMDSDKIKAIQEWKVPTSVS+LRSFLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE
        ANYYRRFVEGFSRRAAPL ELLKKDHPW WSNDCQMAFE+LKTTM  GPVLGLVDVTKPFE+ETDASDFALGGVLIQEGHPIA+ESRKLNDAE+RY VSE
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE

Query:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
        K+ML VVHCLRVWRQYLLGSQFVVKTDNS  CHFFDQPKLTAKQARWQESLA+FDFKFKHK GKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
Subjt:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII

Query:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI
        KEHLHKD SAKAVVELAKA KTRQFWVEGDLLITKGNRL V RTGELRKKLI
Subjt:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI

XP_022975516.1 uncharacterized protein LOC111474945, partial [Cucurbita maxima]3.8e-16989.61Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MNQVFYEYLDQFVIVYLD+IVVYSTTLEEHKVHLKLVFDKLRQNQL                      CGQISMDSDKIKAIQEWKVPTSVS+LRSFLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE
        ANYYRRFVEGFSRRAAPLTELLKKDH WSWS+DCQMAFE+LKTTMTRGPVLGLVDVTKPFE+ETDASDFALGGVLIQEGHPIA+ESRKLNDAE+RYTVSE
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE

Query:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
        KEMLAVVHCLRVWRQYLLGSQFVVKTDNSA CHFFDQPKLTAKQARWQ+SLAEFDFKF+HK GKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
Subjt:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII

Query:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGN
        KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLL+TKGN
Subjt:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGN

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]1.2e-19998.58Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MNQVFYEYLDQFVIVYLD+IVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE
        ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAE+RYTVSE
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE

Query:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
        KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKF+HK GKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
Subjt:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII

Query:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI
        KEHLHKDPSAK VVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI
Subjt:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]1.6e-19998.58Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MNQVFYEYLDQFVIVYLD+IVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCI+FLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE
        ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAE+RYTVSE
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE

Query:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
        KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKF+HK GKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
Subjt:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII

Query:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI
        KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI
Subjt:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI

TrEMBL top hitse value%identityAlignment
A0A5D3BRZ6 Reverse transcriptase9.7e-15575.28Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MNQVF+EYLD+FV+VYLD+IVVYSTT+EEH+ HL+ VF KL++NQLYVK+EKC+FAQ  INFLGHV+ CG+I M+  KI AI++W +P SVSELRSFLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE
        ANYYRRFVEGFS+RA+PLTELLKKD  W+W  +CQ AF+ LK  +  GP+LG+ DVTKPFEVETDASD+ALGGVL+Q GHPIAYESRKLN AE+RYTVSE
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE

Query:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
        KEMLAVVHCLR WRQYLLGS FVVKTDNSATCHFF QPKLT+KQARWQE LAEFDF+F+HK G SNQAADALSRK EHAA+C+LAH+  S+I GS+RD +
Subjt:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII

Query:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI
        +E L KD +A+ V+ LAKAGKTRQFWVE DLL+TKGNRLYVPR G LRKKL+
Subjt:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI

A0A5D3C4R1 Reverse transcriptase9.7e-15575.28Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MNQVF+EYLD+FV+VYLD+IVVYSTT+EEH+ HL+ VF KL++NQLYVK+EKC+FAQ  INFLGHV+ CG+I M+  KI AI++W +P SVSELRSFLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE
        ANYYRRFVEGFS+RA+PLTELLKKD  W+W  +CQ AF+ LK  +  GP+LG+ DVTKPFEVETDASD+ALGGVL+Q GHPIAYESRKLN AE+RYTVSE
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE

Query:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
        KEMLAVVHCLR WRQYLLGS FVVKTDNSATCHFF QPKLT+KQARWQE LAEFDF+F+HK G SNQAADALSRK EHAA+C+LAH+  S+I GS+RD +
Subjt:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII

Query:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI
        +E L KD +A+ V+ LAKAGKTRQFWVE DLL+TKGNRLYVPR G LRKKL+
Subjt:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI

A0A6J1D906 Reverse transcriptase3.3e-16380.4Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MNQVF+EYLDQFV+VYLD+IVVYS TL+EH++HL+LVFDKLRQNQLYVKKEKCAFAQ  I FLGHV+  GQISMD+DK+KAIQEW+VPTSV ELRSFLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE
        ANYYRRF+EGFSRRA P+TELLKK   W WS + Q AFE+LK  M +GPVLGL DVTKPFEVETDASD+ALGGVL+Q+ HPI YESRKLN+AE+RYTVSE
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE

Query:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
        KEMLAVVHCLR WRQYLLGS FVVKTDNSA CHFF+QPKLT+KQARWQE LAEFDFKF+HK GKSNQAADALSRKGEHA LCMLAHIH+SK DGS+RD+I
Subjt:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII

Query:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI
         E+L   PSA+ VVELAK  KTRQFWVEGDLL T+GN LYVPRTG LRKKL+
Subjt:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI

A0A6J1IDF7 uncharacterized protein LOC1114742151.4e-16184.09Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MNQVFYEYLDQFVIVYLD+IVVYSTTLEEHKVHLKL                                   ISMDSDKIKAIQEWKVPTSVS+LRSFLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE
        ANYYRRFVEGFSRRAAPL ELLKKDHPW WSNDCQMAFE+LKTTM  GPVLGLVDVTKPFE+ETDASDFALGGVLIQEGHPIA+ESRKLNDAE+RY VSE
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE

Query:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
        K+ML VVHCLRVWRQYLLGSQFVVKTDNS  CHFFDQPKLTAKQARWQESLA+FDFKFKHK GKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
Subjt:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII

Query:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI
        KEHLHKD SAKAVVELAKA KTRQFWVEGDLLITKGNRL V RTGELRKKLI
Subjt:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGNRLYVPRTGELRKKLI

A0A6J1IEF9 uncharacterized protein LOC1114749451.8e-16989.61Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MNQVFYEYLDQFVIVYLD+IVVYSTTLEEHKVHLKLVFDKLRQNQL                      CGQISMDSDKIKAIQEWKVPTSVS+LRSFLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE
        ANYYRRFVEGFSRRAAPLTELLKKDH WSWS+DCQMAFE+LKTTMTRGPVLGLVDVTKPFE+ETDASDFALGGVLIQEGHPIA+ESRKLNDAE+RYTVSE
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSE

Query:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
        KEMLAVVHCLRVWRQYLLGSQFVVKTDNSA CHFFDQPKLTAKQARWQ+SLAEFDFKF+HK GKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII
Subjt:  KEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDII

Query:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGN
        KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLL+TKGN
Subjt:  KEHLHKDPSAKAVVELAKAGKTRQFWVEGDLLITKGN

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.63.1e-5742.18Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MN +    L++  +VYLD+I+V+ST+L+EH   L LVF+KL +  L ++ +KC F +    FLGHV+    I  + +KI+AIQ++ +PT   E+++FLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSN-DCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVS
          YYR+F+  F+  A P+T+ LKK+     +N +   AF+ LK  ++  P+L + D TK F + TDASD ALG VL Q+GHP++Y SR LN+ E  Y+  
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSN-DCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVS

Query:  EKEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSR
        EKE+LA+V   + +R YLLG  F + +D+      +      +K  RW+  L+EFDF  K+  GK N  ADALSR
Subjt:  EKEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSR

P0CT42 Transposon Tf2-7 polyprotein2.1e-4536.04Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        +N +  E  +  V+ Y+DNI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+G+ +     +   + I  + +WK P +  ELR FLG 
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEG-----HPIAYESRKLNDAEQR
         NY R+F+   S+   PL  LLKKD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+G VL Q+      +P+ Y S K++ A+  
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEG-----HPIAYESRKLNDAEQR

Query:  YTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDNSATCHFF--DQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSR
        Y+VS+KEMLA++  L+ WR YL  +   F + TD+         +      + ARWQ  L +F+F+  ++ G +N  ADALSR
Subjt:  YTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDNSATCHFF--DQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSR

P0CT43 Transposon Tf2-8 polyprotein2.1e-4536.04Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        +N +  E  +  V+ Y+DNI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+G+ +     +   + I  + +WK P +  ELR FLG 
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEG-----HPIAYESRKLNDAEQR
         NY R+F+   S+   PL  LLKKD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+G VL Q+      +P+ Y S K++ A+  
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEG-----HPIAYESRKLNDAEQR

Query:  YTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDNSATCHFF--DQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSR
        Y+VS+KEMLA++  L+ WR YL  +   F + TD+         +      + ARWQ  L +F+F+  ++ G +N  ADALSR
Subjt:  YTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDNSATCHFF--DQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSR

P20825 Retrovirus-related Pol polyprotein from transposon 2974.2e-5441.82Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        MN +    L++  +VYLD+I+++ST+L EH   ++LVF KL    L ++ +KC F +   NFLGH+V    I  +  K+KAI  + +PT   E+R+FLGL
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSN-DCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVS
          YYR+F+  ++  A P+T  LKK         +   AFE LK  + R P+L L D  K F + TDAS+ ALG VL Q GHPI++ SR LND E  Y+  
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSN-DCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVS

Query:  EKEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSR
        EKE+LA+V   + +R YLLG QF++ +D+       +  +  AK  RW+  L+E+ FK  +  GK N  ADALSR
Subjt:  EKEMLAVVHCLRVWRQYLLGSQFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSR

Q9UR07 Transposon Tf2-11 polyprotein2.1e-4536.04Show/hide
Query:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL
        +N +  E  +  V+ Y+DNI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+G+ +     +   + I  + +WK P +  ELR FLG 
Subjt:  MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGL

Query:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEG-----HPIAYESRKLNDAEQR
         NY R+F+   S+   PL  LLKKD  W W+     A EN+K  +   PVL   D +K   +ETDASD A+G VL Q+      +P+ Y S K++ A+  
Subjt:  ANYYRRFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEG-----HPIAYESRKLNDAEQR

Query:  YTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDNSATCHFF--DQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSR
        Y+VS+KEMLA++  L+ WR YL  +   F + TD+         +      + ARWQ  L +F+F+  ++ G +N  ADALSR
Subjt:  YTVSEKEMLAVVHCLRVWRQYLLGS--QFVVKTDNSATCHFF--DQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein6.7e-2342.31Show/hide
Query:  HLKLVFDKLRQNQLYVKKEKCAFAQTCINFLG--HVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDHPWSW
        HL +V     Q+Q Y  ++KCAF Q  I +LG  H++    +S D  K++A+  W  P + +ELR FLGL  YYRRFV+ + +   PLTELLKK +   W
Subjt:  HLKLVFDKLRQNQLYVKKEKCAFAQTCINFLG--HVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEGFSRRAAPLTELLKKDHPWSW

Query:  SNDCQMAFENLKTTMTRGPVLGLVDVTKPF
        +    +AF+ LK  +T  PVL L D+  PF
Subjt:  SNDCQMAFENLKTTMTRGPVLGLVDVTKPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCAGGTTTTCTACGAATACCTGGATCAGTTCGTCATAGTATACCTCGACAACATTGTTGTTTACAGCACAACCCTCGAGGAACACAAAGTGCATTTGAAGTTAGT
GTTTGACAAGCTCCGGCAGAACCAGCTGTATGTGAAGAAGGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAGATGTGGACAAATTAGTA
TGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAGGTTCCTACTTCCGTATCTGAGTTGCGGTCCTTCTTAGGACTAGCCAACTACTATAGGCGATTCGTCGAAGGG
TTTTCACGACGAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGACCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATCTGAAAACAACCATGACGAG
GGGTCCTGTCCTCGGGTTGGTAGACGTCACAAAGCCATTTGAAGTAGAAACCGATGCTTCCGACTTTGCTCTCGGTGGAGTCCTTATTCAAGAAGGCCACCCCATCGCTT
ACGAAAGTCGAAAGCTCAACGATGCCGAACAAAGATACACGGTCTCCGAGAAAGAAATGTTGGCTGTAGTCCATTGCCTTCGAGTCTGGAGACAGTATCTCTTAGGATCA
CAGTTCGTAGTGAAGACGGATAACAGCGCCACTTGCCACTTCTTTGATCAACCAAAATTAACAGCAAAACAAGCCCGGTGGCAGGAGTCGTTGGCCGAATTCGACTTCAA
GTTCAAACACAAAACAGGAAAGAGTAATCAAGCAGCCGACGCGCTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCATTCAAGTAAGATCGATG
GATCGATGCGCGACATCATCAAGGAACATTTACACAAAGACCCATCGGCCAAAGCTGTTGTCGAACTAGCTAAGGCTGGAAAGACACGACAGTTTTGGGTTGAGGGGGAC
CTCCTGATAACCAAAGGAAACAGATTGTACGTCCCAAGAACAGGGGAACTGAGGAAGAAGCTCATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACCAGGTTTTCTACGAATACCTGGATCAGTTCGTCATAGTATACCTCGACAACATTGTTGTTTACAGCACAACCCTCGAGGAACACAAAGTGCATTTGAAGTTAGT
GTTTGACAAGCTCCGGCAGAACCAGCTGTATGTGAAGAAGGAGAAATGTGCATTCGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAGATGTGGACAAATTAGTA
TGGATAGCGACAAGATAAAAGCTATCCAAGAATGGAAGGTTCCTACTTCCGTATCTGAGTTGCGGTCCTTCTTAGGACTAGCCAACTACTATAGGCGATTCGTCGAAGGG
TTTTCACGACGAGCCGCCCCATTGACAGAGCTGTTGAAGAAAGACCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATCTGAAAACAACCATGACGAG
GGGTCCTGTCCTCGGGTTGGTAGACGTCACAAAGCCATTTGAAGTAGAAACCGATGCTTCCGACTTTGCTCTCGGTGGAGTCCTTATTCAAGAAGGCCACCCCATCGCTT
ACGAAAGTCGAAAGCTCAACGATGCCGAACAAAGATACACGGTCTCCGAGAAAGAAATGTTGGCTGTAGTCCATTGCCTTCGAGTCTGGAGACAGTATCTCTTAGGATCA
CAGTTCGTAGTGAAGACGGATAACAGCGCCACTTGCCACTTCTTTGATCAACCAAAATTAACAGCAAAACAAGCCCGGTGGCAGGAGTCGTTGGCCGAATTCGACTTCAA
GTTCAAACACAAAACAGGAAAGAGTAATCAAGCAGCCGACGCGCTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATTCATTCAAGTAAGATCGATG
GATCGATGCGCGACATCATCAAGGAACATTTACACAAAGACCCATCGGCCAAAGCTGTTGTCGAACTAGCTAAGGCTGGAAAGACACGACAGTTTTGGGTTGAGGGGGAC
CTCCTGATAACCAAAGGAAACAGATTGTACGTCCCAAGAACAGGGGAACTGAGGAAGAAGCTCATTTAG
Protein sequenceShow/hide protein sequence
MNQVFYEYLDQFVIVYLDNIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLGHVVRCGQISMDSDKIKAIQEWKVPTSVSELRSFLGLANYYRRFVEG
FSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTRGPVLGLVDVTKPFEVETDASDFALGGVLIQEGHPIAYESRKLNDAEQRYTVSEKEMLAVVHCLRVWRQYLLGS
QFVVKTDNSATCHFFDQPKLTAKQARWQESLAEFDFKFKHKTGKSNQAADALSRKGEHAALCMLAHIHSSKIDGSMRDIIKEHLHKDPSAKAVVELAKAGKTRQFWVEGD
LLITKGNRLYVPRTGELRKKLI