; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh17G005430 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh17G005430
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr17:5391897..5393864
RNA-Seq ExpressionCmoCh17G005430
SyntenyCmoCh17G005430
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037573.1 reverse transcriptase [Cucumis melo var. makuwa]2.7e-28573.17Show/hide
Query:  MGALKFLSALQRKV-EPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL
        MGALKFLS+LQ+KV E    +E+GLM+VD  IN +P+KST++DSGATHN I + EA+RL L   KD E+MKAVNS ALPI+G+ K    ++G W+G +D 
Subjt:  MGALKFLSALQRKV-EPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL

Query:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM
        VVV+MDDF+VVLGMEFLLEH+VIPMPLAKCLVIT   P+V+   ++QP  L+MISA+QLK+GL+R+EPTFMAIPL     + ETVP+EI  VL  Y D+M
Subjt:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM

Query:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP++LPPRR IDHEIEL+PG KPPAKNAYRMA PELAELRKQLDELL AGFIRPAK PYGAPVLFQ+KKDG+LRLCIDYRALNK+TVRNKYPLPII+
Subjt:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS

Query:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF
        DLFD+LHGAKYF+KLDLRS YYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFC LMNQ+F+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF
Subjt:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF

Query:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF
         K+++NQLYVK+EKC+FAQ  INFLGHV++CG+I M+  KI AI++W +P SVSE+RSFLGLANYYR+FVEGFS+RA+PLTELLKKD  W+W  +CQ AF
Subjt:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF

Query:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL
        + LK  + +GP+L + DVTKPFE+ETDASD+ALG VL+Q GHPIAYESRKLN AERRYTVSEKEMLAVV+CLR WRQY LGS FVVKTDNSA CHFF Q 
Subjt:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL

Query:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG
        KLT+KQA+WQE LAEFDF+F+HK G SNQA D LSRK EHAA+C+LAH+  S+I G
Subjt:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG

XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]2.2e-30378.66Show/hide
Query:  MGALKFLSALQRKVE-PKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL
        M ALKFLSA+Q++V  PK   EKGLMFVDATIN   +KST++DSGATHN I++QEARRL LTI KD  KMK VN EALPIVGVSK V  K+G WTG +D 
Subjt:  MGALKFLSALQRKVE-PKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL

Query:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM
        VVVRMDDF+VVLGMEFL+EHKVIPMPLAKC+++T ++PTV+  SIKQPG ++MISA+QLK+GL REEPTFMAIP++E+      VP EI+ V+  Y DIM
Subjt:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM

Query:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP+TLPPRR IDHEIEL+PG KPPAKNAYRMA PELAELRKQLDELL AGFIRPAK P+GAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPII+
Subjt:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS

Query:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF
        DLFDQLHGAKYFTKLDLRS YYQVRIAEGDEPKTTCVTRYGAFEFLVM FGLTNAPATFC +MNQ+F+EYLDQFV+VYLDDIVVYS TL+EH++HL+LVF
Subjt:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF

Query:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF
        DK+RQNQLYVKKEKCAFAQ  I FLGHV++ GQISMD+DK+KAIQEW++PTSV E+RSFLGLANYYR+F+EGFSRRA P+TELLKK   W WS + Q AF
Subjt:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF

Query:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL
        E+LK  M KGPVL L DVTKPFE+ETDASD+ALG VL+Q+ HPI YESRKLN+AERRYTVSEKEMLAVV+CLR WRQY LGS FVVKTDNSAICHFF+Q 
Subjt:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL

Query:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG
        KLT+KQA+WQE LAEFDFKF+HKAGKSNQA D LSRKGEHA LCMLAHIHTSK DG
Subjt:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]0.0e+0094.66Show/hide
Query:  MGALKFLSALQRKVEPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLV
        MGALKFLSALQRKVEPKEIIEKGLMFVDATINSR SKSTLIDSGATHN IADQEARRLGLTIGKDP KMKAVNSEALPIVGVSKGVPFKIGDWTGELDLV
Subjt:  MGALKFLSALQRKVEPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLV

Query:  VVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIMP
        VVRMDDF+VVLGMEFLLEHKVIPMPLAKCLVITD NPTVIPASIKQPGNL+MISAIQLKRGLAREEPTFMAIPL+EEATTEETVPEEIKEVL++YTDIMP
Subjt:  VVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIMP

Query:  ESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISD
        ESLPQTLPPRR IDHEIELLPGVKPPAKNAYRMA PELAELRKQLDELLKAGFIRPAK PYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISD
Subjt:  ESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISD

Query:  LFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFD
        LFDQLHGAKYFTKLDLRS YYQVRI EGDEPKTTCVTRYGAFEFLVMPFGLTNAPATF  LMNQ+FYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFD
Subjt:  LFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFD

Query:  KIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFE
        K+RQNQLYVKKEKCAFAQTCINFLGHVV+CGQISMDSDKIKAIQEWK+PTSVSE+RSFLGLANYYR+FVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFE
Subjt:  KIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFE

Query:  NLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQLK
        NLKTTMT+GPVL LVDVTKPFE+ETDASDFALG VLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVV+CLRVWRQY LGSQFVVKTDNSA CHFFDQ K
Subjt:  NLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQLK

Query:  LTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG
        LTAKQA+WQESLAEFDFKF+HKAGKSNQA D LSRKGEHAALCMLAHIH+SKIDG
Subjt:  LTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG

XP_023526180.1 uncharacterized protein LOC111789739 [Cucurbita pepo subsp. pepo]1.2e-28595.01Show/hide
Query:  ALKFLSALQRKVEPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVV
        A KFLSALQRKVEPKEIIEKGLMFVDATINSR SKSTLIDSGATHN IADQEARRLGLTIGKDP KMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVV
Subjt:  ALKFLSALQRKVEPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVV

Query:  RMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIMPES
        RMDDF+VVLGMEFLLEHKVIPMPLAKCLVITD NP VIPASIKQPGNL+MISAIQLKRGLAREEPTFMAIPL+EEATTEETVPEEIKEVL+SYTDIMPES
Subjt:  RMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIMPES

Query:  LPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLF
        LPQTLPPRR IDHEIELLPGVKPPAKNAYRMA PELAELRKQLDELLKAGFIRPAK PYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLF
Subjt:  LPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLF

Query:  DQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKI
        DQLHGAKYFTKLDLRS YYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFC LMNQ+FYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDK+
Subjt:  DQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKI

Query:  RQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENL
        RQNQLYVKKEKCAFAQTCI+FLGHVV+CGQISMDSDKIKAIQEWK+PTSVSE+RSFLGLANYYR+FVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENL
Subjt:  RQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENL

Query:  KTTMTKGPVLRLVDVTKPFEI
        KTTMT+GPVL LVD+TKPFE+
Subjt:  KTTMTKGPVLRLVDVTKPFEI

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]0.0e+0094.81Show/hide
Query:  MGALKFLSALQRKVEPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLV
        MGALKFLSALQRKVEPKEIIEKGLMFVDATINSR SKSTLIDSGATHN IADQEARRLGLTIGKDP KMKAVNSEALPIVGVSKGVPFKIGDWTGELDLV
Subjt:  MGALKFLSALQRKVEPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLV

Query:  VVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIMP
        VVRMDDF+VVLGMEFLLEHKVIPMPLAKCLVITD NPTVIPASIKQPGNL+MISAIQLKRGLAREEPTFMAIPL+EEATTEETVPEEIK+VL+SYTDIMP
Subjt:  VVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIMP

Query:  ESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISD
        ESLPQTLPPRR IDHEIELLPGVKPPAKNAYRMA PELAELRKQLDELLKAGFIRPAK PYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISD
Subjt:  ESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISD

Query:  LFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFD
        LFDQLHGAKYFTKLDLRS YYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFC LMNQ+FYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFD
Subjt:  LFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFD

Query:  KIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFE
        K+RQNQLYVKKEKCAFAQTCI+FLGHVV+CGQISMDSDKIKAIQEWK+PTSVSE+RSFLGLANYYR+FVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFE
Subjt:  KIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFE

Query:  NLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQLK
        NLKTTMT+GPVL LVDVTKPFE+ETDASDFALG VLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVV+CLRVWRQY LGSQFVVKTDNSA CHFFDQ K
Subjt:  NLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQLK

Query:  LTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG
        LTAKQA+WQESLAEFDFKF+HKAGKSNQA D LSRKGEHAALCMLAHIH+SKIDG
Subjt:  LTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG

TrEMBL top hitse value%identityAlignment
A0A5A7T3M3 Reverse transcriptase1.3e-28573.17Show/hide
Query:  MGALKFLSALQRKV-EPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL
        MGALKFLS+LQ+KV E    +E+GLM+VD  IN +P+KST++DSGATHN I + EA+RL L   KD E+MKAVNS ALPI+G+ K    ++G W+G +D 
Subjt:  MGALKFLSALQRKV-EPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL

Query:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM
        VVV+MDDF+VVLGMEFLLEH+VIPMPLAKCLVIT   P+V+   ++QP  L+MISA+QLK+GL+R+EPTFMAIPL     + ETVP+EI  VL  Y D+M
Subjt:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM

Query:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP++LPPRR IDHEIEL+PG KPPAKNAYRMA PELAELRKQLDELL AGFIRPAK PYGAPVLFQ+KKDG+LRLCIDYRALNK+TVRNKYPLPII+
Subjt:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS

Query:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF
        DLFD+LHGAKYF+KLDLRS YYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFC LMNQ+F+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF
Subjt:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF

Query:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF
         K+++NQLYVK+EKC+FAQ  INFLGHV++CG+I M+  KI AI++W +P SVSE+RSFLGLANYYR+FVEGFS+RA+PLTELLKKD  W+W  +CQ AF
Subjt:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF

Query:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL
        + LK  + +GP+L + DVTKPFE+ETDASD+ALG VL+Q GHPIAYESRKLN AERRYTVSEKEMLAVV+CLR WRQY LGS FVVKTDNSA CHFF Q 
Subjt:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL

Query:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG
        KLT+KQA+WQE LAEFDF+F+HK G SNQA D LSRK EHAA+C+LAH+  S+I G
Subjt:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG

A0A5D3BRZ6 Reverse transcriptase8.3e-28573.02Show/hide
Query:  MGALKFLSALQRKV-EPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL
        MGALKFLS+LQ+KV E    +E+GLM+VD  IN +P+KST++DSGATHN I + EA+RL L   KD  +MKAVNS ALPI+G+ K    ++G W+G +D 
Subjt:  MGALKFLSALQRKV-EPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL

Query:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM
        VVV+MDDF+VVLGMEFLLEH+VIPMPLAKCLVIT   P+V+   ++QP  L+MISA+QLK+GL+R+EPTFMAIPL     + ETVP+EI  VL  Y D+M
Subjt:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM

Query:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP++LPPRR IDHEIEL+PG KPPAKNAYRMA PELAELRKQLDELL AGFIRPAK PYGAPVLFQ+KKDG+LRLCIDYRALNK+TVRNKYPLPII+
Subjt:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS

Query:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF
        DLFD+LHGAKYF+KLDLRS YYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFC LMNQ+F+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF
Subjt:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF

Query:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF
         K+++NQLYVK+EKC+FAQ  INFLGHV++CG+I M+  KI AI++W +P SVSE+RSFLGLANYYR+FVEGFS+RA+PLTELLKKD  W+W  +CQ AF
Subjt:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF

Query:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL
        + LK  + +GP+L + DVTKPFE+ETDASD+ALG VL+Q GHPIAYESRKLN AERRYTVSEKEMLAVV+CLR WRQY LGS FVVKTDNSA CHFF Q 
Subjt:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL

Query:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG
        KLT+KQA+WQE LAEFDF+F+HK G SNQA D LSRK EHAA+C+LAH+  S+I G
Subjt:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG

A0A5D3C4R1 Reverse transcriptase8.3e-28573.02Show/hide
Query:  MGALKFLSALQRKV-EPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL
        MGALKFLS+LQ+KV E    +E+GLM+VD  IN +P+KST++DSGATHN I + EA+RL L   KD  +MKAVNS ALPI+G+ K    ++G W+G +D 
Subjt:  MGALKFLSALQRKV-EPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL

Query:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM
        VVV+MDDF+VVLGMEFLLEH+VIPMPLAKCLVIT   P+V+   ++QP  L+MISA+QLK+GL+R+EPTFMAIPL     + ETVP+EI  VL  Y D+M
Subjt:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM

Query:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP++LPPRR IDHEIEL+PG KPPAKNAYRMA PELAELRKQLDELL AGFIRPAK PYGAPVLFQ+KKDG+LRLCIDYRALNK+TVRNKYPLPII+
Subjt:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS

Query:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF
        DLFD+LHGAKYF+KLDLRS YYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFC LMNQ+F+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF
Subjt:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF

Query:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF
         K+++NQLYVK+EKC+FAQ  INFLGHV++CG+I M+  KI AI++W +P SVSE+RSFLGLANYYR+FVEGFS+RA+PLTELLKKD  W+W  +CQ AF
Subjt:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF

Query:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL
        + LK  + +GP+L + DVTKPFE+ETDASD+ALG VL+Q GHPIAYESRKLN AERRYTVSEKEMLAVV+CLR WRQY LGS FVVKTDNSA CHFF Q 
Subjt:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL

Query:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG
        KLT+KQA+WQE LAEFDF+F+HK G SNQA D LSRK EHAA+C+LAH+  S+I G
Subjt:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG

A0A5D3C9P8 Reverse transcriptase4.9e-28573.17Show/hide
Query:  MGALKFLSALQRKV-EPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL
        MGALKFLS+LQ+KV E    +E+GLM+VD  IN +P+KST++DSGATHN I + EA+RL L   KD  +MKAVNS ALPI+G+ K    ++G W+G +D 
Subjt:  MGALKFLSALQRKV-EPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL

Query:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM
        VVV+MDDF+VVLGMEFLLEH+VIPMPLAKCLVIT   P+V+   ++QP  L+MISA+QLK+GL+R+EPTFMAIPL     + ETVP+EI  VL  Y D+M
Subjt:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM

Query:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP++LPPRR IDHEIEL+PG KPPAKNAYRMA PELAELRKQLDELL AGFIRPAK PYGAPVLFQKKKDG+LRLCIDYRALNK+TVRNKYPLPII+
Subjt:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS

Query:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF
        DLFD+LHGAKYF+KLDLRS YYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFC LMNQ+F+EYLD+FV+VYLDDIVVYSTT+EEH+ HL+ VF
Subjt:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF

Query:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF
         K+++NQLYVK+EKC+FAQ  INFLGHV++CG+I M+  KI AI++W +P SVSE+RSFLGLANYYR+FVEGFS+RA+PLTELLKKD  W+W  +CQ AF
Subjt:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF

Query:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL
        + LK  + +GP+L + DVTKPFE+ETDASD+ALG VL+Q GHPIAYESRKLN AERRYTVSEKEMLAVV+CLR WRQY LGS FVVKTDNSA CHFF Q 
Subjt:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL

Query:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG
        KLT+KQA+WQE LAEFDF+F+HK G SNQA D LSRK EHAA+C+LAH+  S+I G
Subjt:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG

A0A6J1D906 Reverse transcriptase1.0e-30378.66Show/hide
Query:  MGALKFLSALQRKVE-PKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL
        M ALKFLSA+Q++V  PK   EKGLMFVDATIN   +KST++DSGATHN I++QEARRL LTI KD  KMK VN EALPIVGVSK V  K+G WTG +D 
Subjt:  MGALKFLSALQRKVE-PKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDL

Query:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM
        VVVRMDDF+VVLGMEFL+EHKVIPMPLAKC+++T ++PTV+  SIKQPG ++MISA+QLK+GL REEPTFMAIP++E+      VP EI+ V+  Y DIM
Subjt:  VVVRMDDFNVVLGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIM

Query:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS
        P+SLP+TLPPRR IDHEIEL+PG KPPAKNAYRMA PELAELRKQLDELL AGFIRPAK P+GAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPII+
Subjt:  PESLPQTLPPRRDIDHEIELLPGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIIS

Query:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF
        DLFDQLHGAKYFTKLDLRS YYQVRIAEGDEPKTTCVTRYGAFEFLVM FGLTNAPATFC +MNQ+F+EYLDQFV+VYLDDIVVYS TL+EH++HL+LVF
Subjt:  DLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVF

Query:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF
        DK+RQNQLYVKKEKCAFAQ  I FLGHV++ GQISMD+DK+KAIQEW++PTSV E+RSFLGLANYYR+F+EGFSRRA P+TELLKK   W WS + Q AF
Subjt:  DKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAF

Query:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL
        E+LK  M KGPVL L DVTKPFE+ETDASD+ALG VL+Q+ HPI YESRKLN+AERRYTVSEKEMLAVV+CLR WRQY LGS FVVKTDNSAICHFF+Q 
Subjt:  ENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQL

Query:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG
        KLT+KQA+WQE LAEFDFKF+HKAGKSNQA D LSRKGEHA LCMLAHIHTSK DG
Subjt:  KLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.2e-8539.52Show/hide
Query:  AKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPV-LFQKKKDGT----LRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSMYY
        +K +Y  A  +  E+  Q+ ++L  G IR +  PY +P+ +  KK+D +     R+ IDYR LN++TV +++P+P + ++  +L    YFT +DL   ++
Subjt:  AKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPV-LFQKKKDGT----LRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSMYY

Query:  QVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKIRQNQLYVKKEKCAFAQTCI
        Q+ +      KT   T++G +E+L MPFGL NAPATF   MN I    L++  +VYLDDI+V+ST+L+EH   L LVF+K+ +  L ++ +KC F +   
Subjt:  QVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKIRQNQLYVKKEKCAFAQTCI

Query:  NFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSN-DCQMAFENLKTTMTKGPVLRLVDVTKP
         FLGHV+    I  + +KI+AIQ++ IPT   E+++FLGL  YYR+F+  F+  A P+T+ LKK+     +N +   AF+ LK  +++ P+L++ D TK 
Subjt:  NFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSN-DCQMAFENLKTTMTKGPVLRLVDVTKP

Query:  FEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQLKLTAKQAQWQESLAEFDFKFK
        F + TDASD ALG VL Q+GHP++Y SR LN+ E  Y+  EKE+LA+V+  + +R Y LG  F + +D+  +   +      +K  +W+  L+EFDF  K
Subjt:  FEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQLKLTAKQAQWQESLAEFDFKFK

Query:  HKAGKSNQATDTLSR
        +  GK N   D LSR
Subjt:  HKAGKSNQATDTLSR

P0CT34 Transposon Tf2-1 polyprotein9.7e-8136.09Show/hide
Query:  EIKEVLNSYTDIMPESLPQTLP-PRRDIDHEIELL-PGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRAL
        E+ ++   + DI  E+  + LP P + ++ E+EL     + P +N Y +   ++  +  ++++ LK+G IR +K     PV+F  KK+GTLR+ +DY+ L
Subjt:  EIKEVLNSYTDIMPESLPQTLP-PRRDIDHEIELL-PGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRAL

Query:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVY
        NK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+G++ APA F   +N I  E  +  V+ Y+DDI+++
Subjt:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVY

Query:  STTLEEHKVHLKLVFDKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLK
        S +  EH  H+K V  K++   L + + KC F Q+ + F+G+ +     +   + I  + +WK P +  E+R FLG  NY R+F+   S+   PL  LLK
Subjt:  STTLEEHKVHLKLVFDKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLK

Query:  KDHPWSWSNDCQMAFENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEG-----HPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRL
        KD  W W+     A EN+K  +   PVLR  D +K   +ETDASD A+G VL Q+      +P+ Y S K++ A+  Y+VS+KEMLA++  L+ WR Y  
Subjt:  KDHPWSWSNDCQMAFENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEG-----HPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRL

Query:  GS--QFVVKTDN-SAICHFFDQLKLTAKQ-AQWQESLAEFDFKFKHKAGKSNQATDTLSR
         +   F + TD+ + I    ++ +   K+ A+WQ  L +F+F+  ++ G +N   D LSR
Subjt:  GS--QFVVKTDN-SAICHFFDQLKLTAKQ-AQWQESLAEFDFKFKHKAGKSNQATDTLSR

P0CT35 Transposon Tf2-2 polyprotein9.7e-8136.09Show/hide
Query:  EIKEVLNSYTDIMPESLPQTLP-PRRDIDHEIELL-PGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRAL
        E+ ++   + DI  E+  + LP P + ++ E+EL     + P +N Y +   ++  +  ++++ LK+G IR +K     PV+F  KK+GTLR+ +DY+ L
Subjt:  EIKEVLNSYTDIMPESLPQTLP-PRRDIDHEIELL-PGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRAL

Query:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVY
        NK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+G++ APA F   +N I  E  +  V+ Y+DDI+++
Subjt:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVY

Query:  STTLEEHKVHLKLVFDKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLK
        S +  EH  H+K V  K++   L + + KC F Q+ + F+G+ +     +   + I  + +WK P +  E+R FLG  NY R+F+   S+   PL  LLK
Subjt:  STTLEEHKVHLKLVFDKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLK

Query:  KDHPWSWSNDCQMAFENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEG-----HPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRL
        KD  W W+     A EN+K  +   PVLR  D +K   +ETDASD A+G VL Q+      +P+ Y S K++ A+  Y+VS+KEMLA++  L+ WR Y  
Subjt:  KDHPWSWSNDCQMAFENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEG-----HPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRL

Query:  GS--QFVVKTDN-SAICHFFDQLKLTAKQ-AQWQESLAEFDFKFKHKAGKSNQATDTLSR
         +   F + TD+ + I    ++ +   K+ A+WQ  L +F+F+  ++ G +N   D LSR
Subjt:  GS--QFVVKTDN-SAICHFFDQLKLTAKQ-AQWQESLAEFDFKFKHKAGKSNQATDTLSR

P0CT41 Transposon Tf2-12 polyprotein9.7e-8136.09Show/hide
Query:  EIKEVLNSYTDIMPESLPQTLP-PRRDIDHEIELL-PGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRAL
        E+ ++   + DI  E+  + LP P + ++ E+EL     + P +N Y +   ++  +  ++++ LK+G IR +K     PV+F  KK+GTLR+ +DY+ L
Subjt:  EIKEVLNSYTDIMPESLPQTLP-PRRDIDHEIELL-PGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRAL

Query:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVY
        NK    N YPLP+I  L  ++ G+  FTKLDL+S Y+ +R+ +GDE K       G FE+LVMP+G++ APA F   +N I  E  +  V+ Y+DDI+++
Subjt:  NKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSMYYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVY

Query:  STTLEEHKVHLKLVFDKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLK
        S +  EH  H+K V  K++   L + + KC F Q+ + F+G+ +     +   + I  + +WK P +  E+R FLG  NY R+F+   S+   PL  LLK
Subjt:  STTLEEHKVHLKLVFDKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLK

Query:  KDHPWSWSNDCQMAFENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEG-----HPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRL
        KD  W W+     A EN+K  +   PVLR  D +K   +ETDASD A+G VL Q+      +P+ Y S K++ A+  Y+VS+KEMLA++  L+ WR Y  
Subjt:  KDHPWSWSNDCQMAFENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEG-----HPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRL

Query:  GS--QFVVKTDN-SAICHFFDQLKLTAKQ-AQWQESLAEFDFKFKHKAGKSNQATDTLSR
         +   F + TD+ + I    ++ +   K+ A+WQ  L +F+F+  ++ G +N   D LSR
Subjt:  GS--QFVVKTDN-SAICHFFDQLKLTAKQ-AQWQESLAEFDFKFKHKAGKSNQATDTLSR

P20825 Retrovirus-related Pol polyprotein from transposon 2975.2e-8239.33Show/hide
Query:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPV-LFQKKKDGT----LRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSM
        P     Y +A     E+  Q+ E+L  G IR +  PY +P  +  KK D +     R+ IDYR LN++T+ ++YP+P + ++  +L   +YFT +DL   
Subjt:  PPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPV-LFQKKKDGT----LRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSM

Query:  YYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKIRQNQLYVKKEKCAFAQT
        ++Q+ + E    KT   T+ G +E+L MPFGL NAPATF   MN I    L++  +VYLDDI+++ST+L EH   ++LVF K+    L ++ +KC F + 
Subjt:  YYQVRIAEGDEPKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKIRQNQLYVKKEKCAFAQT

Query:  CINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSN-DCQMAFENLKTTMTKGPVLRLVDVT
          NFLGH+V    I  +  K+KAI  + IPT   E+R+FLGL  YYR+F+  ++  A P+T  LKK         +   AFE LK  + + P+L+L D  
Subjt:  CINFLGHVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSN-DCQMAFENLKTTMTKGPVLRLVDVT

Query:  KPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQLKLTAKQAQWQESLAEFDFK
        K F + TDAS+ ALG VL Q GHPI++ SR LND E  Y+  EKE+LA+V+  + +R Y LG QF++ +D+  +    +  +  AK  +W+  L+E+ FK
Subjt:  KPFEIETDASDFALGVVLIQEGHPIAYESRKLNDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQLKLTAKQAQWQESLAEFDFK

Query:  FKHKAGKSNQATDTLSR
          +  GK N   D LSR
Subjt:  FKHKAGKSNQATDTLSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.3e-2140.77Show/hide
Query:  HLKLVFDKIRQNQLYVKKEKCAFAQTCINFLG--HVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSW
        HL +V     Q+Q Y  ++KCAF Q  I +LG  H++    +S D  K++A+  W  P + +E+R FLGL  YYR+FV+ + +   PLTELLKK +   W
Subjt:  HLKLVFDKIRQNQLYVKKEKCAFAQTCINFLG--HVVKCGQISMDSDKIKAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSW

Query:  SNDCQMAFENLKTTMTKGPVLRLVDVTKPF
        +    +AF+ LK  +T  PVL L D+  PF
Subjt:  SNDCQMAFENLKTTMTKGPVLRLVDVTKPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGCGCTCAAATTCTTGTCAGCCCTCCAACGGAAAGTCGAACCGAAGGAGATAATAGAGAAAGGGCTCATGTTCGTGGATGCGACAATAAATTCTCGACCGAGCAA
GAGCACTCTGATAGACTCTGGAGCAACCCACAACTCCATTGCCGATCAAGAAGCCCGAAGATTGGGACTCACCATAGGAAAGGACCCGGAAAAAATGAAAGCTGTCAACT
CCGAGGCCTTGCCTATTGTGGGAGTTTCCAAAGGAGTCCCCTTCAAAATAGGGGATTGGACTGGAGAGTTAGACCTTGTCGTGGTTCGCATGGACGACTTCAACGTGGTA
CTTGGAATGGAGTTTCTCCTAGAACACAAGGTCATCCCAATGCCACTGGCGAAATGCTTGGTGATCACCGACCACAACCCCACAGTAATACCTGCAAGCATCAAGCAACC
AGGTAATCTTCAGATGATCTCGGCCATACAACTGAAAAGGGGACTCGCACGAGAGGAACCTACATTCATGGCCATACCACTGATGGAAGAAGCGACCACCGAAGAGACTG
TCCCAGAGGAAATCAAGGAAGTACTAAACAGTTATACCGACATAATGCCAGAGAGTCTACCCCAAACACTACCACCTCGTCGAGACATTGACCATGAAATCGAACTCCTC
CCCGGGGTTAAACCGCCAGCGAAGAACGCATACCGGATGGCTCTGCCCGAGCTAGCCGAGTTGAGGAAACAACTAGATGAGTTGCTGAAGGCGGGATTCATTCGCCCGGC
AAAGGTACCTTACGGAGCCCCCGTACTGTTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTACGCAACAAATATC
CACTGCCGATAATATCCGACTTGTTCGACCAACTTCACGGGGCCAAGTACTTCACGAAGTTGGACTTACGATCAATGTATTACCAAGTACGTATTGCCGAAGGAGATGAA
CCAAAGACGACGTGCGTAACAAGATATGGGGCCTTTGAATTCCTAGTAATGCCCTTTGGCTTGACAAACGCCCCAGCTACATTTTGCATGTTGATGAACCAGATTTTCTA
CGAATACTTGGATCAGTTCGTCATAGTATACCTCGACGACATCGTTGTTTACAGCACAACCCTAGAGGAACACAAAGTGCACTTGAAGCTGGTGTTTGACAAGATACGGC
AGAACCAACTGTATGTGAAGAAGGAGAAATGTGCATTTGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAAATGTGGACAAATTAGTATGGATAGTGACAAGATA
AAAGCTATCCAAGAATGGAAAATTCCTACTTCCGTATCCGAGGTGCGGTCCTTCTTAGGATTGGCCAACTACTATAGGCAGTTCGTCGAAGGGTTTTCACGACGAGCCGC
CCCATTGACAGAGCTGTTGAAGAAAGACCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATCTAAAAACAACCATGACGAAGGGTCCTGTCCTCAGGT
TGGTAGATGTTACAAAGCCATTTGAAATAGAAACAGATGCTTCCGACTTTGCTCTAGGTGTAGTCCTCATTCAGGAAGGCCACCCCATCGCTTACGAGAGTCGAAAGCTC
AACGATGCCGAACGGAGATACACTGTCTCCGAGAAAGAAATGCTGGCTGTAGTCTACTGCCTTCGAGTCTGGAGACAGTATCGCTTAGGATCACAGTTCGTAGTGAAGAC
GGATAACAGCGCCATTTGCCACTTCTTTGATCAACTAAAATTAACGGCAAAACAAGCCCAGTGGCAGGAGTCGTTGGCCGAATTCGACTTCAAGTTCAAACACAAAGCAG
GAAAGAGTAATCAAGCAACCGACACGCTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATCCATACAAGTAAGATCGATGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCGCGCTCAAATTCTTGTCAGCCCTCCAACGGAAAGTCGAACCGAAGGAGATAATAGAGAAAGGGCTCATGTTCGTGGATGCGACAATAAATTCTCGACCGAGCAA
GAGCACTCTGATAGACTCTGGAGCAACCCACAACTCCATTGCCGATCAAGAAGCCCGAAGATTGGGACTCACCATAGGAAAGGACCCGGAAAAAATGAAAGCTGTCAACT
CCGAGGCCTTGCCTATTGTGGGAGTTTCCAAAGGAGTCCCCTTCAAAATAGGGGATTGGACTGGAGAGTTAGACCTTGTCGTGGTTCGCATGGACGACTTCAACGTGGTA
CTTGGAATGGAGTTTCTCCTAGAACACAAGGTCATCCCAATGCCACTGGCGAAATGCTTGGTGATCACCGACCACAACCCCACAGTAATACCTGCAAGCATCAAGCAACC
AGGTAATCTTCAGATGATCTCGGCCATACAACTGAAAAGGGGACTCGCACGAGAGGAACCTACATTCATGGCCATACCACTGATGGAAGAAGCGACCACCGAAGAGACTG
TCCCAGAGGAAATCAAGGAAGTACTAAACAGTTATACCGACATAATGCCAGAGAGTCTACCCCAAACACTACCACCTCGTCGAGACATTGACCATGAAATCGAACTCCTC
CCCGGGGTTAAACCGCCAGCGAAGAACGCATACCGGATGGCTCTGCCCGAGCTAGCCGAGTTGAGGAAACAACTAGATGAGTTGCTGAAGGCGGGATTCATTCGCCCGGC
AAAGGTACCTTACGGAGCCCCCGTACTGTTCCAGAAGAAGAAGGATGGGACGTTGCGTCTGTGCATAGACTATAGAGCCTTAAACAAGGTGACGGTACGCAACAAATATC
CACTGCCGATAATATCCGACTTGTTCGACCAACTTCACGGGGCCAAGTACTTCACGAAGTTGGACTTACGATCAATGTATTACCAAGTACGTATTGCCGAAGGAGATGAA
CCAAAGACGACGTGCGTAACAAGATATGGGGCCTTTGAATTCCTAGTAATGCCCTTTGGCTTGACAAACGCCCCAGCTACATTTTGCATGTTGATGAACCAGATTTTCTA
CGAATACTTGGATCAGTTCGTCATAGTATACCTCGACGACATCGTTGTTTACAGCACAACCCTAGAGGAACACAAAGTGCACTTGAAGCTGGTGTTTGACAAGATACGGC
AGAACCAACTGTATGTGAAGAAGGAGAAATGTGCATTTGCACAAACATGCATCAACTTCCTTGGACATGTCGTCAAATGTGGACAAATTAGTATGGATAGTGACAAGATA
AAAGCTATCCAAGAATGGAAAATTCCTACTTCCGTATCCGAGGTGCGGTCCTTCTTAGGATTGGCCAACTACTATAGGCAGTTCGTCGAAGGGTTTTCACGACGAGCCGC
CCCATTGACAGAGCTGTTGAAGAAAGACCACCCTTGGTCGTGGTCGAATGATTGTCAAATGGCCTTTGAAAATCTAAAAACAACCATGACGAAGGGTCCTGTCCTCAGGT
TGGTAGATGTTACAAAGCCATTTGAAATAGAAACAGATGCTTCCGACTTTGCTCTAGGTGTAGTCCTCATTCAGGAAGGCCACCCCATCGCTTACGAGAGTCGAAAGCTC
AACGATGCCGAACGGAGATACACTGTCTCCGAGAAAGAAATGCTGGCTGTAGTCTACTGCCTTCGAGTCTGGAGACAGTATCGCTTAGGATCACAGTTCGTAGTGAAGAC
GGATAACAGCGCCATTTGCCACTTCTTTGATCAACTAAAATTAACGGCAAAACAAGCCCAGTGGCAGGAGTCGTTGGCCGAATTCGACTTCAAGTTCAAACACAAAGCAG
GAAAGAGTAATCAAGCAACCGACACGCTGAGTCGGAAGGGCGAACATGCGGCCCTGTGCATGTTAGCCCATATCCATACAAGTAAGATCGATGGATAG
Protein sequenceShow/hide protein sequence
MGALKFLSALQRKVEPKEIIEKGLMFVDATINSRPSKSTLIDSGATHNSIADQEARRLGLTIGKDPEKMKAVNSEALPIVGVSKGVPFKIGDWTGELDLVVVRMDDFNVV
LGMEFLLEHKVIPMPLAKCLVITDHNPTVIPASIKQPGNLQMISAIQLKRGLAREEPTFMAIPLMEEATTEETVPEEIKEVLNSYTDIMPESLPQTLPPRRDIDHEIELL
PGVKPPAKNAYRMALPELAELRKQLDELLKAGFIRPAKVPYGAPVLFQKKKDGTLRLCIDYRALNKVTVRNKYPLPIISDLFDQLHGAKYFTKLDLRSMYYQVRIAEGDE
PKTTCVTRYGAFEFLVMPFGLTNAPATFCMLMNQIFYEYLDQFVIVYLDDIVVYSTTLEEHKVHLKLVFDKIRQNQLYVKKEKCAFAQTCINFLGHVVKCGQISMDSDKI
KAIQEWKIPTSVSEVRSFLGLANYYRQFVEGFSRRAAPLTELLKKDHPWSWSNDCQMAFENLKTTMTKGPVLRLVDVTKPFEIETDASDFALGVVLIQEGHPIAYESRKL
NDAERRYTVSEKEMLAVVYCLRVWRQYRLGSQFVVKTDNSAICHFFDQLKLTAKQAQWQESLAEFDFKFKHKAGKSNQATDTLSRKGEHAALCMLAHIHTSKIDG