; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G10200 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G10200
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr02:16115694..16118393
RNA-Seq ExpressionClc02G10200
SyntenyClc02G10200
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_012833379.1 PREDICTED: uncharacterized protein LOC105954252 [Erythranthe guttata]0.0e+0072.03Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        H+ILMED +KP ++ QRRLNP+MKEVVKKEVLKLL AG+IYAISDSPWVSPVQVVPK+GGMTV+ N+K+E+I +RTVTGWRVC DYR LN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQMLDRL G+ +YCFLDGY GYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +EVFMDDFSVFGSSFD C+ NL
Subjt:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VL+RC + NLVLNWEKCHFMV EGI+LGHKVS+KGLEVDRAKI  IE+LPPP +VK VRSFLGHAGFYRRFIKDFSKI KPL +LLE EA F FD AC
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA
        LQAF  LKE+L  +PI+I P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  Q+ Y+T+EKE+LAVV+A+DK R Y+LGS+++++TDHAA
Subjt:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL
        ++YLF KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E FPDEQL  +    PW+AD+ N+LA G +P D++Y QKK+FL
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL

Query:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV
        H+ + Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQS F+W +LF+D Y FVK CDRCQRTGN+S + ++P   + EV
Subjt:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV

Query:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT
        ELFDVWGIDFMGPFP S N  LYIL+ VDYVSKWVEAIAT  NDARTV+KF HKNIF+RFGTPRAIISDEGSHFCN L  ++  K  + HKIA AYHPQT
Subjt:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT

Query:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA
        NGLAELSNREIKQ+LEK V  NRKDWALKLDDALWAYRTAFKTPIG SPY+LVFGKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  A
Subjt:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA

Query:  YENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY
        YENAK+YKE+T +WHDK+IT R F +G +VLLFNSRLRLFPGKL++RWSGPFV++  +P G +E++G +G +FKVN QR+KHY
Subjt:  YENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY

XP_012833687.1 PREDICTED: uncharacterized protein LOC105954563 [Erythranthe guttata]0.0e+0072.03Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        H+ILMED +KP ++ QRRLNP+MKEVVKKEVLKLL AG+IYAISDSPWVSPVQVVPK+GGMTV+ N+K+E+I +RTVTGWRVC DYR LN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQMLDRL G+ +YCFLDGY GYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +EVFMDDFSVFGSSFD C+ NL
Subjt:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VL+RC + NLVLNWEKCHFMV EGI+LGHKVS+KGLEVDRAKI  IE+LPPP +VK VRSFLGHAGFYRRFIKDFSKI KPL +LLE EA F FD AC
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA
        LQAF  LKE+L  +PI+I P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  Q+ Y+T+EKE+LAVV+A+DK R Y+LGS+++++TDHAA
Subjt:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL
        ++YLF KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E FPDEQL  +    PW+AD+ N+LA G +P D++Y QKK+FL
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL

Query:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV
        H+ + Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQS F+W +LF+D Y FVK CDRCQRTGN+S + ++P   + EV
Subjt:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV

Query:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT
        ELFDVWGIDFMGPFP S N  LYIL+ VDYVSKWVEAIAT TNDARTV+KF HKNIF+RFGTPRAIISDEGSHFCN L  ++  K  + HKIA AYHPQT
Subjt:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT

Query:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA
        NGLAELSNREIKQ+LEK V  NRKDWALKLDDALWAYRTAFKTPIG SPY+LV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  A
Subjt:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA

Query:  YENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY
        YENAK+YKE+T +WHDK+IT R F +G +VLLFNSRLRLFPGKL++RWSGPFV++  +P G +E++G +G +FKVN QR+KHY
Subjt:  YENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY

XP_012846413.1 PREDICTED: uncharacterized protein LOC105966405 [Erythranthe guttata]0.0e+0072.03Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        H+ILMED +KP ++ QRRLNP+MKEVVKKEVLKLL AG+IYAISDSPWVSPVQVVPK+GGMTV+ N+K+E+I +RTVTGWRVC DYR LN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQMLDRL G+ +YCFLDGY GYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +EVFMDDFSVFGSSFD C+ NL
Subjt:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VLQRC + NLVLNWEKCHFMV EGI+LGHKVS+KGLEVDRAKI  IE+LPPP +VK VRSFLGHAGFYRRFIKDFSKI KPL +LLE EA F FD AC
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA
        LQAF  LKE+L  +PI+I P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  Q+ Y+T+EKE+LAVV+A+DK R Y+LGS+++++TDHAA
Subjt:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL
        ++YLF KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  +E  +   I E FPDEQL  +    PW+AD+ N+LA G +P D++Y QKK+FL
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL

Query:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV
        H+ + Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQS F+W +LF+D Y FVK CDRCQRTGN+S + ++P   + EV
Subjt:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV

Query:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT
        ELFDVWGIDFMGPFP S N  LYIL+ VDYVSKWVEAIAT TNDARTV+KF HKNIF+RFGTPRAIISDEGSHFCN L  ++  K  + HKIA AYHPQT
Subjt:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT

Query:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA
        NGLAELSNREIKQ+LEK V  NRKDWALKLDDALWAYRTAFKTPIG SPY+LV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  A
Subjt:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA

Query:  YENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY
        YENAK+YKE+T +WHDK+IT R F +G +VLLFNSRLRLFPGKL++RWSGPFV++  +P G +E++G +G +FKVN QR+KHY
Subjt:  YENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY

XP_012847037.1 PREDICTED: uncharacterized protein LOC105967019 [Erythranthe guttata]0.0e+0072.03Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        H+ILMED +KP ++ QRRLNP+MKEVVKKEVLKLL AG+IYAISDSPWVSPVQVVPK+GGMTV+ N+K+E+I +RTVTGWRVC DYR LN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQMLDRL G+ +YCFLDGY GYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +EVFMDDFSVFGSSFD C+ NL
Subjt:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VL+RC + NLVLNWEKCHFMV EGI+LGHKVS+KGLEVDRAKI  IE+LPPP +VK VRSFLGHAGFYRRFIKDFSKI KPL +LLE EA F FD AC
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA
        LQAF  LKE+L  +PI+I P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  Q+ Y+T+EKE+LAVV+A+DK R Y+LGS+++++TDHAA
Subjt:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL
        ++YLF KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E FPDEQL  +    PW+AD+ N+LA G +P D++Y QKK+FL
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL

Query:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV
        H+ + Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQS F+W +LF+D Y FVK CDRCQRTGN+S + ++P   + EV
Subjt:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV

Query:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT
        ELFDVWGIDFMGPFP S N  LYIL+ VDYVSKWVEAIAT TNDARTV+KF HKNIF+RFGTPRAIISDEGSHFCN L  ++  K  + HKIA AYHPQT
Subjt:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT

Query:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA
        NGLAELSNREIKQ+LEK V  NRKDWALKLDDALWAYRTAFKTPIG SPY+LV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  A
Subjt:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA

Query:  YENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY
        YENAK+YKE+T +WHDK+IT R F +G +VLLFNSRLRLFPGKL++RWSGPFV++  +P G +E++G +G +FKVN QR+KHY
Subjt:  YENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]0.0e+0075.34Show/hide
Query:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP
        MH+ILME+ +KPSIEHQRRLNP MKEVV+ E+LKLLNAGIIYAISDS WVSPVQVVPKKGGMTVVKN+ +E I TRTVTGWRVC+DYRKLNKATRKDHFP
Subjt:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP

Query:  LPFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN
        LPFIDQMLDRLAGYS+YCFLDGY GYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMM+IF  ++EDIME+FMDDFSVFG+SFD CL N
Subjt:  LPFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN

Query:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA
        L  VLQRC+D NLVLNWEKCHFMV EGI+LGH+VS KG+EVDRAKI  IE+LPPP NVK +RSFLGHAGFYRRFIKDFSK++KPL NLLE  + F FDD 
Subjt:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA

Query:  CLQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHA
        CLQAFN +KE+LI+AP++ VPDWSQ FE+MCDASD+ALGAVLGQRRD +FRAIYYASRTL+  Q  YTT+EKE+LAVVFA DK RSYL+ +K++V TDHA
Subjt:  CLQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHA

Query:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRF
        AL+YLF KKD+KPRL+RWILLLQEFDLE++D+KG EN VADHLSR+E EE +    I E FPDEQL+  +  LPW+ADIVN+LA   LPPD+ Y Q+K+F
Subjt:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRF

Query:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE
        LH+VK Y W++PLL+K C D +IR+CVP+EE+ +IL+ CH+S YGGHFG TRTAAKVLQS F+W S+F+D YT VK+CDRCQR GNISR+ ELP K ILE
Subjt:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE

Query:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ
        VELFDVWGIDFMGPFP SF  ++YIL+ VDYVSKWVEAIAT TNDA+ V+KFLHKNIFTRFGTPRAIISDEG+HFCN LF +++ KY V HKIA AYHPQ
Subjt:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ

Query:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ
        TNG AE+SNREIK +LEK V  NRKDWA KLDDALWAYRTAFKTPIG SPYRLVFGKACHLPVELEH+AYWA+KK N+D + AGEKRLL+LNEM+EFR  
Subjt:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ

Query:  AYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE
        AYENAK+YKERT +WHDK+I  R F  GQ+VLLFNSRL+LFPGKLR+RW+GP+ I KVS  GA++L+   G  F+VN QRLKHY G++
Subjt:  AYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase0.0e+0070.34Show/hide
Query:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP
        MH+IL+ED  KPS+E QRRLNP MKEVVKKE++K L+AGIIY ISDS WVSPVQ VPKKGG+TVV N  +E+I TRTVTGWRVC+DYRKLNKATRKDHFP
Subjt:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP

Query:  LPFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN
        L FIDQMLDRLAG   YCFLDGY GYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAPATFQRCMM+IF  ++E+ +EVFMDDFSV+G+SFD CL N
Subjt:  LPFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN

Query:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA
        L+ VL+RC+D NL+LNWEKCHFMV EGI+LGHKVS +G+EVD+AK+  IE+LPPPT+VK VRSFLGHAGFYRRFIKDFSKI+KPL NLLE +  F FDDA
Subjt:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA

Query:  CLQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHA
        C  AFN LK RLI+APII VPDWS  FE+MCDASD+A+GAVLGQR+D +FR+IYYAS+TL++ Q  YTT+EKELLAVVFA DK RSYL+G+K++V+TDHA
Subjt:  CLQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHA

Query:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIV-EKFPDEQLYQ-VKDSLPWFADIVNYLAGGHLPPDMNYQQKK
        A++YL  KKD+KPRL+RW+LLLQEFDLEI+DRKG EN +ADHLSR+E+      P ++ + FPDEQL   V   +PW+ADIVNYL  G +P D++ QQKK
Subjt:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIV-EKFPDEQLYQ-VKDSLPWFADIVNYLAGGHLPPDMNYQQKK

Query:  RFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPI
        +FL + + Y W+DP L+K   DN++R+CVP+ E+  IL  CHASPYGGHF   RTAAK+LQS F+W +LFKD ++FV +CDRCQRTGNISR+HE+P   I
Subjt:  RFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPI

Query:  LEVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYH
        LEVELFDVWGIDFMGPF  SF + +YILV VDYVSKWVEA A   ND++ VV F+ KNIFTRFGTPRAIISD G+HFCN  F++++ KY V HKI+T YH
Subjt:  LEVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYH

Query:  PQTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFR
        PQT+G  E+SNREIK++LEK V + RKDW+ +LD+ALWAYRTA+KTPIG SPYRLVFGKACHLPVELEH AYWAI+KLN D + AGEKRLL+LNE++EFR
Subjt:  PQTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFR

Query:  AQAYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNG-TTFKVNVQRLKHYIGD
          AYENAK+YKE+  RWH+KKI  R F  GQ VLLFNSRL+LFPGKL++RWSGPF I +V PHGAVEL+  N    FKVN QR+KHY G+
Subjt:  AQAYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNG-TTFKVNVQRLKHYIGD

A0A4Y1RS99 Transposable element protein0.0e+0069.11Show/hide
Query:  MEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFID
        ME+  KPS+EHQRRLNP MKEVV+ EVLKLL+AGIIY ISDS WVSP QVVPKKGGMTVVKNE +E++ TRTVTGWRVCIDYRKLN ATRKDHFPLPFID
Subjt:  MEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFID

Query:  QMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVL
        QML+RLAG+++YCFLDGY GYNQI IAPEDQEKTTFTCP+GTFA++RMPFGLCNAPATFQRCMMSIF  ++E  +EVFMDDFSVFGSSFDSCL NL  VL
Subjt:  QMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVL

Query:  QRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAF
         RC++ NLVLNWEKCHFMV EGI+LGHK+S +G+EVDRAKI  IE+LPPP+ VK +RSFLGHAGFYRRFIKDFSKI KPL  LL  +++F FD  CL+AF
Subjt:  QRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAF

Query:  NTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAALKYL
        N LK +L  AP+I+ PDW   FEIMCDASDYA+GAVLGQR++ +   I+YASRTL++ Q  Y T+EKELLAVVFA+DK RSYLLG+K++V+TDHAALK+L
Subjt:  NTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAALKYL

Query:  FVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI--ENEEAKSWPPIVEKFPDEQLYQVKDS----LPWFADIVNYLAGGHLPPDMNYQQKKR
          KK++KPRL+RW+LLLQEFD+EI+D+KG ENVVADHLSR+  E+E  +   PI+E FPDEQLY +  +     PW+AD VNYLA G LPPDM++ QKK+
Subjt:  FVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI--ENEEAKSWPPIVEKFPDEQLYQVKDS----LPWFADIVNYLAGGHLPPDMNYQQKKR

Query:  FLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPIL
        FL  VK Y+W+DP L+K   D +IR+CVP+ E+  IL  CH    GGH+G ++T AKVLQS F+W +LFKD   FV  CD CQRTGNIS ++++P   IL
Subjt:  FLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPIL

Query:  EVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHP
        EVELFDVWGIDFMGPFP S+ + LYILV VDYVSKWVEA A  TNDA+ VV+FL KNIFTRFG PRAIISD G+HFCN  F S++ KY + HK++T YHP
Subjt:  EVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHP

Query:  QTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRA
        QT+G  E+SNRE+K++LEK V  +RKDW+LKLDDALWAYRTAFK PIG SPYRLVFGKACHLPVELEH+A+WAIK LN D   AGEKR L+LNE+EE R 
Subjt:  QTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRA

Query:  QAYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGN-NGTTFKVNVQRLKHYI
        ++YENAK+YK+RT +WHDK I  + F  GQ VLL+NSRL+LFPGKLR+RWSGPF ++ V P+G VE++ + +GTTFKVN  RLK Y+
Subjt:  QAYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGN-NGTTFKVNVQRLKHYI

A0A5H2XID6 Reverse transcriptase0.0e+0068.49Show/hide
Query:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP
        MHRILME+  KPS + QRRLNP MKEVV+ EVLKLL+ GIIY ISDS WVSPVQVVPKK G+TVVKNEK+E++ TRT+TGWRVCIDYRKLN +TRKDHFP
Subjt:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP

Query:  LPFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN
        LPFIDQMLDRL+G+++YCFLDG+ GYNQI IAPEDQEKTTFTCP+GTFA++RMPFGLCNAPATFQRCMM+IF  ++E  MEVFMDDFSVFGSSFD CL +
Subjt:  LPFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN

Query:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA
        L+ VL RCQ+ NL+LNWEKCHFMV +GI+LGH VS KG++VD+AKI  I  LPPP++VK VRSFLGHAGFYRRFIK+FS I++PL NLL  +A F FD+ 
Subjt:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA

Query:  CLQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHA
        C++AF TLK+ L +APIII PDWS  FEIMCDASD+A+GAVLGQ+++ +   I+YASRTL++ Q  Y+T+EKELLAVVFA++K R YL+GSK++V++DHA
Subjt:  CLQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHA

Query:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI-ENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKR
        AL+YL  KKD+KPRL+RWILLLQEFDLEI+D+KGCENVVADHLSRI   E+ ++  P+ E FPDEQLY  +   PW+AD VNYLA G L  D+ YQ KK+
Subjt:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI-ENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKR

Query:  FLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPIL
        F   VK Y W++P L+K C D +IR+CVP+EE  SIL   H    GGHFG  +TA K+LQS F+W +LFKD + F   CDRCQR GNISR++ELP K IL
Subjt:  FLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPIL

Query:  EVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHP
         VELFDVWGIDFMGPFP SF  Y YILV VDYVSKWVEAIAT+TND + V+KFL  NIFTRFGTPRA+ISD GSHFCN LF+++M+KYN+ H+++T YHP
Subjt:  EVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHP

Query:  QTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRA
        QT+G  E+SNREIKQ+LEKVV + RKDWA KL+DALWAYRTA+KTPIG SPYRLVFGKACHLP+ELEH A+WAIKKLN D +KAG  R  +LNE+EE R 
Subjt:  QTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRA

Query:  QAYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQG-NNGTTFKVNVQRLKHYIGDEERGLE
        ++YENAKLYKERT  +HD+ I  + F  G  VLLFNSRLRLFPGKL++RW GPF +V VSP+GAVE+Q   +G+TFKVN QRLK +      G++
Subjt:  QAYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQG-NNGTTFKVNVQRLKHYIGDEERGLE

A0A6P8CBX2 Reverse transcriptase0.0e+0070.05Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        HRI++E   KP ++ QRRLNPT+KEVVKKEVLKLL+AGIIY ISDS WVSPVQVVPKKGGMTVVKNE +++I TRTVTGWRVCIDYRKLN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQML++LAG+ +YCFLDGY GYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +E+FMDDFSVFG SF+SCL NL
Subjt:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VL+RC++ NL+LNWEKCHFMV EGI+LGHKVS+KG+EVDRAK+  IE+LPPPT+ K VRSFLGHAGFYRRFIKDFSKI++PL NLLE ++ F+F+D C
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA
        LQAFN LKE+L +AP+I+ P+W   FE+MCDASDYA+GAVLGQRR  +F AIYYASRTL+  Q+ Y T+EKELLAV+FA DK R YL+GSKI+V+TDHAA
Subjt:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRF
        LKYLF K D+KPRL+RWILLLQEFDLEI+D KG ENVVADHLSR+E++   S  PI EKFPDEQL+  +   LPW+ADIVNY+     P  ++ QQKK+F
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRF

Query:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE
        LH+VK Y W++P L+K CAD +IR+CVP+ E +SI+  CH+   GGHFG  RTA K+L   FYW  +F DC  ++ SC  CQRTGNISR+HE+P   IL 
Subjt:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE

Query:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ
        +ELFDVWGIDFMGPFP SF++  YILV VDYVSKWVEA+A ++NDAR V++FL KNIF+RFG PRAIISD GSHFCN  F+ ++ KY V HKIAT YHPQ
Subjt:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ

Query:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ
        T G  E+SNREIK++LEK V  +RKDW+LKLDDALWAYRTAFKTPIG SPY++V+GK+CHLPVELEH+AYWAIK LN D + AGEKRLL+LN+M E R +
Subjt:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ

Query:  AYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE
        AYENA++YKER  RWHD+ I  R FL GQ+VLL+NSRL+LFPGKL++RWSGPFVI  V P+GAVEL+  +  TFKVN   LKHY   E
Subjt:  AYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE

A0A6P8DLJ8 Reverse transcriptase0.0e+0069.82Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        HRI++E   KP ++ QRRLNPT+KEVVKKEVLKLL+AGIIY ISDS WVSPVQVVPKKGGMTVVKNE +E+I TRTVTGWRVCIDYRKLN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQML++L G+ +YCFLDGY GYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +E+FMDDFSVFG SF+SCL NL
Subjt:  PFIDQMLDRLAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VL+RC++ NL+LNWEKCHFMV EGI+LGHKVS+KG+EVDRAK+  IE+LPPPT+ K VRSFLGHAGFYRRFIKDFSKI++PL NLLE ++ F+F+D C
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA
        LQAFN LKE+L +AP+I+ P+W   FE+MC ASDYA+GAVLGQRR  +F AIYYASRTL+  Q+ Y T+EKELLAV+FA DK R YL+GSKI+V+TDHAA
Subjt:  LQAFNTLKERLIAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRF
        LKYLF K D+KPRL+RWILLLQEFDLEI+D KG ENVVADHLSR+E++   S  PI EKFPDEQL+  +   LPW+ADIVNY+     P  ++ QQKK+F
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRF

Query:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE
        LH+VK Y W++P L+K CAD +IR+CVP+ E +SI+  CH+   GGHFG  RTA K+L   FYW  +F DC  ++ SC  CQRTGNISR+HE+P   IL 
Subjt:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE

Query:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ
        +ELFDVWGIDFMGPFP SF++  YILV VDYVSKWVEA+A ++NDAR V++FL KNIF+R G PRAIISD GSHFCN  F+ ++ KY V HKIAT YHPQ
Subjt:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ

Query:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ
        T G  E+SNREIK++LEK V  +RKDW+LKLDDALWAYRTAFKTPIG SPY++V+GK+CHLPVELEH+AYWAIK LN D + AGEKRLL+LN+M E R +
Subjt:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ

Query:  AYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE
        AYENA++YKER  RWHD+ I  R FL GQ+VLL+NSRL+LFPGKL++RWSGPFVI  V P+GAVEL+  +  TFKVN   LKHY   E
Subjt:  AYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.3e-8040.52Show/hide
Query:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAGYSHYCFLDGYL
        ++ V+ ++  +LN GII   S+SP+ SP+ VVPKK   +  +              +R+ IDYRKLN+ T  D  P+P +D++L +L   +++  +D   
Subjt:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAGYSHYCFLDGYL

Query:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV
        G++QI + PE   KT F+  +G + + RMPFGL NAPATFQRCM  I + L+     V++DD  VF +S D  L +L  V ++   ANL L  +KC F+ 
Subjt:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV

Query:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKF-IFDDACLQAFNTLKERLIAAPIIIVPDW
         E   LGH ++  G++ +  KI AI++ P PT  K +++FLG  G+YR+FI +F+ IAKP++  L+   K    +     AF  LK  +   PI+ VPD+
Subjt:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKF-IFDDACLQAFNTLKERLIAAPIIIVPDW

Query:  SQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQ
        ++ F +  DASD ALGAVL Q        + Y SRTL+  +  Y+T EKELLA+V+A    R YLLG    + +DH  L +L+  KD   +L RW + L 
Subjt:  SQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQ

Query:  EFDLEIKDRKGCENVVADHLSRIENEE
        EFD +IK  KG EN VAD LSRI+ EE
Subjt:  EFDLEIKDRKGCENVVADHLSRIENEE

P0CT41 Transposon Tf2-12 polyprotein1.5e-7626.88Show/hide
Query:  LNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAGYSHYCF
        L P   + +  E+ + L +GII   S +    PV  VPKK G                    R+ +DY+ LNK  + + +PLP I+Q+L ++ G + +  
Subjt:  LNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAGYSHYCF

Query:  LDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEK
        LD    Y+ I +   D+ K  F CP G F +  MP+G+  APA FQ  + +I     E  +  +MDD  +   S    + ++  VLQ+ ++ANL++N  K
Subjt:  LDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEK

Query:  CHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIAAPIII
        C F  ++   +G+ +S+KG    +  I  + Q   P N K +R FLG   + R+FI   S++  PL+NLL+ + ++ +     QA   +K+ L++ P++ 
Subjt:  CHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIAAPIII

Query:  VPDWSQSFEIMCDASDYALGAVLGQRR-DSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGS--KIVVHTDHAALKYLFVKKDSKP---
          D+S+   +  DASD A+GAVL Q+  D  +  + Y S  +   Q  Y+ S+KE+LA++ ++   R YL  +     + TDH  L    +  +S+P   
Subjt:  VPDWSQSFEIMCDASDYALGAVLGQRR-DSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGS--KIVVHTDHAALKYLFVKKDSKP---

Query:  RLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQL---YQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHNVKSYHWE
        RL RW L LQ+F+ EI  R G  N +AD LSRI +E      PI +   D  +    Q+  +  +   +V          ++   + KR   N++    +
Subjt:  RLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQL---YQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHNVKSYHWE

Query:  DPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHEL--PTKPILEVEL-FDVW
        D LL  + + + I      +   +I+   H      H G       ++  RF W  + K    +V++C  CQ   N SR H+   P +PI   E  ++  
Subjt:  DPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHEL--PTKPILEVEL-FDVW

Query:  GIDFMGPFPISFNDYLYILVGVDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTNGLAE
         +DF+   P S + Y  + V VD  SK    +  T++  A    +   + +   FG P+ II+D    F +  +K    KYN   K +  Y PQT+G  E
Subjt:  GIDFMGPFPISFNDYLYILVGVDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTNGLAE

Query:  LSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAK
         +N+ ++++L  V   +   W   +     +Y  A  +    +P+ +V            HR   A+  L  +     +K      E  +      E+  
Subjt:  LSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAK

Query:  LYKERTARWHDKKITS-RTFLSGQRVLLFNSRLRLF--PGKLRTRWSGPFVIVKVSPHGAVEL
            +  ++ D KI     F  G  V++  ++        KL   ++GPF +++ S     EL
Subjt:  LYKERTARWHDKKITS-RTFLSGQRVLLFNSRLRLF--PGKLRTRWSGPFVIVKVSPHGAVEL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.1e-9029.47Show/hide
Query:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAGYSHYCFLDGYL
        ++ + K V KLL+   I   S SP  SPV +VPKK G                   +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD + 
Subjt:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAGYSHYCFLDGYL

Query:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV
        GY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F+ L    + V++DD  +F  S +    +L  VL+R ++ NL++  +KC F  
Subjt:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV

Query:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIAAPIIIVPDWS
         E   LG+ +  + +   + K  AI   P P  VK+ + FLG   +YRRFI + SKIA+P+   + ++++  + +   +A   LK  L  +P+++  +  
Subjt:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIAAPIIIVPDWS

Query:  QSFEIMCDASDYALGAVLGQ--RRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLL
         ++ +  DAS   +GAVL +   ++ +   + Y S++L++ Q+ Y   E ELL ++ A+   R  L G    + TDH +L  L  K +   R+ RW+  L
Subjt:  QSFEIMCDASDYALGAVLGQ--RRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLL

Query:  QEFDLEIKDRKGCENVVADHLSR----IENE-----EAKSWPPIVEKFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHNV-KS
          +D  ++   G +NVVAD +SR    I  E     + +SW    +  P     L  +K+                 P DM    +YQ+K        K+
Subjt:  QEFDLEIKDRKGCENVVADHLSR----IENE-----EAKSWPPIVEKFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHNV-KS

Query:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQR-TGNISRQHE-LPTKPILEVEL
        Y  ED ++Y        R  VP ++  +++   H  + +GGHFG T T AK+    +YW  L      ++++C +CQ    +  R H  L   PI E   
Subjt:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQR-TGNISRQHE-LPTKPILEVEL

Query:  FDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATR-TNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTN
         D+  +DF+   P + N+   ILV VD  SK    IATR T DA  ++  L + IF+  G PR I SD       + ++ + ++  +   +++A HPQT+
Subjt:  FDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATR-TNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTN

Query:  GLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPV-----ELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEF
        G +E + + + ++L   V  N ++W + L    + Y +     +G SP+ +  G   + P      E+  R++ A+       E A   + L +   E+ 
Subjt:  GLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPV-----ELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEF

Query:  RAQAYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRL--RLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFK--VNVQRLK
             E+A++  E       K +       G  VL+       +    K++  + GPF +VK     A EL  N+       +NVQ LK
Subjt:  RAQAYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRL--RLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFK--VNVQRLK

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.2e-7637.47Show/hide
Query:  MKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAGYSHYCFLDGY
        M+  V++++ +LL  GII   S+SP+ SP+ +VPKK        EK           +R+ +D+++LN  T  D +P+P I+  L  L    ++  LD  
Subjt:  MKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAGYSHYCFLDGY

Query:  LGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFM
         G++QI +   D  KT F+   G + F R+PFGL NAPA FQR +  I +  I  +  V++DD  VF   +D+   NL  VL     ANL +N EK HF+
Subjt:  LGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFM

Query:  VTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLL-----------ENEAKFIFDDACLQAFNTLKERL
         T+   LG+ V+  G++ D  K+ AI ++PPPT+VK ++ FLG   +YR+FI+D++K+AKPL+NL             ++     D+  LQ+FN LK  L
Subjt:  VTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLL-----------ENEAKFIFDDACLQAFNTLKERL

Query:  IAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGS-KIVVHTDHAALKYLFVKKDS
         ++ I+  P +++ F +  DAS++A+GAVL Q      R I Y SR+L+ T++ Y T EKE+LA+++++D LR+YL G+  I V+TDH  L +    ++ 
Subjt:  IAAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGS-KIVVHTDHAALKYLFVKKDS

Query:  KPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI
          +L RW   ++E++ E+  + G  NVVAD LSRI
Subjt:  KPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.3e-9129.26Show/hide
Query:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAGYSHYCFLDGYL
        ++ + K V KLL+   I   S SP  SPV +VPKK G                   +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD + 
Subjt:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAGYSHYCFLDGYL

Query:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV
        GY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F+ L    + V++DD  +F  S +    +L  VL+R ++ NL++  +KC F  
Subjt:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV

Query:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIAAPIIIVPDWS
         E   LG+ +  + +   + K  AI   P P  VK+ + FLG   +YRRFI + SKIA+P+   + ++++  + +   +A + LK+ L  +P+++  +  
Subjt:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIAAPIIIVPDWS

Query:  QSFEIMCDASDYALGAVLGQ--RRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLL
         ++ +  DAS   +GAVL +   ++ +   + Y S++L++ Q+ Y   E ELL ++ A+   R  L G    + TDH +L  L  K +   R+ RW+  L
Subjt:  QSFEIMCDASDYALGAVLGQ--RRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLL

Query:  QEFDLEIKDRKGCENVVADHLSR----IENE-----EAKSWPPIVEKFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHNV-KS
          +D  ++   G +NVVAD +SR    I  E     + +SW    +  P     L  +K+                 P DM    +YQ+K        K+
Subjt:  QEFDLEIKDRKGCENVVADHLSR----IENE-----EAKSWPPIVEKFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHNV-KS

Query:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQR-TGNISRQHE-LPTKPILEVEL
        Y  ED ++Y        R  VP ++  +++   H  + +GGHFG T T AK+    +YW  L      ++++C +CQ    +  R H  L   PI E   
Subjt:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQR-TGNISRQHE-LPTKPILEVEL

Query:  FDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATR-TNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTN
         D+  +DF+   P + N+   ILV VD  SK    IATR T DA  ++  L + IF+  G PR I SD       + ++ + ++  +   +++A HPQT+
Subjt:  FDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATR-TNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTN

Query:  GLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPV-----ELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEF
        G +E + + + ++L      N ++W + L    + Y +     +G SP+ +  G   + P      E+  R++ A+       E A   + L +   E+ 
Subjt:  GLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPV-----ELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEF

Query:  RAQAYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRL--RLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFK--VNVQRLKHYI
             E+A++  E       K +       G  VL+       +    K++  + GPF +VK     A EL  N+       +NVQ LK ++
Subjt:  RAQAYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRL--RLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFK--VNVQRLKHYI

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein1.7e-1460.71Show/hide
Query:  VLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEVELFDVWGIDFM
        VLQ+ FYW + FKD + FV SCD CQR GN ++++E+P   ILEVE+FDVWGI FM
Subjt:  VLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEVELFDVWGIDFM

ATMG00860.1 DNA/RNA polymerases superfamily protein2.0e-1535.38Show/hide
Query:  NLTRVLQRCQDANLVLNWEKCHFMVTEGIILGHK--VSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIF
        +L  VLQ  +      N +KC F   +   LGH+  +S +G+  D AK+ A+   P P N   +R FLG  G+YRRF+K++ KI +PL+ LL+  +    
Subjt:  NLTRVLQRCQDANLVLNWEKCHFMVTEGIILGHK--VSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIF

Query:  DDACLQAFNTLKERLIAAPIIIVPDWSQSF
        + A L AF  LK  +   P++ +PD    F
Subjt:  DDACLQAFNTLKERLIAAPIIIVPDWSQSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAGAATCTTGATGGAAGATACCTTTAAACCATCAATAGAGCACCAAAGGAGACTTAATCCCACCATGAAGGAGGTAGTTAAGAAAGAAGTGCTCAAGCTTTTGAA
TGCAGGAATCATATATGCCATATCTGACAGTCCATGGGTAAGTCCTGTTCAAGTGGTACCCAAGAAAGGTGGCATGACTGTGGTTAAAAATGAAAAAGATGAAATGATAG
CCACTAGAACTGTGACAGGATGGCGTGTATGCATTGATTATAGGAAGTTGAACAAAGCTACTAGGAAAGATCATTTTCCCCTTCCTTTTATAGATCAAATGCTTGATAGA
CTTGCAGGTTATTCTCATTATTGCTTTCTAGATGGATATTTAGGTTATAACCAAATTGTCATTGCTCCTGAGGATCAAGAAAAAACGACATTTACATGTCCCTATGGGAC
ATTTGCTTTTAAAAGAATGCCTTTTGGTTTATGTAATGCCCCTGCTACTTTCCAACGCTGTATGATGTCTATTTTTCAGGGGCTCATCGAAGACATAATGGAGGTTTTTA
TGGATGATTTTTCTGTATTTGGGTCTTCATTTGATTCTTGTCTTGTTAATCTAACTCGTGTTTTGCAGAGGTGCCAAGATGCTAACCTTGTTCTAAACTGGGAGAAGTGT
CACTTCATGGTGACGGAAGGCATCATCCTGGGGCACAAAGTCTCTCAAAAGGGATTGGAGGTGGATAGGGCTAAAATAGTTGCTATTGAACAACTACCACCCCCCACCAA
TGTGAAAAGAGTCAGAAGCTTCCTAGGGCATGCAGGGTTTTACAGGAGATTTATTAAAGATTTTTCTAAAATTGCTAAACCCTTGAGTAATTTGTTAGAAAATGAAGCTA
AATTTATTTTTGATGATGCATGTCTACAAGCTTTTAATACTCTGAAGGAAAGACTAATTGCTGCACCTATTATTATTGTACCAGATTGGAGTCAGTCTTTTGAGATCATG
TGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTAGAGATAGCATGTTTAGGGCAATTTATTATGCTAGTAGGACTCTTGATAATACTCAACAGAAATA
CACTACTTCTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAACTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGATCATGCTGCTTTAAAGT
ACTTGTTTGTTAAGAAAGATTCTAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGGAATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTTGCA
GACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCCCAATTGTTGAGAAGTTCCCTGATGAACAACTGTATCAGGTAAAAGATAGTTTGCCCTGGTTTGC
TGACATAGTTAATTATCTTGCAGGAGGACATTTGCCTCCTGACATGAACTATCAACAAAAGAAAAGATTCCTGCACAATGTTAAATCTTACCATTGGGAGGACCCACTTC
TCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTACCTCAAGAGGAAGTGGTAAGTATTTTAAATTCATGTCATGCTTCCCCTTATGGAGGTCATTTTGGACCC
ACCAGAACTGCAGCTAAGGTACTTCAGTCACGATTTTATTGGCTATCCCTTTTTAAAGACTGTTATACCTTTGTTAAGTCATGTGATAGGTGCCAACGTACTGGCAATAT
TTCTAGACAACATGAGCTTCCAACGAAACCTATCTTGGAAGTGGAGTTATTTGATGTTTGGGGTATTGACTTTATGGGACCTTTTCCTATTTCTTTTAATGATTACCTAT
ACATTCTAGTTGGAGTAGATTATGTATCTAAGTGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTGTAAAATTCTTGCATAAAAACATTTTCACACGT
TTTGGTACACCTAGAGCTATTATTAGTGATGAGGGCTCTCACTTTTGCAATAATTTGTTTAAATCCATGATGCAAAAATATAATGTTAATCATAAAATTGCTACAGCTTA
TCATCCTCAAACTAATGGCCTTGCTGAGTTATCTAACAGGGAAATCAAACAAGTTTTGGAAAAGGTAGTCAAGAACAATAGGAAGGATTGGGCCCTTAAGCTCGATGATG
CACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTATAGGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGAGCTCGAGCATAGAGCTTAT
TGGGCCATCAAGAAGCTGAACATGGATTTTGAGAAAGCTGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATGAGAATGCCAAACT
TTACAAAGAGCGCACTGCGAGATGGCATGACAAGAAGATCACATCACGAACCTTCCTTTCAGGACAAAGAGTATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGTA
AGCTTAGGACACGATGGTCGGGACCCTTTGTCATTGTCAAGGTATCCCCACACGGAGCCGTGGAACTACAAGGTAACAATGGAACAACCTTCAAAGTGAATGTTCAACGA
TTAAAGCACTACATCGGTGATGAAGAACGTGGACTTGAGAACCTGACTTTCATTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCACAGAATCTTGATGGAAGATACCTTTAAACCATCAATAGAGCACCAAAGGAGACTTAATCCCACCATGAAGGAGGTAGTTAAGAAAGAAGTGCTCAAGCTTTTGAA
TGCAGGAATCATATATGCCATATCTGACAGTCCATGGGTAAGTCCTGTTCAAGTGGTACCCAAGAAAGGTGGCATGACTGTGGTTAAAAATGAAAAAGATGAAATGATAG
CCACTAGAACTGTGACAGGATGGCGTGTATGCATTGATTATAGGAAGTTGAACAAAGCTACTAGGAAAGATCATTTTCCCCTTCCTTTTATAGATCAAATGCTTGATAGA
CTTGCAGGTTATTCTCATTATTGCTTTCTAGATGGATATTTAGGTTATAACCAAATTGTCATTGCTCCTGAGGATCAAGAAAAAACGACATTTACATGTCCCTATGGGAC
ATTTGCTTTTAAAAGAATGCCTTTTGGTTTATGTAATGCCCCTGCTACTTTCCAACGCTGTATGATGTCTATTTTTCAGGGGCTCATCGAAGACATAATGGAGGTTTTTA
TGGATGATTTTTCTGTATTTGGGTCTTCATTTGATTCTTGTCTTGTTAATCTAACTCGTGTTTTGCAGAGGTGCCAAGATGCTAACCTTGTTCTAAACTGGGAGAAGTGT
CACTTCATGGTGACGGAAGGCATCATCCTGGGGCACAAAGTCTCTCAAAAGGGATTGGAGGTGGATAGGGCTAAAATAGTTGCTATTGAACAACTACCACCCCCCACCAA
TGTGAAAAGAGTCAGAAGCTTCCTAGGGCATGCAGGGTTTTACAGGAGATTTATTAAAGATTTTTCTAAAATTGCTAAACCCTTGAGTAATTTGTTAGAAAATGAAGCTA
AATTTATTTTTGATGATGCATGTCTACAAGCTTTTAATACTCTGAAGGAAAGACTAATTGCTGCACCTATTATTATTGTACCAGATTGGAGTCAGTCTTTTGAGATCATG
TGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTAGAGATAGCATGTTTAGGGCAATTTATTATGCTAGTAGGACTCTTGATAATACTCAACAGAAATA
CACTACTTCTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAACTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGATCATGCTGCTTTAAAGT
ACTTGTTTGTTAAGAAAGATTCTAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGGAATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTTGCA
GACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCCCAATTGTTGAGAAGTTCCCTGATGAACAACTGTATCAGGTAAAAGATAGTTTGCCCTGGTTTGC
TGACATAGTTAATTATCTTGCAGGAGGACATTTGCCTCCTGACATGAACTATCAACAAAAGAAAAGATTCCTGCACAATGTTAAATCTTACCATTGGGAGGACCCACTTC
TCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTACCTCAAGAGGAAGTGGTAAGTATTTTAAATTCATGTCATGCTTCCCCTTATGGAGGTCATTTTGGACCC
ACCAGAACTGCAGCTAAGGTACTTCAGTCACGATTTTATTGGCTATCCCTTTTTAAAGACTGTTATACCTTTGTTAAGTCATGTGATAGGTGCCAACGTACTGGCAATAT
TTCTAGACAACATGAGCTTCCAACGAAACCTATCTTGGAAGTGGAGTTATTTGATGTTTGGGGTATTGACTTTATGGGACCTTTTCCTATTTCTTTTAATGATTACCTAT
ACATTCTAGTTGGAGTAGATTATGTATCTAAGTGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTGTAAAATTCTTGCATAAAAACATTTTCACACGT
TTTGGTACACCTAGAGCTATTATTAGTGATGAGGGCTCTCACTTTTGCAATAATTTGTTTAAATCCATGATGCAAAAATATAATGTTAATCATAAAATTGCTACAGCTTA
TCATCCTCAAACTAATGGCCTTGCTGAGTTATCTAACAGGGAAATCAAACAAGTTTTGGAAAAGGTAGTCAAGAACAATAGGAAGGATTGGGCCCTTAAGCTCGATGATG
CACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTATAGGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGAGCTCGAGCATAGAGCTTAT
TGGGCCATCAAGAAGCTGAACATGGATTTTGAGAAAGCTGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATGAGAATGCCAAACT
TTACAAAGAGCGCACTGCGAGATGGCATGACAAGAAGATCACATCACGAACCTTCCTTTCAGGACAAAGAGTATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGTA
AGCTTAGGACACGATGGTCGGGACCCTTTGTCATTGTCAAGGTATCCCCACACGGAGCCGTGGAACTACAAGGTAACAATGGAACAACCTTCAAAGTGAATGTTCAACGA
TTAAAGCACTACATCGGTGATGAAGAACGTGGACTTGAGAACCTGACTTTCATTGCATGA
Protein sequenceShow/hide protein sequence
MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDR
LAGYSHYCFLDGYLGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKC
HFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKRVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIAAPIIIVPDWSQSFEIM
CDASDYALGAVLGQRRDSMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKLRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVA
DHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGP
TRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTR
FGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAY
WAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITSRTFLSGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQR
LKHYIGDEERGLENLTFIA