; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G036177 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G036177
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionReverse transcriptase
Genome locationCla97Chr02:16185882..16188581
RNA-Seq ExpressionCla97C02G036177
SyntenyCla97C02G036177
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_012833379.1 PREDICTED: uncharacterized protein LOC105954252 [Erythranthe guttata]0.0e+0072.14Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        H+ILMED +KP ++ QRRLNP+MKEVVKKEVLKLL AG+IYAISDSPWVSPVQVVPK+GGMTV+ N+K+E+I +RTVTGWRVC DYR LN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQMLDRL  + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +EVFMDDFSVFGSSFD C+ NL
Subjt:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VL+RC + NLVLNWEKCHFMV EGI+LGHKVS+KGLEVDRAKI  IE+LPPP +VK VRSFLGHAGFYRRFIKDFSKI KPL +LLE EA F FD AC
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA
        LQAF  LKE+L  +PI+I P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  Q+ Y+T+EKE+LAVV+A+DKFR Y+LGS+++++TDHAA
Subjt:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL
        ++YLF KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E FPDEQL  +    PW+AD+ N+LA G +P D++Y QKK+FL
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL

Query:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV
        H+ + Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQS F+W +LF+D Y FVK CDRCQRTGN+S + ++P   + EV
Subjt:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV

Query:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT
        ELFDVWGIDFMGPFP S N  LYIL+ VDYVSKWVEAIAT  NDARTV+KF HKNIF+RFGTPRAIISDEGSHFCN L  ++  K  + HKIA AYHPQT
Subjt:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT

Query:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA
        NGLAELSNREIKQ+LEK V  NRKDWALKLDDALWAYRTAFKTPIG SPY+LVFGKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  A
Subjt:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA

Query:  YENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY
        YENAK+YKE+T +WHDK+IT R F  G +VLLFNSRLRLFPGKL++RWSGPFV++  +P G +E++G +G +FKVN QR+KHY
Subjt:  YENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY

XP_012833687.1 PREDICTED: uncharacterized protein LOC105954563 [Erythranthe guttata]0.0e+0072.14Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        H+ILMED +KP ++ QRRLNP+MKEVVKKEVLKLL AG+IYAISDSPWVSPVQVVPK+GGMTV+ N+K+E+I +RTVTGWRVC DYR LN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQMLDRL  + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +EVFMDDFSVFGSSFD C+ NL
Subjt:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VL+RC + NLVLNWEKCHFMV EGI+LGHKVS+KGLEVDRAKI  IE+LPPP +VK VRSFLGHAGFYRRFIKDFSKI KPL +LLE EA F FD AC
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA
        LQAF  LKE+L  +PI+I P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  Q+ Y+T+EKE+LAVV+A+DKFR Y+LGS+++++TDHAA
Subjt:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL
        ++YLF KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E FPDEQL  +    PW+AD+ N+LA G +P D++Y QKK+FL
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL

Query:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV
        H+ + Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQS F+W +LF+D Y FVK CDRCQRTGN+S + ++P   + EV
Subjt:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV

Query:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT
        ELFDVWGIDFMGPFP S N  LYIL+ VDYVSKWVEAIAT TNDARTV+KF HKNIF+RFGTPRAIISDEGSHFCN L  ++  K  + HKIA AYHPQT
Subjt:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT

Query:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA
        NGLAELSNREIKQ+LEK V  NRKDWALKLDDALWAYRTAFKTPIG SPY+LV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  A
Subjt:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA

Query:  YENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY
        YENAK+YKE+T +WHDK+IT R F  G +VLLFNSRLRLFPGKL++RWSGPFV++  +P G +E++G +G +FKVN QR+KHY
Subjt:  YENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY

XP_012846413.1 PREDICTED: uncharacterized protein LOC105966405 [Erythranthe guttata]0.0e+0072.14Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        H+ILMED +KP ++ QRRLNP+MKEVVKKEVLKLL AG+IYAISDSPWVSPVQVVPK+GGMTV+ N+K+E+I +RTVTGWRVC DYR LN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQMLDRL  + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +EVFMDDFSVFGSSFD C+ NL
Subjt:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VLQRC + NLVLNWEKCHFMV EGI+LGHKVS+KGLEVDRAKI  IE+LPPP +VK VRSFLGHAGFYRRFIKDFSKI KPL +LLE EA F FD AC
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA
        LQAF  LKE+L  +PI+I P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  Q+ Y+T+EKE+LAVV+A+DKFR Y+LGS+++++TDHAA
Subjt:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL
        ++YLF KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  +E  +   I E FPDEQL  +    PW+AD+ N+LA G +P D++Y QKK+FL
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL

Query:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV
        H+ + Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQS F+W +LF+D Y FVK CDRCQRTGN+S + ++P   + EV
Subjt:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV

Query:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT
        ELFDVWGIDFMGPFP S N  LYIL+ VDYVSKWVEAIAT TNDARTV+KF HKNIF+RFGTPRAIISDEGSHFCN L  ++  K  + HKIA AYHPQT
Subjt:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT

Query:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA
        NGLAELSNREIKQ+LEK V  NRKDWALKLDDALWAYRTAFKTPIG SPY+LV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  A
Subjt:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA

Query:  YENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY
        YENAK+YKE+T +WHDK+IT R F  G +VLLFNSRLRLFPGKL++RWSGPFV++  +P G +E++G +G +FKVN QR+KHY
Subjt:  YENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY

XP_012847037.1 PREDICTED: uncharacterized protein LOC105967019 [Erythranthe guttata]0.0e+0072.14Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        H+ILMED +KP ++ QRRLNP+MKEVVKKEVLKLL AG+IYAISDSPWVSPVQVVPK+GGMTV+ N+K+E+I +RTVTGWRVC DYR LN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQMLDRL  + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +EVFMDDFSVFGSSFD C+ NL
Subjt:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VL+RC + NLVLNWEKCHFMV EGI+LGHKVS+KGLEVDRAKI  IE+LPPP +VK VRSFLGHAGFYRRFIKDFSKI KPL +LLE EA F FD AC
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA
        LQAF  LKE+L  +PI+I P+W + FEIMCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  Q+ Y+T+EKE+LAVV+A+DKFR Y+LGS+++++TDHAA
Subjt:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL
        ++YLF KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADHLSR+  EE  +   I E FPDEQL  +    PW+AD+ N+LA G +P D++Y QKK+FL
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFL

Query:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV
        H+ + Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G +RTAAKVLQS F+W +LF+D Y FVK CDRCQRTGN+S + ++P   + EV
Subjt:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEV

Query:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT
        ELFDVWGIDFMGPFP S N  LYIL+ VDYVSKWVEAIAT TNDARTV+KF HKNIF+RFGTPRAIISDEGSHFCN L  ++  K  + HKIA AYHPQT
Subjt:  ELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQT

Query:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA
        NGLAELSNREIKQ+LEK V  NRKDWALKLDDALWAYRTAFKTPIG SPY+LV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  A
Subjt:  NGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQA

Query:  YENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY
        YENAK+YKE+T +WHDK+IT R F  G +VLLFNSRLRLFPGKL++RWSGPFV++  +P G +E++G +G +FKVN QR+KHY
Subjt:  YENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHY

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]0.0e+0075.56Show/hide
Query:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP
        MH+ILME+ +KPSIEHQRRLNP MKEVV+ E+LKLLNAGIIYAISDS WVSPVQVVPKKGGMTVVKN+ +E I TRTVTGWRVC+DYRKLNKATRKDHFP
Subjt:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP

Query:  LPFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN
        LPFIDQMLDRLA YS+YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMM+IF  ++EDIME+FMDDFSVFG+SFD CL N
Subjt:  LPFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN

Query:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA
        L  VLQRC+D NLVLNWEKCHFMV EGI+LGH+VS KG+EVDRAKI  IE+LPPP NVK +RSFLGHAGFYRRFIKDFSK++KPL NLLE  + F FDD 
Subjt:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA

Query:  CLQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHA
        CLQAFN +KE+LI AP++ VPDWSQ FE+MCDASD+ALGAVLGQRRD +FRAIYYASRTL+  Q  YTT+EKE+LAVVFA DKFRSYL+ +K++V TDHA
Subjt:  CLQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHA

Query:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRF
        AL+YLF KKD+KPRL+RWILLLQEFDLE++D+KG EN VADHLSR+E EE +    I E FPDEQL+  +  LPW+ADIVN+LA   LPPD+ Y Q+K+F
Subjt:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRF

Query:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE
        LH+VK Y W++PLL+K C D +IR+CVP+EE+ +IL+ CH+S YGGHFG TRTAAKVLQS F+W S+F+D YT VK+CDRCQR GNISR+ ELP K ILE
Subjt:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE

Query:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ
        VELFDVWGIDFMGPFP SF  ++YIL+ VDYVSKWVEAIAT TNDA+ V+KFLHKNIFTRFGTPRAIISDEG+HFCN LF +++ KY V HKIA AYHPQ
Subjt:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ

Query:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ
        TNG AE+SNREIK +LEK V  NRKDWA KLDDALWAYRTAFKTPIG SPYRLVFGKACHLPVELEH+AYWA+KK N+D + AGEKRLL+LNEM+EFR  
Subjt:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ

Query:  AYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE
        AYENAK+YKERT +WHDK+I  R F PGQ+VLLFNSRL+LFPGKLR+RW+GP+ I KVS  GA++L+   G  F+VN QRLKHY G++
Subjt:  AYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase0.0e+0070.56Show/hide
Query:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP
        MH+IL+ED  KPS+E QRRLNP MKEVVKKE++K L+AGIIY ISDS WVSPVQ VPKKGG+TVV N  +E+I TRTVTGWRVC+DYRKLNKATRKDHFP
Subjt:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP

Query:  LPFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN
        L FIDQMLDRLA    YCFLDGYSGYNQI IAPEDQEK TFTCPYGTFAF+RMPFGLCNAPATFQRCMM+IF  ++E+ +EVFMDDFSV+G+SFD CL N
Subjt:  LPFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN

Query:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA
        L+ VL+RC+D NL+LNWEKCHFMV EGI+LGHKVS +G+EVD+AK+  IE+LPPPT+VK VRSFLGHAGFYRRFIKDFSKI+KPL NLLE +  F FDDA
Subjt:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA

Query:  CLQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHA
        C  AFN LK RLI APII VPDWS  FE+MCDASD+A+GAVLGQR+D +FR+IYYAS+TL++ Q  YTT+EKELLAVVFA DKFRSYL+G+K++V+TDHA
Subjt:  CLQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHA

Query:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIV-EKFPDEQLYQ-VKDSLPWFADIVNYLAGGHLPPDMNYQQKK
        A++YL  KKD+KPRL+RW+LLLQEFDLEI+DRKG EN +ADHLSR+E+      P ++ + FPDEQL   V   +PW+ADIVNYL  G +P D++ QQKK
Subjt:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIV-EKFPDEQLYQ-VKDSLPWFADIVNYLAGGHLPPDMNYQQKK

Query:  RFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPI
        +FL + + Y W+DP L+K   DN++R+CVP+ E+  IL  CHASPYGGHF   RTAAK+LQS F+W +LFKD ++FV +CDRCQRTGNISR+HE+P   I
Subjt:  RFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPI

Query:  LEVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYH
        LEVELFDVWGIDFMGPF  SF + +YILV VDYVSKWVEA A   ND++ VV F+ KNIFTRFGTPRAIISD G+HFCN  F++++ KY V HKI+T YH
Subjt:  LEVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYH

Query:  PQTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFR
        PQT+G  E+SNREIK++LEK V + RKDW+ +LD+ALWAYRTA+KTPIG SPYRLVFGKACHLPVELEH AYWAI+KLN D + AGEKRLL+LNE++EFR
Subjt:  PQTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFR

Query:  AQAYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNG-TTFKVNVQRLKHYIGD
          AYENAK+YKE+  RWH+KKI  R F PGQ VLLFNSRL+LFPGKL++RWSGPF I +V PHGAVEL+  N    FKVN QR+KHY G+
Subjt:  AQAYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNG-TTFKVNVQRLKHYIGD

A0A4Y1RS99 Transposable element protein0.0e+0069.22Show/hide
Query:  MEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFID
        ME+  KPS+EHQRRLNP MKEVV+ EVLKLL+AGIIY ISDS WVSP QVVPKKGGMTVVKNE +E++ TRTVTGWRVCIDYRKLN ATRKDHFPLPFID
Subjt:  MEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFID

Query:  QMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVL
        QML+RLA +++YCFLDGYSGYNQI IAPEDQEKTTFTCP+GTFA++RMPFGLCNAPATFQRCMMSIF  ++E  +EVFMDDFSVFGSSFDSCL NL  VL
Subjt:  QMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVL

Query:  QRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAF
         RC++ NLVLNWEKCHFMV EGI+LGHK+S +G+EVDRAKI  IE+LPPP+ VK +RSFLGHAGFYRRFIKDFSKI KPL  LL  +++F FD  CL+AF
Subjt:  QRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAF

Query:  NTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYL
        N LK +L  AP+I+ PDW   FEIMCDASDYA+GAVLGQR++ +   I+YASRTL++ Q  Y T+EKELLAVVFA+DKFRSYLLG+K++V+TDHAALK+L
Subjt:  NTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYL

Query:  FVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI--ENEEAKSWPPIVEKFPDEQLYQVKDS----LPWFADIVNYLAGGHLPPDMNYQQKKR
          KK++KPRL+RW+LLLQEFD+EI+D+KG ENVVADHLSR+  E+E  +   PI+E FPDEQLY +  +     PW+AD VNYLA G LPPDM++ QKK+
Subjt:  FVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI--ENEEAKSWPPIVEKFPDEQLYQVKDS----LPWFADIVNYLAGGHLPPDMNYQQKKR

Query:  FLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPIL
        FL  VK Y+W+DP L+K   D +IR+CVP+ E+  IL  CH    GGH+G ++T AKVLQS F+W +LFKD   FV  CD CQRTGNIS ++++P   IL
Subjt:  FLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPIL

Query:  EVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHP
        EVELFDVWGIDFMGPFP S+ + LYILV VDYVSKWVEA A  TNDA+ VV+FL KNIFTRFG PRAIISD G+HFCN  F S++ KY + HK++T YHP
Subjt:  EVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHP

Query:  QTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRA
        QT+G  E+SNRE+K++LEK V  +RKDW+LKLDDALWAYRTAFK PIG SPYRLVFGKACHLPVELEH+A+WAIK LN D   AGEKR L+LNE+EE R 
Subjt:  QTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRA

Query:  QAYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGN-NGTTFKVNVQRLKHYI
        ++YENAK+YK+RT +WHDK I  + F  GQ VLL+NSRL+LFPGKLR+RWSGPF ++ V P+G VE++ + +GTTFKVN  RLK Y+
Subjt:  QAYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGN-NGTTFKVNVQRLKHYI

A0A5H2XID6 Reverse transcriptase0.0e+0068.6Show/hide
Query:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP
        MHRILME+  KPS + QRRLNP MKEVV+ EVLKLL+ GIIY ISDS WVSPVQVVPKK G+TVVKNEK+E++ TRT+TGWRVCIDYRKLN +TRKDHFP
Subjt:  MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFP

Query:  LPFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN
        LPFIDQMLDRL+ +++YCFLDG+SGYNQI IAPEDQEKTTFTCP+GTFA++RMPFGLCNAPATFQRCMM+IF  ++E  MEVFMDDFSVFGSSFD CL +
Subjt:  LPFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVN

Query:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA
        L+ VL RCQ+ NL+LNWEKCHFMV +GI+LGH VS KG++VD+AKI  I  LPPP++VK VRSFLGHAGFYRRFIK+FS I++PL NLL  +A F FD+ 
Subjt:  LTRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDA

Query:  CLQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHA
        C++AF TLK+ L  APIII PDWS  FEIMCDASD+A+GAVLGQ+++ +   I+YASRTL++ Q  Y+T+EKELLAVVFA++KFR YL+GSK++V++DHA
Subjt:  CLQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHA

Query:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI-ENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKR
        AL+YL  KKD+KPRL+RWILLLQEFDLEI+D+KGCENVVADHLSRI   E+ ++  P+ E FPDEQLY  +   PW+AD VNYLA G L  D+ YQ KK+
Subjt:  ALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRI-ENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKR

Query:  FLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPIL
        F   VK Y W++P L+K C D +IR+CVP+EE  SIL   H    GGHFG  +TA K+LQS F+W +LFKD + F   CDRCQR GNISR++ELP K IL
Subjt:  FLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPIL

Query:  EVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHP
         VELFDVWGIDFMGPFP SF  Y YILV VDYVSKWVEAIAT+TND + V+KFL  NIFTRFGTPRA+ISD GSHFCN LF+++M+KYN+ H+++T YHP
Subjt:  EVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHP

Query:  QTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRA
        QT+G  E+SNREIKQ+LEKVV + RKDWA KL+DALWAYRTA+KTPIG SPYRLVFGKACHLP+ELEH A+WAIKKLN D +KAG  R  +LNE+EE R 
Subjt:  QTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRA

Query:  QAYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQG-NNGTTFKVNVQRLKHYIGDEERGLE
        ++YENAKLYKERT  +HD+ I  + F  G  VLLFNSRLRLFPGKL++RW GPF +V VSP+GAVE+Q   +G+TFKVN QRLK +      G++
Subjt:  QAYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQG-NNGTTFKVNVQRLKHYIGDEERGLE

A0A6P8CBX2 Reverse transcriptase0.0e+0070.27Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        HRI++E   KP ++ QRRLNPT+KEVVKKEVLKLL+AGIIY ISDS WVSPVQVVPKKGGMTVVKNE +++I TRTVTGWRVCIDYRKLN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQML++LA + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +E+FMDDFSVFG SF+SCL NL
Subjt:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VL+RC++ NL+LNWEKCHFMV EGI+LGHKVS+KG+EVDRAK+  IE+LPPPT+ K VRSFLGHAGFYRRFIKDFSKI++PL NLLE ++ F+F+D C
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA
        LQAFN LKE+L  AP+I+ P+W   FE+MCDASDYA+GAVLGQRR  +F AIYYASRTL+  Q+ Y T+EKELLAV+FA DKFR YL+GSKI+V+TDHAA
Subjt:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRF
        LKYLF K D+KPRL+RWILLLQEFDLEI+D KG ENVVADHLSR+E++   S  PI EKFPDEQL+  +   LPW+ADIVNY+     P  ++ QQKK+F
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRF

Query:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE
        LH+VK Y W++P L+K CAD +IR+CVP+ E +SI+  CH+   GGHFG  RTA K+L   FYW  +F DC  ++ SC  CQRTGNISR+HE+P   IL 
Subjt:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE

Query:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ
        +ELFDVWGIDFMGPFP SF++  YILV VDYVSKWVEA+A ++NDAR V++FL KNIF+RFG PRAIISD GSHFCN  F+ ++ KY V HKIAT YHPQ
Subjt:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ

Query:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ
        T G  E+SNREIK++LEK V  +RKDW+LKLDDALWAYRTAFKTPIG SPY++V+GK+CHLPVELEH+AYWAIK LN D + AGEKRLL+LN+M E R +
Subjt:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ

Query:  AYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE
        AYENA++YKER  RWHD+ I  R FLPGQ+VLL+NSRL+LFPGKL++RWSGPFVI  V P+GAVEL+  +  TFKVN   LKHY   E
Subjt:  AYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE

A0A6P8DLJ8 Reverse transcriptase0.0e+0070.05Show/hide
Query:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL
        HRI++E   KP ++ QRRLNPT+KEVVKKEVLKLL+AGIIY ISDS WVSPVQVVPKKGGMTVVKNE +E+I TRTVTGWRVCIDYRKLN ATRKDHFPL
Subjt:  HRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPL

Query:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL
        PFIDQML++L  + +YCFLDGYSGYNQI IAPEDQEKTTFTCPYGTFAF+RMPFGLCNAPATFQRCMMSIF  ++E+ +E+FMDDFSVFG SF+SCL NL
Subjt:  PFIDQMLDRLAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNL

Query:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC
          VL+RC++ NL+LNWEKCHFMV EGI+LGHKVS+KG+EVDRAK+  IE+LPPPT+ K VRSFLGHAGFYRRFIKDFSKI++PL NLLE ++ F+F+D C
Subjt:  TRVLQRCQDANLVLNWEKCHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDAC

Query:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA
        LQAFN LKE+L  AP+I+ P+W   FE+MC ASDYA+GAVLGQRR  +F AIYYASRTL+  Q+ Y T+EKELLAV+FA DKFR YL+GSKI+V+TDHAA
Subjt:  LQAFNTLKERLIVAPIIIVPDWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAA

Query:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRF
        LKYLF K D+KPRL+RWILLLQEFDLEI+D KG ENVVADHLSR+E++   S  PI EKFPDEQL+  +   LPW+ADIVNY+     P  ++ QQKK+F
Subjt:  LKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQLYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRF

Query:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE
        LH+VK Y W++P L+K CAD +IR+CVP+ E +SI+  CH+   GGHFG  RTA K+L   FYW  +F DC  ++ SC  CQRTGNISR+HE+P   IL 
Subjt:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILE

Query:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ
        +ELFDVWGIDFMGPFP SF++  YILV VDYVSKWVEA+A ++NDAR V++FL KNIF+R G PRAIISD GSHFCN  F+ ++ KY V HKIAT YHPQ
Subjt:  VELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQ

Query:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ
        T G  E+SNREIK++LEK V  +RKDW+LKLDDALWAYRTAFKTPIG SPY++V+GK+CHLPVELEH+AYWAIK LN D + AGEKRLL+LN+M E R +
Subjt:  TNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQ

Query:  AYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE
        AYENA++YKER  RWHD+ I  R FLPGQ+VLL+NSRL+LFPGKL++RWSGPFVI  V P+GAVEL+  +  TFKVN   LKHY   E
Subjt:  AYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQRLKHYIGDE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.7e-8140.75Show/hide
Query:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAVYSHYCFLDGYS
        ++ V+ ++  +LN GII   S+SP+ SP+ VVPKK   +  +              +R+ IDYRKLN+ T  D  P+P +D++L +L   +++  +D   
Subjt:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAVYSHYCFLDGYS

Query:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV
        G++QI + PE   KT F+  +G + + RMPFGL NAPATFQRCM  I + L+     V++DD  VF +S D  L +L  V ++   ANL L  +KC F+ 
Subjt:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV

Query:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKF-IFDDACLQAFNTLKERLIVAPIIIVPDW
         E   LGH ++  G++ +  KI AI++ P PT  K++++FLG  G+YR+FI +F+ IAKP++  L+   K    +     AF  LK  +   PI+ VPD+
Subjt:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKF-IFDDACLQAFNTLKERLIVAPIIIVPDW

Query:  SQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQ
        ++ F +  DASD ALGAVL Q        + Y SRTL+  +  Y+T EKELLA+V+A   FR YLLG    + +DH  L +L+  KD   +L RW + L 
Subjt:  SQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQ

Query:  EFDLEIKDRKGCENVVADHLSRIENEE
        EFD +IK  KG EN VAD LSRI+ EE
Subjt:  EFDLEIKDRKGCENVVADHLSRIENEE

P0CT34 Transposon Tf2-1 polyprotein2.4e-7727Show/hide
Query:  LNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAVYSHYCF
        L P   + +  E+ + L +GII   S +    PV  VPKK G                    R+ +DY+ LNK  + + +PLP I+Q+L ++   + +  
Subjt:  LNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAVYSHYCF

Query:  LDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEK
        LD  S Y+ I +   D+ K  F CP G F +  MP+G+  APA FQ  + +I     E  +  +MDD  +   S    + ++  VLQ+ ++ANL++N  K
Subjt:  LDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEK

Query:  CHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIVAPIII
        C F  ++   +G+ +S+KG    +  I  + Q   P N K++R FLG   + R+FI   S++  PL+NLL+ + ++ +     QA   +K+ L+  P++ 
Subjt:  CHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIVAPIII

Query:  VPDWSQSFEIMCDASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAALKYLFVKKDSKP---
          D+S+   +  DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ S+KE+LA++ ++  +R YL  +     + TDH  L    +  +S+P   
Subjt:  VPDWSQSFEIMCDASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAALKYLFVKKDSKP---

Query:  RLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQL---YQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHNVKSYHWE
        RL RW L LQ+F+ EI  R G  N +AD LSRI +E      PI +   D  +    Q+  +  +   +V          ++   + KR   N++    +
Subjt:  RLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQL---YQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHNVKSYHWE

Query:  DPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHEL--PTKPILEVEL-FDVW
        D LL  + + + I      +   +I+   H      H G       ++  RF W  + K    +V++C  CQ   N SR H+   P +PI   E  ++  
Subjt:  DPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHEL--PTKPILEVEL-FDVW

Query:  GIDFMGPFPISFNDYLYILVGVDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTNGLAE
         +DF+   P S + Y  + V VD  SK    +  T++  A    +   + +   FG P+ II+D    F +  +K    KYN   K +  Y PQT+G  E
Subjt:  GIDFMGPFPISFNDYLYILVGVDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTNGLAE

Query:  LSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAK
         +N+ ++++L  V   +   W   +     +Y  A  +    +P+ +V            HR   A+  L  +     +K      E  +      E+  
Subjt:  LSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAK

Query:  LYKERTARWHDKKITS-RTFLPGQRVLLFNSRLRLF--PGKLRTRWSGPFVIVKVSPHGAVEL
            +  ++ D KI     F PG  V++  ++        KL   ++GPF +++ S     EL
Subjt:  LYKERTARWHDKKITS-RTFLPGQRVLLFNSRLRLF--PGKLRTRWSGPFVIVKVSPHGAVEL

P0CT41 Transposon Tf2-12 polyprotein2.4e-7727Show/hide
Query:  LNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAVYSHYCF
        L P   + +  E+ + L +GII   S +    PV  VPKK G                    R+ +DY+ LNK  + + +PLP I+Q+L ++   + +  
Subjt:  LNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAVYSHYCF

Query:  LDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEK
        LD  S Y+ I +   D+ K  F CP G F +  MP+G+  APA FQ  + +I     E  +  +MDD  +   S    + ++  VLQ+ ++ANL++N  K
Subjt:  LDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEK

Query:  CHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIVAPIII
        C F  ++   +G+ +S+KG    +  I  + Q   P N K++R FLG   + R+FI   S++  PL+NLL+ + ++ +     QA   +K+ L+  P++ 
Subjt:  CHFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIVAPIII

Query:  VPDWSQSFEIMCDASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAALKYLFVKKDSKP---
          D+S+   +  DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ S+KE+LA++ ++  +R YL  +     + TDH  L    +  +S+P   
Subjt:  VPDWSQSFEIMCDASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAALKYLFVKKDSKP---

Query:  RLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQL---YQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHNVKSYHWE
        RL RW L LQ+F+ EI  R G  N +AD LSRI +E      PI +   D  +    Q+  +  +   +V          ++   + KR   N++    +
Subjt:  RLMRWILLLQEFDLEIKDRKGCENVVADHLSRIENEEAKSWPPIVEKFPDEQL---YQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHNVKSYHWE

Query:  DPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHEL--PTKPILEVEL-FDVW
        D LL  + + + I      +   +I+   H      H G       ++  RF W  + K    +V++C  CQ   N SR H+   P +PI   E  ++  
Subjt:  DPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHEL--PTKPILEVEL-FDVW

Query:  GIDFMGPFPISFNDYLYILVGVDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTNGLAE
         +DF+   P S + Y  + V VD  SK    +  T++  A    +   + +   FG P+ II+D    F +  +K    KYN   K +  Y PQT+G  E
Subjt:  GIDFMGPFPISFNDYLYILVGVDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTNGLAE

Query:  LSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAK
         +N+ ++++L  V   +   W   +     +Y  A  +    +P+ +V            HR   A+  L  +     +K      E  +      E+  
Subjt:  LSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAK

Query:  LYKERTARWHDKKITS-RTFLPGQRVLLFNSRLRLF--PGKLRTRWSGPFVIVKVSPHGAVEL
            +  ++ D KI     F PG  V++  ++        KL   ++GPF +++ S     EL
Subjt:  LYKERTARWHDKKITS-RTFLPGQRVLLFNSRLRLF--PGKLRTRWSGPFVIVKVSPHGAVEL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.9e-9129.7Show/hide
Query:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAVYSHYCFLDGYS
        ++ + K V KLL+   I   S SP  SPV +VPKK G                   +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +S
Subjt:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAVYSHYCFLDGYS

Query:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV
        GY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F+ L    + V++DD  +F  S +    +L  VL+R ++ NL++  +KC F  
Subjt:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV

Query:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIVAPIIIVPDWS
         E   LG+ +  + +   + K  AI   P P  VK+ + FLG   +YRRFI + SKIA+P+   + ++++  + +   +A   LK  L  +P+++  +  
Subjt:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIVAPIIIVPDWS

Query:  QSFEIMCDASDYALGAVLGQ--RRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLL
         ++ +  DAS   +GAVL +   ++ +   + Y S++L++ Q+ Y   E ELL ++ A+  FR  L G    + TDH +L  L  K +   R+ RW+  L
Subjt:  QSFEIMCDASDYALGAVLGQ--RRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLL

Query:  QEFDLEIKDRKGCENVVADHLSR----IENE-----EAKSWPPIVEKFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHNV-KS
          +D  ++   G +NVVAD +SR    I  E     + +SW    +  P     L  +K+                 P DM    +YQ+K        K+
Subjt:  QEFDLEIKDRKGCENVVADHLSR----IENE-----EAKSWPPIVEKFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHNV-KS

Query:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQR-TGNISRQHE-LPTKPILEVEL
        Y  ED ++Y        R  VP ++  +++   H  + +GGHFG T T AK+    +YW  L      ++++C +CQ    +  R H  L   PI E   
Subjt:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQR-TGNISRQHE-LPTKPILEVEL

Query:  FDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATR-TNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTN
         D+  +DF+   P + N+   ILV VD  SK    IATR T DA  ++  L + IF+  G PR I SD       + ++ + ++  +   +++A HPQT+
Subjt:  FDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATR-TNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTN

Query:  GLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPV-----ELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEF
        G +E + + + ++L   V  N ++W + L    + Y +     +G SP+ +  G   + P      E+  R++ A+       E A   + L +   E+ 
Subjt:  GLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPV-----ELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEF

Query:  RAQAYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRL--RLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFK--VNVQRLK
             E+A++  E       K +       G  VL+       +    K++  + GPF +VK     A EL  N+       +NVQ LK
Subjt:  RAQAYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRL--RLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFK--VNVQRLK

Q99315 Transposon Ty3-G Gag-Pol polyprotein9.9e-9229.48Show/hide
Query:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAVYSHYCFLDGYS
        ++ + K V KLL+   I   S SP  SPV +VPKK G                   +R+C+DYR LNKAT  D FPLP ID +L R+     +  LD +S
Subjt:  KEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDRLAVYSHYCFLDGYS

Query:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV
        GY+QI + P+D+ KT F  P G + +  MPFGL NAP+TF R M   F+ L    + V++DD  +F  S +    +L  VL+R ++ NL++  +KC F  
Subjt:  GYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKCHFMV

Query:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIVAPIIIVPDWS
         E   LG+ +  + +   + K  AI   P P  VK+ + FLG   +YRRFI + SKIA+P+   + ++++  + +   +A + LK+ L  +P+++  +  
Subjt:  TEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIVAPIIIVPDWS

Query:  QSFEIMCDASDYALGAVLGQ--RRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLL
         ++ +  DAS   +GAVL +   ++ +   + Y S++L++ Q+ Y   E ELL ++ A+  FR  L G    + TDH +L  L  K +   R+ RW+  L
Subjt:  QSFEIMCDASDYALGAVLGQ--RRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLL

Query:  QEFDLEIKDRKGCENVVADHLSR----IENE-----EAKSWPPIVEKFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHNV-KS
          +D  ++   G +NVVAD +SR    I  E     + +SW    +  P     L  +K+                 P DM    +YQ+K        K+
Subjt:  QEFDLEIKDRKGCENVVADHLSR----IENE-----EAKSWPPIVEKFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHNV-KS

Query:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQR-TGNISRQHE-LPTKPILEVEL
        Y  ED ++Y        R  VP ++  +++   H  + +GGHFG T T AK+    +YW  L      ++++C +CQ    +  R H  L   PI E   
Subjt:  YHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPTRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQR-TGNISRQHE-LPTKPILEVEL

Query:  FDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATR-TNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTN
         D+  +DF+   P + N+   ILV VD  SK    IATR T DA  ++  L + IF+  G PR I SD       + ++ + ++  +   +++A HPQT+
Subjt:  FDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATR-TNDARTVVKFLHKNIFTRFGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTN

Query:  GLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPV-----ELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEF
        G +E + + + ++L      N ++W + L    + Y +     +G SP+ +  G   + P      E+  R++ A+       E A   + L +   E+ 
Subjt:  GLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPV-----ELEHRAYWAIKKLNMDFEKAGEKRLLELNEMEEF

Query:  RAQAYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRL--RLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFK--VNVQRLKHYI
             E+A++  E       K +       G  VL+       +    K++  + GPF +VK     A EL  N+       +NVQ LK ++
Subjt:  RAQAYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRL--RLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFK--VNVQRLKHYI

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein1.7e-1460.71Show/hide
Query:  VLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEVELFDVWGIDFM
        VLQ+ FYW + FKD + FV SCD CQR GN ++++E+P   ILEVE+FDVWGI FM
Subjt:  VLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEVELFDVWGIDFM

ATMG00860.1 DNA/RNA polymerases superfamily protein2.0e-1535.38Show/hide
Query:  NLTRVLQRCQDANLVLNWEKCHFMVTEGIILGHK--VSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIF
        +L  VLQ  +      N +KC F   +   LGH+  +S +G+  D AK+ A+   P P N  ++R FLG  G+YRRF+K++ KI +PL+ LL+  +    
Subjt:  NLTRVLQRCQDANLVLNWEKCHFMVTEGIILGHK--VSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIF

Query:  DDACLQAFNTLKERLIVAPIIIVPDWSQSF
        + A L AF  LK  +   P++ +PD    F
Subjt:  DDACLQAFNTLKERLIVAPIIIVPDWSQSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAGAATCTTGATGGAAGATACCTTTAAACCATCAATAGAGCACCAAAGGAGACTTAATCCCACCATGAAGGAGGTAGTTAAGAAAGAAGTGCTCAAGCTTTTGAA
TGCAGGAATCATATATGCCATATCTGACAGTCCATGGGTAAGTCCTGTTCAAGTGGTACCCAAGAAAGGTGGCATGACTGTGGTTAAAAATGAAAAAGATGAAATGATAG
CCACTAGAACTGTGACAGGATGGCGTGTATGCATTGATTATAGGAAGTTGAACAAAGCTACTAGGAAAGATCATTTTCCCCTTCCTTTTATAGATCAAATGCTTGATAGA
CTTGCAGTTTATTCTCATTATTGCTTTCTAGATGGATATTCAGGTTATAACCAAATTGTCATTGCTCCTGAGGATCAAGAAAAAACGACATTTACATGTCCCTATGGGAC
ATTTGCTTTTAAAAGAATGCCTTTTGGTTTATGTAATGCCCCTGCTACTTTCCAACGCTGTATGATGTCTATTTTTCAGGGGCTCATCGAAGACATAATGGAGGTTTTTA
TGGATGATTTTTCTGTATTTGGGTCTTCATTTGATTCTTGTCTTGTTAATCTAACTCGTGTTTTGCAGAGGTGCCAAGATGCTAACCTTGTTCTAAACTGGGAGAAGTGT
CACTTCATGGTGACGGAAGGCATCATCCTGGGGCACAAAGTCTCTCAAAAGGGATTGGAGGTGGATAGGGCTAAAATAGTTGCTATTGAACAACTACCACCCCCCACCAA
TGTGAAAAAAGTCAGAAGCTTCCTTGGGCATGCAGGGTTTTACAGGAGATTTATTAAAGATTTTTCTAAAATTGCTAAACCCTTGAGTAATTTGTTAGAAAATGAAGCTA
AATTTATTTTTGATGATGCATGTCTACAAGCTTTTAATACTCTGAAGGAAAGACTAATTGTTGCACCTATTATTATTGTACCAGATTGGAGTCAGTCTTTTGAGATCATG
TGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTAGAGATAACATGTTTAGGGCAATTTATTATGCTAGTAGGACTCTTGATAATACTCAACAGAAATA
CACTACTTCTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAATTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGATCATGCTGCTTTAAAGT
ACTTGTTTGTTAAGAAAGATTCTAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGGAATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAACGTGGTTGCA
GACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCCCAATTGTTGAGAAGTTCCCTGATGAACAACTGTATCAGGTAAAAGATAGTTTGCCCTGGTTTGC
TGACATAGTTAATTATCTTGCAGGAGGACATTTGCCTCCTGACATGAACTATCAACAAAAGAAAAGATTCCTGCACAATGTTAAATCTTACCATTGGGAGGACCCACTTC
TCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTACCTCAAGAGGAAGTGGTAAGTATTTTAAATTCATGTCATGCTTCCCCTTATGGAGGTCATTTTGGACCC
ACCAGAACTGCAGCTAAGGTACTTCAGTCACGATTTTATTGGCTATCCCTTTTTAAAGACTGTTATACCTTTGTTAAGTCATGTGATAGGTGCCAACGTACTGGCAATAT
TTCTAGACAACATGAGCTTCCAACGAAACCTATCTTGGAAGTGGAGTTATTTGATGTTTGGGGTATTGACTTTATGGGACCTTTTCCTATTTCTTTTAATGATTACCTAT
ACATTCTAGTTGGAGTAGATTATGTATCTAAGTGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTGTAAAATTCTTGCATAAAAACATTTTCACACGT
TTTGGTACACCTAGAGCTATTATTAGTGATGAGGGCTCTCACTTTTGCAATAATTTGTTTAAATCCATGATGCAAAAATATAATGTTAATCATAAAATTGCTACAGCTTA
TCATCCTCAAACTAATGGCCTTGCTGAGTTATCTAACAGGGAAATCAAACAAGTTTTGGAAAAGGTAGTCAAGAACAATAGGAAGGATTGGGCCCTTAAGCTCGATGATG
CACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTATAGGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGAGCTCGAGCATAGAGCTTAT
TGGGCCATCAAGAAGCTGAACATGGATTTTGAGAAAGCTGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATGAGAATGCCAAACT
TTACAAAGAGCGCACTGCGAGATGGCATGACAAGAAGATCACATCACGAACCTTCCTTCCAGGACAAAGAGTATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGTA
AGCTTAGGACACGATGGTCGGGACCCTTTGTCATTGTCAAGGTATCCCCACACGGAGCCGTGGAACTACAAGGTAACAATGGAACAACCTTCAAAGTGAATGTTCAACGA
TTAAAGCACTACATCGGTGATGAAGAACGTGGACTTGAGAACCTGACTTTCATTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGCACAGAATCTTGATGGAAGATACCTTTAAACCATCAATAGAGCACCAAAGGAGACTTAATCCCACCATGAAGGAGGTAGTTAAGAAAGAAGTGCTCAAGCTTTTGAA
TGCAGGAATCATATATGCCATATCTGACAGTCCATGGGTAAGTCCTGTTCAAGTGGTACCCAAGAAAGGTGGCATGACTGTGGTTAAAAATGAAAAAGATGAAATGATAG
CCACTAGAACTGTGACAGGATGGCGTGTATGCATTGATTATAGGAAGTTGAACAAAGCTACTAGGAAAGATCATTTTCCCCTTCCTTTTATAGATCAAATGCTTGATAGA
CTTGCAGTTTATTCTCATTATTGCTTTCTAGATGGATATTCAGGTTATAACCAAATTGTCATTGCTCCTGAGGATCAAGAAAAAACGACATTTACATGTCCCTATGGGAC
ATTTGCTTTTAAAAGAATGCCTTTTGGTTTATGTAATGCCCCTGCTACTTTCCAACGCTGTATGATGTCTATTTTTCAGGGGCTCATCGAAGACATAATGGAGGTTTTTA
TGGATGATTTTTCTGTATTTGGGTCTTCATTTGATTCTTGTCTTGTTAATCTAACTCGTGTTTTGCAGAGGTGCCAAGATGCTAACCTTGTTCTAAACTGGGAGAAGTGT
CACTTCATGGTGACGGAAGGCATCATCCTGGGGCACAAAGTCTCTCAAAAGGGATTGGAGGTGGATAGGGCTAAAATAGTTGCTATTGAACAACTACCACCCCCCACCAA
TGTGAAAAAAGTCAGAAGCTTCCTTGGGCATGCAGGGTTTTACAGGAGATTTATTAAAGATTTTTCTAAAATTGCTAAACCCTTGAGTAATTTGTTAGAAAATGAAGCTA
AATTTATTTTTGATGATGCATGTCTACAAGCTTTTAATACTCTGAAGGAAAGACTAATTGTTGCACCTATTATTATTGTACCAGATTGGAGTCAGTCTTTTGAGATCATG
TGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTAGAGATAACATGTTTAGGGCAATTTATTATGCTAGTAGGACTCTTGATAATACTCAACAGAAATA
CACTACTTCTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAATTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGATCATGCTGCTTTAAAGT
ACTTGTTTGTTAAGAAAGATTCTAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGGAATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAACGTGGTTGCA
GACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCCCAATTGTTGAGAAGTTCCCTGATGAACAACTGTATCAGGTAAAAGATAGTTTGCCCTGGTTTGC
TGACATAGTTAATTATCTTGCAGGAGGACATTTGCCTCCTGACATGAACTATCAACAAAAGAAAAGATTCCTGCACAATGTTAAATCTTACCATTGGGAGGACCCACTTC
TCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTACCTCAAGAGGAAGTGGTAAGTATTTTAAATTCATGTCATGCTTCCCCTTATGGAGGTCATTTTGGACCC
ACCAGAACTGCAGCTAAGGTACTTCAGTCACGATTTTATTGGCTATCCCTTTTTAAAGACTGTTATACCTTTGTTAAGTCATGTGATAGGTGCCAACGTACTGGCAATAT
TTCTAGACAACATGAGCTTCCAACGAAACCTATCTTGGAAGTGGAGTTATTTGATGTTTGGGGTATTGACTTTATGGGACCTTTTCCTATTTCTTTTAATGATTACCTAT
ACATTCTAGTTGGAGTAGATTATGTATCTAAGTGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTGTAAAATTCTTGCATAAAAACATTTTCACACGT
TTTGGTACACCTAGAGCTATTATTAGTGATGAGGGCTCTCACTTTTGCAATAATTTGTTTAAATCCATGATGCAAAAATATAATGTTAATCATAAAATTGCTACAGCTTA
TCATCCTCAAACTAATGGCCTTGCTGAGTTATCTAACAGGGAAATCAAACAAGTTTTGGAAAAGGTAGTCAAGAACAATAGGAAGGATTGGGCCCTTAAGCTCGATGATG
CACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTATAGGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGAGCTCGAGCATAGAGCTTAT
TGGGCCATCAAGAAGCTGAACATGGATTTTGAGAAAGCTGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATGAGAATGCCAAACT
TTACAAAGAGCGCACTGCGAGATGGCATGACAAGAAGATCACATCACGAACCTTCCTTCCAGGACAAAGAGTATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGTA
AGCTTAGGACACGATGGTCGGGACCCTTTGTCATTGTCAAGGTATCCCCACACGGAGCCGTGGAACTACAAGGTAACAATGGAACAACCTTCAAAGTGAATGTTCAACGA
TTAAAGCACTACATCGGTGATGAAGAACGTGGACTTGAGAACCTGACTTTCATTGCATGA
Protein sequenceShow/hide protein sequence
MHRILMEDTFKPSIEHQRRLNPTMKEVVKKEVLKLLNAGIIYAISDSPWVSPVQVVPKKGGMTVVKNEKDEMIATRTVTGWRVCIDYRKLNKATRKDHFPLPFIDQMLDR
LAVYSHYCFLDGYSGYNQIVIAPEDQEKTTFTCPYGTFAFKRMPFGLCNAPATFQRCMMSIFQGLIEDIMEVFMDDFSVFGSSFDSCLVNLTRVLQRCQDANLVLNWEKC
HFMVTEGIILGHKVSQKGLEVDRAKIVAIEQLPPPTNVKKVRSFLGHAGFYRRFIKDFSKIAKPLSNLLENEAKFIFDDACLQAFNTLKERLIVAPIIIVPDWSQSFEIM
CDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTSEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVA
DHLSRIENEEAKSWPPIVEKFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGP
TRTAAKVLQSRFYWLSLFKDCYTFVKSCDRCQRTGNISRQHELPTKPILEVELFDVWGIDFMGPFPISFNDYLYILVGVDYVSKWVEAIATRTNDARTVVKFLHKNIFTR
FGTPRAIISDEGSHFCNNLFKSMMQKYNVNHKIATAYHPQTNGLAELSNREIKQVLEKVVKNNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAY
WAIKKLNMDFEKAGEKRLLELNEMEEFRAQAYENAKLYKERTARWHDKKITSRTFLPGQRVLLFNSRLRLFPGKLRTRWSGPFVIVKVSPHGAVELQGNNGTTFKVNVQR
LKHYIGDEERGLENLTFIA