; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G10690 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G10690
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionReverse transcriptase
Genome locationClcChr02:16598530..16603770
RNA-Seq ExpressionClc02G10690
SyntenyClc02G10690
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833156.1 hypothetical protein, partial [Synechococcus sp. PCC 7002]1.2e-29482.84Show/hide
Query:  LEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYAL
        LEVDR  I AIEQLPPPTNVKGV SF+ H GFY RFIKDFSKIAKPLSNLLEK+AKFIFDDACLLAFNTLKERLI APIIVV +WSQSFEIMCDASDYAL
Subjt:  LEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYAL

Query:  GAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENV
        GAVLGQRRDNMFRAIYYASRTLDNTQ+KYTTTEKELLAVVFA+DKFRSYLL SKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENV
Subjt:  GAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENV

Query:  VADHVSRIENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNS
        VADH+SRIENE+AKSWPPIVEMFPDEQLYQVKDS+PWFADIVNYLAGGH PPDMNYQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNS
Subjt:  VADHVSRIENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNS

Query:  CHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEA
        CHASPYGGHFGP RTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVELFDVWGIDFMGPFPIS                    
Subjt:  CHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEA

Query:  IATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAY
                                                        YNVNHKIATAYHPQTNGL ELSNREIKQVLEK VKTNRKDWALKLDDALWAY
Subjt:  IATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAY

Query:  RTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAYENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRL
        RTAFKTPIGTS YRLVFGKACHLPVELEHRAYWAIKKLNMDF KAGEKRLLELNEMEEFRAQAYENAK+YK+RT RWHDKKIT  TFLP QR+LLFNSRL
Subjt:  RTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAYENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRL

Query:  RLFPGVKRAAFA
        RLFPG  R  ++
Subjt:  RLFPGVKRAAFA

XP_012833687.1 PREDICTED: uncharacterized protein LOC105954563 [Erythranthe guttata]1.1e-28271.45Show/hide
Query:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL
        L+L RC + NLVLNWEKCHFMV EGIVLGHKVSKKGLEVDRAKI  IE+LPPP +VKGVRSFLGH GFYRRFIKDFSKI KPL +LLEKEA F FD ACL
Subjt:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL

Query:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL
         AF  LKE+L  +PI++ PNW + FEIMCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  QK Y+TTEKE+LAVV+A+DKFR Y+L S+++++TDHAA+
Subjt:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL

Query:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH
        +YLF KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADH+SR+  EE  +   I E FPDEQL  +    PW+AD+ N+LA G  P D++Y QKK+FLH
Subjt:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH

Query:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE
        + + Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G  RTAAKVLQSGF+WP+LF+D Y FVK CDRCQRTGN+S + ++P+  + EVE
Subjt:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE

Query:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN
        LFDVWGIDFMGPFP S NG LYIL+A+DYVSKWVEAIAT TNDARTV+KF HKNIF+RFGTPRAI+SDEGSHFCNKL  ++  K  + HKIA AYHPQTN
Subjt:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN

Query:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY
        GL ELSNREIKQ+LEKTV TNRKDWALKLDDALWAYRTAFKTPIG SPY+LV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  AY
Subjt:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY

Query:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG
        ENAKIYKE+T +WHDK+IT R F  G ++LLFNSRLRLFPG
Subjt:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG

XP_012847037.1 PREDICTED: uncharacterized protein LOC105967019 [Erythranthe guttata]1.1e-28271.45Show/hide
Query:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL
        L+L RC + NLVLNWEKCHFMV EGIVLGHKVSKKGLEVDRAKI  IE+LPPP +VKGVRSFLGH GFYRRFIKDFSKI KPL +LLEKEA F FD ACL
Subjt:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL

Query:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL
         AF  LKE+L  +PI++ PNW + FEIMCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  QK Y+TTEKE+LAVV+A+DKFR Y+L S+++++TDHAA+
Subjt:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL

Query:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH
        +YLF KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADH+SR+  EE  +   I E FPDEQL  +    PW+AD+ N+LA G  P D++Y QKK+FLH
Subjt:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH

Query:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE
        + + Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G  RTAAKVLQSGF+WP+LF+D Y FVK CDRCQRTGN+S + ++P+  + EVE
Subjt:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE

Query:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN
        LFDVWGIDFMGPFP S NG LYIL+A+DYVSKWVEAIAT TNDARTV+KF HKNIF+RFGTPRAI+SDEGSHFCNKL  ++  K  + HKIA AYHPQTN
Subjt:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN

Query:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY
        GL ELSNREIKQ+LEKTV TNRKDWALKLDDALWAYRTAFKTPIG SPY+LV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  AY
Subjt:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY

Query:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG
        ENAKIYKE+T +WHDK+IT R F  G ++LLFNSRLRLFPG
Subjt:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG

XP_012858910.1 PREDICTED: uncharacterized protein LOC105978045 [Erythranthe guttata]8.3e-28371.61Show/hide
Query:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL
        L+L RC + NLVLNWEKCHFMV EGIVLGHKVSKKGLEVDRAKI  IE+LPPP +VKGVRSFLGH GFYRRFIKDFSKI KPL +LLEKEA F FD ACL
Subjt:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL

Query:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL
         AF  LKE+L  +PI++ PNW + FEIMCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  QK Y+TTEKE+LAVV+A+DKFR Y+L S+++++TDHAA+
Subjt:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL

Query:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH
        +YLF KKD+KPRL+RW+LLLQEFDLEI+D+KG ENVVADH+SR+  EE  +   I E FPDEQL  +    PW+AD+ N+LA G  P D+ Y QKK+FLH
Subjt:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH

Query:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE
        + + Y W++PLL++   D +IR+CVP+ EV  IL  CH+SP GGH G  RTAAKVLQSGF+WP+LF+D Y FVK CDRCQRTGN+S + ++P+  + EVE
Subjt:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE

Query:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN
        LFDVWGIDFMGPFP S NG LYIL+A+DYVSKWVEAIAT TNDARTV+KF HKNIF+RFGTPRAI+SDEGSHFCNKL  ++  K  + HKIA AYHPQTN
Subjt:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN

Query:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY
        GLVELSNREIKQ+LEKTV TNRKDWALKLDDALWAYRTAFKTPIG SPY+LVFGKACHLPVELEHRAYWA+KKLN D    G++RLL+LNE+EEFR  AY
Subjt:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY

Query:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG
        ENAKIYKE+T +WHDK+IT R F  G ++LLFNSRLRLFPG
Subjt:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]1.6e-29473.49Show/hide
Query:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL
        L+L RC+D NLVLNWEKCHFMV EGIVLGH+VS KG+EVDRAKI  IE+LPPP NVKG+RSFLGH GFYRRFIKDFSK++KPL NLLEK + F FDD CL
Subjt:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL

Query:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL
         AFN +KE+LI AP++ VP+WSQ FE+MCDASD+ALGAVLGQRRD +FRAIYYASRTL+  Q  YTTTEKE+LAVVFA DKFRSYL+C+K++V TDHAAL
Subjt:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL

Query:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH
        +YLF KKD+KPRL+RWILLLQEFDLE++D+KG EN VADH+SR+E EE +    I E FPDEQL+  +  +PW+ADIVN+LA    PPD+ Y Q+K+FLH
Subjt:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH

Query:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE
        +VK Y W++PLL+K C D +IR+CVP+EE+ +IL+ CH+S YGGHFG  RTAAKVLQSGF+WPS+F+D YT VK+CDRCQR GNISR+ ELP+K ILEVE
Subjt:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE

Query:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN
        LFDVWGIDFMGPFP S+ G++YIL+A+DYVSKWVEAIAT TNDA+ V+KFLHKNIFTRFGTPRAI+SDEG+HFCNKLF +++ KY V HKIA AYHPQTN
Subjt:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN

Query:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY
        G  E+SNREIK +LEKTV TNRKDWA KLDDALWAYRTAFKTPIG SPYRLVFGKACHLPVELEH+AYWA+KK N+D   AGEKRLL+LNEM+EFR  AY
Subjt:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY

Query:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPGVKRA
        ENAKIYKERT +WHDK+I  R F PGQ++LLFNSRL+LFPG  R+
Subjt:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPGVKRA

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase5.4e-27269.16Show/hide
Query:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL
        +L RC+D NL+LNWEKCHFMV EGIVLGHKVS +G+EVD+AK+  IE+LPPPT+VKGVRSFLGH GFYRRFIKDFSKI+KPL NLLEK+  F FDDAC  
Subjt:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL

Query:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAALK
        AFN LK RLI APII VP+WS  FE+MCDASD+A+GAVLGQR+D +FR+IYYAS+TL++ Q  YTTTEKELLAVVFA DKFRSYL+ +K++V+TDHAA++
Subjt:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAALK

Query:  YLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIV-EMFPDEQLYQ-VKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFL
        YL  KKD+KPRL+RW+LLLQEFDLEI+DRKG EN +ADH+SR+E+      P ++ + FPDEQL   V   VPW+ADIVNYL  G  P D++ QQKK+FL
Subjt:  YLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIV-EMFPDEQLYQ-VKDSVPWFADIVNYLAGGHFPPDMNYQQKKRFL

Query:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEV
         + + Y W+DP L+K   DN++R+CVP+ E+  IL  CHASPYGGHF   RTAAK+LQSGF+WP+LFKD ++FV +CDRCQRTGNISR+HE+P+  ILEV
Subjt:  HNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEV

Query:  ELFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQT
        ELFDVWGIDFMGPF  S+ G +YILVA+DYVSKWVEA A   ND++ VV F+ KNIFTRFGTPRAI+SD G+HFCN+ F++++ KY V HKI+T YHPQT
Subjt:  ELFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQT

Query:  NGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQA
        +G VE+SNREIK++LEKTV + RKDW+ +LD+ALWAYRTA+KTPIG SPYRLVFGKACHLPVELEH AYWAI+KLN D   AGEKRLL+LNE++EFR  A
Subjt:  NGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQA

Query:  YENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG
        YENAKIYKE+  RWH+KKI  R F PGQ +LLFNSRL+LFPG
Subjt:  YENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG

A0A4Y1RS99 Transposable element protein3.8e-26566.06Show/hide
Query:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL
        L+L RC++ NLVLNWEKCHFMV EGIVLGHK+S +G+EVDRAKI  IE+LPPP+ VKG+RSFLGH GFYRRFIKDFSKI KPL  LL K+++F FD  CL
Subjt:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL

Query:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL
         AFN LK +L  AP+I+ P+W   FEIMCDASDYA+GAVLGQR++ +   I+YASRTL++ Q  Y TTEKELLAVVFA+DKFRSYLL +K++V+TDHAAL
Subjt:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL

Query:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRI--ENEEAKSWPPIVEMFPDEQLYQVKDS----VPWFADIVNYLAGGHFPPDMNYQQ
        K+L  KK++KPRL+RW+LLLQEFD+EI+D+KG ENVVADH+SR+  E+E  +   PI+E FPDEQLY +  +     PW+AD VNYLA G  PPDM++ Q
Subjt:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRI--ENEEAKSWPPIVEMFPDEQLYQVKDS----VPWFADIVNYLAGGHFPPDMNYQQ

Query:  KKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMK
        KK+FL  VK Y+W+DP L+K   D +IR+CVP+ E+  IL  CH    GGH+G  +T AKVLQSGF+WP+LFKD   FV  CD CQRTGNIS ++++P+ 
Subjt:  KKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMK

Query:  PILEVELFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATA
         ILEVELFDVWGIDFMGPFP SY G LYILVA+DYVSKWVEA A  TNDA+ VV+FL KNIFTRFG PRAI+SD G+HFCN+ F S++ KY + HK++T 
Subjt:  PILEVELFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATA

Query:  YHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEE
        YHPQT+G VE+SNRE+K++LEKTV  +RKDW+LKLDDALWAYRTAFK PIG SPYRLVFGKACHLPVELEH+A+WAIK LN D   AGEKR L+LNE+EE
Subjt:  YHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEE

Query:  FRAQAYENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPGVKRAAFA
         R ++YENAKIYK+RT +WHDK I  + F  GQ +LL+NSRL+LFPG  R+ ++
Subjt:  FRAQAYENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPGVKRAAFA

A0A540MQU7 Integrase catalytic domain-containing protein2.9e-26566.31Show/hide
Query:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL
        L+L RC++ NLVLNWEKC FMV +GIVLGH +S KG+EVD+AKI  I +LPPPT+VKGVRSFLGH GFYRRFIKDFSKI++PL NLL K+  F F+  CL
Subjt:  LILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACL

Query:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL
         AFN+LK+ L  APII  PNWS  FE+MCDASDYA+GAVLGQR++ +   IYYASRTL++ Q  YTTTEKE+LAV+FA++KFRSYL+ SK++V+TDH AL
Subjt:  LAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAAL

Query:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIEN--EEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRF
        KYL  KKD+KPRL+RW+LLLQEFDL+I+D+KG ENVVADH+SR+ +  +E +   P+ E FPDEQL+ + D VPW+ADI NYL  G  PPD++ Q +K+F
Subjt:  KYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIEN--EEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDMNYQQKKRF

Query:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILE
        L  VK Y W+DP LYK C+D +IR+CVP  E  SIL  CH+   GGHFGP +TAAKVLQSGF+WP+LFKD Y F  +CDRCQR GNIS+++E+P + +L 
Subjt:  LHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILE

Query:  VELFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQ
        VELFDVWGIDFMGPFP S+N   YILVA+DYVSKWVEAIAT TND + V++FL   IF RFGTPR I+SD G HF NK F ++M KYN+NH++AT YHPQ
Subjt:  VELFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQ

Query:  TNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQ
        T+G VE+SNREIK++LE TV  +RKDW+LKL DALWAYRTA+KTPIG SP+RLV+GKACH PVELEHRAYWAIK+LN ++  AGEKR L+LNE+EE+R +
Subjt:  TNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQ

Query:  AYENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPGVKRA
        AYENAKIYKERT ++HDK I  + F+PGQ++LL+NSRLRLFPG  R+
Subjt:  AYENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPGVKRA

A0A6P8CBX2 Reverse transcriptase2.3e-27068.33Show/hide
Query:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL
        +L RC++ NL+LNWEKCHFMV EGIVLGHKVSKKG+EVDRAK+  IE+LPPPT+ KGVRSFLGH GFYRRFIKDFSKI++PL NLLEK++ F+F+D CL 
Subjt:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL

Query:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAALK
        AFN LKE+L  AP+IV PNW   FE+MCDASDYA+GAVLGQRR  +F AIYYASRTL+  QK Y TTEKELLAV+FA DKFR YL+ SKI+V+TDHAALK
Subjt:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAALK

Query:  YLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVK-DSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH
        YLF K D+KPRL+RWILLLQEFDLEI+D KG ENVVADH+SR+E++   S  PI E FPDEQL+  +   +PW+ADIVNY+     P  ++ QQKK+FLH
Subjt:  YLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVK-DSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH

Query:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE
        +VK Y W++P L+K CAD +IR+CVP+ E +SI+  CH+   GGHFG  RTA K+L  GFYWP +F DC  ++ SC  CQRTGNISR+HE+P   IL +E
Subjt:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE

Query:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN
        LFDVWGIDFMGPFP S++   YILVA+DYVSKWVEA+A ++NDAR V++FL KNIF+RFG PRAI+SD GSHFCN+ F+ ++ KY V HKIAT YHPQT 
Subjt:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN

Query:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY
        G VE+SNREIK++LEKTV  +RKDW+LKLDDALWAYRTAFKTPIG SPY++V+GK+CHLPVELEH+AYWAIK LN D   AGEKRLL+LN+M E R +AY
Subjt:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY

Query:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG
        ENA+IYKER  RWHD+ I  R FLPGQ++LL+NSRL+LFPG
Subjt:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG

A0A6P8DLJ8 Reverse transcriptase9.6e-26968.02Show/hide
Query:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL
        +L RC++ NL+LNWEKCHFMV EGIVLGHKVSKKG+EVDRAK+  IE+LPPPT+ KGVRSFLGH GFYRRFIKDFSKI++PL NLLEK++ F+F+D CL 
Subjt:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL

Query:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAALK
        AFN LKE+L  AP+IV PNW   FE+MC ASDYA+GAVLGQRR  +F AIYYASRTL+  QK Y TTEKELLAV+FA DKFR YL+ SKI+V+TDHAALK
Subjt:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAALK

Query:  YLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVK-DSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH
        YLF K D+KPRL+RWILLLQEFDLEI+D KG ENVVADH+SR+E++   S  PI E FPDEQL+  +   +PW+ADIVNY+     P  ++ QQKK+FLH
Subjt:  YLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVK-DSVPWFADIVNYLAGGHFPPDMNYQQKKRFLH

Query:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE
        +VK Y W++P L+K CAD +IR+CVP+ E +SI+  CH+   GGHFG  RTA K+L  GFYWP +F DC  ++ SC  CQRTGNISR+HE+P   IL +E
Subjt:  NVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE

Query:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN
        LFDVWGIDFMGPFP S++   YILVA+DYVSKWVEA+A ++NDAR V++FL KNIF+R G PRAI+SD GSHFCN+ F+ ++ KY V HKIAT YHPQT 
Subjt:  LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTN

Query:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY
        G VE+SNREIK++LEKTV  +RKDW+LKLDDALWAYRTAFKTPIG SPY++V+GK+CHLPVELEH+AYWAIK LN D   AGEKRLL+LN+M E R +AY
Subjt:  GLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAY

Query:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG
        ENA+IYKER  RWHD+ I  R FLPGQ++LL+NSRL+LFPG
Subjt:  ENAKIYKERTTRWHDKKITSRTFLPGQRILLFNSRLRLFPG

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.2e-5126.95Show/hide
Query:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL
        +L + ++ANL++N  KC F  ++   +G+ +S+KG    +  I  + Q   P N K +R FLG   + R+FI   S++  PL+NLL+K+ ++ +      
Subjt:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL

Query:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYL--LCSKIVVHTDHA
        A   +K+ L+  P++   ++S+   +  DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ ++  +R YL        + TDH 
Subjt:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYL--LCSKIVVHTDHA

Query:  ALKYLFVKKDSKP---RLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQL---YQVKDSVPWFADIVNYLAGGHFPPDMNY
         L    +  +S+P   RL RW L LQ+F+ EI  R G  N +AD +SRI +E      PI +   D  +    Q+  +  +   +V          ++  
Subjt:  ALKYLFVKKDSKP---RLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQL---YQVKDSVPWFADIVNYLAGGHFPPDMNY

Query:  QQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHEL-
         + KR   N++    +D LL  + + + I      +   +I+   H      H G I     ++   F W  + K    +V++C  CQ   N SR H+  
Subjt:  QQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHEL-

Query:  -PMKPILEVEL-FDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVN
         P++PI   E  ++   +DF+   P S +GY  + V +D  SK    +  T++  A    +   + +   FG P+ I++D    F ++ +K    KYN  
Subjt:  -PMKPILEVEL-FDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVN

Query:  HKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLV
         K +  Y PQT+G  E +N+ ++++L     T+   W   +     +Y  A  +    +P+ +V
Subjt:  HKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLV

P0CT41 Transposon Tf2-12 polyprotein3.2e-5126.95Show/hide
Query:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL
        +L + ++ANL++N  KC F  ++   +G+ +S+KG    +  I  + Q   P N K +R FLG   + R+FI   S++  PL+NLL+K+ ++ +      
Subjt:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL

Query:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYL--LCSKIVVHTDHA
        A   +K+ L+  P++   ++S+   +  DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ ++  +R YL        + TDH 
Subjt:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYL--LCSKIVVHTDHA

Query:  ALKYLFVKKDSKP---RLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQL---YQVKDSVPWFADIVNYLAGGHFPPDMNY
         L    +  +S+P   RL RW L LQ+F+ EI  R G  N +AD +SRI +E      PI +   D  +    Q+  +  +   +V          ++  
Subjt:  ALKYLFVKKDSKP---RLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQL---YQVKDSVPWFADIVNYLAGGHFPPDMNY

Query:  QQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHEL-
         + KR   N++    +D LL  + + + I      +   +I+   H      H G I     ++   F W  + K    +V++C  CQ   N SR H+  
Subjt:  QQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHEL-

Query:  -PMKPILEVEL-FDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVN
         P++PI   E  ++   +DF+   P S +GY  + V +D  SK    +  T++  A    +   + +   FG P+ I++D    F ++ +K    KYN  
Subjt:  -PMKPILEVEL-FDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVN

Query:  HKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLV
         K +  Y PQT+G  E +N+ ++++L     T+   W   +     +Y  A  +    +P+ +V
Subjt:  HKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLV

P10394 Retrovirus-related Pol polyprotein from transposon 4121.1e-6727.8Show/hide
Query:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN
        +C++ NL L+ EKC F + E   LGHK + KG+  D  K   I+  P P +    R F+    +YRRFIK+F+  ++ ++ L +K   F + D C  AF 
Subjt:  RCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFN

Query:  TLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAALKYLF
         LK +LI   ++  P++S+ F I  DAS  A GAVL Q  +     + YASR     +   +TTE+EL A+ +AI  FR Y+      V TDH  L YLF
Subjt:  TLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIVVHTDHAALKYLF

Query:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRI------------------------------------ENEEAKSWPPIVEMFPDEQL----
           +   +L R  L L+E++  ++  KG +N VAD +SRI                                    + +E  S P + E+  ++++    
Subjt:  VKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRI------------------------------------ENEEAKSWPPIVEMFPDEQL----

Query:  -YQVKDSVPWFA---------DIVNYLAGGHFPPDMNYQQ------------------KKRFLHNV--KSYHWEDPLL--YKVCADNMIRKCVPQEEVVS
          Q+ DS+  F          D+ +    G    D   Q+                  KK F H    K  +  + +L   KV   N + +   ++E  +
Subjt:  -YQVKDSVPWFA---------DIVNYLAGGHFPPDMNYQQ------------------KKRFLHNV--KSYHWEDPLL--YKVCADNMIRKCVPQEEVVS

Query:  ILNSCHASP-YGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEV--ELFDVWGIDFMGPFPISYNGYLYILVAIDY
        IL++ H  P  GGH G  +T AKV +  +YW ++ K    +V+ C +CQ+    ++  + PM  I E     FD   +D +GP P S NG  Y +  I  
Subjt:  ILNSCHASP-YGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEV--ELFDVWGIDFMGPFPISYNGYLYILVAIDY

Query:  VSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKL
        ++K++ AI      A+TV K + ++   ++G  +  ++D G+ + N +   + +   + +  +TA+H QT G+VE S+R + + +   + T++ DW + L
Subjt:  VSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKL

Query:  DDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEH-RAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAYENAKIYKERTTRWHDKKITSRTFLPGQR
           ++ + T         PY LVFG+  +LP       +   I  ++ D+ K  + RL      E   A+A +  + +KE+    +D K+       G +
Subjt:  DDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEH-RAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAYENAKIYKERTTRWHDKKITSRTFLPGQR

Query:  ILLFN
        +LL N
Subjt:  ILLFN

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.2e-5528.83Show/hide
Query:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPL-------SNLLEKEAKFI
        +L R ++ NL++  +KC F   E   LG+ +  + +   + K  AI   P P  VK  + FLG   +YRRFI + SKIA+P+       S   EK+ K  
Subjt:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPL-------SNLLEKEAKFI

Query:  FDDACLLAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQ--RRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIV
               A   LK  L  +P++V  N   ++ +  DAS   +GAVL +   ++ +   + Y S++L++ QK Y   E ELL ++ A+  FR  L      
Subjt:  FDDACLLAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQ--RRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYLLCSKIV

Query:  VHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSR-IENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYL----AGGHFP
        + TDH +L  L  K +   R+ RW+  L  +D  ++   G +NVVAD +SR I     ++  PI      E       S P  + ++ ++         P
Subjt:  VHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSR-IENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYL----AGGHFP

Query:  PDM----NYQQKKRFLHNV-KSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQR
         DM    +YQ+K        K+Y  ED ++Y        R  VP ++  +++   H  + +GGHFG   T AK+    +YWP L      ++++C +CQ 
Subjt:  PDM----NYQQKKRFLHNV-KSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCH-ASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQR

Query:  -TGNISRQHEL--PMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATR-TNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKL
           +  R H L  P+ PI E    D+  +DF+   P + N    ILV +D  SK    IATR T DA  ++  L + IF+  G PR I SD         
Subjt:  -TGNISRQHEL--PMKPILEVELFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATR-TNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKL

Query:  FKSMMQKYNVNHKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPV-----ELEHRAYWAIK
        ++ + ++  +   +++A HPQT+G  E + + + ++L   V TN ++W + L    + Y +     +G SP+ +  G   + P      E+  R++ A++
Subjt:  FKSMMQKYNVNHKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPV-----ELEHRAYWAIK

Q9UR07 Transposon Tf2-11 polyprotein3.2e-5126.95Show/hide
Query:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL
        +L + ++ANL++N  KC F  ++   +G+ +S+KG    +  I  + Q   P N K +R FLG   + R+FI   S++  PL+NLL+K+ ++ +      
Subjt:  ILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLL

Query:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYL--LCSKIVVHTDHA
        A   +K+ L+  P++   ++S+   +  DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ ++  +R YL        + TDH 
Subjt:  AFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAIDKFRSYL--LCSKIVVHTDHA

Query:  ALKYLFVKKDSKP---RLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQL---YQVKDSVPWFADIVNYLAGGHFPPDMNY
         L    +  +S+P   RL RW L LQ+F+ EI  R G  N +AD +SRI +E      PI +   D  +    Q+  +  +   +V          ++  
Subjt:  ALKYLFVKKDSKP---RLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQL---YQVKDSVPWFADIVNYLAGGHFPPDMNY

Query:  QQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHEL-
         + KR   N++    +D LL  + + + I      +   +I+   H      H G I     ++   F W  + K    +V++C  CQ   N SR H+  
Subjt:  QQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHEL-

Query:  -PMKPILEVEL-FDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVN
         P++PI   E  ++   +DF+   P S +GY  + V +D  SK    +  T++  A    +   + +   FG P+ I++D    F ++ +K    KYN  
Subjt:  -PMKPILEVEL-FDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAI-ATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVN

Query:  HKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLV
         K +  Y PQT+G  E +N+ ++++L     T+   W   +     +Y  A  +    +P+ +V
Subjt:  HKIATAYHPQTNGLVELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLV

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein6.8e-1764.29Show/hide
Query:  VLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVELFDVWGIDFM
        VLQ+GFYWP+ FKD + FV SCD CQR GN ++++E+P   ILEVE+FDVWGI FM
Subjt:  VLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVELFDVWGIDFM

ATMG00860.1 DNA/RNA polymerases superfamily protein2.4e-1435.96Show/hide
Query:  NWEKCHFMVTEGIVLGHK--VSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFNTLKERLI
        N +KC F   +   LGH+  +S +G+  D AK+ A+   P P N   +R FLG  G+YRRF+K++ KI +PL+ LL+K +   + +   LAF  LK  + 
Subjt:  NWEKCHFMVTEGIVLGHK--VSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYRRFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFNTLKERLI

Query:  VAPIIVVPNWSQSF
          P++ +P+    F
Subjt:  VAPIIVVPNWSQSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCAGAGCATCAACGCGGTGCTGGGAAGGCGCTGCTTCGACGCGTACTTGGTGAACGCAAGACAGCGTCGCGGCGCTGGGATGGCGCTGCCGCGGCGCTAAAACC
CGAGAGCAACTTAATTTTGGTCAGGTGCCAAGATGCTAACCTTGTTCTAAACTGGGAGAAGTGTCACTTTATGGTGACGGAAGGCATCGTCCTGGGGCACAAAGTCTCTA
AAAAAGGATTGGAGGTGGATAGGGCTAAAATAGTTGCTATTGAACAACTACCACCCCCCACCAATGTGAAAGGAGTCAGAAGCTTCCTAGGGCATGGAGGGTTTTACAGG
AGATTTATTAAAGATTTTTCTAAAATTGCTAAACCCTTGAGTAATTTGTTAGAAAAGGAAGCTAAATTTATTTTTGATGATGCATGTCTACTAGCTTTTAATACTCTGAA
GGAAAGACTAATTGTTGCACCTATTATTGTTGTACCAAATTGGAGTCAGTCTTTTGAGATCATGTGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTA
GAGATAACATGTTTAGGGCAATTTATTATGCTAGTAGGACTCTTGATAATACTCAAAAGAAATACACTACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGAT
AAATTTAGATCATACTTGCTTTGCTCTAAAATAGTAGTGCATACTGACCATGCTGCTTTAAAGTACTTGTTTGTTAAGAAAGATTCTAAACCTAGGCTAATGAGGTGGAT
ATTATTGTTACAGGAATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTTGCAGACCACGTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCC
CAATTGTTGAGATGTTCCCTGATGAACAACTGTATCAGGTAAAAGATAGTGTGCCTTGGTTTGCTGACATAGTTAATTATCTTGCAGGAGGACATTTCCCACCTGACATG
AACTATCAACAAAAGAAAAGATTCCTGCACAATGTTAAGTCTTACCATTGGGAGGACCCACTTCTCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTACCTCA
AGAGGAAGTGGTAAGTATTTTAAATTCATGTCATGCTTCACCCTATGGAGGTCATTTTGGACCCATCAGAACTGCAGCTAAGGTACTTCAGTCAGGATTTTATTGGCCAT
CCCTTTTTAAAGACTGTTATACCTTTGTTAAGTCATGTGATAGGTGCCAACGTACTGGCAATATTTCTAGACAACATGAGCTTCCAATGAAACCTATCTTGGAAGTGGAG
TTATTTGATGTTTGGGGTATTGACTTTATGGGACCTTTTCCAATTTCTTATAATGGCTACCTATATATTCTAGTTGCAATAGATTATGTATCTAAGTGGGTAGAAGCCAT
AGCTACTAGGACCAATGATGCTCGCACTGTTGTAAAATTCTTGCATAAAAACATTTTCACACGTTTTGGTACACCTAGAGCTATTCTTAGTGATGAGGGCTCTCACTTTT
GCAATAAACTGTTTAAATCCATGATGCAAAAATATAATGTTAATCATAAAATTGCTACAGCTTATCATCCTCAAACTAATGGCCTTGTTGAGTTATCTAACAGGGAAATC
AAACAAGTTTTGGAAAAGACAGTCAAGACCAATAGGAAGGATTGGGCCCTTAAGCTCGATGATGCACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTC
CCCGTACAGGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGAGCTCGAGCATAGAGCTTATTGGGCCATCAAGAAGCTGAACATGGATTTTGGGAAAGCCGGTGAGA
AACGCCTTTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATGAGAATGCCAAAATTTACAAAGAGCGCACTACGAGATGGCATGACAAGAAGATCACATCA
CGAACCTTCCTTCCAGGACAAAGAATATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGGGTAAAAAGAGCTGCCTTTGCTGTGATCAGTAAGGACGCCTGGGAATT
GACTAGAAACGCATCAGCATGGACGCAGAGCAGCAACGCGGCGCTGGGAAGGCGCTGCTTCGACGCATACTTGGTGAACGCAAGACAGCGTCGCGGCGCTAAAACCCGAG
AGCAACTTAATTTTGGTCGCAACGCTAGCGTTGCGGCGCCAGCGTTAATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGCAGAGCATCAACGCGGTGCTGGGAAGGCGCTGCTTCGACGCGTACTTGGTGAACGCAAGACAGCGTCGCGGCGCTGGGATGGCGCTGCCGCGGCGCTAAAACC
CGAGAGCAACTTAATTTTGGTCAGGTGCCAAGATGCTAACCTTGTTCTAAACTGGGAGAAGTGTCACTTTATGGTGACGGAAGGCATCGTCCTGGGGCACAAAGTCTCTA
AAAAAGGATTGGAGGTGGATAGGGCTAAAATAGTTGCTATTGAACAACTACCACCCCCCACCAATGTGAAAGGAGTCAGAAGCTTCCTAGGGCATGGAGGGTTTTACAGG
AGATTTATTAAAGATTTTTCTAAAATTGCTAAACCCTTGAGTAATTTGTTAGAAAAGGAAGCTAAATTTATTTTTGATGATGCATGTCTACTAGCTTTTAATACTCTGAA
GGAAAGACTAATTGTTGCACCTATTATTGTTGTACCAAATTGGAGTCAGTCTTTTGAGATCATGTGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTA
GAGATAACATGTTTAGGGCAATTTATTATGCTAGTAGGACTCTTGATAATACTCAAAAGAAATACACTACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGAT
AAATTTAGATCATACTTGCTTTGCTCTAAAATAGTAGTGCATACTGACCATGCTGCTTTAAAGTACTTGTTTGTTAAGAAAGATTCTAAACCTAGGCTAATGAGGTGGAT
ATTATTGTTACAGGAATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTTGCAGACCACGTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCC
CAATTGTTGAGATGTTCCCTGATGAACAACTGTATCAGGTAAAAGATAGTGTGCCTTGGTTTGCTGACATAGTTAATTATCTTGCAGGAGGACATTTCCCACCTGACATG
AACTATCAACAAAAGAAAAGATTCCTGCACAATGTTAAGTCTTACCATTGGGAGGACCCACTTCTCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTACCTCA
AGAGGAAGTGGTAAGTATTTTAAATTCATGTCATGCTTCACCCTATGGAGGTCATTTTGGACCCATCAGAACTGCAGCTAAGGTACTTCAGTCAGGATTTTATTGGCCAT
CCCTTTTTAAAGACTGTTATACCTTTGTTAAGTCATGTGATAGGTGCCAACGTACTGGCAATATTTCTAGACAACATGAGCTTCCAATGAAACCTATCTTGGAAGTGGAG
TTATTTGATGTTTGGGGTATTGACTTTATGGGACCTTTTCCAATTTCTTATAATGGCTACCTATATATTCTAGTTGCAATAGATTATGTATCTAAGTGGGTAGAAGCCAT
AGCTACTAGGACCAATGATGCTCGCACTGTTGTAAAATTCTTGCATAAAAACATTTTCACACGTTTTGGTACACCTAGAGCTATTCTTAGTGATGAGGGCTCTCACTTTT
GCAATAAACTGTTTAAATCCATGATGCAAAAATATAATGTTAATCATAAAATTGCTACAGCTTATCATCCTCAAACTAATGGCCTTGTTGAGTTATCTAACAGGGAAATC
AAACAAGTTTTGGAAAAGACAGTCAAGACCAATAGGAAGGATTGGGCCCTTAAGCTCGATGATGCACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTC
CCCGTACAGGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGAGCTCGAGCATAGAGCTTATTGGGCCATCAAGAAGCTGAACATGGATTTTGGGAAAGCCGGTGAGA
AACGCCTTTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCAAGCTTATGAGAATGCCAAAATTTACAAAGAGCGCACTACGAGATGGCATGACAAGAAGATCACATCA
CGAACCTTCCTTCCAGGACAAAGAATATTACTTTTTAACTCACGTTTACGCTTGTTTCCAGGGGTAAAAAGAGCTGCCTTTGCTGTGATCAGTAAGGACGCCTGGGAATT
GACTAGAAACGCATCAGCATGGACGCAGAGCAGCAACGCGGCGCTGGGAAGGCGCTGCTTCGACGCATACTTGGTGAACGCAAGACAGCGTCGCGGCGCTAAAACCCGAG
AGCAACTTAATTTTGGTCGCAACGCTAGCGTTGCGGCGCCAGCGTTAATTTAA
Protein sequenceShow/hide protein sequence
MDAEHQRGAGKALLRRVLGERKTASRRWDGAAAALKPESNLILVRCQDANLVLNWEKCHFMVTEGIVLGHKVSKKGLEVDRAKIVAIEQLPPPTNVKGVRSFLGHGGFYR
RFIKDFSKIAKPLSNLLEKEAKFIFDDACLLAFNTLKERLIVAPIIVVPNWSQSFEIMCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQKKYTTTEKELLAVVFAID
KFRSYLLCSKIVVHTDHAALKYLFVKKDSKPRLMRWILLLQEFDLEIKDRKGCENVVADHVSRIENEEAKSWPPIVEMFPDEQLYQVKDSVPWFADIVNYLAGGHFPPDM
NYQQKKRFLHNVKSYHWEDPLLYKVCADNMIRKCVPQEEVVSILNSCHASPYGGHFGPIRTAAKVLQSGFYWPSLFKDCYTFVKSCDRCQRTGNISRQHELPMKPILEVE
LFDVWGIDFMGPFPISYNGYLYILVAIDYVSKWVEAIATRTNDARTVVKFLHKNIFTRFGTPRAILSDEGSHFCNKLFKSMMQKYNVNHKIATAYHPQTNGLVELSNREI
KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYRLVFGKACHLPVELEHRAYWAIKKLNMDFGKAGEKRLLELNEMEEFRAQAYENAKIYKERTTRWHDKKITS
RTFLPGQRILLFNSRLRLFPGVKRAAFAVISKDAWELTRNASAWTQSSNAALGRRCFDAYLVNARQRRGAKTREQLNFGRNASVAAPALI