; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G17170 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G17170
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
Genome locationChr1:12935847..12937982
RNA-Seq ExpressionCSPI01G17170
SyntenyCSPI01G17170
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7014963.1 unnamed protein product [Microthlaspi erraticum]4.8e-24257.53Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        MSP+E +IL + IE+LL+KG I+ S+SPC VP LL PKK   WRMCVDSRAIN+IT+KYRFPIPR+ D+LD+L  + IFSKIDL+SGYHQIRIR GDEWK
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
        TAFK+ +GL+EW+                           F+VVYFDDIL+Y+   +EH+ HL+++ QVL E +LY+N KKCTF   ++ FLGF++ +  
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL------------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG
        I ++ +KV AI  W  P S  E+++F GL +FYRRF+R+FS++                  QD SF  IK +L ++PVL LPDF   F+V  DA G GIG
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL------------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG

Query:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV
        AVLSQ   P+ +FSEKLS +RQ WSTY QE YA+ RAL+QWEHYL+ +EFIL TDH +LK+L S++ I++MHARW+SFLQ+F F+I+H+SGT NKVADA+
Subjt:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV

Query:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ
        SR++SLL  L+ E+V F+ L +LYE D +F E+W KC     + DFHI +GFLFKG++LCIP +SLRE L+++ H GGL+GH G+DKT  ++ +RYYWP 
Subjt:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ

Query:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK
        LRRD+   +KRC ICQ +KG S NTGLY PLP+P  IW+DLS+DFV+GLP TQR  DS+ V+VDRFSKM HFIACKKT DA  IA LFFRE+VRLHGVPK
Subjt:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK

Query:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL
        TI+SDRD KFLSHFW TLW+   TTLK ++TAHPQTDGQTEVTNRTLGN+IR + G + KQWDLAL Q EFA+N+  + AT + PF +VYT  P+   DL
Subjt:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL

Query:  ATLP
          LP
Subjt:  ATLP

CAA7028195.1 unnamed protein product [Microthlaspi erraticum]4.8e-24257.53Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        MSP+E +IL + IE+LL+KG I+ S+SPC VP LL PKK   WRMCVDSRAIN+IT+KYRFPIPR+ D+LD+L  + IFSKIDL+SGYHQIRIR GDEWK
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
        TAFK+ +GL+EW+                           F+VVYFDDIL+Y+   +EH+ HL+++ QVL E +LY+N KKCTF   ++ FLGF++ +  
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL------------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG
        I ++ +KV AI  W  P S  E+++F GL +FYRRF+R+FS++                  QD SF  IK +L ++PVL LPDF   F+V  DA G GIG
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL------------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG

Query:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV
        AVLSQ   P+ +FSEKLS +RQ WSTY QE YA+ RAL+QWEHYL+ +EFIL TDH +LK+L S++ I++MHARW+SFLQ+F F+I+H+SGT NKVADA+
Subjt:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV

Query:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ
        SR++SLL  L+ E+V F+ L +LYE D +F E+W KC     + DFHI +GFLFKG++LCIP +SLRE L+++ H GGL+GH G+DKT  ++ +RYYWP 
Subjt:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ

Query:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK
        LRRD+   +KRC ICQ +KG S NTGLY PLP+P  IW+DLS+DFV+GLP TQR  DS+ V+VDRFSKM HFIACKKT DA  IA LFFRE+VRLHGVPK
Subjt:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK

Query:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL
        TI+SDRD KFLSHFW TLW+   TTLK ++TAHPQTDGQTEVTNRTLGN+IR + G + KQWDLAL Q EFA+N+  + AT + PF +VYT  P+   DL
Subjt:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL

Query:  ATLP
          LP
Subjt:  ATLP

PWA81295.1 transposon Ty3-I Gag-Pol polyprotein [Artemisia annua]8.2e-24256.68Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        MSP+E  IL + +EELLRKGHI+ S+SPC VPALLTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ DLLDQL  A +FSKIDL+SGYHQIRI+ GDEWK
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
        TAFKT +GL+EW+                           F+VVYFDDILVY+    EH+ HLRK+ + LTE EL++N KKCTF   ++ FLG+I+    
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG
        I ++  KV+A+  W +P +  E+++F GLA+FYRRF+RNFSS+                   ++SF  IK RLT++PVL LP+F   FE+  DACGTGIG
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG

Query:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV
        AVLSQ   P+ + SEKL+ +RQ WSTY QELYA+V+A+K+WEHYL+ +EF++ +DH +LKY Q++R ++++HARW SFL++F++VIKH+SG  NKVADA+
Subjt:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV

Query:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ
        SRK++LL  +S++VV F+ +  LYE D DF   W +        +F +L+G+LFKG +LCIP TSLR  L+KE H+GGL+ H G+DKT  ++  R+YWPQ
Subjt:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ

Query:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK
        L+RD  +F++RC +CQ  KG + NTGLY PLP+P S W D+S+DFV+GLP TQR  DS+ V+VDRFSKMAHFI CKKT+DA +IA LFF+E+VRLHGVPK
Subjt:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK

Query:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL
        +I SDRD KFL+HFW TLW++L T+L F++TAHPQTDGQTEV NRTLGN+IR L G K K WD++LAQAEFA+N+  + +T   PF+VVY   PR   DL
Subjt:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL

Query:  ATLP
          LP
Subjt:  ATLP

VVA31129.1 PREDICTED: reverse mRNAase, partial [Prunus dulcis]3.5e-24057.1Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        MSP+E  IL + IEELL+KG I+ SLSPC VP LL PKKD +WRMCVDSRAIN+ITVKYRFPIPR+ D+LD L  + +FSKIDL+SGYHQIRIR GDEWK
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
        TAFK+ +GLFEW+                           F+VVYFDDIL+Y+T  +EH++HLR++  VL E +LY+N KKCTF   ++ FLGF++ +  
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG
        I ++ +K++AI  W  P +  E+++F GLA+FYRRF+R+FSS+                   + SF DIK +L ++PVL LP+F   FEV  DA G G+G
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG

Query:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV
        AVLSQ   P+ +FSEKLS +RQ WSTY QE YA+VRALKQWEHYL+ KEF+L TDH +LKY+ S++ I +MHARW++FLQ+F FVIKH SG  N+VADA+
Subjt:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV

Query:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ
        SR++SLL  L+ EVV F+ L +LYE D DF E+W KCTN     D+ + EG+LFKG QLCIP +SLRE L+++ H GGL+GH G+DKT   + +R+YWPQ
Subjt:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ

Query:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK
        L+RD    +++C  CQ +KG   NTGLY PLP+P  IW+DL++DFV+GLP TQR  DS+ V+VDRFSKMAHFIACKKT DA  IA LFFRE+VRLHGVP 
Subjt:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK

Query:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL
        +I SDRD KFLSHFW TLW+   TTL  ++TAHPQTDGQTEVTNRTLGN++R + G K KQWD AL Q EFA+N+  + AT + PF +VYT  P    DL
Subjt:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL

Query:  ATLP
          LP
Subjt:  ATLP

XP_025979678.1 uncharacterized protein LOC112997809 [Glycine max]2.9e-23957.24Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        MSP+E +IL + IE+LLRKG I+ S+SPC VP LL PKK   WRMCVDSRAIN+IT+KYRFPIPR+ D+LD+L  + +FSKIDL+SGYHQIRIR GDEWK
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
        TAFK+ +GL+EW+                           F+VVYFDDIL+Y+   +EH+ H+R + QVL E +LYIN KKCTF   ++ FLGF++ +  
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG
        I ++ +KV AI  W  PTS  E+++F GLA+FYRRFIR+FS++                   + SF  IK +L ++PVL LPDF   F+V  DA G GIG
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG

Query:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV
        AVLSQ   PI +FSEKLS +R+ WSTY QE YA+ RAL+QWEHYL+ +EFIL TDH +LK+L S++ I++MHARW+SFLQ+F F+I+H+SG  NKVADA+
Subjt:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV

Query:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ
        SR+ SLL  L+ EVV F+ L +LYE D +F E+W KC  +   +DFH+ EGFLFKG +LCIP +SLRE L+++ H GGL+GH G+DKT  ++ +R+YWP 
Subjt:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ

Query:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK
        LR+D+   +K+C  CQ +KG S NTGLY PLPIP  IW+DL++DFV+GLP TQR  DS+ V+VDRFSKM+HFIACKKT DA  IA LFFRE+V LHGVPK
Subjt:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK

Query:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL
        +I SDRD KFLSHFW TLWK  DT+L  ++TAHPQTDGQTEVTNRTLGN+IR + G K KQWDLAL Q EFA+N+  + AT + PF +VYT  PR   DL
Subjt:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL

Query:  ATLP
          LP
Subjt:  ATLP

TrEMBL top hitse value%identityAlignment
A0A2U1P6A2 Transposon Ty3-I Gag-Pol polyprotein4.0e-24256.68Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        MSP+E  IL + +EELLRKGHI+ S+SPC VPALLTPKKDGSWRMCVDSRAIN+ITV+YRFPIPR+ DLLDQL  A +FSKIDL+SGYHQIRI+ GDEWK
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
        TAFKT +GL+EW+                           F+VVYFDDILVY+    EH+ HLRK+ + LTE EL++N KKCTF   ++ FLG+I+    
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG
        I ++  KV+A+  W +P +  E+++F GLA+FYRRF+RNFSS+                   ++SF  IK RLT++PVL LP+F   FE+  DACGTGIG
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG

Query:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV
        AVLSQ   P+ + SEKL+ +RQ WSTY QELYA+V+A+K+WEHYL+ +EF++ +DH +LKY Q++R ++++HARW SFL++F++VIKH+SG  NKVADA+
Subjt:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV

Query:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ
        SRK++LL  +S++VV F+ +  LYE D DF   W +        +F +L+G+LFKG +LCIP TSLR  L+KE H+GGL+ H G+DKT  ++  R+YWPQ
Subjt:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ

Query:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK
        L+RD  +F++RC +CQ  KG + NTGLY PLP+P S W D+S+DFV+GLP TQR  DS+ V+VDRFSKMAHFI CKKT+DA +IA LFF+E+VRLHGVPK
Subjt:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK

Query:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL
        +I SDRD KFL+HFW TLW++L T+L F++TAHPQTDGQTEV NRTLGN+IR L G K K WD++LAQAEFA+N+  + +T   PF+VVY   PR   DL
Subjt:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL

Query:  ATLP
          LP
Subjt:  ATLP

A0A5B7BER3 Uncharacterized protein4.4e-24958.52Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        MSP+E +IL   +E+L+ KG I+ S+SPC VPALLTPKKDGSWRMCVDSRAIN+ITVKYRFPIPR+ D+LD L  + IFSKIDL+SGYHQIRIR GDEWK
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
        TAFKT EGL+EW+                           F+VVYFDDIL+Y+ +  EH+ H+R++   L E++LYIN KKC F    + FLGFII    
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG
        I ++ +KV AI  W TP +  +I++F GLA+FYRRFIRNFSS+                   + SF  IK +L+++PVL LP F   F+V  DA  TGIG
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG

Query:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV
        AVLSQ   P+E+FSEKL+ +RQ W+TY  EL+A+VRALK WEHYL+ +EF++ +DH +LK++ ++ ++SRMH RWI+FLQRF FV+KH++G +NKVADA+
Subjt:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV

Query:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ
        SR+++LL ++SSE+ +F+ L +LY+ED DF + W KC     + +FHI +G+LFKG QLCIP TSLRE +L++ HSGGL GH G+DKT   + +RYYWPQ
Subjt:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ

Query:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK
        L+RD   F+++CPICQ AKG + NTGLY+PLP+P  IWEDL++DF++GLP TQR  DS+ V+VDRFSKMAHFI CKKT+DA ++ANLFFREIVRLHGVPK
Subjt:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK

Query:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL
        +I SDRDVKFLSHFWRTLW+K DT+L++++TAHPQTDGQTEVTNRTLGNLIR  SG + KQWD+ L Q EFA+N M NR+T + PFE+VYTK P+   DL
Subjt:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL

Query:  ATLP
        A LP
Subjt:  ATLP

A0A6D2HLB5 Reverse transcriptase2.3e-24257.53Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        MSP+E +IL + IE+LL+KG I+ S+SPC VP LL PKK   WRMCVDSRAIN+IT+KYRFPIPR+ D+LD+L  + IFSKIDL+SGYHQIRIR GDEWK
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
        TAFK+ +GL+EW+                           F+VVYFDDIL+Y+   +EH+ HL+++ QVL E +LY+N KKCTF   ++ FLGF++ +  
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL------------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG
        I ++ +KV AI  W  P S  E+++F GL +FYRRF+R+FS++                  QD SF  IK +L ++PVL LPDF   F+V  DA G GIG
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL------------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG

Query:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV
        AVLSQ   P+ +FSEKLS +RQ WSTY QE YA+ RAL+QWEHYL+ +EFIL TDH +LK+L S++ I++MHARW+SFLQ+F F+I+H+SGT NKVADA+
Subjt:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV

Query:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ
        SR++SLL  L+ E+V F+ L +LYE D +F E+W KC     + DFHI +GFLFKG++LCIP +SLRE L+++ H GGL+GH G+DKT  ++ +RYYWP 
Subjt:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ

Query:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK
        LRRD+   +KRC ICQ +KG S NTGLY PLP+P  IW+DLS+DFV+GLP TQR  DS+ V+VDRFSKM HFIACKKT DA  IA LFFRE+VRLHGVPK
Subjt:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK

Query:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL
        TI+SDRD KFLSHFW TLW+   TTLK ++TAHPQTDGQTEVTNRTLGN+IR + G + KQWDLAL Q EFA+N+  + AT + PF +VYT  P+   DL
Subjt:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL

Query:  ATLP
          LP
Subjt:  ATLP

A0A6D2IKM3 Reverse transcriptase2.3e-24257.53Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        MSP+E +IL + IE+LL+KG I+ S+SPC VP LL PKK   WRMCVDSRAIN+IT+KYRFPIPR+ D+LD+L  + IFSKIDL+SGYHQIRIR GDEWK
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
        TAFK+ +GL+EW+                           F+VVYFDDIL+Y+   +EH+ HL+++ QVL E +LY+N KKCTF   ++ FLGF++ +  
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL------------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG
        I ++ +KV AI  W  P S  E+++F GL +FYRRF+R+FS++                  QD SF  IK +L ++PVL LPDF   F+V  DA G GIG
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL------------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG

Query:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV
        AVLSQ   P+ +FSEKLS +RQ WSTY QE YA+ RAL+QWEHYL+ +EFIL TDH +LK+L S++ I++MHARW+SFLQ+F F+I+H+SGT NKVADA+
Subjt:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV

Query:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ
        SR++SLL  L+ E+V F+ L +LYE D +F E+W KC     + DFHI +GFLFKG++LCIP +SLRE L+++ H GGL+GH G+DKT  ++ +RYYWP 
Subjt:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ

Query:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK
        LRRD+   +KRC ICQ +KG S NTGLY PLP+P  IW+DLS+DFV+GLP TQR  DS+ V+VDRFSKM HFIACKKT DA  IA LFFRE+VRLHGVPK
Subjt:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK

Query:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL
        TI+SDRD KFLSHFW TLW+   TTLK ++TAHPQTDGQTEVTNRTLGN+IR + G + KQWDLAL Q EFA+N+  + AT + PF +VYT  P+   DL
Subjt:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL

Query:  ATLP
          LP
Subjt:  ATLP

A0A6N2LVR1 Uncharacterized protein1.2e-24156.48Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        MSP E  IL   +EEL++KG ++ S+SPC VPALL PKKDGSWRMC+DSRAIN+IT+KYRFPIPR+ D+LD L  + IFSKIDL+SGYHQIRIR GDEWK
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
        TAFKT EGL+EW+                           F+VVYFDDIL+Y+ +  +HI HLR++F VL   +L++N  KC F    + FLGF++    
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG
        I ++ +KV AI  W TP +  E+++F GLA+FYRRF+R+FS +                   + SF  IK +L S+PVL LPDF   FEV  DA   GIG
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL-------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIG

Query:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV
        AVLSQ + P+ ++SEKLS +R+ WSTY  ELYA+ RA+K WEHYL+ +EFIL +DH +LK++ ++  ++RMHARW++F+QRF+F +KH+SG  NKVADA+
Subjt:  AVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAV

Query:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ
        SRK SLLT L +EV+ F+ + DLY  D DF   W KC   +  E  H  +G+LF+G QLCIP +SLRE ++ E H GGL GH G+DKT     +RYYWPQ
Subjt:  SRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQDKTFETISKRYYWPQ

Query:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK
        L+RD  N +KRCP CQ +KG + NTGLY PLPIP   WEDLS+DF++GLP TQR  DS+ V+VDRFSKMAHFIACKKT+DA+++ANLFF+E+VRLHGVPK
Subjt:  LRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLHGVPK

Query:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL
        +I SDRD KFLSHFWRTLW++ DTTL F++T+HPQTDGQTEV NRTLGNLIR LSG + KQWDL LAQAEFA+N+M NR+T + PF+VVY + P+   DL
Subjt:  TIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDL

Query:  ATLPTVVDIN
          LP +  +N
Subjt:  ATLPTVVDIN

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.0e-9830.58Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        + P + + ++D I + L+ G I+ S +    P +  PKK+G+ RM VD + +N+      +P+P I  LL ++  +TIF+K+DLKS YH IR+R GDE K
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
         AF+   G+FE++                            +V Y DDIL+++ +  EH+ H++ + Q L    L IN  KC F + ++ F+G+ I +  
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL--------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGI
             + ++ +  W+ P + KE++ FLG  ++ R+FI   S L                      + ++IK+ L S PVL+  DF+    +  DA    +
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL--------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGI

Query:  GAVLSQRS-----HPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYL--LCKEFILLTDHFSL--KYLQSRRTISRMHARWISFLQRFDFVIKHQS
        GAVLSQ+      +P+ Y+S K+S ++ ++S   +E+ A++++LK W HYL    + F +LTDH +L  +        ++  ARW  FLQ F+F I ++ 
Subjt:  GAVLSQRS-----HPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYL--LCKEFILLTDHFSL--KYLQSRRTISRMHARWISFLQRFDFVIKHQS

Query:  GTENKVADAVSR------------KSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIK--AEDFHILEGFLFKG-EQLCIPH-TSLREALLKEA
        G+ N +ADA+SR            + + +  ++   +       +  E T+ +++     N  K   E+  + +G L    +Q+ +P+ T L   ++K+ 
Subjt:  GTENKVADAVSR------------KSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIK--AEDFHILEGFLFKG-EQLCIPH-TSLREALLKEA

Query:  HSGGLAGHFGQDKTFETISKRYYWPQLRRDSNNFIKRCPICQRAKGSSTNTGLYSPL-PIPTS--IWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAH
        H  G   H G +     I +R+ W  +R+    +++ C  CQ  K  S N   Y PL PIP S   WE LS+DF+  LP +   Y+++ V+VDRFSKMA 
Subjt:  HSGGLAGHFGQDKTFETISKRYYWPQLRRDSNNFIKRCPICQRAKGSSTNTGLYSPL-PIPTS--IWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAH

Query:  FIACKKTNDAIYIANLFFREIVRLHGVPKTIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEF
         + C K+  A   A +F + ++   G PK I++D D  F S  W+    K +  +KF+    PQTDGQTE TN+T+  L+R +  T    W   ++  + 
Subjt:  FIACKKTNDAIYIANLFFREIVRLHGVPKTIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEF

Query:  AFNNMKNRATNRCPFEVVYTKQPRLT
        ++NN  + AT   PFE+V+   P L+
Subjt:  AFNNMKNRATNRCPFEVVYTKQPRLT

P0CT35 Transposon Tf2-2 polyprotein4.0e-9830.58Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        + P + + ++D I + L+ G I+ S +    P +  PKK+G+ RM VD + +N+      +P+P I  LL ++  +TIF+K+DLKS YH IR+R GDE K
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
         AF+   G+FE++                            +V Y DDIL+++ +  EH+ H++ + Q L    L IN  KC F + ++ F+G+ I +  
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL--------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGI
             + ++ +  W+ P + KE++ FLG  ++ R+FI   S L                      + ++IK+ L S PVL+  DF+    +  DA    +
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL--------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGI

Query:  GAVLSQRS-----HPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYL--LCKEFILLTDHFSL--KYLQSRRTISRMHARWISFLQRFDFVIKHQS
        GAVLSQ+      +P+ Y+S K+S ++ ++S   +E+ A++++LK W HYL    + F +LTDH +L  +        ++  ARW  FLQ F+F I ++ 
Subjt:  GAVLSQRS-----HPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYL--LCKEFILLTDHFSL--KYLQSRRTISRMHARWISFLQRFDFVIKHQS

Query:  GTENKVADAVSR------------KSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIK--AEDFHILEGFLFKG-EQLCIPH-TSLREALLKEA
        G+ N +ADA+SR            + + +  ++   +       +  E T+ +++     N  K   E+  + +G L    +Q+ +P+ T L   ++K+ 
Subjt:  GTENKVADAVSR------------KSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIK--AEDFHILEGFLFKG-EQLCIPH-TSLREALLKEA

Query:  HSGGLAGHFGQDKTFETISKRYYWPQLRRDSNNFIKRCPICQRAKGSSTNTGLYSPL-PIPTS--IWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAH
        H  G   H G +     I +R+ W  +R+    +++ C  CQ  K  S N   Y PL PIP S   WE LS+DF+  LP +   Y+++ V+VDRFSKMA 
Subjt:  HSGGLAGHFGQDKTFETISKRYYWPQLRRDSNNFIKRCPICQRAKGSSTNTGLYSPL-PIPTS--IWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAH

Query:  FIACKKTNDAIYIANLFFREIVRLHGVPKTIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEF
         + C K+  A   A +F + ++   G PK I++D D  F S  W+    K +  +KF+    PQTDGQTE TN+T+  L+R +  T    W   ++  + 
Subjt:  FIACKKTNDAIYIANLFFREIVRLHGVPKTIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEF

Query:  AFNNMKNRATNRCPFEVVYTKQPRLT
        ++NN  + AT   PFE+V+   P L+
Subjt:  AFNNMKNRATNRCPFEVVYTKQPRLT

P0CT41 Transposon Tf2-12 polyprotein4.0e-9830.58Show/hide
Query:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK
        + P + + ++D I + L+ G I+ S +    P +  PKK+G+ RM VD + +N+      +P+P I  LL ++  +TIF+K+DLKS YH IR+R GDE K
Subjt:  MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWK

Query:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS
         AF+   G+FE++                            +V Y DDIL+++ +  EH+ H++ + Q L    L IN  KC F + ++ F+G+ I +  
Subjt:  TAFKTNEGLFEWM---------------------------FIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGS

Query:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL--------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGI
             + ++ +  W+ P + KE++ FLG  ++ R+FI   S L                      + ++IK+ L S PVL+  DF+    +  DA    +
Subjt:  IGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSL--------------------QDSFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGI

Query:  GAVLSQRS-----HPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYL--LCKEFILLTDHFSL--KYLQSRRTISRMHARWISFLQRFDFVIKHQS
        GAVLSQ+      +P+ Y+S K+S ++ ++S   +E+ A++++LK W HYL    + F +LTDH +L  +        ++  ARW  FLQ F+F I ++ 
Subjt:  GAVLSQRS-----HPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYL--LCKEFILLTDHFSL--KYLQSRRTISRMHARWISFLQRFDFVIKHQS

Query:  GTENKVADAVSR------------KSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIK--AEDFHILEGFLFKG-EQLCIPH-TSLREALLKEA
        G+ N +ADA+SR            + + +  ++   +       +  E T+ +++     N  K   E+  + +G L    +Q+ +P+ T L   ++K+ 
Subjt:  GTENKVADAVSR------------KSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIK--AEDFHILEGFLFKG-EQLCIPH-TSLREALLKEA

Query:  HSGGLAGHFGQDKTFETISKRYYWPQLRRDSNNFIKRCPICQRAKGSSTNTGLYSPL-PIPTS--IWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAH
        H  G   H G +     I +R+ W  +R+    +++ C  CQ  K  S N   Y PL PIP S   WE LS+DF+  LP +   Y+++ V+VDRFSKMA 
Subjt:  HSGGLAGHFGQDKTFETISKRYYWPQLRRDSNNFIKRCPICQRAKGSSTNTGLYSPL-PIPTS--IWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAH

Query:  FIACKKTNDAIYIANLFFREIVRLHGVPKTIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEF
         + C K+  A   A +F + ++   G PK I++D D  F S  W+    K +  +KF+    PQTDGQTE TN+T+  L+R +  T    W   ++  + 
Subjt:  FIACKKTNDAIYIANLFFREIVRLHGVPKTIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEF

Query:  AFNNMKNRATNRCPFEVVYTKQPRLT
        ++NN  + AT   PFE+V+   P L+
Subjt:  AFNNMKNRATNRCPFEVVYTKQPRLT

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.9e-10934.47Show/hide
Query:  IEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWKTAFKTNEGLFEW
        +++LL    I PS SPC  P +L PKKDG++R+CVD R +N+ T+   FP+PRI +LL ++G A IF+ +DL SGYHQI +   D +KTAF T  G +E+
Subjt:  IEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWKTAFKTNEGLFEW

Query:  -------------------------MFIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGSIGMEPKKVEAIHTW
                                  F+ VY DDIL+++ + +EH  HL  + + L    L +  KKC F   E  FLG+ I    I     K  AI  +
Subjt:  -------------------------MFIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGSIGMEPKKVEAIHTW

Query:  RTPTSFKEIQAFLGLASFYRRFIRNFSSL-----------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIGAVLSQRSHP------
         TP + K+ Q FLG+ ++YRRFI N S +                 QD + + +K  L +SPVL   +    + +  DA   GIGAVL +  +       
Subjt:  RTPTSFKEIQAFLGLASFYRRFIRNFSSL-----------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIGAVLSQRSHP------

Query:  IEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAVSRKSSLLTL
        + YFS+ L S+++++     EL  +++AL  + + L  K F L TDH SL  LQ++   +R   RW+  L  +DF +++ +G +N VADA+SR    +T 
Subjt:  IEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAVSRKSSLLTL

Query:  LSSEVVAFKHLPDLYEEDTDFSEVWYKC----------------TNYIK--------AEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGL-AGHFG
         +S  +  +     Y+ D   S V                     +Y K         +++ + +  ++  ++L +P    + A+++  H   L  GHFG
Subjt:  LSSEVVAFKHLPDLYEEDTDFSEVWYKC----------------TNYIK--------AEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGL-AGHFG

Query:  QDKTFETISKRYYWPQLRRDSNNFIKRCPICQRAKGSSTNT-GLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIY
           T   IS  YYWP+L+     +I+ C  CQ  K       GL  PLPI    W D+S+DFV GLP T    + I+V+VDRFSK AHFIA +KT DA  
Subjt:  QDKTFETISKRYYWPQLRRDSNNFIKRCPICQRAKGSSTNT-GLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIY

Query:  IANLFFREIVRLHGVPKTIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNR
        + +L FR I   HG P+TI SDRDV+  +  ++ L K+L      ++  HPQTDGQ+E T +TL  L+R    T ++ W + L Q EF +N+   R   +
Subjt:  IANLFFREIVRLHGVPKTIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNR

Query:  CPFEV
         PFE+
Subjt:  CPFEV

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.9e-11034.61Show/hide
Query:  IEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWKTAFKTNEGLFEW
        +++LL    I PS SPC  P +L PKKDG++R+CVD R +N+ T+   FP+PRI +LL ++G A IF+ +DL SGYHQI +   D +KTAF T  G +E+
Subjt:  IEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWKTAFKTNEGLFEW

Query:  -------------------------MFIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGSIGMEPKKVEAIHTW
                                  F+ VY DDIL+++ + +EH  HL  + + L    L +  KKC F   E  FLG+ I    I     K  AI  +
Subjt:  -------------------------MFIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGSIGMEPKKVEAIHTW

Query:  RTPTSFKEIQAFLGLASFYRRFIRNFSSL-----------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIGAVLSQRSHP------
         TP + K+ Q FLG+ ++YRRFI N S +                 QD + D +K  L +SPVL   +    + +  DA   GIGAVL +  +       
Subjt:  RTPTSFKEIQAFLGLASFYRRFIRNFSSL-----------------QD-SFDDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIGAVLSQRSHP------

Query:  IEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAVSRKSSLLTL
        + YFS+ L S+++++     EL  +++AL  + + L  K F L TDH SL  LQ++   +R   RW+  L  +DF +++ +G +N VADA+SR    +T 
Subjt:  IEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWISFLQRFDFVIKHQSGTENKVADAVSRKSSLLTL

Query:  LSSEVVAFKHLPDLYEEDTDFSEVWYKC----------------TNYIK--------AEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGL-AGHFG
         +S  +  +     Y+ D   S V                     +Y K         +++ + +  ++  ++L +P    + A+++  H   L  GHFG
Subjt:  LSSEVVAFKHLPDLYEEDTDFSEVWYKC----------------TNYIK--------AEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGL-AGHFG

Query:  QDKTFETISKRYYWPQLRRDSNNFIKRCPICQRAKGSSTNT-GLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIY
           T   IS  YYWP+L+     +I+ C  CQ  K       GL  PLPI    W D+S+DFV GLP T    + I+V+VDRFSK AHFIA +KT DA  
Subjt:  QDKTFETISKRYYWPQLRRDSNNFIKRCPICQRAKGSSTNT-GLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIY

Query:  IANLFFREIVRLHGVPKTIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNR
        + +L FR I   HG P+TI SDRDV+  +  ++ L K+L      ++  HPQTDGQ+E T +TL  L+R  + T ++ W + L Q EF +N+   R   +
Subjt:  IANLFFREIVRLHGVPKTIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNR

Query:  CPFEV
         PFE+
Subjt:  CPFEV

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.3e-1433.08Show/hide
Query:  HLRKLFQVLTETELYINSKKCTFFRREIAFLG--FIIKQGSIGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSLQD--------------
        HL  + Q+  + + Y N KKC F + +IA+LG   II    +  +P K+EA+  W  P +  E++ FLGL  +YRRF++N+  +                
Subjt:  HLRKLFQVLTETELYINSKKCTFFRREIAFLG--FIIKQGSIGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSLQD--------------

Query:  -----SFDDIKRRLTSSPVLQLPDFTLPFEVAV
             +F  +K  +T+ PVL LPD  LPF   V
Subjt:  -----SFDDIKRRLTSSPVLQLPDFTLPFEVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTCCAGAAGAATACAAAATCCTACACGATCACATAGAAGAGCTGCTGAGAAAGGGCCATATCAAGCCAAGCCTTAGCCCATGTGTTGTGCCTGCATTACTTACACC
AAAGAAGGATGGAAGCTGGAGAATGTGTGTAGACAGCAGGGCTATCAACCGAATTACTGTAAAGTACCGATTCCCCATCCCTCGAATTGGAGACTTGCTGGACCAACTAG
GCAAGGCTACCATCTTTTCAAAGATTGACTTGAAGAGCGGCTATCACCAAATAAGGATCAGACTAGGGGACGAGTGGAAAACAGCCTTCAAAACCAATGAAGGCTTATTC
GAATGGATGTTTATAGTTGTCTATTTCGATGACATACTTGTATACAACACCAACTATGATGAGCACATACTGCACTTAAGGAAACTATTCCAAGTCCTAACGGAGACAGA
ACTATACATCAATTCCAAAAAGTGCACATTCTTTAGAAGGGAAATTGCCTTTCTTGGCTTTATAATCAAGCAAGGGAGCATAGGCATGGAACCAAAGAAAGTAGAGGCTA
TCCATACTTGGCGCACACCAACCTCTTTTAAGGAGATACAAGCCTTCCTTGGCTTGGCTTCCTTTTACAGAAGATTTATAAGAAATTTCAGCTCATTGCAAGATAGCTTT
GACGATATTAAAAGGAGGTTGACTTCCAGCCCCGTACTTCAACTACCAGACTTCACTTTACCATTTGAAGTGGCTGTGGATGCGTGCGGAACAGGAATTGGGGCTGTTCT
CTCTCAACGAAGTCATCCCATCGAATATTTTAGTGAAAAGCTAAGCTCATCTAGACAGTCTTGGAGTACGTATGTGCAGGAATTATATGCTCTCGTTCGGGCACTCAAAC
AGTGGGAGCACTACCTATTATGCAAAGAATTTATACTGCTAACTGACCATTTTTCACTAAAATACCTCCAGTCTCGAAGAACTATCAGTCGAATGCACGCACGATGGATC
TCTTTCCTACAAAGATTTGACTTTGTGATCAAACACCAAAGTGGCACAGAAAATAAAGTGGCAGATGCCGTCAGCAGAAAGAGTTCCTTACTCACACTCCTCTCTTCAGA
AGTCGTGGCATTTAAACATCTTCCCGACCTATACGAGGAAGATACTGACTTTTCAGAAGTATGGTACAAATGCACTAATTATATTAAAGCTGAAGACTTCCATATTTTGG
AAGGTTTTCTATTCAAAGGGGAACAACTATGCATACCTCACACCTCATTACGAGAAGCCCTATTAAAGGAAGCTCACTCGGGTGGTCTGGCTGGACATTTCGGGCAAGAC
AAAACATTCGAGACAATTTCCAAGAGGTATTATTGGCCACAACTCAGGAGAGATTCAAACAATTTCATTAAAAGATGTCCTATATGCCAAAGAGCTAAAGGCTCAAGTAC
TAATACTGGGTTATACTCTCCACTGCCCATCCCAACTTCTATTTGGGAAGACTTATCAGTAGACTTCGTGGTAGGATTACCAAATACTCAAAGGCTGTATGATTCAATCA
TGGTTTTAGTAGACAGATTCAGTAAAATGGCCCACTTTATTGCGTGCAAAAAGACGAATGATGCAATCTATATAGCCAATCTGTTTTTCAGAGAAATAGTACGCCTACAT
GGAGTACCAAAAACAATTGTGTCTGATCGGGATGTGAAATTCTTAAGCCACTTTTGGAGGACCCTATGGAAGAAACTTGACACGACTCTGAAATTCAATACTACTGCACA
CCCACAAACAGATGGACAGACCGAGGTCACGAATAGAACACTAGGCAACCTAATACGTTTTCTTAGTGGGACTAAACTGAAGCAGTGGGATTTGGCACTTGCCCAGGCCG
AGTTTGCATTCAACAATATGAAGAATAGAGCAACTAACAGATGCCCATTCGAAGTTGTATATACTAAACAACCACGATTAACGTTTGACCTTGCCACACTCCCTACAGTC
GTGGACATTAATGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTCCAGAAGAATACAAAATCCTACACGATCACATAGAAGAGCTGCTGAGAAAGGGCCATATCAAGCCAAGCCTTAGCCCATGTGTTGTGCCTGCATTACTTACACC
AAAGAAGGATGGAAGCTGGAGAATGTGTGTAGACAGCAGGGCTATCAACCGAATTACTGTAAAGTACCGATTCCCCATCCCTCGAATTGGAGACTTGCTGGACCAACTAG
GCAAGGCTACCATCTTTTCAAAGATTGACTTGAAGAGCGGCTATCACCAAATAAGGATCAGACTAGGGGACGAGTGGAAAACAGCCTTCAAAACCAATGAAGGCTTATTC
GAATGGATGTTTATAGTTGTCTATTTCGATGACATACTTGTATACAACACCAACTATGATGAGCACATACTGCACTTAAGGAAACTATTCCAAGTCCTAACGGAGACAGA
ACTATACATCAATTCCAAAAAGTGCACATTCTTTAGAAGGGAAATTGCCTTTCTTGGCTTTATAATCAAGCAAGGGAGCATAGGCATGGAACCAAAGAAAGTAGAGGCTA
TCCATACTTGGCGCACACCAACCTCTTTTAAGGAGATACAAGCCTTCCTTGGCTTGGCTTCCTTTTACAGAAGATTTATAAGAAATTTCAGCTCATTGCAAGATAGCTTT
GACGATATTAAAAGGAGGTTGACTTCCAGCCCCGTACTTCAACTACCAGACTTCACTTTACCATTTGAAGTGGCTGTGGATGCGTGCGGAACAGGAATTGGGGCTGTTCT
CTCTCAACGAAGTCATCCCATCGAATATTTTAGTGAAAAGCTAAGCTCATCTAGACAGTCTTGGAGTACGTATGTGCAGGAATTATATGCTCTCGTTCGGGCACTCAAAC
AGTGGGAGCACTACCTATTATGCAAAGAATTTATACTGCTAACTGACCATTTTTCACTAAAATACCTCCAGTCTCGAAGAACTATCAGTCGAATGCACGCACGATGGATC
TCTTTCCTACAAAGATTTGACTTTGTGATCAAACACCAAAGTGGCACAGAAAATAAAGTGGCAGATGCCGTCAGCAGAAAGAGTTCCTTACTCACACTCCTCTCTTCAGA
AGTCGTGGCATTTAAACATCTTCCCGACCTATACGAGGAAGATACTGACTTTTCAGAAGTATGGTACAAATGCACTAATTATATTAAAGCTGAAGACTTCCATATTTTGG
AAGGTTTTCTATTCAAAGGGGAACAACTATGCATACCTCACACCTCATTACGAGAAGCCCTATTAAAGGAAGCTCACTCGGGTGGTCTGGCTGGACATTTCGGGCAAGAC
AAAACATTCGAGACAATTTCCAAGAGGTATTATTGGCCACAACTCAGGAGAGATTCAAACAATTTCATTAAAAGATGTCCTATATGCCAAAGAGCTAAAGGCTCAAGTAC
TAATACTGGGTTATACTCTCCACTGCCCATCCCAACTTCTATTTGGGAAGACTTATCAGTAGACTTCGTGGTAGGATTACCAAATACTCAAAGGCTGTATGATTCAATCA
TGGTTTTAGTAGACAGATTCAGTAAAATGGCCCACTTTATTGCGTGCAAAAAGACGAATGATGCAATCTATATAGCCAATCTGTTTTTCAGAGAAATAGTACGCCTACAT
GGAGTACCAAAAACAATTGTGTCTGATCGGGATGTGAAATTCTTAAGCCACTTTTGGAGGACCCTATGGAAGAAACTTGACACGACTCTGAAATTCAATACTACTGCACA
CCCACAAACAGATGGACAGACCGAGGTCACGAATAGAACACTAGGCAACCTAATACGTTTTCTTAGTGGGACTAAACTGAAGCAGTGGGATTTGGCACTTGCCCAGGCCG
AGTTTGCATTCAACAATATGAAGAATAGAGCAACTAACAGATGCCCATTCGAAGTTGTATATACTAAACAACCACGATTAACGTTTGACCTTGCCACACTCCCTACAGTC
GTGGACATTAATGATTAA
Protein sequenceShow/hide protein sequence
MSPEEYKILHDHIEELLRKGHIKPSLSPCVVPALLTPKKDGSWRMCVDSRAINRITVKYRFPIPRIGDLLDQLGKATIFSKIDLKSGYHQIRIRLGDEWKTAFKTNEGLF
EWMFIVVYFDDILVYNTNYDEHILHLRKLFQVLTETELYINSKKCTFFRREIAFLGFIIKQGSIGMEPKKVEAIHTWRTPTSFKEIQAFLGLASFYRRFIRNFSSLQDSF
DDIKRRLTSSPVLQLPDFTLPFEVAVDACGTGIGAVLSQRSHPIEYFSEKLSSSRQSWSTYVQELYALVRALKQWEHYLLCKEFILLTDHFSLKYLQSRRTISRMHARWI
SFLQRFDFVIKHQSGTENKVADAVSRKSSLLTLLSSEVVAFKHLPDLYEEDTDFSEVWYKCTNYIKAEDFHILEGFLFKGEQLCIPHTSLREALLKEAHSGGLAGHFGQD
KTFETISKRYYWPQLRRDSNNFIKRCPICQRAKGSSTNTGLYSPLPIPTSIWEDLSVDFVVGLPNTQRLYDSIMVLVDRFSKMAHFIACKKTNDAIYIANLFFREIVRLH
GVPKTIVSDRDVKFLSHFWRTLWKKLDTTLKFNTTAHPQTDGQTEVTNRTLGNLIRFLSGTKLKQWDLALAQAEFAFNNMKNRATNRCPFEVVYTKQPRLTFDLATLPTV
VDIND