; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039515 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039515
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:45535834..45549781
RNA-Seq ExpressionLag0039515
SyntenyLag0039515
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0016779 - nucleotidyltransferase activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AAU90333.1 Putative gag and pol polyprotein, identical [Solanum demissum]9.0e-17153.72Show/hide
Query:  STIEKPNE-NVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQE-------KINKHEKND----------------
        +TIEK  +  + DLNKPFRF +     WK       +  K   V T      +E E+   +  ++ +E       K  K +KN+                
Subjt:  STIEKPNE-NVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQE-------KINKHEKND----------------

Query:  --------------------FTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGE
                            F  FRK+ P PQANVTEEPFVA+ITDI MV++V+GWW DSGANRHVCYDKDWFK YT F+EPK IMLGD+HTTQ++G G+
Subjt:  --------------------FTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGE

Query:  VELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK-------------------------------------------
        VEL  +SGR L LKDVL+TPSMRKNLMSS+L NK GFKQ IESDQYVI KKGIFVGK                                           
Subjt:  VELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK-------------------------------------------

Query:  -------------------------------------------------VENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAE
                                                         VENQF RKIKRIRSDRGREYES EFNS+V SLGIIHETTPPYSP+SNG AE
Subjt:  -------------------------------------------------VENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAE

Query:  RKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYR
        RKNRTL+ LTNAML+ES A  N WGE +LTAC+VLNRVPHKK+K+T F+LWKGYKP+LGYLRVWGCLAFVRL DPK  KLG K TTC FLGYA NSTAYR
Subjt:  RKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYR

Query:  FLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKF-GSSTPSDNNENSSQVELRRSKRARVVKDFGPDFYVYNIQETPISLQEALSSPDAIFWK
        F ++E N++ ES DAIFHE K PF S+NSGGQ  EQ       SST +  N+  +  ELRRSKRAR+ KDFGP+FYV+N+ + P++L+EALSS D+IFWK
Subjt:  FLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKF-GSSTPSDNNENSSQVELRRSKRARVVKDFGPDFYVYNIQETPISLQEALSSPDAIFWK

Query:  EVVNDEMESLISNKTWKLVDLPPGCKTIGCK
        E VNDEMESLISNKTWKLVDLPPGCKTIGCK
Subjt:  EVVNDEMESLISNKTWKLVDLPPGCKTIGCK

ABI34306.1 Polyprotein, putative [Solanum demissum]1.2e-22061.46Show/hide
Query:  FTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTP
        F  FRK+ P PQANVTEEPFVA+ITDI MV++V+GWWADSGANRHVCYDKDWFK YT F+EPK IMLGDSHTTQ++GTG+VEL  TSGRVL LKDVL+TP
Subjt:  FTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTP

Query:  SMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK---------------------------------------------------------------
        SMRK LMSS+LLNKAGFKQ IES+QYVI KKGIFVGK                                                               
Subjt:  SMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK---------------------------------------------------------------

Query:  ---------------------------------------------------------------------------VENQFSRKIKRIRSDRGREYESFEF
                                                                                   VENQF RKIKRIRSDRGREYES EF
Subjt:  ---------------------------------------------------------------------------VENQFSRKIKRIRSDRGREYESFEF

Query:  NSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTD
        NS+V SLGIIHETTPPYSP+SNGVAERKNRTL+ LTNAML+ES A  N WGEA+LTAC+VLNRVPHKK+K+TPF+LWKGYKP+LGYLRVWGCLAFVRL D
Subjt:  NSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTD

Query:  PKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQT-STKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPD
        PK  KLG K TTC FLGYA NSTAYRF ++E N++ ES DAIFHE K PF S+NSGGQ  EQ   T   SST +  N+  +  ELRRSKRARV KDFGPD
Subjt:  PKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQT-STKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPD

Query:  FYVYNIQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRI
        FYV+N+ +  ++L+EALSS D+IFWKE VNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQ E L+FFDTFSPVTRI
Subjt:  FYVYNIQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRI

Query:  TSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFE
        TSIRLLIA+AAIFD+ IHQMDVKTAFLNGDL EEIYMDQPE F+E GQE+KV     ++  +K+   K+W E
Subjt:  TSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFE

GEV61159.1 retrotransposon protein, putative, Ty1-copia subclass [Tanacetum cinerariifolium]5.9e-15437.91Show/hide
Query:  RKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYK
        +KIK +RS+RG EY   EF+++ E  GI HE T PY+P  NG+AERKNRTL+ + N ML +SG   NLWG+A+L ACH+ NR+  +   ++P++LW G K
Subjt:  RKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYK

Query:  PNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQ
        PNL YLRVWGCLA+ R   PK  KLG++    VF+GY  NS A   LD    VI ES D    E+K     +N         ++   +ST S   E S++
Subjt:  PNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQ

Query:  V-ELRRSKRARVVKDFGPDFYVYNIQETPISLQEALSSPDAIFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKG
        + E RRS RAR  K                   +++SS DA  WKEA+NDE +S++ N+TWKL +LP G K IG KWV +KKL PDGSI  +KARLV KG
Subjt:  V-ELRRSKRARVVKDFGPDFYVYNIQETPISLQEALSSPDAIFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKG

Query:  FKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFELHNVSVGLL
        + Q+E L++FDT++PVTRI+SI+ LIAI+AI D+ IHQMDVKTAFLNG L EE+YM+QPE F+  G+ENKV     ++  +K++  K+W E   +     
Subjt:  FKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFELHNVSVGLL

Query:  YPGGDRLENTCCTGDSNMESTIEKPNENVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQEKINKHEKNDFTEFR
            + +  T C   SN +       +++G+++     K    KR   ++          Y++     KKI T+  ++                      
Subjt:  YPGGDRLENTCCTGDSNMESTIEKPNENVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQEKINKHEKNDFTEFR

Query:  KQEPVPQANVTEEPFVAMITD----IYMVQSVEGWWADSGANRHVCYDKD-----------WFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRV
            + ++NV  EP V +  +    +  ++ +EG+   S  N   C + +           W     P   P I M  DS  T                 
Subjt:  KQEPVPQANVTEEPFVAMITD----IYMVQSVEGWWADSGANRHVCYDKD-----------WFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRV

Query:  LMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGKVENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERK
          L    ++   RK        N   +   + S         I+  +VENQ  +KIK +RSDRG EY + EF+++ E  GI HE T PY+P  NG+AERK
Subjt:  LMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGKVENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERK

Query:  NRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFL
        NRTL+ + N ML +SG   +LWGEA+L ACH+ NR+  +   ++P++LW G KPNL YLRVWGCLA+ R  +PK  KLG++    VF+GY  NS AY  L
Subjt:  NRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFL

Query:  DIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQV-ELRRSKRARVVKDFGPDFYVYNIQETP----------ISLQEALS
        D    VI ES D  F E+K     +N         ++   +ST S   E S+++ E RRS RAR  K  G  F+ Y ++ T           I++ + L 
Subjt:  DIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQV-ELRRSKRARVVKDFGPDFYVYNIQETP----------ISLQEALS

Query:  SPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCK
                  VND+ +S++ N+TWKL +LP G K IG K
Subjt:  SPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCK

PPD96412.1 hypothetical protein GOBAR_DD06575 [Gossypium barbadense]3.6e-14348.67Show/hide
Query:  HEKNDFTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKD
        H KN+   F K++   +A+   E FVAMI++I M Q    WW D+GA +HVC DK  F  +T  +   ++ +G+S T  I G G VEL+ TSG+VL L D
Subjt:  HEKNDFTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKD

Query:  VLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK------------------------VENQFSRKIKRIRSDRGREYESFEFNSYVESLGI
        V + P +RKNL+S  LLNK GFK   E+D+++++K GIFVGK                        VE Q    IK +RSDRG EY +    SY E  GI
Subjt:  VLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK------------------------VENQFSRKIKRIRSDRGREYESFEFNSYVESLGI

Query:  IHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSK
        +H+ + PY+P  NGVAERKNR LI                WGEAVLTACH+LNRVP+K+TKITP++ WK  KPNL YL+VWGC A V++  PKR KLG +
Subjt:  IHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSK

Query:  ATTCVFLGYAINSTAYRFLDIEH------NVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPDF---
           C+F+GYA NS AYRF+ IE       N + ES DAIF E +    SR    Q    +S +        +N + S  ELRRSKR + VKDFGPDF   
Subjt:  ATTCVFLGYAINSTAYRFLDIEH------NVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPDF---

Query:  --------------YVYNIQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKED
                      Y YN +  PI+ +EA+ S D+ FWKE +NDEM+S++ N+TW LVDLPPG K IGCKW+ +KK+K DG+IDK+KARLVAKGF Q++ 
Subjt:  --------------YVYNIQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKED

Query:  LDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKV
        +D+FDT++PV RI +IRLLI++ +I+++ +HQMDVKTAFLNG+LEEE+YM+QPE F+ PGQE+KV
Subjt:  LDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKV

RVW67328.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]3.6e-14349.46Show/hide
Query:  QSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITK
        ++ + WW D+GA RH+C +K  F  Y P ++ + + +G+S ++++ G G+V LK T G+ L L DVLH P +RKNL+S  LL+K GFK    SD++V+TK
Subjt:  QSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITK

Query:  KGIFVGK---------------------------------------VENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKN
          +FVGK                                       VENQ S+KIK IRSDRG EYES  F  +    GIIH+TT PYSP SNG+A+RKN
Subjt:  KGIFVGK---------------------------------------VENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKN

Query:  RTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFL-
        RTL  + NAMLL SG   NLWGEA+L+A ++LN++PHKKT  TP++LWKG+KP   YL+VWGCLA V +  PK+ K+  K   C+F+GYA NS+AYRFL 
Subjt:  RTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFL-

Query:  ------DIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQ---VELRRSKRARVVKDFGPDFYVYNIQETPISLQEALSSP
              D+  N I ES +A F E   P+   +            FG++    N E+  +    + RR KRAR    FGPDF  Y ++  P + +EA+SSP
Subjt:  ------DIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQ---VELRRSKRARVVKDFGPDFYVYNIQETPISLQEALSSP

Query:  DAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQM
        +A +WKE +N E+ES++ N TW+LVDLPPG KT+GCKW+ +KK+K DGSIDKYKARLVAKG+KQ+E LD+FDT+SPV+RITSIR+LIAIAAI +  IHQM
Subjt:  DAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQM

Query:  DVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFE
        DVKT FLNG+L+EEIYMDQPE FI PGQE KV     ++  +K+   K+W E
Subjt:  DVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFE

TrEMBL top hitse value%identityAlignment
A0A7N2L531 Uncharacterized protein1.4e-23048.3Show/hide
Query:  MESTIEKPNENVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQEKINKHEKND----------------------
        MES+ EKPNE VGDLNKPFRFK AHFKRWKGKVLF L+LLK++Y+LT+KNP K+ T+ M+ EE++ HQEKI+K+ K++                      
Subjt:  MESTIEKPNENVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQEKINKHEKND----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------------FTE
                                                                                                         F +
Subjt:  -------------------------------------------------------------------------------------------------FTE

Query:  FRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTPSMR
        FRK+E VPQANVTEEP VAMITDI MVQ VEGWWADSGANRHVCYDK+WFK+YTPF+E K +MLGDS  T+++G+GEVELK TSGRVL LKDVL+TPSMR
Subjt:  FRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTPSMR

Query:  KNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK------------------------------------------------------------------
        KNLMSS+LLNKAGFKQT+ESD YVITKKG+FVGK                                                                  
Subjt:  KNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK------------------------------------------------------------------

Query:  -------------------------------------------------------------------------VENQFSRKIKRIRSDRGREYESFEFNS
                                                                                 VENQF RKIKRIRSDRGREYES  FNS
Subjt:  -------------------------------------------------------------------------VENQFSRKIKRIRSDRGREYESFEFNS

Query:  YVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPK
        + +SLGIIHETT PYSPASNGVAERKNRTLI LTNAML+ESGA  + WGEA+LTACHVLNRVPHKK+  TPF++WKG+KPNLGYLR W CLA+VRLTDPK
Subjt:  YVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPK

Query:  RPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPDFYV
         PKLG +ATTC FLGYAINS AYRF D+E+ +IFES DAIFHEEK PFK +NSGG+  E   ++  SST    N+ + ++E RRSKRARV KDFGPD+YV
Subjt:  RPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPDFYV

Query:  YNIQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSI
        +NI+E P +L+EAL+SPDAIFWKE VNDEMESLISN+TWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDK+KARLVAKGFKQK DLDFFDTFSPVTRITSI
Subjt:  YNIQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSI

Query:  RLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFE
        RLLIAIAAIFD+ IHQMDVKTAFLNGDLEEEIYMDQPE F+EPGQE+KV     ++  +K+   K+W E
Subjt:  RLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFE

A0A7N2N1S1 Integrase catalytic domain-containing protein5.2e-24961.15Show/hide
Query:  SNMESTIEKPNENVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQEKINKHEKND--------------------
        ++MES+ EKPNE VGDLNKPFRFK AHFKRWKGKVLF L+LLK++Y+LT KNP K+ T+ M+ EE++ HQEKI+K+ K++                    
Subjt:  SNMESTIEKPNENVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQEKINKHEKND--------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------FTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIM
                                          F +FRK+E VPQANVTEEP VAMITDI MVQ VEGWWADSGANRHVCYDK+WFK+YTPF+E K IM
Subjt:  ----------------------------------FTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIM

Query:  LGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGKVENQFSRKIKRIRSDRGREYESFEFNSYV
        LG+S  T+++G+GEVELK TSGRVL LKDVL+TPSMRKNLMSS+LLNKAGFKQT+ESD YVITKKG+FVGK       KIKRIRSDRG EYES  FNS+ 
Subjt:  LGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGKVENQFSRKIKRIRSDRGREYESFEFNSYV

Query:  ESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRP
        +SLGIIHETT PYSPASNGVAERKNRTLI LTNAML+ESGA  + WGEA+LTACHVLNRVPHKK+  TPF++WKG+KPNLGYLRVWGCLA+VRLTDPK P
Subjt:  ESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRP

Query:  KLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPDFYVYN
        KLG +ATTC FLGYAINS AYRF D+E+ +IFES DAIFHEEK PFK +NSGG+  E   ++  SST    N+ + ++E RRSKRARV KDFGPD+YV+N
Subjt:  KLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPDFYVYN

Query:  IQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRL
        I+E P +L+EAL+SPDAIFWKE VNDEMESLISN+TWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDK+KARLVAKGFKQK DLDFFDTFSPVTRITSIRL
Subjt:  IQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRL

Query:  LIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFS
        LIAIAAIFD+ IHQMDVKTAFLNGDLEEEIYMDQPE F+EPGQE+K ++
Subjt:  LIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFS

A0A7N2R9F3 Uncharacterized protein5.6e-22748.47Show/hide
Query:  MESTIEKPNENVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQEKINKHEKND----------------------
        ME T EKPNE +GDLNKPFRFK AHFKRWKGKVLF L+LLK+AY+LT+KNP K+ T+ M+ EE++ HQEKI+K+ K++                      
Subjt:  MESTIEKPNENVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQEKINKHEKND----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------------------------------------------FTE
                                                                                                         F +
Subjt:  -------------------------------------------------------------------------------------------------FTE

Query:  FRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTPSMR
        FRK+E VPQ NVTEEP VA+ITDI MVQ VEGWWAD GANRHVCYDK+WFK+YTPF+E K IMLGDS  T+++G+GEVELK TSGRVL LKDV +TPSMR
Subjt:  FRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTPSMR

Query:  KNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK------------------------------------------------------------------
        KNLMSS+LLNKAGFKQT+ESD YVITKKG+FVGK                                                                  
Subjt:  KNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK------------------------------------------------------------------

Query:  -------------------------------------------------------------------------VENQFSRKIKRIRSDRGREYESFEFNS
                                                                                 VENQF RKIKRIRSDRGREYES  FNS
Subjt:  -------------------------------------------------------------------------VENQFSRKIKRIRSDRGREYESFEFNS

Query:  YVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPK
        +V+SLGIIHETT PYSPASNGV ERKNRTLI LTNAML+ESGA  + WGEA+LTACHVLNRVPHKK+  TPF++WKG+KPNLGYLRVWGCLA+VRLTDPK
Subjt:  YVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPK

Query:  RPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPDFYV
         PKLG +ATTC FLGYAINS AYRF D+E+ +IFES DAIFHEEK PFK +NSGG+  E    +  SST    N+ + ++ELRRSKRARV KDFGPD+YV
Subjt:  RPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPDFYV

Query:  YNIQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSI
        +NI+E P +L+EAL+S DAIFWKE VNDEMESLISN+TWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDK+KARLVA GFKQK DLDFFDTFSPVTRITSI
Subjt:  YNIQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSI

Query:  RLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKV
        RLLIAIAAIFD+ IHQMDVKTAFLNG++EEEIYMDQPE F+EPGQE+KV
Subjt:  RLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKV

Q0KIN7 Polyprotein, putative6.0e-22161.46Show/hide
Query:  FTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTP
        F  FRK+ P PQANVTEEPFVA+ITDI MV++V+GWWADSGANRHVCYDKDWFK YT F+EPK IMLGDSHTTQ++GTG+VEL  TSGRVL LKDVL+TP
Subjt:  FTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTP

Query:  SMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK---------------------------------------------------------------
        SMRK LMSS+LLNKAGFKQ IES+QYVI KKGIFVGK                                                               
Subjt:  SMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK---------------------------------------------------------------

Query:  ---------------------------------------------------------------------------VENQFSRKIKRIRSDRGREYESFEF
                                                                                   VENQF RKIKRIRSDRGREYES EF
Subjt:  ---------------------------------------------------------------------------VENQFSRKIKRIRSDRGREYESFEF

Query:  NSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTD
        NS+V SLGIIHETTPPYSP+SNGVAERKNRTL+ LTNAML+ES A  N WGEA+LTAC+VLNRVPHKK+K+TPF+LWKGYKP+LGYLRVWGCLAFVRL D
Subjt:  NSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTD

Query:  PKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQT-STKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPD
        PK  KLG K TTC FLGYA NSTAYRF ++E N++ ES DAIFHE K PF S+NSGGQ  EQ   T   SST +  N+  +  ELRRSKRARV KDFGPD
Subjt:  PKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQT-STKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPD

Query:  FYVYNIQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRI
        FYV+N+ +  ++L+EALSS D+IFWKE VNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQ E L+FFDTFSPVTRI
Subjt:  FYVYNIQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRI

Query:  TSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFE
        TSIRLLIA+AAIFD+ IHQMDVKTAFLNGDL EEIYMDQPE F+E GQE+KV     ++  +K+   K+W E
Subjt:  TSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFE

Q60D13 Putative gag and pol polyprotein, identical4.4e-17153.72Show/hide
Query:  STIEKPNE-NVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQE-------KINKHEKND----------------
        +TIEK  +  + DLNKPFRF +     WK       +  K   V T      +E E+   +  ++ +E       K  K +KN+                
Subjt:  STIEKPNE-NVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQE-------KINKHEKND----------------

Query:  --------------------FTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGE
                            F  FRK+ P PQANVTEEPFVA+ITDI MV++V+GWW DSGANRHVCYDKDWFK YT F+EPK IMLGD+HTTQ++G G+
Subjt:  --------------------FTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGE

Query:  VELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK-------------------------------------------
        VEL  +SGR L LKDVL+TPSMRKNLMSS+L NK GFKQ IESDQYVI KKGIFVGK                                           
Subjt:  VELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGK-------------------------------------------

Query:  -------------------------------------------------VENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAE
                                                         VENQF RKIKRIRSDRGREYES EFNS+V SLGIIHETTPPYSP+SNG AE
Subjt:  -------------------------------------------------VENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAE

Query:  RKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYR
        RKNRTL+ LTNAML+ES A  N WGE +LTAC+VLNRVPHKK+K+T F+LWKGYKP+LGYLRVWGCLAFVRL DPK  KLG K TTC FLGYA NSTAYR
Subjt:  RKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYR

Query:  FLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKF-GSSTPSDNNENSSQVELRRSKRARVVKDFGPDFYVYNIQETPISLQEALSSPDAIFWK
        F ++E N++ ES DAIFHE K PF S+NSGGQ  EQ       SST +  N+  +  ELRRSKRAR+ KDFGP+FYV+N+ + P++L+EALSS D+IFWK
Subjt:  FLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKF-GSSTPSDNNENSSQVELRRSKRARVVKDFGPDFYVYNIQETPISLQEALSSPDAIFWK

Query:  EVVNDEMESLISNKTWKLVDLPPGCKTIGCK
        E VNDEMESLISNKTWKLVDLPPGCKTIGCK
Subjt:  EVVNDEMESLISNKTWKLVDLPPGCKTIGCK

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.1e-4127.03Show/hide
Query:  FVGKVENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHK---K
        FV K E  F+ K+  +  D GREY S E   +    GI +  T P++P  NGV+ER  RT+      M+  +    + WGEAVLTA +++NR+P +    
Subjt:  FVGKVENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHK---K

Query:  TKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGY---------------------------AINSTAYRFLDI-------EHNV
        +  TP+++W   KP L +LRV+G   +V + + K+ K   K+   +F+GY                            +NS A +F  +         N 
Subjt:  TKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGY---------------------------AINSTAYRFLDI-------EHNV

Query:  IFESSDAIFHEEKLPFK------------SRNSGGQNSEQTSTKF---------------------------------------------GSSTPSDNNE
         F +      + + P +            S+ S  +N    S K                                              GS  P+++ E
Subjt:  IFESSDAIFHEEKLPFK------------SRNSGGQNSEQTSTKF---------------------------------------------GSSTPSDNNE

Query:  NSSQVEL------------------RRSKRARV-------VKDFGPDFYVYN----IQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPP
        + +   L                  RRS+R +         +D   +  V N      + P S  E     D   W+E +N E+ +   N TW +   P 
Subjt:  NSSQVEL------------------RRSKRARV-------VKDFGPDFYVYN----IQETPISLQEALSSPDAIFWKEVVNDEMESLISNKTWKLVDLPP

Query:  GCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPE
            +  +WV   K    G+  +YKARLVA+GF QK  +D+ +TF+PV RI+S R ++++   +++ +HQMDVKTAFLNG L+EEIYM  P+
Subjt:  GCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMDQPE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.2e-6936.28Show/hide
Query:  FVGKVENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKI
        F   VE +  RK+KR+RSD G EY S EF  Y  S GI HE T P +P  NGVAER NRT++    +ML  +    + WGEAV TAC+++NR P      
Subjt:  FVGKVENQFSRKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKI

Query:  -TPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEK---------------------LP
          P ++W   + +  +L+V+GC AF  +   +R KL  K+  C+F+GY      YR  D     +  S D +F E +                     +P
Subjt:  -TPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEK---------------------LP

Query:  FKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVE------------------LRRSKRARVVKDFGP--DFYVYNIQETPISLQEALSSPDAIFWKEVVN
          S N     S           P +  E   Q++                  LRRS+R RV     P  ++ + +    P SL+E LS P+     + + 
Subjt:  FKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVE------------------LRRSKRARVVKDFGP--DFYVYNIQETPISLQEALSSPDAIFWKEVVN

Query:  DEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGD
        +EMESL  N T+KLV+LP G + + CKWV + K   D  + +YKARLV KGF+QK+ +DF + FSPV ++TSIR ++++AA  D+ + Q+DVKTAFL+GD
Subjt:  DEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGD

Query:  LEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWF
        LEEEIYM+QPE F   G+++ V     ++  +K+   ++W+
Subjt:  LEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-0628.43Show/hide
Query:  WWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFV
        W  D+ A+ H    +D F  Y    +   + +G++  ++I G G++ +K   G  L+LKDV H P +R NL+S   L++ G++    + ++ +TK  + +
Subjt:  WWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFV

Query:  GK
         K
Subjt:  GK

P92520 Uncharacterized mitochondrial protein AtMg008204.8e-1844.76Show/hide
Query:  IQETPISLQEALSSPDAIFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRL
        I++ P S+  AL  P    W +A+ +E+++L  NKTW LV  P     +GCKWV + KL  DG++D+ KARLVAKGF Q+E + F +T+SPV R  +IR 
Subjt:  IQETPISLQEALSSPDAIFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRL

Query:  LIAIA
        ++ +A
Subjt:  LIAIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-3925.88Show/hide
Query:  KIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKI-TPFQLWKGYK
        +I    SD G E+ +     Y    GI H T+PP++P  NG++ERK+R ++     +L  +      W  A   A +++NR+P    ++ +PFQ   G  
Subjt:  KIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKI-TPFQLWKGYK

Query:  PNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKS-----------------------------
        PN   LRV+GC  +  L    + KL  K+  CVFLGY++  +AY  L ++ + ++ S    F E   PF +                             
Subjt:  PNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKS-----------------------------

Query:  -------------------------RNSGGQNSE--------------------------------QTSTKFGSSTPSDNNENSSQVELRR---------
                                 RNS   +S                                 QT T    +T  +N  N S  +L +         
Subjt:  -------------------------RNSGGQNSE--------------------------------QTSTKFGSSTPSDNNENSSQVELRR---------

Query:  --------SKRARVVKDFGPDFYVY----------NIQETPISLQE-----------------------ALSSPDAIF-------WKEAVNDEMESLISN
                S  +       P   ++          N  + P++                          A S P           W+ A+  E+ + I N
Subjt:  --------SKRARVVKDFGPDFYVY----------NIQETPISLQE-----------------------ALSSPDAIF-------WKEAVNDEMESLISN

Query:  KTWKLVDLPPGCKTI-GCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMD
         TW LV  PP   TI GC+W+  KK   DGS+++YKARLVAKG+ Q+  LD+ +TFSPV + TSIR+++ +A      I Q+DV  AFL G L +++YM 
Subjt:  KTWKLVDLPPGCKTI-GCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEIYMD

Query:  QPECFIEPGQENKVFSPRVTVKIMKKINTKEWFELHN
        QP  FI+  + N V   R  +  +K+     + EL N
Subjt:  QPECFIEPGQENKVFSPRVTVKIMKKINTKEWFELHN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-3925.23Show/hide
Query:  KIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKI-TPFQLWKGYK
        +I  + SD G E+       Y+   GI H T+PP++P  NG++ERK+R ++ +   +L  +      W  A   A +++NR+P    ++ +PFQ   G  
Subjt:  KIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKI-TPFQLWKGYK

Query:  PNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTP---------
        PN   L+V+GC  +  L    R KL  K+  C F+GY++  +AY  L I    ++ S    F E   PF + N G   S++  +    + P         
Subjt:  PNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTP---------

Query:  -------------------------------SDNNENSSQVELRRSKRARVVKDFGP--------------DFYVYN-----------------IQETPI
                                       S +N  SS +    S         GP              +  + N                 + ++PI
Subjt:  -------------------------------SDNNENSSQVELRRSKRARVVKDFGP--------------DFYVYN-----------------IQETPI

Query:  S--------------------------LQEALSSPDAI--------------------------------------------------FWKEAVNDEMES
        S                          L   L +P  I                                                   W++A+  E+ +
Subjt:  S--------------------------LQEALSSPDAI--------------------------------------------------FWKEAVNDEMES

Query:  LISNKTWKLVDLPPGCKTI-GCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEE
         I N TW LV  PP   TI GC+W+  KK   DGS+++YKARLVAKG+ Q+  LD+ +TFSPV + TSIR+++ +A      I Q+DV  AFL G L +E
Subjt:  LISNKTWKLVDLPPGCKTI-GCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEE

Query:  IYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFEL
        +YM QP  F++  + + V   R  +  +K+     + EL
Subjt:  IYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFEL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.4e-3441.61Show/hide
Query:  IFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDV
        + W  A++DE+ ++ +  TW++  LPP  K IGCKWV + K   DG+I++YKARLVAKG+ Q+E +DF +TFSPV ++TS++L++AI+AI++  +HQ+D+
Subjt:  IFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDV

Query:  KTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKI-----NTKEWFELHNVSV
          AFLNGDL+EEIYM  P  +    ++     P     + K I      +++WF   +V++
Subjt:  KTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKI-----NTKEWFELHNVSV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.2e-0532Show/hide
Query:  NRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKI-TPFQLWKGYKPNLGYLRVWGCLAFVRLTDPK
        NRT+I    +ML E G       +A  TA H++N+ P        P ++W    P   YLR +GC+A++   + K
Subjt:  NRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKI-TPFQLWKGYKPNLGYLRVWGCLAFVRLTDPK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.4e-1944.76Show/hide
Query:  IQETPISLQEALSSPDAIFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRL
        I++ P S+  AL  P    W +A+ +E+++L  NKTW LV  P     +GCKWV + KL  DG++D+ KARLVAKGF Q+E + F +T+SPV R  +IR 
Subjt:  IQETPISLQEALSSPDAIFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRL

Query:  LIAIA
        ++ +A
Subjt:  LIAIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
AGAAAAATTAAGAGAATTAGAAGTGACAGAGGTCGAGAATATGAATCATTTGAATTCAATTCATATGTTGAATCATTGGGAATAATTCATGAGACTACTCCACCATATTC
ACCTGCTTCTAATGGAGTAGCTGAGAGGAAGAATAGAACCTTAATTGGTCTAACTAATGCTATGCTATTAGAATCAGGTGCATCTTTTAATCTTTGGGGTGAAGCTGTTT
TAACTGCTTGTCATGTGTTAAATAGAGTGCCACATAAAAAGACAAAGATTACACCTTTTCAATTGTGGAAAGGGTATAAGCCAAATTTGGGATATTTAAGGGTTTGGGGA
TGTTTAGCTTTTGTAAGGCTAACAGATCCAAAAAGGCCTAAGTTAGGGTCTAAAGCTACCACTTGTGTTTTTCTTGGATATGCAATTAATAGTACAGCCTATCGATTTCT
TGATATTGAACATAACGTGATTTTTGAATCAAGTGATGCAATTTTTCATGAAGAAAAATTGCCTTTCAAGTCAAGAAATAGTGGGGGACAAAATTCTGAACAAACTTCTA
CTAAGTTTGGTTCTTCTACTCCTAGTGATAATAATGAAAATAGTTCTCAAGTAGAATTAAGGAGGAGTAAAAGAGCTAGAGTTGTGAAAGATTTTGGTCCTGATTTTTAT
GTTTATAATATTCAGGAAACACCAATTTCTTTGCAAGAAGCTTTATCATCTCCAGATGCAATTTTCTGGAAAGAGGCTGTTAATGATGAAATGGAATCACTAATTTCTAA
CAAAACTTGGAAACTAGTTGATCTACCACCAGGATGTAAAACTATAGGGTGTAAATGGGTCCTGAGAAAAAAACTGAAACCGGATGGTAGTATAGACAAGTATAAAGCTA
GACTTGTTGCAAAAGGTTTTAAACAAAAGGAAGATTTAGACTTCTTTGATACCTTTTCTCCGGTAACGCGAATCACATCCATTAGATTGTTAATTGCAATTGCTGCAATT
TTTGATATGTGCATTCATCAAATGGATGTGAAAACAGCTTTTCTCAATGGTGATTTAGAAGAAGAAATATATATGGACCAACCTGAATGTTTTATAGAACCTGGGCAAGA
GAACAAAGTGTTCTCACCAAGGGTTACAGTGAAGATAATGAAGAAGATCAACACCAAAGAGTGGTTTGAGCTACACAACGTTTCAGTTGGGTTGTTGTATCCTGGAGGGG
ACAGGCTTGAGAATACCTGCTGCACCGGAGATTCAAACATGGAGTCTACTATTGAGAAACCAAATGAGAATGTTGGAGATCTCAACAAGCCCTTTCGATTTAAGGAAGCA
CACTTTAAGAGATGGAAAGGAAAAGTGTTGTTCAACCTTAATCTTCTCAAGCTGGCCTATGTCCTCACAGAAAAGAACCCGAAGAAGATCGAGACTGAAAGCATGAACGT
TGAAGAGTTCATGGAACACCAAGAGAAAATTAATAAACATGAGAAAAATGATTTTACAGAATTCCGGAAACAAGAACCTGTACCTCAGGCCAATGTTACCGAAGAGCCAT
TTGTGGCTATGATTACTGATATTTATATGGTTCAAAGTGTTGAAGGATGGTGGGCTGATTCTGGAGCCAATAGACATGTCTGTTATGACAAAGATTGGTTCAAAGTTTAT
ACTCCTTTTAAGGAGCCTAAAATAATAATGCTTGGTGATTCACATACTACCCAAATTATGGGAACTGGTGAGGTTGAATTAAAATGCACCTCTGGAAGGGTTTTAATGTT
GAAAGATGTGCTTCATACACCCTCCATGAGGAAGAATTTAATGTCGAGTTATCTTCTTAATAAAGCTGGCTTTAAGCAAACTATAGAATCTGATCAATATGTGATAACTA
AAAAGGGAATTTTTGTTGGAAAAGTTGAAAATCAATTCAGTAGAAAAATTAAGAGAATTAGAAGTGACAGAGGTCGAGAATATGAATCATTTGAATTCAATTCATATGTT
GAATCATTGGGAATAATTCATGAGACTACTCCACCATATTCACCTGCTTCTAATGGAGTAGCTGAGAGGAAGAATAGAACCTTAATTGGTCTAACTAATGCTATGCTATT
AGAATCAGGTGCATCTTTTAATCTTTGGGGTGAAGCTGTTTTAACTGCTTGTCATGTGTTAAATAGAGTGCCACATAAAAAGACAAAGATTACACCTTTTCAATTGTGGA
AAGGGTATAAGCCAAATTTGGGATATTTAAGGGTTTGGGGATGTTTAGCTTTTGTAAGGCTAACAGATCCAAAAAGGCCTAAGTTAGGGTCTAAAGCTACCACTTGTGTT
TTTCTTGGATATGCAATTAATAGTACAGCCTATCGATTTCTTGATATTGAACATAACGTGATTTTTGAATCAAGTGATGCAATTTTTCATGAAGAAAAATTGCCTTTCAA
GTCAAGAAATAGTGGGGGACAAAATTCTGAACAAACTTCTACTAAGTTTGGTTCTTCTACTCCTAGTGATAATAATGAAAATAGTTCTCAAGTAGAATTAAGGAGGAGTA
AAAGAGCTAGAGTTGTGAAAGATTTTGGTCCTGATTTTTATGTTTATAATATTCAGGAAACACCAATTTCTTTGCAAGAAGCTTTATCATCTCCAGATGCAATTTTCTGG
AAAGAGGTTGTTAATGATGAAATGGAATCACTAATTTCTAACAAAACTTGGAAACTAGTTGATCTACCACCAGGATGTAAAACTATAGGGTGTAAATGGGTCCTGAGAAA
AAAACTGAAACCGGATGGTAGTATAGACAAGTATAAAGCTAGACTTGTTGCAAAAGGTTTTAAACAAAAGGAAGATTTAGACTTCTTTGATACCTTTTCTCCGGTAACGC
GAATCACATCCATTAGATTGTTAATTGCAATTGCTGCAATTTTTGATATGTGCATTCATCAAATGGATGTGAAAACAGCTTTTCTCAATGGTGATTTAGAAGAAGAAATA
TATATGGACCAACCTGAATGTTTTATAGAACCTGGGCAAGAGAACAAAGTGTTCTCACCAAGGGTTACAGTGAAGATAATGAAGAAGATCAACACCAAAGAGTGGTTTGA
GCTACACAACGTTTCAGTTGGGTTGTTGTATCCTGGAGGGGACAGGCTTGAGAATACCTGCTGCACCGGAGATTCAAACATGGAGTCTACTATTGAGAAACCAAATGAGA
ATGTTGGAGATCTCAACAAGCCCTTTCGATTTAAGGAAGCACACTTTAAGAGATGGAAAGGAAAAGTGTTGTTCAACCTTAATCTTCTCAAGCTGGCCTATGTCCTCACA
GAAAAGAACCCGAAGAAGATCGAGACTGAAAGCATGAACGTTGAAGAGTTCATGGAACACCAAGAGAAAATTAATAAACATGAGAAAAATGATTTTACAGAATTCCGGAA
ACAAGAACCTGTACCTCAGGCCAATGTTACCGAAGAGCCATTTGTGGCTATGATTACTGATATTTATATGGTTCAAAGTGTTGAAGGATGGTGGGCTGATTCTGGAGCCA
ATAGACATGTCTGTTATGACAAAGATTGGTTCAAAGTTTATACTCCTTTTAAGGAGCCTAAAATAATAATGCTTGGTGATTCACATACTACCCAAATTATGGGAACTGGT
GAGGTTGAATTAAAATGCACCTCTGGAAGGGTTTTAATGTTGAAAGATGTGCTTCATACACCCTCCATGAGGAAGAATTTAATGTCGAGTTATCTTCTTAATAAAGCTGG
CTTTAAGCAAACTATAGAATCTGATCAATATGTGATAACTAAAAAGGGAATTTTTGTTGGAAAAGGTTATGCTTGTGATGGAATAGCTTGGCAAAGGGGACTTATCATGA
CGACGGTTTGCTCAAAAGGCATTGAAGGATTTACTCTTTGGGCAAAGAAGATCAAGATTATCTTGATTGCTCAAAAGGCATTGAAGGCTTTGGATGATCCTAAAACACTC
CCGGACACTTTAACAATGAACAAAAACAGACCATGGAAGAAATTGCATTCAAAGCCCTCATACTCAACATTCCAGATAACATAA
mRNA sequenceShow/hide mRNA sequence
AGAAAAATTAAGAGAATTAGAAGTGACAGAGGTCGAGAATATGAATCATTTGAATTCAATTCATATGTTGAATCATTGGGAATAATTCATGAGACTACTCCACCATATTC
ACCTGCTTCTAATGGAGTAGCTGAGAGGAAGAATAGAACCTTAATTGGTCTAACTAATGCTATGCTATTAGAATCAGGTGCATCTTTTAATCTTTGGGGTGAAGCTGTTT
TAACTGCTTGTCATGTGTTAAATAGAGTGCCACATAAAAAGACAAAGATTACACCTTTTCAATTGTGGAAAGGGTATAAGCCAAATTTGGGATATTTAAGGGTTTGGGGA
TGTTTAGCTTTTGTAAGGCTAACAGATCCAAAAAGGCCTAAGTTAGGGTCTAAAGCTACCACTTGTGTTTTTCTTGGATATGCAATTAATAGTACAGCCTATCGATTTCT
TGATATTGAACATAACGTGATTTTTGAATCAAGTGATGCAATTTTTCATGAAGAAAAATTGCCTTTCAAGTCAAGAAATAGTGGGGGACAAAATTCTGAACAAACTTCTA
CTAAGTTTGGTTCTTCTACTCCTAGTGATAATAATGAAAATAGTTCTCAAGTAGAATTAAGGAGGAGTAAAAGAGCTAGAGTTGTGAAAGATTTTGGTCCTGATTTTTAT
GTTTATAATATTCAGGAAACACCAATTTCTTTGCAAGAAGCTTTATCATCTCCAGATGCAATTTTCTGGAAAGAGGCTGTTAATGATGAAATGGAATCACTAATTTCTAA
CAAAACTTGGAAACTAGTTGATCTACCACCAGGATGTAAAACTATAGGGTGTAAATGGGTCCTGAGAAAAAAACTGAAACCGGATGGTAGTATAGACAAGTATAAAGCTA
GACTTGTTGCAAAAGGTTTTAAACAAAAGGAAGATTTAGACTTCTTTGATACCTTTTCTCCGGTAACGCGAATCACATCCATTAGATTGTTAATTGCAATTGCTGCAATT
TTTGATATGTGCATTCATCAAATGGATGTGAAAACAGCTTTTCTCAATGGTGATTTAGAAGAAGAAATATATATGGACCAACCTGAATGTTTTATAGAACCTGGGCAAGA
GAACAAAGTGTTCTCACCAAGGGTTACAGTGAAGATAATGAAGAAGATCAACACCAAAGAGTGGTTTGAGCTACACAACGTTTCAGTTGGGTTGTTGTATCCTGGAGGGG
ACAGGCTTGAGAATACCTGCTGCACCGGAGATTCAAACATGGAGTCTACTATTGAGAAACCAAATGAGAATGTTGGAGATCTCAACAAGCCCTTTCGATTTAAGGAAGCA
CACTTTAAGAGATGGAAAGGAAAAGTGTTGTTCAACCTTAATCTTCTCAAGCTGGCCTATGTCCTCACAGAAAAGAACCCGAAGAAGATCGAGACTGAAAGCATGAACGT
TGAAGAGTTCATGGAACACCAAGAGAAAATTAATAAACATGAGAAAAATGATTTTACAGAATTCCGGAAACAAGAACCTGTACCTCAGGCCAATGTTACCGAAGAGCCAT
TTGTGGCTATGATTACTGATATTTATATGGTTCAAAGTGTTGAAGGATGGTGGGCTGATTCTGGAGCCAATAGACATGTCTGTTATGACAAAGATTGGTTCAAAGTTTAT
ACTCCTTTTAAGGAGCCTAAAATAATAATGCTTGGTGATTCACATACTACCCAAATTATGGGAACTGGTGAGGTTGAATTAAAATGCACCTCTGGAAGGGTTTTAATGTT
GAAAGATGTGCTTCATACACCCTCCATGAGGAAGAATTTAATGTCGAGTTATCTTCTTAATAAAGCTGGCTTTAAGCAAACTATAGAATCTGATCAATATGTGATAACTA
AAAAGGGAATTTTTGTTGGAAAAGTTGAAAATCAATTCAGTAGAAAAATTAAGAGAATTAGAAGTGACAGAGGTCGAGAATATGAATCATTTGAATTCAATTCATATGTT
GAATCATTGGGAATAATTCATGAGACTACTCCACCATATTCACCTGCTTCTAATGGAGTAGCTGAGAGGAAGAATAGAACCTTAATTGGTCTAACTAATGCTATGCTATT
AGAATCAGGTGCATCTTTTAATCTTTGGGGTGAAGCTGTTTTAACTGCTTGTCATGTGTTAAATAGAGTGCCACATAAAAAGACAAAGATTACACCTTTTCAATTGTGGA
AAGGGTATAAGCCAAATTTGGGATATTTAAGGGTTTGGGGATGTTTAGCTTTTGTAAGGCTAACAGATCCAAAAAGGCCTAAGTTAGGGTCTAAAGCTACCACTTGTGTT
TTTCTTGGATATGCAATTAATAGTACAGCCTATCGATTTCTTGATATTGAACATAACGTGATTTTTGAATCAAGTGATGCAATTTTTCATGAAGAAAAATTGCCTTTCAA
GTCAAGAAATAGTGGGGGACAAAATTCTGAACAAACTTCTACTAAGTTTGGTTCTTCTACTCCTAGTGATAATAATGAAAATAGTTCTCAAGTAGAATTAAGGAGGAGTA
AAAGAGCTAGAGTTGTGAAAGATTTTGGTCCTGATTTTTATGTTTATAATATTCAGGAAACACCAATTTCTTTGCAAGAAGCTTTATCATCTCCAGATGCAATTTTCTGG
AAAGAGGTTGTTAATGATGAAATGGAATCACTAATTTCTAACAAAACTTGGAAACTAGTTGATCTACCACCAGGATGTAAAACTATAGGGTGTAAATGGGTCCTGAGAAA
AAAACTGAAACCGGATGGTAGTATAGACAAGTATAAAGCTAGACTTGTTGCAAAAGGTTTTAAACAAAAGGAAGATTTAGACTTCTTTGATACCTTTTCTCCGGTAACGC
GAATCACATCCATTAGATTGTTAATTGCAATTGCTGCAATTTTTGATATGTGCATTCATCAAATGGATGTGAAAACAGCTTTTCTCAATGGTGATTTAGAAGAAGAAATA
TATATGGACCAACCTGAATGTTTTATAGAACCTGGGCAAGAGAACAAAGTGTTCTCACCAAGGGTTACAGTGAAGATAATGAAGAAGATCAACACCAAAGAGTGGTTTGA
GCTACACAACGTTTCAGTTGGGTTGTTGTATCCTGGAGGGGACAGGCTTGAGAATACCTGCTGCACCGGAGATTCAAACATGGAGTCTACTATTGAGAAACCAAATGAGA
ATGTTGGAGATCTCAACAAGCCCTTTCGATTTAAGGAAGCACACTTTAAGAGATGGAAAGGAAAAGTGTTGTTCAACCTTAATCTTCTCAAGCTGGCCTATGTCCTCACA
GAAAAGAACCCGAAGAAGATCGAGACTGAAAGCATGAACGTTGAAGAGTTCATGGAACACCAAGAGAAAATTAATAAACATGAGAAAAATGATTTTACAGAATTCCGGAA
ACAAGAACCTGTACCTCAGGCCAATGTTACCGAAGAGCCATTTGTGGCTATGATTACTGATATTTATATGGTTCAAAGTGTTGAAGGATGGTGGGCTGATTCTGGAGCCA
ATAGACATGTCTGTTATGACAAAGATTGGTTCAAAGTTTATACTCCTTTTAAGGAGCCTAAAATAATAATGCTTGGTGATTCACATACTACCCAAATTATGGGAACTGGT
GAGGTTGAATTAAAATGCACCTCTGGAAGGGTTTTAATGTTGAAAGATGTGCTTCATACACCCTCCATGAGGAAGAATTTAATGTCGAGTTATCTTCTTAATAAAGCTGG
CTTTAAGCAAACTATAGAATCTGATCAATATGTGATAACTAAAAAGGGAATTTTTGTTGGAAAAGGTTATGCTTGTGATGGAATAGCTTGGCAAAGGGGACTTATCATGA
CGACGGTTTGCTCAAAAGGCATTGAAGGATTTACTCTTTGGGCAAAGAAGATCAAGATTATCTTGATTGCTCAAAAGGCATTGAAGGCTTTGGATGATCCTAAAACACTC
CCGGACACTTTAACAATGAACAAAAACAGACCATGGAAGAAATTGCATTCAAAGCCCTCATACTCAACATTCCAGATAACATAA
Protein sequenceShow/hide protein sequence
RKIKRIRSDRGREYESFEFNSYVESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWG
CLAFVRLTDPKRPKLGSKATTCVFLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPDFY
VYNIQETPISLQEALSSPDAIFWKEAVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAI
FDMCIHQMDVKTAFLNGDLEEEIYMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFELHNVSVGLLYPGGDRLENTCCTGDSNMESTIEKPNENVGDLNKPFRFKEA
HFKRWKGKVLFNLNLLKLAYVLTEKNPKKIETESMNVEEFMEHQEKINKHEKNDFTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVY
TPFKEPKIIMLGDSHTTQIMGTGEVELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGKVENQFSRKIKRIRSDRGREYESFEFNSYV
ESLGIIHETTPPYSPASNGVAERKNRTLIGLTNAMLLESGASFNLWGEAVLTACHVLNRVPHKKTKITPFQLWKGYKPNLGYLRVWGCLAFVRLTDPKRPKLGSKATTCV
FLGYAINSTAYRFLDIEHNVIFESSDAIFHEEKLPFKSRNSGGQNSEQTSTKFGSSTPSDNNENSSQVELRRSKRARVVKDFGPDFYVYNIQETPISLQEALSSPDAIFW
KEVVNDEMESLISNKTWKLVDLPPGCKTIGCKWVLRKKLKPDGSIDKYKARLVAKGFKQKEDLDFFDTFSPVTRITSIRLLIAIAAIFDMCIHQMDVKTAFLNGDLEEEI
YMDQPECFIEPGQENKVFSPRVTVKIMKKINTKEWFELHNVSVGLLYPGGDRLENTCCTGDSNMESTIEKPNENVGDLNKPFRFKEAHFKRWKGKVLFNLNLLKLAYVLT
EKNPKKIETESMNVEEFMEHQEKINKHEKNDFTEFRKQEPVPQANVTEEPFVAMITDIYMVQSVEGWWADSGANRHVCYDKDWFKVYTPFKEPKIIMLGDSHTTQIMGTG
EVELKCTSGRVLMLKDVLHTPSMRKNLMSSYLLNKAGFKQTIESDQYVITKKGIFVGKGYACDGIAWQRGLIMTTVCSKGIEGFTLWAKKIKIILIAQKALKALDDPKTL
PDTLTMNKNRPWKKLHSKPSYSTFQIT