; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh06G009190 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh06G009190
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr06:6530444..6533332
RNA-Seq ExpressionCmoCh06G009190
SyntenyCmoCh06G009190
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7615234.1 Reverse transcriptase domain [Arabidopsis thaliana x Arabidopsis arenosa]6.5e-24258.36Show/hide
Query:  SLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCIDCR
        ++PS    LLQ++ D+F +E P  LPP+ GIEH+IDF+PG  +PNRPAYRTNP E +E+++QV+EL+ +G++RES+S C+VPV+LVPKKDGSWRMC+DCR
Subjt:  SLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCIDCR

Query:  AINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL------------
        AIN IT+KYRHPIPRLDDMLDELHG S+F+K+DLKSGYHQI M  GDEWKTAFKT+ GLYEWLVMPFGLTNAPSTFMRLMNH+LR ++            
Subjt:  AINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL------------

Query:  ---------------------------------------------VSSNGVEVDEEKVKAIKDWPTLKNI------------------------------
                                                     VS++GV+VDEEKVK I++WP+ K++                              
Subjt:  ---------------------------------------------VSSNGVEVDEEKVKAIKDWPTLKNI------------------------------

Query:  ---NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKE
           NV F W + QE AF  LKEKL+ AP+L+LP+F  TFEIECDASGVGIGAVLMQ+++P+ +FSEKL GA+L YPTYDKE YALVRALQTWQHYLWPKE
Subjt:  ---NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKE

Query:  FIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR-------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYS
        F+IHTDHESLKHL+ Q KLN++HA+W+EFIETF YVIKYK+ K+N+VADALSR       REVVRLHG+PK+IVSDRD KFLS+FW+ LW KLGTKL++S
Subjt:  FIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR-------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYS

Query:  TTCHPQTDGQTEVVKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKK
        TTCHPQTDGQTEVV RT++ +LRA+I KN KTWEDCLP +EFAYN  +HS +K +PF+IVYGFNP TP+DL+P+   E V+ D   K E V ++H+Q KK
Subjt:  TTCHPQTDGQTEVVKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKK

Query:  QIEKQNFKVATRIKKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQ
         IE++  + A    K RK  IF  GD VW+H RKERFP  RKSKL+ R DGPF+VL+ IN+NAY +DL GKY VS +FNV DL  F + D  D R+NP Q
Subjt:  QIEKQNFKVATRIKKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQ

Query:  EGENDM
         GE+D+
Subjt:  EGENDM

OWM74668.1 hypothetical protein CDL15_Pgr005248 [Punica granatum]1.1e-24950.11Show/hide
Query:  YFTNMLNPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGS
        Y +N  N  LPS    LLQEF+D+F +  P  LPP+RGIEH+IDFIPG PIPNRPAYR NP+EA+E+Q+QV ELL KGYVRES+S CSVPV+LVPKKDG+
Subjt:  YFTNMLNPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGS

Query:  WRMCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL----
        WRMC+DCRA+NKIT+KYR+PIPRLDDMLDELHG ++F+KIDLKSGYHQI M  GDEWKTAFKTK GLYEWLVMPFGLTNAPSTFMRLMNH+LR Y+    
Subjt:  WRMCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL----

Query:  -----------------------------------------------------VSSNGVEVDEEKVKAIKDWPTLKNI----------------------
                                                             VSS GVEVDEEKVKAI++WPT   I                      
Subjt:  -----------------------------------------------------VSSNGVEVDEEKVKAIKDWPTLKNI----------------------

Query:  -----------NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTW
                    V F WGK+QE AFNTLKEKLSSAPLL LP+F   FEIECDASG+GIGAVLMQ +RP+ +FSEKL GA+L Y TYDKE YALVRAL+TW
Subjt:  -----------NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTW

Query:  QHYLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSRR--------------------------------------
        QHYLW KEFIIHTDHESLKHL+ Q+KLNR+H +W+EFIE F YVI+YK+ KEN+VADALSRR                                      
Subjt:  QHYLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSRR--------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------EVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEV
                                                        EVVRLHGIP++IVSDRDVKFLSHFWRVLWGKLGTKL++STTCHPQTDGQTEV
Subjt:  ------------------------------------------------EVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEV

Query:  VKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRI
        V RT+  +LRA+I +N K+WEDC+PFIEFAYNR +HS+TK +PFE+VYGFNPLTP+DL P+   E V+ D   K E V K+H++ +  I  +N + ATR 
Subjt:  VKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRI

Query:  KKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGEND-----MNH
         KGRK   F+PGDWVWVHFRKERF ++RKSKL PRGDGPFQVLE INDNAYK+DLPG+Y VS+TFNV DLS FDV  G DSRTNP +EG ND       H
Subjt:  KKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGEND-----MNH

Query:  DQRISIPQGPITRTRAKKLQ---QTLYIYIQAMVSSSKKILEDAGDLL
        + ++ I  GPITR RAKKL+   Q L+  I     SS+    D+G +L
Subjt:  DQRISIPQGPITRTRAKKLQ---QTLYIYIQAMVSSSKKILEDAGDLL

PSS05945.1 Integrase [Actinidia chinensis var. chinensis]1.1e-24648.64Show/hide
Query:  TNMLNPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWR
        TN L+  +PS    LLQEFED+F  EMP  LPP+RGIEH+IDF+PG  IPNRPAYR+NP E +E+QRQVSELL KGYVRES+S C+VPV+LVPKKDG+WR
Subjt:  TNMLNPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWR

Query:  MCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL------
        MC+DCRAIN IT+KYRHPIPRLDDMLDELHG  +F+KIDLKSGYHQI M  GDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILR ++      
Subjt:  MCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL------

Query:  ---------------------------------------------------VSSNGVEVDEEKVKAIKDWP---TLKNI---------------------
                                                           VS+ G+ VD EK++AI++WP   T+ N+                     
Subjt:  ---------------------------------------------------VSSNGVEVDEEKVKAIKDWP---TLKNI---------------------

Query:  ---------NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQH
                 NV F WG +QE AF  +K++L++APLL+LPNF   FEIECDASG+GIGAVLMQ  RP+ +FSEKL+GA+L YPTYDKE YALVRAL+TW+H
Subjt:  ---------NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQH

Query:  YLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSRR----------------------------------------
        YLW +EF+IHTDHESLKHL+ Q+KLN++HA+W+EFIETF YVI+YKQ KEN+VADALSRR                                        
Subjt:  YLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSRR----------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------EVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVK
                                                      E+VRLHG+P++IVSDRD KFLS+FW+ LWGKLGTKL++STTCHPQTDGQTEVV 
Subjt:  ----------------------------------------------EVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVK

Query:  RTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKK
        RT++ +LRAII KN KTWEDCLP +EFAYNR VHS TK +PFEIVYGFNPLTP+DL P+   E VN D   K E V ++H++ K  IE++  + A +  K
Subjt:  RTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKK

Query:  GRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGEND-------MNH
        GRK  +F+PGDWVW+H RKERFPT+RKSKLLPRGDGPFQVLE INDNAYK+DLPG+Y VSATFN+ DLS F VGD LD RTNP QE END         +
Subjt:  GRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGEND-------MNH

Query:  DQRISIPQGPITRTRAKKLQQTLYIYIQ---AMVSSSKKILEDAGDLLYMLCKVEVQE
           I +P GPITR RAKK Q  L   IQ   A  S  + I  D  ++  ++  ++VQE
Subjt:  DQRISIPQGPITRTRAKKLQQTLYIYIQ---AMVSSSKKILEDAGDLLYMLCKVEVQE

TYK02449.1 F15O4.13 [Cucumis melo var. makuwa]3.4e-24348.35Show/hide
Query:  QEFEDLFSKE-MPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCIDCRAINKITIKY
        +EF D+F  E  P+ LPPLRGIEH+IDFIPG  +PN  AYRTNP E +EIQRQV EL+ KGY+RES+S CSVPVILVPKKDG+WRMC+DCRAINKIT+KY
Subjt:  QEFEDLFSKE-MPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCIDCRAINKITIKY

Query:  RHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREY----------------------
        RHPIPRLDDMLDELHG +LF+KIDLKSGYHQI MH+GDEWKTAFKTK+GLYEWLVMPFGLTNAPSTFMRLMNH+L+EY                      
Subjt:  RHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREY----------------------

Query:  -----------------------------------LVSSNGVEVDEEKVKAIKDWPTLKNI---------------------------------NVSFIW
                                           +V  +GV+VDEEKVKAI++WPT  N                                  +V F W
Subjt:  -----------------------------------LVSSNGVEVDEEKVKAIKDWPTLKNI---------------------------------NVSFIW

Query:  GKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKEFIIHTDHES
         + QE AFN LKEKL  AP LALPNF+ +FEIECDASG+GIGAVLMQ ++P+MFFSEKL GA L Y TYDKE +ALVRAL+ WQHYLWPKEF+IHTDHES
Subjt:  GKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKEFIIHTDHES

Query:  LKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR--------------------------------------------------------
        LKHL+ Q KLN++HAKW+EFIETF YVI YK+ K+N+VADALSR                                                        
Subjt:  LKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR--------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVKRTMTAMLRAIIDK
                                        REVVRLHGIPK+IVSDRDVKFLSHFW+VLWGKLGTKL++STTCHPQTDGQTEVV RT+ A+LR++I K
Subjt:  --------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVKRTMTAMLRAIIDK

Query:  NHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKKGRKIAIFKPGDWV
        N K+WE+ LPF+EFAYNR +HSTT C+PFE+VYGFNPLTP+DL P+    F +  A+++VE++  LHK++K++IEK+N K+ TR  +GRK  IFKPGDWV
Subjt:  NHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKKGRKIAIFKPGDWV

Query:  WVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVG-DGLDSRTNPSQEGENDMNH-DQRISIPQGPITRTRAKK
        WVH RKERFP +RKSKL  +GDGPFQVLE IN+NAYK+DL GKY VS+TFNV DL+ FDVG + LD RTNPS+E  +D++  +  + +P+GPIT+ +AKK
Subjt:  WVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVG-DGLDSRTNPSQEGENDMNH-DQRISIPQGPITRTRAKK

Query:  LQQTLYIYIQAMVSSSKKILEDAGDLLYMLCKVEVQE
        +Q+   +++Q + ++ ++        LY +     +E
Subjt:  LQQTLYIYIQAMVSSSKKILEDAGDLLYMLCKVEVQE

TYK22420.1 Transposon Ty3-I Gag-Pol polyprotein [Cucumis melo var. makuwa]2.6e-24348.5Show/hide
Query:  EFEDLFSKE-MPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCIDCRAINKITIKYR
        EF D+F  E  P+ LPPLRGIEH+IDFIPG  +PN  AYRTNP E +EIQRQV EL+ KGY+RES+S CSVPVILVPKKDG+WRMC+DCRAINKIT+KYR
Subjt:  EFEDLFSKE-MPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCIDCRAINKITIKYR

Query:  HPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREY-----------------------
        HPIPRLDDMLDELHG +LF+KIDLKSGYHQI MH+GDEWKTAFKTK+GLYEWLVMPFGLTNAPSTFMRLMNH+L+EY                       
Subjt:  HPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREY-----------------------

Query:  ----------------------------------LVSSNGVEVDEEKVKAIKDWPTLKNI---------------------------------NVSFIWG
                                          +V  +GV+VDEEKVKAI++WPT  N                                  +V F W 
Subjt:  ----------------------------------LVSSNGVEVDEEKVKAIKDWPTLKNI---------------------------------NVSFIWG

Query:  KDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKEFIIHTDHESL
        + QE AFN LKEKL  AP LALPNF+ +FEIECDASG+GIGAVLMQ ++P+MFFSEKL GA L Y TYDKE +ALVRAL+ WQHYLWPKEF+IHTDHESL
Subjt:  KDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKEFIIHTDHESL

Query:  KHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR---------------------------------------------------------
        KHL+ Q KLN++HAKW+EFIETF YVI YK+ K+N+VADALSR                                                         
Subjt:  KHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR---------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVKRTMTAMLRAIIDKN
                                       REVVRLHGIPK+IVSDRDVKFLSHFW+VLWGKLGTKL++STTCHPQTDGQTEVV RT+ A+LR++I KN
Subjt:  -------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVKRTMTAMLRAIIDKN

Query:  HKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKKGRKIAIFKPGDWVW
         K+WE+ LPF+EFAYNR +HSTT C+PFE+VYGFNPLTP+DL P+    F +  A+++VE++  LHK++K++IEK+N K+ TR  +GRK  IFKPGDWVW
Subjt:  HKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKKGRKIAIFKPGDWVW

Query:  VHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVG-DGLDSRTNPSQEGENDMNH-DQRISIPQGPITRTRAKKL
        VH RKERFP +RKSKL  RGDGPFQVLE IN+NAYK+DL GKY VS+TFNV DL+ FDVG + LD RTNPS+E  +D++  +  + +P+GPIT+ +AKK+
Subjt:  VHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVG-DGLDSRTNPSQEGENDMNH-DQRISIPQGPITRTRAKKL

Query:  QQTLYIYIQAMVSSSKKILEDAGDLLYMLCKVEVQE
        Q+   +++Q + ++ ++        LY +     +E
Subjt:  QQTLYIYIQAMVSSSKKILEDAGDLLYMLCKVEVQE

TrEMBL top hitse value%identityAlignment
A0A2N9G001 Integrase catalytic domain-containing protein1.2e-25754.75Show/hide
Query:  NPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCID
        N SL S    LLQEFED+F KEMP+ LPP+RGIEH+IDF+PG  IPNRPAYR+NP+E +E+QRQV +L++KGYVRES+S C+VPV+LVPKKDG+WRMC+D
Subjt:  NPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCID

Query:  CRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYLVSSNGVEVDE
        CRAIN IT+KYRHPIPRLDDMLDELHG  +F+KIDLKSGYHQI M  GDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNH+LR        +EVDE
Subjt:  CRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYLVSSNGVEVDE

Query:  EKVKAIKDWPTLKNI--NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALV
        EKVKAIK+WPT K+I  NV F WG DQ+ AF T+KE+L SAP+LALPNF  TFEIECDASG+GIGAVLMQ++RP+ FFSEKL+GASL+YPTYDKE YALV
Subjt:  EKVKAIKDWPTLKNI--NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALV

Query:  RALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR---------------------------------
        RAL+TWQHYLWP+EF+IHTDHESLKHL+ Q KLN++HA+WLE+IETF YVI+YKQ KENIVADALSR                                 
Subjt:  RALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR---------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQT
                                                             RE+VRLHG+P+SIVSDRDVKFLS+FW+VLWGKLGTKL++STTCHPQT
Subjt:  -----------------------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQT

Query:  DGQTEVVKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNF
        DGQTEVV RT+T +LR ++ KN KTWEDCLPFIEFAYNR +HSTT  +PFEIVYGFNPLTP+DL+P+      + D   K E V  LH++V+ QI ++N 
Subjt:  DGQTEVVKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNF

Query:  KVATRIKKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGENDMN
        +VA++  KGR+  IF+PGDWVWVH RKERFP  RK+KL PRGDGPFQ+LE INDNAYK+DLPG+Y VSATFNV DLS FDVG+  DS +NP +E  ND N
Subjt:  KVATRIKKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGENDMN

Query:  H-----DQRISIPQGPITRTRAKKLQQTLYIYIQAMVSSSKK
                 + +P GPITR+RAKK+++ +   +Q+    + K
Subjt:  H-----DQRISIPQGPITRTRAKKLQQTLYIYIQAMVSSSKK

A0A2N9GQ39 Uncharacterized protein9.7e-26054.41Show/hide
Query:  NPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCID
        N SLPS    LLQEFED+F +EMP+ LPP+RGIEH+IDF+PG  IPNRPAYR+NP+E +E+QRQV +L++KGYVRES+S C+VPV+LVPKKDG+WRMC+D
Subjt:  NPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCID

Query:  CRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILRE------------
        CRAIN IT+KYRHPIPRLDDMLDELHG  +F+KIDLKSGYHQI M  GDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNH+LR             
Subjt:  CRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILRE------------

Query:  ---------------------------------------------YLVSSNGVEVDEEKVKAIKDWPTLKNI----------------------------
                                                     Y+V++ G+EVDEEKVKAIK+WPT K+I                            
Subjt:  ---------------------------------------------YLVSSNGVEVDEEKVKAIKDWPTLKNI----------------------------

Query:  -----NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWP
             NV F WG DQ+ AF T+KE+L SAP+LALPNF   FEIECDASG+GIGAVLMQ++RP+ FFSEKL+GASL+YPTYDKE YALVRAL+TWQHYLWP
Subjt:  -----NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWP

Query:  KEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR---------------------------------------------
        +EF+IHTDHESLKHL+ Q KLN++HA+WLE+IETF YVI+YKQ KENIVADALSR                                             
Subjt:  KEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR---------------------------------------------

Query:  --------------------------------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLV
                                                                      RE+VRLHG+P+SIVSDRDVKFLS+FW+VLWGKLGTKL+
Subjt:  --------------------------------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLV

Query:  YSTTCHPQTDGQTEVVKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQV
        +STTCHPQTDGQTEVV RT+T +LR ++ KN KTWEDCLPFIEFAYNR +HSTT  +PFEIVYGFNPLTP+DL+P+      + D   K E V  LH++V
Subjt:  YSTTCHPQTDGQTEVVKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQV

Query:  KKQIEKQNFKVATRIKKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNP
        + QI ++N +VA++  KGR+  IF+PGDWVWVH RKERFP  RK+KL PRGDGPFQ+LE INDNAYK+DLPG+Y VSATFNV DLS FDVG+  DS +NP
Subjt:  KKQIEKQNFKVATRIKKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNP

Query:  SQEGENDMNH-----DQRISIPQGPITRTRAKKLQQTLYIYIQAMVSSSKK
         +E  ND N         + +P GPITR+RAKK+++ +   +Q+    + K
Subjt:  SQEGENDMNH-----DQRISIPQGPITRTRAKKLQQTLYIYIQAMVSSSKK

A0A2N9HVZ7 Reverse transcriptase4.1e-25855.16Show/hide
Query:  TNMLNPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWR
        TN L+ +LPS    LLQE+ED+F +E P  LPP+RGIEH+IDF+PG  IPNRPAYR+NP+E +E+QRQVSELL KG+VRES+S C+VPV+LVPKKDG+WR
Subjt:  TNMLNPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWR

Query:  MCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL------
        MC+DCRAIN IT+KYRHPIPRLDDMLDELHG  +F+KIDLKSGYHQI +  GDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNH+LR ++      
Subjt:  MCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL------

Query:  ---------------------------------------------------VSSNGVEVDEEKVKAIKDWPTLKNI------------------------
                                                           VS+ G++VDEEKV+AI+DWP+  ++                        
Subjt:  ---------------------------------------------------VSSNGVEVDEEKVKAIKDWPTLKNI------------------------

Query:  ---------NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQH
                 NV F WG++QE AF  +KEKL++APLL+LPNF  TFEIECDASGVGIGAVLMQ  RP+ +FSEKL+GA+L YPTYDKE YALVRAL+TWQH
Subjt:  ---------NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQH

Query:  YLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR-----------------------------------------
        YLWPKEF+IHTDHESLKHL+ Q+KLN++HA+W+EFIETF YVI+YKQ KEN+VADALSR                                         
Subjt:  YLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR-----------------------------------------

Query:  -----------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVKRTMT
                                                 RE+VRLHG+P++IVSDRD KFLS+FW+ LW KLGTKL++STTCHPQTDGQTEVV RT++
Subjt:  -----------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVKRTMT

Query:  AMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKKGRKI
         +LRAII KN KTWE+CLP +EFAYNR VHS TK +PFEIVYGFNPLTP+DL P+   E VN D   K +FV ++H++ +  IE++  + AT+  KGR+ 
Subjt:  AMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKKGRKI

Query:  AIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGEND----MNHDQRISIP
         +F+PGDWVW+H RKERFP +R+SKLLPRGDGPFQVLE INDNAYK+DLPG+Y VSATFNV DLS FDVGD  D R NP QE  ND          + +P
Subjt:  AIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGEND----MNHDQRISIP

Query:  QGPITRTRAKKLQQTLYIYIQAM
         GP+TR RAKK +  L   IQ +
Subjt:  QGPITRTRAKKLQQTLYIYIQAM

A0A2N9IKQ6 Reverse transcriptase4.1e-25855.16Show/hide
Query:  TNMLNPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWR
        TN L+ +LPS    LLQE+ED+F +E P  LPP+RGIEH+IDF+PG  IPNRPAYR+NP+E +E+QRQVSELL KG+VRES+S C+VPV+LVPKKDG+WR
Subjt:  TNMLNPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWR

Query:  MCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL------
        MC+DCRAIN IT+KYRHPIPRLDDMLDELHG  +F+KIDLKSGYHQI +  GDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNH+LR ++      
Subjt:  MCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL------

Query:  ---------------------------------------------------VSSNGVEVDEEKVKAIKDWPTLKNI------------------------
                                                           VS+ G++VDEEKV+AI+DWP+  ++                        
Subjt:  ---------------------------------------------------VSSNGVEVDEEKVKAIKDWPTLKNI------------------------

Query:  ---------NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQH
                 NV F WG++QE AF  +KEKL++APLL+LPNF  TFEIECDASGVGIGAVLMQ  RP+ +FSEKL+GA+L YPTYDKE YALVRAL+TWQH
Subjt:  ---------NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQH

Query:  YLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR-----------------------------------------
        YLWPKEF+IHTDHESLKHL+ Q+KLN++HA+W+EFIETF YVI+YKQ KEN+VADALSR                                         
Subjt:  YLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR-----------------------------------------

Query:  -----------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVKRTMT
                                                 RE+VRLHG+P++IVSDRD KFLS+FW+ LW KLGTKL++STTCHPQTDGQTEVV RT++
Subjt:  -----------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVKRTMT

Query:  AMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKKGRKI
         +LRAII KN KTWE+CLP +EFAYNR VHS TK +PFEIVYGFNPLTP+DL P+   E VN D   K +FV ++H++ +  IE++  + AT+  KGR+ 
Subjt:  AMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKKGRKI

Query:  AIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGEND----MNHDQRISIP
         +F+PGDWVW+H RKERFP +R+SKLLPRGDGPFQVLE INDNAYK+DLPG+Y VSATFNV DLS FDVGD  D R NP QE  ND          + +P
Subjt:  AIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGEND----MNHDQRISIP

Query:  QGPITRTRAKKLQQTLYIYIQAM
         GP+TR RAKK +  L   IQ +
Subjt:  QGPITRTRAKKLQQTLYIYIQAM

A0A2N9IM79 Reverse transcriptase4.1e-25855.16Show/hide
Query:  TNMLNPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWR
        TN L+ +LPS    LLQE+ED+F +E P  LPP+RGIEH+IDF+PG  IPNRPAYR+NP+E +E+QRQVSELL KG+VRES+S C+VPV+LVPKKDG+WR
Subjt:  TNMLNPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWR

Query:  MCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL------
        MC+DCRAIN IT+KYRHPIPRLDDMLDELHG  +F+KIDLKSGYHQI +  GDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNH+LR ++      
Subjt:  MCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYL------

Query:  ---------------------------------------------------VSSNGVEVDEEKVKAIKDWPTLKNI------------------------
                                                           VS+ G++VDEEKV+AI+DWP+  ++                        
Subjt:  ---------------------------------------------------VSSNGVEVDEEKVKAIKDWPTLKNI------------------------

Query:  ---------NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQH
                 NV F WG++QE AF  +KEKL++APLL+LPNF  TFEIECDASGVGIGAVLMQ  RP+ +FSEKL+GA+L YPTYDKE YALVRAL+TWQH
Subjt:  ---------NVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQH

Query:  YLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR-----------------------------------------
        YLWPKEF+IHTDHESLKHL+ Q+KLN++HA+W+EFIETF YVI+YKQ KEN+VADALSR                                         
Subjt:  YLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR-----------------------------------------

Query:  -----------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVKRTMT
                                                 RE+VRLHG+P++IVSDRD KFLS+FW+ LW KLGTKL++STTCHPQTDGQTEVV RT++
Subjt:  -----------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVKRTMT

Query:  AMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKKGRKI
         +LRAII KN KTWE+CLP +EFAYNR VHS TK +PFEIVYGFNPLTP+DL P+   E VN D   K +FV ++H++ +  IE++  + AT+  KGR+ 
Subjt:  AMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKKGRKI

Query:  AIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGEND----MNHDQRISIP
         +F+PGDWVW+H RKERFP +R+SKLLPRGDGPFQVLE INDNAYK+DLPG+Y VSATFNV DLS FDVGD  D R NP QE  ND          + +P
Subjt:  AIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGEND----MNHDQRISIP

Query:  QGPITRTRAKKLQQTLYIYIQAM
         GP+TR RAKK +  L   IQ +
Subjt:  QGPITRTRAKKLQQTLYIYIQAM

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.4e-5130.21Show/hide
Query:  NMLNPSLPSDFF--------------VLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAE-EIQRQVSELLAKGYVRESLSLCS
        N ++P L SD +               LLQ++ D+   E    L      +H I+     P+ ++ +Y   P+  E E++ Q+ ++L +G +R S S  +
Subjt:  NMLNPSLPSDFF--------------VLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAE-EIQRQVSELLAKGYVRESLSLCS

Query:  VPVILVPKKDGS-----WRMCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPST
         P+ +VPKK  +     +R+ ID R +N+IT+  RHPIP +D++L +L  C+ FT IDL  G+HQI M      KTAF TK+G YE+L MPFGL NAP+T
Subjt:  VPVILVPKKDGS-----WRMCIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPST

Query:  FMRLMNHILRE---------------------------------------------------------YLVSSNGVEVDEEKVKAIKDWP----------
        F R MN ILR                                                          ++++ +G++ + EK++AI+ +P          
Subjt:  FMRLMNHILRE---------------------------------------------------------YLVSSNGVEVDEEKVKAIKDWP----------

Query:  -------------------------TLKNINVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASL
                                   KN+ +      + + AF  LK  +S  P+L +P+F   F +  DAS V +GAVL Q+  PL + S  L    +
Subjt:  -------------------------TLKNINVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASL

Query:  RYPTYDKEFYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR
         Y T +KE  A+V A +T++HYL  + F I +DH+ L  L      N +  +W   +  F + IKY + KEN VADALSR
Subjt:  RYPTYDKEFYALVRALQTWQHYLWPKEFIIHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR

P0CT41 Transposon Tf2-12 polyprotein5.6e-4727.9Show/hide
Query:  PSLPSDFFVLLQEFEDLFSKEMPSSLP-PLRGIEHKIDFIPGE---PIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRM
        P LP     + +EF+D+ ++     LP P++G+E +++        PI N   Y   P + + +  ++++ L  G +RES ++ + PV+ VPKK+G+ RM
Subjt:  PSLPSDFFVLLQEFEDLFSKEMPSSLP-PLRGIEHKIDFIPGE---PIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRM

Query:  CIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILRE---------
         +D + +NK      +P+P ++ +L ++ G ++FTK+DLKS YH I +  GDE K AF+   G++E+LVMP+G++ AP+ F   +N IL E         
Subjt:  CIDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILRE---------

Query:  ------------------------------------------------YLVSSNGVEVDEEKVKAIKDWPTLKN--------------------------
                                                        Y +S  G    +E +  +  W   KN                          
Subjt:  ------------------------------------------------YLVSSNGVEVDEEKVKAIKDWPTLKN--------------------------

Query:  -------INVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQ-----RPLMFFSEKLTGASLRYPTYDKEFYALVRALQ
                +V + W   Q  A   +K+ L S P+L   +F     +E DAS V +GAVL Q        P+ ++S K++ A L Y   DKE  A++++L+
Subjt:  -------INVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQ-----RPLMFFSEKLTGASLRYPTYDKEFYALVRALQ

Query:  TWQHYLWP--KEFIIHTDHESLKHLRVQNKL---NRQHAKWLEFIETFQYVIKYKQDKENIVADALSRREVVRLHGIPKSIVSDRDVKFLS
         W+HYL    + F I TDH +L   R+ N+    N++ A+W  F++ F + I Y+    N +ADALS R V     IPK    D  + F++
Subjt:  TWQHYLWP--KEFIIHTDHESLKHLRVQNKL---NRQHAKWLEFIETFQYVIKYKQDKENIVADALSRREVVRLHGIPKSIVSDRDVKFLS

P20825 Retrovirus-related Pol polyprotein from transposon 2974.9e-5132.1Show/hide
Query:  LLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKD-----GSWRMCIDCRAIN
        LL +F +L  KE    L     I+H ++     PI ++  Y        E++ QV E+L +G +RES S  + P  +VPKK        +R+ ID R +N
Subjt:  LLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKD-----GSWRMCIDCRAIN

Query:  KITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILRE-----------------
        +ITI  R+PIP +D++L +L  C  FT IDL  G+HQI M      KTAF TK G YE+L MPFGL NAP+TF R MN+ILR                  
Subjt:  KITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILRE-----------------

Query:  ----------------------------------------YLVSSNGVEVDEEKVKAIKDWP-TLKNINVSFIWG-------------------------
                                                ++V+ +G++ +  KVKAI  +P   K+  +    G                         
Subjt:  ----------------------------------------YLVSSNGVEVDEEKVKAIKDWP-TLKNINVSFIWG-------------------------

Query:  ----KDQEL----AFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKEFI
              Q+L    AF  LK  +   P+L LP+FE  F +  DAS + +GAVL QN  P+ F S  L    L Y   +KE  A+V A +T++HYL  ++F+
Subjt:  ----KDQEL----AFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKEFI

Query:  IHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSRREVVRLH
        I +DH+ L+ L    +   +  +W   +  +Q+ I Y + KEN VADALSR ++   H
Subjt:  IHTDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSRREVVRLH

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.2e-6625.34Show/hide
Query:  LLQEFEDLFSKEMPSSLPPLRGI--EHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCIDCRAINKIT
        L Q++ ++   ++P     +  I  +H I+  PG  +P    Y    K  +EI + V +LL   ++  S S CS PV+LVPKKDG++R+C+D R +NK T
Subjt:  LLQEFEDLFSKEMPSSLPPLRGI--EHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCIDCRAINKIT

Query:  IKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILR---------------------
        I    P+PR+D++L  +    +FT +DL SGYHQI M   D +KTAF T  G YE+ VMPFGL NAPSTF R M    R                     
Subjt:  IKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILR---------------------

Query:  -------------------------------EYLVSSNGVEVD---EEKVKAIKDWPTLKNI------------------NVSFI-------------WG
                                       E+L  S G++     + K  AI+D+PT K +                  N S I             W 
Subjt:  -------------------------------EYLVSSNGVEVD---EEKVKAIKDWPTLKNI------------------NVSFI-------------WG

Query:  KDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVL--MQNQRPLM----FFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKEFIIH
        + Q+ A   LK  L ++P+L   N ++ + +  DAS  GIGAVL  + N+  L+    +FS+ L  A   YP  + E   +++AL  +++ L  K F + 
Subjt:  KDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVL--MQNQRPLM----FFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKEFIIH

Query:  TDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR---------------------------------------------------
        TDH SL  L+ +N+  R+  +WL+ + T+ + ++Y    +N+VADA+SR                                                   
Subjt:  TDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR---------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVY
                                                                     R +   HG P++I SDRDV+  +  ++ L  +LG K   
Subjt:  -------------------------------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVY

Query:  STTCHPQTDGQTEVVKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHK---
        S+  HPQTDGQ+E   +T+  +LRA +  N + W   LP IEF YN     T   +PFEI  G+ P TP     I S + VN  +   VE    L     
Subjt:  STTCHPQTDGQTEVVKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHK---

Query:  QVKKQIEKQNFKVATRIKKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDL
        Q K+Q+E    ++ T   + RK  +   GD V VH R   F      K+     GPF+V++ INDNAY++DL
Subjt:  QVKKQIEKQNFKVATRIKKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDL

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.4e-6725.28Show/hide
Query:  LLQEFEDLFSKEMPSSLPPLRGI--EHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCIDCRAINKIT
        L Q++ ++   ++P     +  I  +H I+  PG  +P    Y    K  +EI + V +LL   ++  S S CS PV+LVPKKDG++R+C+D R +NK T
Subjt:  LLQEFEDLFSKEMPSSLPPLRGI--EHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMCIDCRAINKIT

Query:  IKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILR---------------------
        I    P+PR+D++L  +    +FT +DL SGYHQI M   D +KTAF T  G YE+ VMPFGL NAPSTF R M    R                     
Subjt:  IKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILR---------------------

Query:  -------------------------------EYLVSSNGVEVD---EEKVKAIKDWPTLKNI------------------NVSFI-------------WG
                                       E+L  S G++     + K  AI+D+PT K +                  N S I             W 
Subjt:  -------------------------------EYLVSSNGVEVD---EEKVKAIKDWPTLKNI------------------NVSFI-------------WG

Query:  KDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVL--MQNQRPLM----FFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKEFIIH
        + Q+ A + LK+ L ++P+L   N ++ + +  DAS  GIGAVL  + N+  L+    +FS+ L  A   YP  + E   +++AL  +++ L  K F + 
Subjt:  KDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVL--MQNQRPLM----FFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKEFIIH

Query:  TDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR---------------------------------------------------
        TDH SL  L+ +N+  R+  +WL+ + T+ + ++Y    +N+VADA+SR                                                   
Subjt:  TDHESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSR---------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVY
                                                                     R +   HG P++I SDRDV+  +  ++ L  +LG K   
Subjt:  -------------------------------------------------------------REVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVY

Query:  STTCHPQTDGQTEVVKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHK---
        S+  HPQTDGQ+E   +T+  +LRA    N + W   LP IEF YN     T   +PFEI  G+ P TP     I S + VN  +   VE    L     
Subjt:  STTCHPQTDGQTEVVKRTMTAMLRAIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHK---

Query:  QVKKQIEKQNFKVATRIKKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHF
        Q K+Q+E    ++ T   + RK  +   GD V VH R   F      K+     GPF+V++ INDNAY++DL          NV  L  F
Subjt:  QVKKQIEKQNFKVATRIKKGRKIAIFKPGDWVWVHFRKERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHF

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCAAGGGGTCTTGTTACTTTACTAACATGCTTAACCCTTCTTTGCCTAGTGATTTTTTTGTGCTTTTGCAAGAGTTTGAAGATTTATTTTCCAAGGAGATGCCTAG
TAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGAGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATAC
AAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGTATGTACGTGAAAGTTTGAGTCTTTGTTCTGTTCCAGTTATTCTTGTACCTAAGAAAGATGGTTCTTGGCGTATGTGT
ATTGATTGTAGGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGA
TTTAAAATCGGGTTATCATCAAATTCACATGCATATTGGGGATGAGTGGAAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAA
CTAATGCACCTAGTACATTCATGAGACTAATGAACCATATCTTACGAGAATACTTAGTTTCATCTAATGGTGTTGAGGTTGATGAGGAGAAAGTGAAGGCTATAAAAGAT
TGGCCTACACTAAAAAATATAAATGTATCTTTTATATGGGGAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCTTGCATTACC
TAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCCTTAATGTTCTTTAGTGAGAAGTTGACTG
GTGCATCTTTGAGGTATCCAACTTATGACAAAGAGTTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGAT
CATGAAAGTTTAAAGCATTTGAGAGTACAAAATAAACTCAACAGACAACATGCTAAGTGGTTAGAATTTATTGAAACATTCCAGTATGTCATAAAATATAAACAAGATAA
GGAGAACATTGTAGCAGATGCTTTATCCCGAAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATCGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGC
GTGTTTTATGGGGTAAGTTAGGAACTAAGCTAGTATATTCAACTACTTGTCATCCTCAAACGGATGGACAAACTGAAGTTGTTAAAAGAACCATGACTGCTATGCTTAGG
GCTATTATTGATAAGAATCATAAGACTTGGGAGGATTGTTTGCCATTTATAGAATTTGCATATAATAGGGTTGTTCATAGCACTACTAAATGCACACCTTTTGAAATTGT
TTATGGCTTTAATCCTTTAACCCCTATTGACTTGTTACCCATATCGTCAAAAGAGTTTGTGAATTTTGATGCAAATGCCAAGGTTGAGTTTGTTCATAAACTGCACAAGC
AAGTGAAAAAACAAATTGAGAAACAAAATTTCAAGGTTGCCACCCGAATTAAAAAAGGACGTAAGATTGCCATCTTCAAGCCAGGAGATTGGGTTTGGGTGCATTTCCGA
AAAGAAAGATTTCCTACTCGAAGAAAATCTAAGCTTTTACCACGAGGAGATGGACCTTTTCAAGTTCTTGAGCACATCAACGACAATGCTTATAAAATTGATTTACCAGG
TAAGTATGGTGTTAGTGCAACTTTTAATGTTGTTGATTTGAGCCATTTTGATGTAGGTGATGGCTTGGATTCGAGGACGAATCCTTCTCAAGAGGGGGAGAATGATATGA
ACCACGACCAAAGAATTTCCATACCTCAAGGTCCAATTACAAGGACGAGAGCCAAGAAGCTACAACAAACTTTATACATTTATATTCAAGCTATGGTGAGCTCATCAAAG
AAAATTCTAGAAGACGCTGGAGACCTCCTTTATATGTTGTGCAAAGTTGAGGTTCAAGAAAGAGATGAATTAAATGCACTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGCAAGGGGTCTTGTTACTTTACTAACATGCTTAACCCTTCTTTGCCTAGTGATTTTTTTGTGCTTTTGCAAGAGTTTGAAGATTTATTTTCCAAGGAGATGCCTAG
TAGTTTGCCACCACTTAGAGGGATTGAACACAAGATTGACTTCATTCCTGGCGAGCCCATTCCAAACCGACCAGCTTATAGGACTAATCCAAAGGAGGCTGAAGAGATAC
AAAGGCAAGTAAGTGAACTCCTTGCTAAAGGGTATGTACGTGAAAGTTTGAGTCTTTGTTCTGTTCCAGTTATTCTTGTACCTAAGAAAGATGGTTCTTGGCGTATGTGT
ATTGATTGTAGGGCTATAAACAAGATAACTATAAAGTATAGGCATCCAATTCCCAGATTAGATGATATGCTTGATGAATTGCATGGATGTAGTCTTTTTACTAAGATTGA
TTTAAAATCGGGTTATCATCAAATTCACATGCATATTGGGGATGAGTGGAAAACAGCTTTTAAAACCAAGTATGGTCTTTATGAATGGTTGGTTATGCCTTTTGGATTAA
CTAATGCACCTAGTACATTCATGAGACTAATGAACCATATCTTACGAGAATACTTAGTTTCATCTAATGGTGTTGAGGTTGATGAGGAGAAAGTGAAGGCTATAAAAGAT
TGGCCTACACTAAAAAATATAAATGTATCTTTTATATGGGGAAAAGATCAAGAACTTGCTTTTAATACTTTGAAAGAAAAATTGAGTTCTGCTCCCTTGCTTGCATTACC
TAATTTTGAGTCTACTTTTGAAATTGAATGTGATGCTAGTGGAGTAGGGATAGGTGCTGTATTAATGCAAAATCAAAGACCCTTAATGTTCTTTAGTGAGAAGTTGACTG
GTGCATCTTTGAGGTATCCAACTTATGACAAAGAGTTTTATGCTTTGGTTCGTGCATTGCAAACCTGGCAACATTATCTTTGGCCTAAGGAGTTCATTATTCATACGGAT
CATGAAAGTTTAAAGCATTTGAGAGTACAAAATAAACTCAACAGACAACATGCTAAGTGGTTAGAATTTATTGAAACATTCCAGTATGTCATAAAATATAAACAAGATAA
GGAGAACATTGTAGCAGATGCTTTATCCCGAAGGGAAGTTGTACGATTGCATGGCATTCCTAAAAGCATCGTTAGTGATCGTGATGTAAAATTTTTAAGCCACTTTTGGC
GTGTTTTATGGGGTAAGTTAGGAACTAAGCTAGTATATTCAACTACTTGTCATCCTCAAACGGATGGACAAACTGAAGTTGTTAAAAGAACCATGACTGCTATGCTTAGG
GCTATTATTGATAAGAATCATAAGACTTGGGAGGATTGTTTGCCATTTATAGAATTTGCATATAATAGGGTTGTTCATAGCACTACTAAATGCACACCTTTTGAAATTGT
TTATGGCTTTAATCCTTTAACCCCTATTGACTTGTTACCCATATCGTCAAAAGAGTTTGTGAATTTTGATGCAAATGCCAAGGTTGAGTTTGTTCATAAACTGCACAAGC
AAGTGAAAAAACAAATTGAGAAACAAAATTTCAAGGTTGCCACCCGAATTAAAAAAGGACGTAAGATTGCCATCTTCAAGCCAGGAGATTGGGTTTGGGTGCATTTCCGA
AAAGAAAGATTTCCTACTCGAAGAAAATCTAAGCTTTTACCACGAGGAGATGGACCTTTTCAAGTTCTTGAGCACATCAACGACAATGCTTATAAAATTGATTTACCAGG
TAAGTATGGTGTTAGTGCAACTTTTAATGTTGTTGATTTGAGCCATTTTGATGTAGGTGATGGCTTGGATTCGAGGACGAATCCTTCTCAAGAGGGGGAGAATGATATGA
ACCACGACCAAAGAATTTCCATACCTCAAGGTCCAATTACAAGGACGAGAGCCAAGAAGCTACAACAAACTTTATACATTTATATTCAAGCTATGGTGAGCTCATCAAAG
AAAATTCTAGAAGACGCTGGAGACCTCCTTTATATGTTGTGCAAAGTTGAGGTTCAAGAAAGAGATGAATTAAATGCACTTTAA
Protein sequenceShow/hide protein sequence
MCKGSCYFTNMLNPSLPSDFFVLLQEFEDLFSKEMPSSLPPLRGIEHKIDFIPGEPIPNRPAYRTNPKEAEEIQRQVSELLAKGYVRESLSLCSVPVILVPKKDGSWRMC
IDCRAINKITIKYRHPIPRLDDMLDELHGCSLFTKIDLKSGYHQIHMHIGDEWKTAFKTKYGLYEWLVMPFGLTNAPSTFMRLMNHILREYLVSSNGVEVDEEKVKAIKD
WPTLKNINVSFIWGKDQELAFNTLKEKLSSAPLLALPNFESTFEIECDASGVGIGAVLMQNQRPLMFFSEKLTGASLRYPTYDKEFYALVRALQTWQHYLWPKEFIIHTD
HESLKHLRVQNKLNRQHAKWLEFIETFQYVIKYKQDKENIVADALSRREVVRLHGIPKSIVSDRDVKFLSHFWRVLWGKLGTKLVYSTTCHPQTDGQTEVVKRTMTAMLR
AIIDKNHKTWEDCLPFIEFAYNRVVHSTTKCTPFEIVYGFNPLTPIDLLPISSKEFVNFDANAKVEFVHKLHKQVKKQIEKQNFKVATRIKKGRKIAIFKPGDWVWVHFR
KERFPTRRKSKLLPRGDGPFQVLEHINDNAYKIDLPGKYGVSATFNVVDLSHFDVGDGLDSRTNPSQEGENDMNHDQRISIPQGPITRTRAKKLQQTLYIYIQAMVSSSK
KILEDAGDLLYMLCKVEVQERDELNAL