; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG10G011900 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG10G011900
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionReverse transcriptase
Genome locationCG_Chr10:25798393..25801600
RNA-Seq ExpressionClCG10G011900
SyntenyClCG10G011900
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054966.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]1.9e-12336.21Show/hide
Query:  KDNQNWRRKNDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAWWDQIQGNR--------------------RF----
        ++ +  R  +++KMKID+P+Y+GK NIE FLD +K+ E  F YMGT ++KKV L++LKLK GASAWWDQI  NR                    RF    
Subjt:  KDNQNWRRKNDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAWWDQIQGNR--------------------RF----

Query:  ------------------------------ARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNSAT
                                       R NL E +   ++WF+ GL+ D+KEKV +QP   L+EA++ A T+E  E ++++  + ++  W+   + 
Subjt:  ------------------------------ARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNSAT

Query:  SRKSTAANSK-----GQSPMRHLSKTRGKKLN------------------LRCGQTGHLSNDCPQRKNLTIQEEHED-ENYS----EEDINVAQPDEGDT
        S+K+TA NSK      + P+     +  K++                    RCGQ GH SN CPQRK + + ++++D  N S    +E+  V + DEGD+
Subjt:  SRKSTAANSK-----GQSPMRHLSKTRGKKLN------------------LRCGQTGHLSNDCPQRKNLTIQEEHED-ENYS----EEDINVAQPDEGDT

Query:  LSCVIQKVLLTPVTETNPQRHALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKI-W------TLVT---FYLEDHGN--------
        LSC++Q+VL++P  E   QRH+LF+  CTI GKVCNVII+SGSSEN V KKLV  L LK   H  PYKI W      TL++   +     GN        
Subjt:  LSCVIQKVLLTPVTETNPQRHALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKI-W------TLVT---FYLEDHGN--------

Query:  ----------MITIP---------------------NTMVMPTPM-------------------SLNGWALLPTPVN--------------------SKV
                  ++  P                     N  V+  P+                   +++G   L    N                      +
Subjt:  ----------MITIP---------------------NTMVMPTPM-------------------SLNGWALLPTPVN--------------------SKV

Query:  QNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIK
        + L  K+P I+K P  LPPLRDI HNI+ + G+SFPHLPHY MSP+EY+ L   I++LL+K HI+P  S C VP+LL  +KDG+WR+ +DSRA+N+IT+K
Subjt:  QNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIK

Query:  YRFPIPRISDLLDQLAGAKVFSK-----------------------------------------------------------------------------
        YRFPIPR+SDLLDQL GA +FSK                                                                             
Subjt:  YRFPIPRISDLLDQLAGAKVFSK-----------------------------------------------------------------------------

Query:  ------EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFAR
              ++F +L  N LY+N KKC+F  + I F GFII +  V +D +K+EAIK W TPT++ +VQ+FLGLASFYRKFI+N SS AAP+T CLKKG F  
Subjt:  ------EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFAR

Query:  SKETLDSFNTLKEKLVNPPVLALPDFTKIFEVAL
          +  DSFN LKE L N  VL LPDF + FEVA+
Subjt:  SKETLDSFNTLKEKLVNPPVLALPDFTKIFEVAL

KAA0062943.1 Retrovirus-related Pol polyprotein from transposon 17.6 [Cucumis melo var. makuwa]5.3e-11039.11Show/hide
Query:  KKVKLISLKLKSGASAWWDQIQGNRRFARNNL----AESDSQRVAWFID-GLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNS
        KKV L++LKLK GASAWWDQ++ NR+ +   +     ++ ++R  W  +   +Q   +K   QP           S +++ +A+  +             
Subjt:  KKVKLISLKLKSGASAWWDQIQGNRRFARNNL----AESDSQRVAWFID-GLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNS

Query:  ATSRKSTAANSKGQSPMRHLSKTRGKKLNLRCGQTGHLSNDCPQRKNLTIQEEH-----EDENYSEEDINVAQPDEGDTLSCVIQKVLLTPVTETNPQRH
         T++K +    K Q+   +   + GK    RCG+  HLSN+CPQRK + + E+      E +   +E+I + + D GD +SC++Q+VL+T   E NPQRH
Subjt:  ATSRKSTAANSKGQSPMRHLSKTRGKKLNLRCGQTGHLSNDCPQRKNLTIQEEH-----EDENYSEEDINVAQPDEGDTLSCVIQKVLLTPVTETNPQRH

Query:  ALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKIWTLVTFYLEDHGNMITIPNTMV----------------------MPTPMSLN
        +LF+  CTI+GKVC+VII+SGSSEN V +KLV  L LK++ HP PYKI  +          + TIP ++V                       P    L 
Subjt:  ALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKIWTLVTFYLEDHGNMITIPNTMV----------------------MPTPMSLN

Query:  GWALLPTP-------VNSKVQNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASR
        G  +           V  +++ L A+FP + K P  LPPL DIQH ID +PG+S P LPHYRMSP EYQ L   I++LL+K HI+P LSPC VP+LL  +
Subjt:  GWALLPTP-------VNSKVQNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASR

Query:  KDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK---------------------------------------------------------
        KD SWR+ +DSRA+NRIT+KY FPIP++ DLLDQL  A VFSK                                                         
Subjt:  KDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK---------------------------------------------------------

Query:  -EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKETL
         ++F +L+   LYIN KKC FL   I F GF+I +G + ++P+K+EAI+ WP PTSIKEVQ+FLGLASFY++FI+NFSS   PLT  LKK  F       
Subjt:  -EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKETL

Query:  DSFNTLKEKLVNPPVLALPDFTKIFEVAL
         SF  +K +L + P+L LPDFT  FEV +
Subjt:  DSFNTLKEKLVNPPVLALPDFTKIFEVAL

TYK30863.1 transposon Ty3-I Gag-Pol polyprotein isoform X1 [Cucumis melo var. makuwa]1.2e-10637.91Show/hide
Query:  GKDNQNWRRK--NDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAW-----------WDQIQG--------------
        G   Q  RR+  +D+KMKID+PTYNGK +IE FLD +K+ E  F YM   + KKV L++LKLK GASAW           + Q Q               
Subjt:  GKDNQNWRRK--NDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAW-----------WDQIQG--------------

Query:  -NRRFARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNSATSRK---------STAANSKG-----
         +R  AR NL+E++  ++A FI GL+ DIKEKV +     L+EA+S+A T+EE   ++ K  + +R+ W+ N +  +          ST+   KG     
Subjt:  -NRRFARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNSATSRK---------STAANSKG-----

Query:  -QSPMRHLSKTRGKKLN----------LRCGQTGHLSNDCPQRKNLTIQEEHE-----DENYSEEDINVAQPDEGDTLSCVIQKVLLTPVTETNPQRHAL
         ++  +  S  RGK  N           RCG+ GHLSN+C QRK + + E+ +      +   EE+  + + D+GD +SC++Q+VL+TP  ETNPQ H+L
Subjt:  -QSPMRHLSKTRGKKLN----------LRCGQTGHLSNDCPQRKNLTIQEEHE-----DENYSEEDINVAQPDEGDTLSCVIQKVLLTPVTETNPQRHAL

Query:  FRICCTINGKV-----------------------CNVIIESGSS--ENMVPKKLVQHLKLKVNAHPSPYKIWTL-----VTFYLEDHGNMITI-----PN
        F+  CTINGKV                       C + +  G+S  + +V   +   +   +   P  +   TL      T+  +  G  + +      N
Subjt:  FRICCTINGKV-----------------------CNVIIESGSS--ENMVPKKLVQHLKLKVNAHPSPYKIWTL-----VTFYLEDHGNMITI-----PN

Query:  TMVMPTP------MSLNGWALLPTP--------------------VNSKVQNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQA
        T  +         ++++G  LL                       V  +++ L A+FP + K P  LPPLRDIQH ID +P +S P+LPHYRMSP EYQ 
Subjt:  TMVMPTP------MSLNGWALLPTP--------------------VNSKVQNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQA

Query:  LQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK---------------------------
        L   I+DLL+K HI+P LSPCAVP+LL   KDGSWR+ +DSRA+NR+T KYRFPIPRI DLLDQL  A +FSK                           
Subjt:  LQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK---------------------------

Query:  ---------------EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLT
                       ++F +L    LYIN KKC +L   I F GF+I +G ++++P+KIEAI+  PTPTSIKEVQ+FLGLASFYR+FI+NFS   APLT
Subjt:  ---------------EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLT

XP_024932991.1 uncharacterized protein LOC112492819 [Ziziphus jujuba]2.6e-10934.02Show/hide
Query:  NDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAWWDQIQGNRR----------------------------------
        +D+++K+DIP ++G +NIE FLD V++VE  F+YM   E K+V+L++ K + GASAWW+Q+  NRR                                  
Subjt:  NDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAWWDQIQGNRR----------------------------------

Query:  --------------------FARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKH-----KRWAGQRSNWDRN------S
                             AR NL E++ Q VA ++ GL   I+E++ + P+  L+EAV++A  IE+ +  +H      +W      +         +
Subjt:  --------------------FARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKH-----KRWAGQRSNWDRN------S

Query:  ATSRKSTAAN------SKGQSPMRHLSKTRGKKLNL---RCGQTGHLSNDCPQRKNLTIQEEHED--ENYSE--EDINVAQPDEGDTLSCVIQKVLLTPV
        A  +K+T A+      SK Q+     S    +   L   +CGQ GH SN+CP RK + I E  +D  E ++   ++  +   D+G+ + C+IQK+L +P 
Subjt:  ATSRKSTAAN------SKGQSPMRHLSKTRGKKLNL---RCGQTGHLSNDCPQRKNLTIQEEHED--ENYSE--EDINVAQPDEGDTLSCVIQKVLLTPV

Query:  TETNPQRHALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKI-W------TLVTFYLEDH--------------------------
             QRH++F+  CTIN KVC VII+SGSSEN+V K LV+ LKL   +HP+PYK+ W      T VT   + H                          
Subjt:  TETNPQRHALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKI-W------TLVTFYLEDH--------------------------

Query:  -----GNMIT------------------------------IPNTMVMPTPMSLNGWALLPT------------------------PVNSK----VQNLLA
              N IT                              +P + V P  +S  G  LL T                        P +S+    +  LL 
Subjt:  -----GNMIT------------------------------IPNTMVMPTPMSLNGWALLPT------------------------PVNSK----VQNLLA

Query:  KFPSI--TKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRF
        +F  I  +++P+ LPP+RDIQH ID +PG+  P+LPHYRM P E Q LQ +++DLL+K  I+  LSPCAVP+LL  +K+G WR+ IDSRA+N+IT KYRF
Subjt:  KFPSI--TKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRF

Query:  PIPRISDLLDQLAGAKVFSK--------------------------------------------------------------------------------
        PIPR+ D+LD+L+GA+VFSK                                                                                
Subjt:  PIPRISDLLDQLAGAKVFSK--------------------------------------------------------------------------------

Query:  ---EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQ-GVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKE
            VF IL  N LY+N KKCVF+   + F GFI+G+ G++ DP K+ AI++W TP+++ EV+SF GLA+FYR+F++NFSS A PLT CLKKGKF     
Subjt:  ---EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQ-GVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKE

Query:  TLDSFNTLKEKLVNPPVLALPDFTKIFEV
           SF TLK+ L   PVLALPDF KIFEV
Subjt:  TLDSFNTLKEKLVNPPVLALPDFTKIFEV

XP_040994264.1 uncharacterized protein LOC121240799 [Juglans microcarpa x Juglans regia]5.9e-10933.87Show/hide
Query:  RMMHPPQRNNPLFNNKFLLNKHKGVEIKIVINIGKDNQNWRRKN-DFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASA
        R   PPQ+++        +  +   E ++   +G  NQ  R  N +FK+KID+P +NG +++E FLD +  VE  FDYM   E ++VKL++ KL+ GASA
Subjt:  RMMHPPQRNNPLFNNKFLLNKHKGVEIKIVINIGKDNQNWRRKN-DFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASA

Query:  WWDQIQGNRR------------------------------------------------------FARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPC
        WW+Q Q NRR                                                       +RNNLAE++ Q+VA +I GL+  I++KV +  +  
Subjt:  WWDQIQGNRR------------------------------------------------------FARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPC

Query:  LAEAVSIASTIEEDEALKHKRWAG-----------QRSNWDRNSATSRKSTAAN-------------SKGQSPMRHLSKTRGKKLNLRCGQTGHLSNDCP
        L+EAV++A  IE   +    R               +SN     ++SR     N             ++G +     +K    K   RC Q GH SN+CP
Subjt:  LAEAVSIASTIEEDEALKHKRWAG-----------QRSNWDRNSATSRKSTAAN-------------SKGQSPMRHLSKTRGKKLNLRCGQTGHLSNDCP

Query:  QRKNLTIQE----EHEDENYSEEDINVAQPDEGDTLSCVIQKVLLTPVTETNPQRHALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPS
         RK++ + +      E ++ SEED    + DEGD ++CVIQ++LL P  E + QRH +F+  CT+N KVCN+II+SGS EN+V + LV  L+L    HP 
Subjt:  QRKNLTIQE----EHEDENYSEEDINVAQPDEGDTLSCVIQKVLLTPVTETNPQRHALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPS

Query:  PYKI-W------TLVT------FYLEDHGNMITIPNTMVMPTPMSLNG---------------------W-----ALLPT-------PVNSK--------
        PYKI W      T VT      F +    N +   + + M     L G                     W      LLPT        V+ K        
Subjt:  PYKI-W------TLVT------FYLEDHGNMITIPNTMVMPTPMSLNG---------------------W-----ALLPT-------PVNSK--------

Query:  ---------------------------------VQNLLAKFPSIT--KVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHI
                                         V+ LL +F  I   ++P  LPPLRDIQH ID +PG S P+LPHYRMSP+E++ LQ  ++DL++K  I
Subjt:  ---------------------------------VQNLLAKFPSIT--KVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHI

Query:  QPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK-----------------------------------------
        +  +SPCAVP+LL  +KDGSWR+ +DSRA+N+IT+KYRFPIPR++D+LD LAG+KVFSK                                         
Subjt:  QPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK-----------------------------------------

Query:  ------------------------------------------EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIG-QGVQVDPQKIEAIKEWPTPTSIKE
                                                  EV S+L  N LYIN KKC FL S + F GF++G +GV VD +K+  I+EWP P++I +
Subjt:  ------------------------------------------EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIG-QGVQVDPQKIEAIKEWPTPTSIKE

Query:  VQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKETLDSFNTLKEKLVNPPVLALPDFTKIFEV
        V+SF GLA+FYR+FI+NFS+ AAP+T C+KKG+    ++   SF  +KEKL   PVLALP F K+FEV
Subjt:  VQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKETLDSFNTLKEKLVNPPVLALPDFTKIFEV

TrEMBL top hitse value%identityAlignment
A0A5A7V4G7 Retrovirus-related Pol polyprotein from transposon 17.62.6e-11039.11Show/hide
Query:  KKVKLISLKLKSGASAWWDQIQGNRRFARNNL----AESDSQRVAWFID-GLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNS
        KKV L++LKLK GASAWWDQ++ NR+ +   +     ++ ++R  W  +   +Q   +K   QP           S +++ +A+  +             
Subjt:  KKVKLISLKLKSGASAWWDQIQGNRRFARNNL----AESDSQRVAWFID-GLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNS

Query:  ATSRKSTAANSKGQSPMRHLSKTRGKKLNLRCGQTGHLSNDCPQRKNLTIQEEH-----EDENYSEEDINVAQPDEGDTLSCVIQKVLLTPVTETNPQRH
         T++K +    K Q+   +   + GK    RCG+  HLSN+CPQRK + + E+      E +   +E+I + + D GD +SC++Q+VL+T   E NPQRH
Subjt:  ATSRKSTAANSKGQSPMRHLSKTRGKKLNLRCGQTGHLSNDCPQRKNLTIQEEH-----EDENYSEEDINVAQPDEGDTLSCVIQKVLLTPVTETNPQRH

Query:  ALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKIWTLVTFYLEDHGNMITIPNTMV----------------------MPTPMSLN
        +LF+  CTI+GKVC+VII+SGSSEN V +KLV  L LK++ HP PYKI  +          + TIP ++V                       P    L 
Subjt:  ALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKIWTLVTFYLEDHGNMITIPNTMV----------------------MPTPMSLN

Query:  GWALLPTP-------VNSKVQNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASR
        G  +           V  +++ L A+FP + K P  LPPL DIQH ID +PG+S P LPHYRMSP EYQ L   I++LL+K HI+P LSPC VP+LL  +
Subjt:  GWALLPTP-------VNSKVQNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASR

Query:  KDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK---------------------------------------------------------
        KD SWR+ +DSRA+NRIT+KY FPIP++ DLLDQL  A VFSK                                                         
Subjt:  KDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK---------------------------------------------------------

Query:  -EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKETL
         ++F +L+   LYIN KKC FL   I F GF+I +G + ++P+K+EAI+ WP PTSIKEVQ+FLGLASFY++FI+NFSS   PLT  LKK  F       
Subjt:  -EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKETL

Query:  DSFNTLKEKLVNPPVLALPDFTKIFEVAL
         SF  +K +L + P+L LPDFT  FEV +
Subjt:  DSFNTLKEKLVNPPVLALPDFTKIFEVAL

A0A5B7BER3 Uncharacterized protein8.6e-11432.48Show/hide
Query:  LGLESYSSKDEDFVGPQPHYRARMMHPP--QRNNPLFNNKFLLNKHKGVEIKIVINIGKDNQNWRRKNDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFD
        LG      +D   V   P  +  M + P  +RNNP++ N    +  +  +       G+D +  +   +++MKID+P++NG ++IE FLD +  VE  FD
Subjt:  LGLESYSSKDEDFVGPQPHYRARMMHPP--QRNNPLFNNKFLLNKHKGVEIKIVINIGKDNQNWRRKNDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFD

Query:  YMGTLEHKKVKLISLKLKSGASAWWDQIQGNRR------------------------------------------------------FARNNLAESDSQR
         M   + K+VKL++ KLK GASAWWDQ+Q NRR                                                       +RNNL E+++Q+
Subjt:  YMGTLEHKKVKLISLKLKSGASAWWDQIQGNRR------------------------------------------------------FARNNLAESDSQR

Query:  VAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNSATSR------------------KSTAANSKGQ-SPMRHLSKTRG
        VA ++ GL+  I+++++++ +  L EA S+A  +E  ++ +  R      ++  +S   +                  +  A++SK Q +P+    K+  
Subjt:  VAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNSATSR------------------KSTAANSKGQ-SPMRHLSKTRG

Query:  KKLN------LRCGQTGHLSNDCPQRKNLTIQ----------EEHEDENYSEE--DINVAQPDEGDTLSCVIQKVLLTPVTETNPQRHALFRICCTINGK
                   RC Q GH SN+CP R+ + +           E  E+  Y +E     + + DEG+ +SCV+Q++LL P  E +PQRH +FR  CTIN K
Subjt:  KKLN------LRCGQTGHLSNDCPQRKNLTIQ----------EEHEDENYSEE--DINVAQPDEGDTLSCVIQKVLLTPVTETNPQRHALFRICCTINGK

Query:  VCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKI-W------TLVT---------------------------------------------------F
        VC+VII+SGSSEN+V K LV+ L+LK   HP+PYKI W      T VT                                                   F
Subjt:  VCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKI-W------TLVT---------------------------------------------------F

Query:  YLEDHGNMITIPNT--MVMPTPMSLNGWALL-----------------------------PTPVNSKVQNLLAKFPSIT--KVPNELPPLRDIQHNIDFI
        +  D   ++ +PN     +P    + G +LL                             P  V   +Q LLA+F  IT  ++P+ LPP+RDIQH+ID +
Subjt:  YLEDHGNMITIPNT--MVMPTPMSLNGWALL-----------------------------PTPVNSKVQNLLAKFPSIT--KVPNELPPLRDIQHNIDFI

Query:  PGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK-------
        PG+S P+LPHYRMSP E + LQ  ++DL+ K  IQ  +SPCAVP+LL  +KDGSWR+ +DSRA+N+IT+KYRFPIPR++D+LD L G+K+FSK       
Subjt:  PGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK-------

Query:  ----------------------------------------------------------------------------EVFSILVSNTLYINSKKCVFLCSN
                                                                                    EV   L  + LYIN KKC FL + 
Subjt:  ----------------------------------------------------------------------------EVFSILVSNTLYINSKKCVFLCSN

Query:  IKFFGFIIG-QGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKETLDSFNTLKEKLVNPPVLALPDFTKIF
        + F GFIIG +G+QVD +K+ AI++WPTP ++ +++SF GLA+FYR+FI+NFSS  AP+T C+KKGKF    +   SF  +KEKL   PVLALP F K+F
Subjt:  IKFFGFIIG-QGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKETLDSFNTLKEKLVNPPVLALPDFTKIF

Query:  EV
        +V
Subjt:  EV

A0A5D3DGR0 Reverse transcriptase9.1e-12436.21Show/hide
Query:  KDNQNWRRKNDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAWWDQIQGNR--------------------RF----
        ++ +  R  +++KMKID+P+Y+GK NIE FLD +K+ E  F YMGT ++KKV L++LKLK GASAWWDQI  NR                    RF    
Subjt:  KDNQNWRRKNDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAWWDQIQGNR--------------------RF----

Query:  ------------------------------ARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNSAT
                                       R NL E +   ++WF+ GL+ D+KEKV +QP   L+EA++ A T+E  E ++++  + ++  W+   + 
Subjt:  ------------------------------ARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNSAT

Query:  SRKSTAANSK-----GQSPMRHLSKTRGKKLN------------------LRCGQTGHLSNDCPQRKNLTIQEEHED-ENYS----EEDINVAQPDEGDT
        S+K+TA NSK      + P+     +  K++                    RCGQ GH SN CPQRK + + ++++D  N S    +E+  V + DEGD+
Subjt:  SRKSTAANSK-----GQSPMRHLSKTRGKKLN------------------LRCGQTGHLSNDCPQRKNLTIQEEHED-ENYS----EEDINVAQPDEGDT

Query:  LSCVIQKVLLTPVTETNPQRHALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKI-W------TLVT---FYLEDHGN--------
        LSC++Q+VL++P  E   QRH+LF+  CTI GKVCNVII+SGSSEN V KKLV  L LK   H  PYKI W      TL++   +     GN        
Subjt:  LSCVIQKVLLTPVTETNPQRHALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKI-W------TLVT---FYLEDHGN--------

Query:  ----------MITIP---------------------NTMVMPTPM-------------------SLNGWALLPTPVN--------------------SKV
                  ++  P                     N  V+  P+                   +++G   L    N                      +
Subjt:  ----------MITIP---------------------NTMVMPTPM-------------------SLNGWALLPTPVN--------------------SKV

Query:  QNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIK
        + L  K+P I+K P  LPPLRDI HNI+ + G+SFPHLPHY MSP+EY+ L   I++LL+K HI+P  S C VP+LL  +KDG+WR+ +DSRA+N+IT+K
Subjt:  QNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIK

Query:  YRFPIPRISDLLDQLAGAKVFSK-----------------------------------------------------------------------------
        YRFPIPR+SDLLDQL GA +FSK                                                                             
Subjt:  YRFPIPRISDLLDQLAGAKVFSK-----------------------------------------------------------------------------

Query:  ------EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFAR
              ++F +L  N LY+N KKC+F  + I F GFII +  V +D +K+EAIK W TPT++ +VQ+FLGLASFYRKFI+N SS AAP+T CLKKG F  
Subjt:  ------EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFAR

Query:  SKETLDSFNTLKEKLVNPPVLALPDFTKIFEVAL
          +  DSFN LKE L N  VL LPDF + FEVA+
Subjt:  SKETLDSFNTLKEKLVNPPVLALPDFTKIFEVAL

A0A5D3E417 Transposon Ty3-I Gag-Pol polyprotein isoform X15.9e-10737.91Show/hide
Query:  GKDNQNWRRK--NDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAW-----------WDQIQG--------------
        G   Q  RR+  +D+KMKID+PTYNGK +IE FLD +K+ E  F YM   + KKV L++LKLK GASAW           + Q Q               
Subjt:  GKDNQNWRRK--NDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAW-----------WDQIQG--------------

Query:  -NRRFARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNSATSRK---------STAANSKG-----
         +R  AR NL+E++  ++A FI GL+ DIKEKV +     L+EA+S+A T+EE   ++ K  + +R+ W+ N +  +          ST+   KG     
Subjt:  -NRRFARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNSATSRK---------STAANSKG-----

Query:  -QSPMRHLSKTRGKKLN----------LRCGQTGHLSNDCPQRKNLTIQEEHE-----DENYSEEDINVAQPDEGDTLSCVIQKVLLTPVTETNPQRHAL
         ++  +  S  RGK  N           RCG+ GHLSN+C QRK + + E+ +      +   EE+  + + D+GD +SC++Q+VL+TP  ETNPQ H+L
Subjt:  -QSPMRHLSKTRGKKLN----------LRCGQTGHLSNDCPQRKNLTIQEEHE-----DENYSEEDINVAQPDEGDTLSCVIQKVLLTPVTETNPQRHAL

Query:  FRICCTINGKV-----------------------CNVIIESGSS--ENMVPKKLVQHLKLKVNAHPSPYKIWTL-----VTFYLEDHGNMITI-----PN
        F+  CTINGKV                       C + +  G+S  + +V   +   +   +   P  +   TL      T+  +  G  + +      N
Subjt:  FRICCTINGKV-----------------------CNVIIESGSS--ENMVPKKLVQHLKLKVNAHPSPYKIWTL-----VTFYLEDHGNMITI-----PN

Query:  TMVMPTP------MSLNGWALLPTP--------------------VNSKVQNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQA
        T  +         ++++G  LL                       V  +++ L A+FP + K P  LPPLRDIQH ID +P +S P+LPHYRMSP EYQ 
Subjt:  TMVMPTP------MSLNGWALLPTP--------------------VNSKVQNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQA

Query:  LQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK---------------------------
        L   I+DLL+K HI+P LSPCAVP+LL   KDGSWR+ +DSRA+NR+T KYRFPIPRI DLLDQL  A +FSK                           
Subjt:  LQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVFSK---------------------------

Query:  ---------------EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLT
                       ++F +L    LYIN KKC +L   I F GF+I +G ++++P+KIEAI+  PTPTSIKEVQ+FLGLASFYR+FI+NFS   APLT
Subjt:  ---------------EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQG-VQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLT

A0A6P6GFU0 uncharacterized protein LOC1124928191.3e-10934.02Show/hide
Query:  NDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAWWDQIQGNRR----------------------------------
        +D+++K+DIP ++G +NIE FLD V++VE  F+YM   E K+V+L++ K + GASAWW+Q+  NRR                                  
Subjt:  NDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAWWDQIQGNRR----------------------------------

Query:  --------------------FARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKH-----KRWAGQRSNWDRN------S
                             AR NL E++ Q VA ++ GL   I+E++ + P+  L+EAV++A  IE+ +  +H      +W      +         +
Subjt:  --------------------FARNNLAESDSQRVAWFIDGLQQDIKEKVHVQPLPCLAEAVSIASTIEEDEALKH-----KRWAGQRSNWDRN------S

Query:  ATSRKSTAAN------SKGQSPMRHLSKTRGKKLNL---RCGQTGHLSNDCPQRKNLTIQEEHED--ENYSE--EDINVAQPDEGDTLSCVIQKVLLTPV
        A  +K+T A+      SK Q+     S    +   L   +CGQ GH SN+CP RK + I E  +D  E ++   ++  +   D+G+ + C+IQK+L +P 
Subjt:  ATSRKSTAAN------SKGQSPMRHLSKTRGKKLNL---RCGQTGHLSNDCPQRKNLTIQEEHED--ENYSE--EDINVAQPDEGDTLSCVIQKVLLTPV

Query:  TETNPQRHALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKI-W------TLVTFYLEDH--------------------------
             QRH++F+  CTIN KVC VII+SGSSEN+V K LV+ LKL   +HP+PYK+ W      T VT   + H                          
Subjt:  TETNPQRHALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKI-W------TLVTFYLEDH--------------------------

Query:  -----GNMIT------------------------------IPNTMVMPTPMSLNGWALLPT------------------------PVNSK----VQNLLA
              N IT                              +P + V P  +S  G  LL T                        P +S+    +  LL 
Subjt:  -----GNMIT------------------------------IPNTMVMPTPMSLNGWALLPT------------------------PVNSK----VQNLLA

Query:  KFPSI--TKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRF
        +F  I  +++P+ LPP+RDIQH ID +PG+  P+LPHYRM P E Q LQ +++DLL+K  I+  LSPCAVP+LL  +K+G WR+ IDSRA+N+IT KYRF
Subjt:  KFPSI--TKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRF

Query:  PIPRISDLLDQLAGAKVFSK--------------------------------------------------------------------------------
        PIPR+ D+LD+L+GA+VFSK                                                                                
Subjt:  PIPRISDLLDQLAGAKVFSK--------------------------------------------------------------------------------

Query:  ---EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQ-GVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKE
            VF IL  N LY+N KKCVF+   + F GFI+G+ G++ DP K+ AI++W TP+++ EV+SF GLA+FYR+F++NFSS A PLT CLKKGKF     
Subjt:  ---EVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQ-GVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKE

Query:  TLDSFNTLKEKLVNPPVLALPDFTKIFEV
           SF TLK+ L   PVLALPDF KIFEV
Subjt:  TLDSFNTLKEKLVNPPVLALPDFTKIFEV

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.67.7e-1943.2Show/hide
Query:  VFSILVSNTLYINSKKCVFLCSNIKFFGFII-GQGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKG-KFARSKETLD
        VF  L    L +   KC FL     F G ++   G++ +P+KIEAI+++P PT  KE+++FLGL  +YRKFI NF+  A P+T CLKK  K   +    D
Subjt:  VFSILVSNTLYINSKKCVFLCSNIKFFGFII-GQGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKG-KFARSKETLD

Query:  S-FNTLKEKLVNPPVLALPDFTKIF
        S F  LK  +   P+L +PDFTK F
Subjt:  S-FNTLKEKLVNPPVLALPDFTKIF

P10394 Retrovirus-related Pol polyprotein from transposon 4129.1e-2023.19Show/hide
Query:  PTPVNSKVQNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDG------SWR
        P    S+++N+ +++  I  + +E   + ++      +      +  +YR   S+ + +QA +Q L++ K ++P +S    P LL  +K         WR
Subjt:  PTPVNSKVQNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDG------SWR

Query:  LGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVF-----------------SKEVFSILVSNTLY----------------------------------
        L ID R +N+  +  +FP+PRI D+LDQL  AK F                 S+++ S   SN  Y                                  
Subjt:  LGIDSRAVNRITIKYRFPIPRISDLLDQLAGAKVF-----------------SKEVFSILVSNTLY----------------------------------

Query:  --------------------------------INSKKCVFLCSNIKFFGF-IIGQGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTA
                                        ++ +KC F    + F G     +G+  D +K + I+ +P P      + F+   ++YR+FIKNF+  +
Subjt:  --------------------------------INSKKCVFLCSNIKFFGF-IIGQGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTA

Query:  APLTH-CLKKGKFARSKETLDSFNTLKEKLVNPPVLALPDFTKIF
          +T  C K   F  + E   +F  LK +L+NP +L  PDF+K F
Subjt:  APLTH-CLKKGKFARSKETLDSFNTLKEKLVNPPVLALPDFTKIF

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.7e-2426.11Show/hide
Query:  VPNELPPLR------DIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIP
        + N+LPP         ++H+I+  PG+  P L  Y ++    Q +  I+Q LL  K I P  SPC+ P +L  +KDG++RL +D R +N+ TI   FP+P
Subjt:  VPNELPPLR------DIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIP

Query:  RISDLLDQLAGAKVFS---------------------------------------------------------------------------------KEV
        RI +LL ++  A++F+                                                                                   V
Subjt:  RISDLLDQLAGAKVFS---------------------------------------------------------------------------------KEV

Query:  FSILVSNTLYINSKKCVFLCSNIKFFGFIIG-QGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPL-THCLKKGKFARSKETLDS
           L +  L +  KKC F     +F G+ IG Q +     K  AI+++PTP ++K+ Q FLG+ ++YR+FI N S  A P+      K ++   ++   +
Subjt:  FSILVSNTLYINSKKCVFLCSNIKFFGFIIG-QGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPL-THCLKKGKFARSKETLDS

Query:  FNTLKEKLVNPPVL
           LK  L N PVL
Subjt:  FNTLKEKLVNPPVL

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.4e-1926.1Show/hide
Query:  PTPVNSKVQNLLAKFPSITKVP-----NELPPLRDIQHNI-DFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRK-----
        P      + +LL +FP I + P      E     +I+ N  D I   S+P+  + R        ++  I +LLQ   I+P  SP   P  +  +K     
Subjt:  PTPVNSKVQNLLAKFPSITKVP-----NELPPLRDIQHNI-DFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRK-----

Query:  DGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAK--------------------------------------------------------------
        +  +R+ +D + +N +TI   +PIP I+  L  L  AK                                                              
Subjt:  DGSWRLGIDSRAVNRITIKYRFPIPRISDLLDQLAGAK--------------------------------------------------------------

Query:  -----------VFSKE----------VFSILVSNTLYINSKKCVFLCSNIKFFGFII-GQGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKN
                   VFS++          V + L    L +N +K  FL + ++F G+I+   G++ DP+K+ AI E P PTS+KE++ FLG+ S+YRKFI++
Subjt:  -----------VFSKE----------VFSILVSNTLYINSKKCVFLCSNIKFFGFII-GQGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKN

Query:  FSSTAAPLTHCLKKGKFARSKET-------------LDSFNTLKEKLVNPPVLALPDFTKIFEV
        ++  A PLT+ L +G +A  K +             L SFN LK  L +  +LA P FTK F +
Subjt:  FSSTAAPLTHCLKKGKFARSKET-------------LDSFNTLKEKLVNPPVLALPDFTKIFEV

Q99315 Transposon Ty3-G Gag-Pol polyprotein5.5e-2526.98Show/hide
Query:  VPNELPPLR------DIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIP
        + N+LPP         ++H+I+  PG+  P L  Y ++    Q +  I+Q LL  K I P  SPC+ P +L  +KDG++RL +D R +N+ TI   FP+P
Subjt:  VPNELPPLR------DIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIP

Query:  RISDLLDQLAGAKVFS---------------------------------------------------------------------------------KEV
        RI +LL ++  A++F+                                                                                   V
Subjt:  RISDLLDQLAGAKVFS---------------------------------------------------------------------------------KEV

Query:  FSILVSNTLYINSKKCVFLCSNIKFFGFIIG-QGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAP--LTHCLKKGKFARSKETLD
           L +  L +  KKC F     +F G+ IG Q +     K  AI+++PTP ++K+ Q FLG+ ++YR+FI N S  A P  L  C K     +  + +D
Subjt:  FSILVSNTLYINSKKCVFLCSNIKFFGFIIG-QGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAP--LTHCLKKGKFARSKETLD

Query:  SFNTLKEKLVNPPVL
            LK+ L N PVL
Subjt:  SFNTLKEKLVNPPVL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.3e-1840Show/hide
Query:  VFSILVSNTLYINSKKCVFLCSNIKFFG---FIIGQGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKETL
        V  I   +  Y N KKC F    I + G    I G+GV  DP K+EA+  WP P +  E++ FLGL  +YR+F+KN+     PLT  LKK     ++   
Subjt:  VFSILVSNTLYINSKKCVFLCSNIKFFG---FIIGQGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFARSKETL

Query:  DSFNTLKEKLVNPPVLALPD
         +F  LK  +   PVLALPD
Subjt:  DSFNTLKEKLVNPPVLALPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAGTAAAGGGCTTAGTAAACTCCCCACTCTTCCTTCTCTCCCTCCTAATGAAGCGGAAGCAACCTTCTCCACCTATCCTCAAACCATCAGCCGAAGATTAGACAA
TTTGGAAACTTCTGTCAACGGCATCATGACTAATATGATTGCTAGGCAAATTTCTATGGCGGGATTACAACAAGCAATCGAGAAGCTGACTTTGGATGTGGGTCGACTGG
TTGAAAATCAACCAAGAACTCATCAAGAAGCTTTGGCGGGGAACCAAACAGCAGGTAACAACCAGAATTTACAGCAAGAAATTCAACAAAGGCTTGAAAACCAACAACAA
ATGCTTCAAGGAATCAAGATTTGCCTGAGGGTCCAAGAAATTCCTCCAAGAATCCAAGAACCAGCAAGACTTCATCAAAGAATTGAAGATAGAGGGCTGCAGCTGCCCAA
TCCAGCCCAATATCATCAACATATGCCAGTTTATCCACCGTTTAGACCCATGCCACCAAATTTGTTGGGGTTGGAATCTTATTCATCCAAAGATGAAGATTTTGTTGGTC
CTCAACCACATTACCGAGCAAGAATGATGCATCCACCTCAAAGGAATAACCCACTTTTTAATAACAAATTCCTTTTGAACAAGCACAAGGGTGTGGAAATCAAGATTGTT
ATCAACATTGGCAAGGACAACCAAAATTGGCGGCGGAAAAATGACTTCAAGATGAAGATTGACATCCCAACATATAATGGCAAGATGAATATTGAAATCTTTTTAGATCG
GGTGAAAAGTGTAGAATTTTCTTTTGATTACATGGGAACCCTGGAACACAAAAAGGTAAAACTTATTTCCTTAAAACTTAAAAGTGGTGCATCGGCATGGTGGGATCAGA
TTCAAGGCAATCGCCGCTTTGCTAGGAACAATTTGGCCGAAAGTGACAGCCAACGAGTGGCTTGGTTCATTGATGGATTGCAACAAGACATCAAAGAAAAGGTTCACGTT
CAACCCCTACCTTGCTTGGCAGAAGCGGTGTCCATAGCCTCCACAATTGAAGAAGATGAGGCCTTAAAACATAAACGGTGGGCTGGCCAAAGGAGCAATTGGGACAGAAA
TTCAGCAACATCAAGAAAGTCCACAGCAGCAAATTCAAAAGGGCAGTCACCGATGAGGCACCTATCAAAAACAAGGGGAAAGAAGTTGAATCTAAGGTGTGGTCAAACAG
GACACCTTTCTAACGATTGTCCCCAAAGGAAGAACCTCACAATTCAAGAAGAACATGAGGATGAGAACTATTCTGAAGAAGACATCAATGTGGCCCAACCAGATGAGGGA
GATACTCTTTCTTGTGTGATCCAAAAGGTCCTCTTGACTCCGGTAACCGAAACTAATCCGCAGAGACACGCATTGTTTAGAATCTGTTGTACCATCAATGGGAAGGTGTG
CAACGTTATTATTGAGAGTGGTAGCTCGGAAAACATGGTGCCTAAGAAACTTGTGCAACATCTTAAATTGAAAGTTAACGCTCATCCAAGCCCCTATAAAATATGGACGC
TTGTCACATTCTACTTGGAAGACCATGGCAATATGATAACCATACCAAACACAATGGTAATGCCAACACCTATGAGTTTGAATGGATGGGCCTTGTTACCAACCCCTGTG
AACAGCAAGGTGCAGAACCTTTTGGCCAAATTTCCTTCCATTACCAAGGTCCCAAATGAGCTACCTCCACTCCGAGACATCCAACATAACATAGATTTCATCCCTGGATC
GTCTTTTCCTCACTTGCCCCACTATCGAATGAGCCCTAGTGAGTACCAAGCCCTCCAAGCTATCATCCAAGATCTTTTACAAAAGAAACATATCCAACCTAGGCTTAGTC
CTTGTGCTGTCCCATCCCTTTTAGCCTCAAGGAAAGATGGTAGTTGGCGCTTAGGCATAGATAGTCGTGCAGTCAATAGGATAACAATCAAATATAGATTCCCAATCCCA
AGAATTTCAGACCTACTCGATCAATTAGCCGGTGCAAAGGTGTTTTCCAAGGAAGTATTTTCTATCCTTGTATCTAATACTTTATACATCAATTCTAAGAAATGTGTTTT
CCTATGCAGTAACATCAAATTCTTTGGCTTCATCATTGGACAAGGTGTCCAAGTGGATCCTCAAAAAATTGAAGCTATCAAAGAGTGGCCAACACCTACATCAATCAAAG
AGGTTCAATCTTTCCTTGGCTTAGCATCTTTTTACCGCAAATTTATAAAGAATTTTAGCTCCACAGCTGCCCCTTTAACTCATTGCTTAAAGAAAGGTAAATTTGCAAGG
TCCAAAGAGACTTTAGATAGTTTTAATACCTTAAAGGAAAAACTAGTCAACCCACCAGTTTTAGCCCTTCCAGACTTTACAAAAATATTTGAAGTAGCACTTGTGATGCA
AGTTCTATAG
mRNA sequenceShow/hide mRNA sequence
ATGACAAGTAAAGGGCTTAGTAAACTCCCCACTCTTCCTTCTCTCCCTCCTAATGAAGCGGAAGCAACCTTCTCCACCTATCCTCAAACCATCAGCCGAAGATTAGACAA
TTTGGAAACTTCTGTCAACGGCATCATGACTAATATGATTGCTAGGCAAATTTCTATGGCGGGATTACAACAAGCAATCGAGAAGCTGACTTTGGATGTGGGTCGACTGG
TTGAAAATCAACCAAGAACTCATCAAGAAGCTTTGGCGGGGAACCAAACAGCAGGTAACAACCAGAATTTACAGCAAGAAATTCAACAAAGGCTTGAAAACCAACAACAA
ATGCTTCAAGGAATCAAGATTTGCCTGAGGGTCCAAGAAATTCCTCCAAGAATCCAAGAACCAGCAAGACTTCATCAAAGAATTGAAGATAGAGGGCTGCAGCTGCCCAA
TCCAGCCCAATATCATCAACATATGCCAGTTTATCCACCGTTTAGACCCATGCCACCAAATTTGTTGGGGTTGGAATCTTATTCATCCAAAGATGAAGATTTTGTTGGTC
CTCAACCACATTACCGAGCAAGAATGATGCATCCACCTCAAAGGAATAACCCACTTTTTAATAACAAATTCCTTTTGAACAAGCACAAGGGTGTGGAAATCAAGATTGTT
ATCAACATTGGCAAGGACAACCAAAATTGGCGGCGGAAAAATGACTTCAAGATGAAGATTGACATCCCAACATATAATGGCAAGATGAATATTGAAATCTTTTTAGATCG
GGTGAAAAGTGTAGAATTTTCTTTTGATTACATGGGAACCCTGGAACACAAAAAGGTAAAACTTATTTCCTTAAAACTTAAAAGTGGTGCATCGGCATGGTGGGATCAGA
TTCAAGGCAATCGCCGCTTTGCTAGGAACAATTTGGCCGAAAGTGACAGCCAACGAGTGGCTTGGTTCATTGATGGATTGCAACAAGACATCAAAGAAAAGGTTCACGTT
CAACCCCTACCTTGCTTGGCAGAAGCGGTGTCCATAGCCTCCACAATTGAAGAAGATGAGGCCTTAAAACATAAACGGTGGGCTGGCCAAAGGAGCAATTGGGACAGAAA
TTCAGCAACATCAAGAAAGTCCACAGCAGCAAATTCAAAAGGGCAGTCACCGATGAGGCACCTATCAAAAACAAGGGGAAAGAAGTTGAATCTAAGGTGTGGTCAAACAG
GACACCTTTCTAACGATTGTCCCCAAAGGAAGAACCTCACAATTCAAGAAGAACATGAGGATGAGAACTATTCTGAAGAAGACATCAATGTGGCCCAACCAGATGAGGGA
GATACTCTTTCTTGTGTGATCCAAAAGGTCCTCTTGACTCCGGTAACCGAAACTAATCCGCAGAGACACGCATTGTTTAGAATCTGTTGTACCATCAATGGGAAGGTGTG
CAACGTTATTATTGAGAGTGGTAGCTCGGAAAACATGGTGCCTAAGAAACTTGTGCAACATCTTAAATTGAAAGTTAACGCTCATCCAAGCCCCTATAAAATATGGACGC
TTGTCACATTCTACTTGGAAGACCATGGCAATATGATAACCATACCAAACACAATGGTAATGCCAACACCTATGAGTTTGAATGGATGGGCCTTGTTACCAACCCCTGTG
AACAGCAAGGTGCAGAACCTTTTGGCCAAATTTCCTTCCATTACCAAGGTCCCAAATGAGCTACCTCCACTCCGAGACATCCAACATAACATAGATTTCATCCCTGGATC
GTCTTTTCCTCACTTGCCCCACTATCGAATGAGCCCTAGTGAGTACCAAGCCCTCCAAGCTATCATCCAAGATCTTTTACAAAAGAAACATATCCAACCTAGGCTTAGTC
CTTGTGCTGTCCCATCCCTTTTAGCCTCAAGGAAAGATGGTAGTTGGCGCTTAGGCATAGATAGTCGTGCAGTCAATAGGATAACAATCAAATATAGATTCCCAATCCCA
AGAATTTCAGACCTACTCGATCAATTAGCCGGTGCAAAGGTGTTTTCCAAGGAAGTATTTTCTATCCTTGTATCTAATACTTTATACATCAATTCTAAGAAATGTGTTTT
CCTATGCAGTAACATCAAATTCTTTGGCTTCATCATTGGACAAGGTGTCCAAGTGGATCCTCAAAAAATTGAAGCTATCAAAGAGTGGCCAACACCTACATCAATCAAAG
AGGTTCAATCTTTCCTTGGCTTAGCATCTTTTTACCGCAAATTTATAAAGAATTTTAGCTCCACAGCTGCCCCTTTAACTCATTGCTTAAAGAAAGGTAAATTTGCAAGG
TCCAAAGAGACTTTAGATAGTTTTAATACCTTAAAGGAAAAACTAGTCAACCCACCAGTTTTAGCCCTTCCAGACTTTACAAAAATATTTGAAGTAGCACTTGTGATGCA
AGTTCTATAG
Protein sequenceShow/hide protein sequence
MTSKGLSKLPTLPSLPPNEAEATFSTYPQTISRRLDNLETSVNGIMTNMIARQISMAGLQQAIEKLTLDVGRLVENQPRTHQEALAGNQTAGNNQNLQQEIQQRLENQQQ
MLQGIKICLRVQEIPPRIQEPARLHQRIEDRGLQLPNPAQYHQHMPVYPPFRPMPPNLLGLESYSSKDEDFVGPQPHYRARMMHPPQRNNPLFNNKFLLNKHKGVEIKIV
INIGKDNQNWRRKNDFKMKIDIPTYNGKMNIEIFLDRVKSVEFSFDYMGTLEHKKVKLISLKLKSGASAWWDQIQGNRRFARNNLAESDSQRVAWFIDGLQQDIKEKVHV
QPLPCLAEAVSIASTIEEDEALKHKRWAGQRSNWDRNSATSRKSTAANSKGQSPMRHLSKTRGKKLNLRCGQTGHLSNDCPQRKNLTIQEEHEDENYSEEDINVAQPDEG
DTLSCVIQKVLLTPVTETNPQRHALFRICCTINGKVCNVIIESGSSENMVPKKLVQHLKLKVNAHPSPYKIWTLVTFYLEDHGNMITIPNTMVMPTPMSLNGWALLPTPV
NSKVQNLLAKFPSITKVPNELPPLRDIQHNIDFIPGSSFPHLPHYRMSPSEYQALQAIIQDLLQKKHIQPRLSPCAVPSLLASRKDGSWRLGIDSRAVNRITIKYRFPIP
RISDLLDQLAGAKVFSKEVFSILVSNTLYINSKKCVFLCSNIKFFGFIIGQGVQVDPQKIEAIKEWPTPTSIKEVQSFLGLASFYRKFIKNFSSTAAPLTHCLKKGKFAR
SKETLDSFNTLKEKLVNPPVLALPDFTKIFEVALVMQVL