; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008438 (gene) of Snake gourd v1 genome

Gene IDTan0008438
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:58246500..58249474
RNA-Seq ExpressionTan0008438
SyntenyTan0008438
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]5.4e-17640.08Show/hide
Query:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG
        M++SI+ LL ++KL G+N   WK+ LNTILVVDDL+FV+TEE PQ P  NA+R VR+AY+RWVKANDKA+VY+LASM+D+LAKKH+ +  AK IMDS+R 
Subjt:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG

Query:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA-----------------------------------------------GRMVLPSMSRS
        MFGQ S   RH  +K+I+  RM EGTSV +HVLDMM+HFNIA                                                R    ++S+ 
Subjt:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA-----------------------------------------------GRMVLPSMSRS

Query:  RKVRQMLPV---------------------------GVITEGKKVKDVADKGKCFHYNEDGHWKRNCPKYIAEKNKE-----------------------
        ++V   + V                           G      KVK  ADKGKCFH N+DGHWKRNCPKY+AEK  E                       
Subjt:  RKVRQMLPV---------------------------GVITEGKKVKDVADKGKCFHYNEDGHWKRNCPKYIAEKNKE-----------------------

Query:  -----GATNHVCYSFRGIDSGGRADKEGEITLGLEMEKSSHQRAEGH------------------------------------------NEALFWC----
             GATNH+C+SF+   S  +  KEGEITL +   +     A G                                           NE    C    
Subjt:  -----GATNHVCYSFRGIDSGGRADKEGEITLGLEMEKSSHQRAEGH------------------------------------------NEALFWC----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------YSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------APDTP
                                      +SR+GH+YL+HHKSE+ EKFKE+K EVEN++                                  AP TP
Subjt:  ------------------------------YSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------APDTP

Query:  QQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKE
        QQN V ERRNRTLLDMVRSMMSYA L  SFWGY +E A++ILNNVPSKSV ETP+ELWKGRK                                 GYPKE
Subjt:  QQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKE

Query:  IRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSEL------------------------------------DGTIAKPDRYMGLVEIQVVIPD
         RGGLFY P+E+KVF+S N  FL EDH R+H+P+SK+VL E+                                       + +P+RY+GLVE Q++IPD
Subjt:  IRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSEL------------------------------------DGTIAKPDRYMGLVEIQVVIPD

Query:  DNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIR
        D  EDPLTY QAM D D+D+W+  M+ EMESMYFNSVW LVD    VKPIGCKWIYKRKR    KVQTFKARLVAKGYTQ EGVDYEETFSPVAM+KSIR
Subjt:  DNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIR

Query:  ILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF
        ILL+IA +Y++E+W+MDVKTAFLN NLEE+IYM Q +GFI   +EQKVC+L++SIYGLKQASRSWNIRFD AI+   F
Subjt:  ILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF

KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]7.5e-18647.6Show/hide
Query:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG
        +SS+ + +L A KL G N  +WKN +N +L++DDL+FV+ E+ PQVP +NA+R VR+ Y RW KAN+KA+ Y+LAS+S++LAKKHE M+ A+EIMDS++ 
Subjt:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG

Query:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLPVGVI---------------------------------
        MFGQ S Q +H+ LKYI+N+RM EG SV +HVL+MMVHFN+A     V+   S+   + + LP   +                                 
Subjt:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLPVGVI---------------------------------

Query:  ------------------TEGKK------------------------------VKDVADKGKCFHYNEDGHWKRNCPKYIAEKNKEGATNHVCYSFRGID
                          T G K                               K  A KG CF  N++GHWKRNCPKY+A+K K          F G  
Subjt:  ------------------TEGKK------------------------------VKDVADKGKCFHYNEDGHWKRNCPKYIAEKNKEGATNHVCYSFRGID

Query:  SGGRADKEGEITLGLEMEKSSHQRAEGHNEALFWC---YSRFGHIYLMHHKSEALEKFKEFKTEVENQL-------------------------------
         G RA +  E+ +  ++    + +A G  E        YSR+G++YLM HKSEALEKFKE+K EVEN L                               
Subjt:  SGGRADKEGEITLGLEMEKSSHQRAEGHNEALFWC---YSRFGHIYLMHHKSEALEKFKEFKTEVENQL-------------------------------

Query:  ---APDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK------------------------------
           APDTPQQN V ERRNRTLLDMVRSMMSYAHL +SFWGY V+ AVYILN VPSKSVSETP +LW G K                              
Subjt:  ---APDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK------------------------------

Query:  ---GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSELDGTIAKPD------------------------------------------
           GYPK  RGG FYDPK++KVF+S N  FL EDHIR+HKP+SK+VL+EL     +P                                           
Subjt:  ---GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSELDGTIAKPD------------------------------------------

Query:  RYMGLVEIQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDY
        RYM L E   VI D + EDPLT+ +AM D DKD+W+  M+ E+ESMYFNSVWDLVD+ DGVKPIGCKWIYKRKRG D KVQTFKARLVAKGYTQVEGVDY
Subjt:  RYMGLVEIQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDY

Query:  EETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF
        EETFSPVAM+KSIRILL+IAAY+D+E+W+MDVKTAFLN NLEE IYM Q +GFI PG+EQK+C+L RSIYGLKQASRSWNIRFD AI+   F
Subjt:  EETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF

KAA0058278.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-17850.33Show/hide
Query:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG
        M SSII LL   +LTGEN  TWK+KLN ILV+ DL FV+ EE P  P   AS++VRDAY+RW KANDKA++++LASM DIL+KKHE M+ A +IMDS+R 
Subjt:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG

Query:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLP-----VGVITEGKKVKDVADKGKCFHYNEDGHWKRNC
        MFGQ S Q +                             N+A   R  +PS S S K+++          +  E K    VA K KCFH N D HWK NC
Subjt:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLP-----VGVITEGKKVKDVADKGKCFHYNEDGHWKRNC

Query:  PKY-IAEKNKEGATNHVCYSFR------------------------------------GIDSGGRADKEG-------------EITLGLEMEK-------
        PKY + +K KEGATNHVC S +                                     +D  GR  K G             E  L  +M K       
Subjt:  PKY-IAEKNKEGATNHVCYSFR------------------------------------GIDSGGRADKEG-------------EITLGLEMEK-------

Query:  ----------------SSHQRAEGHNE---ALFWCYSRFGHIYLMHHKSEALEKFKEFKTEVENQLAPDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSS
                          + +A G  E   +    YSR+G++YLM HKSEALEKFKE+KTEVEN L P TPQQN V ERRNRTLLDMVRSMM YA L SS
Subjt:  ----------------SSHQRAEGHNE---ALFWCYSRFGHIYLMHHKSEALEKFKEFKTEVENQLAPDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSS

Query:  FWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIR
        FWGY VE AV+ILNNVPSKSVSE PFELW+ RK                                 GYPKE RGGLF+DP+E++VF+S N  FL EDH+R
Subjt:  FWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIR

Query:  DHKPKSKVVLSE-----------------LDGT---------------------IAKPDRYMGLVEIQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQ
        +HKP+SK+VLSE                 +D T                     +++P+RY+GL E QVVIPDD  EDPL+Y QAM D DKD+WV  MD 
Subjt:  DHKPKSKVVLSE-----------------LDGT---------------------IAKPDRYMGLVEIQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQ

Query:  EMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNL
        EMESMYFNSVW+LVD  +GVKPIGCKWIYKRKR    KVQTFKARLVAKGYTQ EGVDYEETFS VAM+KSIRILL+IA +YD+E+W+MDVKTAFLN NL
Subjt:  EMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNL

Query:  EENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF
        EE+I+M Q +GFI  G+EQKVC+L RSIYGLKQASRSWNIRFD AI+   F
Subjt:  EENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]4.0e-17949.04Show/hide
Query:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG
        MSSSII LL   +LTGEN  TWK+KLN ILV+ DL FV+ EE P  P  +AS++VRDAY+RW KANDKA++++LASMSDIL+KKHE M+ A++IMDS+R 
Subjt:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG

Query:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLP-----VGVITEGKKVKDVADKGKCFHYNEDGHWKRNC
        MFGQ S Q +                             N+A   R  +PS S S K+++          +  E K    VA K KCFH N D HWK NC
Subjt:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLP-----VGVITEGKKVKDVADKGKCFHYNEDGHWKRNC

Query:  PKY-IAEKNKEGATNHVCYSFR------------------------------------GIDSGGRADKEG-------------EITLGLEMEK-------
        PKY + +K KEGATNHVC S +                                     +D  GR  K G             E  L  +M K       
Subjt:  PKY-IAEKNKEGATNHVCYSFR------------------------------------GIDSGGRADKEG-------------EITLGLEMEK-------

Query:  ----------------SSHQRAEGHNE---ALFWCYSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------
                          + +A G  E   +    YSR+G++YLM HKSEALEKFKE+KTEVEN L                                  
Subjt:  ----------------SSHQRAEGHNE---ALFWCYSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------

Query:  APDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------
        AP TPQQN V ERRNRTLLDMVRSMMSYA L SSFWGY VE AV+ILNNVPSKSVSETPFELW+GRK                                 
Subjt:  APDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------

Query:  GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSE-----------------LDGT---------------------IAKPDRYMGLVE
        GYPKE RGGLF+DP+E++VF+S N  FL EDH+R+HKP+SK+VLSE                 +D T                     +++P+RY+GL E
Subjt:  GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSE-----------------LDGT---------------------IAKPDRYMGLVE

Query:  IQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPV
         QVVIPDD  EDPL+Y QAM D DKD+WV  MD EMESMYFNSVW+LVD  +GVKPIGCKWIYKRKR    KVQTFKARLVAKGYTQ EGVDYEETFSPV
Subjt:  IQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPV

Query:  AMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF
        AM+KSIRILL+IA +YD+E+W+MDVKTAFLN NLEE+I+M Q +GFI  G+EQKVC+L RSIYGLKQASRSWNIRFD AI+   F
Subjt:  AMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-18948.65Show/hide
Query:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG
        M+++ + +L A KL G N  +WKN +N +L++DDLKFV+ EE PQVP +NA++ VR+ Y RW K N+K + Y+LAS+S++LAKKHE M+ A+EIMDS++ 
Subjt:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG

Query:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA---------------------------------GRMVLPSMSRSRKVR-----QMLPV
        MFGQ S Q  H+ LKYI+N+RM EG SV +HVL+MMVHFN+A                                 G   +PS S ++K +     Q    
Subjt:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA---------------------------------GRMVLPSMSRSRKVR-----QMLPV

Query:  GVITEGKKVKDVADKGKCFHYNEDGHWKRNCPKYIAEKN--KEGATN---------------------HVCYSF-------RGIDSGGRADKEGEITLGL
         +       K  A KG CFHYN++GHWKRNCPKY+AEK   K+G  N                      +C S        R     G   KE    +  
Subjt:  GVITEGKKVKDVADKGKCFHYNEDGHWKRNCPKYIAEKN--KEGATN---------------------HVCYSF-------RGIDSGGRADKEGEITLGL

Query:  EMEKSSHQRAEGHNEALFWC---YSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------APDTPQQNNVLE
        ++    + +A G  E        YSR+G++YLM HKSEALEKFKE+K EVEN L                                  AP TPQQN V E
Subjt:  EMEKSSHQRAEGHNEALFWC---YSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------APDTPQQNNVLE

Query:  RRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKEIRGGLFY
        RRNRTLLDMVRSM+SYAHL +SFWGY V+ AVYILN VPSKSVSETP +LW GRK                                 GYPK  RGG FY
Subjt:  RRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKEIRGGLFY

Query:  DPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSELDGTIAKPD------------------------------------------RYMGLVEIQVVIPDD
        DPK++KVF+S N  FL EDHIR+HKP+SK+VL+EL     +P                                           RYM L E   VI D 
Subjt:  DPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSELDGTIAKPD------------------------------------------RYMGLVEIQVVIPDD

Query:  NCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRI
        + EDPLT+ +AM D DKD+W+  M+ E+ESMYFNSVWDLVD+ DGVKPIGCKWIYKRKRG D KVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSIRI
Subjt:  NCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRI

Query:  LLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF
        LL+IAAY+D+E+W+MDVKTAFLN NLEE IYM Q +GFI PG+EQK+C+L RSIYGLKQASRSWNIRFD AI+   F
Subjt:  LLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF

TrEMBL top hitse value%identityAlignment
A0A5A7SNP8 Gag/pol protein3.6e-18647.6Show/hide
Query:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG
        +SS+ + +L A KL G N  +WKN +N +L++DDL+FV+ E+ PQVP +NA+R VR+ Y RW KAN+KA+ Y+LAS+S++LAKKHE M+ A+EIMDS++ 
Subjt:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG

Query:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLPVGVI---------------------------------
        MFGQ S Q +H+ LKYI+N+RM EG SV +HVL+MMVHFN+A     V+   S+   + + LP   +                                 
Subjt:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLPVGVI---------------------------------

Query:  ------------------TEGKK------------------------------VKDVADKGKCFHYNEDGHWKRNCPKYIAEKNKEGATNHVCYSFRGID
                          T G K                               K  A KG CF  N++GHWKRNCPKY+A+K K          F G  
Subjt:  ------------------TEGKK------------------------------VKDVADKGKCFHYNEDGHWKRNCPKYIAEKNKEGATNHVCYSFRGID

Query:  SGGRADKEGEITLGLEMEKSSHQRAEGHNEALFWC---YSRFGHIYLMHHKSEALEKFKEFKTEVENQL-------------------------------
         G RA +  E+ +  ++    + +A G  E        YSR+G++YLM HKSEALEKFKE+K EVEN L                               
Subjt:  SGGRADKEGEITLGLEMEKSSHQRAEGHNEALFWC---YSRFGHIYLMHHKSEALEKFKEFKTEVENQL-------------------------------

Query:  ---APDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK------------------------------
           APDTPQQN V ERRNRTLLDMVRSMMSYAHL +SFWGY V+ AVYILN VPSKSVSETP +LW G K                              
Subjt:  ---APDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK------------------------------

Query:  ---GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSELDGTIAKPD------------------------------------------
           GYPK  RGG FYDPK++KVF+S N  FL EDHIR+HKP+SK+VL+EL     +P                                           
Subjt:  ---GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSELDGTIAKPD------------------------------------------

Query:  RYMGLVEIQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDY
        RYM L E   VI D + EDPLT+ +AM D DKD+W+  M+ E+ESMYFNSVWDLVD+ DGVKPIGCKWIYKRKRG D KVQTFKARLVAKGYTQVEGVDY
Subjt:  RYMGLVEIQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDY

Query:  EETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF
        EETFSPVAM+KSIRILL+IAAY+D+E+W+MDVKTAFLN NLEE IYM Q +GFI PG+EQK+C+L RSIYGLKQASRSWNIRFD AI+   F
Subjt:  EETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF

A0A5A7USZ2 Gag/pol protein7.3e-17950.33Show/hide
Query:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG
        M SSII LL   +LTGEN  TWK+KLN ILV+ DL FV+ EE P  P   AS++VRDAY+RW KANDKA++++LASM DIL+KKHE M+ A +IMDS+R 
Subjt:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG

Query:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLP-----VGVITEGKKVKDVADKGKCFHYNEDGHWKRNC
        MFGQ S Q +                             N+A   R  +PS S S K+++          +  E K    VA K KCFH N D HWK NC
Subjt:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLP-----VGVITEGKKVKDVADKGKCFHYNEDGHWKRNC

Query:  PKY-IAEKNKEGATNHVCYSFR------------------------------------GIDSGGRADKEG-------------EITLGLEMEK-------
        PKY + +K KEGATNHVC S +                                     +D  GR  K G             E  L  +M K       
Subjt:  PKY-IAEKNKEGATNHVCYSFR------------------------------------GIDSGGRADKEG-------------EITLGLEMEK-------

Query:  ----------------SSHQRAEGHNE---ALFWCYSRFGHIYLMHHKSEALEKFKEFKTEVENQLAPDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSS
                          + +A G  E   +    YSR+G++YLM HKSEALEKFKE+KTEVEN L P TPQQN V ERRNRTLLDMVRSMM YA L SS
Subjt:  ----------------SSHQRAEGHNE---ALFWCYSRFGHIYLMHHKSEALEKFKEFKTEVENQLAPDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSS

Query:  FWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIR
        FWGY VE AV+ILNNVPSKSVSE PFELW+ RK                                 GYPKE RGGLF+DP+E++VF+S N  FL EDH+R
Subjt:  FWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIR

Query:  DHKPKSKVVLSE-----------------LDGT---------------------IAKPDRYMGLVEIQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQ
        +HKP+SK+VLSE                 +D T                     +++P+RY+GL E QVVIPDD  EDPL+Y QAM D DKD+WV  MD 
Subjt:  DHKPKSKVVLSE-----------------LDGT---------------------IAKPDRYMGLVEIQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQ

Query:  EMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNL
        EMESMYFNSVW+LVD  +GVKPIGCKWIYKRKR    KVQTFKARLVAKGYTQ EGVDYEETFS VAM+KSIRILL+IA +YD+E+W+MDVKTAFLN NL
Subjt:  EMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNL

Query:  EENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF
        EE+I+M Q +GFI  G+EQKVC+L RSIYGLKQASRSWNIRFD AI+   F
Subjt:  EENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF

A0A5A7UYE8 Gag/pol protein1.9e-17949.04Show/hide
Query:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG
        MSSSII LL   +LTGEN  TWK+KLN ILV+ DL FV+ EE P  P  +AS++VRDAY+RW KANDKA++++LASMSDIL+KKHE M+ A++IMDS+R 
Subjt:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG

Query:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLP-----VGVITEGKKVKDVADKGKCFHYNEDGHWKRNC
        MFGQ S Q +                             N+A   R  +PS S S K+++          +  E K    VA K KCFH N D HWK NC
Subjt:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA--GRMVLPSMSRSRKVRQMLP-----VGVITEGKKVKDVADKGKCFHYNEDGHWKRNC

Query:  PKY-IAEKNKEGATNHVCYSFR------------------------------------GIDSGGRADKEG-------------EITLGLEMEK-------
        PKY + +K KEGATNHVC S +                                     +D  GR  K G             E  L  +M K       
Subjt:  PKY-IAEKNKEGATNHVCYSFR------------------------------------GIDSGGRADKEG-------------EITLGLEMEK-------

Query:  ----------------SSHQRAEGHNE---ALFWCYSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------
                          + +A G  E   +    YSR+G++YLM HKSEALEKFKE+KTEVEN L                                  
Subjt:  ----------------SSHQRAEGHNE---ALFWCYSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------

Query:  APDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------
        AP TPQQN V ERRNRTLLDMVRSMMSYA L SSFWGY VE AV+ILNNVPSKSVSETPFELW+GRK                                 
Subjt:  APDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------

Query:  GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSE-----------------LDGT---------------------IAKPDRYMGLVE
        GYPKE RGGLF+DP+E++VF+S N  FL EDH+R+HKP+SK+VLSE                 +D T                     +++P+RY+GL E
Subjt:  GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSE-----------------LDGT---------------------IAKPDRYMGLVE

Query:  IQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPV
         QVVIPDD  EDPL+Y QAM D DKD+WV  MD EMESMYFNSVW+LVD  +GVKPIGCKWIYKRKR    KVQTFKARLVAKGYTQ EGVDYEETFSPV
Subjt:  IQVVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPV

Query:  AMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF
        AM+KSIRILL+IA +YD+E+W+MDVKTAFLN NLEE+I+M Q +GFI  G+EQKVC+L RSIYGLKQASRSWNIRFD AI+   F
Subjt:  AMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF

A0A5D3BHG7 Gag/pol protein9.2e-19048.65Show/hide
Query:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG
        M+++ + +L A KL G N  +WKN +N +L++DDLKFV+ EE PQVP +NA++ VR+ Y RW K N+K + Y+LAS+S++LAKKHE M+ A+EIMDS++ 
Subjt:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG

Query:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA---------------------------------GRMVLPSMSRSRKVR-----QMLPV
        MFGQ S Q  H+ LKYI+N+RM EG SV +HVL+MMVHFN+A                                 G   +PS S ++K +     Q    
Subjt:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA---------------------------------GRMVLPSMSRSRKVR-----QMLPV

Query:  GVITEGKKVKDVADKGKCFHYNEDGHWKRNCPKYIAEKN--KEGATN---------------------HVCYSF-------RGIDSGGRADKEGEITLGL
         +       K  A KG CFHYN++GHWKRNCPKY+AEK   K+G  N                      +C S        R     G   KE    +  
Subjt:  GVITEGKKVKDVADKGKCFHYNEDGHWKRNCPKYIAEKN--KEGATN---------------------HVCYSF-------RGIDSGGRADKEGEITLGL

Query:  EMEKSSHQRAEGHNEALFWC---YSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------APDTPQQNNVLE
        ++    + +A G  E        YSR+G++YLM HKSEALEKFKE+K EVEN L                                  AP TPQQN V E
Subjt:  EMEKSSHQRAEGHNEALFWC---YSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------APDTPQQNNVLE

Query:  RRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKEIRGGLFY
        RRNRTLLDMVRSM+SYAHL +SFWGY V+ AVYILN VPSKSVSETP +LW GRK                                 GYPK  RGG FY
Subjt:  RRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKEIRGGLFY

Query:  DPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSELDGTIAKPD------------------------------------------RYMGLVEIQVVIPDD
        DPK++KVF+S N  FL EDHIR+HKP+SK+VL+EL     +P                                           RYM L E   VI D 
Subjt:  DPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSELDGTIAKPD------------------------------------------RYMGLVEIQVVIPDD

Query:  NCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRI
        + EDPLT+ +AM D DKD+W+  M+ E+ESMYFNSVWDLVD+ DGVKPIGCKWIYKRKRG D KVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSIRI
Subjt:  NCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRI

Query:  LLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF
        LL+IAAY+D+E+W+MDVKTAFLN NLEE IYM Q +GFI PG+EQK+C+L RSIYGLKQASRSWNIRFD AI+   F
Subjt:  LLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF

E2GK51 Gag/pol protein (Fragment)2.6e-17640.08Show/hide
Query:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG
        M++SI+ LL ++KL G+N   WK+ LNTILVVDDL+FV+TEE PQ P  NA+R VR+AY+RWVKANDKA+VY+LASM+D+LAKKH+ +  AK IMDS+R 
Subjt:  MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRG

Query:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA-----------------------------------------------GRMVLPSMSRS
        MFGQ S   RH  +K+I+  RM EGTSV +HVLDMM+HFNIA                                                R    ++S+ 
Subjt:  MFGQQSTQARHNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIA-----------------------------------------------GRMVLPSMSRS

Query:  RKVRQMLPV---------------------------GVITEGKKVKDVADKGKCFHYNEDGHWKRNCPKYIAEKNKE-----------------------
        ++V   + V                           G      KVK  ADKGKCFH N+DGHWKRNCPKY+AEK  E                       
Subjt:  RKVRQMLPV---------------------------GVITEGKKVKDVADKGKCFHYNEDGHWKRNCPKYIAEKNKE-----------------------

Query:  -----GATNHVCYSFRGIDSGGRADKEGEITLGLEMEKSSHQRAEGH------------------------------------------NEALFWC----
             GATNH+C+SF+   S  +  KEGEITL +   +     A G                                           NE    C    
Subjt:  -----GATNHVCYSFRGIDSGGRADKEGEITLGLEMEKSSHQRAEGH------------------------------------------NEALFWC----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------YSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------APDTP
                                      +SR+GH+YL+HHKSE+ EKFKE+K EVEN++                                  AP TP
Subjt:  ------------------------------YSRFGHIYLMHHKSEALEKFKEFKTEVENQL----------------------------------APDTP

Query:  QQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKE
        QQN V ERRNRTLLDMVRSMMSYA L  SFWGY +E A++ILNNVPSKSV ETP+ELWKGRK                                 GYPKE
Subjt:  QQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVSETPFELWKGRK---------------------------------GYPKE

Query:  IRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSEL------------------------------------DGTIAKPDRYMGLVEIQVVIPD
         RGGLFY P+E+KVF+S N  FL EDH R+H+P+SK+VL E+                                       + +P+RY+GLVE Q++IPD
Subjt:  IRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSEL------------------------------------DGTIAKPDRYMGLVEIQVVIPD

Query:  DNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIR
        D  EDPLTY QAM D D+D+W+  M+ EMESMYFNSVW LVD    VKPIGCKWIYKRKR    KVQTFKARLVAKGYTQ EGVDYEETFSPVAM+KSIR
Subjt:  DNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIR

Query:  ILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF
        ILL+IA +Y++E+W+MDVKTAFLN NLEE+IYM Q +GFI   +EQKVC+L++SIYGLKQASRSWNIRFD AI+   F
Subjt:  ILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.1e-2923.44Show/hide
Query:  RAEGH-NEALFWCYSRFGHIYLMHHKSEALEKFKEFKTEVENQLAPDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSV-
        ++E H N  + + Y   G  YL    S  + +F   K    +   P TPQ N V ER  RT+ +  R+M+S A L  SFWG  V  A Y++N +PS+++ 
Subjt:  RAEGH-NEALFWCYSRFGHIYLMHHKSEALEKFKEFKTEVENQLAPDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSV-

Query:  --SETPFELWKGRKGYPKEIR-------------GGLF-----------YDPKEDKVFMSIN-------------------------VIFLGEDHIRDHK
          S+TP+E+W  +K Y K +R              G F           Y+P   K++ ++N                          +FL +    ++K
Subjt:  --SETPFELWKGRKGYPKEIR-------------GGLF-----------YDPKEDKVFMSIN-------------------------VIFLGEDHIRDHK

Query:  ----PKSKVVLSELDGTIAKPDRYMGLVE----------------IQVVIPDD-----------------------------------------------
               K++ +E      + D    L +                IQ   P++                                               
Subjt:  ----PKSKVVLSELDGTIAKPDRYMGLVE----------------IQVVIPDD-----------------------------------------------

Query:  ---------------------------------------NCED-----------------PLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSD
                                               N ED                 P ++++     DK  W   ++ E+ +   N+ W +  + +
Subjt:  ---------------------------------------NCED-----------------PLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSD

Query:  GVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKE
            +  +W++  K         +KARLVA+G+TQ   +DYEETF+PVA + S R +L++   Y+ +V +MDVKTAFLN  L+E IYM   +G       
Subjt:  GVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKE

Query:  QKVCRLKRSIYGLKQASRSWNIRFDEAIE
          VC+L ++IYGLKQA+R W   F++A++
Subjt:  QKVCRLKRSIYGLKQASRSWNIRFDEAIE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-4831.76Show/hide
Query:  PDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVS-ETPFELWKGRK---------------------------------
        P TPQ N V ER NRT+++ VRSM+  A L  SFWG  V+ A Y++N  PS  ++ E P  +W  ++                                 
Subjt:  PDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVS-ETPFELWKGRK---------------------------------

Query:  --GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIR-----DHKPKSKVV-------------------------LSELDGTIAKPDRYM--GLVEIQ--
          GY  E  G   +DP + KV  S +V+F  E  +R       K K+ ++                           E  G + +    +  G+ E++  
Subjt:  --GYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIR-----DHKPKSKVV-------------------------LSELDGTIAKPDRYM--GLVEIQ--

Query:  ----------------------------VVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQ
                                    V+I DD   +P +  + +   +K++ +  M +EMES+  N  + LV+   G +P+ CKW++K K+  D K+ 
Subjt:  ----------------------------VVIPDDNCEDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQ

Query:  TFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNI
         +KARLV KG+ Q +G+D++E FSPV  + SIR +L++AA  D EV ++DVKTAFL+ +LEE IYM+Q +GF   GK+  VC+L +S+YGLKQA R W +
Subjt:  TFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNI

Query:  RFD
        +FD
Subjt:  RFD

P92520 Uncharacterized mitochondrial protein AtMg008201.5e-1139.53Show/hide
Query:  WVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIA
        W   M +E++++  N  W LV        +GCKW++K K   D  +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L +A
Subjt:  WVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-3041.46Show/hide
Query:  DPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDG-VKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL
        +P T  QA+ D   ++W   M  E+ +   N  WDLV      V  +GC+WI+ +K   D  +  +KARLVAKGY Q  G+DY ETFSPV    SIRI+L
Subjt:  DPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDG-VKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL

Query:  AIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNI
         +A    + + ++DV  AFL   L +++YM Q  GFI+  +   VC+L++++YGLKQA R+W +
Subjt:  AIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.0e-0431.15Show/hide
Query:  PDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVS-ETPFE
        P TP+ N + ER++R +++   +++S+A +  ++W Y   +AVY++N +P+  +  E+PF+
Subjt:  PDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYILNNVPSKSVS-ETPFE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-3143.29Show/hide
Query:  DPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLV-DKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL
        +P T  QAM D   D+W   M  E+ +   N  WDLV      V  +GC+WI+ +K   D  +  +KARLVAKGY Q  G+DY ETFSPV    SIRI+L
Subjt:  DPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLV-DKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL

Query:  AIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNI
         +A    + + ++DV  AFL   L + +YM Q  GF++  +   VCRL+++IYGLKQA R+W +
Subjt:  AIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-0423.2Show/hide
Query:  YSRFGHIYLMHHKSEALEKFKEFKTEVENQL--------------------------------APDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWG
        ++R+  +Y +  KS+  + F  FK+ VEN+                                  P TP+ N + ER++R +++M  +++S+A +  ++W 
Subjt:  YSRFGHIYLMHHKSEALEKFKEFKTEVENQL--------------------------------APDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWG

Query:  YTVEIAVYILNNVPSKSVS-ETPFE
        Y   +AVY++N +P+  +  ++PF+
Subjt:  YTVEIAVYILNNVPSKSVS-ETPFE

Arabidopsis top hitse value%identityAlignment
AT4G05360.1 Zinc knuckle (CCHC-type) family protein1.8e-0438.03Show/hide
Query:  KCFHYNEDGHWKRNCPKYIAEKNKEGATNHVCY----SFRGIDSGGRADKE-GEITLGLEMEKSSHQRAEG
        KC+HY   GH KRNC ++I E + EG   +  +     F G  SGG  D   G++ L LE+  S H +  G
Subjt:  KCFHYNEDGHWKRNCPKYIAEKNKEGATNHVCY----SFRGIDSGGRADKE-GEITLGLEMEKSSHQRAEG

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.2e-3745.29Show/hide
Query:  EDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL
        ++P TYN+A   K+   W   MD E+ +M     W++       KPIGCKW+YK K   D  ++ +KARLVAKGYTQ EG+D+ ETFSPV  + S++++L
Subjt:  EDPLTYNQAMVDKDKDKWVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL

Query:  AIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFI----EPGKEQKVCRLKRSIYGLKQASRSWNIRF
        AI+A Y+F + ++D+  AFLN +L+E IYM    G+     +      VC LK+SIYGLKQASR W ++F
Subjt:  AIAAYYDFEVWKMDVKTAFLNCNLEENIYMDQLKGFI----EPGKEQKVCRLKRSIYGLKQASRSWNIRF

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.1e-1239.53Show/hide
Query:  WVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIA
        W   M +E++++  N  W LV        +GCKW++K K   D  +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L +A
Subjt:  WVIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTCTATAATTTGCTTATTGGGTGCGAAAAAGTTAACCGGAGAGAATATGATGACATGGAAAAACAAACTCAACACTATATTGGTAGTGGATGATCTGAAGTT
TGTGGTAACTGAGGAGTATCCTCAGGTGCCAGGCTCGAATGCGTCACGAAATGTTCGTGATGCATATAATCGATGGGTCAAGGCCAATGATAAGGCGAAGGTCTACATGC
TGGCAAGTATGTCTGACATATTAGCCAAGAAGCATGAGGGCATGATTATCGCCAAGGAAATCATGGATTCTGTGCGGGGTATGTTTGGACAACAGTCCACACAGGCCCGA
CATAACGTCCTAAAGTACATATTCAACTCGAGGATGCCAGAGGGTACATCTGTTTGGGATCATGTCCTGGATATGATGGTGCACTTTAACATCGCGGGTCGAATGGTGCT
TCCATCGATGAGTCGATCCAGAAAGGTGAGGCAAATGTTGCCAGTAGGAGTTATCACAGAGGGCAAGAAAGTCAAAGATGTTGCTGACAAAGGAAAATGTTTCCACTACA
ACGAAGACGGGCATTGGAAACGAAACTGTCCGAAGTACATTGCAGAAAAGAATAAGGAAGGCGCCACTAATCATGTTTGCTATTCTTTTCGTGGAATTGATTCTGGCGGC
AGAGCTGACAAAGAAGGAGAGATAACGCTCGGGTTGGAAATGGAAAAGTCGTCTCACCAGAGAGCAGAGGGGCACAATGAAGCTTTATTTTGGTGTTATTCTAGATTTGG
GCATATTTACCTAATGCACCACAAGTCTGAAGCACTTGAAAAGTTCAAGGAATTCAAGACTGAGGTTGAAAATCAATTAGCTCCCGACACACCACAACAAAACAATGTAT
TGGAAAGGAGAAATAGAACCTTGTTGGACATGGTTCGATCAATGATGAGTTATGCTCATCTCCATAGTTCCTTTTGGGGTTACACAGTGGAGATTGCAGTATACATTTTG
AATAATGTGCCCTCCAAAAGTGTTTCTGAAACACCTTTTGAACTCTGGAAAGGACGTAAAGGATACCCAAAAGAAATAAGAGGTGGTTTATTCTATGATCCTAAGGAAGA
CAAGGTCTTTATGTCGATAAATGTCATTTTCTTGGGGGAGGACCACATAAGGGACCACAAACCAAAAAGTAAAGTAGTGTTGAGTGAGTTAGACGGAACAATAGCAAAGC
CTGACCGTTACATGGGTTTAGTTGAAATCCAAGTTGTTATACCAGATGACAACTGCGAAGATCCATTGACTTATAATCAAGCAATGGTTGACAAAGACAAAGACAAATGG
GTCATACCCATGGACCAAGAAATGGAGTCGATGTACTTCAATTCTGTTTGGGATCTTGTAGATAAATCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAG
AAAACGTGGTGTAGATGAGAAGGTGCAAACCTTTAAAGCTAGACTAGTAGCAAAGGGTTATACCCAGGTTGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGTTGCTA
TGGTAAAGTCTATCCGTATCCTTCTTGCCATTGCCGCATATTATGACTTCGAGGTATGGAAAATGGACGTCAAGACTGCCTTTTTGAATTGCAATCTTGAGGAAAACATC
TACATGGACCAACTCAAAGGGTTCATTGAACCAGGAAAAGAGCAAAAGGTTTGCAGGCTTAAAAGGTCAATTTATGGACTGAAACAAGCCTCTAGGTCCTGGAATATAAG
ATTTGATGAGGCAATCGAGATCTTATGGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCTCTATAATTTGCTTATTGGGTGCGAAAAAGTTAACCGGAGAGAATATGATGACATGGAAAAACAAACTCAACACTATATTGGTAGTGGATGATCTGAAGTT
TGTGGTAACTGAGGAGTATCCTCAGGTGCCAGGCTCGAATGCGTCACGAAATGTTCGTGATGCATATAATCGATGGGTCAAGGCCAATGATAAGGCGAAGGTCTACATGC
TGGCAAGTATGTCTGACATATTAGCCAAGAAGCATGAGGGCATGATTATCGCCAAGGAAATCATGGATTCTGTGCGGGGTATGTTTGGACAACAGTCCACACAGGCCCGA
CATAACGTCCTAAAGTACATATTCAACTCGAGGATGCCAGAGGGTACATCTGTTTGGGATCATGTCCTGGATATGATGGTGCACTTTAACATCGCGGGTCGAATGGTGCT
TCCATCGATGAGTCGATCCAGAAAGGTGAGGCAAATGTTGCCAGTAGGAGTTATCACAGAGGGCAAGAAAGTCAAAGATGTTGCTGACAAAGGAAAATGTTTCCACTACA
ACGAAGACGGGCATTGGAAACGAAACTGTCCGAAGTACATTGCAGAAAAGAATAAGGAAGGCGCCACTAATCATGTTTGCTATTCTTTTCGTGGAATTGATTCTGGCGGC
AGAGCTGACAAAGAAGGAGAGATAACGCTCGGGTTGGAAATGGAAAAGTCGTCTCACCAGAGAGCAGAGGGGCACAATGAAGCTTTATTTTGGTGTTATTCTAGATTTGG
GCATATTTACCTAATGCACCACAAGTCTGAAGCACTTGAAAAGTTCAAGGAATTCAAGACTGAGGTTGAAAATCAATTAGCTCCCGACACACCACAACAAAACAATGTAT
TGGAAAGGAGAAATAGAACCTTGTTGGACATGGTTCGATCAATGATGAGTTATGCTCATCTCCATAGTTCCTTTTGGGGTTACACAGTGGAGATTGCAGTATACATTTTG
AATAATGTGCCCTCCAAAAGTGTTTCTGAAACACCTTTTGAACTCTGGAAAGGACGTAAAGGATACCCAAAAGAAATAAGAGGTGGTTTATTCTATGATCCTAAGGAAGA
CAAGGTCTTTATGTCGATAAATGTCATTTTCTTGGGGGAGGACCACATAAGGGACCACAAACCAAAAAGTAAAGTAGTGTTGAGTGAGTTAGACGGAACAATAGCAAAGC
CTGACCGTTACATGGGTTTAGTTGAAATCCAAGTTGTTATACCAGATGACAACTGCGAAGATCCATTGACTTATAATCAAGCAATGGTTGACAAAGACAAAGACAAATGG
GTCATACCCATGGACCAAGAAATGGAGTCGATGTACTTCAATTCTGTTTGGGATCTTGTAGATAAATCTGATGGGGTAAAACCTATAGGTTGTAAGTGGATCTACAAGAG
AAAACGTGGTGTAGATGAGAAGGTGCAAACCTTTAAAGCTAGACTAGTAGCAAAGGGTTATACCCAGGTTGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGTTGCTA
TGGTAAAGTCTATCCGTATCCTTCTTGCCATTGCCGCATATTATGACTTCGAGGTATGGAAAATGGACGTCAAGACTGCCTTTTTGAATTGCAATCTTGAGGAAAACATC
TACATGGACCAACTCAAAGGGTTCATTGAACCAGGAAAAGAGCAAAAGGTTTGCAGGCTTAAAAGGTCAATTTATGGACTGAAACAAGCCTCTAGGTCCTGGAATATAAG
ATTTGATGAGGCAATCGAGATCTTATGGTTTTGA
Protein sequenceShow/hide protein sequence
MSSSIICLLGAKKLTGENMMTWKNKLNTILVVDDLKFVVTEEYPQVPGSNASRNVRDAYNRWVKANDKAKVYMLASMSDILAKKHEGMIIAKEIMDSVRGMFGQQSTQAR
HNVLKYIFNSRMPEGTSVWDHVLDMMVHFNIAGRMVLPSMSRSRKVRQMLPVGVITEGKKVKDVADKGKCFHYNEDGHWKRNCPKYIAEKNKEGATNHVCYSFRGIDSGG
RADKEGEITLGLEMEKSSHQRAEGHNEALFWCYSRFGHIYLMHHKSEALEKFKEFKTEVENQLAPDTPQQNNVLERRNRTLLDMVRSMMSYAHLHSSFWGYTVEIAVYIL
NNVPSKSVSETPFELWKGRKGYPKEIRGGLFYDPKEDKVFMSINVIFLGEDHIRDHKPKSKVVLSELDGTIAKPDRYMGLVEIQVVIPDDNCEDPLTYNQAMVDKDKDKW
VIPMDQEMESMYFNSVWDLVDKSDGVKPIGCKWIYKRKRGVDEKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIAAYYDFEVWKMDVKTAFLNCNLEENI
YMDQLKGFIEPGKEQKVCRLKRSIYGLKQASRSWNIRFDEAIEILWF