; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002337 (gene) of Snake gourd v1 genome

Gene IDTan0002337
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:7454357..7458134
RNA-Seq ExpressionTan0002337
SyntenyTan0002337
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-26449.67Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG
        +SS+ +++L A+KL G N  +WKN +N +L++DDL+FVL E+C QVP++NA+R VR+ Y+RW KAN+K + Y+LAS+S++LAKKHE M+TA+EIMDS+Q 
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG

Query:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG
        MFGQ S Q +H+ALKYI+N+RM +  SVR+HVL+MMVHFN+   N A IDE+SQVSFILE LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SLMKI+G
Subjt:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG

Query:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKV--------------------------KKVADKGPLQRR---
         KGE NVA  +R +HRGSTS TK++ SS  N K KK   KK G+G +A+ AAA+  KK                           KK A +G + +R   
Subjt:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKV--------------------------KKVADKGPLQRR---

Query:  ----RALETELSEVHC--------------------RKEEGRYGVV------------------------------------------------------
            RA E  L  VH                       +  RYG V                                                      
Subjt:  ----RALETELSEVHC--------------------RKEEGRYGVV------------------------------------------------------

Query:  ----------------------------------------------------------------------------------QRHVLVTNPKKLDSRSKL
                                                                                            HVL  NPKKL+ RSKL
Subjt:  ----------------------------------------------------------------------------------QRHVLVTNPKKLDSRSKL

Query:  CLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK---------------------------------------
        CLFVGYPK TRGG FYDPK++KVFVSTNA FLEEDHIR+HKP+SK+VL+EL     + + +                                       
Subjt:  CLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK---------------------------------------

Query:  -------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEGVD
                     ++ + EDPLT+ +AM D DKD+W+ AM+ E+E MYFNSVWDLVD+P+G           RKRG DGKVQTFKARLVAKGYTQVEGVD
Subjt:  -------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEGVD

Query:  YEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEP
        YEETFSPVAM+KSIRILL+IA Y+DYE+WQMDVKT FLNGNLE+ IYM QP+GFI PGQEQK+C+L RSIYGLKQASRSWNI FD AI+SYGFDQ  DEP
Subjt:  YEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEP

Query:  CVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL---------------------
        CVYK+IIN S+AFL+L VDDILLIGND+G L  IK+WLAT+FQMKDLG+  FVLGIQI  +RKNK LALSQ SYI+K++                     
Subjt:  CVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL---------------------

Query:  ---------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDSRK
                                         +RPDIC+AVG+VSRYQSN GL HWT VKTILKYLRRTR+Y LVYG+KDLILTGYTDSDFQT++DSRK
Subjt:  ---------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDSRK

Query:  STSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN
        STSGSVFTLN   VVW+S+KQGCIA+STMEA+YVAACE AKE VWLR F+++LEVV NM+  ITLYCDNSGAVAN
Subjt:  STSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-24442.13Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG
        M+S+ +++L A+KL G N  +WKN +NT+L++DDL+FVL EEC QVP++NA+R VR+ Y+RW KAN+K + Y+LAS+S++LAKKHE M+TA+EIMDS+Q 
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG

Query:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG
        MFGQ S Q +H+ALKYI+N+RM +  SVR+HVL+MMVHFN+ E NGA IDE+SQVSFILE LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SLMKI+G
Subjt:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG

Query:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------------------------
         KGE NVA  +R +HRGSTS TK++ SS  N K KK   KK G+G +A+ AAA+  KK K                                        
Subjt:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------------------------

Query:  -----------VADKGP----------LQRRRALET-------------------------------------------ELSEVHCRKEE----------
                   + D G           +   R LET                                            L  V C  E+          
Subjt:  -----------VADKGP----------LQRRRALET-------------------------------------------ELSEVHCRKEE----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------GRYGVV----------------------------------------------------
                                                   RYG V                                                    
Subjt:  ------------------------------------------GRYGVV----------------------------------------------------

Query:  ------------------------------------------------------------------------------------QRHVLVTNPKKLDSRS
                                                                                              HVL  NPKKL+ RS
Subjt:  ------------------------------------------------------------------------------------QRHVLVTNPKKLDSRS

Query:  KLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK-------------------------------------
        KLCLFVGYPK TRGG FYDPK++KVFVSTNA FLEEDHIR+HKP+SK+VL+EL     + + +                                     
Subjt:  KLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK-------------------------------------

Query:  ---------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEG
                       ++ + EDPLT+ +AM D DKD+W+ AM+ E+E MYFNSVWDLVD+P+G           RKRG DGKVQTFKARLVAKGYTQVEG
Subjt:  ---------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEG

Query:  VDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDD
        VDYEETFSPVAM+KSIRILL+IA Y+DYE+WQMDVKT FLNGNLE+ IYM QP+GFI PGQEQK+C+L RSIYGLKQASRSWNI FD AI+SYGFDQ  D
Subjt:  VDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDD

Query:  EPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL-------------------
        EPCVYK+IIN S+AFL+L VDDILLIGND+G L  IK+WLAT+FQMKDLG+  FVLGIQI  +RKNK LALSQ SYI+K++                   
Subjt:  EPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL-------------------

Query:  -----------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDS
                                           +RPDIC+AVG+VSRYQSN GL HWT VKTILKYLRRTR+Y LVYG+KDLILTGYTDSDFQT++DS
Subjt:  -----------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDS

Query:  RKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN
        RKSTSGSVFTLN   VVW+S+KQGCIA+STMEA+YVAACE AKE VWLR F+++LEVV NM+  ITLYCDNSGAVAN
Subjt:  RKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]3.6e-27357.3Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG
        M+S+ +++L A+KL G N   WKN +NT+L++DDL+FVL EEC QVP++NA++ VR+ Y+RW KAN+K + Y+LAS+S++LAKKHE M+TA+EIMDS+Q 
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG

Query:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG
        MFGQ S Q +H+ALKYI+N+RM +  SVR+HVL++MVHFN+ E NGA IDE+SQVSFILE LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SLMKI+G
Subjt:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG

Query:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKKVADKGP---------LQRRRAL----ETELSEVHCRKEE
         KGE NVA  +R +HRGSTS TK++ SS  N K KK   KK G+G +A+ AAA+  KK K      P          +R R L     + +S  H     
Subjt:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKKVADKGP---------LQRRRAL----ETELSEVHCRKEE

Query:  GRYGVVQRHVLVTN--PKKLDSRSKLCLF---VGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK----------
          Y  VQ  V + N  P K  S + L L+    GYPK TRGG FYDPK++KVFVSTNA FLEEDHIR+HKP+SK+VL+EL     K + +          
Subjt:  GRYGVVQRHVLVTN--PKKLDSRSKLCLF---VGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK----------

Query:  ------------------------------------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG----------
                                                  ++ + EDPLT+ +AM D DKD+W+ AM+ E+E MYFNSVWDL+D+P+G          
Subjt:  ------------------------------------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG----------

Query:  -RKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIY
         RKRG DGKVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSIRILL+IA Y+DYE+WQMDVKT FLNGNLE+ IYM QP+GFI PGQEQK+C+L RSIY
Subjt:  -RKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIY

Query:  GLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQ
        GLKQASRSWNI FD AI+SYGFDQ  DEPCVYK+IIN  +AFL+L VDDILLIGND+G L  IK+WLAT+FQMKDLG+  FVLGIQI  +RKNK LALSQ
Subjt:  GLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQ

Query:  TSYINKVL------------------------------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTR
         SYI+K++                                                      +RPDIC+AVG+VSRYQSN GL HWT VKTILKYLRR R
Subjt:  TSYINKVL------------------------------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTR

Query:  NYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSG
        +Y LVYG+KDLILTGYTDSDFQT++DSRKST GSVFTLN   VVW+S+KQGCIA+STMEA+YV ACE AKE VWLR F+++LEVV NM+  ITLYCDNSG
Subjt:  NYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSG

Query:  AVAN
        AVAN
Subjt:  AVAN

KAA0066490.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-25652.42Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG
        M+S+ +++L A+KL G N  +WKN +NT+L++DDL+FVL EEC QVP++NA+R VR+ Y+RW KAN+K + Y+LAS+S++LAKKHE ++TA+EIMDS+Q 
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG

Query:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG
        MFGQ S Q +H+ALKYI+N+RM +  SVR+HVL+MMVHF++ E NGA IDE+SQVSFILE LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SLMKI+G
Subjt:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG

Query:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------VADK--------------
         KGE NVA  +R +HRGSTS TK++ SS  N K KK   KK G+G +A+ AAA+  KK K                      +A+K              
Subjt:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------VADK--------------

Query:  -------------------------------------------------GPLQRRR-----------------------ALETELSEVHC----------
                                                         G  +RR                        A++T +  ++C          
Subjt:  -------------------------------------------------GPLQRRR-----------------------ALETELSEVHC----------

Query:  -RKEEGRYGVVQR--------HVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK--
         +   GR G ++         HVL  NPKKL+ RSKLCLFVGYPK TRGG FYDPK++KVFVSTNA FLEEDHIR+HKP+SK+VL+EL     + + +  
Subjt:  -RKEEGRYGVVQR--------HVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK--

Query:  --------------------------------------------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG--
                                                          ++ + EDPLT+ +AM D DKD+W+ AM+ E++ MYFNSVWDLVD+P+G  
Subjt:  --------------------------------------------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG--

Query:  ---------RKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKV
                 RKRG DGKVQTF+ARLVAKGYTQVEGVDYE+TFSPVAM+KSIRILL+IA Y+DYE+WQMDVKT FLNGNLE+ IYM QP+GFI PGQEQK+
Subjt:  ---------RKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKV

Query:  CRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRK
        C+L RSIYGLKQASRSWNI FD AI+SYGFDQ  DEPCVYK+IIN S+AFL+L VDDILLI ND+G L  IK+WLAT+FQMKDLG+  FVLGIQI  +RK
Subjt:  CRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRK

Query:  NKTLALSQTSYINKVL--------SRPDICFAVGM-VSRYQ---SNQGLEHWTT-VKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSG
        NK LALSQ SYI+K++         R  + F  G+ +S+ Q   + Q +E     V  +   +RRTR+YMLVYG+KDLILTGYTDSDFQT++DSRKSTSG
Subjt:  NKTLALSQTSYINKVL--------SRPDICFAVGM-VSRYQ---SNQGLEHWTT-VKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSG

Query:  SVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN
        SVFTLN   +VW+S+KQGCIA+STMEA+Y AACEVAKE VWLRKF+++LEVV NM   ITLYCDNSG VAN
Subjt:  SVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]4.5e-24442.13Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG
        M+S+ +++L A+KL G N  +WKN +NT+L++DDL+FVL EEC QVP++NA+R VR+ Y+RW KAN+K + Y+LAS+S++LAKKHE M+TA+EIMDS+Q 
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG

Query:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG
        MFGQ S Q +H+ALKYI+N+RM +  SVR+HVL+MMVHFN+ E NGA IDE+SQVSFILE LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SLMKI+G
Subjt:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG

Query:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------------------------
         KGE NVA  +R +HRGSTS TK++ SS  N K KK   KK G+G +A+ AAA+  KK K                                        
Subjt:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------------------------

Query:  -----------VADKGP----------LQRRRALET-------------------------------------------ELSEVHCRKEE----------
                   + D G           +   R LET                                            L  V C  E+          
Subjt:  -----------VADKGP----------LQRRRALET-------------------------------------------ELSEVHCRKEE----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------GRYGVV----------------------------------------------------
                                                   RYG V                                                    
Subjt:  ------------------------------------------GRYGVV----------------------------------------------------

Query:  ------------------------------------------------------------------------------------QRHVLVTNPKKLDSRS
                                                                                              HVL  NPKKL+ RS
Subjt:  ------------------------------------------------------------------------------------QRHVLVTNPKKLDSRS

Query:  KLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK-------------------------------------
        KLCLFVGYPK TRGG FYDPK++KVFVSTNA FLEEDHIR+HKP+SK+VL+EL     + + +                                     
Subjt:  KLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK-------------------------------------

Query:  ---------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEG
                       ++ + EDPLT+ +AM D DKD+W+ AM+ E+E MYFNSVWDLVD+P+G           RKRG DGKVQTFKARLVAKGYTQVEG
Subjt:  ---------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEG

Query:  VDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDD
        VDYEETFSPVAM+KSIRILL+IA Y+DYE+WQMDVKT FLNGNLE+ IYM QP+GFI PGQEQK+C+L RSIYGLKQASRSWNI FD AI+SYGFDQ  D
Subjt:  VDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDD

Query:  EPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL-------------------
        EPCVYK+IIN S+AFL+L VDDILLIGND+G L  IK+WLAT+FQMKDLG+  FVLGIQI  +RKNK LALSQ SYI+K++                   
Subjt:  EPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL-------------------

Query:  -----------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDS
                                           +RPDIC+AVG+VSRYQSN GL HWT VKTILKYLRRTR+Y LVYG+KDLILTGYTDSDFQT++DS
Subjt:  -----------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDS

Query:  RKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN
        RKSTSGSVFTLN   VVW+S+KQGCIA+STMEA+YVAACE AKE VWLR F+++LEVV NM+  ITLYCDNSGAVAN
Subjt:  RKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.2e-24442.13Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG
        M+S+ +++L A+KL G N  +WKN +NT+L++DDL+FVL EEC QVP++NA+R VR+ Y+RW KAN+K + Y+LAS+S++LAKKHE M+TA+EIMDS+Q 
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG

Query:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG
        MFGQ S Q +H+ALKYI+N+RM +  SVR+HVL+MMVHFN+ E NGA IDE+SQVSFILE LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SLMKI+G
Subjt:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG

Query:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------------------------
         KGE NVA  +R +HRGSTS TK++ SS  N K KK   KK G+G +A+ AAA+  KK K                                        
Subjt:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------------------------

Query:  -----------VADKGP----------LQRRRALET-------------------------------------------ELSEVHCRKEE----------
                   + D G           +   R LET                                            L  V C  E+          
Subjt:  -----------VADKGP----------LQRRRALET-------------------------------------------ELSEVHCRKEE----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------GRYGVV----------------------------------------------------
                                                   RYG V                                                    
Subjt:  ------------------------------------------GRYGVV----------------------------------------------------

Query:  ------------------------------------------------------------------------------------QRHVLVTNPKKLDSRS
                                                                                              HVL  NPKKL+ RS
Subjt:  ------------------------------------------------------------------------------------QRHVLVTNPKKLDSRS

Query:  KLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK-------------------------------------
        KLCLFVGYPK TRGG FYDPK++KVFVSTNA FLEEDHIR+HKP+SK+VL+EL     + + +                                     
Subjt:  KLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK-------------------------------------

Query:  ---------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEG
                       ++ + EDPLT+ +AM D DKD+W+ AM+ E+E MYFNSVWDLVD+P+G           RKRG DGKVQTFKARLVAKGYTQVEG
Subjt:  ---------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEG

Query:  VDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDD
        VDYEETFSPVAM+KSIRILL+IA Y+DYE+WQMDVKT FLNGNLE+ IYM QP+GFI PGQEQK+C+L RSIYGLKQASRSWNI FD AI+SYGFDQ  D
Subjt:  VDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDD

Query:  EPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL-------------------
        EPCVYK+IIN S+AFL+L VDDILLIGND+G L  IK+WLAT+FQMKDLG+  FVLGIQI  +RKNK LALSQ SYI+K++                   
Subjt:  EPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL-------------------

Query:  -----------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDS
                                           +RPDIC+AVG+VSRYQSN GL HWT VKTILKYLRRTR+Y LVYG+KDLILTGYTDSDFQT++DS
Subjt:  -----------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDS

Query:  RKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN
        RKSTSGSVFTLN   VVW+S+KQGCIA+STMEA+YVAACE AKE VWLR F+++LEVV NM+  ITLYCDNSGAVAN
Subjt:  RKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN

A0A5A7SNP8 Gag/pol protein1.9e-26449.67Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG
        +SS+ +++L A+KL G N  +WKN +N +L++DDL+FVL E+C QVP++NA+R VR+ Y+RW KAN+K + Y+LAS+S++LAKKHE M+TA+EIMDS+Q 
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG

Query:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG
        MFGQ S Q +H+ALKYI+N+RM +  SVR+HVL+MMVHFN+   N A IDE+SQVSFILE LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SLMKI+G
Subjt:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG

Query:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKV--------------------------KKVADKGPLQRR---
         KGE NVA  +R +HRGSTS TK++ SS  N K KK   KK G+G +A+ AAA+  KK                           KK A +G + +R   
Subjt:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKV--------------------------KKVADKGPLQRR---

Query:  ----RALETELSEVHC--------------------RKEEGRYGVV------------------------------------------------------
            RA E  L  VH                       +  RYG V                                                      
Subjt:  ----RALETELSEVHC--------------------RKEEGRYGVV------------------------------------------------------

Query:  ----------------------------------------------------------------------------------QRHVLVTNPKKLDSRSKL
                                                                                            HVL  NPKKL+ RSKL
Subjt:  ----------------------------------------------------------------------------------QRHVLVTNPKKLDSRSKL

Query:  CLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK---------------------------------------
        CLFVGYPK TRGG FYDPK++KVFVSTNA FLEEDHIR+HKP+SK+VL+EL     + + +                                       
Subjt:  CLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK---------------------------------------

Query:  -------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEGVD
                     ++ + EDPLT+ +AM D DKD+W+ AM+ E+E MYFNSVWDLVD+P+G           RKRG DGKVQTFKARLVAKGYTQVEGVD
Subjt:  -------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEGVD

Query:  YEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEP
        YEETFSPVAM+KSIRILL+IA Y+DYE+WQMDVKT FLNGNLE+ IYM QP+GFI PGQEQK+C+L RSIYGLKQASRSWNI FD AI+SYGFDQ  DEP
Subjt:  YEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEP

Query:  CVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL---------------------
        CVYK+IIN S+AFL+L VDDILLIGND+G L  IK+WLAT+FQMKDLG+  FVLGIQI  +RKNK LALSQ SYI+K++                     
Subjt:  CVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL---------------------

Query:  ---------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDSRK
                                         +RPDIC+AVG+VSRYQSN GL HWT VKTILKYLRRTR+Y LVYG+KDLILTGYTDSDFQT++DSRK
Subjt:  ---------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDSRK

Query:  STSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN
        STSGSVFTLN   VVW+S+KQGCIA+STMEA+YVAACE AKE VWLR F+++LEVV NM+  ITLYCDNSGAVAN
Subjt:  STSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN

A0A5A7V6N0 Gag/pol protein1.7e-27357.3Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG
        M+S+ +++L A+KL G N   WKN +NT+L++DDL+FVL EEC QVP++NA++ VR+ Y+RW KAN+K + Y+LAS+S++LAKKHE M+TA+EIMDS+Q 
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG

Query:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG
        MFGQ S Q +H+ALKYI+N+RM +  SVR+HVL++MVHFN+ E NGA IDE+SQVSFILE LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SLMKI+G
Subjt:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG

Query:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKKVADKGP---------LQRRRAL----ETELSEVHCRKEE
         KGE NVA  +R +HRGSTS TK++ SS  N K KK   KK G+G +A+ AAA+  KK K      P          +R R L     + +S  H     
Subjt:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKKVADKGP---------LQRRRAL----ETELSEVHCRKEE

Query:  GRYGVVQRHVLVTN--PKKLDSRSKLCLF---VGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK----------
          Y  VQ  V + N  P K  S + L L+    GYPK TRGG FYDPK++KVFVSTNA FLEEDHIR+HKP+SK+VL+EL     K + +          
Subjt:  GRYGVVQRHVLVTN--PKKLDSRSKLCLF---VGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK----------

Query:  ------------------------------------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG----------
                                                  ++ + EDPLT+ +AM D DKD+W+ AM+ E+E MYFNSVWDL+D+P+G          
Subjt:  ------------------------------------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG----------

Query:  -RKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIY
         RKRG DGKVQTFKARLVAKGYTQVEGVDYEETFSPVAM+KSIRILL+IA Y+DYE+WQMDVKT FLNGNLE+ IYM QP+GFI PGQEQK+C+L RSIY
Subjt:  -RKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIY

Query:  GLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQ
        GLKQASRSWNI FD AI+SYGFDQ  DEPCVYK+IIN  +AFL+L VDDILLIGND+G L  IK+WLAT+FQMKDLG+  FVLGIQI  +RKNK LALSQ
Subjt:  GLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQ

Query:  TSYINKVL------------------------------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTR
         SYI+K++                                                      +RPDIC+AVG+VSRYQSN GL HWT VKTILKYLRR R
Subjt:  TSYINKVL------------------------------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTR

Query:  NYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSG
        +Y LVYG+KDLILTGYTDSDFQT++DSRKST GSVFTLN   VVW+S+KQGCIA+STMEA+YV ACE AKE VWLR F+++LEVV NM+  ITLYCDNSG
Subjt:  NYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSG

Query:  AVAN
        AVAN
Subjt:  AVAN

A0A5A7VH46 Gag/pol protein1.1e-25652.42Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG
        M+S+ +++L A+KL G N  +WKN +NT+L++DDL+FVL EEC QVP++NA+R VR+ Y+RW KAN+K + Y+LAS+S++LAKKHE ++TA+EIMDS+Q 
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG

Query:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG
        MFGQ S Q +H+ALKYI+N+RM +  SVR+HVL+MMVHF++ E NGA IDE+SQVSFILE LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SLMKI+G
Subjt:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG

Query:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------VADK--------------
         KGE NVA  +R +HRGSTS TK++ SS  N K KK   KK G+G +A+ AAA+  KK K                      +A+K              
Subjt:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------VADK--------------

Query:  -------------------------------------------------GPLQRRR-----------------------ALETELSEVHC----------
                                                         G  +RR                        A++T +  ++C          
Subjt:  -------------------------------------------------GPLQRRR-----------------------ALETELSEVHC----------

Query:  -RKEEGRYGVVQR--------HVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK--
         +   GR G ++         HVL  NPKKL+ RSKLCLFVGYPK TRGG FYDPK++KVFVSTNA FLEEDHIR+HKP+SK+VL+EL     + + +  
Subjt:  -RKEEGRYGVVQR--------HVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK--

Query:  --------------------------------------------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG--
                                                          ++ + EDPLT+ +AM D DKD+W+ AM+ E++ MYFNSVWDLVD+P+G  
Subjt:  --------------------------------------------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG--

Query:  ---------RKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKV
                 RKRG DGKVQTF+ARLVAKGYTQVEGVDYE+TFSPVAM+KSIRILL+IA Y+DYE+WQMDVKT FLNGNLE+ IYM QP+GFI PGQEQK+
Subjt:  ---------RKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKV

Query:  CRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRK
        C+L RSIYGLKQASRSWNI FD AI+SYGFDQ  DEPCVYK+IIN S+AFL+L VDDILLI ND+G L  IK+WLAT+FQMKDLG+  FVLGIQI  +RK
Subjt:  CRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRK

Query:  NKTLALSQTSYINKVL--------SRPDICFAVGM-VSRYQ---SNQGLEHWTT-VKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSG
        NK LALSQ SYI+K++         R  + F  G+ +S+ Q   + Q +E     V  +   +RRTR+YMLVYG+KDLILTGYTDSDFQT++DSRKSTSG
Subjt:  NKTLALSQTSYINKVL--------SRPDICFAVGM-VSRYQ---SNQGLEHWTT-VKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSG

Query:  SVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN
        SVFTLN   +VW+S+KQGCIA+STMEA+Y AACEVAKE VWLRKF+++LEVV NM   ITLYCDNSG VAN
Subjt:  SVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN

A0A5D3CPJ6 Gag/pol protein2.2e-24442.13Show/hide
Query:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG
        M+S+ +++L A+KL G N  +WKN +NT+L++DDL+FVL EEC QVP++NA+R VR+ Y+RW KAN+K + Y+LAS+S++LAKKHE M+TA+EIMDS+Q 
Subjt:  MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQG

Query:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG
        MFGQ S Q +H+ALKYI+N+RM +  SVR+HVL+MMVHFN+ E NGA IDE+SQVSFILE LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SLMKI+G
Subjt:  MFGQQSTQARHNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQG

Query:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------------------------
         KGE NVA  +R +HRGSTS TK++ SS  N K KK   KK G+G +A+ AAA+  KK K                                        
Subjt:  SKGETNVA--SRSYHRGSTSRTKTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKK---------------------------------------

Query:  -----------VADKGP----------LQRRRALET-------------------------------------------ELSEVHCRKEE----------
                   + D G           +   R LET                                            L  V C  E+          
Subjt:  -----------VADKGP----------LQRRRALET-------------------------------------------ELSEVHCRKEE----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------GRYGVV----------------------------------------------------
                                                   RYG V                                                    
Subjt:  ------------------------------------------GRYGVV----------------------------------------------------

Query:  ------------------------------------------------------------------------------------QRHVLVTNPKKLDSRS
                                                                                              HVL  NPKKL+ RS
Subjt:  ------------------------------------------------------------------------------------QRHVLVTNPKKLDSRS

Query:  KLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK-------------------------------------
        KLCLFVGYPK TRGG FYDPK++KVFVSTNA FLEEDHIR+HKP+SK+VL+EL     + + +                                     
Subjt:  KLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANK-------------------------------------

Query:  ---------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEG
                       ++ + EDPLT+ +AM D DKD+W+ AM+ E+E MYFNSVWDLVD+P+G           RKRG DGKVQTFKARLVAKGYTQVEG
Subjt:  ---------------TNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG-----------RKRGVDGKVQTFKARLVAKGYTQVEG

Query:  VDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDD
        VDYEETFSPVAM+KSIRILL+IA Y+DYE+WQMDVKT FLNGNLE+ IYM QP+GFI PGQEQK+C+L RSIYGLKQASRSWNI FD AI+SYGFDQ  D
Subjt:  VDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDD

Query:  EPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL-------------------
        EPCVYK+IIN S+AFL+L VDDILLIGND+G L  IK+WLAT+FQMKDLG+  FVLGIQI  +RKNK LALSQ SYI+K++                   
Subjt:  EPCVYKKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL-------------------

Query:  -----------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDS
                                           +RPDIC+AVG+VSRYQSN GL HWT VKTILKYLRRTR+Y LVYG+KDLILTGYTDSDFQT++DS
Subjt:  -----------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAKDLILTGYTDSDFQTNKDS

Query:  RKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN
        RKSTSGSVFTLN   VVW+S+KQGCIA+STMEA+YVAACE AKE VWLR F+++LEVV NM+  ITLYCDNSGAVAN
Subjt:  RKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-5830.02Show/hide
Query:  KPKSKVVLSELDGTIAKVANKTNDNCED-PLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNGR-----------KRGVDGKVQTFKARLVAK
        K K ++  +E D ++ KV    +    D P ++++     DK  W  A++ E+     N+ W +  +P  +           K    G    +KARLVA+
Subjt:  KPKSKVVLSELDGTIAKVANKTNDNCED-PLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNGR-----------KRGVDGKVQTFKARLVAK

Query:  GYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSY
        G+TQ   +DYEETF+PVA + S R +L++   Y+ +V QMDVKT FLNG L++ IYM  P+G         VC+L ++IYGLKQA+R W   F++A++  
Subjt:  GYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSY

Query:  GFDQNDDEPCVY---KKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVLS--------
         F  +  + C+Y   K  IN +I +++L VDD+++   D+  +   K +L  +F+M DL ++   +GI+I    +   + LSQ++Y+ K+LS        
Subjt:  GFDQNDDEPCVY---KKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVLS--------

Query:  ---------------------------------------RPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYG---AKDLILTGYTDSDF
                                               RPD+  AV ++SRY S    E W  +K +L+YL+ T +  L++    A +  + GY DSD+
Subjt:  ---------------------------------------RPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYG---AKDLILTGYTDSDF

Query:  QTNKDSRKSTSGSVFTL-NREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVA
          ++  RKST+G +F + +  ++ W + +Q  +A S+ EA+Y+A  E  +E +WL+  + ++ +   +   I +Y DN G ++
Subjt:  QTNKDSRKSTSGSVFTL-NREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVA

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-8232.13Show/hide
Query:  HVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEE-------------------------------------DHIRDHKPKSKVV
        HV      KLD +S  C+F+GY  E  G   +DP + KV  S + +F E                                      D + +   +   V
Subjt:  HVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKEDKVFVSTNAIFLEE-------------------------------------DHIRDHKPKSKVV

Query:  L---SELDGTIAKVANKTN-----------------------------DNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNGR-----
        +    +LD  + +V + T                               +  +P +  + +   +K++ + AM +EME +  N  + LV+ P G+     
Subjt:  L---SELDGTIAKVANKTN-----------------------------DNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNGR-----

Query:  ------KRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLK
              K+  D K+  +KARLV KG+ Q +G+D++E FSPV  + SIR +L++A   D EV Q+DVKT FL+G+LE+ IYM+QP+GF   G++  VC+L 
Subjt:  ------KRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLK

Query:  RSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVY-KKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKT
        +S+YGLKQA R W + FD  ++S  + +   +PCVY K+   ++   L+L VDD+L++G D G +  +K  L+  F MKDLG    +LG++IV  R ++ 
Subjt:  RSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVY-KKIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKT

Query:  LALSQTSYINKVL------------------------------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKY
        L LSQ  YI +VL                                                      +RPDI  AVG+VSR+  N G EHW  VK IL+Y
Subjt:  LALSQTSYINKVL------------------------------------------------------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKY

Query:  LRRTRNYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLY
        LR T    L +G  D IL GYTD+D   + D+RKS++G +FT +   + WQS  Q C+A ST EA+Y+AA E  KE++WL++F+  L +         +Y
Subjt:  LRRTRNYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLY

Query:  CDNSGAV
        CD+  A+
Subjt:  CDNSGAV

P25600 Putative transposon Ty5-1 protein YCL074W9.8e-2428.16Show/hide
Query:  MDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSVDDILLIGNDVGY
        MDV T FLN  +++ IY+ QP GF+       V  L   +YGLKQA   WN   +  ++  GF +++ E  +Y +  +    ++ + VDD+L+       
Subjt:  MDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSVDDILLIGNDVGY

Query:  LIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVLS------------------------------------------------RP
           +K+ L   + MKDLG V   LG+ I     N  + LS   YI K  S                                                RP
Subjt:  LIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVLS------------------------------------------------RP

Query:  DICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVY-GAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVK-QGCIANSTMEAKYV
        DI + V ++SR+       H  + + +L+YL  TR+  L Y     L LT Y D+      D   ST G V  L    V W S K +G I   + EA+Y+
Subjt:  DICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVY-GAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVK-QGCIANSTMEAKYV

Query:  AACEVAKEV
         A E   E+
Subjt:  AACEVAKEV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.7e-5531.7Show/hide
Query:  DPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG------------RKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL
        +P T  QA+ D   ++W  AM  E+     N  WDLV  P              +K   DG +  +KARLVAKGY Q  G+DY ETFSPV    SIRI+L
Subjt:  DPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNG------------RKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL

Query:  AIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSV
         +A    + + Q+DV   FL G L  ++YM QP GFI+  +   VC+L++++YGLKQA R+W +     + + GF  +  +  ++      SI ++++ V
Subjt:  AIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSV

Query:  DDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL---------------------------------------
        DDIL+ GND   L    + L+  F +KD  ++H+ LGI+    R    L LSQ  YI  +L                                       
Subjt:  DDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL---------------------------------------

Query:  --------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQGCI
                +RPDI +AV  +S++      EH   +K IL+YL  T N+ + +     L L  Y+D+D+  +KD   ST+G +  L    + W S KQ  +
Subjt:  --------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQGCI

Query:  ANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGA
          S+ EA+Y +    + E+ W+   +  L +   +T    +YCDN GA
Subjt:  ANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGA

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-5531.25Show/hide
Query:  DPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPN------------GRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL
        +P T  QAM D   D+W  AM  E+     N  WDLV  P              +K   DG +  +KARLVAKGY Q  G+DY ETFSPV    SIRI+L
Subjt:  DPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPN------------GRKRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL

Query:  AIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSV
         +A    + + Q+DV   FL G L   +YM QP GF++  +   VCRL+++IYGLKQA R+W +     + + GF  +  +  ++      SI ++++ V
Subjt:  AIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFLILSV

Query:  DDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL---------------------------------------
        DDIL+ GND   L +  + L+  F +K+  D+H+ LGI+    R  + L LSQ  Y   +L                                       
Subjt:  DDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVL---------------------------------------

Query:  --------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQGCI
                +RPD+ +AV  +S+Y      +HW  +K +L+YL  T ++ + +     L L  Y+D+D+  + D   ST+G +  L    + W S KQ  +
Subjt:  --------SRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQGCI

Query:  ANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGA
          S+ EA+Y +    + E+ W+   +  L +   ++    +YCDN GA
Subjt:  ANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGA

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-5731.42Show/hide
Query:  EDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNGR-----------KRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL
        ++P TYN+A   K+   W  AMD E+  M     W++   P  +           K   DG ++ +KARLVAKGYTQ EG+D+ ETFSPV  + S++++L
Subjt:  EDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNGR-----------KRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILL

Query:  AIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFI----EPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFL
        AI+  Y++ + Q+D+   FLNG+L++ IYM  P G+     +      VC LK+SIYGLKQASR W + F   +  +GF Q+  +   + KI  +    +
Subjt:  AIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFI----EPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYKKIINSSIAFL

Query:  ILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQI--------VWNRKNKTLALSQTSYIN------------------------------
        ++ VDDI++  N+   +  +K  L + F+++DLG + + LG++I        +  RK     L +T  +                               
Subjt:  ILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQI--------VWNRKNKTLALSQTSYIN------------------------------

Query:  -------KVLSRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAK-DLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQG
                 ++R DI FAV  +S++     L H   V  IL Y++ T    L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L   ++ W+S KQ 
Subjt:  -------KVLSRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGAK-DLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQG

Query:  CIANSTMEAKYVAACEVAKEVVWLRKFMLNLEV-VSNMTLSITLYCDNSGAV
         ++ S+ EA+Y A      E++WL +F   L++ +S  TL   L+CDN+ A+
Subjt:  CIANSTMEAKYVAACEVAKEVVWLRKFMLNLEV-VSNMTLSITLYCDNSGAV

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.3e-0434.25Show/hide
Query:  LSRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGA-KDLILTGYTDSDFQTNKDSRKSTSG
        ++RPD+ FAV  +S++ S         V  +L Y++ T    L Y A  DL L  + DSD+ +  D+R+S +G
Subjt:  LSRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNYMLVYGA-KDLILTGYTDSDFQTNKDSRKSTSG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.9e-1426.99Show/hide
Query:  FLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKV----------------------------------
        +L+L VDDILL G+    L  +   L++ F MKDLG VH+ LGIQI  +     L LSQT Y  ++                                  
Subjt:  FLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKV----------------------------------

Query:  ------------LSRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSV
                    L+RPDI +AV +V +      L  +  +K +L+Y++ T  + + ++    L +  + DSD+     +R+ST+G    L   ++ W + 
Subjt:  ------------LSRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLRRTRNY-MLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSV

Query:  KQGCIANSTMEAKYVAACEVAKEVVW
        +Q  ++ S+ E +Y A    A E+ W
Subjt:  KQGCIANSTMEAKYVAACEVAKEVVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)7.7e-0837.21Show/hide
Query:  WVIAMDQEMEYMYFNSVWDLVDKPNGR-----------KRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIA
        W  AM +E++ +  N  W LV  P  +           K   DG +   KARLVAKG+ Q EG+ + ET+SPV    +IR +L +A
Subjt:  WVIAMDQEMEYMYFNSVWDLVDKPNGR-----------KRGVDGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMVKSIRILLAIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTCTATAATTTCCTTATTGGGTGCGGAAAAGTTAACCGGAGAGAATATGATGACATGGAAAAACAAACTCAACACTATATTGGTAGTGGATGATCTGAAGTT
TGTACTAACTGAGGAGTGTCTTCAGGTGCCAAGCTCGAATGCGTCACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGTCAAGGTCTACATGC
TGGCAAGTATGTCTGACATATTAGCCAAGAAGCATGAGGGCATGATAACCGCGAAGGAAATCATGGATTCTGTGCAGGGTATGTTTGGACAACAGTCCACACAGGCCCGA
CACAATGCCCTAAAGTACATATTCAACTCAAGGATGCCAAAGAGTACATCTGTTCGGGATCATGTCTTGGATATGATGGTGCACTTCAACATCGTTGAGTCGAATGGTGC
TTCCATTGATGAGTCGAGCCAGGTCAGCTTCATTCTGGAAATCCTTCCAGATAGTTTCCTACAGTTTAGAAGTAATGTTGTTATGAACAAGCTTACTTTTAACCTTACCT
CCCTTCTCAATGAACTCCAGACTTTTCAATCTTTGATGAAAATTCAGGGATCGAAAGGTGAGACAAATGTTGCCAGTAGGAGTTATCACAGAGGTTCGACCTCTAGGACG
AAAACAGTTGCTTCATCTTATTCTAACCTGAAAGGGAAGAAGAAGAACATGAAGAAGGTTGGTAAAGGAAAACAAGCTGACAGAGCTGCCGCCCAGAAGGGCAAGAAAGT
CAAAAAAGTTGCTGACAAAGGTCCACTGCAACGAAGACGGGCATTGGAAACGGAACTGTCCGAAGTACATTGTAGAAAAGAAGAAGGAAGATATGGGGTTGTCCAAAGAC
ATGTGCTTGTGACAAATCCAAAGAAGTTGGATTCACGTTCAAAGTTATGCCTATTCGTAGGATACCCAAAAGAAACAAGAGGTGGTTTATTCTATGATCCTAAGGAAGAT
AAGGTTTTTGTGTCGACAAATGCCATTTTCTTAGAGGAGGACCACATAAGGGACCACAAACCAAAAAGTAAAGTTGTGTTGAGTGAGTTAGACGGAACAATAGCAAAGGT
TGCAAATAAAACTAATGACAACTGCGAGGATCCATTGACTTATAATCAAGCAATGGTTGACAAAGACAAAGACAAATGGGTCATAGCCATGGACCAAGAAATGGAGTATA
TGTACTTCAATTCTGTTTGGGATCTTGTAGATAAGCCTAATGGGAGAAAGCGTGGTGTAGATGGAAAGGTGCAAACCTTTAAAGCTAGACTAGTAGCAAAGGGTTATACC
CAGGTTGAAGGGGTTGACTATGAGGAGACCTTTTCACCTGTTGCTATGGTAAAGTCTATCCGCATCCTGCTTGCCATTGCCACATATTATGACTATGAGGTATGGCAAAT
GGATGTAAAGACAACCTTTTTGAATGGCAATCTTGAGAAAAATATCTACATGGACCAACCCAAAGGGTTCATTGAACCAGGACAAGAGCAAAAGGTTTGCAGGCTTAAAA
GGTCAATTTATGGACTTAAACAAGCCTCTAGGTCCTGGAATATAAGTTTTGATGAGGCAATCAGATCTTATGGTTTTGACCAGAATGATGATGAACCTTGTGTCTACAAG
AAAATCATCAATAGTTCTATCGCATTCCTAATTCTCTCTGTGGATGATATCCTACTCATTGGGAATGATGTAGGTTATCTTATTTACATCAAGGAATGGCTAGCTACGGA
ATTCCAAATGAAAGATTTGGGTGATGTGCACTTTGTTCTTGGGATCCAGATTGTCTGGAACCGCAAGAATAAAACACTAGCCTTGTCTCAAACATCTTACATAAACAAAG
TGTTGTCAAGGCCTGACATCTGTTTTGCAGTTGGAATGGTCAGTAGGTATCAATCCAATCAAGGACTTGAACACTGGACAACGGTTAAAACAATCCTTAAGTATCTTCGG
AGAACAAGGAACTACATGCTTGTGTATGGGGCTAAGGATCTGATCCTTACAGGATACACGGATTCTGACTTTCAGACTAATAAAGATTCTCGGAAATCTACATCAGGGTC
AGTATTTACTCTTAACCGAGAAGTTGTAGTATGGCAAAGCGTCAAGCAAGGATGCATCGCTAATTCCACCATGGAGGCCAAATATGTAGCAGCTTGTGAAGTTGCTAAAG
AGGTCGTTTGGTTAAGGAAATTCATGCTAAATTTGGAAGTTGTTTCAAATATGACTTTGTCCATCACGTTGTATTGCGATAACAGTGGTGCAGTGGCAAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCTCTATAATTTCCTTATTGGGTGCGGAAAAGTTAACCGGAGAGAATATGATGACATGGAAAAACAAACTCAACACTATATTGGTAGTGGATGATCTGAAGTT
TGTACTAACTGAGGAGTGTCTTCAGGTGCCAAGCTCGAATGCGTCACGAAATGTTCGTGATGCATATGATCGATGGATCAAGGCCAATGATAAGGTCAAGGTCTACATGC
TGGCAAGTATGTCTGACATATTAGCCAAGAAGCATGAGGGCATGATAACCGCGAAGGAAATCATGGATTCTGTGCAGGGTATGTTTGGACAACAGTCCACACAGGCCCGA
CACAATGCCCTAAAGTACATATTCAACTCAAGGATGCCAAAGAGTACATCTGTTCGGGATCATGTCTTGGATATGATGGTGCACTTCAACATCGTTGAGTCGAATGGTGC
TTCCATTGATGAGTCGAGCCAGGTCAGCTTCATTCTGGAAATCCTTCCAGATAGTTTCCTACAGTTTAGAAGTAATGTTGTTATGAACAAGCTTACTTTTAACCTTACCT
CCCTTCTCAATGAACTCCAGACTTTTCAATCTTTGATGAAAATTCAGGGATCGAAAGGTGAGACAAATGTTGCCAGTAGGAGTTATCACAGAGGTTCGACCTCTAGGACG
AAAACAGTTGCTTCATCTTATTCTAACCTGAAAGGGAAGAAGAAGAACATGAAGAAGGTTGGTAAAGGAAAACAAGCTGACAGAGCTGCCGCCCAGAAGGGCAAGAAAGT
CAAAAAAGTTGCTGACAAAGGTCCACTGCAACGAAGACGGGCATTGGAAACGGAACTGTCCGAAGTACATTGTAGAAAAGAAGAAGGAAGATATGGGGTTGTCCAAAGAC
ATGTGCTTGTGACAAATCCAAAGAAGTTGGATTCACGTTCAAAGTTATGCCTATTCGTAGGATACCCAAAAGAAACAAGAGGTGGTTTATTCTATGATCCTAAGGAAGAT
AAGGTTTTTGTGTCGACAAATGCCATTTTCTTAGAGGAGGACCACATAAGGGACCACAAACCAAAAAGTAAAGTTGTGTTGAGTGAGTTAGACGGAACAATAGCAAAGGT
TGCAAATAAAACTAATGACAACTGCGAGGATCCATTGACTTATAATCAAGCAATGGTTGACAAAGACAAAGACAAATGGGTCATAGCCATGGACCAAGAAATGGAGTATA
TGTACTTCAATTCTGTTTGGGATCTTGTAGATAAGCCTAATGGGAGAAAGCGTGGTGTAGATGGAAAGGTGCAAACCTTTAAAGCTAGACTAGTAGCAAAGGGTTATACC
CAGGTTGAAGGGGTTGACTATGAGGAGACCTTTTCACCTGTTGCTATGGTAAAGTCTATCCGCATCCTGCTTGCCATTGCCACATATTATGACTATGAGGTATGGCAAAT
GGATGTAAAGACAACCTTTTTGAATGGCAATCTTGAGAAAAATATCTACATGGACCAACCCAAAGGGTTCATTGAACCAGGACAAGAGCAAAAGGTTTGCAGGCTTAAAA
GGTCAATTTATGGACTTAAACAAGCCTCTAGGTCCTGGAATATAAGTTTTGATGAGGCAATCAGATCTTATGGTTTTGACCAGAATGATGATGAACCTTGTGTCTACAAG
AAAATCATCAATAGTTCTATCGCATTCCTAATTCTCTCTGTGGATGATATCCTACTCATTGGGAATGATGTAGGTTATCTTATTTACATCAAGGAATGGCTAGCTACGGA
ATTCCAAATGAAAGATTTGGGTGATGTGCACTTTGTTCTTGGGATCCAGATTGTCTGGAACCGCAAGAATAAAACACTAGCCTTGTCTCAAACATCTTACATAAACAAAG
TGTTGTCAAGGCCTGACATCTGTTTTGCAGTTGGAATGGTCAGTAGGTATCAATCCAATCAAGGACTTGAACACTGGACAACGGTTAAAACAATCCTTAAGTATCTTCGG
AGAACAAGGAACTACATGCTTGTGTATGGGGCTAAGGATCTGATCCTTACAGGATACACGGATTCTGACTTTCAGACTAATAAAGATTCTCGGAAATCTACATCAGGGTC
AGTATTTACTCTTAACCGAGAAGTTGTAGTATGGCAAAGCGTCAAGCAAGGATGCATCGCTAATTCCACCATGGAGGCCAAATATGTAGCAGCTTGTGAAGTTGCTAAAG
AGGTCGTTTGGTTAAGGAAATTCATGCTAAATTTGGAAGTTGTTTCAAATATGACTTTGTCCATCACGTTGTATTGCGATAACAGTGGTGCAGTGGCAAATTAA
Protein sequenceShow/hide protein sequence
MSSSIISLLGAEKLTGENMMTWKNKLNTILVVDDLKFVLTEECLQVPSSNASRNVRDAYDRWIKANDKVKVYMLASMSDILAKKHEGMITAKEIMDSVQGMFGQQSTQAR
HNALKYIFNSRMPKSTSVRDHVLDMMVHFNIVESNGASIDESSQVSFILEILPDSFLQFRSNVVMNKLTFNLTSLLNELQTFQSLMKIQGSKGETNVASRSYHRGSTSRT
KTVASSYSNLKGKKKNMKKVGKGKQADRAAAQKGKKVKKVADKGPLQRRRALETELSEVHCRKEEGRYGVVQRHVLVTNPKKLDSRSKLCLFVGYPKETRGGLFYDPKED
KVFVSTNAIFLEEDHIRDHKPKSKVVLSELDGTIAKVANKTNDNCEDPLTYNQAMVDKDKDKWVIAMDQEMEYMYFNSVWDLVDKPNGRKRGVDGKVQTFKARLVAKGYT
QVEGVDYEETFSPVAMVKSIRILLAIATYYDYEVWQMDVKTTFLNGNLEKNIYMDQPKGFIEPGQEQKVCRLKRSIYGLKQASRSWNISFDEAIRSYGFDQNDDEPCVYK
KIINSSIAFLILSVDDILLIGNDVGYLIYIKEWLATEFQMKDLGDVHFVLGIQIVWNRKNKTLALSQTSYINKVLSRPDICFAVGMVSRYQSNQGLEHWTTVKTILKYLR
RTRNYMLVYGAKDLILTGYTDSDFQTNKDSRKSTSGSVFTLNREVVVWQSVKQGCIANSTMEAKYVAACEVAKEVVWLRKFMLNLEVVSNMTLSITLYCDNSGAVAN