; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017438 (gene) of Snake gourd v1 genome

Gene IDTan0017438
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG05:33266049..33268987
RNA-Seq ExpressionTan0017438
SyntenyTan0017438
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-26252.18Show/hide
Query:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG
        +NT+L++DDLRFVL E+CP +PA NA ++V++ Y+RW KAN+KA+ YILAS+SEVLAKKHE M++ REIM SLQEMFGQ S QI+H++LKY+YN+RM EG
Subjt:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG

Query:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP
        +SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+S MK KGQ  GEAN+ + + +F R    G   
Subjt:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP

Query:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------
          SSSG KK +KKK G   KA  A     K   A KG CFHCN  GHWKRNCPKYLAE K+ K                                     
Subjt:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------

Query:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT
                                                                     E  Y ++F++N+ FI K  V ICSAKL+N LYVL+   +
Subjt:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT

Query:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK
        K + N EMFK A TQNKR +ISP  N +LWHLRLG+INLNRIERL KNGLL++LE++SLP CESCL+GKMTKRPFTGKG+RA EPLEL+HSDLCG +N+K
Subjt:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK

Query:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------
        ARGG+EYFI+F DDYS                                                                                    
Subjt:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------

Query:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV
                                                            RIWGC  HVL  NPKKLEPRS+LC FVGYPK T+GG FYDP++NKV V
Subjt:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV

Query:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY
        STNATFLEEDH+R HKP SK+VLN    E T   TRVV++    +RV   V +S+++   QSL   RRSGRV + P RY  L ET  VI D  +EDPL +
Subjt:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY

Query:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY
        K+AM DVDKD+WIK M+LE+ES+ FNSVW LVDQ +GV+PIGCKWIYKRKR A  KVQTFKARLVAKG+TQ EGVDYEETFSP   LKSIRILLSI  ++
Subjt:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY

Query:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE
        DYEIWQMDVK AFLNGNL+E+IYM Q EGFI   QE+K+CKLNRSIYGLKQAS SWNIRFDTAIKSYGFDQ VDEP VY +IIN+
Subjt:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-26252.18Show/hide
Query:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG
        +NT+L++DDLRFVL E+CP +PA NA ++V++ Y+RW KAN+KA+ YILAS+SEVLAKKHE M++ REIM SLQEMFGQ S QI+H++LKY+YN+RM EG
Subjt:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG

Query:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP
        +SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+S MK KGQ  GEAN+ + + +F R    G   
Subjt:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP

Query:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------
          SSSG KK +KKK G   KA  A     K   A KG CFHCN  GHWKRNCPKYLAE K+ K                                     
Subjt:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------

Query:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT
                                                                     E  Y ++F++N+ FI K  V ICSAKL+N LYVL+   +
Subjt:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT

Query:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK
        K + N EMFK A TQNKR +ISP  N +LWHLRLG+INLNRIERL KNGLL++LE++SLP CESCL+GKMTKRPFTGKG+RA EPLEL+HSDLCG +N+K
Subjt:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK

Query:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------
        ARGG+EYFI+F DDYS                                                                                    
Subjt:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------

Query:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV
                                                            RIWGC  HVL  NPKKLEPRS+LC FVGYPK T+GG FYDP++NKV V
Subjt:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV

Query:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY
        STNATFLEEDH+R HKP SK+VLN    E T   TRVV++    +RV   V +S+++   QSL   RRSGRV + P RY  L ET  VI D  +EDPL +
Subjt:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY

Query:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY
        K+AM DVDKD+WIK M+LE+ES+ FNSVW LVDQ +GV+PIGCKWIYKRKR A  KVQTFKARLVAKG+TQ EGVDYEETFSP   LKSIRILLSI  ++
Subjt:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY

Query:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE
        DYEIWQMDVK AFLNGNL+E+IYM Q EGFI   QE+K+CKLNRSIYGLKQAS SWNIRFDTAIKSYGFDQ VDEP VY +IIN+
Subjt:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-26252.18Show/hide
Query:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG
        +NT+L++DDLRFVL E+CP +PA NA ++V++ Y+RW KAN+KA+ YILAS+SEVLAKKHE M++ REIM SLQEMFGQ S QI+H++LKY+YN+RM EG
Subjt:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG

Query:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP
        +SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+S MK KGQ  GEAN+ + + +F R    G   
Subjt:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP

Query:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------
          SSSG KK +KKK G   KA  A     K   A KG CFHCN  GHWKRNCPKYLAE K+ K                                     
Subjt:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------

Query:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT
                                                                     E  Y ++F++N+ FI K  V ICSAKL+N LYVL+   +
Subjt:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT

Query:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK
        K + N EMFK A TQNKR +ISP  N +LWHLRLG+INLNRIERL KNGLL++LE++SLP CESCL+GKMTKRPFTGKG+RA EPLEL+HSDLCG +N+K
Subjt:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK

Query:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------
        ARGG+EYFI+F DDYS                                                                                    
Subjt:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------

Query:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV
                                                            RIWGC  HVL  NPKKLEPRS+LC FVGYPK T+GG FYDP++NKV V
Subjt:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV

Query:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY
        STNATFLEEDH+R HKP SK+VLN    E T   TRVV++    +RV   V +S+++   QSL   RRSGRV + P RY  L ET  VI D  +EDPL +
Subjt:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY

Query:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY
        K+AM DVDKD+WIK M+LE+ES+ FNSVW LVDQ +GV+PIGCKWIYKRKR A  KVQTFKARLVAKG+TQ EGVDYEETFSP   LKSIRILLSI  ++
Subjt:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY

Query:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE
        DYEIWQMDVK AFLNGNL+E+IYM Q EGFI   QE+K+CKLNRSIYGLKQAS SWNIRFDTAIKSYGFDQ VDEP VY +IIN+
Subjt:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-26252.18Show/hide
Query:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG
        +NT+L++DDLRFVL E+CP +PA NA ++V++ Y+RW KAN+KA+ YILAS+SEVLAKKHE M++ REIM SLQEMFGQ S QI+H++LKY+YN+RM EG
Subjt:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG

Query:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP
        +SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+S MK KGQ  GEAN+ + + +F R    G   
Subjt:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP

Query:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------
          SSSG KK +KKK G   KA  A     K   A KG CFHCN  GHWKRNCPKYLAE K+ K                                     
Subjt:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------

Query:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT
                                                                     E  Y ++F++N+ FI K  V ICSAKL+N LYVL+   +
Subjt:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT

Query:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK
        K + N EMFK A TQNKR +ISP  N +LWHLRLG+INLNRIERL KNGLL++LE++SLP CESCL+GKMTKRPFTGKG+RA EPLEL+HSDLCG +N+K
Subjt:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK

Query:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------
        ARGG+EYFI+F DDYS                                                                                    
Subjt:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------

Query:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV
                                                            RIWGC  HVL  NPKKLEPRS+LC FVGYPK T+GG FYDP++NKV V
Subjt:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV

Query:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY
        STNATFLEEDH+R HKP SK+VLN    E T   TRVV++    +RV   V +S+++   QSL   RRSGRV + P RY  L ET  VI D  +EDPL +
Subjt:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY

Query:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY
        K+AM DVDKD+WIK M+LE+ES+ FNSVW LVDQ +GV+PIGCKWIYKRKR A  KVQTFKARLVAKG+TQ EGVDYEETFSP   LKSIRILLSI  ++
Subjt:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY

Query:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE
        DYEIWQMDVK AFLNGNL+E+IYM Q EGFI   QE+K+CKLNRSIYGLKQAS SWNIRFDTAIKSYGFDQ VDEP VY +IIN+
Subjt:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-26252.18Show/hide
Query:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG
        +NT+L++DDLRFVL E+CP +PA NA ++V++ Y+RW KAN+KA+ YILAS+SEVLAKKHE M++ REIM SLQEMFGQ S QI+H++LKY+YN+RM EG
Subjt:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG

Query:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP
        +SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+S MK KGQ  GEAN+ + + +F R    G   
Subjt:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP

Query:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------
          SSSG KK +KKK G   KA  A     K   A KG CFHCN  GHWKRNCPKYLAE K+ K                                     
Subjt:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------

Query:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT
                                                                     E  Y ++F++N+ FI K  V ICSAKL+N LYVL+   +
Subjt:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT

Query:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK
        K + N EMFK A TQNKR +ISP  N +LWHLRLG+INLNRIERL KNGLL++LE++SLP CESCL+GKMTKRPFTGKG+RA EPLEL+HSDLCG +N+K
Subjt:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK

Query:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------
        ARGG+EYFI+F DDYS                                                                                    
Subjt:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------

Query:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV
                                                            RIWGC  HVL  NPKKLEPRS+LC FVGYPK T+GG FYDP++NKV V
Subjt:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV

Query:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY
        STNATFLEEDH+R HKP SK+VLN    E T   TRVV++    +RV   V +S+++   QSL   RRSGRV + P RY  L ET  VI D  +EDPL +
Subjt:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY

Query:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY
        K+AM DVDKD+WIK M+LE+ES+ FNSVW LVDQ +GV+PIGCKWIYKRKR A  KVQTFKARLVAKG+TQ EGVDYEETFSP   LKSIRILLSI  ++
Subjt:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY

Query:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE
        DYEIWQMDVK AFLNGNL+E+IYM Q EGFI   QE+K+CKLNRSIYGLKQAS SWNIRFDTAIKSYGFDQ VDEP VY +IIN+
Subjt:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.0e-26352.18Show/hide
Query:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG
        +NT+L++DDLRFVL E+CP +PA NA ++V++ Y+RW KAN+KA+ YILAS+SEVLAKKHE M++ REIM SLQEMFGQ S QI+H++LKY+YN+RM EG
Subjt:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG

Query:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP
        +SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+S MK KGQ  GEAN+ + + +F R    G   
Subjt:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP

Query:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------
          SSSG KK +KKK G   KA  A     K   A KG CFHCN  GHWKRNCPKYLAE K+ K                                     
Subjt:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------

Query:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT
                                                                     E  Y ++F++N+ FI K  V ICSAKL+N LYVL+   +
Subjt:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT

Query:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK
        K + N EMFK A TQNKR +ISP  N +LWHLRLG+INLNRIERL KNGLL++LE++SLP CESCL+GKMTKRPFTGKG+RA EPLEL+HSDLCG +N+K
Subjt:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK

Query:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------
        ARGG+EYFI+F DDYS                                                                                    
Subjt:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------

Query:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV
                                                            RIWGC  HVL  NPKKLEPRS+LC FVGYPK T+GG FYDP++NKV V
Subjt:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV

Query:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY
        STNATFLEEDH+R HKP SK+VLN    E T   TRVV++    +RV   V +S+++   QSL   RRSGRV + P RY  L ET  VI D  +EDPL +
Subjt:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY

Query:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY
        K+AM DVDKD+WIK M+LE+ES+ FNSVW LVDQ +GV+PIGCKWIYKRKR A  KVQTFKARLVAKG+TQ EGVDYEETFSP   LKSIRILLSI  ++
Subjt:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY

Query:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE
        DYEIWQMDVK AFLNGNL+E+IYM Q EGFI   QE+K+CKLNRSIYGLKQAS SWNIRFDTAIKSYGFDQ VDEP VY +IIN+
Subjt:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE

A0A5A7TWB9 Gag/pol protein6.0e-26352.18Show/hide
Query:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG
        +NT+L++DDLRFVL E+CP +PA NA ++V++ Y+RW KAN+KA+ YILAS+SEVLAKKHE M++ REIM SLQEMFGQ S QI+H++LKY+YN+RM EG
Subjt:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG

Query:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP
        +SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+S MK KGQ  GEAN+ + + +F R    G   
Subjt:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP

Query:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------
          SSSG KK +KKK G   KA  A     K   A KG CFHCN  GHWKRNCPKYLAE K+ K                                     
Subjt:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------

Query:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT
                                                                     E  Y ++F++N+ FI K  V ICSAKL+N LYVL+   +
Subjt:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT

Query:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK
        K + N EMFK A TQNKR +ISP  N +LWHLRLG+INLNRIERL KNGLL++LE++SLP CESCL+GKMTKRPFTGKG+RA EPLEL+HSDLCG +N+K
Subjt:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK

Query:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------
        ARGG+EYFI+F DDYS                                                                                    
Subjt:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------

Query:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV
                                                            RIWGC  HVL  NPKKLEPRS+LC FVGYPK T+GG FYDP++NKV V
Subjt:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV

Query:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY
        STNATFLEEDH+R HKP SK+VLN    E T   TRVV++    +RV   V +S+++   QSL   RRSGRV + P RY  L ET  VI D  +EDPL +
Subjt:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY

Query:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY
        K+AM DVDKD+WIK M+LE+ES+ FNSVW LVDQ +GV+PIGCKWIYKRKR A  KVQTFKARLVAKG+TQ EGVDYEETFSP   LKSIRILLSI  ++
Subjt:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY

Query:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE
        DYEIWQMDVK AFLNGNL+E+IYM Q EGFI   QE+K+CKLNRSIYGLKQAS SWNIRFDTAIKSYGFDQ VDEP VY +IIN+
Subjt:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE

A0A5A7UGV2 Gag/pol protein6.0e-26352.18Show/hide
Query:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG
        +NT+L++DDLRFVL E+CP +PA NA ++V++ Y+RW KAN+KA+ YILAS+SEVLAKKHE M++ REIM SLQEMFGQ S QI+H++LKY+YN+RM EG
Subjt:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG

Query:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP
        +SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+S MK KGQ  GEAN+ + + +F R    G   
Subjt:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP

Query:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------
          SSSG KK +KKK G   KA  A     K   A KG CFHCN  GHWKRNCPKYLAE K+ K                                     
Subjt:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------

Query:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT
                                                                     E  Y ++F++N+ FI K  V ICSAKL+N LYVL+   +
Subjt:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT

Query:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK
        K + N EMFK A TQNKR +ISP  N +LWHLRLG+INLNRIERL KNGLL++LE++SLP CESCL+GKMTKRPFTGKG+RA EPLEL+HSDLCG +N+K
Subjt:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK

Query:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------
        ARGG+EYFI+F DDYS                                                                                    
Subjt:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------

Query:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV
                                                            RIWGC  HVL  NPKKLEPRS+LC FVGYPK T+GG FYDP++NKV V
Subjt:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV

Query:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY
        STNATFLEEDH+R HKP SK+VLN    E T   TRVV++    +RV   V +S+++   QSL   RRSGRV + P RY  L ET  VI D  +EDPL +
Subjt:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY

Query:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY
        K+AM DVDKD+WIK M+LE+ES+ FNSVW LVDQ +GV+PIGCKWIYKRKR A  KVQTFKARLVAKG+TQ EGVDYEETFSP   LKSIRILLSI  ++
Subjt:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY

Query:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE
        DYEIWQMDVK AFLNGNL+E+IYM Q EGFI   QE+K+CKLNRSIYGLKQAS SWNIRFDTAIKSYGFDQ VDEP VY +IIN+
Subjt:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE

A0A5D3CPJ6 Gag/pol protein6.0e-26352.18Show/hide
Query:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG
        +NT+L++DDLRFVL E+CP +PA NA ++V++ Y+RW KAN+KA+ YILAS+SEVLAKKHE M++ REIM SLQEMFGQ S QI+H++LKY+YN+RM EG
Subjt:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG

Query:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP
        +SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+S MK KGQ  GEAN+ + + +F R    G   
Subjt:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP

Query:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------
          SSSG KK +KKK G   KA  A     K   A KG CFHCN  GHWKRNCPKYLAE K+ K                                     
Subjt:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------

Query:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT
                                                                     E  Y ++F++N+ FI K  V ICSAKL+N LYVL+   +
Subjt:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT

Query:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK
        K + N EMFK A TQNKR +ISP  N +LWHLRLG+INLNRIERL KNGLL++LE++SLP CESCL+GKMTKRPFTGKG+RA EPLEL+HSDLCG +N+K
Subjt:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK

Query:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------
        ARGG+EYFI+F DDYS                                                                                    
Subjt:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------

Query:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV
                                                            RIWGC  HVL  NPKKLEPRS+LC FVGYPK T+GG FYDP++NKV V
Subjt:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV

Query:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY
        STNATFLEEDH+R HKP SK+VLN    E T   TRVV++    +RV   V +S+++   QSL   RRSGRV + P RY  L ET  VI D  +EDPL +
Subjt:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY

Query:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY
        K+AM DVDKD+WIK M+LE+ES+ FNSVW LVDQ +GV+PIGCKWIYKRKR A  KVQTFKARLVAKG+TQ EGVDYEETFSP   LKSIRILLSI  ++
Subjt:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY

Query:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE
        DYEIWQMDVK AFLNGNL+E+IYM Q EGFI   QE+K+CKLNRSIYGLKQAS SWNIRFDTAIKSYGFDQ VDEP VY +IIN+
Subjt:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE

A0A5D3CSZ6 Gag/pol protein6.0e-26352.18Show/hide
Query:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG
        +NT+L++DDLRFVL E+CP +PA NA ++V++ Y+RW KAN+KA+ YILAS+SEVLAKKHE M++ REIM SLQEMFGQ S QI+H++LKY+YN+RM EG
Subjt:  MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEG

Query:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP
        +SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESL +SFLQFRSNAVMNKI Y LTTLLNELQTF+S MK KGQ  GEAN+ + + +F R    G   
Subjt:  SSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSP

Query:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------
          SSSG KK +KKK G   KA  A     K   A KG CFHCN  GHWKRNCPKYLAE K+ K                                     
Subjt:  RSSSSGLKKTQKKKIGGKGKAPAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKK-------------------------------------

Query:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT
                                                                     E  Y ++F++N+ FI K  V ICSAKL+N LYVL+   +
Subjt:  -------------------------------------------------------------ELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKT

Query:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK
        K + N EMFK A TQNKR +ISP  N +LWHLRLG+INLNRIERL KNGLL++LE++SLP CESCL+GKMTKRPFTGKG+RA EPLEL+HSDLCG +N+K
Subjt:  KTIFNHEMFKNANTQNKRQRISPNNNTYLWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIK

Query:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------
        ARGG+EYFI+F DDYS                                                                                    
Subjt:  ARGGYEYFISFIDDYS------------------------------------------------------------------------------------

Query:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV
                                                            RIWGC  HVL  NPKKLEPRS+LC FVGYPK T+GG FYDP++NKV V
Subjt:  ----------------------------------------------------RIWGCTTHVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIV

Query:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY
        STNATFLEEDH+R HKP SK+VLN    E T   TRVV++    +RV   V +S+++   QSL   RRSGRV + P RY  L ET  VI D  +EDPL +
Subjt:  STNATFLEEDHMRNHKPHSKLVLN----EATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPY

Query:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY
        K+AM DVDKD+WIK M+LE+ES+ FNSVW LVDQ +GV+PIGCKWIYKRKR A  KVQTFKARLVAKG+TQ EGVDYEETFSP   LKSIRILLSI  ++
Subjt:  KQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFY

Query:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE
        DYEIWQMDVK AFLNGNL+E+IYM Q EGFI   QE+K+CKLNRSIYGLKQAS SWNIRFDTAIKSYGFDQ VDEP VY +IIN+
Subjt:  DYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-2929.01Show/hide
Query:  QFVGYPKETKGGLFYDPEENKVIVSTNATFLEED-----HMRNHKPHSKLVLNEATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLG-----------
        QF+   KE++   F  P +++ I+ T      ++      +++ K  +K  LNE+     R  D     S+     N S +S  ++ L            
Subjt:  QFVGYPKETKGGLFYDPEENKVIVSTNATFLEED-----HMRNHKPHSKLVLNEATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLG-----------

Query:  ----MSRRSGRVVSQPDRYFGLAE---TQVVIPDDGVEDPLP--YKQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAAR
            ++RRS R+ ++P   +   +    +VV+    + + +P  + +     DK  W + ++ E+ +   N+ W +  + E    +  +W++  K +   
Subjt:  ----MSRRSGRVVSQPDRYFGLAE---TQVVIPDDGVEDPLP--YKQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAAR

Query:  KVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFYDYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTS
            +KARLVA+GFTQ+  +DYEETF+P   + S R +LS+V  Y+ ++ QMDVK AFLNG L E IYM   +G         VCKLN++IYGLKQA+  
Subjt:  KVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFYDYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTS

Query:  WNIRFDTAIKSYGFDQNVDEPSVY
        W   F+ A+K   F  +  +  +Y
Subjt:  WNIRFDTAIKSYGFDQNVDEPSVY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.2e-5627.4Show/hide
Query:  LWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIKARGGYEYFISFIDDYSR-----------
        LWH R+G+++   ++ L+K  L++  +  ++ PC+ CL GK  +  F     R +  L+L++SD+CG + I++ GG +YF++FIDD SR           
Subjt:  LWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIKARGGYEYFISFIDDYSR-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------IWGCT--THVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIVSTNATFLEEDHMRNHKPHSKLVLN-
                                  ++GC    HV      KL+ +S  C F+GY  E  G   +DP + KVI S +  F  E  +R     S+ V N 
Subjt:  --------------------------IWGCT--THVLMTNPKKLEPRSRLCQFVGYPKETKGGLFYDPEENKVIVSTNATFLEEDHMRNHKPHSKLVLN-

Query:  --------------------------EATHEPTRVVDQARPSSRVDRGV-NTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPYK
                                  E   +P  V++Q     ++D GV      ++  +     RRS R   +  RY   +   V+I DD   +P   K
Subjt:  --------------------------EATHEPTRVVDQARPSSRVDRGV-NTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPYK

Query:  QAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFYD
        + ++  +K+Q +K M  EMES+  N  +KLV+  +G RP+ CKW++K K+D   K+  +KARLV KGF Q++G+D++E FSP + + SIR +LS+    D
Subjt:  QAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFYD

Query:  YEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE
         E+ Q+DVK AFL+G+L+E IYM Q EGF    ++  VCKLN+S+YGLKQA   W ++FD+ +KS  + +   +P VY K  +E
Subjt:  YEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE

P92520 Uncharacterized mitochondrial protein AtMg008203.2e-1137.65Show/hide
Query:  WIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSI
        W + M  E++++  N  W LV        +GCKW++K K  +   +   KARLVAKGF Q EG+ + ET+SP +   +IR +L++
Subjt:  WIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.1e-3135.22Show/hide
Query:  VNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPYKQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEG-VRPIGCKWIYKR
        VN ++Q+ P  +  M  R+   + +P+  + LA +          +P    QA+ D   ++W   M  E+ +   N  W LV      V  +GC+WI+ +
Subjt:  VNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPYKQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEG-VRPIGCKWIYKR

Query:  KRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFYDYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGL
        K ++   +  +KARLVAKG+ QR G+DY ETFSP I   SIRI+L +     + I Q+DV  AFL G L + +YMSQ  GFI + +   VCKL +++YGL
Subjt:  KRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFYDYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGL

Query:  KQASTSWNIRFDTAIKSYGFDQNVDEPSVY
        KQA  +W +     + + GF  +V + S++
Subjt:  KQASTSWNIRFDTAIKSYGFDQNVDEPSVY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.3e-3235.14Show/hide
Query:  PSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPYKQAMNDVDKDQWIKVMDLEMESVDFNSVWKLV-DQSEGVRPIGCKWIYKRKRDAARKV
        P  +  M+ R+   + +P++ +  A +          +P    QAM D   D+W + M  E+ +   N  W LV      V  +GC+WI+ +K ++   +
Subjt:  PSQSLGMSRRSGRVVSQPDRYFGLAETQVVIPDDGVEDPLPYKQAMNDVDKDQWIKVMDLEMESVDFNSVWKLV-DQSEGVRPIGCKWIYKRKRDAARKV

Query:  QTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFYDYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWN
          +KARLVAKG+ QR G+DY ETFSP I   SIRI+L +     + I Q+DV  AFL G L + +YMSQ  GF+ + +   VC+L ++IYGLKQA  +W 
Subjt:  QTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVTFYDYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWN

Query:  IRFDTAIKSYGFDQNVDEPSVY
        +   T + + GF  ++ + S++
Subjt:  IRFDTAIKSYGFDQNVDEPSVY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.5e-3739.06Show/hide
Query:  EDPLPYKQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILL
        ++P  Y +A   +    W   MD E+ +++    W++       +PIGCKW+YK K ++   ++ +KARLVAKG+TQ+EG+D+ ETFSP   L S++++L
Subjt:  EDPLPYKQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILL

Query:  SIVTFYDYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQE----KKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKI
        +I   Y++ + Q+D+  AFLNG+LDE IYM    G+ AR+ +      VC L +SIYGLKQAS  W ++F   +  +GF Q+  + + + KI
Subjt:  SIVTFYDYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQE----KKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKI

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.3e-1237.65Show/hide
Query:  WIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSI
        W + M  E++++  N  W LV        +GCKW++K K  +   +   KARLVAKGF Q EG+ + ET+SP +   +IR +L++
Subjt:  WIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATACGATTCTCGTTGTTGATGACCTTCGGTTCGTACTAACTGAGGATTGTCCTCATATTCCTGCTCGTAACGCTCCTCAATCTGTTAAGAAGGCGTACGACCGCTG
GATCAAGGCCAATGATAAGGCCAAGGTCTACATTTTGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAACTCGTGAGATCATGAGTTCACTGC
AGGAAATGTTTGGACAACCGTCTGGACAGATTCGACACGAATCCCTCAAATATGTTTATAACTCTCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTAGATCTG
ATGGTCCACTTCAACGTGGCTGAGATGAACGGAGCGGTCATTGACGAGCAGAGTCAGGTATCATTCATCCTGGAATCTCTTTCGAAGAGTTTCCTGCAATTTCGCAGCAA
TGCGGTGATGAACAAGATAGAGTATAACCTGACTACTCTCCTTAATGAACTACAGACTTTCCAGTCTCATATGAAGAATAAGGGACAGGCTGATGGAGAGGCAAATCTGT
TTTCCCATTCCGTAAGGTTCCGTAGGGTTCATCCTCCGGGACTAAGTCCTCGGAGCTCATCTTCTGGGCTTAAGAAGACCCAAAAGAAGAAAATAGGAGGGAAAGGGAAG
GCACCTGCTGCTGACAAAGGCAAGGGAAAAGTCAAGCTTGCAGATAAAGGAAAGTGTTTCCACTGCAACTTGAATGGGCATTGGAAGAGAAATTGCCCAAAATACCTTGC
TGAGCTCAAAGAGAAGAAAGAACTTGGTTATTATATTTCTTTTTCAATCAATGAAGCGTTCATTTTGAAAGAGAGTGTCAATATTTGTTCTGCTAAATTGAAAAATGGCT
TATATGTACTGAAACTGAATAAAACCAAAACAATTTTTAATCATGAGATGTTTAAGAATGCTAACACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAATAATACCTAT
CTGTGGCATCTTAGACTTGGTTATATTAATCTCAACCGAATTGAGAGACTTTCTAAGAATGGACTTCTAAACAAGTTAGAAGATGATTCTTTACCGCCTTGCGAGTCTTG
CTTGGATGGTAAAATGACCAAACGACCTTTTACTGGAAAAGGTTATAGAGCCATAGAACCCTTAGAACTTATACATTCGGATCTTTGTGGTTCGGTGAATATTAAAGCTC
GAGGAGGGTACGAATATTTCATCTCTTTCATAGATGATTATTCGAGGATTTGGGGATGTACGACACACGTACTCATGACAAACCCAAAGAAACTGGAACCTCGTTCAAGA
TTGTGCCAATTTGTTGGCTATCCCAAAGAAACGAAAGGTGGTCTTTTCTACGACCCAGAAGAAAATAAGGTGATTGTATCAACAAACGCCACGTTCTTGGAGGAAGATCA
CATGAGAAACCATAAACCGCATAGTAAATTAGTACTAAATGAAGCTACACATGAACCAACAAGAGTTGTTGATCAAGCTAGACCCTCATCAAGAGTTGATAGAGGAGTCA
ACACCTCAAGTCAGTCTCGTCCTTCTCAATCGTTGGGAATGTCTCGACGTAGTGGGAGGGTTGTTTCCCAACCTGACCGCTACTTCGGTTTAGCTGAAACTCAAGTTGTC
ATACCTGATGACGGCGTAGAAGATCCATTGCCTTATAAACAGGCAATGAATGACGTAGACAAGGACCAATGGATCAAAGTCATGGACCTTGAAATGGAGTCAGTGGACTT
CAATTCAGTATGGAAACTTGTAGACCAATCTGAAGGGGTTAGACCCATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATGCAGCCAGAAAGGTACAGACCTTTAAGG
CTAGACTTGTAGCAAAGGGTTTTACCCAAAGGGAAGGAGTTGACTATGAAGAAACTTTTTCCCCCTGTATATCTTTGAAGTCTATAAGGATACTCTTGTCCATAGTCACG
TTTTATGATTATGAAATTTGGCAAATGGACGTCAAGATTGCCTTTTTGAATGGTAATCTTGACGAGAGCATCTATATGTCTCAGTCCGAAGGGTTCATAGCACGACGTCA
GGAGAAAAAAGTTTGCAAGCTTAATCGATCCATTTATGGATTGAAACAAGCATCAACATCTTGGAATATAAGATTTGATACTGCGATCAAGTCTTACGGCTTTGACCAAA
ACGTTGACGAGCCTTCTGTTTACAATAAGATCATCAACGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATACGATTCTCGTTGTTGATGACCTTCGGTTCGTACTAACTGAGGATTGTCCTCATATTCCTGCTCGTAACGCTCCTCAATCTGTTAAGAAGGCGTACGACCGCTG
GATCAAGGCCAATGATAAGGCCAAGGTCTACATTTTGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAACTCGTGAGATCATGAGTTCACTGC
AGGAAATGTTTGGACAACCGTCTGGACAGATTCGACACGAATCCCTCAAATATGTTTATAACTCTCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTAGATCTG
ATGGTCCACTTCAACGTGGCTGAGATGAACGGAGCGGTCATTGACGAGCAGAGTCAGGTATCATTCATCCTGGAATCTCTTTCGAAGAGTTTCCTGCAATTTCGCAGCAA
TGCGGTGATGAACAAGATAGAGTATAACCTGACTACTCTCCTTAATGAACTACAGACTTTCCAGTCTCATATGAAGAATAAGGGACAGGCTGATGGAGAGGCAAATCTGT
TTTCCCATTCCGTAAGGTTCCGTAGGGTTCATCCTCCGGGACTAAGTCCTCGGAGCTCATCTTCTGGGCTTAAGAAGACCCAAAAGAAGAAAATAGGAGGGAAAGGGAAG
GCACCTGCTGCTGACAAAGGCAAGGGAAAAGTCAAGCTTGCAGATAAAGGAAAGTGTTTCCACTGCAACTTGAATGGGCATTGGAAGAGAAATTGCCCAAAATACCTTGC
TGAGCTCAAAGAGAAGAAAGAACTTGGTTATTATATTTCTTTTTCAATCAATGAAGCGTTCATTTTGAAAGAGAGTGTCAATATTTGTTCTGCTAAATTGAAAAATGGCT
TATATGTACTGAAACTGAATAAAACCAAAACAATTTTTAATCATGAGATGTTTAAGAATGCTAACACTCAAAATAAAAGGCAAAGAATTTCTCCAAATAATAATACCTAT
CTGTGGCATCTTAGACTTGGTTATATTAATCTCAACCGAATTGAGAGACTTTCTAAGAATGGACTTCTAAACAAGTTAGAAGATGATTCTTTACCGCCTTGCGAGTCTTG
CTTGGATGGTAAAATGACCAAACGACCTTTTACTGGAAAAGGTTATAGAGCCATAGAACCCTTAGAACTTATACATTCGGATCTTTGTGGTTCGGTGAATATTAAAGCTC
GAGGAGGGTACGAATATTTCATCTCTTTCATAGATGATTATTCGAGGATTTGGGGATGTACGACACACGTACTCATGACAAACCCAAAGAAACTGGAACCTCGTTCAAGA
TTGTGCCAATTTGTTGGCTATCCCAAAGAAACGAAAGGTGGTCTTTTCTACGACCCAGAAGAAAATAAGGTGATTGTATCAACAAACGCCACGTTCTTGGAGGAAGATCA
CATGAGAAACCATAAACCGCATAGTAAATTAGTACTAAATGAAGCTACACATGAACCAACAAGAGTTGTTGATCAAGCTAGACCCTCATCAAGAGTTGATAGAGGAGTCA
ACACCTCAAGTCAGTCTCGTCCTTCTCAATCGTTGGGAATGTCTCGACGTAGTGGGAGGGTTGTTTCCCAACCTGACCGCTACTTCGGTTTAGCTGAAACTCAAGTTGTC
ATACCTGATGACGGCGTAGAAGATCCATTGCCTTATAAACAGGCAATGAATGACGTAGACAAGGACCAATGGATCAAAGTCATGGACCTTGAAATGGAGTCAGTGGACTT
CAATTCAGTATGGAAACTTGTAGACCAATCTGAAGGGGTTAGACCCATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATGCAGCCAGAAAGGTACAGACCTTTAAGG
CTAGACTTGTAGCAAAGGGTTTTACCCAAAGGGAAGGAGTTGACTATGAAGAAACTTTTTCCCCCTGTATATCTTTGAAGTCTATAAGGATACTCTTGTCCATAGTCACG
TTTTATGATTATGAAATTTGGCAAATGGACGTCAAGATTGCCTTTTTGAATGGTAATCTTGACGAGAGCATCTATATGTCTCAGTCCGAAGGGTTCATAGCACGACGTCA
GGAGAAAAAAGTTTGCAAGCTTAATCGATCCATTTATGGATTGAAACAAGCATCAACATCTTGGAATATAAGATTTGATACTGCGATCAAGTCTTACGGCTTTGACCAAA
ACGTTGACGAGCCTTCTGTTTACAATAAGATCATCAACGAATGA
Protein sequenceShow/hide protein sequence
MNTILVVDDLRFVLTEDCPHIPARNAPQSVKKAYDRWIKANDKAKVYILASVSEVLAKKHEGMVSTREIMSSLQEMFGQPSGQIRHESLKYVYNSRMKEGSSVREHVLDL
MVHFNVAEMNGAVIDEQSQVSFILESLSKSFLQFRSNAVMNKIEYNLTTLLNELQTFQSHMKNKGQADGEANLFSHSVRFRRVHPPGLSPRSSSSGLKKTQKKKIGGKGK
APAADKGKGKVKLADKGKCFHCNLNGHWKRNCPKYLAELKEKKELGYYISFSINEAFILKESVNICSAKLKNGLYVLKLNKTKTIFNHEMFKNANTQNKRQRISPNNNTY
LWHLRLGYINLNRIERLSKNGLLNKLEDDSLPPCESCLDGKMTKRPFTGKGYRAIEPLELIHSDLCGSVNIKARGGYEYFISFIDDYSRIWGCTTHVLMTNPKKLEPRSR
LCQFVGYPKETKGGLFYDPEENKVIVSTNATFLEEDHMRNHKPHSKLVLNEATHEPTRVVDQARPSSRVDRGVNTSSQSRPSQSLGMSRRSGRVVSQPDRYFGLAETQVV
IPDDGVEDPLPYKQAMNDVDKDQWIKVMDLEMESVDFNSVWKLVDQSEGVRPIGCKWIYKRKRDAARKVQTFKARLVAKGFTQREGVDYEETFSPCISLKSIRILLSIVT
FYDYEIWQMDVKIAFLNGNLDESIYMSQSEGFIARRQEKKVCKLNRSIYGLKQASTSWNIRFDTAIKSYGFDQNVDEPSVYNKIINE