; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0023209 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0023209
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag/pol protein
Genome locationchr04:22780726..22786601
RNA-Seq ExpressionIVF0023209
SyntenyIVF0023209
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.097.77Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNT NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKF+RGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIER VKNGLLSELEENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI

Query:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
        RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
Subjt:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY

Query:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
        EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
Subjt:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC

Query:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------
        VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV                      
Subjt:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------

Query:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS
           CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDY LVYGSKDLILTGYTDSDFQTDRDSRKS
Subjt:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
        TSGSVFTLNGGAVVWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR

Query:  GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
        GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
Subjt:  GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.097.69Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNT NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKF+RGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKK KAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIER VKNGLLSELEENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI

Query:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
        RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
Subjt:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY

Query:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
        EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
Subjt:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC

Query:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------
        VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV                      
Subjt:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------

Query:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS
           CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDY LVYGSKDLILTGYTDSDFQTDRDSRKS
Subjt:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
        TSGSVFTLNGGAVVWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR

Query:  GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
        GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
Subjt:  GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]0.097.74Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNT NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKF+RGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIER VKNGLLSELEENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI

Query:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
        RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
Subjt:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY

Query:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
        EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
Subjt:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC

Query:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------
        VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV                      
Subjt:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------

Query:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS
           CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS
Subjt:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
        TSGSVF LNGGAVVWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR

Query:  GDVIVTQIASTHNVADPFTKPLTAK
        GDVIVTQIASTHNVADPFTKPLTAK
Subjt:  GDVIVTQIASTHNVADPFTKPLTAK

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]0.097.84Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEM
        MTSATLNMLAADKLNGNNYASWKNTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEM
Subjt:  MTSATLNMLAADKLNGNNYASWKNTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEM

Query:  FGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ
        FGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ
Subjt:  FGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ

Query:  KGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCLV
        KGEANVATSTRKF+RGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCLV
Subjt:  KGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCLV

Query:  ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIY
        ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIY
Subjt:  ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIY

Query:  KNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFTG
        KNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIER VKNGLLSELEENSLPVCESCLEGKMTKRPFTG
Subjt:  KNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFTG

Query:  KGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ
        KGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ
Subjt:  KGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ

Query:  LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCL
        LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCL
Subjt:  LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCL

Query:  FVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIR
        FVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIR
Subjt:  FVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIR

Query:  YMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE
        YMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE
Subjt:  YMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE

Query:  ETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCV
        ETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCV
Subjt:  ETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCV

Query:  YKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV-----------------------
        YKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV                       
Subjt:  YKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV-----------------------

Query:  --CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKST
          CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDY LVYGSKDLILTGYTDSDFQTDRDSRKST
Subjt:  --CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKST

Query:  SGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRG
        SGSVFTLNGGAVVWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRG
Subjt:  SGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRG

Query:  DVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
        DVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
Subjt:  DVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]0.097.74Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNT NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKF+RGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIER VKNGLLSELEENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI

Query:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
        RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
Subjt:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY

Query:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
        EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
Subjt:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC

Query:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------
        VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV                      
Subjt:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------

Query:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS
           CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDY LVYGSKDLILTGYTDSDFQTDRDSRKS
Subjt:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
        TSGSVFTLNGGAVVWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR

Query:  GDVIVTQIASTHNVADPFTKPLTAK
        GDVIVTQIASTHNVADPFTKPLTAK
Subjt:  GDVIVTQIASTHNVADPFTKPLTAK

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0097.77Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNT NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKF+RGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIER VKNGLLSELEENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI

Query:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
        RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
Subjt:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY

Query:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
        EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
Subjt:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC

Query:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------
        VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV                      
Subjt:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------

Query:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS
           CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDY LVYGSKDLILTGYTDSDFQTDRDSRKS
Subjt:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
        TSGSVFTLNGGAVVWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR

Query:  GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
        GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
Subjt:  GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT

A0A5A7TWB9 Gag/pol protein0.0e+0097.74Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNT NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKF+RGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIER VKNGLLSELEENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI

Query:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
        RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
Subjt:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY

Query:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
        EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
Subjt:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC

Query:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------
        VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV                      
Subjt:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------

Query:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS
           CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS
Subjt:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
        TSGSVF LNGGAVVWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR

Query:  GDVIVTQIASTHNVADPFTKPLTAK
        GDVIVTQIASTHNVADPFTKPLTAK
Subjt:  GDVIVTQIASTHNVADPFTKPLTAK

A0A5A7TZD7 Gag/pol protein0.0e+0097.84Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEM
        MTSATLNMLAADKLNGNNYASWKNTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEM
Subjt:  MTSATLNMLAADKLNGNNYASWKNTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEM

Query:  FGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ
        FGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ
Subjt:  FGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQ

Query:  KGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCLV
        KGEANVATSTRKF+RGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCLV
Subjt:  KGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCLV

Query:  ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIY
        ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIY
Subjt:  ENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIY

Query:  KNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFTG
        KNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIER VKNGLLSELEENSLPVCESCLEGKMTKRPFTG
Subjt:  KNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFTG

Query:  KGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ
        KGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ
Subjt:  KGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQ

Query:  LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCL
        LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCL
Subjt:  LSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCL

Query:  FVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIR
        FVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIR
Subjt:  FVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIR

Query:  YMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE
        YMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE
Subjt:  YMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYE

Query:  ETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCV
        ETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCV
Subjt:  ETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCV

Query:  YKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV-----------------------
        YKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV                       
Subjt:  YKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV-----------------------

Query:  --CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKST
          CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDY LVYGSKDLILTGYTDSDFQTDRDSRKST
Subjt:  --CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKST

Query:  SGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRG
        SGSVFTLNGGAVVWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRG
Subjt:  SGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRG

Query:  DVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
        DVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
Subjt:  DVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT

A0A5A7UGV2 Gag/pol protein0.0e+0097.74Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNT NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKF+RGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIER VKNGLLSELEENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI

Query:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
        RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
Subjt:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY

Query:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
        EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
Subjt:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC

Query:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------
        VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV                      
Subjt:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------

Query:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS
           CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDY LVYGSKDLILTGYTDSDFQTDRDSRKS
Subjt:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
        TSGSVFTLNGGAVVWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR

Query:  GDVIVTQIASTHNVADPFTKPLTAK
        GDVIVTQIASTHNVADPFTKPLTAK
Subjt:  GDVIVTQIASTHNVADPFTKPLTAK

A0A5D3CPJ6 Gag/pol protein0.0e+0097.69Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNT NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNT-NTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKF+RGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKK KAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIER VKNGLLSELEENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPI

Query:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
        RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY
Subjt:  RYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDY

Query:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
        EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC
Subjt:  EETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPC

Query:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------
        VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV                      
Subjt:  VYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIV----------------------

Query:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS
           CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDY LVYGSKDLILTGYTDSDFQTDRDSRKS
Subjt:  ---CPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKS

Query:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
        TSGSVFTLNGGAVVWRSIKQGCI DSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR
Subjt:  TSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHR

Query:  GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
        GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT
Subjt:  GDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMPHLT

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.0e-14829.16Show/hide
Query:  NGNNYASWK-NTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDAL
        +G  YA WK     +L   D+  V+    P            E  + W KA   A++ I+  LS+       S +TAR+I+++L  ++ + S   +    
Subjt:  NGNNYASWK-NTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKHDAL

Query:  KYIYNARMNEGASVRE--HVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTR
        K + + +++   S+    H+ + ++   +A   GA I+E  ++S +L +LP  +      A+       LT    + +  +  +KIK    + +      
Subjt:  KYIYNARMNEGASVRE--HVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTR

Query:  KFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKY---LAEKKKAKQGKYDL-----LVLETCLVEN-
          +  + +   ++     N+  K KK  +GN        +  K K     C HC +EGH K++C  Y   L  K K  + +        +      V N 
Subjt:  KFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKY---LAEKKKAKQGKYDL-----LVLETCLVEN-

Query:  ---DDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
           D+  +++DSGA++H+ +     +   ++           G  + A   G +RL      + LE+V    +   NL+SVK L E   S+ F+ + V I
Subjt:  ---DDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEIC-SAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHIN------LNRIERFVKNGLLSELEENSLPVCESCLEGK
         KNG+ +  ++ + NN+ V+                   Q   +    K N  LWH R GHI+      + R   F    LL+ L E S  +CE CL GK
Subjt:  YKNGVEIC-SAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHIN------LNRIERFVKNGLLSELEENSLPVCESCLEGK

Query:  MTKRPF---TGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQ
          + PF     K H  K PL +VHSD+CGP+         YF+ F D ++ Y   YL+++KS+    F+++ A+ E   +  +     D G EY+  + +
Subjt:  MTKRPF---TGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQ

Query:  NYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSV---SETPLKLWNGRKGSLRHFRIWGCPAHV-L
         + ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S A L  SFWG AV TA Y++N +PS+++   S+TP ++W+ +K  L+H R++G   +V +
Subjt:  NYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSV---SETPLKLWNGRKGSLRHFRIWGCPAHV-L

Query:  ENNPKKLEPRSKLCLFVGY-PKG--------------------------TRGGYFYD--------------PKDNKVFVST----------NATFLEEDH
        +N   K + +S   +FVGY P G                          +R   F                P D++  + T          N  FL++  
Subjt:  ENNPKKLEPRSKLCLFVGY-PKG--------------------------TRGGYFYD--------------PKDNKVFVST----------NATFLEEDH

Query:  IREHK----PRSKIVLNELS------------KETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREP------------------------RRSGRV
          E+K       KI+  E              K++ E +   + E     R  H+  S  +  P   RE                         RRS R+
Subjt:  IREHK----PRSKIVLNELS------------KETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREP------------------------RRSGRV

Query:  TNLP-IRYMSLTETL--TVISDGDI--EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAK
           P I Y     +L   V++   I  + P +F +     DK  W +A+N EL +   N+ W +  +P+    +  +W++  K    G    +KARLVA+
Subjt:  TNLP-IRYMSLTETL--TVISDGDI--EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAK

Query:  GYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSY
        G+TQ   +DYEETF+PVA + S R +LS+   ++ ++ QMDVKTAFLNG L+E IYM+ P+G  I      +CKLN++IYGLKQA+R W   F+ A+K  
Subjt:  GYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSY

Query:  GFDQIVDEPCVY---KRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKI----------
         F     + C+Y   K  IN+++ +++LYVDD+++   D+  + + K++L  +F+M DL E +  +GI+I  + +   + LSQ++Y+ KI          
Subjt:  GFDQIVDEPCVY---KRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKI----------

Query:  -VCPKTPQDV-------EEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLI----LTGYTDSD
         V    P  +       +E  + P  S +G LMY MLCTRPD+  AV I+SRY S      W  +K +L+YL+ T D  L++  K+L     + GY DSD
Subjt:  -VCPKTPQDV-------EEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLI----LTGYTDSD

Query:  FQTDRDSRKSTSGSVFTL-NGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIER
        +      RKST+G +F + +   + W + +Q  +  S+ EAEY+A  EA +EA+WL+  L  + +   +  PI +Y DN G ++ +  P  HKR KHI+ 
Subjt:  FQTDRDSRKSTSGSVFTL-NGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIER

Query:  KYHLIREIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGL
        KYH  RE V    + +  I + + +AD FTKPL A  F    + LGL
Subjt:  KYHLIREIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-21034.18Show/hide
Query:  KLNGNN-YASW-KNTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKH
        K NG+N +++W +    +LI   L  VL  +  +     A        E WA  +E+A + I   LS+ +        TAR I   L+ ++   +   K 
Subjt:  KLNGNN-YASW-KNTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMFGQASYQIKH

Query:  DALKYIYNARMNEGASVREH--VLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVAT
           K +Y   M+EG +   H  V N ++   +A + G  I+E  +   +L SLP S+    +  +  K    L  + + L   E + K    +G+A +  
Subjt:  DALKYIYNARMNEGASVREH--VLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVAT

Query:  STRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYD-------------LLVL
           + Y+ S++                  G  G     A  K+  ++K+    C++CNQ GH+KR+CP     K +    K D             +L +
Subjt:  STRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYD-------------LLVL

Query:  ---ETCL-VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSF---LLLENVYVVPDLKRNLISVKCLLEQSY
           E C+ +   +S W++D+ A++H          +   + G  T+++G         +G   +C++ +    L+L++V  VPDL+ NLIS   L    Y
Subjt:  ---ETCL-VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSF---LLLENVYVVPDLKRNLISVKCLLEQSY

Query:  SLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESC
           F   K  + K  + I        LY   +   +  LN    + ++               LWH R+GH++   ++   K  L+S  +  ++  C+ C
Subjt:  SLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESC

Query:  LEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKF
        L GK  +  F     R    L+LV+SD+CGPM +++ GG +YF+TF DD SR  +VY+++ K +  + F+++ A VE    + +K  RSD GGEY   +F
Subjt:  LEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKF

Query:  QNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLRHFRIWGCP--AHVL
        + Y    GI  + + PGTPQ NGV+ER NRT+++ VRSM+  A LP SFWG AVQTA Y++N  PS  ++ E P ++W  ++ S  H +++GC   AHV 
Subjt:  QNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLRHFRIWGCP--AHVL

Query:  ENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNEL----------------SKETTEPSTRVVEEP-----
        +    KL+ +S  C+F+GY     G   +DP   KV  S +  F  E  +R     S+ V N +                ++ TT+  +   E+P     
Subjt:  ENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNEL----------------SKETTEPSTRVVEEP-----

Query:  ------SALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPD
                +  V H       HQP      RRS R      RY S TE + +  D    +P + K+ +   +K++ +KAM  E+ES+  N  + LV+ P 
Subjt:  ------SALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPD

Query:  GVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQE
        G +P+ CKW++K K+  D K+  +KARLV KG+ Q +G+D++E FSPV  + SIR +LS+AA  D E+ Q+DVKTAFL+G+LEE IYM+QPEGF + G++
Subjt:  GVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQE

Query:  QKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVY-KRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIF
          +CKLN+S+YGLKQA R W ++FD+ +KS  + +   +PCVY KR    +   L+LYVDD+L++G D GL+  +K  L+  F MKDLG AQ +LG++I 
Subjt:  QKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVY-KRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIF

Query:  RDRKNKMLALSQASYIDKI-------------------------VCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTA
        R+R ++ L LSQ  YI+++                         +CP T ++   M  +PY+SAVGSLMYAM+CTRPDI +AVG+VSR+  NPG  HW A
Subjt:  RDRKNKMLALSQASYIDKI-------------------------VCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTA

Query:  VKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNM
        VK IL+YLR T    L +G  D IL GYTD+D   D D+RKS++G +FT +GGA+ W+S  Q C+  ST EAEY+AA E  KE +WL+ FL +L +    
Subjt:  VKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNM

Query:  SKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGL
         K   +YCD+  A+  S+    H R KHI+ +YH IRE+V    + V +I++  N AD  TK +    FE   E +G+
Subjt:  SKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGL

P25600 Putative transposon Ty5-1 protein YCL074W6.0e-3633.77Show/hide
Query:  MDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGL
        MDV TAFLN  ++E IY++QP GF+       + +L   +YGLKQA   WN   +  +K  GF +   E  +Y R  +    ++ +YVDD+L+      +
Subjt:  MDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGL

Query:  LTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKI---------------VCPKTPQDVEEMRHI----PYASAVGSLMYAMLCTRP
           +KQ L   + MKDLG+    LG+ I +   N  + LS   YI K                +C   P       H+    PY S VG L++     RP
Subjt:  LTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKI---------------VCPKTPQDVEEMRHI----PYASAVGSLMYAMLCTRP

Query:  DICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGS-KDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIK-QGCIVDSTMEAEYV
        DI Y V ++SR+   P   H  + + +L+YL  TR   L Y S   L LT Y D+      D   ST G V  L G  V W S K +G I   + EAEY+
Subjt:  DICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGS-KDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIK-QGCIVDSTMEAEYV

Query:  AACEAAKE
         A E   E
Subjt:  AACEAAKE

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.5e-13226.65Show/hide
Query:  LNMLAADKLNGNNYASW-KNTNTVLIIDDLRFVLVEECPQVPA---ANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMF
        +NM    KL   NY  W +  + +    +L   L       PA    +A   V   Y RW + ++   + +L ++S  +        TA +I ++L++++
Subjt:  LNMLAADKLNGNNYASW-KNTNTVLIIDDLRFVLVEECPQVPA---ANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMF

Query:  GQASY----QIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKI
           SY    Q++    ++    +     ++ +++  ++  F+   + G  +D   QV  +LE+LPE +              TLT +   L   ES  KI
Subjt:  GQASY----QIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKI

Query:  KGQKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNK----ANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCP--KYLAEKKKAKQGKYD
                +  +       +T+ T +  + + N ++  +     +K    ++        ++K   G C  C  +GH  + C   ++      ++Q    
Subjt:  KGQKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNK----ANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCP--KYLAEKKKAKQGKYD

Query:  L--------LVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCL
                 L L +    N+   W++DSGAT+H+ S F  + S  Q  TG   + V  G  +     G   L  +   L L N+  VP++ +NLISV  L
Subjt:  L--------LVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCL

Query:  LE------QSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENA--HLWHLRLGHINLNRIERFVKNGLLS
                + +  +F V  +     GV +   K ++ LY               +  A +Q   L  SP   A    WH RLGH   + +   + N  LS
Subjt:  LE------QSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENA--HLWHLRLGHINLNRIERFVKNGLLS

Query:  ELE-ENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKT
         L   +    C  CL  K  K PF+     +  PLE ++SD+     + +   + Y++ F D ++RY ++Y ++ KS+  E F  +K  +EN     I T
Subjt:  ELE-ENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKT

Query:  FRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLR
        F SD GGE++ L    Y  + GI    S P TP+ NG+SER++R +++   +++S+A +P ++W YA   AVY++N +P+  +  E+P +   G   +  
Subjt:  FRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLR

Query:  HFRIWGCPAH--VLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEE-----------DHIREHKPRSKIVLNELSKETTEPSTRV
          R++GC  +  +   N  KL+ +S+ C+F+GY            + +++++S +  F E              ++E +  S  V    S  TT P+   
Subjt:  HFRIWGCPAH--VLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEE-----------DHIREHKPRSKIVLNELSKETTEPSTRV

Query:  V----------------EEPSALTRVVHVGSS----------------------------------TRTHQ-----------------PQSLREPRRSGR
        V                  PSA  R   V SS                                  T+TH                   QSL  P +S  
Subjt:  V----------------EEPSALTRVVHVGSS----------------------------------TRTHQ-----------------PQSLREPRRSGR

Query:  VTNLPIRYMSLTET--------------LTVISDGDIEDPLTF------------------------------KKAMEDVDKDEWIKAMNLELESMYFNS
         +  P    S + T              L  I + + + PL                                + A++ +  + W  AM  E+ +   N 
Subjt:  VTNLPIRYMSLTET--------------LTVISDGDIEDPLTF------------------------------KKAMEDVDKDEWIKAMNLELESMYFNS

Query:  VWDLVDQPDG-VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQ
         WDLV  P   V  +GC+WI+ +K  +DG +  +KARLVAKGY Q  G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YM Q
Subjt:  VWDLVDQPDG-VKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQ

Query:  PEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEA
        P GFI   +   +CKL +++YGLKQA R+W +     + + GF   V +  ++     KS+ ++++YVDDIL+ GND  LL +    L+ +F +KD  E 
Subjt:  PEGFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEA

Query:  QFVLGIQIFRDRKNKMLALSQASYI------------DKIVCPKTPQDVEEMRH-------IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAH
         + LGI+    R    L LSQ  YI              +  P  P     +           Y   VGSL Y +  TRPDI YAV  +S++   P   H
Subjt:  QFVLGIQIFRDRKNKMLALSQASYI------------DKIVCPKTPQDVEEMRH-------IPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAH

Query:  WTAVKTILKYLRRTRDY-MLVYGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEV
          A+K IL+YL  T ++ + +     L L  Y+D+D+  D+D   ST+G +  L    + W S KQ  +V S+ EAEY +    + E  W+ + L +L +
Subjt:  WTAVKTILKYLRRTRDY-MLVYGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEV

Query:  VPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMP
           +++P  +YCDN GA      P  H R KHI   YH IR  V  G + V  +++   +AD  TKPL+   F+     +G+  +P
Subjt:  VPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.0e-12826.35Show/hide
Query:  LNMLAADKLNGNNYASW-KNTNTVLIIDDLRFVLVEECPQVPA---ANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMF
        +NM    KL   NY  W +  + +    +L   L    P  PA    +A   V   Y RW + ++   + IL ++S  +        TA +I ++L++++
Subjt:  LNMLAADKLNGNNYASW-KNTNTVLIIDDLRFVLVEECPQVPA---ANATRTVREPYERWAKANEKARAYILASLSEVLAKKHESMLTAREIMDSLQEMF

Query:  GQASYQIKHDALKYIYNARMNEGASVREHV--LNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
           SY                       HV  L  +  F+   + G  +D   QV  +LE+LP+ +              +LT +   L   ES +    
Subjt:  GQASYQIKHDALKYIYNARMNEGASVREHV--LNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAK-----------QG
              +  +       +T+  ++    + N      +      ++  +    ++ K   G C  C+ +GH  + CP+    +               Q 
Subjt:  QKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAK-----------QG

Query:  KYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLE---
        + +L V       N    W++DSGAT+H+ S F  + S+ Q  TG   + +  G  +     G   L      L L  V  VP++ +NLISV  L     
Subjt:  KYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLE---

Query:  ---QSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELE-ENS
           + +  +F V  +     GV +   K ++ LY     +S+A+    MF +  +         K     WH RLGH +L  +   + N  L  L   + 
Subjt:  ---QSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELE-ENS

Query:  LPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGG
        L  C  C   K  K PF+     + +PLE ++SD+     + +   + Y++ F D ++RY ++Y ++ KS+  + F  +K+ VEN     I T  SD GG
Subjt:  LPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGG

Query:  EYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLRHFRIWGC
        E++ L+  +YL + GI    S P TP+ NG+SER++R +++M  +++S+A +P ++W YA   AVY++N +P+  +  ++P +   G+  +    +++GC
Subjt:  EYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLRHFRIWGC

Query:  PAH--VLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEE---------DHIREHKPRSKIVLNELSKETTEPSTRVV--------
          +  +   N  KLE +SK C F+GY               +++ S +  F E                + RS    N  S  TT P+T +V        
Subjt:  PAH--VLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEE---------DHIREHKPRSKIVLNELSKETTEPSTRVV--------

Query:  ----------EEPSAL-----------------------TRVVHVG--SSTRTHQPQ-----------------SLREPRRSGRVTNLPIRYMSLTETLT
                    PS L                       T   H G   + + HQ Q                 S   P ++  +   PI    +    T
Subjt:  ----------EEPSAL-----------------------TRVVHVG--SSTRTHQPQ-----------------SLREPRRSGRVTNLPIRYMSLTETLT

Query:  VISDGDI----------------------------------------------------------EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVW
         IS+ +                                                            +P T  +AM+D   D W +AM  E+ +   N  W
Subjt:  VISDGDI----------------------------------------------------------EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVW

Query:  DLV-DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPE
        DLV   P  V  +GC+WI+ +K  +DG +  +KARLVAKGY Q  G+DY ETFSPV    SIRI+L +A    + I Q+DV  AFL G L + +YM QP 
Subjt:  DLV-DQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPE

Query:  GFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQF
        GF+   +   +C+L ++IYGLKQA R+W +   T + + GF   + +  ++     +S+ ++++YVDDIL+ GND  LL      L+ +F +K+  +  +
Subjt:  GFIIPGQEQKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQF

Query:  VLGIQIFRDRKNKMLALSQASYIDKIVC---------PKTPQDVEEMRHI----------PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWT
         LGI+    R  + L LSQ  Y   ++            TP        +           Y   VGSL Y +  TRPD+ YAV  +S+Y   P   HW 
Subjt:  VLGIQIFRDRKNKMLALSQASYIDKIVC---------PKTPQDVEEMRHI----------PYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWT

Query:  AVKTILKYLRRTRDY-MLVYGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVP
        A+K +L+YL  T D+ + +     L L  Y+D+D+  D D   ST+G +  L    + W S KQ  +V S+ EAEY +    + E  W+ + L +L +  
Subjt:  AVKTILKYLRRTRDY-MLVYGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVP

Query:  NMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMP
         +S P  +YCDN GA      P  H R KHI   YH IR  V  G + V  +++   +AD  TKPL+   F+     +G+  +P
Subjt:  NMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADPFTKPLTAKVFEGHLESLGLRDMP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 86.9e-8035.5Show/hide
Query:  EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILL
        ++P T+ +A E +    W  AM+ E+ +M     W++   P   KPIGCKW+YK K  +DG ++ +KARLVAKGYTQ EG+D+ ETFSPV  L S++++L
Subjt:  EDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILL

Query:  SIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQE----QKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFL
        +I+A +++ + Q+D+  AFLNG+L+E IYM+ P G+     +      +C L +SIYGLKQASR W ++F   +  +GF Q   +   + +I       +
Subjt:  SIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQE----QKICKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFL

Query:  VLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFR--------DRKNKMLALSQASYI---------DKIVCPKTPQDVEEMRHIPYAS
        ++YVDDI++  N+   + ++K  L + F+++DLG  ++ LG++I R         RK  +  L +   +         D  V        + +    Y  
Subjt:  VLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFR--------DRKNKMLALSQASYI---------DKIVCPKTPQDVEEMRHIPYAS

Query:  AVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSK-DLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQ
         +G LMY  + TR DI +AV  +S++   P LAH  AV  IL Y++ T    L Y S+ ++ L  ++D+ FQ+ +D+R+ST+G    L    + W+S KQ
Subjt:  AVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSK-DLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRSIKQ

Query:  GCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIRE
          +  S+ EAEY A   A  E +WL  F  +L++   +SKP  L+CDN+ A+  +     H+R KHIE   H +RE
Subjt:  GCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIRE

AT5G27260.1 unknown protein3.5e-0725.66Show/hide
Query:  WKSDNGTFRPGYLAQLVRMMAE-KLPGCQVRATTVIDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKPFPYYDE
        W+  NGT     L    + M E     C+ +       R+K LK  +Q+  +++    SGFGW+   K   A  E++ +++++HP  K L    F ++DE
Subjt:  WKSDNGTFRPGYLAQLVRMMAE-KLPGCQVRATTVIDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKPFPYYDE

Query:  LTYVFGRDRATGRQGVDISQDDVRASRPSRASEGRTGSSGSKRKRGSQRDFE
        L  +FG   ATG+  + +            +++G T  +G   ++    DF+
Subjt:  LTYVFGRDRATGRQGVDISQDDVRASRPSRASEGRTGSSGSKRKRGSQRDFE

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.5e-1038.37Show/hide
Query:  TQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNV
        T    L  + K+   LWH RL H++   +E  VK G L   + +SL  CE C+ GK  +  F+   H  K PL+ VHSDL G  +V
Subjt:  TQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNV

ATMG00810.1 DNA/RNA polymerases superfamily protein2.1e-2033.04Show/hide
Query:  FLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVCPKTPQDVEEM-RHIP-----------------
        +L+LYVDDILL G+   LL  +   L++ F MKDLG   + LGIQI        L LSQ  Y ++I+      D + M   +P                 
Subjt:  FLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQASYIDKIVCPKTPQDVEEM-RHIP-----------------

Query:  YASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDY-MLVYGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRS
        + S VG+L Y  L TRPDI YAV IV +    P LA +  +K +L+Y++ T  + + ++ +  L +  + DSD+     +R+ST+G    L    + W +
Subjt:  YASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDY-MLVYGSKDLILTGYTDSDFQTDRDSRKSTSGSVFTLNGGAVVWRS

Query:  IKQGCIVDSTMEAEYVAACEAAKEAVW
         +Q  +  S+ E EY A    A E  W
Subjt:  IKQGCIVDSTMEAEYVAACEAAKEAVW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.2e-1537.4Show/hide
Query:  RRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVA
        R    +  L  +Y SLT T T+      ++P +   A++D     W +AM  EL+++  N  W LV  P     +GCKW++K K  +DG +   KARLVA
Subjt:  RRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVWDLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVA

Query:  KGYTQVEGVDYEETFSPVAMLKSIRILLSIA
        KG+ Q EG+ + ET+SPV    +IR +L++A
Subjt:  KGYTQVEGVDYEETFSPVAMLKSIRILLSIA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGGGCTGGAAATCAGATAATGGTACATTCCGCCCAGGATACCTTGCGCAGTTGGTACGCATGATGGCAGAGAAACTACCTGGATGTCAGGTACGAGCAACAACTGT
AATTGATTGTAGGATCAAGACACTGAAGCGAACGTTCCAGGCCATTGCCGAAATGCGGGGCCCAGCGTGCAGTGGCTTTGGGTGGAACGATGAAGAGAAATGTATCGTTG
CGGAGAAGGAATTATTTGATAATTGGGTTAGGTCGCATCCTGCAGCGAAAGGACTCCTGAACAAACCATTCCCGTATTACGACGAGCTTACATATGTATTTGGTCGCGAT
AGGGCGACCGGTCGCCAGGGAGTTGACATCTCGCAGGACGATGTACGTGCATCCCGACCTTCTCGCGCTTCAGAGGGTAGGACCGGATCAAGTGGATCGAAGAGGAAGAG
GGGAAGTCAGCGAGACTTCGAGCTTGAAGCGATACATCTGGCGCTCGACCAAACAAACGAGCAACTCAGGCAGATTGCGGAGTGGCCAGCACGCAACCTTGCAAATGACA
ACCACGTGCGCACGGAATTTTTCCGCATATTGCGTGAGATGCCAGAACTTACGAGTTTGGATAGGGCGTTATTGCAAAGGCATCTTTTGTCTCGTATGGACGACCTTCGG
GGTTTCGTACTCATGCCTGAGGATGAAAGGGAGGGATTCTGTAGAGTCCTCCTACGAGACATAGAGAGAGAGTTTCTTAGTTTGATAATGACGAGTGCTACTTTGAATAT
GCTGGCTGCTGATAAACTTAATGGCAATAATTATGCATCTTGGAAAAATACTAACACTGTGCTAATCATCGATGACCTTAGATTTGTCCTAGTTGAGGAGTGTCCTCAAG
TCCCAGCTGCTAATGCAACTCGAACTGTTCGAGAACCATATGAGCGTTGGGCCAAGGCAAATGAAAAAGCCCGAGCATACATCTTGGCAAGCTTATCTGAAGTATTGGCC
AAGAAACATGAATCAATGCTCACTGCTCGTGAGATTATGGACTCCTTGCAGGAGATGTTTGGTCAGGCCTCTTATCAGATCAAGCATGATGCTCTGAAATACATTTATAA
TGCCCGTATGAATGAGGGAGCCTCAGTGCGAGAACATGTTCTCAATATGATGGTTCATTTCAACGTGGCAGAAATGAATGGGGCTGTCATCGATGAAGCCAGTCAGGTTA
GCTTTATTTTGGAATCTCTGCCAGAGAGTTTCCTACAATTTAGAAGCAATGCTGTTATGAATAAGATTGCTTATACCCTTACCACCCTTCTCAACGAGCTACAGACTTTT
GAGTCTCTGATGAAAATCAAGGGACAGAAGGGAGAGGCAAATGTTGCTACTTCCACAAGAAAGTTCTATAGGGGTTCGACCTCTGGAACTAAGTCTATGCCTTCTTCATC
TGGCAATAAGAAGTGGAAGAAGAAGAAGGGTGGCCAAGGAAATAAAGCTAACCTCGCTGCTGCTAAAACGACCAAGAAAGCCAAAGCTGCAAAGGGAATATGTTTCCATT
GCAACCAAGAGGGACATTGGAAGAGAAACTGTCCCAAGTACTTGGCAGAAAAGAAGAAGGCTAAACAAGGTAAATATGATTTACTAGTGCTAGAGACTTGTTTAGTGGAA
AATGATGATTCAGCCTGGATAATAGATTCAGGTGCCACTAATCATGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGCGGCAGTTGGAGACTGGAGAGATGACGATGCG
AGTTGGAACTGGGCATGTCGTCTCAGCAATTGCAGTGGGAGGGCTTCGACTTTGTTTACAGAAATCTTTTCTTTTATTAGAAAATGTATATGTTGTTCCTGATTTAAAAA
GGAATTTGATTTCTGTAAAGTGCTTACTAGAACAATCTTACTCGTTAACTTTTAATGTAAATAAAGTGTTTATTTACAAAAATGGTGTTGAGATTTGTTCTGCAAAGTTA
GAAAATAATCTTTATGTGTTAAGATCATTAACATCTAAAGCTCTTCTTAATACTGAAATGTTCAAAACTGCAATAACTCAAAATAAAAGACTTAAAATTTCTCCAAAAGA
AAATGCTCATCTTTGGCACCTAAGATTAGGGCACATAAATCTCAATAGGATTGAGAGATTTGTAAAGAATGGACTTCTAAGTGAGTTAGAAGAAAATTCTTTACCTGTAT
GTGAGTCATGTCTTGAAGGTAAGATGACCAAAAGACCTTTTACTGGAAAAGGTCATAGGGCCAAAGAACCTCTAGAACTTGTACATTCAGATCTATGTGGTCCTATGAAT
GTTAAAGCAAGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTTAATGCAACATAAGTCTGAAGCCCTTGAAAAGTTCAA
GGAATACAAGGCTGAAGTTGAAAACGCATTAAGTAAAACTATTAAAACATTTCGATCGGATCGAGGTGGAGAGTATATGGATTTGAAATTCCAAAACTATTTGATGGAAT
GTGGAATTGTATCTCAACTCTCAGCACCTGGTACACCTCAACAGAATGGTGTATCAGAAAGGAGAAATCGAACCTTGTTGGACATGGTTCGGTCTATGATGAGTTACGCT
CACTTACCTAATTCGTTTTGGGGTTATGCAGTGCAAACTGCAGTCTATATTTTGAATTGTGTTCCATCTAAAAGTGTTTCTGAAACACCTTTAAAATTATGGAATGGTCG
TAAAGGTAGTTTACGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTGCTTGAGAATAACCCTAAGAAATTGGAACCTCGTTCAAAATTATGTTTATTTGTAGGCTACC
CCAAAGGAACTAGAGGTGGTTACTTCTATGATCCTAAAGATAATAAAGTGTTTGTATCGACAAATGCTACATTTTTAGAAGAGGACCACATAAGGGAGCACAAACCGCGT
AGTAAGATAGTATTAAATGAACTTTCCAAAGAAACTACTGAACCTTCAACAAGAGTTGTTGAAGAGCCTAGTGCATTAACAAGAGTTGTTCATGTCGGTTCATCTACTAG
GACACATCAACCTCAATCGTTGAGGGAACCTCGACGAAGTGGGAGGGTTACAAACTTACCTATTCGTTATATGAGTTTAACTGAAACCTTAACTGTCATATCTGATGGCG
ACATTGAGGATCCATTGACTTTTAAGAAGGCAATGGAGGATGTGGATAAAGATGAATGGATCAAAGCTATGAATCTTGAATTGGAGTCTATGTACTTCAATTCGGTCTGG
GATCTTGTAGATCAACCTGATGGGGTAAAACCTATAGGTTGTAAATGGATCTACAAGAGAAAAAGAGGTGCAGATGGTAAGGTACAAACTTTTAAAGCTAGACTAGTGGC
AAAGGGTTATACCCAAGTTGAGGGAGTTGACTATGAGGAGACTTTCTCACCTGTTGCCATGTTAAAGTCTATTCGAATACTTTTGTCCATTGCTGCATATTTTGACTATG
AGATTTGGCAAATGGATGTAAAGACTGCCTTTTTGAATGGCAATCTTGAGGAGACCATCTATATGCAACAACCAGAAGGATTCATAATTCCAGGTCAAGAGCAAAAGATT
TGCAAGCTTAATCGTTCTATTTATGGATTAAAACAAGCTTCTCGATCTTGGAACATAAGATTTGATACCGCAATAAAATCTTATGGATTTGATCAAATCGTTGATGAACC
TTGTGTCTACAAAAGAATCATCAACAAATCAGTAGCTTTCTTAGTTTTGTACGTAGATGATATCCTGCTCATTGGGAATGATATAGGTTTACTAACTGACATCAAACAAT
GGCTAGCAACCCAATTTCAAATGAAAGATTTGGGAGAGGCACAATTTGTTCTGGGTATTCAGATCTTTAGAGATCGTAAGAACAAAATGCTAGCTTTGTCTCAAGCATCG
TATATTGATAAAATAGTTTGTCCTAAGACACCTCAAGACGTTGAGGAAATGAGACATATCCCCTATGCATCAGCTGTTGGCAGCTTGATGTATGCGATGTTATGTACTAG
ACCTGACATCTGTTATGCGGTGGGGATAGTCAGTAGATATCAATCTAATCCAGGATTAGCTCATTGGACTGCCGTTAAAACTATCCTCAAGTATCTTAGGAGAACGAGGG
ACTACATGCTTGTGTATGGTTCTAAGGATTTGATTCTTACAGGATACACAGACTCTGACTTTCAGACTGATAGAGATTCTAGGAAATCTACTTCAGGTTCAGTGTTCACT
CTTAACGGAGGAGCTGTAGTTTGGAGAAGTATCAAGCAAGGATGTATTGTTGACTCCACTATGGAGGCAGAGTACGTTGCAGCTTGTGAAGCTGCCAAAGAGGCTGTTTG
GCTTAGAAATTTCTTGATTGATTTGGAAGTAGTTCCAAACATGTCAAAGCCAATTACTCTTTACTGTGATAATAGTGGGGCTGTGGCTAATTCTAGGGAGCCCAGAAGCC
ACAAGCGTGGAAAGCATATTGAGCGCAAGTATCACTTGATTCGAGAGATAGTGCATCGAGGGGACGTGATCGTCACACAGATAGCTTCGACACACAATGTTGCTGATCCG
TTTACAAAGCCCCTCACGGCTAAGGTGTTTGAGGGTCACCTAGAGAGTCTGGGTCTACGTGACATGCCACATTTAACCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGGGCTGGAAATCAGATAATGGTACATTCCGCCCAGGATACCTTGCGCAGTTGGTACGCATGATGGCAGAGAAACTACCTGGATGTCAGGTACGAGCAACAACTGT
AATTGATTGTAGGATCAAGACACTGAAGCGAACGTTCCAGGCCATTGCCGAAATGCGGGGCCCAGCGTGCAGTGGCTTTGGGTGGAACGATGAAGAGAAATGTATCGTTG
CGGAGAAGGAATTATTTGATAATTGGGTTAGGTCGCATCCTGCAGCGAAAGGACTCCTGAACAAACCATTCCCGTATTACGACGAGCTTACATATGTATTTGGTCGCGAT
AGGGCGACCGGTCGCCAGGGAGTTGACATCTCGCAGGACGATGTACGTGCATCCCGACCTTCTCGCGCTTCAGAGGGTAGGACCGGATCAAGTGGATCGAAGAGGAAGAG
GGGAAGTCAGCGAGACTTCGAGCTTGAAGCGATACATCTGGCGCTCGACCAAACAAACGAGCAACTCAGGCAGATTGCGGAGTGGCCAGCACGCAACCTTGCAAATGACA
ACCACGTGCGCACGGAATTTTTCCGCATATTGCGTGAGATGCCAGAACTTACGAGTTTGGATAGGGCGTTATTGCAAAGGCATCTTTTGTCTCGTATGGACGACCTTCGG
GGTTTCGTACTCATGCCTGAGGATGAAAGGGAGGGATTCTGTAGAGTCCTCCTACGAGACATAGAGAGAGAGTTTCTTAGTTTGATAATGACGAGTGCTACTTTGAATAT
GCTGGCTGCTGATAAACTTAATGGCAATAATTATGCATCTTGGAAAAATACTAACACTGTGCTAATCATCGATGACCTTAGATTTGTCCTAGTTGAGGAGTGTCCTCAAG
TCCCAGCTGCTAATGCAACTCGAACTGTTCGAGAACCATATGAGCGTTGGGCCAAGGCAAATGAAAAAGCCCGAGCATACATCTTGGCAAGCTTATCTGAAGTATTGGCC
AAGAAACATGAATCAATGCTCACTGCTCGTGAGATTATGGACTCCTTGCAGGAGATGTTTGGTCAGGCCTCTTATCAGATCAAGCATGATGCTCTGAAATACATTTATAA
TGCCCGTATGAATGAGGGAGCCTCAGTGCGAGAACATGTTCTCAATATGATGGTTCATTTCAACGTGGCAGAAATGAATGGGGCTGTCATCGATGAAGCCAGTCAGGTTA
GCTTTATTTTGGAATCTCTGCCAGAGAGTTTCCTACAATTTAGAAGCAATGCTGTTATGAATAAGATTGCTTATACCCTTACCACCCTTCTCAACGAGCTACAGACTTTT
GAGTCTCTGATGAAAATCAAGGGACAGAAGGGAGAGGCAAATGTTGCTACTTCCACAAGAAAGTTCTATAGGGGTTCGACCTCTGGAACTAAGTCTATGCCTTCTTCATC
TGGCAATAAGAAGTGGAAGAAGAAGAAGGGTGGCCAAGGAAATAAAGCTAACCTCGCTGCTGCTAAAACGACCAAGAAAGCCAAAGCTGCAAAGGGAATATGTTTCCATT
GCAACCAAGAGGGACATTGGAAGAGAAACTGTCCCAAGTACTTGGCAGAAAAGAAGAAGGCTAAACAAGGTAAATATGATTTACTAGTGCTAGAGACTTGTTTAGTGGAA
AATGATGATTCAGCCTGGATAATAGATTCAGGTGCCACTAATCATGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGCGGCAGTTGGAGACTGGAGAGATGACGATGCG
AGTTGGAACTGGGCATGTCGTCTCAGCAATTGCAGTGGGAGGGCTTCGACTTTGTTTACAGAAATCTTTTCTTTTATTAGAAAATGTATATGTTGTTCCTGATTTAAAAA
GGAATTTGATTTCTGTAAAGTGCTTACTAGAACAATCTTACTCGTTAACTTTTAATGTAAATAAAGTGTTTATTTACAAAAATGGTGTTGAGATTTGTTCTGCAAAGTTA
GAAAATAATCTTTATGTGTTAAGATCATTAACATCTAAAGCTCTTCTTAATACTGAAATGTTCAAAACTGCAATAACTCAAAATAAAAGACTTAAAATTTCTCCAAAAGA
AAATGCTCATCTTTGGCACCTAAGATTAGGGCACATAAATCTCAATAGGATTGAGAGATTTGTAAAGAATGGACTTCTAAGTGAGTTAGAAGAAAATTCTTTACCTGTAT
GTGAGTCATGTCTTGAAGGTAAGATGACCAAAAGACCTTTTACTGGAAAAGGTCATAGGGCCAAAGAACCTCTAGAACTTGTACATTCAGATCTATGTGGTCCTATGAAT
GTTAAAGCAAGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTTAATGCAACATAAGTCTGAAGCCCTTGAAAAGTTCAA
GGAATACAAGGCTGAAGTTGAAAACGCATTAAGTAAAACTATTAAAACATTTCGATCGGATCGAGGTGGAGAGTATATGGATTTGAAATTCCAAAACTATTTGATGGAAT
GTGGAATTGTATCTCAACTCTCAGCACCTGGTACACCTCAACAGAATGGTGTATCAGAAAGGAGAAATCGAACCTTGTTGGACATGGTTCGGTCTATGATGAGTTACGCT
CACTTACCTAATTCGTTTTGGGGTTATGCAGTGCAAACTGCAGTCTATATTTTGAATTGTGTTCCATCTAAAAGTGTTTCTGAAACACCTTTAAAATTATGGAATGGTCG
TAAAGGTAGTTTACGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTGCTTGAGAATAACCCTAAGAAATTGGAACCTCGTTCAAAATTATGTTTATTTGTAGGCTACC
CCAAAGGAACTAGAGGTGGTTACTTCTATGATCCTAAAGATAATAAAGTGTTTGTATCGACAAATGCTACATTTTTAGAAGAGGACCACATAAGGGAGCACAAACCGCGT
AGTAAGATAGTATTAAATGAACTTTCCAAAGAAACTACTGAACCTTCAACAAGAGTTGTTGAAGAGCCTAGTGCATTAACAAGAGTTGTTCATGTCGGTTCATCTACTAG
GACACATCAACCTCAATCGTTGAGGGAACCTCGACGAAGTGGGAGGGTTACAAACTTACCTATTCGTTATATGAGTTTAACTGAAACCTTAACTGTCATATCTGATGGCG
ACATTGAGGATCCATTGACTTTTAAGAAGGCAATGGAGGATGTGGATAAAGATGAATGGATCAAAGCTATGAATCTTGAATTGGAGTCTATGTACTTCAATTCGGTCTGG
GATCTTGTAGATCAACCTGATGGGGTAAAACCTATAGGTTGTAAATGGATCTACAAGAGAAAAAGAGGTGCAGATGGTAAGGTACAAACTTTTAAAGCTAGACTAGTGGC
AAAGGGTTATACCCAAGTTGAGGGAGTTGACTATGAGGAGACTTTCTCACCTGTTGCCATGTTAAAGTCTATTCGAATACTTTTGTCCATTGCTGCATATTTTGACTATG
AGATTTGGCAAATGGATGTAAAGACTGCCTTTTTGAATGGCAATCTTGAGGAGACCATCTATATGCAACAACCAGAAGGATTCATAATTCCAGGTCAAGAGCAAAAGATT
TGCAAGCTTAATCGTTCTATTTATGGATTAAAACAAGCTTCTCGATCTTGGAACATAAGATTTGATACCGCAATAAAATCTTATGGATTTGATCAAATCGTTGATGAACC
TTGTGTCTACAAAAGAATCATCAACAAATCAGTAGCTTTCTTAGTTTTGTACGTAGATGATATCCTGCTCATTGGGAATGATATAGGTTTACTAACTGACATCAAACAAT
GGCTAGCAACCCAATTTCAAATGAAAGATTTGGGAGAGGCACAATTTGTTCTGGGTATTCAGATCTTTAGAGATCGTAAGAACAAAATGCTAGCTTTGTCTCAAGCATCG
TATATTGATAAAATAGTTTGTCCTAAGACACCTCAAGACGTTGAGGAAATGAGACATATCCCCTATGCATCAGCTGTTGGCAGCTTGATGTATGCGATGTTATGTACTAG
ACCTGACATCTGTTATGCGGTGGGGATAGTCAGTAGATATCAATCTAATCCAGGATTAGCTCATTGGACTGCCGTTAAAACTATCCTCAAGTATCTTAGGAGAACGAGGG
ACTACATGCTTGTGTATGGTTCTAAGGATTTGATTCTTACAGGATACACAGACTCTGACTTTCAGACTGATAGAGATTCTAGGAAATCTACTTCAGGTTCAGTGTTCACT
CTTAACGGAGGAGCTGTAGTTTGGAGAAGTATCAAGCAAGGATGTATTGTTGACTCCACTATGGAGGCAGAGTACGTTGCAGCTTGTGAAGCTGCCAAAGAGGCTGTTTG
GCTTAGAAATTTCTTGATTGATTTGGAAGTAGTTCCAAACATGTCAAAGCCAATTACTCTTTACTGTGATAATAGTGGGGCTGTGGCTAATTCTAGGGAGCCCAGAAGCC
ACAAGCGTGGAAAGCATATTGAGCGCAAGTATCACTTGATTCGAGAGATAGTGCATCGAGGGGACGTGATCGTCACACAGATAGCTTCGACACACAATGTTGCTGATCCG
TTTACAAAGCCCCTCACGGCTAAGGTGTTTGAGGGTCACCTAGAGAGTCTGGGTCTACGTGACATGCCACATTTAACCTAG
Protein sequenceShow/hide protein sequence
MGGWKSDNGTFRPGYLAQLVRMMAEKLPGCQVRATTVIDCRIKTLKRTFQAIAEMRGPACSGFGWNDEEKCIVAEKELFDNWVRSHPAAKGLLNKPFPYYDELTYVFGRD
RATGRQGVDISQDDVRASRPSRASEGRTGSSGSKRKRGSQRDFELEAIHLALDQTNEQLRQIAEWPARNLANDNHVRTEFFRILREMPELTSLDRALLQRHLLSRMDDLR
GFVLMPEDEREGFCRVLLRDIEREFLSLIMTSATLNMLAADKLNGNNYASWKNTNTVLIIDDLRFVLVEECPQVPAANATRTVREPYERWAKANEKARAYILASLSEVLA
KKHESMLTAREIMDSLQEMFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNVAEMNGAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTF
ESLMKIKGQKGEANVATSTRKFYRGSTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTTKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCLVE
NDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKL
ENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERFVKNGLLSELEENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMN
VKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYA
HLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPR
SKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQPQSLREPRRSGRVTNLPIRYMSLTETLTVISDGDIEDPLTFKKAMEDVDKDEWIKAMNLELESMYFNSVW
DLVDQPDGVKPIGCKWIYKRKRGADGKVQTFKARLVAKGYTQVEGVDYEETFSPVAMLKSIRILLSIAAYFDYEIWQMDVKTAFLNGNLEETIYMQQPEGFIIPGQEQKI
CKLNRSIYGLKQASRSWNIRFDTAIKSYGFDQIVDEPCVYKRIINKSVAFLVLYVDDILLIGNDIGLLTDIKQWLATQFQMKDLGEAQFVLGIQIFRDRKNKMLALSQAS
YIDKIVCPKTPQDVEEMRHIPYASAVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLAHWTAVKTILKYLRRTRDYMLVYGSKDLILTGYTDSDFQTDRDSRKSTSGSVFT
LNGGAVVWRSIKQGCIVDSTMEAEYVAACEAAKEAVWLRNFLIDLEVVPNMSKPITLYCDNSGAVANSREPRSHKRGKHIERKYHLIREIVHRGDVIVTQIASTHNVADP
FTKPLTAKVFEGHLESLGLRDMPHLT