; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0252271 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0252271
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr09:18381344..18383695
RNA-Seq ExpressionCmc09g0252271
SyntenyCmc09g0252271
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0098.85Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVP ANATRTVREPYERWAKANEKARAYILASLSEVLA KHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFN+AEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTT+LNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKT KKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHV+SAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL+ENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0098.72Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVP ANATRTVREPYERWAKANEKARAYILASLSEVLA KHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFN+AEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTT+LNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKT KK KAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHV+SAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL+ENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0098.85Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVP ANATRTVREPYERWAKANEKARAYILASLSEVLA KHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFN+AEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTT+LNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKT KKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHV+SAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL+ENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0098.85Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVP ANATRTVREPYERWAKANEKARAYILASLSEVLA KHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFN+AEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTT+LNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKT KKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHV+SAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL+ENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0098.72Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVP ANATRTVREPYERWAKANEKARAYILASLSEVLA KHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFN+AEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTT+LNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKT KK KAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHV+SAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL+ENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein0.0e+0098.85Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVP ANATRTVREPYERWAKANEKARAYILASLSEVLA KHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFN+AEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTT+LNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKT KKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHV+SAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL+ENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP

A0A5A7TWB9 Gag/pol protein0.0e+0098.85Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVP ANATRTVREPYERWAKANEKARAYILASLSEVLA KHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFN+AEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTT+LNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKT KKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHV+SAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL+ENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP

A0A5A7UGV2 Gag/pol protein0.0e+0098.85Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVP ANATRTVREPYERWAKANEKARAYILASLSEVLA KHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFN+AEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTT+LNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKT KKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHV+SAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL+ENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP

A0A5A7V4M1 Gag/pol protein0.0e+0098.72Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLV+ECPQVP ANATRTVREPYERWAKANEKARAYILASLSEVLA KHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFN+AEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTT+LNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKT KKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHV+SAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL+ENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP

A0A5D3CPJ6 Gag/pol protein0.0e+0098.72Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVP ANATRTVREPYERWAKANEKARAYILASLSEVLA KHESMLTAREIMDSLQE
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQE

Query:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFN+AEMNGAVIDEASQVSFILESL ESFLQFRSNAVMNKIAYTLTT+LNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
        QKGEANVATSTRKFHRG TSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKT KK KAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
        VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHV+SAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFI

Query:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT
        YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSEL+ENSLPVCESCLEGKMTKRPFT
Subjt:  YKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFT

Query:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
        GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS
Subjt:  GKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVS

Query:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
        QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC
Subjt:  QLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLC

Query:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
        LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP
Subjt:  LFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.5e-5926.79Show/hide
Query:  NGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQEMFGQASYQIKHDAL
        +G  YA WK  I  +L   D+  V+    P            E  + W KA   A++ I+  LS+       S +TAR+I+++L  ++ + S   +    
Subjt:  NGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQEMFGQASYQIKHDAL

Query:  KYIYNARMNEGASVRE--HVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQ-FRSNAVMNKIAYTLTTILNELQTFESLMKIKGQKGEANVATST
        K + + +++   S+    H+ + ++   +A   GA I+E  ++S +L +L   +     +   +++   TL  + N L   +  +KIK    + +     
Subjt:  KYIYNARMNEGASVRE--HVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQ-FRSNAVMNKIAYTLTTILNELQTFESLMKIKGQKGEANVATST

Query:  RKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKY---LAEKKKAKQGKYDL-----LVLETCLVEN
           H    +   ++     N+  K KK  +GN        +  K K     C HC +EGH K++C  Y   L  K K  + +        +      V N
Subjt:  RKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKY---LAEKKKAKQGKYDL-----LVLETCLVEN

Query:  ----DDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVF
            D+  +++DSGA++H+ +     +   ++           G  I A   G +RL      + LE+V    +   NL+SVK L E   S+ F+ + V 
Subjt:  ----DDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVF

Query:  IYKNGVEIC-SAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELK-----ENSLPVCESCLEGK
        I KNG+ +  ++ + NN+ V+                   Q   +    K N  LWH R GHI+  ++  + +  + S+       E S  +CE CL GK
Subjt:  IYKNGVEIC-SAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELK-----ENSLPVCESCLEGK

Query:  MTKRPF---TGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQ
          + PF     K H  K PL +VHSD+CGP+         YF+ F D ++ Y   YL+++KS+    F+++ A+ E   +  +     D G EY+  + +
Subjt:  MTKRPF---TGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQ

Query:  NYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSV---SETPLKLWNGRKGSLRHFRIWGCPAHV-L
         + ++ GI   L+ P TPQ NGVSER  RT+ +  R+M+S A L  SFWG AV TA Y++N +PS+++   S+TP ++W+ +K  L+H R++G   +V +
Subjt:  NYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSV---SETPLKLWNGRKGSLRHFRIWGCPAHV-L

Query:  ENNPKKLEPRSKLCLFVGY-PKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKET
        +N   K + +S   +FVGY P G +    +D  + K  V+ +    E + +     + + V  + SKE+
Subjt:  ENNPKKLEPRSKLCLFVGY-PKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKET

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.5e-7729.59Show/hide
Query:  KLNGNN-YASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQEMFGQASYQIKH
        K NG+N +++W+  +  +LI   L  VL  +  +  T  A        E WA  +E+A + I   LS+ +        TAR I   L+ ++   +   K 
Subjt:  KLNGNN-YASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQEMFGQASYQIKH

Query:  DALKYIYNARMNEGASVREH--VLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKGQKGEANVAT
           K +Y   M+EG +   H  V N ++   +A + G  I+E  +   +L SL  S+    +  +  K    L  + + L   E + K    +G+A +  
Subjt:  DALKYIYNARMNEGASVREH--VLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKGQKGEANVAT

Query:  STRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYD-------------LLVL
              RG     +S   SS N       G  G     A  K+  ++K+    C++CNQ GH+KR+CP     K +    K D             +L +
Subjt:  STRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYD-------------LLVL

Query:  ---ETCL-VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSF---LLLENVYVVPDLKRNLISVKCLLEQSY
           E C+ +   +S W++D+ A++H          +   + G  T+++G         +G   +C++ +    L+L++V  VPDL+ NLIS   L    Y
Subjt:  ---ETCL-VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSF---LLLENVYVVPDLKRNLISVKCLLEQSY

Query:  SLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESC
           F   K  + K  + I        LY   +   +  LN    + ++               LWH R+GH++   ++ L K  L+S  K  ++  C+ C
Subjt:  SLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESC

Query:  LEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKF
        L GK  +  F     R    L+LV+SD+CGPM +++ GG +YF+TF DD SR  +VY+++ K +  + F+++ A VE    + +K  RSD GGEY   +F
Subjt:  LEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKF

Query:  QNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLRHFRIWGCP--AHVL
        + Y    GI  + + PGTPQ NGV+ER NRT+++ VRSM+  A LP SFWG AVQTA Y++N  PS  ++ E P ++W  ++ S  H +++GC   AHV 
Subjt:  QNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLRHFRIWGCP--AHVL

Query:  ENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKE-TTEPST
        +    KL+ +S  C+F+GY     G   +DP   KV  S +  F  E  +R     S+ V N +     T PST
Subjt:  ENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDHIREHKPRSKIVLNELSKE-TTEPST

Q12491 Transposon Ty2-B Gag-Pol polyprotein4.3e-2723.4Show/hide
Query:  NGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKGQKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNK
        N   + +      IL+ LS  F   R N    K    L+ +  E+Q      KI           S  K H    + +++ P+++  K   +      ++
Subjt:  NGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKGQKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNK

Query:  ANLAAAKTAKKAKAAKGICFHCNQEGHWKRN--CPKYLAEKKKAKQGKYDLLVLETCLVENDDSA---WIIDSGATNHVCSSFQGISSWRQLETGEMTMR
         N +  + AK    A    F      H   +    +YL++  +   G+       T  ++++D      +IDSGA+  +  S   +         E+ + 
Subjt:  ANLAAAKTAKKAKAAKGICFHCNQEGHWKRN--CPKYLAEKKKAKQGKYDLLVLETCLVENDDSA---WIIDSGATNHVCSSFQGISSWRQLETGEMTMR

Query:  VGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQ
              I   A+G L    Q            P++  +L+S+  L  Q+ +  F  N      +G  +       + Y    L+ K L+ + + K  I  
Subjt:  VGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQ

Query:  NKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLP-------VCESCLEGKMTKRPFTGKGHRAK-----EPLELVHSDLCGPMNVKARG
          + K   K    L H  LGH N   I++ +K   ++ LKE+ +         C  CL GK TK     KG R K     EP + +H+D+ GP++   + 
Subjt:  NKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLP-------VCESCLEGKMTKRPFTGKGHRAK-----EPLELVHSDLCGPMNVKARG

Query:  GFEYFITFTDDYSRYGYVYLMQHKSE--ALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMV
           YFI+FTD+ +R+ +VY +  + E   L  F    A ++N  +  +   + DRG EY +     +    GI +  +     + +GV+ER NRTLL+  
Subjt:  GFEYFITFTDDYSRYGYVYLMQHKSE--ALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMV

Query:  RSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKG-SLRHFRIWGCPAHVLENNP-KKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVF
        R+++  + LPN  W  AV+ +  I N + S   ++   +   G  G  +     +G P  V  +NP  K+ PR      +   + + G   Y P   K  
Subjt:  RSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKG-SLRHFRIWGCPAHVLENNP-KKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVF

Query:  VSTNATFLEED
         +TN   L+++
Subjt:  VSTNATFLEED

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-5224.21Show/hide
Query:  LNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPT---ANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQEMF
        +NM    KL   NY  W   ++ +    +L   L       P     +A   V   Y RW + ++   + +L ++S  +        TA +I ++L++++
Subjt:  LNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPT---ANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQEMF

Query:  GQASY----QIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKI
           SY    Q++    ++    +     ++ +++  ++  F+   + G  +D   QV  +LE+L E +              TLT I   L   ES  KI
Subjt:  GQASY----QIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKI

Query:  KGQKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNK----ANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCP--KYLAEKKKAKQGKYD
                +  +        T+ T +  + + N ++  +     +K    ++        ++K   G C  C  +GH  + C   ++      ++Q    
Subjt:  KGQKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNK----ANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCP--KYLAEKKKAKQGKYD

Query:  L--------LVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCL
                 L L +    N+   W++DSGAT+H+ S F  + S  Q  TG   + V  G  I     G   L  +   L L N+  VP++ +NLISV  L
Subjt:  L--------LVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCL

Query:  LE------QSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENA--HLWHLRLGHINLNRIERLVKNGLLS
                + +  +F V  +     GV +   K ++ LY               +  A +Q   L  SP   A    WH RLGH   + +  ++ N  LS
Subjt:  LE------QSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENA--HLWHLRLGHINLNRIERLVKNGLLS

Query:  ELK-ENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKT
         L   +    C  CL  K  K PF+     +  PLE ++SD+     + +   + Y++ F D ++RY ++Y ++ KS+  E F  +K  +EN     I T
Subjt:  ELK-ENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKT

Query:  FRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLR
        F SD GGE++ L    Y  + GI    S P TP+ NG+SER++R +++   +++S+A +P ++W YA   AVY++N +P+  +  E+P +   G   +  
Subjt:  FRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLR

Query:  HFRIWGCPAH--VLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLE
          R++GC  +  +   N  KL+ +S+ C+F+GY            + +++++S +  F E
Subjt:  HFRIWGCPAH--VLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.9e-5225.14Show/hide
Query:  LNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPT---ANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQEMF
        +NM    KL   NY  W   ++ +    +L   L    P  P     +A   V   Y RW + ++   + IL ++S  +        TA +I ++L++++
Subjt:  LNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPT---ANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQEMF

Query:  GQASYQIKHDALKYIYNARMNEGASVREHV--LNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG
           SY                       HV  L  +  F+   + G  +D   QV  +LE+L + +              +LT I   L   ES + +  
Subjt:  GQASYQIKHDALKYIYNARMNEGASVREHV--LNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKG

Query:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKW--KKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAK-----------
           E    T+    HR  T+  ++  +   N+ +     +      ++  +    ++ K   G C  C+ +GH  + CP+    +               
Subjt:  QKGEANVATSTRKFHRGLTSGTKSMPSSSGNKKW--KKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAK-----------

Query:  QGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLE-
        Q + +L V       N    W++DSGAT+H+ S F  + S+ Q  TG   + +  G  I     G   L      L L  V  VP++ +NLISV  L   
Subjt:  QGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLE-

Query:  -----QSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELK-E
             + +  +F V  +     GV +   K ++ LY     +S+A+    MF +  +         K     WH RLGH +L  +  ++ N  L  L   
Subjt:  -----QSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELK-E

Query:  NSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDR
        + L  C  C   K  K PF+     + +PLE ++SD+     + +   + Y++ F D ++RY ++Y ++ KS+  + F  +K+ VEN     I T  SD 
Subjt:  NSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDR

Query:  GGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLRHFRIW
        GGE++ L+  +YL + GI    S P TP+ NG+SER++R +++M  +++S+A +P ++W YA   AVY++N +P+  +  ++P +   G+  +    +++
Subjt:  GGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLRHFRIW

Query:  GCPAH--VLENNPKKLEPRSKLCLFVGY
        GC  +  +   N  KLE +SK C F+GY
Subjt:  GCPAH--VLENNPKKLEPRSKLCLFVGY

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.2e-1140.7Show/hide
Query:  TQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNV
        T    L  + K+   LWH RL H++   +E LVK G L   K +SL  CE C+ GK  +  F+   H  K PL+ VHSDL G  +V
Subjt:  TQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.9e-0735.37Show/hide
Query:  NRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSK
        NRT+++ VRSM+    LP +F   A  TAV+I+N  PS +++   P ++W     +  + R +GC A++   +  KL+PR+K
Subjt:  NRTLLDMVRSMMSYAHLPNSFWGYAVQTAVYILNCVPSKSVS-ETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGAGTGCTACTTTGAATATGCTGGCTGCTGATAAACTTAATGGCAATAATTATGCATCTTGGAAAAATACTATCAACACTGTGCTAATCATCGATGACCTT
AGATTTGTCCTAGTTGAGGAGTGTCCTCAAGTCCCAACTGCTAATGCAACTCGAACTGTTCGAGAACCATATGAGCGTTGGGCCAAGGCAAATGAAAAAGCCCGA
GCATACATCTTGGCAAGCTTATCTGAAGTATTGGCCATGAAACATGAATCAATGCTCACTGCTCGTGAGATTATGGACTCCTTGCAGGAGATGTTTGGTCAGGCC
TCTTATCAGATCAAGCATGATGCTCTGAAATACATTTATAATGCCCGTATGAATGAGGGAGCCTCAGTGCGAGAACATGTTCTCAATATGATGGTTCATTTCAAC
ATGGCAGAAATGAATGGGGCTGTCATCGATGAAGCCAGTCAGGTTAGCTTTATTTTGGAATCTCTGTCAGAGAGTTTCCTACAATTTAGAAGCAATGCTGTTATG
AATAAGATTGCTTATACCCTTACCACCATTCTCAACGAGCTACAGACTTTTGAGTCTCTGATGAAAATCAAGGGACAGAAGGGAGAGGCAAATGTTGCTACTTCC
ACAAGAAAGTTCCATAGGGGTTTGACCTCTGGAACTAAGTCTATGCCTTCTTCATCTGGCAATAAGAAGTGGAAGAAGAAGAAGGGTGGCCAAGGAAATAAAGCT
AACCTCGCTGCTGCTAAAACGGCCAAGAAAGCCAAAGCTGCAAAGGGAATATGTTTCCATTGCAACCAAGAGGGACATTGGAAGAGAAACTGTCCCAAGTACTTG
GCAGAAAAGAAGAAGGCTAAACAAGGTAAATATGATTTACTAGTGCTAGAGACTTGTTTAGTGGAAAATGATGATTCAGCCTGGATAATAGATTCAGGTGCCACT
AATCATGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGCGGCAGTTGGAGACTGGAGAGATGACGATGCGAGTTGGAACTGGGCATGTCATCTCAGCAATTGCA
GTGGGAGGGCTTCGACTTTGTTTACAGAAATCTTTTCTTTTATTAGAAAATGTATATGTTGTTCCTGATTTAAAAAGGAATTTGATTTCTGTAAAGTGCTTACTA
GAACAATCTTACTCGTTAACTTTTAATGTAAATAAAGTGTTTATTTACAAAAATGGTGTTGAGATTTGTTCTGCAAAGTTAGAAAATAATCTTTATGTGTTAAGA
TCATTAACATCTAAAGCTCTTCTTAATACTGAAATGTTCAAAACTGCAATAACTCAAAATAAAAGACTTAAAATTTCTCCAAAAGAAAATGCTCATCTTTGGCAC
CTAAGATTAGGGCACATAAATCTCAATAGGATTGAGAGATTAGTAAAGAATGGACTTTTAAGTGAGTTAAAAGAAAATTCTTTACCTGTATGTGAGTCATGCCTT
GAAGGTAAGATGACCAAAAGACCTTTTACTGGAAAAGGTCATAGGGCCAAAGAACCTTTAGAACTTGTACATTCAGATCTATGTGGTCCTATGAATGTTAAAGCA
AGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTTAATGCAACATAAGTCTGAAGCCCTTGAAAAGTTCAAGGAA
TACAAGGCTGAAGTTGAAAACGCATTAAGTAAAACTATTAAAACATTTCGATCGGATCGAGGTGGAGAGTATATGGATTTGAAATTCCAAAACTATTTGATGGAA
TGTGGAATTGTATCTCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAAAGGAGAAATCGAACCTTGTTGGACATGGTTCGGTCTATGATGAGT
TACGCTCACTTACCTAATTCGTTTTGGGGTTATGCAGTGCAAACTGCAGTCTATATTTTGAATTGTGTTCCATCTAAAAGTGTTTCTGAAACACCTTTAAAATTA
TGGAATGGTCGTAAAGGTAGTTTACGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTGCTTGAGAATAACCCTAAGAAATTGGAACCTCGTTCAAAATTATGT
TTATTTGTAGGCTACCCCAAAGGAACTAGAGGTGGTTACTTCTATGATCCTAAAGATAATAAAGTGTTTGTATCGACAAATGCTACATTTTTAGAAGAGGACCAC
ATAAGGGAGCACAAACCGCGTAGTAAGATAGTATTAAATGAACTTTCCAAAGAAACTACTGAACCTTCAACAAGAGTTGTTGAAGAGCCTAGTGCATTAACAAGA
GTTGTTCATGTCGGCTCATCTACTAGGACACATCAACCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGAGTGCTACTTTGAATATGCTGGCTGCTGATAAACTTAATGGCAATAATTATGCATCTTGGAAAAATACTATCAACACTGTGCTAATCATCGATGACCTT
AGATTTGTCCTAGTTGAGGAGTGTCCTCAAGTCCCAACTGCTAATGCAACTCGAACTGTTCGAGAACCATATGAGCGTTGGGCCAAGGCAAATGAAAAAGCCCGA
GCATACATCTTGGCAAGCTTATCTGAAGTATTGGCCATGAAACATGAATCAATGCTCACTGCTCGTGAGATTATGGACTCCTTGCAGGAGATGTTTGGTCAGGCC
TCTTATCAGATCAAGCATGATGCTCTGAAATACATTTATAATGCCCGTATGAATGAGGGAGCCTCAGTGCGAGAACATGTTCTCAATATGATGGTTCATTTCAAC
ATGGCAGAAATGAATGGGGCTGTCATCGATGAAGCCAGTCAGGTTAGCTTTATTTTGGAATCTCTGTCAGAGAGTTTCCTACAATTTAGAAGCAATGCTGTTATG
AATAAGATTGCTTATACCCTTACCACCATTCTCAACGAGCTACAGACTTTTGAGTCTCTGATGAAAATCAAGGGACAGAAGGGAGAGGCAAATGTTGCTACTTCC
ACAAGAAAGTTCCATAGGGGTTTGACCTCTGGAACTAAGTCTATGCCTTCTTCATCTGGCAATAAGAAGTGGAAGAAGAAGAAGGGTGGCCAAGGAAATAAAGCT
AACCTCGCTGCTGCTAAAACGGCCAAGAAAGCCAAAGCTGCAAAGGGAATATGTTTCCATTGCAACCAAGAGGGACATTGGAAGAGAAACTGTCCCAAGTACTTG
GCAGAAAAGAAGAAGGCTAAACAAGGTAAATATGATTTACTAGTGCTAGAGACTTGTTTAGTGGAAAATGATGATTCAGCCTGGATAATAGATTCAGGTGCCACT
AATCATGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGCGGCAGTTGGAGACTGGAGAGATGACGATGCGAGTTGGAACTGGGCATGTCATCTCAGCAATTGCA
GTGGGAGGGCTTCGACTTTGTTTACAGAAATCTTTTCTTTTATTAGAAAATGTATATGTTGTTCCTGATTTAAAAAGGAATTTGATTTCTGTAAAGTGCTTACTA
GAACAATCTTACTCGTTAACTTTTAATGTAAATAAAGTGTTTATTTACAAAAATGGTGTTGAGATTTGTTCTGCAAAGTTAGAAAATAATCTTTATGTGTTAAGA
TCATTAACATCTAAAGCTCTTCTTAATACTGAAATGTTCAAAACTGCAATAACTCAAAATAAAAGACTTAAAATTTCTCCAAAAGAAAATGCTCATCTTTGGCAC
CTAAGATTAGGGCACATAAATCTCAATAGGATTGAGAGATTAGTAAAGAATGGACTTTTAAGTGAGTTAAAAGAAAATTCTTTACCTGTATGTGAGTCATGCCTT
GAAGGTAAGATGACCAAAAGACCTTTTACTGGAAAAGGTCATAGGGCCAAAGAACCTTTAGAACTTGTACATTCAGATCTATGTGGTCCTATGAATGTTAAAGCA
AGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTTAATGCAACATAAGTCTGAAGCCCTTGAAAAGTTCAAGGAA
TACAAGGCTGAAGTTGAAAACGCATTAAGTAAAACTATTAAAACATTTCGATCGGATCGAGGTGGAGAGTATATGGATTTGAAATTCCAAAACTATTTGATGGAA
TGTGGAATTGTATCTCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAAAGGAGAAATCGAACCTTGTTGGACATGGTTCGGTCTATGATGAGT
TACGCTCACTTACCTAATTCGTTTTGGGGTTATGCAGTGCAAACTGCAGTCTATATTTTGAATTGTGTTCCATCTAAAAGTGTTTCTGAAACACCTTTAAAATTA
TGGAATGGTCGTAAAGGTAGTTTACGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTGCTTGAGAATAACCCTAAGAAATTGGAACCTCGTTCAAAATTATGT
TTATTTGTAGGCTACCCCAAAGGAACTAGAGGTGGTTACTTCTATGATCCTAAAGATAATAAAGTGTTTGTATCGACAAATGCTACATTTTTAGAAGAGGACCAC
ATAAGGGAGCACAAACCGCGTAGTAAGATAGTATTAAATGAACTTTCCAAAGAAACTACTGAACCTTCAACAAGAGTTGTTGAAGAGCCTAGTGCATTAACAAGA
GTTGTTCATGTCGGCTCATCTACTAGGACACATCAACCTTAA
Protein sequenceShow/hide protein sequence
MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFVLVEECPQVPTANATRTVREPYERWAKANEKARAYILASLSEVLAMKHESMLTAREIMDSLQEMFGQA
SYQIKHDALKYIYNARMNEGASVREHVLNMMVHFNMAEMNGAVIDEASQVSFILESLSESFLQFRSNAVMNKIAYTLTTILNELQTFESLMKIKGQKGEANVATS
TRKFHRGLTSGTKSMPSSSGNKKWKKKKGGQGNKANLAAAKTAKKAKAAKGICFHCNQEGHWKRNCPKYLAEKKKAKQGKYDLLVLETCLVENDDSAWIIDSGAT
NHVCSSFQGISSWRQLETGEMTMRVGTGHVISAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKLENNLYVLR
SLTSKALLNTEMFKTAITQNKRLKISPKENAHLWHLRLGHINLNRIERLVKNGLLSELKENSLPVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGPMNVKA
RGGFEYFITFTDDYSRYGYVYLMQHKSEALEKFKEYKAEVENALSKTIKTFRSDRGGEYMDLKFQNYLMECGIVSQLSAPGTPQQNGVSERRNRTLLDMVRSMMS
YAHLPNSFWGYAVQTAVYILNCVPSKSVSETPLKLWNGRKGSLRHFRIWGCPAHVLENNPKKLEPRSKLCLFVGYPKGTRGGYFYDPKDNKVFVSTNATFLEEDH
IREHKPRSKIVLNELSKETTEPSTRVVEEPSALTRVVHVGSSTRTHQP