; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc05g0129401 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc05g0129401
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr05:7328977..7330185
RNA-Seq ExpressionCmc05g0129401
SyntenyCmc05g0129401
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031826.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-21997.51Show/hide
Query:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR
        MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSG+KKWKKKKGGQGNKANLAAAKTTKKAK AKGICFHCNQEGHWKR
Subjt:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR

Query:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
        NCPKYLAE KKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGT HVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
Subjt:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL

Query:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL
        KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAK ENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKEN HLWHLRLG INLNRIERLVKNGL
Subjt:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL

Query:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK
        LSELEENSL VCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCG MNVKARGGFEYFITFTDDYSRYGYVYLMQHKS+ALEKFKEYKAEVENALSKTIK
Subjt:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK

Query:  TF
        TF
Subjt:  TF

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-21997.51Show/hide
Query:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR
        MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSG+KKWKKKKGGQGNKANLAAAKTTKKAK AKGICFHCNQEGHWKR
Subjt:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR

Query:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
        NCPKYLAE KKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGT HVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
Subjt:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL

Query:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL
        KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAK ENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKEN HLWHLRLG INLNRIERLVKNGL
Subjt:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL

Query:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK
        LSELEENSL VCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCG MNVKARGGFEYFITFTDDYSRYGYVYLMQHKS+ALEKFKEYKAEVENALSKTIK
Subjt:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK

Query:  TF
        TF
Subjt:  TF

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-21997.51Show/hide
Query:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR
        MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSG+KKWKKKKGGQGNKANLAAAKTTKKAK AKGICFHCNQEGHWKR
Subjt:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR

Query:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
        NCPKYLAE KKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGT HVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
Subjt:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL

Query:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL
        KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAK ENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKEN HLWHLRLG INLNRIERLVKNGL
Subjt:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL

Query:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK
        LSELEENSL VCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCG MNVKARGGFEYFITFTDDYSRYGYVYLMQHKS+ALEKFKEYKAEVENALSKTIK
Subjt:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK

Query:  TF
        TF
Subjt:  TF

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-21997.51Show/hide
Query:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR
        MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSG+KKWKKKKGGQGNKANLAAAKTTKKAK AKGICFHCNQEGHWKR
Subjt:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR

Query:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
        NCPKYLAE KKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGT HVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
Subjt:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL

Query:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL
        KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAK ENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKEN HLWHLRLG INLNRIERLVKNGL
Subjt:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL

Query:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK
        LSELEENSL VCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCG MNVKARGGFEYFITFTDDYSRYGYVYLMQHKS+ALEKFKEYKAEVENALSKTIK
Subjt:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK

Query:  TF
        TF
Subjt:  TF

KAA0062993.1 gag/pol protein [Cucumis melo var. makuwa]3.5e-21997.51Show/hide
Query:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR
        MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSG+KKWKKKKGGQGNKANLAAAKTTKKAK AKGICFHCNQEGHWKR
Subjt:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR

Query:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
        NCPKYLAE KKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGT HVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
Subjt:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL

Query:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL
        KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAK ENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKEN HLWHLRLG INLNRIERLVKNGL
Subjt:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL

Query:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK
        LSELEENSL VCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCG MNVKARGGFEYFITFTDDYSRYGYVYLMQHKS+ALEKFKEYKAEVENALSKTIK
Subjt:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK

Query:  TF
        TF
Subjt:  TF

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.7e-21997.51Show/hide
Query:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR
        MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSG+KKWKKKKGGQGNKANLAAAKTTKKAK AKGICFHCNQEGHWKR
Subjt:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR

Query:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
        NCPKYLAE KKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGT HVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
Subjt:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL

Query:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL
        KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAK ENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKEN HLWHLRLG INLNRIERLVKNGL
Subjt:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL

Query:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK
        LSELEENSL VCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCG MNVKARGGFEYFITFTDDYSRYGYVYLMQHKS+ALEKFKEYKAEVENALSKTIK
Subjt:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK

Query:  TF
        TF
Subjt:  TF

A0A5A7TWB9 Gag/pol protein1.7e-21997.51Show/hide
Query:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR
        MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSG+KKWKKKKGGQGNKANLAAAKTTKKAK AKGICFHCNQEGHWKR
Subjt:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR

Query:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
        NCPKYLAE KKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGT HVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
Subjt:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL

Query:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL
        KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAK ENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKEN HLWHLRLG INLNRIERLVKNGL
Subjt:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL

Query:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK
        LSELEENSL VCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCG MNVKARGGFEYFITFTDDYSRYGYVYLMQHKS+ALEKFKEYKAEVENALSKTIK
Subjt:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK

Query:  TF
        TF
Subjt:  TF

A0A5A7TZD7 Gag/pol protein1.7e-21997.51Show/hide
Query:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR
        MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSG+KKWKKKKGGQGNKANLAAAKTTKKAK AKGICFHCNQEGHWKR
Subjt:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR

Query:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
        NCPKYLAE KKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGT HVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
Subjt:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL

Query:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL
        KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAK ENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKEN HLWHLRLG INLNRIERLVKNGL
Subjt:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL

Query:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK
        LSELEENSL VCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCG MNVKARGGFEYFITFTDDYSRYGYVYLMQHKS+ALEKFKEYKAEVENALSKTIK
Subjt:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK

Query:  TF
        TF
Subjt:  TF

A0A5A7UGV2 Gag/pol protein1.7e-21997.51Show/hide
Query:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR
        MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSG+KKWKKKKGGQGNKANLAAAKTTKKAK AKGICFHCNQEGHWKR
Subjt:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR

Query:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
        NCPKYLAE KKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGT HVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
Subjt:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL

Query:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL
        KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAK ENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKEN HLWHLRLG INLNRIERLVKNGL
Subjt:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL

Query:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK
        LSELEENSL VCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCG MNVKARGGFEYFITFTDDYSRYGYVYLMQHKS+ALEKFKEYKAEVENALSKTIK
Subjt:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK

Query:  TF
        TF
Subjt:  TF

A0A5A7V4M1 Gag/pol protein1.7e-21997.51Show/hide
Query:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR
        MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSG+KKWKKKKGGQGNKANLAAAKTTKKAK AKGICFHCNQEGHWKR
Subjt:  MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKR

Query:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
        NCPKYLAE KKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGT HVVSAIAVGGLRLCLQKSFLLLENVYVVPDL
Subjt:  NCPKYLAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDL

Query:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL
        KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAK ENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKEN HLWHLRLG INLNRIERLVKNGL
Subjt:  KRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGL

Query:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK
        LSELEENSL VCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCG MNVKARGGFEYFITFTDDYSRYGYVYLMQHKS+ALEKFKEYKAEVENALSKTIK
Subjt:  LSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIK

Query:  TF
        TF
Subjt:  TF

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-2026.57Show/hide
Query:  KANLAAAKTTKKAKVAKG------ICFHCNQEGHWKRNCPKY--------LAETKKAKQGKYDLLVLETCLVEN----DDSAWIIDSGATNHVCSSFQGI
        K NL   + TK  K+ KG       C HC +EGH K++C  Y            K+ +      +      V N    D+  +++DSGA++H+ +     
Subjt:  KANLAAAKTTKKAKVAKG------ICFHCNQEGHWKRNCPKY--------LAETKKAKQGKYDLLVLETCLVEN----DDSAWIIDSGATNHVCSSFQGI

Query:  SSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEIC-SAKFENNLYVLRSLTS
        +   ++              + A   G +RL      + LE+V    +   NL+SVK L E   S+ F+ + V I KNG+ +  ++   NN+ V+     
Subjt:  SSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEIC-SAKFENNLYVLRSLTS

Query:  KALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGLLSELE-----ENSLLVCESCLEGKMTKRPF---TGKGHRAKEPLELVHSD
                      Q   +    K N  LWH R G I+  ++  + +  + S+       E S  +CE CL GK  + PF     K H  K PL +VHSD
Subjt:  KALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGLLSELE-----ENSLLVCESCLEGKMTKRPF---TGKGHRAKEPLELVHSD

Query:  LCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVE
        +CG +         YF+ F D ++ Y   YL+++KS     F+++ A+ E
Subjt:  LCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-2525.73Show/hide
Query:  TTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKRNCP---KY
        TT+L+   T E        K   +      K  +   +  +++ +    + +++     G       +K   K++V    C++CNQ GH+KR+CP   K 
Subjt:  TTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKRNCP---KY

Query:  LAETKKAKQG--------KYDLLVL-----ETCL-VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSF---
          ET   K            D +VL     E C+ +   +S W++D+ A++H          +   + G  T+++G         +G   +C++ +    
Subjt:  LAETKKAKQG--------KYDLLVL-----ETCL-VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSF---

Query:  LLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDIN
        L+L++V  VPDL+ NLIS   L    Y   F   K  + K  + I        LY           N E     I Q +      + +V LWH R+G ++
Subjt:  LLLENVYVVPDLKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDIN

Query:  LNRIERLVKNGLLSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYK
           ++ L K  L+S  +  ++  C+ CL GK  +  F     R    L+LV+SD+CG M +++ GG +YF+TF DD SR  +VY+++ K +  + F+++ 
Subjt:  LNRIERLVKNGLLSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYK

Query:  AEVENALSKTIK
        A VE    + +K
Subjt:  AEVENALSKTIK

Q12491 Transposon Ty2-B Gag-Pol polyprotein9.7e-0721.01Show/hide
Query:  YTLTTLLNELQTFESLMKIKGQKGEANV-ATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKRN--C
        Y   T +   Q F  +  I  +    N+   S  K H    + +++ P+++ +K   +      ++ N +  +  K   +A    F      H   +   
Subjt:  YTLTTLLNELQTFESLMKIKGQKGEANV-ATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKRN--C

Query:  PKYLAETKKAKQGKYDLLVLETCLVENDDSA---WIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPD
         +YL++  +   G+       T  ++++D      +IDSGA+  +  S   +         E+ +    +  +   A+G L    Q            P+
Subjt:  PKYLAETKKAKQGKYDLLVLETCLVENDDSA---WIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPD

Query:  LKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNG
        +  +L+S+  L  Q+ +  F  N      +G  +       + Y    L+ K L+ + + K  I    + K   K    L H  LG  N   I++ +K  
Subjt:  LKRNLISVKCLLEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNG

Query:  LLSELEEN-------SLLVCESCLEGKMTKRPFTGKGHRAK-----EPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLM--QHKSKALEKFK
         ++ L+E+       S   C  CL GK TK     KG R K     EP + +H+D+ G ++   +    YFI+FTD+ +R+ +VY +  + +   L  F 
Subjt:  LLSELEEN-------SLLVCESCLEGKMTKRPFTGKGHRAK-----EPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLM--QHKSKALEKFK

Query:  EYKAEVENALSKTI
           A ++N  +  +
Subjt:  EYKAEVENALSKTI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.1e-1623.5Show/hide
Query:  TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNK----ANLAAAKTTKKAKVAKGICFHCNQEGHWKRNC
        TLT +   L   ES  KI        +  +       +T+ T +  + + + ++  +     +K    ++        ++K   G C  C  +GH  + C
Subjt:  TLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNK----ANLAAAKTTKKAKVAKGICFHCNQEGHWKRNC

Query:  P--KYLAETKKAKQGKYDL--------LVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLE
           ++   +  ++Q             L L +    N+   W++DSGAT+H+ S F  + S  Q  TG   + V     +     G   L  +   L L 
Subjt:  P--KYLAETKKAKQGKYDL--------LVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLE

Query:  NVYVVPDLKRNLISVKCLLE------QSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGD
        N+  VP++ +NLISV  L        + +  +F V  +     GV +   K ++ LY     +S+ +    +F +          S K     WH RLG 
Subjt:  NVYVVPDLKRNLISVKCLLE------QSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGD

Query:  INLNRIERLVKNGLLSELE-ENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFK
           + +  ++ N  LS L   +  L C  CL  K  K PF+     +  PLE ++SD+     + +   + Y++ F D ++RY ++Y ++ KS+  E F 
Subjt:  INLNRIERLVKNGLLSELE-ENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFK

Query:  EYKAEVENALSKTIKTF
         +K  +EN     I TF
Subjt:  EYKAEVENALSKTIKTF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.0e-1924.87Show/hide
Query:  RKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKRNCP---KYLAETKKAK--------QGKYDLLVLETCL
        R ++  +       PSSSGS+                     ++ K   G C  C+ +GH  + CP   ++ + T + +        Q + +L V     
Subjt:  RKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKRNCP---KYLAETKKAK--------QGKYDLLVLETCL

Query:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLE------QSYSLTFN
          N    W++DSGAT+H+ S F  + S+ Q  TG   + +     +     G   L      L L  V  VP++ +NLISV  L        + +  +F 
Subjt:  VENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCLLE------QSYSLTFN

Query:  VNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGLLSELE-ENSLLVCESCLEGK
        V  +     GV +   K ++ LY     +S+A+    MF +  +         K     WH RLG  +L  +  ++ N  L  L   + LL C  C   K
Subjt:  VNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGLLSELE-ENSLLVCESCLEGK

Query:  MTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIKT
          K PF+     + +PLE ++SD+     + +   + Y++ F D ++RY ++Y ++ KS+  + F  +K+ VEN     I T
Subjt:  MTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIKT

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.5e-1038.37Show/hide
Query:  TQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGLLSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNV
        T    L  + K+   LWH RL  ++   +E LVK G L   + +SL  CE C+ GK  +  F+   H  K PL+ VHSDL G  +V
Subjt:  TQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGLLSELEENSLLVCESCLEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAAGATTGCTTATACCCTTACCACCCTTCTCAACGAGCTACAGACTTTCGAGTCTCTGATGAAAATCAAGGGACAGAAGGGAGAGGCAAATGTTGCTACT
TCCACAAGAAAGTTCCATAGGGGTTCGACCTCTGGAACTAAGTCTATGCCTTCTTCATCTGGCAGTAAGAAGTGGAAGAAGAAGAAGGGTGGCCAAGGAAATAAA
GCTAACCTCGCTGCTGCTAAAACGACCAAGAAAGCCAAAGTTGCAAAGGGAATATGTTTCCATTGCAACCAAGAGGGACATTGGAAGAGAAACTGTCCCAAGTAT
TTGGCAGAAACGAAGAAGGCCAAACAAGGTAAATATGATTTACTAGTGCTAGAGACTTGTTTAGTGGAAAATGATGATTCAGCCTGGATAATAGATTCAGGTGCC
ACTAATCATGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGCGACAGTTGGAGACTGGAGAGATGACGATGCGAGTTGGAACTAGGCATGTCGTCTCAGCAATT
GCAGTGGGAGGGCTTCGACTTTGTTTACAGAAATCTTTTCTTTTATTAGAAAATGTATATGTTGTTCCAGATTTAAAAAGGAACTTGATTTCTGTAAAGTGCTTA
CTAGAACAATCTTACTCGTTAACTTTTAATGTAAATAAAGTGTTTATTTACAAAAATGGTGTTGAGATTTGTTCTGCAAAGTTTGAAAATAATCTTTATGTGTTA
AGATCATTAACATCTAAAGCCCTTCTTAATACTGAAATGTTCAAAACTGCAATAACTCAAAATAAAAGACTTAAAATTTCTCCAAAAGAAAATGTACATCTTTGG
CACCTAAGATTAGGGGACATAAATCTCAATAGGATTGAGAGGTTAGTAAAGAATGGACTTCTAAGTGAGTTAGAAGAAAATTCTTTACTTGTATGTGAGTCATGC
CTTGAAGGTAAGATGACCAAAAGACCTTTTACCGGAAAAGGTCATAGGGCCAAAGAACCTCTAGAACTTGTACATTCAGATCTATGTGGTCTTATGAATGTTAAA
GCAAGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTTAATGCAACATAAGTCTAAAGCCCTTGAAAAGTTCAAG
GAATACAAGGCTGAAGTTGAAAACGCATTAAGTAAAACTATTAAAACATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAAGATTGCTTATACCCTTACCACCCTTCTCAACGAGCTACAGACTTTCGAGTCTCTGATGAAAATCAAGGGACAGAAGGGAGAGGCAAATGTTGCTACT
TCCACAAGAAAGTTCCATAGGGGTTCGACCTCTGGAACTAAGTCTATGCCTTCTTCATCTGGCAGTAAGAAGTGGAAGAAGAAGAAGGGTGGCCAAGGAAATAAA
GCTAACCTCGCTGCTGCTAAAACGACCAAGAAAGCCAAAGTTGCAAAGGGAATATGTTTCCATTGCAACCAAGAGGGACATTGGAAGAGAAACTGTCCCAAGTAT
TTGGCAGAAACGAAGAAGGCCAAACAAGGTAAATATGATTTACTAGTGCTAGAGACTTGTTTAGTGGAAAATGATGATTCAGCCTGGATAATAGATTCAGGTGCC
ACTAATCATGTTTGTTCTTCATTTCAGGGAATTAGTTCCTGGCGACAGTTGGAGACTGGAGAGATGACGATGCGAGTTGGAACTAGGCATGTCGTCTCAGCAATT
GCAGTGGGAGGGCTTCGACTTTGTTTACAGAAATCTTTTCTTTTATTAGAAAATGTATATGTTGTTCCAGATTTAAAAAGGAACTTGATTTCTGTAAAGTGCTTA
CTAGAACAATCTTACTCGTTAACTTTTAATGTAAATAAAGTGTTTATTTACAAAAATGGTGTTGAGATTTGTTCTGCAAAGTTTGAAAATAATCTTTATGTGTTA
AGATCATTAACATCTAAAGCCCTTCTTAATACTGAAATGTTCAAAACTGCAATAACTCAAAATAAAAGACTTAAAATTTCTCCAAAAGAAAATGTACATCTTTGG
CACCTAAGATTAGGGGACATAAATCTCAATAGGATTGAGAGGTTAGTAAAGAATGGACTTCTAAGTGAGTTAGAAGAAAATTCTTTACTTGTATGTGAGTCATGC
CTTGAAGGTAAGATGACCAAAAGACCTTTTACCGGAAAAGGTCATAGGGCCAAAGAACCTCTAGAACTTGTACATTCAGATCTATGTGGTCTTATGAATGTTAAA
GCAAGAGGAGGATTTGAATATTTCATCACTTTTACTGATGATTATTCAAGATATGGGTATGTTTATTTAATGCAACATAAGTCTAAAGCCCTTGAAAAGTTCAAG
GAATACAAGGCTGAAGTTGAAAACGCATTAAGTAAAACTATTAAAACATTTTGA
Protein sequenceShow/hide protein sequence
MNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFHRGSTSGTKSMPSSSGSKKWKKKKGGQGNKANLAAAKTTKKAKVAKGICFHCNQEGHWKRNCPKY
LAETKKAKQGKYDLLVLETCLVENDDSAWIIDSGATNHVCSSFQGISSWRQLETGEMTMRVGTRHVVSAIAVGGLRLCLQKSFLLLENVYVVPDLKRNLISVKCL
LEQSYSLTFNVNKVFIYKNGVEICSAKFENNLYVLRSLTSKALLNTEMFKTAITQNKRLKISPKENVHLWHLRLGDINLNRIERLVKNGLLSELEENSLLVCESC
LEGKMTKRPFTGKGHRAKEPLELVHSDLCGLMNVKARGGFEYFITFTDDYSRYGYVYLMQHKSKALEKFKEYKAEVENALSKTIKTF