; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc07g0204871 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc07g0204871
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr07:26005068..26006909
RNA-Seq ExpressionCmc07g0204871
SyntenyCmc07g0204871
Gene Ontology termsGO:0006952 - defense response (biological process)
GO:0007165 - signal transduction (biological process)
GO:0015074 - DNA integration (biological process)
GO:0055085 - transmembrane transport (biological process)
GO:0016020 - membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043531 - ADP binding (molecular function)
GO:0140359 - ABC-type transmembrane transporter activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025314 - Domain of unknown function DUF4219
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025843.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0099.35Show/hide
Query:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN
        MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMT+RSAYEIWN
Subjt:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN

Query:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
        YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
Subjt:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA

Query:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
        QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
Subjt:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ

Query:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN
        QGVEAKIAYQ EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHI DVLFVPDINQN
Subjt:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN

Query:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
        LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEE+QSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
Subjt:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF

Query:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
        GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
Subjt:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK

Query:  FCEDSGIKHQLTAP
        FCEDSGIKHQLTAP
Subjt:  FCEDSGIKHQLTAP

KAA0064094.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0099.35Show/hide
Query:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN
        MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMT+RSAYEIWN
Subjt:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN

Query:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
        YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
Subjt:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA

Query:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
        QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
Subjt:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ

Query:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN
        QGVEAKIAYQ EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHI DVLFVPDINQN
Subjt:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN

Query:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
        LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEE+QSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
Subjt:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF

Query:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
        GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
Subjt:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK

Query:  FCEDSGIKHQLTAP
        FCEDSGIKHQLTAP
Subjt:  FCEDSGIKHQLTAP

TYJ95793.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0099.51Show/hide
Query:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN
        MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMT+RSAYEIWN
Subjt:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN

Query:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
        YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
Subjt:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA

Query:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
        QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
Subjt:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ

Query:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN
        QGVEAKIAYQ EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHI DVLFVPDINQN
Subjt:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN

Query:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
        LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
Subjt:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF

Query:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
        GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
Subjt:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK

Query:  FCEDSGIKHQLTAP
        FCEDSGIKHQLTAP
Subjt:  FCEDSGIKHQLTAP

TYK00004.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0099.35Show/hide
Query:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN
        MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMT+RSAYEIWN
Subjt:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN

Query:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
        YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
Subjt:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA

Query:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
        QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
Subjt:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ

Query:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN
        QGVEAKIAYQ EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHI DVLFVPDINQN
Subjt:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN

Query:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
        LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEE+QSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
Subjt:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF

Query:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
        GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
Subjt:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK

Query:  FCEDSGIKHQLTAP
        FCEDSGIKHQLTAP
Subjt:  FCEDSGIKHQLTAP

TYK21117.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0099.51Show/hide
Query:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN
        MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMT+RSAYEIWN
Subjt:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN

Query:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
        YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
Subjt:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA

Query:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
        QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
Subjt:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ

Query:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN
        QGVEAKIAYQ EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHI DVLFVPDINQN
Subjt:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN

Query:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
        LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
Subjt:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF

Query:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
        GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
Subjt:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK

Query:  FCEDSGIKHQLTAP
        FCEDSGIKHQLTAP
Subjt:  FCEDSGIKHQLTAP

TrEMBL top hitse value%identityAlignment
A0A5A7VA39 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0099.35Show/hide
Query:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN
        MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMT+RSAYEIWN
Subjt:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN

Query:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
        YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
Subjt:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA

Query:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
        QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
Subjt:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ

Query:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN
        QGVEAKIAYQ EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHI DVLFVPDINQN
Subjt:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN

Query:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
        LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEE+QSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
Subjt:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF

Query:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
        GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
Subjt:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK

Query:  FCEDSGIKHQLTAP
        FCEDSGIKHQLTAP
Subjt:  FCEDSGIKHQLTAP

A0A5D3BL45 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0099.35Show/hide
Query:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN
        MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMT+RSAYEIWN
Subjt:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN

Query:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
        YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
Subjt:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA

Query:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
        QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
Subjt:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ

Query:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN
        QGVEAKIAYQ EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHI DVLFVPDINQN
Subjt:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN

Query:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
        LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEE+QSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
Subjt:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF

Query:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
        GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
Subjt:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK

Query:  FCEDSGIKHQLTAP
        FCEDSGIKHQLTAP
Subjt:  FCEDSGIKHQLTAP

A0A5D3DBU0 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0099.51Show/hide
Query:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN
        MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMT+RSAYEIWN
Subjt:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN

Query:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
        YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
Subjt:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA

Query:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
        QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
Subjt:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ

Query:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN
        QGVEAKIAYQ EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHI DVLFVPDINQN
Subjt:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN

Query:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
        LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
Subjt:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF

Query:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
        GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
Subjt:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK

Query:  FCEDSGIKHQLTAP
        FCEDSGIKHQLTAP
Subjt:  FCEDSGIKHQLTAP

A0A5D3DMJ1 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0099.51Show/hide
Query:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN
        MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMT+RSAYEIWN
Subjt:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN

Query:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
        YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
Subjt:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA

Query:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
        QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
Subjt:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ

Query:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN
        QGVEAKIAYQ EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHI DVLFVPDINQN
Subjt:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN

Query:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
        LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
Subjt:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF

Query:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
        GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
Subjt:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK

Query:  FCEDSGIKHQLTAP
        FCEDSGIKHQLTAP
Subjt:  FCEDSGIKHQLTAP

A0A5D3E2V7 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0099.35Show/hide
Query:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN
        MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMT+RSAYEIWN
Subjt:  MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWN

Query:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
        YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA
Subjt:  YLKSEYEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQA

Query:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
        QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ
Subjt:  QEQRRAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQ

Query:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN
        QGVEAKIAYQ EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHI DVLFVPDINQN
Subjt:  QGVEAKIAYQ-EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQN

Query:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
        LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEE+QSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF
Subjt:  LLSVGQLIEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHF

Query:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
        GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK
Subjt:  GKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDK

Query:  FCEDSGIKHQLTAP
        FCEDSGIKHQLTAP
Subjt:  FCEDSGIKHQLTAP

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.6e-3524.15Show/hide
Query:  FDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWNYLKSEYEGDERIKGMR
        FDG+ Y +W  R+ A +   D+ + V+       +P+    +  KA++        AK+ +   +S +      +  +A +I   L + Y   ER     
Subjt:  FDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWNYLKSEYEGDERIKGMR

Query:  VLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQAQEQRRAMRQEGAVEGA
         L L +     K+    S+  +     ++ +++   G+  ++   +  +L+++P  ++  I+A+E   +   +TLA + N L  QE +         +  
Subjt:  VLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQAQEQRRAMRQEGAVEGA

Query:  LPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQQGVEAK------IAYQ
        + A  H N  NN  K    KN+++  +      +  K      C HC ++GH    C+              H   I  N N++   + +      IA+ 
Subjt:  LPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQQGVEAK------IAYQ

Query:  EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKD----LKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQNLLSVGQL
         +E +   V         N  +++DSG ++H+ +D+ L+ D    + P  I   +   G++I    +G + + +      + DVLF  +   NL+SV +L
Subjt:  EEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKD----LKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQNLLSVGQL

Query:  IEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLE-EEQSVFALKEDETQLWHKRVGHYHHQGLLQL-------TELALDFPKLSEEISSCKACH
         E G  + F+     I   +   +  VK  G   ++  +  +  S+ A  ++  +LWH+R GH     LL++        +  L+  +LS EI  C+ C 
Subjt:  IEKGFKVTFENEYCLIKDAANQDIFKVKMKGKSFSLNPLE-EEQSVFALKEDETQLWHKRVGHYHHQGLLQL-------TELALDFPKLSEEISSCKACH

Query:  FGKQNRKSFP--KSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAE
         GKQ R  F   K      + L ++H+DV GP    +L    Y++ F+D FT  C  + +K+KS+V  +F  F A+ E     K+  +  DNG+EY+S E
Subjt:  FGKQNRKSFP--KSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAE

Query:  FDKFCEDSGIKHQLTAP
          +FC   GI + LT P
Subjt:  FDKFCEDSGIKHQLTAP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-4726.66Show/hide
Query:  FDGDN-YQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWNYLKSEYEGDERIKGM
        F+GDN +  W  RM   +    + + ++ D + P        A +           +A + +   +S  +   I+   +A  IW  L+S Y         
Subjt:  FDGDN-YQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWNYLKSEYEGDERIKGM

Query:  RVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQAQEQRRAMRQEGAVEG
          L L ++     M E  +   +      +  Q+  LG   ++      +L S+P  ++   + + + K  T I L ++ +AL   E+ R          
Subjt:  RVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQAQEQRRAMRQEGAVEG

Query:  ALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVK-------KGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHE-----AVICRNNNQQQGV
          P    + +    + + ++      R S+ Y ++G +       K     C +CN+ GH    C   PN +  K    G +     A + +NN+     
Subjt:  ALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVK-------KGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHE-----AVICRNNNQQQGV

Query:  EAKIAYQEEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKG-TKHIHDVLFVPDINQNLLS
           + +  EEE+ + ++    G ES   W++D+  ++H T  ++LF      +   V++GN  Y  + G G I I +  G T  + DV  VPD+  NL+S
Subjt:  EAKIAYQEEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKG-TKHIHDVLFVPDINQNLLS

Query:  VGQLIEKGFKVTFENE--------YCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSC
           L   G++  F N+          + K  A   +++   +     LN  ++E SV         LWHKR+GH   +GL  L + +L        +  C
Subjt:  VGQLIEKGFKVTFENE--------YCLIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSC

Query:  KACHFGKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVS
          C FGKQ+R SF  SS R    L L+++DV GP    S+ G+ Y++ FIDD +R  W++ LK K +V  VF KF A VE E+G K++ +RSDNG EY S
Subjt:  KACHFGKQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVS

Query:  AEFDKFCEDSGIKHQLTAP
         EF+++C   GI+H+ T P
Subjt:  AEFDKFCEDSGIKHQLTAP

P25601 Putative transposon Ty5-1 protein YCL075W1.6e-0833Show/hide
Query:  EAKIAYQEEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQNLLSV
        + + A Q   +  L + +  V    +  W+ D+GCT+HM HD+ +F     ++      G G  I + G GT+ I    GT  +HDV +VPD+  NL+SV
Subjt:  EAKIAYQEEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQNLLSV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.6e-3124.72Show/hide
Query:  NYQVWAVRMEAYMEALDIWEAVEEDYEI-PALPDNPTMAQIKAQKEKKTKKSK-AKACLFAAVSSTIFTRIMTMRSAYEIWNYLKSEYEGDERIKGMRVL
        NY +W+ ++ A  +  ++   ++    + PA        ++     +  ++ K   + +  A+S ++   +    +A +IW  L+  Y         ++ 
Subjt:  NYQVWAVRMEAYMEALDIWEAVEEDYEI-PALPDNPTMAQIKAQKEKKTKKSK-AKACLFAAVSSTIFTRIMTMRSAYEIWNYLKSEYEGDERIKGMRVL

Query:  NLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQAQEQRRAMRQEGAV--EGA
          ++++     K T++I +Y   L+   +Q+ LLG        VE++L ++PE+++  I  +   KD T  TL EI   L   E +        V    A
Subjt:  NLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQAQEQRRAMRQEGAV--EGA

Query:  LPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVIC-------RNNNQQQGVEAKIAY
            H      N      + N+   R ++  +K         P    +   HP     +    KC  C   GH A  C        + N QQ       +
Subjt:  LPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVIC-------RNNNQQQGVEAKIAY

Query:  QEEEEDQLFVATCFVGGE-SNESWLIDSGCTNHMTHDKELFKDLKP-TNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQNLLSVGQLI
        Q         A   +G   S+ +WL+DSG T+H+T D       +P T    V + +G  I +   G+ ++++     ++H++L+VP+I++NL+SV +L 
Subjt:  QEEEEDQLFVATCFVGGE-SNESWLIDSGCTNHMTHDKELFKDLKP-TNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQNLLSVGQLI

Query:  E-KGFKVTFENEYCLIKDA-ANQDIFKVKMKGKSFSLNPLEEEQ--SVFALKEDET--QLWHKRVGHYHHQGLLQ-LTELALDFPKLSEEISSCKACHFG
           G  V F      +KD      + + K K + +   P+   Q  S+FA    +     WH R+GH     L   ++  +L     S +  SC  C   
Subjt:  E-KGFKVTFENEYCLIKDA-ANQDIFKVKMKGKSFSLNPLEEEQ--SVFALKEDET--QLWHKRVGHYHHQGLLQ-LTELALDFPKLSEEISSCKACHFG

Query:  KQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLK--GSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFD
        K N+  F +S+  +T+ L+ I++DV     +P L      YY+ F+D FTR  W++ LK KS+V   F  FK  +EN    +I    SDNG E+V+    
Subjt:  KQNRKSFPKSSWRATQKLQLIHTDVAGPQRTPSLK--GSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFD

Query:  KFCEDSGIKHQLTAP
        ++    GI H LT+P
Subjt:  KFCEDSGIKHQLTAP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE25.2e-2826.72Show/hide
Query:  VRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQAQEQR-RAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQ
        +R +   +Q+ LLG        VE++L ++P+ ++  I  +   KD T  +L EI   L  +E +  A+     V        H N   N       +NQ
Subjt:  VRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQAQEQR-RAMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQ

Query:  ISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPN---AKCTKCNQMGHEAVIC-------RNNNQQQGVEAKIAYQEEEEDQLFVATCFVGGES
         +  ++  YN    +  S+ P S  ++  +      R+P     +C  C+  GH A  C          NQQQ       +Q         A   V    
Subjt:  ISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPN---AKCTKCNQMGHEAVIC-------RNNNQQQGVEAKIAYQEEEEDQLFVATCFVGGES

Query:  N-ESWLIDSGCTNHMTHDKELFKDLKP-TNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQNLLSVGQLIEKG-FKVTFENEYCLIKDA
        N  +WL+DSG T+H+T D       +P T    V I +G  I +   G+ ++ +   +  ++ VL+VP+I++NL+SV +L       V F      +KD 
Subjt:  N-ESWLIDSGCTNHMTHDKELFKDLKP-TNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQNLLSVGQLIEKG-FKVTFENEYCLIKDA

Query:  -ANQDIFKVKMKGKSFSLNPLEEEQ--SVFA--LKEDETQLWHKRVGHYHHQGLLQ-LTELALDFPKLSEEISSCKACHFGKQNRKSFPKSSWRATQKLQ
             + + K K + +   P+   Q  S+FA    +     WH R+GH     L   ++  +L     S ++ SC  C   K ++  F  S+  +++ L+
Subjt:  -ANQDIFKVKMKGKSFSLNPLEEEQ--SVFA--LKEDETQLWHKRVGHYHHQGLLQ-LTELALDFPKLSEEISSCKACHFGKQNRKSFPKSSWRATQKLQ

Query:  LIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDKFCEDSGIKHQLTAP
         I++DV       S+    YY+ F+D FTR  W++ LK KS+V   F  FK+ VEN    +I  + SDNG E+V      +    GI H  + P
Subjt:  LIHTDVAGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDKFCEDSGIKHQLTAP

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein5.1e-1034.29Show/hide
Query:  VEAKIAYQEEEEDQLFVATCFVGGESNES-WLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGD-----YISVKGKGTIAIASCKGTKHIHDVLFVPDI
        V AK  +    E    VA CF     +E+ WLI S  +NHMT   + F  L  +   KV+  +GD        V+G G +   + +G K I +VL+VP I
Subjt:  VEAKIAYQEEEEDQLFVATCFVGGESNES-WLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGD-----YISVKGKGTIAIASCKGTKHIHDVLFVPDI

Query:  NQNLLSVGQLIEKGFKVTFENEY-CLIKDAANQDIFKVKM
          N LSV QL   GF+V+ E    C + D     +F   M
Subjt:  NQNLLSVGQLIEKGFKVTFENEY-CLIKDAANQDIFKVKM

AT3G21000.1 Gag-Pol-related retrotransposon family protein1.3e-1019.9Show/hide
Query:  LIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPD-NPTMA------QIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWNYLK--SE
        ++ D  +Y++WA   ++ +    +W+ V     +P  P  NP +A      ++   ++   K +KA   L ++++ ++F + ++  SA ++W+ L+  +E
Subjt:  LIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPD-NPTMA------QIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWNYLK--SE

Query:  YEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQAQEQRR
             R++ + +  L ++ E  KM + ES   Y  + L+I  ++        D  I + +  ++   F+   S LE   D+ ++T   ++          
Subjt:  YEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQAQEQRR

Query:  AMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRES--STYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQQGV
                                   +++ ++ ST E+         +K  S   C  C K  H    C  R                    + +++  
Subjt:  AMRQEGAVEGALPAKHHENVRNNKKKKFFKKNQISTRES--STYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQQGV

Query:  EAKIAYQEEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKH-IHDVLFVPDINQNLLS
        E  + Y+ E    L   T       ++ W+I      +MT   + F  L  T    V   +G  + V+GKG + I   +G K  I +V+FVP +N+N+LS
Subjt:  EAKIAYQEEEEDQLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKH-IHDVLFVPDINQNLLS

Query:  VGQLIEKGFKVT
         G+++ K + ++
Subjt:  VGQLIEKGFKVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCCATAACAAGTTCTAGTTTCACTTCAATTTCTCCATTGATATTTGATGGAGACAACTACCAAGTTTGGGCAGTTCGTATGGAAGCTTATATGGAAGCT
TTGGATATTTGGGAAGCAGTGGAAGAGGATTATGAAATTCCTGCTCTTCCAGACAATCCTACCATGGCACAAATCAAAGCGCAAAAGGAGAAGAAGACAAAGAAA
TCCAAGGCAAAGGCATGTTTGTTTGCTGCAGTCTCATCTACTATCTTCACTAGAATTATGACTATGAGATCAGCATATGAGATATGGAATTATCTCAAGTCAGAA
TATGAGGGAGATGAAAGAATCAAAGGAATGCGTGTGCTAAACTTGATTCGAGAATTCGAGTTGCAGAAGATGAAGGAAACAGAATCAATTAAAGAGTATTCTGTT
AGGTTATTGGACATTGCAAACCAAATCAGATTACTTGGTTCTGTGTTCAAGGACTCAAGAATTGTAGAAAAAATTCTTGTATCAGTGCCTGAAAAATTTGAGGCA
TCTATTTCTGCCTTAGAAAATACCAAAGATTTGACTCAAATCACTCTTGCAGAGATACTCAATGCTTTGCAAGCGCAGGAACAAAGAAGAGCCATGAGGCAAGAA
GGTGCTGTTGAAGGAGCCTTGCCAGCGAAGCATCATGAGAACGTTAGGAACAATAAGAAGAAGAAGTTTTTCAAGAAGAATCAAATTTCTACTAGAGAATCTTCA
ACTTACAACAAAGCAGGAGTCAAGAAGGGATCCTATCCTCCCTGTTCACATTGCAATAAACAGGGTCATCCTCCTTTTAAATGCTGGAGGAGACCAAACGCCAAG
TGCACCAAATGTAACCAAATGGGGCATGAAGCTGTAATTTGCAGGAACAATAATCAACAACAAGGTGTAGAGGCCAAAATTGCTTATCAGGAGGAGGAGGAGGAT
CAATTGTTTGTGGCAACATGCTTCGTGGGTGGTGAGTCAAATGAAAGCTGGCTGATTGACAGTGGATGTACCAACCATATGACTCATGACAAGGAGTTGTTTAAA
GATCTGAAGCCAACAAATATCACCAAAGTCAGAATTGGCAATGGAGACTACATCTCAGTCAAAGGAAAGGGGACTATTGCGATTGCCAGTTGTAAAGGTACAAAA
CATATACATGATGTTCTTTTTGTACCTGACATTAACCAAAATTTGTTGAGTGTGGGTCAACTTATTGAAAAAGGTTTTAAAGTTACTTTTGAAAATGAATATTGC
TTGATTAAGGATGCAGCAAATCAAGATATTTTCAAAGTAAAAATGAAAGGAAAAAGTTTTTCACTTAATCCTTTAGAGGAGGAGCAATCTGTTTTTGCTCTCAAA
GAAGATGAAACACAGCTTTGGCACAAAAGAGTCGGTCACTATCATCATCAAGGGCTGCTACAACTAACGGAGTTGGCACTTGATTTTCCCAAGCTTAGTGAAGAA
ATCTCAAGTTGCAAAGCGTGTCATTTTGGAAAGCAAAATAGGAAGTCATTTCCCAAGTCATCTTGGAGAGCCACTCAAAAATTGCAGCTCATTCACACAGATGTT
GCAGGTCCTCAGAGAACACCTTCTTTAAAAGGCAGTCTCTATTATATTGCTTTTATTGATGACTTCACCAGAATGTGCTGGATTTTTTTCTTGAAGTTTAAATCA
GAGGTTGCTCATGTCTTTTGGAAGTTCAAGGCAAGAGTTGAAAATGAAAGTGGTTGCAAAATTCAAATGGTAAGGTCTGACAATGGGAAGGAGTATGTTTCGGCA
GAATTTGACAAGTTTTGTGAAGATTCAGGCATAAAACATCAACTTACAGCTCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGCCATAACAAGTTCTAGTTTCACTTCAATTTCTCCATTGATATTTGATGGAGACAACTACCAAGTTTGGGCAGTTCGTATGGAAGCTTATATGGAAGCT
TTGGATATTTGGGAAGCAGTGGAAGAGGATTATGAAATTCCTGCTCTTCCAGACAATCCTACCATGGCACAAATCAAAGCGCAAAAGGAGAAGAAGACAAAGAAA
TCCAAGGCAAAGGCATGTTTGTTTGCTGCAGTCTCATCTACTATCTTCACTAGAATTATGACTATGAGATCAGCATATGAGATATGGAATTATCTCAAGTCAGAA
TATGAGGGAGATGAAAGAATCAAAGGAATGCGTGTGCTAAACTTGATTCGAGAATTCGAGTTGCAGAAGATGAAGGAAACAGAATCAATTAAAGAGTATTCTGTT
AGGTTATTGGACATTGCAAACCAAATCAGATTACTTGGTTCTGTGTTCAAGGACTCAAGAATTGTAGAAAAAATTCTTGTATCAGTGCCTGAAAAATTTGAGGCA
TCTATTTCTGCCTTAGAAAATACCAAAGATTTGACTCAAATCACTCTTGCAGAGATACTCAATGCTTTGCAAGCGCAGGAACAAAGAAGAGCCATGAGGCAAGAA
GGTGCTGTTGAAGGAGCCTTGCCAGCGAAGCATCATGAGAACGTTAGGAACAATAAGAAGAAGAAGTTTTTCAAGAAGAATCAAATTTCTACTAGAGAATCTTCA
ACTTACAACAAAGCAGGAGTCAAGAAGGGATCCTATCCTCCCTGTTCACATTGCAATAAACAGGGTCATCCTCCTTTTAAATGCTGGAGGAGACCAAACGCCAAG
TGCACCAAATGTAACCAAATGGGGCATGAAGCTGTAATTTGCAGGAACAATAATCAACAACAAGGTGTAGAGGCCAAAATTGCTTATCAGGAGGAGGAGGAGGAT
CAATTGTTTGTGGCAACATGCTTCGTGGGTGGTGAGTCAAATGAAAGCTGGCTGATTGACAGTGGATGTACCAACCATATGACTCATGACAAGGAGTTGTTTAAA
GATCTGAAGCCAACAAATATCACCAAAGTCAGAATTGGCAATGGAGACTACATCTCAGTCAAAGGAAAGGGGACTATTGCGATTGCCAGTTGTAAAGGTACAAAA
CATATACATGATGTTCTTTTTGTACCTGACATTAACCAAAATTTGTTGAGTGTGGGTCAACTTATTGAAAAAGGTTTTAAAGTTACTTTTGAAAATGAATATTGC
TTGATTAAGGATGCAGCAAATCAAGATATTTTCAAAGTAAAAATGAAAGGAAAAAGTTTTTCACTTAATCCTTTAGAGGAGGAGCAATCTGTTTTTGCTCTCAAA
GAAGATGAAACACAGCTTTGGCACAAAAGAGTCGGTCACTATCATCATCAAGGGCTGCTACAACTAACGGAGTTGGCACTTGATTTTCCCAAGCTTAGTGAAGAA
ATCTCAAGTTGCAAAGCGTGTCATTTTGGAAAGCAAAATAGGAAGTCATTTCCCAAGTCATCTTGGAGAGCCACTCAAAAATTGCAGCTCATTCACACAGATGTT
GCAGGTCCTCAGAGAACACCTTCTTTAAAAGGCAGTCTCTATTATATTGCTTTTATTGATGACTTCACCAGAATGTGCTGGATTTTTTTCTTGAAGTTTAAATCA
GAGGTTGCTCATGTCTTTTGGAAGTTCAAGGCAAGAGTTGAAAATGAAAGTGGTTGCAAAATTCAAATGGTAAGGTCTGACAATGGGAAGGAGTATGTTTCGGCA
GAATTTGACAAGTTTTGTGAAGATTCAGGCATAAAACATCAACTTACAGCTCCTTAA
Protein sequenceShow/hide protein sequence
MDAITSSSFTSISPLIFDGDNYQVWAVRMEAYMEALDIWEAVEEDYEIPALPDNPTMAQIKAQKEKKTKKSKAKACLFAAVSSTIFTRIMTMRSAYEIWNYLKSE
YEGDERIKGMRVLNLIREFELQKMKETESIKEYSVRLLDIANQIRLLGSVFKDSRIVEKILVSVPEKFEASISALENTKDLTQITLAEILNALQAQEQRRAMRQE
GAVEGALPAKHHENVRNNKKKKFFKKNQISTRESSTYNKAGVKKGSYPPCSHCNKQGHPPFKCWRRPNAKCTKCNQMGHEAVICRNNNQQQGVEAKIAYQEEEED
QLFVATCFVGGESNESWLIDSGCTNHMTHDKELFKDLKPTNITKVRIGNGDYISVKGKGTIAIASCKGTKHIHDVLFVPDINQNLLSVGQLIEKGFKVTFENEYC
LIKDAANQDIFKVKMKGKSFSLNPLEEEQSVFALKEDETQLWHKRVGHYHHQGLLQLTELALDFPKLSEEISSCKACHFGKQNRKSFPKSSWRATQKLQLIHTDV
AGPQRTPSLKGSLYYIAFIDDFTRMCWIFFLKFKSEVAHVFWKFKARVENESGCKIQMVRSDNGKEYVSAEFDKFCEDSGIKHQLTAP