; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0256251 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0256251
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr09:21129193..21131928
RNA-Seq ExpressionCmc09g0256251
SyntenyCmc09g0256251
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043826.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0098.31Show/hide
Query:  MDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKE
        MDQSKSLEENLDEFQKIVVDLN IGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKE
Subjt:  MDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKE

Query:  RSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQ
        RSF SKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEIT+GYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQ
Subjt:  RSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQ

Query:  KVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAI
        KVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNV YVPKLKRNLISLGELDRSG TIKSENGVMKVTKGSLVKL+GTLRHGLYVLEGTTVSGSAAI
Subjt:  KVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAI

Query:  ASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRK
        ASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKH TKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRK
Subjt:  ASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRK

Query:  VWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA
        VWIYPLKQKDEAFGKFLEWKKQVENQTG+KVKYLRTDNGLEFVNNKFN FCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA
Subjt:  VWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA

Query:  AQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVK
        AQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKC+FIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVK
Subjt:  AQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVK

Query:  EQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAY
        EQQKQQTGDHVVTEVRIASE+RPSIDLD+QPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAY
Subjt:  EQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAY

Query:  ALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKP
        ALTCAADSIEAEPLTFEEAIVSDSKKQWK+AMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKP
Subjt:  ALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKP

KAA0047995.1 retrotransposon protein, putative, Ty1-copia sub-class [Cucumis melo var. makuwa]0.0e+0097.98Show/hide
Query:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI
        MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLP+NI ESEKRDMDEMAY TILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI
Subjt:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI

Query:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSK LEENLDEFQKIVVDLN IGEKMSDENQAVILLNSLPETYREVKAAIKYG DSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
        SEKKSWKGKERSF SKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDG DSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
Subjt:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT

Query:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
        PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSG TIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
Subjt:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE

Query:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF
        GTTVSGSAAIASGKVT+MSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDY+HSDLWGPTKEVSMGGSRYF
Subjt:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF

Query:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTG+KVKYLRTDNGLEFVNNKFN FCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
        SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
Subjt:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF

Query:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI
        NETEMPYCVKEQQKQQTGDHVVTEVRIASE+RPSIDLD+QPPLVSEIEDTQQSEFDGIQSQQERILIDE AFIEESSSNNDLQNYQLTRDRVQRERHAPI
Subjt:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI

Query:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLV
        RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWK+AMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARL+
Subjt:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLV

KAA0050719.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0097.57Show/hide
Query:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI
        MASTRFEVSKFNGHGDF+LWRKKIRAILVQHKVAKILDEERLP+NI ESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSL NKI
Subjt:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI

Query:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLN IGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
        SEKKSWKGKERSF SKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
Subjt:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT

Query:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
        PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSG TIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
Subjt:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE

Query:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF
        GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF
Subjt:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF

Query:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTG+KVKYLRTDNGLEFVNNKFN FCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
        SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
Subjt:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF

Query:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI
        NETEMPYCVKEQQKQQTGDHVVTEVRIASE+RPSIDLD+QPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI
Subjt:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI

Query:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRR
        RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWK+AMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVA+   ++ 
Subjt:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRR

Query:  ELIF
         + F
Subjt:  ELIF

TYK13826.1 putative polyprotein [Cucumis melo var. makuwa]0.0e+0097.68Show/hide
Query:  MSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAET
        MSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKERSF SKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAET
Subjt:  MSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAET

Query:  GYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRT
        GYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSG T
Subjt:  GYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRT

Query:  IKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKG
        IKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKG
Subjt:  IKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKG

Query:  KHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHF
        KHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTG+KVKYLRTDNGLEFVNNKFN FCKSEGITRHF
Subjt:  KHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHF

Query:  TVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMF
        TVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMF
Subjt:  TVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMF

Query:  IGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEG
        IGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVKEQQKQQTGDHVVTEVRIASE+RPSIDLD+QPPLVSEIEDTQQSEFDGIQSQQERILIDEG
Subjt:  IGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEG

Query:  AFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSK
        AFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWK+AMEEELFSLHKNQTWSLVPKPPNQKLIQSK
Subjt:  AFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSK

Query:  WIYKIKPGTGGNSKPRYKARLVARATLRRRELIF
        WIYKIKPGTGGNSKPRYKARLVA+   ++  + F
Subjt:  WIYKIKPGTGGNSKPRYKARLVARATLRRRELIF

TYK25306.1 putative gag-pol polyprotein [Cucumis melo var. makuwa]0.0e+0097.79Show/hide
Query:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI
        MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLP+NI ESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI
Subjt:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI

Query:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLN IGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
        SEKKSWKGKERSF SKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
Subjt:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT

Query:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
        PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSG TIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
Subjt:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE

Query:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF
        GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF
Subjt:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF

Query:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTG+KVKYLRTDNGLEFVNNKFN FCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
        SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
Subjt:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF

Query:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI
        NETEMPYCVKEQQKQQTGDHVVTEVRIASE+RPSIDLD+QPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI
Subjt:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI

Query:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRR
        RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWK+AMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVA+   ++ 
Subjt:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRR

Query:  ELIF
         + F
Subjt:  ELIF

TrEMBL top hitse value%identityAlignment
A0A5A7TP18 Putative gag-pol polyprotein0.0e+0098.31Show/hide
Query:  MDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKE
        MDQSKSLEENLDEFQKIVVDLN IGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKE
Subjt:  MDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKE

Query:  RSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQ
        RSF SKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEIT+GYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQ
Subjt:  RSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQ

Query:  KVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAI
        KVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNV YVPKLKRNLISLGELDRSG TIKSENGVMKVTKGSLVKL+GTLRHGLYVLEGTTVSGSAAI
Subjt:  KVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAI

Query:  ASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRK
        ASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKH TKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRK
Subjt:  ASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRK

Query:  VWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA
        VWIYPLKQKDEAFGKFLEWKKQVENQTG+KVKYLRTDNGLEFVNNKFN FCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA
Subjt:  VWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEA

Query:  AQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVK
        AQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKC+FIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVK
Subjt:  AQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVK

Query:  EQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAY
        EQQKQQTGDHVVTEVRIASE+RPSIDLD+QPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAY
Subjt:  EQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAY

Query:  ALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKP
        ALTCAADSIEAEPLTFEEAIVSDSKKQWK+AMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKP
Subjt:  ALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKP

A0A5A7U2U7 Retrotransposon protein, putative, Ty1-copia sub-class0.0e+0097.98Show/hide
Query:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI
        MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLP+NI ESEKRDMDEMAY TILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI
Subjt:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI

Query:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSK LEENLDEFQKIVVDLN IGEKMSDENQAVILLNSLPETYREVKAAIKYG DSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
        SEKKSWKGKERSF SKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDG DSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
Subjt:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT

Query:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
        PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSG TIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
Subjt:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE

Query:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF
        GTTVSGSAAIASGKVT+MSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDY+HSDLWGPTKEVSMGGSRYF
Subjt:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF

Query:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTG+KVKYLRTDNGLEFVNNKFN FCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
        SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
Subjt:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF

Query:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI
        NETEMPYCVKEQQKQQTGDHVVTEVRIASE+RPSIDLD+QPPLVSEIEDTQQSEFDGIQSQQERILIDE AFIEESSSNNDLQNYQLTRDRVQRERHAPI
Subjt:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI

Query:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLV
        RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWK+AMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARL+
Subjt:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLV

A0A5A7UB25 Putative gag-pol polyprotein0.0e+0097.57Show/hide
Query:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI
        MASTRFEVSKFNGHGDF+LWRKKIRAILVQHKVAKILDEERLP+NI ESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSL NKI
Subjt:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI

Query:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLN IGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
        SEKKSWKGKERSF SKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
Subjt:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT

Query:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
        PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSG TIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
Subjt:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE

Query:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF
        GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF
Subjt:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF

Query:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTG+KVKYLRTDNGLEFVNNKFN FCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
        SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
Subjt:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF

Query:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI
        NETEMPYCVKEQQKQQTGDHVVTEVRIASE+RPSIDLD+QPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI
Subjt:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI

Query:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRR
        RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWK+AMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVA+   ++ 
Subjt:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRR

Query:  ELIF
         + F
Subjt:  ELIF

A0A5D3CTV2 Putative polyprotein0.0e+0097.68Show/hide
Query:  MSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAET
        MSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKERSF SKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAET
Subjt:  MSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAET

Query:  GYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRT
        GYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSG T
Subjt:  GYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRT

Query:  IKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKG
        IKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKG
Subjt:  IKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKG

Query:  KHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHF
        KHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTG+KVKYLRTDNGLEFVNNKFN FCKSEGITRHF
Subjt:  KHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHF

Query:  TVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMF
        TVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMF
Subjt:  TVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMF

Query:  IGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEG
        IGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVKEQQKQQTGDHVVTEVRIASE+RPSIDLD+QPPLVSEIEDTQQSEFDGIQSQQERILIDEG
Subjt:  IGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEG

Query:  AFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSK
        AFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWK+AMEEELFSLHKNQTWSLVPKPPNQKLIQSK
Subjt:  AFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSK

Query:  WIYKIKPGTGGNSKPRYKARLVARATLRRRELIF
        WIYKIKPGTGGNSKPRYKARLVA+   ++  + F
Subjt:  WIYKIKPGTGGNSKPRYKARLVARATLRRRELIF

A0A5D3DNU1 Putative gag-pol polyprotein0.0e+0097.79Show/hide
Query:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI
        MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLP+NI ESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI
Subjt:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI

Query:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
        YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLN IGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR
Subjt:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGR

Query:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
        SEKKSWKGKERSF SKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT
Subjt:  SEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMT

Query:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
        PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSG TIKSENGVMKVTKGSLVKLRGTLRHGLYVLE
Subjt:  PHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLE

Query:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF
        GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF
Subjt:  GTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYF

Query:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
        ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTG+KVKYLRTDNGLEFVNNKFN FCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA
Subjt:  ISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNA

Query:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
        SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF
Subjt:  SLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTF

Query:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI
        NETEMPYCVKEQQKQQTGDHVVTEVRIASE+RPSIDLD+QPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI
Subjt:  NETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDRVQRERHAPI

Query:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRR
        RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWK+AMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVA+   ++ 
Subjt:  RYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRR

Query:  ELIF
         + F
Subjt:  ELIF

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.3e-8726.86Show/hide
Query:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI
        M   +  +  F+G   +A+W+ +IRA+L +  V K++D   L  N ++   +  +  A STI+ YLSD  L       T  ++ + L+++Y  KSL +++
Subjt:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKI

Query:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIK-YGRDSLTMSIVLDALKTRNLEIKKERKD--GELLMA
         ++++    K+    SL  +   F +++ +L   G K+ + ++   LL +LP  Y  +  AI+    ++LT++ V + L  + ++IK +  D   +++ A
Subjt:  YIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIK-YGRDSLTMSIVLDALKTRNLEIKKERKD--GELLMA

Query:  RGRSEKKSWKG---KERSFMSKS--KGKSR---KCFLCHKEGHFKKNC-----PLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRD
           +   ++K    K R    K   KG S+   KC  C +EGH KK+C      LN   + +  +      +  A +               V  V++  
Subjt:  RGRSEKKSWKG---KERSFMSKS--KGKSR---KCFLCHKEGHFKKNC-----PLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRD

Query:  IQD--AWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRI-------LTNVRYVPKLKRNLISLGELDRSGRTIKSEN
        + D   +++DSG + H+       T+  +V     +        V   G    AT  G+VR+       L +V +  +   NL+S+  L  +G +I+ + 
Subjt:  IQD--AWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRI-------LTNVRYVPKLKRNLISLGELDRSGRTIKSEN

Query:  GVMKVTKGSL--VKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGG---VKNVEL--PFCEHCIMGKSTRVKFG
          + ++K  L  VK  G L +         ++  A   + K  +   LWH+R  H+S+  L  + ++ +      + N+EL    CE C+ GK  R+ F 
Subjt:  GVMKVTKGSL--VKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGG---VKNVEL--PFCEHCIMGKSTRVKFG

Query:  KGKHTT--KGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGI
        + K  T  K  L  VHSD+ GP   V++    YF+  +D F+     Y +K K + F  F ++  + E     KV YL  DNG E+++N+   FC  +GI
Subjt:  KGKHTT--KGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGI

Query:  TRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTAL--NLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKD--GKLN
        + H TV +TPQ NG++ER  RTI E+ R +++ A L   FWGEA  TA YLINR PS AL  + KTP E+W  K P L+HLRVFG T Y H+K+  GK +
Subjt:  TRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTAL--NLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKD--GKLN

Query:  KRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEM---------PYCVKEQQKQQTGDHVVTEVRIASELRP--SIDLDDQPPLVSEIEDTQ
         ++ K +F+GY     G+KLW  +    K I++RDV  +ET M            +K+ ++ +  +      +I     P  S + D+    +  ++D++
Subjt:  KRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEM---------PYCVKEQQKQQTGDHVVTEVRIASELRP--SIDLDDQPPLVSEIEDTQ

Query:  QSEFDGIQSQQERIL----------IDEGAFIEESSSNNDL----------------------------------------------QNYQLTRDRVQRE
        +SE     +   +I+           D   F+++S  +N                                                   ++   R +R 
Subjt:  QSEFDGIQSQQERIL----------IDEGAFIEESSSNNDL----------------------------------------------QNYQLTRDRVQRE

Query:  RHAP-IRYGYADLVAYALTCAADSIEAE-PLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVA
        +  P I Y   D     +   A +I  + P +F+E    D K  W+ A+  EL +   N TW++  +P N+ ++ S+W++ +K    GN   RYKARLVA
Subjt:  RHAP-IRYGYADLVAYALTCAADSIEAE-PLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVA

Query:  RATLRRREL
        R   ++ ++
Subjt:  RATLRRREL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.6e-15136.24Show/hide
Query:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILD-EERLPENIIESEKRDMDEMAYSTILLYLSDEVL-RLVDEATTTGELWKKLESLYLTKSLPN
        M+  ++EV+KFNG   F+ W++++R +L+Q  + K+LD + + P+ +   +  D+DE A S I L+LSD+V+  ++DE T  G +W +LESLY++K+L N
Subjt:  MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILD-EERLPENIIESEKRDMDEMAYSTILLYLSDEVL-RLVDEATTTGELWKKLESLYLTKSLPN

Query:  KIYIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLM--
        K+Y+K++ +   M +  +   +L+ F  ++  L  +G K+ +E++A++LLNSLP +Y  +   I +G+ ++ +  V  AL       KK    G+ L+  
Subjt:  KIYIKEKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLM--

Query:  ARGRSEKKSWKGKERS-----FMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIM
         RGRS ++S     RS       ++SK + R C+ C++ GHFK++CP  +  +  TS     D  N+A +    D+        E  M      +  W++
Subjt:  ARGRSEKKSWKGKERS-----FMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIM

Query:  DSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGT
        D+  + H TP RD    +   D G V +G+     + G G + I T+ G   +L +VR+VP L+ NLIS   LDR G      N   ++TKGSLV  +G 
Subjt:  DSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGT

Query:  LRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKE
         R  LY        G    A  +++    LWHKR+ H+SE+GLQ L+++ L+   K   +  C++C+ GK  RV F         ILD V+SD+ GP + 
Subjt:  LRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKE

Query:  VSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIME
         SMGG++YF++ IDD SRK+W+Y LK KD+ F  F ++   VE +TG+K+K LR+DNG E+ + +F  +C S GI    TV  TPQ NG+AER NRTI+E
Subjt:  VSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIME

Query:  RTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHV---KDGKLNKRALKCMFIGYPQGVKGYKLWCIEKG
        + R +L  A LP  FWGEA QTACYLINRSPS  L  + P+ VWT K  S  HL+VFGC A+AHV   +  KL+ +++ C+FIGY     GY+LW  +  
Subjt:  RTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHV---KDGKLNKRALKCMFIGYPQGVKGYKLWCIEKG

Query:  MNKCIISRDVTFNETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLT
          K I SRDV F E+E+       +K + G  +   V I S        +     VSE     Q E  G   +Q   L DEG    E  +  + Q+  L 
Subjt:  MNKCIISRDVTFNETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLT

Query:  RDRVQRERHAPIRYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYK
        R   +R R    RY   + V         S + EP + +E +    K Q   AM+EE+ SL KN T+ LV  P  ++ ++ KW++K+K   G     RYK
Subjt:  RDRVQRERHAPIRYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYK

Query:  ARLVARATLRRRELIF
        ARLV +   +++ + F
Subjt:  ARLVARATLRRRELIF

P93293 Uncharacterized mitochondrial protein AtMg003004.0e-2449.11Show/hide
Query:  GVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTK
        GV+KV KG    L+G     LY+L+G+  +G + +A     D + LWH RLAH+S+RG++ L ++G L   K   L FCE CI GK+ RV F  G+HTTK
Subjt:  GVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTK

Query:  GILDYVHSDLWG
          LDYVHSDLWG
Subjt:  GILDYVHSDLWG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-5824.39Show/hide
Query:  DFALWRKKIRAILVQHKVAKILD-EERLPENIIESEK-----------RDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKIYIK
        ++ +W +++ A+   +++A  LD    +P   I ++            +  D++ YS +L  +S  V   V  ATT  ++W+ L  +Y   S  +   ++
Subjt:  DFALWRKKIRAILVQHKVAKILD-EERLPENIIESEK-----------RDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKIYIK

Query:  EKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREV--KAAIKYGRDSLT---------------------MSIVLDALKT
         +   +    +K++++ +         L  +G+ M  + Q   +L +LPE Y+ V  + A K    +LT                     + I  +A+  
Subjt:  EKFFGYKMDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREV--KAAIKYGRDSLT---------------------MSIVLDALKT

Query:  RNLEIKKERKDG----ELLMARGRSEKKSWKGKERSF---MSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYE
        RN        +G            +  K W+    +F    ++SK    KC +C  +GH  K C         +   +     NS +    +    T ++
Subjt:  RNLEIKKERKDG----ELLMARGRSEKKSWKGKERSF---MSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYE

Query:  SAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGG-KVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGEL-DRSGRTI
            L +      + W++DSG T H+T   + L+  Q   GG  V++ D  T  +  TGS  ++T    +  L N+ YVP + +NLIS+  L + +G ++
Subjt:  SAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGG-KVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGEL-DRSGRTI

Query:  K--SENGVMKVTKGSLVKLRGTLRHGLY---VLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPF--CEHCIMGKSTR
        +    +  +K     +  L+G  +  LY   +     VS  A+  S K T  S  WH RL H +   L ++     L  V N    F  C  C++ KS +
Subjt:  K--SENGVMKVTKGSLVKLRGTLRHGLY---VLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPF--CEHCIMGKSTR

Query:  VKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSE
        V F +    +   L+Y++SD+W  +  +S    RY++  +D F+R  W+YPLKQK +    F+ +K  +EN+   ++    +DNG EFV      +    
Subjt:  VKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSE

Query:  GITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVK---DGKL
        GI+   +  +TP+ NGL+ER +R I+E    LL++AS+P  +W  A   A YLINR P+  L L++P +   G +P+ + LRVFGC  Y  ++     KL
Subjt:  GITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVK---DGKL

Query:  NKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYC--------VKEQQKQQT---GDHVVTEVR---------------IASELRPSI
        + ++ +C+F+GY      Y   C+    ++  ISR V F+E   P+         V+EQ+++ +     H     R                     PS 
Subjt:  NKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYC--------VKEQQKQQT---GDHVVTEVR---------------IASELRPSI

Query:  DLDDQPPLVSEIEDTQQSEF-----------DGIQSQQERILIDEGAFIEESSSNNDLQN---YQLTRDR------------------------------
           +     S ++ +  S F           +G Q   +           +++S N+  N    QL +                                
Subjt:  DLDDQPPLVSEIEDTQQSEF-----------DGIQSQQERILIDEGAFIEESSSNNDLQN---YQLTRDR------------------------------

Query:  ----------VQRERHAPIR------YGYADLV----AYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQ-KLIQSK
                  V     AP+          A ++     Y+L  +  + E+EP T   AI +   ++W+NAM  E+ +   N TW LVP PP+   ++  +
Subjt:  ----------VQRERHAPIR------YGYADLV----AYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQ-KLIQSK

Query:  WIYKIKPGTGGNSKPRYKARLVARATLRR
        WI+  K  + G S  RYKARLVA+   +R
Subjt:  WIYKIKPGTGGNSKPRYKARLVARATLRR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.7e-6025.59Show/hide
Query:  DFALWRKKIRAILVQHKVAKILD-EERLPENIIESEK-----------RDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKIYIK
        ++ +W +++ A+   +++A  LD    +P   I ++            R  D++ YS IL  +S  V   V  ATT  ++W+ L  +Y   S  +   ++
Subjt:  DFALWRKKIRAILVQHKVAKILD-EERLPENIIESEK-----------RDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKIYIK

Query:  -------EKFFGYKMDQSKSLE---ENLDEFQKIVVD----------LNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNL
                   G  MD  + +E   ENL +  K V+D          L +I E++ +    ++ LNS       + A +   R++ T          RN 
Subjt:  -------EKFFGYKMDQSKSLE---ENLDEFQKIVVD----------LNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNL

Query:  EIKKERKDGELLMARGRSEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHR
          + + ++      R  S + S  G  RS   + K    +C +C  +GH  K CP     +++T++   T  +             T ++    L V+  
Subjt:  EIKKERKDGELLMARGRSEKKSWKGKERSFMSKSKGKSRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHR

Query:  DIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGG-KVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRT---IKSENGVMK
           + W++DSG T H+T   + L+  Q   GG  V++ D  T  +  TGS  + T    +  L  V YVP + +NLIS+  L  + R        +  +K
Subjt:  DIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGG-KVLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRT---IKSENGVMK

Query:  VTKGSLVKLRGTLRHGLY---VLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQA-LSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTK
             +  L+G  +  LY   +     VS  A+  S K T  S  WH RL H S   L + +S   L     + +L  C  C + KS +V F     T+ 
Subjt:  VTKGSLVKLRGTLRHGLY---VLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQA-LSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTK

Query:  GILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYT
          L+Y++SD+W  +  +S+   RY++  +D F+R  W+YPLKQK +    F+ +K  VEN+   ++  L +DNG EFV      +    GI+   +  +T
Subjt:  GILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGLEFVNNKFNHFCKSEGITRHFTVTYT

Query:  PQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKD---GKLNKRALKCMFIG
        P+ NGL+ER +R I+E    LL++AS+P  +W  A   A YLINR P+  L L++P +   G+ P+ E L+VFGC  Y  ++     KL  ++ +C F+G
Subjt:  PQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKD---GKLNKRALKCMFIG

Query:  YPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYC-----VKEQQKQQT-------------------------GDHVVTEVR------------IASE
        Y      Y   C+     +   SR V F+E   P+      V   Q+Q++                         G H+ T  R            ++S 
Subjt:  YPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYC-----VKEQQKQQT-------------------------GDHVVTEVR------------IASE

Query:  LRPSIDL----DDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDR-----------------------------------
          PS  +      +P   S       ++    Q+      I         S N+  QN  L +                                     
Subjt:  LRPSIDL----DDQPPLVSEIEDTQQSEFDGIQSQQERILIDEGAFIEESSSNNDLQNYQLTRDR-----------------------------------

Query:  ---VQRERHAPIR-YGYA-----------DLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLV-PKPPNQKLIQSKWIYKI
           +Q    AP+  +  A              +YA + AA+S   EP T  +A+  D   +W+ AM  E+ +   N TW LV P PP+  ++  +WI+  
Subjt:  ---VQRERHAPIR-YGYA-----------DLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLV-PKPPNQKLIQSKWIYKI

Query:  KPGTGGNSKPRYKARLVARATLRR
        K  + G S  RYKARLVA+   +R
Subjt:  KPGTGGNSKPRYKARLVARATLRR

Arabidopsis top hitse value%identityAlignment
AT3G21000.1 Gag-Pol-related retrotransposon family protein3.0e-0626.35Show/hide
Query:  KGKSRK-CFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGK
        K KS K C LC+K  H +++C      +    E  +   Y                E+   L     D  D WI+      +MTP+  + T   +     
Subjt:  KGKSRK-CFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGK

Query:  VLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGEL
        V   D     V+G G V+I   +G  + + NV +VP L RN++S G++
Subjt:  VLLGDNGTCDVKGTGSVQIATHDGMVRILTNVRYVPKLKRNLISLGEL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 89.2e-0837.23Show/hide
Query:  LTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRRELIFM
        L C A +   EP T+ EA        W  AM++E+ ++    TW +   PPN+K I  KW+YKIK  + G  + RYKARLVA+   ++  + F+
Subjt:  LTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRRELIFM

ATMG00300.1 Gag-Pol-related retrotransposon family protein2.8e-2549.11Show/hide
Query:  GVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTK
        GV+KV KG    L+G     LY+L+G+  +G + +A     D + LWH RLAH+S+RG++ L ++G L   K   L FCE CI GK+ RV F  G+HTTK
Subjt:  GVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQGLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTK

Query:  GILDYVHSDLWG
          LDYVHSDLWG
Subjt:  GILDYVHSDLWG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein6.3e-1748.78Show/hide
Query:  NRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALK
        NRTI+E+ R +L    LP  F  +AA TA ++IN+ PSTA+N   P EVW    P+  +LR FGC AY H  +GKL  RA K
Subjt:  NRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGKLNKRALK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.6e-0737.5Show/hide
Query:  YALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRRELIFM
        Y+LT    +I+ EP   +  I +     W  AM+EEL +L +N+TW LVP P NQ ++  KW++K K  + G +  R KARLVA+   +   + F+
Subjt:  YALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTGGNSKPRYKARLVARATLRRRELIFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCGACACGATTTGAAGTATCTAAGTTTAATGGACATGGAGATTTTGCTCTTTGGAGAAAAAAGATTAGAGCTATTTTGGTTCAACATAAAGTAGCAAAGATCTT
AGATGAAGAGAGACTTCCAGAAAATATTATAGAAAGTGAAAAAAGAGATATGGATGAAATGGCCTATTCAACTATTCTACTGTATCTGTCAGATGAAGTTCTTAGGCTAG
TGGATGAGGCTACTACTACAGGGGAGTTGTGGAAAAAGCTAGAGAGCCTTTATTTGACAAAGTCATTGCCAAATAAAATATATATAAAGGAGAAGTTCTTTGGATATAAA
ATGGACCAAAGTAAAAGTTTAGAAGAGAATCTGGATGAATTTCAGAAGATTGTAGTTGATCTCAATAAAATTGGTGAAAAGATGTCGGATGAGAATCAAGCAGTGATTCT
TTTAAATTCACTACCAGAAACATATCGAGAGGTTAAGGCTGCTATTAAATATGGTCGGGATTCATTGACCATGAGTATAGTGTTGGATGCCTTGAAAACTAGAAATCTCG
AGATCAAGAAAGAACGCAAAGATGGCGAGTTACTAATGGCCAGAGGGAGGAGTGAGAAAAAGAGCTGGAAAGGTAAAGAGAGGAGTTTCATGTCAAAATCCAAGGGAAAA
TCTAGAAAGTGTTTCCTTTGTCATAAAGAAGGACACTTTAAGAAAAATTGCCCTTTGAATAAGAGCAGAGAAGCATCAACCAGTGAAGCGAATGTTACTGATGGGTATAA
TTCAGCAGAGATCACTGATGGGTATGATTCAGCAGAGACTGGGTATGAGTCTGCAGAGGTCTTGATGGTGTCTCACAGAGATATACAGGATGCTTGGATCATGGATTCAG
GGTGTACTTTTCATATGACCCCTCATCGGGATTTTCTGACAAACTTTCAGAAAGTTGATGGGGGAAAGGTCTTATTGGGTGACAATGGTACATGCGATGTAAAAGGAACT
GGTTCAGTGCAAATTGCAACACATGATGGGATGGTAAGAATACTTACTAATGTGCGGTATGTTCCAAAACTTAAACGTAATCTAATATCCCTTGGGGAATTAGATAGATC
AGGTCGTACCATAAAATCTGAAAATGGAGTTATGAAAGTTACCAAAGGTTCTCTAGTTAAACTGAGGGGAACTTTAAGACATGGTCTATATGTGTTGGAAGGTACTACAG
TTTCAGGCAGTGCTGCTATCGCGTCTGGTAAAGTTACAGATATGTCTATGTTATGGCATAAAAGGCTAGCTCATGTGAGTGAAAGAGGCTTACAAGCTCTATCCCAACAA
GGTTTGCTAGGAGGAGTTAAGAATGTTGAACTCCCATTTTGTGAACATTGTATAATGGGAAAGTCTACCAGAGTAAAGTTTGGGAAAGGGAAGCACACGACCAAAGGTAT
TTTGGATTATGTTCACTCAGATTTGTGGGGTCCTACGAAAGAGGTTTCTATGGGAGGTTCGAGATACTTTATCTCTATCATTGATGATTTCTCAAGAAAAGTATGGATTT
ATCCATTGAAACAAAAGGATGAAGCTTTTGGAAAATTCCTTGAATGGAAGAAGCAGGTTGAGAACCAAACAGGTAAGAAGGTTAAGTATCTGAGGACAGATAATGGTTTA
GAGTTTGTAAATAACAAATTCAACCATTTTTGCAAATCTGAGGGAATCACGAGGCACTTTACTGTTACGTACACTCCACAACAAAATGGTTTGGCTGAAAGGTTTAACAG
AACTATCATGGAACGTACAAGGTGTCTCTTGACTAATGCTTCTTTACCATTGAAATTTTGGGGAGAAGCTGCCCAAACAGCGTGTTATCTCATTAATAGGAGTCCTTCTA
CCGCTTTAAACTTAAAGACTCCTCAGGAGGTCTGGACAGGTAAGGCTCCAAGTTTAGAACATCTTAGAGTGTTTGGATGCACAGCTTATGCTCATGTTAAAGATGGAAAA
TTGAACAAGAGGGCACTAAAATGCATGTTTATTGGGTATCCTCAGGGTGTCAAAGGTTATAAACTTTGGTGCATTGAAAAAGGGATGAATAAATGCATTATCAGTAGAGA
TGTAACTTTTAATGAGACTGAAATGCCTTACTGTGTTAAAGAGCAGCAGAAACAACAGACGGGTGATCATGTTGTGACAGAGGTTAGAATTGCTTCAGAGCTACGACCAT
CAATTGACTTAGATGATCAGCCTCCACTGGTTTCAGAAATAGAGGATACACAGCAGTCTGAATTTGATGGTATTCAATCTCAACAGGAGAGGATTTTGATTGATGAGGGA
GCTTTTATTGAAGAAAGCTCAAGTAACAATGACCTACAGAATTATCAGCTTACCCGTGACAGAGTTCAGAGGGAAAGACATGCACCTATAAGGTATGGTTATGCTGACTT
AGTTGCTTATGCTCTCACTTGTGCAGCTGACAGTATTGAAGCAGAGCCTCTTACTTTTGAAGAGGCAATTGTATCTGATTCAAAGAAACAATGGAAGAATGCCATGGAAG
AAGAATTGTTCTCTTTGCATAAGAATCAGACATGGTCATTGGTTCCAAAGCCTCCTAATCAGAAACTTATTCAATCAAAATGGATTTACAAAATTAAGCCAGGTACAGGA
GGTAACAGTAAGCCTAGATATAAGGCTAGGTTGGTAGCCAGGGCTACACTCAGAAGGAGGGAGTTGATTTTCATGAGATTTTCTCTCCAGTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCGACACGATTTGAAGTATCTAAGTTTAATGGACATGGAGATTTTGCTCTTTGGAGAAAAAAGATTAGAGCTATTTTGGTTCAACATAAAGTAGCAAAGATCTT
AGATGAAGAGAGACTTCCAGAAAATATTATAGAAAGTGAAAAAAGAGATATGGATGAAATGGCCTATTCAACTATTCTACTGTATCTGTCAGATGAAGTTCTTAGGCTAG
TGGATGAGGCTACTACTACAGGGGAGTTGTGGAAAAAGCTAGAGAGCCTTTATTTGACAAAGTCATTGCCAAATAAAATATATATAAAGGAGAAGTTCTTTGGATATAAA
ATGGACCAAAGTAAAAGTTTAGAAGAGAATCTGGATGAATTTCAGAAGATTGTAGTTGATCTCAATAAAATTGGTGAAAAGATGTCGGATGAGAATCAAGCAGTGATTCT
TTTAAATTCACTACCAGAAACATATCGAGAGGTTAAGGCTGCTATTAAATATGGTCGGGATTCATTGACCATGAGTATAGTGTTGGATGCCTTGAAAACTAGAAATCTCG
AGATCAAGAAAGAACGCAAAGATGGCGAGTTACTAATGGCCAGAGGGAGGAGTGAGAAAAAGAGCTGGAAAGGTAAAGAGAGGAGTTTCATGTCAAAATCCAAGGGAAAA
TCTAGAAAGTGTTTCCTTTGTCATAAAGAAGGACACTTTAAGAAAAATTGCCCTTTGAATAAGAGCAGAGAAGCATCAACCAGTGAAGCGAATGTTACTGATGGGTATAA
TTCAGCAGAGATCACTGATGGGTATGATTCAGCAGAGACTGGGTATGAGTCTGCAGAGGTCTTGATGGTGTCTCACAGAGATATACAGGATGCTTGGATCATGGATTCAG
GGTGTACTTTTCATATGACCCCTCATCGGGATTTTCTGACAAACTTTCAGAAAGTTGATGGGGGAAAGGTCTTATTGGGTGACAATGGTACATGCGATGTAAAAGGAACT
GGTTCAGTGCAAATTGCAACACATGATGGGATGGTAAGAATACTTACTAATGTGCGGTATGTTCCAAAACTTAAACGTAATCTAATATCCCTTGGGGAATTAGATAGATC
AGGTCGTACCATAAAATCTGAAAATGGAGTTATGAAAGTTACCAAAGGTTCTCTAGTTAAACTGAGGGGAACTTTAAGACATGGTCTATATGTGTTGGAAGGTACTACAG
TTTCAGGCAGTGCTGCTATCGCGTCTGGTAAAGTTACAGATATGTCTATGTTATGGCATAAAAGGCTAGCTCATGTGAGTGAAAGAGGCTTACAAGCTCTATCCCAACAA
GGTTTGCTAGGAGGAGTTAAGAATGTTGAACTCCCATTTTGTGAACATTGTATAATGGGAAAGTCTACCAGAGTAAAGTTTGGGAAAGGGAAGCACACGACCAAAGGTAT
TTTGGATTATGTTCACTCAGATTTGTGGGGTCCTACGAAAGAGGTTTCTATGGGAGGTTCGAGATACTTTATCTCTATCATTGATGATTTCTCAAGAAAAGTATGGATTT
ATCCATTGAAACAAAAGGATGAAGCTTTTGGAAAATTCCTTGAATGGAAGAAGCAGGTTGAGAACCAAACAGGTAAGAAGGTTAAGTATCTGAGGACAGATAATGGTTTA
GAGTTTGTAAATAACAAATTCAACCATTTTTGCAAATCTGAGGGAATCACGAGGCACTTTACTGTTACGTACACTCCACAACAAAATGGTTTGGCTGAAAGGTTTAACAG
AACTATCATGGAACGTACAAGGTGTCTCTTGACTAATGCTTCTTTACCATTGAAATTTTGGGGAGAAGCTGCCCAAACAGCGTGTTATCTCATTAATAGGAGTCCTTCTA
CCGCTTTAAACTTAAAGACTCCTCAGGAGGTCTGGACAGGTAAGGCTCCAAGTTTAGAACATCTTAGAGTGTTTGGATGCACAGCTTATGCTCATGTTAAAGATGGAAAA
TTGAACAAGAGGGCACTAAAATGCATGTTTATTGGGTATCCTCAGGGTGTCAAAGGTTATAAACTTTGGTGCATTGAAAAAGGGATGAATAAATGCATTATCAGTAGAGA
TGTAACTTTTAATGAGACTGAAATGCCTTACTGTGTTAAAGAGCAGCAGAAACAACAGACGGGTGATCATGTTGTGACAGAGGTTAGAATTGCTTCAGAGCTACGACCAT
CAATTGACTTAGATGATCAGCCTCCACTGGTTTCAGAAATAGAGGATACACAGCAGTCTGAATTTGATGGTATTCAATCTCAACAGGAGAGGATTTTGATTGATGAGGGA
GCTTTTATTGAAGAAAGCTCAAGTAACAATGACCTACAGAATTATCAGCTTACCCGTGACAGAGTTCAGAGGGAAAGACATGCACCTATAAGGTATGGTTATGCTGACTT
AGTTGCTTATGCTCTCACTTGTGCAGCTGACAGTATTGAAGCAGAGCCTCTTACTTTTGAAGAGGCAATTGTATCTGATTCAAAGAAACAATGGAAGAATGCCATGGAAG
AAGAATTGTTCTCTTTGCATAAGAATCAGACATGGTCATTGGTTCCAAAGCCTCCTAATCAGAAACTTATTCAATCAAAATGGATTTACAAAATTAAGCCAGGTACAGGA
GGTAACAGTAAGCCTAGATATAAGGCTAGGTTGGTAGCCAGGGCTACACTCAGAAGGAGGGAGTTGATTTTCATGAGATTTTCTCTCCAGTGGTGA
Protein sequenceShow/hide protein sequence
MASTRFEVSKFNGHGDFALWRKKIRAILVQHKVAKILDEERLPENIIESEKRDMDEMAYSTILLYLSDEVLRLVDEATTTGELWKKLESLYLTKSLPNKIYIKEKFFGYK
MDQSKSLEENLDEFQKIVVDLNKIGEKMSDENQAVILLNSLPETYREVKAAIKYGRDSLTMSIVLDALKTRNLEIKKERKDGELLMARGRSEKKSWKGKERSFMSKSKGK
SRKCFLCHKEGHFKKNCPLNKSREASTSEANVTDGYNSAEITDGYDSAETGYESAEVLMVSHRDIQDAWIMDSGCTFHMTPHRDFLTNFQKVDGGKVLLGDNGTCDVKGT
GSVQIATHDGMVRILTNVRYVPKLKRNLISLGELDRSGRTIKSENGVMKVTKGSLVKLRGTLRHGLYVLEGTTVSGSAAIASGKVTDMSMLWHKRLAHVSERGLQALSQQ
GLLGGVKNVELPFCEHCIMGKSTRVKFGKGKHTTKGILDYVHSDLWGPTKEVSMGGSRYFISIIDDFSRKVWIYPLKQKDEAFGKFLEWKKQVENQTGKKVKYLRTDNGL
EFVNNKFNHFCKSEGITRHFTVTYTPQQNGLAERFNRTIMERTRCLLTNASLPLKFWGEAAQTACYLINRSPSTALNLKTPQEVWTGKAPSLEHLRVFGCTAYAHVKDGK
LNKRALKCMFIGYPQGVKGYKLWCIEKGMNKCIISRDVTFNETEMPYCVKEQQKQQTGDHVVTEVRIASELRPSIDLDDQPPLVSEIEDTQQSEFDGIQSQQERILIDEG
AFIEESSSNNDLQNYQLTRDRVQRERHAPIRYGYADLVAYALTCAADSIEAEPLTFEEAIVSDSKKQWKNAMEEELFSLHKNQTWSLVPKPPNQKLIQSKWIYKIKPGTG
GNSKPRYKARLVARATLRRRELIFMRFSLQW