; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc05g0132041 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc05g0132041
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr05:11923636..11924889
RNA-Seq ExpressionCmc05g0132041
SyntenyCmc05g0132041
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0031864.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.7e-21095.18Show/hide
Query:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MRASKLLSQGTWSILASM+DT EVDVSLSLEPVV DYPDV SEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY HYEFIVMSFGLTNAPAVF+
Subjt:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
        DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR      HVLYKARVSVDPAKI+ VTSWPRPSTVSEF IFLSLAGYYRRFVENFSRIATPLTQL
Subjt:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL

Query:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPD S SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAT+VFALKIWRH
Subjt:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

KAA0032794.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]5.6e-20692.39Show/hide
Query:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MRA KLLSQGTWSIL+S+VDTREVDVSLS EPVVRDY DVF EELPGLPPHREIEFAIELEPGT+PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        P+GA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY HYEFIVMSFGLTNAP VFM
Subjt:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
        DLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHV+ KA VSVDPAKIEAVTSWPRPSTVSE R FL L GYYRRFVENFSRIATPLTQL
Subjt:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL

Query:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        TR GA FVWSKACEDSFQNLK+KLVTA V TVPDGS SF+IY DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELA +VFALKIWRH
Subjt:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

KAA0036553.1 pol protein [Cucumis melo var. makuwa]5.6e-20692.64Show/hide
Query:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        +RASKLLSQGTW ILAS+VDTREVDVSLS EPVVRDYPDVF EELPGLPPHRE+EFAIELEP T+PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        P+GA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP IDDLFDQLQ ATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY HYEFIVMSFGLTNAPAVFM
Subjt:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
        DLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VSFLGHV+ KA VSVDPAKIEAVT W RPSTVSEFR FL LAGYYRRFVENFS IATPLTQL
Subjt:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL

Query:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        TR GA FVWSKACEDSFQNLKQKLVTA VLTVPDGS SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELA +VFALKIWRH
Subjt:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

KAA0046108.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.1e-21396.45Show/hide
Query:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY HY FI MSFGLTNAP VFM
Subjt:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
        DLMNKVFREFLDTFVIVFIDDILIYSKTE EHEEHLR      HVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLA YYRRFVENFSRIATPLTQL
Subjt:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL

Query:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        TR GASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLEL TMVFALKIWRH
Subjt:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

KAA0056681.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.0e-21196.19Show/hide
Query:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MRASKLLSQGTWSILASMVDT EV+VSLSLEPVVRDYPDVFSEEL GLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
         FGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY H EFIVMSFGLTNAPAVFM
Subjt:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
        DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR      HVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
Subjt:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL

Query:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGS SFVIYSDASKKGLGCVLMQQGKVV YASRQLKSHEQNYPTHDLELAT+VFALKIWRH
Subjt:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

TrEMBL top hitse value%identityAlignment
A0A5A7SL15 Ty3-gypsy retrotransposon protein8.2e-21195.18Show/hide
Query:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MRASKLLSQGTWSILASM+DT EVDVSLSLEPVV DYPDV SEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQ+LLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY HYEFIVMSFGLTNAPAVF+
Subjt:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
        DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR      HVLYKARVSVDPAKI+ VTSWPRPSTVSEF IFLSLAGYYRRFVENFSRIATPLTQL
Subjt:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL

Query:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPD S SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELAT+VFALKIWRH
Subjt:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

A0A5A7T0Y9 Reverse transcriptase2.7e-20692.64Show/hide
Query:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        +RASKLLSQGTW ILAS+VDTREVDVSLS EPVVRDYPDVF EELPGLPPHRE+EFAIELEP T+PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        P+GA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP IDDLFDQLQ ATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY HYEFIVMSFGLTNAPAVFM
Subjt:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
        DLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VSFLGHV+ KA VSVDPAKIEAVT W RPSTVSEFR FL LAGYYRRFVENFS IATPLTQL
Subjt:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL

Query:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        TR GA FVWSKACEDSFQNLKQKLVTA VLTVPDGS SFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELA +VFALKIWRH
Subjt:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

A0A5A7TTB1 Ty3-gypsy retrotransposon protein1.0e-21396.45Show/hide
Query:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY HY FI MSFGLTNAP VFM
Subjt:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
        DLMNKVFREFLDTFVIVFIDDILIYSKTE EHEEHLR      HVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLA YYRRFVENFSRIATPLTQL
Subjt:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL

Query:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        TR GASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLEL TMVFALKIWRH
Subjt:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

A0A5A7UT59 Ty3-gypsy retrotransposon protein9.6e-21296.19Show/hide
Query:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MRASKLLSQGTWSILASMVDT EV+VSLSLEPVVRDYPDVFSEEL GLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
         FGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY H EFIVMSFGLTNAPAVFM
Subjt:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
        DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR      HVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
Subjt:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL

Query:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGS SFVIYSDASKKGLGCVLMQQGKVV YASRQLKSHEQNYPTHDLELAT+VFALKIWRH
Subjt:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

A0A5D3E456 Reverse transcriptase2.7e-20692.39Show/hide
Query:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
        MRA KLLSQGTWSIL+S+VDTREVDVSLS EPVVRDY DVF EELPGLPPHREIEFAIELEPGT+PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS
Subjt:  MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVS

Query:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM
        P+GA VLFVKKKDGSMRLCIDYRELNKVTVKNRYPLP+IDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRY HYEFIVMSFGLTNAP VFM
Subjt:  PFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFM

Query:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL
        DLMN+VFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHV+ KA VSVDPAKIEAVTSWPRPSTVSE R FL L GYYRRFVENFSRIATPLTQL
Subjt:  DLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQL

Query:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        TR GA FVWSKACEDSFQNLK+KLVTA V TVPDGS SF+IY DASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELA +VFALKIWRH
Subjt:  TRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.1e-6237.22Show/hide
Query:  YRMAPAELKELKVQLQELLDKGFIRPSVSPFGAVVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRI
        Y    A  +E++ Q+Q++L++G IR S SP+ + +  V KK+D S     R+ IDYR+LN++TV +R+P+P +D++  +L     F+ IDL  G+HQ+ +
Subjt:  YRMAPAELKELKVQLQELLDKGFIRPSVSPFGAVVLFV-KKKDGS----MRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRI

Query:  KDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMV-----------------------SFLG
            V KTAF +++ HYE++ M FGL NAPA F   MN + R  L+   +V++DDI+++S +  EH + L +V                       +FLG
Subjt:  KDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMV-----------------------SFLG

Query:  HVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQLTRMGASFVWSKACEDS-FQNLKQKLVTALVLTVPDGSRSFVIY
        HVL    +  +P KIEA+  +P P+   E + FL L GYYR+F+ NF+ IA P+T+  +       +    DS F+ LK  +    +L VPD ++ F + 
Subjt:  HVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQLTRMGASFVWSKACEDS-FQNLKQKLVTALVLTVPDGSRSFVIY

Query:  SDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        +DAS   LG VL Q G  ++Y SR L  HE NY T + EL  +V+A K +RH
Subjt:  SDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

P20825 Retrovirus-related Pol polyprotein from transposon 2973.6e-6234.92Show/hide
Query:  PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPFGAVVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSG
        PI    Y +A     E++ Q+QE+L++G IR S SP+ +    V KK         R+ IDYR+LN++T+ +RYP+P +D++  +L     F+ IDL  G
Subjt:  PISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPFGAVVLFVKKKD-----GSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSG

Query:  YHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMV---------------------
        +HQ+ + +  + KTAF ++  HYE++ M FGL NAPA F   MN + R  L+   +V++DDI+I+S +  EH   +++V                     
Subjt:  YHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMV---------------------

Query:  --SFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQLTRMGASFVWSK-ACEDSFQNLKQKLVTALVLTVPDGS
          +FLGH++    +  +P K++A+ S+P P+   E R FL L GYYR+F+ N++ IA P+T   +        K    ++F+ LK  ++   +L +PD  
Subjt:  --SFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQLTRMGASFVWSK-ACEDSFQNLKQKLVTALVLTVPDGS

Query:  RSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        + FV+ +DAS   LG VL Q G  +++ SR L  HE NY   + EL  +V+A K +RH
Subjt:  RSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein6.3e-5935.38Show/hide
Query:  YPDVFSEELPGLPP---HREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  ++  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  + V+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFSEELPGLPP---HREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHE
        +PLP+ID+L  ++  A +F+ +DL SGYHQ+ ++  D  KTAF +    YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHE

Query:  EHLRMV-----------------------SFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQLTRMGASFVWS
        +HL  V                        FLG+ +   +++    K  A+  +P P TV + + FL +  YYRRF+ N S+IA P+ QL     S  W+
Subjt:  EHLRMV-----------------------SFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQLTRMGASFVWS

Query:  KACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        +  + + + LK  L  + VL   +   ++ + +DASK G+G VL +         VV Y S+ L+S ++NYP  +LEL  ++ AL  +R+
Subjt:  KACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.0e-5631.53Show/hide
Query:  LEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGT---IPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPFGAVVLFVKKK-----DGSMRLCID
        L  ++ ++P +F   L G+     +E A++ E  T    PI    Y        E++ Q+ ELL  G IRPS SP+ + +  V KK     +   R+ +D
Subjt:  LEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGT---IPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPFGAVVLFVKKK-----DGSMRLCID

Query:  YRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDD
        ++ LN VT+ + YP+P I+     L  A  F+ +DL SG+HQ+ +K+ D+PKTAF +    YEF+ + FGL NAPA+F  +++ + RE +     V+IDD
Subjt:  YRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDD

Query:  ILIYSKTEAEHEEHLRM-----------------------VSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLT
        I+++S+    H ++LR+                       V FLG+++    +  DP K+ A++  P P++V E + FL +  YYR+F+++++++A PLT
Subjt:  ILIYSKTEAEHEEHLRM-----------------------VSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLT

Query:  QLTR-MGASFVWSKACE----------DSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQ----QGKVVAYASRQLKSHEQNYPTHDLELA
         LTR + A+   S++ +           SF +LK  L ++ +L  P  ++ F + +DAS   +G VL Q    + + +AY SR L   E+NY T + E+ 
Subjt:  QLTR-MGASFVWSKACE----------DSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQ----QGKVVAYASRQLKSHEQNYPTHDLELA

Query:  TMVFAL
         ++++L
Subjt:  TMVFAL

Q99315 Transposon Ty3-G Gag-Pol polyprotein8.2e-5935.38Show/hide
Query:  YPDVFSEELPGLPP---HREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNR
        Y ++   +LP  P    +  ++  IE++PG       PY +     +E+   +Q+LLD  FI PS SP  + V+ V KKDG+ RLC+DYR LNK T+ + 
Subjt:  YPDVFSEELPGLPP---HREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPFGAVVLFVKKKDGSMRLCIDYRELNKVTVKNR

Query:  YPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHE
        +PLP+ID+L  ++  A +F+ +DL SGYHQ+ ++  D  KTAF +    YE+ VM FGL NAP+ F   M   FR+    FV V++DDILI+S++  EH 
Subjt:  YPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFIDDILIYSKTEAEHE

Query:  EHLRMV-----------------------SFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQLTRMGASFVWS
        +HL  V                        FLG+ +   +++    K  A+  +P P TV + + FL +  YYRRF+ N S+IA P+ QL     S  W+
Subjt:  EHLRMV-----------------------SFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQLTRMGASFVWS

Query:  KACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH
        +  + +   LK  L  + VL   +   ++ + +DASK G+G VL +         VV Y S+ L+S ++NYP  +LEL  ++ AL  +R+
Subjt:  KACEDSFQNLKQKLVTALVLTVPDGSRSFVIYSDASKKGLGCVLMQQGK------VVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.7e-1742.31Show/hide
Query:  VSFLG--HVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQLTRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGS
        +++LG  H++    VS DPAK+EA+  WP P   +E R FL L GYYRRFV+N+ +I  PLT+L +   S  W++    +F+ LK  + T  VL +PD  
Subjt:  VSFLG--HVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQLTRMGASFVWSKACEDSFQNLKQKLVTALVLTVPDGS

Query:  RSFV
          FV
Subjt:  RSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGCCAGCAAACTGCTTAGTCAGGGTACTTGGAGTATCTTAGCGAGCATGGTGGATACTAGAGAGGTTGATGTATCCCTGTCATTAGAACCAGTGGTAAGGGACTA
TCCGGATGTTTTTTCTGAAGAGCTTCCAGGGTTACCTCCTCATAGAGAGATTGAGTTTGCCATAGAGCTGGAGCCAGGTACGATTCCTATATCCAGAGCCCCATACAGAA
TGGCCCCAGCAGAATTGAAGGAACTGAAAGTGCAGCTACAGGAGTTGCTTGATAAAGGCTTCATTCGACCGAGTGTGTCACCTTTTGGTGCGGTAGTTTTATTTGTTAAG
AAGAAGGATGGATCGATGCGCCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAAGATCGACGATCTATTTGACCAGTTACA
GGGAGCTACAGTGTTCTCTAAGATTGATCTTCGATCAGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCAAAGACAGCCTTTCGTTCCAGATACAGACACTATG
AGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGACTTGATGAACAAAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGTGTTTATTGAT
GATATTTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACGTATGGTATCCTTTCTAGGCCATGTGCTTTATAAGGCTAGGGTTTCTGTGGATCCAGCTAA
GATAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGTTTCGTATCTTTCTAAGTTTAGCAGGTTATTACCGACGGTTTGTGGAGAACTTTTCTCGTATAG
CTACTCCTCTCACTCAGTTGACCAGGATGGGAGCTTCTTTCGTTTGGAGCAAGGCTTGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACTGCACTGGTTCTT
ACTGTACCTGATGGTTCCAGGAGTTTTGTGATTTACAGTGATGCTTCGAAGAAGGGTTTAGGTTGTGTATTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCA
GTTGAAGAGTCATGAGCAGAATTACCCTACACATGATTTGGAATTGGCAACAATGGTTTTTGCACTGAAGATATGGAGGCATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGGCCAGCAAACTGCTTAGTCAGGGTACTTGGAGTATCTTAGCGAGCATGGTGGATACTAGAGAGGTTGATGTATCCCTGTCATTAGAACCAGTGGTAAGGGACTA
TCCGGATGTTTTTTCTGAAGAGCTTCCAGGGTTACCTCCTCATAGAGAGATTGAGTTTGCCATAGAGCTGGAGCCAGGTACGATTCCTATATCCAGAGCCCCATACAGAA
TGGCCCCAGCAGAATTGAAGGAACTGAAAGTGCAGCTACAGGAGTTGCTTGATAAAGGCTTCATTCGACCGAGTGTGTCACCTTTTGGTGCGGTAGTTTTATTTGTTAAG
AAGAAGGATGGATCGATGCGCCTATGCATTGACTATAGGGAGTTGAATAAGGTAACCGTTAAGAACAGATATCCCTTGCCCAAGATCGACGATCTATTTGACCAGTTACA
GGGAGCTACAGTGTTCTCTAAGATTGATCTTCGATCAGGATATCATCAGCTGAGGATTAAGGATGGTGATGTACCAAAGACAGCCTTTCGTTCCAGATACAGACACTATG
AGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGACTTGATGAACAAAGTGTTTAGGGAGTTCCTAGACACTTTTGTGATCGTGTTTATTGAT
GATATTTTGATATATTCCAAGACAGAGGCCGAGCATGAGGAGCATTTACGTATGGTATCCTTTCTAGGCCATGTGCTTTATAAGGCTAGGGTTTCTGTGGATCCAGCTAA
GATAGAGGCAGTCACCAGTTGGCCCCGACCTTCCACAGTCAGTGAGTTTCGTATCTTTCTAAGTTTAGCAGGTTATTACCGACGGTTTGTGGAGAACTTTTCTCGTATAG
CTACTCCTCTCACTCAGTTGACCAGGATGGGAGCTTCTTTCGTTTGGAGCAAGGCTTGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACTGCACTGGTTCTT
ACTGTACCTGATGGTTCCAGGAGTTTTGTGATTTACAGTGATGCTTCGAAGAAGGGTTTAGGTTGTGTATTGATGCAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCA
GTTGAAGAGTCATGAGCAGAATTACCCTACACATGATTTGGAATTGGCAACAATGGTTTTTGCACTGAAGATATGGAGGCATTAG
Protein sequenceShow/hide protein sequence
MRASKLLSQGTWSILASMVDTREVDVSLSLEPVVRDYPDVFSEELPGLPPHREIEFAIELEPGTIPISRAPYRMAPAELKELKVQLQELLDKGFIRPSVSPFGAVVLFVK
KKDGSMRLCIDYRELNKVTVKNRYPLPKIDDLFDQLQGATVFSKIDLRSGYHQLRIKDGDVPKTAFRSRYRHYEFIVMSFGLTNAPAVFMDLMNKVFREFLDTFVIVFID
DILIYSKTEAEHEEHLRMVSFLGHVLYKARVSVDPAKIEAVTSWPRPSTVSEFRIFLSLAGYYRRFVENFSRIATPLTQLTRMGASFVWSKACEDSFQNLKQKLVTALVL
TVPDGSRSFVIYSDASKKGLGCVLMQQGKVVAYASRQLKSHEQNYPTHDLELATMVFALKIWRH