; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0071171 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0071171
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:17307555..17318168
RNA-Seq ExpressionCmc03g0071171
SyntenyCmc03g0071171
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037369.1 pol protein [Cucumis melo var. makuwa]3.7e-18983.04Show/hide
Query:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS
        +GLEVEPLGS+LSVSTPSG V+LSK++IKAC+V +ANHMLDVTLLVLDMQDFDVILGM+W+ ANHA+IDCF KEVVFNPPS  SFKF+  GIVC+PKVIS
Subjt:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS

Query:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV
        A+KASKLL+QGTWGILASVVD REPEVSLSS+ VVREYP+VFPDELP LPP REVDFAIELEP   PISRAPYRMAPAELKELK+QLQ+LLD GFIRPSV
Subjt:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV

Query:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF
        SPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP +       +   VFSKIDLR GYHQLRIRD D PKTAFRSRYGHYEF+VMSFGLT APAVF
Subjt:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF

Query:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL
        MDLMNR+FKDFLD F IVFIDDIL+YSKTE EHE+HLHQVL TLRANKLYAKFSKCEFW +KV+FLGHVVSSEGVSV P KIE VT+WPRPSTVSEIRS 
Subjt:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL

Query:  L
        L
Subjt:  L

KAA0045265.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.8e-22098.48Show/hide
Query:  VEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVISALKA
        VEPLGSILSVST SGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFA+HASIDCFRKEVVFNP SRTSFKFKRAGIVCVPKVISALKA
Subjt:  VEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVISALKA

Query:  SKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWG
        SKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVF DELPRLPPIR VDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWG
Subjt:  SKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWG

Query:  APVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCKERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN
        APVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCKERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN
Subjt:  APVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCKERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN

Query:  RVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL
        RVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVG VKIEVVTSWPRPSTVSEIRSLL
Subjt:  RVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL

KAA0056047.1 reverse transcriptase [Cucumis melo var. makuwa]6.3e-18983.04Show/hide
Query:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS
        +GLEVEPL S+LSVSTPSG V+LSK+KIKAC+V +ANH+LDVTLLVLDMQDFDVILG +W+ ANHA+IDCF KEVVFNPPS TSFKF+RAGIVC+PKVIS
Subjt:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS

Query:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV
        A+KASKLLNQGTWGIL SVVDTREPEVSLSS+ VVREYP+VFPDELP LPP RE+DFAIELEP   PISRAPYRMAP ELKELK+QLQ+LLD GFIRPSV
Subjt:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV

Query:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF
        SPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP +       +   VFSKIDL+ GYHQLRIRDSD PKTAFRSRY HYEFIVMSFGLTNAPAVF
Subjt:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF

Query:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL
        MDLMNRVFKDFLD F IVFIDDIL+YSKTE EHE+HLHQVL TLRANKLY KFSKCEFW KKV+FLGHVVSSEGVS+ P KIE VT+WPRPSTVSEI+S 
Subjt:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL

Query:  L
        L
Subjt:  L

KAA0060745.1 pol protein [Cucumis melo var. makuwa]6.3e-18983.04Show/hide
Query:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS
        +GL+VEPLGS+LSVSTPSG V+LSK++IKAC+V +AN MLDVTLLVLDMQDFDVILGM+W+ ANHA+IDCF KEVVFNPPS  SFKF+ AG+VC+PKVIS
Subjt:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS

Query:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV
        A+KASKLL+QGTWGILASVVD REPEVSLSS+ VVREYP+VFPDELP LPP REVDFAIELEP   PISRAPYRMAPAELKELK+QLQ+LLD GFIRPSV
Subjt:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV

Query:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF
        SPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP +       +   VFSKIDLR GYHQLRIRD D PKTAFRSRYGHYEF+VMSFGLTNAPAVF
Subjt:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF

Query:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL
        MDLMNRVFKDFLD F IVFIDDIL+YSKTE EHE+HLHQVL TLRANKLYAKFSKCEFW +KV+FLGHVVSSEGVSV P KIE VT+WPRPSTVSEIRS 
Subjt:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL

Query:  L
        L
Subjt:  L

TYK01306.1 pol protein [Cucumis melo var. makuwa]1.3e-18983.54Show/hide
Query:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS
        +GLEVEPLGS+LSVSTPSG V+LSK++IKAC+V +ANHML+VTLLVLDMQDFDVILGM+W+ ANHA+IDCF KEVVFNPPS  SFKF+ AG+VC+PKVIS
Subjt:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS

Query:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV
        A+KASKLL+QGTWGILASVVD REPEVSLSS+ VVREYP+VFPDELP LPP REVDFAIELEP   PISRAPYRMAPAELKELK+QLQ+LLD GFIRPSV
Subjt:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV

Query:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF
        SPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP +       +   VFSKIDLR GYHQLRIRDSD PKTAFR RYGHYEFIVMSFGLTNAPAVF
Subjt:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF

Query:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL
        MDLMNRVFKDFLD F IVFIDDIL+YSKTE EHE+HLHQVL TLRANKLYAKFSKCEFW +KV+FLGHVVSSEGVSV P KIE VT+WPRPSTVSEIRS 
Subjt:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL

Query:  L
        L
Subjt:  L

TrEMBL top hitse value%identityAlignment
A0A5A7T7M0 Reverse transcriptase1.8e-18983.04Show/hide
Query:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS
        +GLEVEPLGS+LSVSTPSG V+LSK++IKAC+V +ANHMLDVTLLVLDMQDFDVILGM+W+ ANHA+IDCF KEVVFNPPS  SFKF+  GIVC+PKVIS
Subjt:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS

Query:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV
        A+KASKLL+QGTWGILASVVD REPEVSLSS+ VVREYP+VFPDELP LPP REVDFAIELEP   PISRAPYRMAPAELKELK+QLQ+LLD GFIRPSV
Subjt:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV

Query:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF
        SPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP +       +   VFSKIDLR GYHQLRIRD D PKTAFRSRYGHYEF+VMSFGLT APAVF
Subjt:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF

Query:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL
        MDLMNR+FKDFLD F IVFIDDIL+YSKTE EHE+HLHQVL TLRANKLYAKFSKCEFW +KV+FLGHVVSSEGVSV P KIE VT+WPRPSTVSEIRS 
Subjt:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL

Query:  L
        L
Subjt:  L

A0A5A7TV58 Reverse transcriptase8.9e-22198.48Show/hide
Query:  VEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVISALKA
        VEPLGSILSVST SGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFA+HASIDCFRKEVVFNP SRTSFKFKRAGIVCVPKVISALKA
Subjt:  VEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVISALKA

Query:  SKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWG
        SKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVF DELPRLPPIR VDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWG
Subjt:  SKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWG

Query:  APVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCKERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN
        APVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCKERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN
Subjt:  APVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCKERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMN

Query:  RVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL
        RVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVG VKIEVVTSWPRPSTVSEIRSLL
Subjt:  RVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL

A0A5A7UN68 Reverse transcriptase3.1e-18983.04Show/hide
Query:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS
        +GLEVEPL S+LSVSTPSG V+LSK+KIKAC+V +ANH+LDVTLLVLDMQDFDVILG +W+ ANHA+IDCF KEVVFNPPS TSFKF+RAGIVC+PKVIS
Subjt:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS

Query:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV
        A+KASKLLNQGTWGIL SVVDTREPEVSLSS+ VVREYP+VFPDELP LPP RE+DFAIELEP   PISRAPYRMAP ELKELK+QLQ+LLD GFIRPSV
Subjt:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV

Query:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF
        SPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP +       +   VFSKIDL+ GYHQLRIRDSD PKTAFRSRY HYEFIVMSFGLTNAPAVF
Subjt:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF

Query:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL
        MDLMNRVFKDFLD F IVFIDDIL+YSKTE EHE+HLHQVL TLRANKLY KFSKCEFW KKV+FLGHVVSSEGVS+ P KIE VT+WPRPSTVSEI+S 
Subjt:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL

Query:  L
        L
Subjt:  L

A0A5A7V4E4 Reverse transcriptase3.1e-18983.04Show/hide
Query:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS
        +GL+VEPLGS+LSVSTPSG V+LSK++IKAC+V +AN MLDVTLLVLDMQDFDVILGM+W+ ANHA+IDCF KEVVFNPPS  SFKF+ AG+VC+PKVIS
Subjt:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS

Query:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV
        A+KASKLL+QGTWGILASVVD REPEVSLSS+ VVREYP+VFPDELP LPP REVDFAIELEP   PISRAPYRMAPAELKELK+QLQ+LLD GFIRPSV
Subjt:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV

Query:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF
        SPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP +       +   VFSKIDLR GYHQLRIRD D PKTAFRSRYGHYEF+VMSFGLTNAPAVF
Subjt:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF

Query:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL
        MDLMNRVFKDFLD F IVFIDDIL+YSKTE EHE+HLHQVL TLRANKLYAKFSKCEFW +KV+FLGHVVSSEGVSV P KIE VT+WPRPSTVSEIRS 
Subjt:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL

Query:  L
        L
Subjt:  L

A0A5D3BSV9 Reverse transcriptase6.2e-19083.54Show/hide
Query:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS
        +GLEVEPLGS+LSVSTPSG V+LSK++IKAC+V +ANHML+VTLLVLDMQDFDVILGM+W+ ANHA+IDCF KEVVFNPPS  SFKF+ AG+VC+PKVIS
Subjt:  MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVIS

Query:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV
        A+KASKLL+QGTWGILASVVD REPEVSLSS+ VVREYP+VFPDELP LPP REVDFAIELEP   PISRAPYRMAPAELKELK+QLQ+LLD GFIRPSV
Subjt:  ALKASKLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSV

Query:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF
        SPWGAPVLFVKKKDG MRLCIDYRELNKVTVKNRYPLP +       +   VFSKIDLR GYHQLRIRDSD PKTAFR RYGHYEFIVMSFGLTNAPAVF
Subjt:  SPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVF

Query:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL
        MDLMNRVFKDFLD F IVFIDDIL+YSKTE EHE+HLHQVL TLRANKLYAKFSKCEFW +KV+FLGHVVSSEGVSV P KIE VT+WPRPSTVSEIRS 
Subjt:  MDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSL

Query:  L
        L
Subjt:  L

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.1e-4132.38Show/hide
Query:  REPEVSLSSDSVVREYPNVFPD-ELPRLP-PIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWGAPVLFVKKKDGLMRLC
        +EPE+      + +E+ ++  +    +LP PI+ ++F +EL  + + +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G +R+ 
Subjt:  REPEVSLSSDSVVREYPNVFPD-ELPRLP-PIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWGAPVLFVKKKDGLMRLC

Query:  IDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDMFFIVFI
        +DY+ LNK    N YPLP +   L   +   +F+K+DL+  YH +R+R  D  K AFR   G +E++VM +G++ APA F   +N +  +  +   + ++
Subjt:  IDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDMFFIVFI

Query:  DDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL
        DDIL++SK+E EH  H+  VL  L+   L    +KCEF Q +V F+G+ +S +G +     I+ V  W +P    E+R  L
Subjt:  DDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL

P0CT41 Transposon Tf2-12 polyprotein2.1e-4132.38Show/hide
Query:  REPEVSLSSDSVVREYPNVFPD-ELPRLP-PIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWGAPVLFVKKKDGLMRLC
        +EPE+      + +E+ ++  +    +LP PI+ ++F +EL  + + +    Y + P +++ +  ++ + L  G IR S +    PV+FV KK+G +R+ 
Subjt:  REPEVSLSSDSVVREYPNVFPD-ELPRLP-PIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWGAPVLFVKKKDGLMRLC

Query:  IDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDMFFIVFI
        +DY+ LNK    N YPLP +   L   +   +F+K+DL+  YH +R+R  D  K AFR   G +E++VM +G++ APA F   +N +  +  +   + ++
Subjt:  IDYRELNKVTVKNRYPLPGLTTCLTSCK-ERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDMFFIVFI

Query:  DDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL
        DDIL++SK+E EH  H+  VL  L+   L    +KCEF Q +V F+G+ +S +G +     I+ V  W +P    E+R  L
Subjt:  DDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL

P20825 Retrovirus-related Pol polyprotein from transposon 2971.3e-4337.08Show/hide
Query:  PISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWGAPVLFVKKKD-----GLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSC-KERVFSKIDLRLG
        PI    Y +A     E++ Q+Q++L+ G IR S SP+ +P   V KK         R+ IDYR+LN++T+ +RYP+P +   L    K + F+ IDL  G
Subjt:  PISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWGAPVLFVKKKD-----GLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSC-KERVFSKIDLRLG

Query:  YHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQK
        +HQ+ + +    KTAF ++ GHYE++ M FGL NAPA F   MN + +  L+   +V++DDI+++S +  EH + +  V   L    L  +  KCEF +K
Subjt:  YHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQK

Query:  KVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL
        + +FLGH+V+ +G+   P+K++ + S+P P+   EIR+ L
Subjt:  KVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.8e-4334.08Show/hide
Query:  KASKLLNQGTWGILASVVDTREPEVSLSSDS---------VVREYPNVFPDELPRLPPIRE---VDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKL
        +AS L   G +  + S + + EP  +  S+          + ++Y  +  ++LP  P       V   IE++P        PY +     +E+   +QKL
Subjt:  KASKLLNQGTWGILASVVDTREPEVSLSSDS---------VVREYPNVFPDELPRLPPIRE---VDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKL

Query:  LDMGFIRPSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSC-KERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMS
        LD  FI PS SP  +PV+ V KKDG  RLC+DYR LNK T+ + +PLP +   L+     ++F+ +DL  GYHQ+ +   D  KTAF +  G YE+ VM 
Subjt:  LDMGFIRPSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSC-KERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMS

Query:  FGLTNAPAVFMDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPR
        FGL NAP+ F   M   F+D    F  V++DDIL++S++  EH  HL  VL  L+   L  K  KC+F  ++  FLG+ +  + ++    K   +  +P 
Subjt:  FGLTNAPAVFMDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPR

Query:  PSTVSEIRSLL
        P TV + +  L
Subjt:  PSTVSEIRSLL

Q99315 Transposon Ty3-G Gag-Pol polyprotein3.8e-4334.08Show/hide
Query:  KASKLLNQGTWGILASVVDTREPEVSLSSDS---------VVREYPNVFPDELPRLPPIRE---VDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKL
        +AS L   G +  + S + + EP  +  S+          + ++Y  +  ++LP  P       V   IE++P        PY +     +E+   +QKL
Subjt:  KASKLLNQGTWGILASVVDTREPEVSLSSDS---------VVREYPNVFPDELPRLPPIRE---VDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKL

Query:  LDMGFIRPSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSC-KERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMS
        LD  FI PS SP  +PV+ V KKDG  RLC+DYR LNK T+ + +PLP +   L+     ++F+ +DL  GYHQ+ +   D  KTAF +  G YE+ VM 
Subjt:  LDMGFIRPSVSPWGAPVLFVKKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSC-KERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMS

Query:  FGLTNAPAVFMDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPR
        FGL NAP+ F   M   F+D    F  V++DDIL++S++  EH  HL  VL  L+   L  K  KC+F  ++  FLG+ +  + ++    K   +  +P 
Subjt:  FGLTNAPAVFMDLMNRVFKDFLDMFFIVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPR

Query:  PSTVSEIRSLL
        P TV + +  L
Subjt:  PSTVSEIRSLL

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein2.1e-0939.13Show/hide
Query:  DHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLG--HVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL
        +HL  VL     ++ YA   KC F Q ++++LG  H++S EGVS  P K+E +  WP P   +E+R  L
Subjt:  DHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLG--HVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTAGAAGTTGAACCGTTGGGTAGTATATTGTCGGTTTCTACCCCATCGGGGGCGGTTATGTTGTCTAAAGATAAAATAAAGGCATGTCAGGTAGGGGTA
GCGAACCATATGTTAGATGTGACCCTACTAGTGTTAGACATGCAGGATTTCGATGTAATACTAGGCATGGAATGGATGTTTGCTAACCATGCAAGCATAGACTGT
TTCCGTAAGGAAGTCGTTTTTAACCCTCCATCTAGGACTAGTTTCAAATTTAAGAGGGCAGGAATCGTATGTGTACCCAAGGTCATTTCAGCTTTGAAGGCCAGT
AAGCTGCTGAACCAGGGCACTTGGGGTATCTTGGCCAGCGTGGTGGACACTAGAGAGCCGGAGGTTTCCCTTTCGTCTGATTCGGTGGTGAGGGAGTACCCCAAC
GTTTTTCCTGATGAGCTTCCAAGACTCCCACCCATCAGGGAGGTTGACTTTGCCATTGAGCTAGAGCCAGACATTTTTCCTATATCTAGAGCTCCGTATAGAATG
GCCCCAGCTGAGCTGAAGGAACTAAAGATCCAGTTACAGAAGTTACTGGACATGGGCTTCATACGACCTAGCGTGTCACCTTGGGGAGCGCCAGTATTGTTTGTA
AAGAAGAAAGATGGATTGATGCGTCTTTGCATTGATTACAGAGAGTTAAATAAGGTGACAGTAAAAAACCGTTATCCTTTGCCAGGATTGACGACTTGTTTGACC
AGCTGCAAGGAGCGTGTCTTCTCTAAGATCGATTTACGTTTGGGATACCACCAGCTGAGGATCAGAGACAGTGATACTCCTAAGACTGCGTTCCGTTCCAGATAC
GGGCATTACGAGTTCATTGTGATGTCCTTTGGTTTGACTAATGCTCCCGCAGTATTTATGGACTTGATGAACAGAGTGTTTAAGGACTTCTTAGACATGTTTTTT
ATAGTCTTCATTGATGATATCTTGGTTTACTCCAAGACAGAGGTCGAGCATGAGGATCACTTGCATCAAGTTTTGGGGACTCTTCGAGCTAATAAGCTGTATGCC
AAGTTTTCCAAGTGTGAGTTCTGGCAGAAGAAGGTATCTTTTCTTGGACATGTGGTGTCCAGTGAGGGAGTTTCTGTAGGCCCAGTAAAGATTGAAGTTGTTACT
AGTTGGCCTCGACCGTCTACAGTCAGTGAGATCCGTAGCCTTCTGGTATAG
mRNA sequenceShow/hide mRNA sequence
TTTCTTTTCCCTCTTTTCTCTAAAATAATATTTTCTTTATTTTTTTCCCAAATTTCTTTCCAACCTTTCTCCATACCCACGATTTCTTTCCTTTCTTCTTCCCTT
CTCCACAAAATTTCCAAATCCTATTCCCTCAACTGCGTGCCCTAGTGACCACTGCCGCACATCGGTGACTGTAGCTCCCTCCACCACCGTAAGTTCGTGACCCGC
CTCTGCCGACGTGACTCGGTCCAAGTTGACATCTGCTCTCCCATTTTACTCTAAAAGTAAGACAAAGTATCCCACAATTTTTGCTCATCTTGACGATTCTAAGAA
CTCATCTAATCAGCAACTCAATCTCTCTATCCTCTGGTTCACATTCGGTATCTCTATCTTTCTCTTGAATGTGTGCTTTCTGTAGGAATTTGAAGATTTGGGAAC
AAGTAATTTTTTCTGTGTAACATTTCGTTGTAATGTGGTGTTTAGGGATTTCTGGGTTATTGGTGGTTTGCAAGTTCTTTACTATCTTGATGAAGCATTGAAGTG
GTGGTTGTTTTATTTGCCTATGATGGTGGATATATTATTCTGTTGTCTTTGCAATATTTACTTGACTATGATGGTGGATGTATTATTCTGCTAAGTAAAGTTTTG
ACAATTAATGTTCAAGAGGCATTTGAAGAAGGGGAAAAACTATACGGGGAGAAATTTTATCTATTTGGGTCTACATAACATAAGCAATTTAAAATTGGAGCCTAC
CTTTTGTTTTTCATCTCTCAGAGAATCTGAACAATTATATGCATATATTCTTTTATGGTTTTCTGTTATTGCAGCATAATTTGTTCCCTATTAAGGAACTCTATT
TCTAATTTTCTTGGTATTTTTTCTATGGCTAATGCTATGTTTACCTTAAAATTTTTCATATTTACTTTGATAGTTCATGATCTTTTTGTTCTTCGTGATTTTCAT
ATTTAATAGGAGCCTACTGTCTGACCCCCAAATTTTGATTCTCCTCTTTGACACCTGCCAATATGTAGATCTGTGCTCCAAGATACTCATTCTTTTCTATGCTAG
AAAATTATTGTTGTGCACTTCTTAATATCTTGTTGACTTGAGCTTCAGTTCTAGATTGTTGACGCTAATTTTTGGGTTTGTGATTATCTGATTTGTAATTTTATT
TGGAAATTCCACTGTTTACTCAATGATTCAATTCTATGAAGGATGAATTTCATTACAGTCGTTTGAAATTTGATATTTAATTAACTGGATTGCAACATCAGGCAT
GTAGATGTAGATGTATAATAATATCAGATTATGTGCCAATGCCATGTCCTTGTTGTTGAAGTATGTTGCCTTCTTTGTTTTTAGTTTATATAGTCTTTCATCACT
AGTAGCACAAATAGAATCTGGAATGTTTGTTCTTGTCACTGCTGCTGCTGGAGGGATTGGGCAGTTTGTTGTCTAGGTTTAATAATTATTGCCTATCTTTCTTGT
AGCAACTTCTTGTCTTACATATAATGTAATTTTCCTTCAAATGATGTGAAGAAATATCTTAAATGTGCGTAAATTGGAACTCTTTTGTTTCTACGAGAAAATTAC
TTCTAACTCTCCTCTATGTTCGGTCTGTTCGAACCATTTCAGTCGTTAGATAAATGCCATTAAAATTATGGTGTAAGGTTCAGAAGATGAAATGCATTATTTAGT
ACAAGAAATGTCATACATTCCAAGAAGCCTTTCAAATCTAGTCATTTAGTTCGGGAGGGGCCTATTTATAGTTTTTTATTTTCCAAGTCCTAAGGATGTTCTTCT
CGATCGTGCATCACACACAAGCAGGTAAATGACGGGACGGACATTCTCGCACCACTTCTCGTAAGCGAATCGTTCTCTACCATAGAGGCAGAGGCAGAGGTAAGA
ACAAGCGCAAGCTGGCGAGTGACCAGAAGTGATCGTGGCGAGCCACATGGATACGCTTGCTTTCGCTTATGTCAAAATTCGGATTTTAGTATTGCATTTTTAGTT
CATTCTTTATTGGTTAATTTATTTCTTTAAAACTATATAGGGCCCGAGTTAGGACTTTATTTTATACTTATTTCAGATTTTCCTTGTTTATTTTTTTATTTGAAA
ATTTGAGGTTTTTGTATTTTTCTATTTTTTTAAAACTTTACAAACTTATTTACCCTTTTTAATTAGTAATGGCTTCGACTTAGTATAAGGAGTTGGGTCGTTACA
GTTGGTATCAGAGCCTATGTCCTAGGTTCGGCAGACTGACTTACGATGTGAGTCTTTGTTTTGGTTTGTTTTACCCCTATGGCTATACGGTCCTTCGTCACTCGC
CAGGTATGTTTTAAGACTTTTCTAAATGTTATGGTTACAATTTTGTCTGAATAAAATGAGTGTAGTTAGTATAAGTGTTGTGGTTGTTTTGTGAAAAGTTCTTCT
AGTGAAACTTCAGGAAAAGATGTCGTCGCGTAGAGGTACACGTCGAGAGGGTGGTAGAGGAGGCAAAGGAGCTGGACGTATCCAGCTGGAGGAGCAACCTGCTGT
GCAGGCAGCCAACCCTGAAGCACCTGTCACCTAGGCGGATCTCGCCGCGATGGAGCAGCGATATCAGGACATGCTACAGGCTGCTTTAGCACCTTTTCTCGCTGC
CCATCAGACCCAGGCCACCCCTGTTTAGGCCCAGACCGTCCCTTCTCCAGCTCCTATGAGAGCTCAGCCCCGCCAGTTCAACTATCGACCGAGGCCAAACACCTG
AGAGACTTTAGAAAATATAACCCTAAAACTTTTTATGGGTCCATGGACAACCCTACCAAGGCCCAAATGTGGTTGACGTCCATAGAGACTATCTTTCGGTACATG
AAGTGCCCAGAGGACCAGAAGGTGCAGTGTTCAATTTTCTCCCTAGAGGATAGAGGCACCGCTTGGTGGGAGACTGCTGAGAGGATGTTGGGGGCGACGTCAGTA
AAATAACCTGGGAGTAGTTCAAGGAGAGCTTCTATGCTAAGTTCTTCTCTACCAACGTGAAGCACGCTAAGCAGCAAAAGTTCCTAAGCTTGGAGCAAGGTGACA
TGACCGTGGAGTAGTATGACGCCGAGTTTGACATGTTATCCTATTTTGCTCCCGATGTGGTAAGGGATGAGGCTTCCAGGATTGAGAAGTTCGTTAGAGGTCTCA
GACTAGACCTCCAGGGCATCGTTCGAGCCCTCCGGCCAGCCACTCATGCTGATGCACTACGCATAGCACTGGACTTGAGTCTGCACGAGAGAGCTGATCCGTCCA
GGGCTGCAGGTAAAACATTGAGGTCAGGAGGTGTCTTCCAGCGGCATCGTCAGGAGCTTGCTGCTGCAGGAAGAACATTGAGAGAGCTACCTGCTTGTGGGAGGT
GTGGAAGAGTTCATGGAGGTCGTTGCTTGATGGGGAGCGGAGTTTGCTTTAAGTGCAAGCAACCAGGGAATACCACTGACGTTTGTCCTCAGAAACTCATTGGGA
CTACCCTGCACTAGCCTCCCGCTTCACAGCAGGGAAGAGTTTTTGCCACGACTCGTCAGGAGGCCAAGCGAGCTGGTACTGTAGTGACAGGTACACTCCCAATCT
TGAGGCATTATGCACTTGTACTGTTTGACTCTGGGTCATCCCATTCTTTTATATCTTCAGTTTTTATTTGACAAATGGGTTTAGAAGTTGAACCGTTGGGTAGTA
TATTGTCGGTTTCTACCCCATCGGGGGCGGTTATGTTGTCTAAAGATAAAATAAAGGCATGTCAGGTAGGGGTAGCGAACCATATGTTAGATGTGACCCTACTAG
TGTTAGACATGCAGGATTTCGATGTAATACTAGGCATGGAATGGATGTTTGCTAACCATGCAAGCATAGACTGTTTCCGTAAGGAAGTCGTTTTTAACCCTCCAT
CTAGGACTAGTTTCAAATTTAAGAGGGCAGGAATCGTATGTGTACCCAAGGTCATTTCAGCTTTGAAGGCCAGTAAGCTGCTGAACCAGGGCACTTGGGGTATCT
TGGCCAGCGTGGTGGACACTAGAGAGCCGGAGGTTTCCCTTTCGTCTGATTCGGTGGTGAGGGAGTACCCCAACGTTTTTCCTGATGAGCTTCCAAGACTCCCAC
CCATCAGGGAGGTTGACTTTGCCATTGAGCTAGAGCCAGACATTTTTCCTATATCTAGAGCTCCGTATAGAATGGCCCCAGCTGAGCTGAAGGAACTAAAGATCC
AGTTACAGAAGTTACTGGACATGGGCTTCATACGACCTAGCGTGTCACCTTGGGGAGCGCCAGTATTGTTTGTAAAGAAGAAAGATGGATTGATGCGTCTTTGCA
TTGATTACAGAGAGTTAAATAAGGTGACAGTAAAAAACCGTTATCCTTTGCCAGGATTGACGACTTGTTTGACCAGCTGCAAGGAGCGTGTCTTCTCTAAGATCG
ATTTACGTTTGGGATACCACCAGCTGAGGATCAGAGACAGTGATACTCCTAAGACTGCGTTCCGTTCCAGATACGGGCATTACGAGTTCATTGTGATGTCCTTTG
GTTTGACTAATGCTCCCGCAGTATTTATGGACTTGATGAACAGAGTGTTTAAGGACTTCTTAGACATGTTTTTTATAGTCTTCATTGATGATATCTTGGTTTACT
CCAAGACAGAGGTCGAGCATGAGGATCACTTGCATCAAGTTTTGGGGACTCTTCGAGCTAATAAGCTGTATGCCAAGTTTTCCAAGTGTGAGTTCTGGCAGAAGA
AGGTATCTTTTCTTGGACATGTGGTGTCCAGTGAGGGAGTTTCTGTAGGCCCAGTAAAGATTGAAGTTGTTACTAGTTGGCCTCGACCGTCTACAGTCAGTGAGA
TCCGTAGCCTTCTGGTATAGCAGGTTATTATAAGAGGTTCGTGGAAGACTTTTCTCGTATAGCTAGCCTCTTGACTCAATTGACCAGGAAGGGGACTCCATTTGT
TTGGAACCCAGCTTGTGAGTCTAGTTTCCAGGAGCTCAAGTAGAAGCTTGTGTCTGCACCAGTCCTGACAGTACCACATGGATCTGGAAGTTTCGTGATCTACAG
TAATGCCTTAAAGAAAGGATTGGGCTGTGTTCTGATGCAGCAAGGTAAGGTAGTTGCTTATGCCTCCCGTCAATTGAAGAGTCATGAGCAGAACTACCCTACCCA
TGACCTAGAGTTGGCAGCAGTGGTTTTTGCACTGAAGATATGGAGGCACTACCTGTACGGTGAGAAGATACAGATTTTCACTGACCAGAAGAGCCTAAAGGACTT
CTTCACCCGAAAGGAGTTGAACATGAGGTAGAGGAGGTGGCTTGAGTTAGTGAAAGACTATGATTGCGAGATTTTGTATCACCTAGGTAAGGCAAACATAGTAGC
TGACGCGTTGAGTAGGAAGGTTGTGCATTCAGCAGCGCTTATCACCAAGCAAGCTCCCTTACTAAGAGATTTCGAGAAAGCTGAGATTGCAGTCTCAGTAGGGGA
AGTGACTTCACAGTTGGCTCAGTTGACCGTGCAGTCAATCTTGAGACAGAGGATTATTGTTGCTCAGCTAAATGATCCTTATTTGGTCGAGAAGCATTTATTAGT
AGAGGCAGGGCAAAGTGAGGGTTTCTCCGTATCCTCTGATGACGGACTTACTTTCGATGGACGTTTGTGGGTTTCAAAAGACAGTGCAGTCAAGATAGAGCTTTT
GACTGAGACTCACGGCTCCCCATTTACTATGCACCT
Protein sequenceShow/hide protein sequence
MGLEVEPLGSILSVSTPSGAVMLSKDKIKACQVGVANHMLDVTLLVLDMQDFDVILGMEWMFANHASIDCFRKEVVFNPPSRTSFKFKRAGIVCVPKVISALKAS
KLLNQGTWGILASVVDTREPEVSLSSDSVVREYPNVFPDELPRLPPIREVDFAIELEPDIFPISRAPYRMAPAELKELKIQLQKLLDMGFIRPSVSPWGAPVLFV
KKKDGLMRLCIDYRELNKVTVKNRYPLPGLTTCLTSCKERVFSKIDLRLGYHQLRIRDSDTPKTAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFKDFLDMFF
IVFIDDILVYSKTEVEHEDHLHQVLGTLRANKLYAKFSKCEFWQKKVSFLGHVVSSEGVSVGPVKIEVVTSWPRPSTVSEIRSLLV