; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0074811 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0074811
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:22245614..22246854
RNA-Seq ExpressionCmc03g0074811
SyntenyCmc03g0074811
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026271.1 pol protein [Cucumis melo var. makuwa]5.2e-19684.78Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        MAPAELKELK+QLQ+LLDKGFIRSSVSPWGAPVLFVKKKDG M LCIDYRELNKVTVKN+YPLPRI+DLFDQLQGATVFSKIDLRS YHQLRIKD DVPK
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHEEHLR+VL+T R NKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL
        VSVDP KIEAVT W+RPS VSE+   L                      + ++G  FVW KACE+SFQNLKQKLVTAPVLTVPDGSGSFVIYSD SKK L
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL

Query:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA
        GCVLMQQG VVAYASRQLKSHEQN+PTHDLEL  VVF LKIWRHYLYGEKIQIFTDHKSLKYFFTQKEL MRQRRWLELVKDYDCEILYHP KANVVADA
Subjt:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA

Query:  LSRKVSHLVAHITR
        LSRKVSH  A ITR
Subjt:  LSRKVSHLVAHITR

KAA0035602.1 pol protein [Cucumis melo var. makuwa]8.9e-19688.52Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        MAPAELKELK+QLQELLDKGFIR SVSPWGAPVLFVKKKDG MHLCIDYRELNKVTVKN+YPLPRI+DLFDQLQGATVFSKIDLRS Y QLRIKD DVPK
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
          FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTE EHEEHLR+VL+T R NKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLLVDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDLGCVLMQQGNVVAYASRQLKSHE
        VSVDP KIEAVT W+RPS VSEL      ++G  FVW KACE+SFQNLKQKLVTAPVLTVPDGSGSFVIYSD SKK LGCVLMQQG VVAYASRQLKSHE
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLLVDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDLGCVLMQQGNVVAYASRQLKSHE

Query:  QNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADALSRKVSHLVAHITR
        QN+PT DLEL  VVF LKIWRHYLYGEKIQIFTDHKSLKYFFTQKEL MRQRRWLELVKDYDCEILYHP KANVVADALSRKVSH  A ITR
Subjt:  QNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADALSRKVSHLVAHITR

KAA0037244.1 reverse transcriptase [Cucumis melo var. makuwa]4.0e-19685.02Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        MAPAELKELK+QLQELLDKGFIR SVSPWGAPVLFVKKKDG M LCIDYRELNKVTVKN+YPLPRI+DLFDQLQGATVFSKIDLRS YHQLRIKD DVPK
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVL+T R NKLYAKF KCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL
        VSVDP KIEAVT W+RPS VSE+   L                      + ++G  FVW KACE+SFQNLKQKLVTAPVLTVPDGSGSFVIYSD SKK L
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL

Query:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA
        GCVLMQQG VVAYASRQLKSHEQN+PTHDLEL  VVF LKIWRHYLYGEKIQIFTDHKSLKYFFTQKEL MRQRRWLELVKDYDCEILYHP KANVVADA
Subjt:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA

Query:  LSRKVSHLVAHITR
        LSRKVSH  A ITR
Subjt:  LSRKVSHLVAHITR

KAA0047209.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.4e-19688.78Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        MAPAELKELK+QLQELLDKGFIRSSVSPWGAPVLFVKKKDG M LCIDYRELNKVTVKN+YPLPRI+DLFDQLQGATVFSKIDLRS YHQLRIKD DVPK
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKT+ EHE+HLR+VL+T R NKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLLVDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDLGCVLMQQGNVVAYASRQLKSHE
        VSVDP KIE VT W+RPS VSEL      ++G  FVW KACE+SFQNLKQKLVTAPVLTVPDGSGSFVIYSD SKK LGCVLMQQG VVAYASRQLKSHE
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLLVDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDLGCVLMQQGNVVAYASRQLKSHE

Query:  QNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADALSRKVSHLVAHITR
        QN+PTHDLEL  VVF LKIWRHYLYGEKIQIFTDHKSLKYFFTQKEL MRQRRWLELVKDYDCEILYHP KANVVADALSRKVSH  A ITR
Subjt:  QNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADALSRKVSHLVAHITR

KAA0048687.1 pol protein [Cucumis melo var. makuwa]2.4e-19685.02Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        MAPAELKELK+QLQELLDKGFIR SVSPWGAPVLFVKKKDG M LCIDYRELNKVTVKN+YPLPRI+DLFDQLQGATVFSKIDLRS YHQLRIKD DVPK
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHEEHLRMVL+T R NKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL
        VSVDP KIEAVT W+RPS VSE+   L                      + ++G  FVW KACE+SFQNLKQKLVTAPVLTVPDGSGSFVIYSD SKK L
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL

Query:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA
        GCVLMQQG VVAYASRQLKSHEQN+PTHDLEL  VVF LKIWRHYLYGEKIQIFTDHKSLKYFFTQKEL MRQRRWLELVKDYDCEILYHP KANVVADA
Subjt:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA

Query:  LSRKVSHLVAHITR
        LSRKVSH  A ITR
Subjt:  LSRKVSHLVAHITR

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ2 Pol protein2.5e-19684.78Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        MAPAELKELK+QLQ+LLDKGFIRSSVSPWGAPVLFVKKKDG M LCIDYRELNKVTVKN+YPLPRI+DLFDQLQGATVFSKIDLRS YHQLRIKD DVPK
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHEEHLR+VL+T R NKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL
        VSVDP KIEAVT W+RPS VSE+   L                      + ++G  FVW KACE+SFQNLKQKLVTAPVLTVPDGSGSFVIYSD SKK L
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL

Query:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA
        GCVLMQQG VVAYASRQLKSHEQN+PTHDLEL  VVF LKIWRHYLYGEKIQIFTDHKSLKYFFTQKEL MRQRRWLELVKDYDCEILYHP KANVVADA
Subjt:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA

Query:  LSRKVSHLVAHITR
        LSRKVSH  A ITR
Subjt:  LSRKVSHLVAHITR

A0A5A7SWF6 Reverse transcriptase4.3e-19688.52Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        MAPAELKELK+QLQELLDKGFIR SVSPWGAPVLFVKKKDG MHLCIDYRELNKVTVKN+YPLPRI+DLFDQLQGATVFSKIDLRS Y QLRIKD DVPK
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
          FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDD+LIYSKTE EHEEHLR+VL+T R NKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLLVDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDLGCVLMQQGNVVAYASRQLKSHE
        VSVDP KIEAVT W+RPS VSEL      ++G  FVW KACE+SFQNLKQKLVTAPVLTVPDGSGSFVIYSD SKK LGCVLMQQG VVAYASRQLKSHE
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLLVDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDLGCVLMQQGNVVAYASRQLKSHE

Query:  QNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADALSRKVSHLVAHITR
        QN+PT DLEL  VVF LKIWRHYLYGEKIQIFTDHKSLKYFFTQKEL MRQRRWLELVKDYDCEILYHP KANVVADALSRKVSH  A ITR
Subjt:  QNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADALSRKVSHLVAHITR

A0A5A7T190 Reverse transcriptase1.9e-19685.02Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        MAPAELKELK+QLQELLDKGFIR SVSPWGAPVLFVKKKDG M LCIDYRELNKVTVKN+YPLPRI+DLFDQLQGATVFSKIDLRS YHQLRIKD DVPK
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVL+T R NKLYAKF KCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL
        VSVDP KIEAVT W+RPS VSE+   L                      + ++G  FVW KACE+SFQNLKQKLVTAPVLTVPDGSGSFVIYSD SKK L
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL

Query:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA
        GCVLMQQG VVAYASRQLKSHEQN+PTHDLEL  VVF LKIWRHYLYGEKIQIFTDHKSLKYFFTQKEL MRQRRWLELVKDYDCEILYHP KANVVADA
Subjt:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA

Query:  LSRKVSHLVAHITR
        LSRKVSH  A ITR
Subjt:  LSRKVSHLVAHITR

A0A5A7U149 Reverse transcriptase1.1e-19688.78Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        MAPAELKELK+QLQELLDKGFIRSSVSPWGAPVLFVKKKDG M LCIDYRELNKVTVKN+YPLPRI+DLFDQLQGATVFSKIDLRS YHQLRIKD DVPK
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAF SRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKT+ EHE+HLR+VL+T R NKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLLVDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDLGCVLMQQGNVVAYASRQLKSHE
        VSVDP KIE VT W+RPS VSEL      ++G  FVW KACE+SFQNLKQKLVTAPVLTVPDGSGSFVIYSD SKK LGCVLMQQG VVAYASRQLKSHE
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLLVDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDLGCVLMQQGNVVAYASRQLKSHE

Query:  QNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADALSRKVSHLVAHITR
        QN+PTHDLEL  VVF LKIWRHYLYGEKIQIFTDHKSLKYFFTQKEL MRQRRWLELVKDYDCEILYHP KANVVADALSRKVSH  A ITR
Subjt:  QNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADALSRKVSHLVAHITR

A0A5A7U330 Reverse transcriptase1.1e-19685.02Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        MAPAELKELK+QLQELLDKGFIR SVSPWGAPVLFVKKKDG M LCIDYRELNKVTVKN+YPLPRI+DLFDQLQGATVFSKIDLRS YHQLRIKD DVPK
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
        TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTE EHEEHLRMVL+T R NKLYAKFSKCEFWLKQVSFLGHVVSKAG
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL
        VSVDP KIEAVT W+RPS VSE+   L                      + ++G  FVW KACE+SFQNLKQKLVTAPVLTVPDGSGSFVIYSD SKK L
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL

Query:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA
        GCVLMQQG VVAYASRQLKSHEQN+PTHDLEL  VVF LKIWRHYLYGEKIQIFTDHKSLKYFFTQKEL MRQRRWLELVKDYDCEILYHP KANVVADA
Subjt:  GCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADA

Query:  LSRKVSHLVAHITR
        LSRKVSH  A ITR
Subjt:  LSRKVSHLVAHITR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.65.3e-6633.25Show/hide
Query:  KELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGL-----MHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPKT
        +E++ Q+Q++L++G IR+S SP+ +P+  V KK          + IDYR+LN++TV +++P+P ++++  +L     F+ IDL   +HQ+ +    V KT
Subjt:  KELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGL-----MHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPKT

Query:  AFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAGV
        AF +++GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+++S +  EH + L +V E      L  +  KCEF  ++ +FLGHV++  G+
Subjt:  AFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAGV

Query:  SVDPTKIEAVTSWSRPSPVSELLLLL-----------------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSD
          +P KIEA+  +  P+   E+   L                             +D     +      + +F+ LK  +   P+L VPD +  F + +D
Subjt:  SVDPTKIEAVTSWSRPSPVSELLLLL-----------------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSD

Query:  TSKKDLGCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKA
         S   LG VL Q G+ ++Y SR L  HE N+ T + EL  +V+  K +RHYL G   +I +DH+ L + +  K+   +  RW   + ++D +I Y   K 
Subjt:  TSKKDLGCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKA

Query:  NVVADALSR
        N VADALSR
Subjt:  NVVADALSR

P0CT34 Transposon Tf2-1 polyprotein7.9e-6233.33Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G + + +DY+ LNK    N YPLP I  L  ++QG+T+F+K+DL+S YH +R++  D  K
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E EH +H++ VL+  +   L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL
         +     I+ V  W +P    EL   L                      + ++   + W     ++ +N+KQ LV+ PVL   D S   ++ +D S   +
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL

Query:  GCVLMQQGN-----VVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYG--EKIQIFTDHKSLKYFFTQKELIMRQR--RWLELVKDYDCEILYHP
        G VL Q+ +      V Y S ++   + N+   D E+  ++  LK WRHYL    E  +I TDH++L    T +     +R  RW   ++D++ EI Y P
Subjt:  GCVLMQQGN-----VVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYG--EKIQIFTDHKSLKYFFTQKELIMRQR--RWLELVKDYDCEILYHP

Query:  RKANVVADALSRKV
          AN +ADALSR V
Subjt:  RKANVVADALSRKV

P0CT41 Transposon Tf2-12 polyprotein7.9e-6233.33Show/hide
Query:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK
        + P +++ +  ++ + L  G IR S +    PV+FV KK+G + + +DY+ LNK    N YPLP I  L  ++QG+T+F+K+DL+S YH +R++  D  K
Subjt:  MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPK

Query:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG
         AFR   G +E++VM +G++ APA F   +N +  E  ++ V+ ++DDILI+SK+E EH +H++ VL+  +   L    +KCEF   QV F+G+ +S+ G
Subjt:  TAFRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAG

Query:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL
         +     I+ V  W +P    EL   L                      + ++   + W     ++ +N+KQ LV+ PVL   D S   ++ +D S   +
Subjt:  VSVDPTKIEAVTSWSRPSPVSELLLLL----------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDL

Query:  GCVLMQQGN-----VVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYG--EKIQIFTDHKSLKYFFTQKELIMRQR--RWLELVKDYDCEILYHP
        G VL Q+ +      V Y S ++   + N+   D E+  ++  LK WRHYL    E  +I TDH++L    T +     +R  RW   ++D++ EI Y P
Subjt:  GCVLMQQGN-----VVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYG--EKIQIFTDHKSLKYFFTQKELIMRQR--RWLELVKDYDCEILYHP

Query:  RKANVVADALSRKV
          AN +ADALSR V
Subjt:  RKANVVADALSRKV

P20825 Retrovirus-related Pol polyprotein from transposon 2976.9e-6633.09Show/hide
Query:  ELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKD-----GLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPKTA
        E++ Q+QE+L++G IR S SP+ +P   V KK          + IDYR+LN++T+ ++YP+P ++++  +L     F+ IDL   +HQ+ + +  + KTA
Subjt:  ELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKD-----GLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPKTA

Query:  FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVS
        F ++ GHYE++ M FGL NAPA F   MN + R  L+   +V++DDI+I+S +  EH   +++V        L  +  KCEF  K+ +FLGH+V+  G+ 
Subjt:  FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVS

Query:  VDPTKIEAVTSWSRPSPVSELLLLL-----------------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDT
         +P K++A+ S+  P+   E+   L                             +D +   ++      E+F+ LK  ++  P+L +PD    FV+ +D 
Subjt:  VDPTKIEAVTSWSRPSPVSELLLLL-----------------------------VDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDT

Query:  SKKDLGCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKAN
        S   LG VL Q G+ +++ SR L  HE N+   + EL  +V+  K +RHYL G +  I +DH+ L++    KE   +  RW   + +Y  +I Y   K N
Subjt:  SKKDLGCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKAN

Query:  VVADALSR
         VADALSR
Subjt:  VVADALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus7.1e-6333.57Show/hide
Query:  ELKMQLQELLDKGFIRSSVSPWGAPVLFVKKK-----DGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPKTA
        E++ Q+ ELL  G IR S SP+ +P+  V KK     +    + +D++ LN VT+ + YP+P IN     L  A  F+ +DL S +HQ+ +K+SD+PKTA
Subjt:  ELKMQLQELLDKGFIRSSVSPWGAPVLFVKKK-----DGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPKTA

Query:  FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVS
        F +  G YEF+ + FGL NAPA+F  +++ + RE +     V+IDDI+++S+    H ++LR+VL +     L     K  F   QV FLG++V+  G+ 
Subjt:  FRSRYGHYEFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVS

Query:  VDPTKIEAVTSWSRPSPVSELLLLL------------------------------VDQEGTSFVWIKACE---ESFQNLKQKLVTAPVLTVPDGSGSFVI
         DP K+ A++    P+ V EL   L                              +    +S V I   E   +SF +LK  L ++ +L  P  +  F +
Subjt:  VDPTKIEAVTSWSRPSPVSELLLLL------------------------------VDQEGTSFVWIKACE---ESFQNLKQKLVTAPVLTVPDGSGSFVI

Query:  YSDTSKKDLGCVLMQ----QGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCE
         +D S   +G VL Q    +   +AY SR L   E+N+ T + E+  +++ L   R YLYG   I+++TDH+ L +    +    + +RW   +++Y+CE
Subjt:  YSDTSKKDLGCVLMQ----QGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCE

Query:  ILYHPRKANVVADALSR
        ++Y P K+NVVADALSR
Subjt:  ILYHPRKANVVADALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.1e-1033.08Show/hide
Query:  HLRMVLETHRANKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPTKIEAVTSWSRPSPVSELLLLL---------VDQEG------------TSFVWI
        HL MVL+    ++ YA   KC F   Q+++LG  H++S  GVS DP K+EA+  W  P   +EL   L         V   G             S  W 
Subjt:  HLRMVLETHRANKLYAKFSKCEFWLKQVSFLG--HVVSKAGVSVDPTKIEAVTSWSRPSPVSELLLLL---------VDQEG------------TSFVWI

Query:  KACEESFQNLKQKLVTAPVLTVPDGSGSFV
        +    +F+ LK  + T PVL +PD    FV
Subjt:  KACEESFQNLKQKLVTAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCCAGCAGAGTTGAAAGAGCTGAAAATGCAGTTACAGGAGTTGCTTGATAAAGGCTTCATTCGGTCGAGTGTGTCACCTTGGGGTGCACCTGTTTTATTTGTTAA
AAAGAAGGATGGATTGATGCACTTATGTATTGACTACAGAGAGTTGAATAAGGTAACCGTTAAGAACAAATATCCCTTGCCCAGGATCAACGATCTGTTTGATCAATTAC
AGGGAGCTACAGTGTTCTCTAAGATCGACCTTCGGTCAAGATATCATCAGTTGAGGATTAAGGATAGTGATGTACCGAAGACAGCCTTTCGTTCCAGATATGGACACTAT
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGATTTGATGAACAGAGTGTTCAGGGAATTCCTAGACACTTTTGTGATCGTGTTCATTGA
CGATATTTTGATATATTCCAAGACAGAGGTAGAGCATGAGGAGCATTTACGCATGGTTCTAGAAACCCATCGAGCTAATAAATTGTATGCAAAGTTCTCAAAATGTGAGT
TTTGGTTGAAGCAGGTATCTTTTCTAGGCCATGTAGTTTCTAAAGCTGGTGTTTCTGTAGATCCAACTAAGATAGAGGCAGTCACTAGTTGGTCCCGACCTTCCCCAGTC
AGTGAGTTACTCCTCTTACTAGTTGACCAGGAAGGGACTTCTTTTGTTTGGATCAAGGCCTGTGAAGAAAGTTTTCAGAACCTTAAACAAAAACTCGTTACTGCACCGGT
TCTTACTGTACCTGATGGTTCTGGGAGTTTTGTGATTTACAGTGATACTTCTAAGAAAGATTTGGGTTGTGTTTTGATGCAGCAAGGTAACGTAGTCGCTTATGCTTCTC
GTCAGTTGAAGAGTCATGAGCAAAATTTCCCTACCCATGATTTAGAGTTGACAGTAGTAGTCTTTGAACTAAAGATTTGGAGGCATTACTTGTATGGTGAAAAGATACAA
ATCTTCACGGATCATAAAAGCTTGAAATACTTCTTCACTCAGAAGGAGTTGATTATGAGACAACGAAGGTGGCTTGAATTAGTAAAGGATTATGATTGCGAGATATTGTA
TCATCCAAGAAAGGCGAATGTGGTGGCTGATGCTCTTAGTAGAAAGGTATCACATTTAGTAGCACACATTACTCGATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCCCAGCAGAGTTGAAAGAGCTGAAAATGCAGTTACAGGAGTTGCTTGATAAAGGCTTCATTCGGTCGAGTGTGTCACCTTGGGGTGCACCTGTTTTATTTGTTAA
AAAGAAGGATGGATTGATGCACTTATGTATTGACTACAGAGAGTTGAATAAGGTAACCGTTAAGAACAAATATCCCTTGCCCAGGATCAACGATCTGTTTGATCAATTAC
AGGGAGCTACAGTGTTCTCTAAGATCGACCTTCGGTCAAGATATCATCAGTTGAGGATTAAGGATAGTGATGTACCGAAGACAGCCTTTCGTTCCAGATATGGACACTAT
GAGTTTATTGTGATGTCTTTTGGTTTGACGAATGCTCCAGCAGTGTTTATGGATTTGATGAACAGAGTGTTCAGGGAATTCCTAGACACTTTTGTGATCGTGTTCATTGA
CGATATTTTGATATATTCCAAGACAGAGGTAGAGCATGAGGAGCATTTACGCATGGTTCTAGAAACCCATCGAGCTAATAAATTGTATGCAAAGTTCTCAAAATGTGAGT
TTTGGTTGAAGCAGGTATCTTTTCTAGGCCATGTAGTTTCTAAAGCTGGTGTTTCTGTAGATCCAACTAAGATAGAGGCAGTCACTAGTTGGTCCCGACCTTCCCCAGTC
AGTGAGTTACTCCTCTTACTAGTTGACCAGGAAGGGACTTCTTTTGTTTGGATCAAGGCCTGTGAAGAAAGTTTTCAGAACCTTAAACAAAAACTCGTTACTGCACCGGT
TCTTACTGTACCTGATGGTTCTGGGAGTTTTGTGATTTACAGTGATACTTCTAAGAAAGATTTGGGTTGTGTTTTGATGCAGCAAGGTAACGTAGTCGCTTATGCTTCTC
GTCAGTTGAAGAGTCATGAGCAAAATTTCCCTACCCATGATTTAGAGTTGACAGTAGTAGTCTTTGAACTAAAGATTTGGAGGCATTACTTGTATGGTGAAAAGATACAA
ATCTTCACGGATCATAAAAGCTTGAAATACTTCTTCACTCAGAAGGAGTTGATTATGAGACAACGAAGGTGGCTTGAATTAGTAAAGGATTATGATTGCGAGATATTGTA
TCATCCAAGAAAGGCGAATGTGGTGGCTGATGCTCTTAGTAGAAAGGTATCACATTTAGTAGCACACATTACTCGATAG
Protein sequenceShow/hide protein sequence
MAPAELKELKMQLQELLDKGFIRSSVSPWGAPVLFVKKKDGLMHLCIDYRELNKVTVKNKYPLPRINDLFDQLQGATVFSKIDLRSRYHQLRIKDSDVPKTAFRSRYGHY
EFIVMSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEVEHEEHLRMVLETHRANKLYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPTKIEAVTSWSRPSPV
SELLLLLVDQEGTSFVWIKACEESFQNLKQKLVTAPVLTVPDGSGSFVIYSDTSKKDLGCVLMQQGNVVAYASRQLKSHEQNFPTHDLELTVVVFELKIWRHYLYGEKIQ
IFTDHKSLKYFFTQKELIMRQRRWLELVKDYDCEILYHPRKANVVADALSRKVSHLVAHITR