; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0222881 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0222881
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr08:11947196..11948565
RNA-Seq ExpressionCmc08g0222881
SyntenyCmc08g0222881
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040547.1 pol protein [Cucumis melo var. makuwa]1.5e-21585.75Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF
        MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DILIYSKTEAEHEEHLHQVLETLRAN LYAKFS                          K  +   +
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV
         R STVSEI+SFLGLAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACESSFQELK KLVTAPVLTV DGSGNFVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV

Query:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK
        SRQLK  EQNYPTHDLEL AVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVAD LSRKVAHSA LITK
Subjt:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK

Query:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH
        Q PLL+DFERAEI VSVGEVTSQLAQLSVQPTLRQ+II AQLNDPYLVEKRR+VETGQGEDFSISSDDGL FEGRLCVPED AV+TELLTEAHSSPFTMH
Subjt:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH

Query:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
         GSTKMY+DLRS+YWWR MKREVA+FVSRCLV QQVKAPRQ P GLLQPLSVSGWK
Subjt:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK

KAA0045309.1 pol protein [Cucumis melo var. makuwa]1.5e-21585.53Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF
        +SFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DILIYSKTEAEHEEHLHQVLETLRAN+LYAKFS                          K  +   +
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV
         R STVSEIRSFLGLAGYYRRFVED S IASPLTQLTRKGTPFVWSPACE SFQELK KLVTAPVLTV DGSGNFVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV

Query:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK
        SRQLK  EQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVAD+LSRKVAHSAALITK
Subjt:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK

Query:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH
        Q PLL+DFERAEI VSVGEVT+QLAQLSVQPTLRQ+II AQLNDPYL EKRR+VETGQGEDFSISSDDGL FEGRLCVPED AVKTELLTEAHSSPFTMH
Subjt:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH

Query:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
         GSTKMY+DLRS+YWWR MKREVA+FVSRCLV QQVKAPRQ PAGLLQPLSV GWK
Subjt:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK

KAA0045429.1 pol protein [Cucumis melo var. makuwa]6.5e-21685.53Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF
        MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DILIYSKTEAEHEEHLHQVLETLRAN+LYAKFS                          K  +   +
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV
         R STVSEIRSFLG AGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACE SFQELK KLVTAPVLTV DGSGNFVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV

Query:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK
        SRQLK  EQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVAD LSRKVAHSAALITK
Subjt:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK

Query:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH
        Q PLL+DFERAEI VSVGEVT+QLAQLSVQPTLRQ+II AQLNDPYL EKRR+VET QGEDFSISSDDGL FEGRLCVPED AVKTELLTEAHSSPFTMH
Subjt:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH

Query:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
         GSTKMY+DLRS+YWWR MKREVA+ VSRCLV QQVKAPRQ PAGLLQPLSV GWK
Subjt:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK

KAA0063098.1 pol protein [Cucumis melo var. makuwa]5.0e-21685.75Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF
        MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DILIYSKTEAEHEEHLHQVLETLRAN+LYAKFS                          K  +   +
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV
         R STVSEIRSFLGLAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACE SFQELK KLVTAPVLTV DGSGNFVIYSDASKK LGCVLMQQGKVVAY 
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV

Query:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK
        SRQLK+ EQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVAD LSRKVAHSAALITK
Subjt:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK

Query:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH
        Q PLL+DFERAEI VSVGEVT+QLAQLSVQPTLRQ+II AQLNDPYL EKRR+VETGQGEDFSISSDDGL FEGRLCVPED AVKTELLTEAHSSPFTMH
Subjt:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH

Query:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
         GSTKMY+DLRS+YWWR MKREVA+FVSRCLV QQVKAPRQ PAGLLQPLSV GWK
Subjt:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK

TYK27796.1 pol protein [Cucumis melo var. makuwa]1.4e-23998.61Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSNKDRSGYQFARSSTVSEIRSFLGLAGYYRRFVED
        MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRL      KDRSGYQFARSSTVSEIRSFLGLAGYYRRFVED
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSNKDRSGYQFARSSTVSEIRSFLGLAGYYRRFVED

Query:  FSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSREQNYPTHDLELAAVVFAL
        FSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSREQNYPTHDLELAAVVFAL
Subjt:  FSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSREQNYPTHDLELAAVVFAL

Query:  KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITKQAPLLKDFERAEIVVSVGEVTSQLA
        KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITKQAPLLKDFERAEIVVSVGEVTSQLA
Subjt:  KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITKQAPLLKDFERAEIVVSVGEVTSQLA

Query:  QLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAE
        QLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAE
Subjt:  QLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAE

Query:  FVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
        FVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
Subjt:  FVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK

TrEMBL top hitse value%identityAlignment
A0A5A7TG62 Reverse transcriptase7.1e-21685.75Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF
        MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DILIYSKTEAEHEEHLHQVLETLRAN LYAKFS                          K  +   +
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV
         R STVSEI+SFLGLAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACESSFQELK KLVTAPVLTV DGSGNFVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV

Query:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK
        SRQLK  EQNYPTHDLEL AVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVAD LSRKVAHSA LITK
Subjt:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK

Query:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH
        Q PLL+DFERAEI VSVGEVTSQLAQLSVQPTLRQ+II AQLNDPYLVEKRR+VETGQGEDFSISSDDGL FEGRLCVPED AV+TELLTEAHSSPFTMH
Subjt:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH

Query:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
         GSTKMY+DLRS+YWWR MKREVA+FVSRCLV QQVKAPRQ P GLLQPLSVSGWK
Subjt:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK

A0A5A7TVN9 Reverse transcriptase7.1e-21685.53Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF
        +SFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DILIYSKTEAEHEEHLHQVLETLRAN+LYAKFS                          K  +   +
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV
         R STVSEIRSFLGLAGYYRRFVED S IASPLTQLTRKGTPFVWSPACE SFQELK KLVTAPVLTV DGSGNFVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV

Query:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK
        SRQLK  EQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVAD+LSRKVAHSAALITK
Subjt:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK

Query:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH
        Q PLL+DFERAEI VSVGEVT+QLAQLSVQPTLRQ+II AQLNDPYL EKRR+VETGQGEDFSISSDDGL FEGRLCVPED AVKTELLTEAHSSPFTMH
Subjt:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH

Query:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
         GSTKMY+DLRS+YWWR MKREVA+FVSRCLV QQVKAPRQ PAGLLQPLSV GWK
Subjt:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK

A0A5A7TW21 Reverse transcriptase3.2e-21685.53Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF
        MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DILIYSKTEAEHEEHLHQVLETLRAN+LYAKFS                          K  +   +
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV
         R STVSEIRSFLG AGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACE SFQELK KLVTAPVLTV DGSGNFVIYSDASKKGLGCVLMQQGKVVAY 
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV

Query:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK
        SRQLK  EQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVAD LSRKVAHSAALITK
Subjt:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK

Query:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH
        Q PLL+DFERAEI VSVGEVT+QLAQLSVQPTLRQ+II AQLNDPYL EKRR+VET QGEDFSISSDDGL FEGRLCVPED AVKTELLTEAHSSPFTMH
Subjt:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH

Query:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
         GSTKMY+DLRS+YWWR MKREVA+ VSRCLV QQVKAPRQ PAGLLQPLSV GWK
Subjt:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK

A0A5A7V646 Reverse transcriptase2.4e-21685.75Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF
        MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFI DILIYSKTEAEHEEHLHQVLETLRAN+LYAKFS                          K  +   +
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSN-------------------------KDRSGYQF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV
         R STVSEIRSFLGLAGYYRRFVEDFS IASPLTQLTRKGTPFVWSPACE SFQELK KLVTAPVLTV DGSGNFVIYSDASKK LGCVLMQQGKVVAY 
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYV

Query:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK
        SRQLK+ EQNYPTHDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVAD LSRKVAHSAALITK
Subjt:  SRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITK

Query:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH
        Q PLL+DFERAEI VSVGEVT+QLAQLSVQPTLRQ+II AQLNDPYL EKRR+VETGQGEDFSISSDDGL FEGRLCVPED AVKTELLTEAHSSPFTMH
Subjt:  QAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMH

Query:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
         GSTKMY+DLRS+YWWR MKREVA+FVSRCLV QQVKAPRQ PAGLLQPLSV GWK
Subjt:  LGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK

A0A5D3DVL8 Pol protein7.0e-24098.61Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSNKDRSGYQFARSSTVSEIRSFLGLAGYYRRFVED
        MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRL      KDRSGYQFARSSTVSEIRSFLGLAGYYRRFVED
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSNKDRSGYQFARSSTVSEIRSFLGLAGYYRRFVED

Query:  FSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSREQNYPTHDLELAAVVFAL
        FSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSREQNYPTHDLELAAVVFAL
Subjt:  FSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSREQNYPTHDLELAAVVFAL

Query:  KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITKQAPLLKDFERAEIVVSVGEVTSQLA
        KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITKQAPLLKDFERAEIVVSVGEVTSQLA
Subjt:  KIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITKQAPLLKDFERAEIVVSVGEVTSQLA

Query:  QLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAE
        QLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAE
Subjt:  QLSVQPTLRQRIIVAQLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAE

Query:  FVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
        FVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK
Subjt:  FVSRCLVYQQVKAPRQRPAGLLQPLSVSGWK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein3.7e-4428.21Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLR-----ANRLYAKF--SNKDRSGY------------------QF
        M +G++ APA F   +N +  +  +S V+ ++ DILI+SK+E+EH +H+  VL+ L+      N+   +F  S     GY                  Q+
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLR-----ANRLYAKF--SNKDRSGY------------------QF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGK-----
         +     E+R FLG   Y R+F+   S +  PL  L +K   + W+P    + + +K  LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYVSRQLKSREQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +AD LSR  
Subjt:  VVAYVSRQLKSREQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKV

Query:  AHSAALITKQAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLV-----EKRRLVETGQGED-FSISSDDGLTFEGRLCVPEDIAVK
             ++ +  P+ KD E   I        + + Q+S+    + +++    ND  L+     E +R+ E  Q +D   I+S D      ++ +P D  + 
Subjt:  AHSAALITKQAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLV-----EKRRLVETGQGED-FSISSDDGLTFEGRLCVPEDIAVK

Query:  TELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVS
          ++ + H     +H G   +   +   + W+ +++++ E+V  C   Q  K+   +P G LQP+  S
Subjt:  TELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVS

P0CT35 Transposon Tf2-2 polyprotein3.7e-4428.21Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLR-----ANRLYAKF--SNKDRSGY------------------QF
        M +G++ APA F   +N +  +  +S V+ ++ DILI+SK+E+EH +H+  VL+ L+      N+   +F  S     GY                  Q+
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLR-----ANRLYAKF--SNKDRSGY------------------QF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGK-----
         +     E+R FLG   Y R+F+   S +  PL  L +K   + W+P    + + +K  LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYVSRQLKSREQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +AD LSR  
Subjt:  VVAYVSRQLKSREQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKV

Query:  AHSAALITKQAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLV-----EKRRLVETGQGED-FSISSDDGLTFEGRLCVPEDIAVK
             ++ +  P+ KD E   I        + + Q+S+    + +++    ND  L+     E +R+ E  Q +D   I+S D      ++ +P D  + 
Subjt:  AHSAALITKQAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLV-----EKRRLVETGQGED-FSISSDDGLTFEGRLCVPEDIAVK

Query:  TELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVS
          ++ + H     +H G   +   +   + W+ +++++ E+V  C   Q  K+   +P G LQP+  S
Subjt:  TELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVS

P0CT36 Transposon Tf2-3 polyprotein3.7e-4428.21Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLR-----ANRLYAKF--SNKDRSGY------------------QF
        M +G++ APA F   +N +  +  +S V+ ++ DILI+SK+E+EH +H+  VL+ L+      N+   +F  S     GY                  Q+
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLR-----ANRLYAKF--SNKDRSGY------------------QF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGK-----
         +     E+R FLG   Y R+F+   S +  PL  L +K   + W+P    + + +K  LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYVSRQLKSREQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +AD LSR  
Subjt:  VVAYVSRQLKSREQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKV

Query:  AHSAALITKQAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLV-----EKRRLVETGQGED-FSISSDDGLTFEGRLCVPEDIAVK
             ++ +  P+ KD E   I        + + Q+S+    + +++    ND  L+     E +R+ E  Q +D   I+S D      ++ +P D  + 
Subjt:  AHSAALITKQAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLV-----EKRRLVETGQGED-FSISSDDGLTFEGRLCVPEDIAVK

Query:  TELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVS
          ++ + H     +H G   +   +   + W+ +++++ E+V  C   Q  K+   +P G LQP+  S
Subjt:  TELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVS

P0CT37 Transposon Tf2-4 polyprotein3.7e-4428.21Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLR-----ANRLYAKF--SNKDRSGY------------------QF
        M +G++ APA F   +N +  +  +S V+ ++ DILI+SK+E+EH +H+  VL+ L+      N+   +F  S     GY                  Q+
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLR-----ANRLYAKF--SNKDRSGY------------------QF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGK-----
         +     E+R FLG   Y R+F+   S +  PL  L +K   + W+P    + + +K  LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYVSRQLKSREQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +AD LSR  
Subjt:  VVAYVSRQLKSREQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKV

Query:  AHSAALITKQAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLV-----EKRRLVETGQGED-FSISSDDGLTFEGRLCVPEDIAVK
             ++ +  P+ KD E   I        + + Q+S+    + +++    ND  L+     E +R+ E  Q +D   I+S D      ++ +P D  + 
Subjt:  AHSAALITKQAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLV-----EKRRLVETGQGED-FSISSDDGLTFEGRLCVPEDIAVK

Query:  TELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVS
          ++ + H     +H G   +   +   + W+ +++++ E+V  C   Q  K+   +P G LQP+  S
Subjt:  TELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVS

P0CT41 Transposon Tf2-12 polyprotein3.7e-4428.21Show/hide
Query:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLR-----ANRLYAKF--SNKDRSGY------------------QF
        M +G++ APA F   +N +  +  +S V+ ++ DILI+SK+E+EH +H+  VL+ L+      N+   +F  S     GY                  Q+
Subjt:  MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLR-----ANRLYAKF--SNKDRSGY------------------QF

Query:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGK-----
         +     E+R FLG   Y R+F+   S +  PL  L +K   + W+P    + + +K  LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  ARSSTVSEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGK-----

Query:  VVAYVSRQLKSREQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   ++D++ EI Y PG AN +AD LSR  
Subjt:  VVAYVSRQLKSREQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKV

Query:  AHSAALITKQAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLV-----EKRRLVETGQGED-FSISSDDGLTFEGRLCVPEDIAVK
             ++ +  P+ KD E   I        + + Q+S+    + +++    ND  L+     E +R+ E  Q +D   I+S D      ++ +P D  + 
Subjt:  AHSAALITKQAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVAQLNDPYLV-----EKRRLVETGQGED-FSISSDDGLTFEGRLCVPEDIAVK

Query:  TELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVS
          ++ + H     +H G   +   +   + W+ +++++ E+V  C   Q  K+   +P G LQP+  S
Subjt:  TELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAGLLQPLSVS

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein8.8e-0943.75Show/hide
Query:  SEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLD
        +E+R FLGL GYYRRFV+++  I  PLT+L +K +   W+     +F+ LK  + T PVL + D
Subjt:  SEIRSFLGLAGYYRRFVEDFSCIASPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGGTTTGACTAATGCTCCTGCGGTATTCATGGACTTGATGAATAGGGTGTTTAAGGACTTCCTAGACTCGTTCGTCATAGTTTTCATTGTTGACATC
TTGATTTACTCCAAGACTGAGGCTGAGCATGAGGAGCACTTGCACCAAGTTTTGGAGACTCTTCGAGCCAATAGACTGTATGCCAAGTTCTCCAACAAAGATCGA
AGCGGTTACCAATTCGCTCGATCGTCTACAGTTAGCGAGATTCGTAGTTTCCTGGGCTTGGCAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCATGCATAGCC
AGCCCCTTGACCCAGTTGACCAGAAAGGGAACCCCTTTTGTTTGGAGCCCAGCTTGCGAGAGTAGCTTCCAGGAGCTTAAGTGGAAGCTAGTGACTGCACCAGTC
TTGACAGTGCTCGATGGGTCGGGAAACTTCGTGATCTACAGTGATGCCTCCAAAAAGGGACTGGGCTGTGTTCTGATGCAGCAAGGTAAGGTAGTTGCTTATGTC
TCCCGTCAGTTGAAGAGTCGTGAGCAGAACTACCCTACCCACGATTTAGAACTGGCAGCAGTGGTCTTTGCACTGAAGATATGGAGGCACTACCTGTACGGTGAG
AAGATACAGATTTTCACTGACCATAAGAGCCTGAAGTACTTCTTCACCCAGAAGGAGTTGAACATGAGGCAGAGAAGGTGGCTTGAGTTAGTGAAAGACTATGAC
TGTGAGATTCTGTACCACCCAGGTAAAGCAAATGTAGTAGCTGACACCCTGAGTAGGAAGGTTGCACATTCAGCAGCACTGATCACCAAACAAGCCCCCTTACTC
AAAGATTTTGAGAGAGCCGAGATTGTAGTCTCAGTAGGAGAGGTTACCTCACAGTTGGCTCAGTTGTCAGTACAGCCGACCCTGAGACAGAGGATTATTGTCGCT
CAGTTAAATGATCCTTATTTGGTCGAGAAGCGTCGTTTGGTAGAGACAGGGCAAGGTGAGGACTTCTCCATATCTTCTGATGATGGCCTTACGTTTGAGGGACGC
TTGTGTGTGCCGGAAGACATTGCAGTCAAGACAGAGCTTTTGACTGAGGCTCACAGTTCTCCATTTACCATGCACCTTGGAAGTACGAAGATGTACAAGGACTTA
AGGAGTATCTATTGGTGGAGGAACATGAAGAGAGAGGTGGCAGAATTTGTCAGTAGGTGTTTAGTGTACCAGCAGGTGAAGGCACCTAGACAGCGACCAGCCGGG
TTGTTGCAACCTTTGAGTGTGTCAGGATGGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGGTTTGACTAATGCTCCTGCGGTATTCATGGACTTGATGAATAGGGTGTTTAAGGACTTCCTAGACTCGTTCGTCATAGTTTTCATTGTTGACATC
TTGATTTACTCCAAGACTGAGGCTGAGCATGAGGAGCACTTGCACCAAGTTTTGGAGACTCTTCGAGCCAATAGACTGTATGCCAAGTTCTCCAACAAAGATCGA
AGCGGTTACCAATTCGCTCGATCGTCTACAGTTAGCGAGATTCGTAGTTTCCTGGGCTTGGCAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCATGCATAGCC
AGCCCCTTGACCCAGTTGACCAGAAAGGGAACCCCTTTTGTTTGGAGCCCAGCTTGCGAGAGTAGCTTCCAGGAGCTTAAGTGGAAGCTAGTGACTGCACCAGTC
TTGACAGTGCTCGATGGGTCGGGAAACTTCGTGATCTACAGTGATGCCTCCAAAAAGGGACTGGGCTGTGTTCTGATGCAGCAAGGTAAGGTAGTTGCTTATGTC
TCCCGTCAGTTGAAGAGTCGTGAGCAGAACTACCCTACCCACGATTTAGAACTGGCAGCAGTGGTCTTTGCACTGAAGATATGGAGGCACTACCTGTACGGTGAG
AAGATACAGATTTTCACTGACCATAAGAGCCTGAAGTACTTCTTCACCCAGAAGGAGTTGAACATGAGGCAGAGAAGGTGGCTTGAGTTAGTGAAAGACTATGAC
TGTGAGATTCTGTACCACCCAGGTAAAGCAAATGTAGTAGCTGACACCCTGAGTAGGAAGGTTGCACATTCAGCAGCACTGATCACCAAACAAGCCCCCTTACTC
AAAGATTTTGAGAGAGCCGAGATTGTAGTCTCAGTAGGAGAGGTTACCTCACAGTTGGCTCAGTTGTCAGTACAGCCGACCCTGAGACAGAGGATTATTGTCGCT
CAGTTAAATGATCCTTATTTGGTCGAGAAGCGTCGTTTGGTAGAGACAGGGCAAGGTGAGGACTTCTCCATATCTTCTGATGATGGCCTTACGTTTGAGGGACGC
TTGTGTGTGCCGGAAGACATTGCAGTCAAGACAGAGCTTTTGACTGAGGCTCACAGTTCTCCATTTACCATGCACCTTGGAAGTACGAAGATGTACAAGGACTTA
AGGAGTATCTATTGGTGGAGGAACATGAAGAGAGAGGTGGCAGAATTTGTCAGTAGGTGTTTAGTGTACCAGCAGGTGAAGGCACCTAGACAGCGACCAGCCGGG
TTGTTGCAACCTTTGAGTGTGTCAGGATGGAAATGA
Protein sequenceShow/hide protein sequence
MSFGLTNAPAVFMDLMNRVFKDFLDSFVIVFIVDILIYSKTEAEHEEHLHQVLETLRANRLYAKFSNKDRSGYQFARSSTVSEIRSFLGLAGYYRRFVEDFSCIA
SPLTQLTRKGTPFVWSPACESSFQELKWKLVTAPVLTVLDGSGNFVIYSDASKKGLGCVLMQQGKVVAYVSRQLKSREQNYPTHDLELAAVVFALKIWRHYLYGE
KIQIFTDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPGKANVVADTLSRKVAHSAALITKQAPLLKDFERAEIVVSVGEVTSQLAQLSVQPTLRQRIIVA
QLNDPYLVEKRRLVETGQGEDFSISSDDGLTFEGRLCVPEDIAVKTELLTEAHSSPFTMHLGSTKMYKDLRSIYWWRNMKREVAEFVSRCLVYQQVKAPRQRPAG
LLQPLSVSGWK