; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0247171 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0247171
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr09:11551955..11553199
RNA-Seq ExpressionCmc09g0247171
SyntenyCmc09g0247171
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035890.1 pol protein [Cucumis melo var. makuwa]4.1e-20788.16Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        MSFGLTNAP V MDLMNRVFKDFLD+FVIVFIDDILI SKI+AEHE+HLH+VLETLRANKLYAKFS CEFWL+KV+FLGHVVSSEGVS+DPAK+E +T+W
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA
        PRPSTVSEIRSFLGL GYYRRFVEDFSRIASPL QLTRKGTPFVWSPACESSFQELKQKLV+APVLTVPDGSG+FVIYSD+SKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK
        SRQLK HEQNY THDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVK+YDCEILY PGKANVVA+ALSRKVAHSAALITK
Subjt:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK

Query:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH
        Q PLLRD ERAEIAVS+ EVT+Q AQL VQPTLRQ+II AQLNDPYL EKR +VE GQ EDFSISSDD L F+GRL VPEDSAVK ELL EAHSSPF MH
Subjt:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH

Query:  PGSTKMYQDLRCVY
        PGSTKMYQDLR VY
Subjt:  PGSTKMYQDLRCVY

KAA0063098.1 pol protein [Cucumis melo var. makuwa]1.0e-20587.44Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        MSFGLTNAP VFMDLMNRVFKDFLD+FVIVFIDDILI SK +AEHE+HLH+VLETLRANKLYAKFS CEFWL+KV+FLGHVVSSEGVS+DPAK+E +T+W
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA
        PRPSTVSEIRSFLGL GYYRRFVEDFSRIASPL QLTRKGTPFVWSPACE SFQELKQKLV+APVLTVPDGSG+FVIYSD+SKK LGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK
        SRQLK+HEQNY THDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVK+YDCEILY PGKANVVA+ALSRKVAHSAALITK
Subjt:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK

Query:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH
        Q PLLRD ERAEI VS+ EVT+Q AQL VQPTLRQ+II AQLNDPYL EKR +VE GQ EDFSISSDD L F+GRL VPEDSAVK ELL EAHSSPF MH
Subjt:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH

Query:  PGSTKMYQDLRCVY
        PGSTKMYQDLR VY
Subjt:  PGSTKMYQDLRCVY

KAA0063793.1 pol protein [Cucumis melo var. makuwa]1.7e-20587.44Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        MSFGLTNAP VFMDLMNRVFKDFLD+FVIVFIDDILI SK +AEHE+HLH+VLETLRANKLYAKFS CEFWL+KV+FLGHVVSSEGVS+DPAK+E +T+W
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA
        PRPSTVSEIRSFLGL GYYRRFVEDFSRIASPL QLTRKGTPFVWSPACE SFQELKQKLV+APVLTVPDGSG+FVIYSD+SKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK
        SRQLK HEQNY THDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVK+YDCEILY PGKANVVA+ALSRKVAHSAALITK
Subjt:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK

Query:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH
        Q PLLRD ERAEIAVS+ EVT+Q AQL VQPTLRQ+II AQLNDPYL EKR +VE  Q EDFSISSDD L F+GRL VPED+AVK ELL EAHSSPF MH
Subjt:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH

Query:  PGSTKMYQDLRCVY
        PGSTKMYQDLR VY
Subjt:  PGSTKMYQDLRCVY

KAA0066951.1 pol protein [Cucumis melo var. makuwa]9.1e-20787.92Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        MSFGLTNAP VFMDLMNRVFKDFLD+FVIVFIDDILI SK +AEHE+HLH+VLETLRANKLYAKFS CEFWL+KV+FLGHVVSSEGVS+DPAK+E +T+W
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA
        PRPSTVSEIRSFLGL GYYRRFVEDFSRIASPL QLTRKGTPFVWSPACESSFQELKQKLV+APVLTVPDGSG+FVIYSD+SKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK
        SRQLK HEQNY THDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVK+YDCEILY PGKANVVA+ALSRKVAHSAALITK
Subjt:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK

Query:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH
        Q PLLRD ERAEIAVS+ EVT+Q AQL VQPTLRQ+II AQLNDPYLVEKR +VE GQ EDFSIS DD L F+GRL VPEDSAV+ ELL EAHSSPF MH
Subjt:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH

Query:  PGSTKMYQDLRCVY
        PGSTKMYQDLR VY
Subjt:  PGSTKMYQDLRCVY

TYK08868.1 pol protein [Cucumis melo var. makuwa]7.7e-20688.16Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        MSFGLTNAP VFMDLMNRVFKDFLD+FVIVFIDDILI SK +AEHE+HLH+VLETLRANKLYAKFS CEFWLKKV+FLGHVVSSEGVS+D AK+E +T+W
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA
        PRPSTV+EIRSFLGL GYYRRFVEDFSRIASPL QLTRKGTPFVWSP CESSF ELKQKLV+APVLTVPDGSGSFVIYSD SKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK
        SRQLKSHEQNY THDLELAAVVF LKIWRHYLYGE+IQI+TDHKSLKYFFTQKELNMRQRRWLELVK+YDCEILY PGKANVVA+ALSRKVAHSAALITK
Subjt:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK

Query:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH
        Q PLLRD ERAEIAVS+ EVTSQ AQL+VQPTLRQRIIVAQLNDPYLVEKR LVE  Q EDFSISS+D L F+GRL VPEDSAVK ELL EAHSSPF MH
Subjt:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH

Query:  PGSTKMYQDLRCVY
        PGSTKMYQDLR VY
Subjt:  PGSTKMYQDLRCVY

TrEMBL top hitse value%identityAlignment
A0A5A7SXW6 Reverse transcriptase2.0e-20788.16Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        MSFGLTNAP V MDLMNRVFKDFLD+FVIVFIDDILI SKI+AEHE+HLH+VLETLRANKLYAKFS CEFWL+KV+FLGHVVSSEGVS+DPAK+E +T+W
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA
        PRPSTVSEIRSFLGL GYYRRFVEDFSRIASPL QLTRKGTPFVWSPACESSFQELKQKLV+APVLTVPDGSG+FVIYSD+SKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK
        SRQLK HEQNY THDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVK+YDCEILY PGKANVVA+ALSRKVAHSAALITK
Subjt:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK

Query:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH
        Q PLLRD ERAEIAVS+ EVT+Q AQL VQPTLRQ+II AQLNDPYL EKR +VE GQ EDFSISSDD L F+GRL VPEDSAVK ELL EAHSSPF MH
Subjt:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH

Query:  PGSTKMYQDLRCVY
        PGSTKMYQDLR VY
Subjt:  PGSTKMYQDLRCVY

A0A5A7V646 Reverse transcriptase4.9e-20687.44Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        MSFGLTNAP VFMDLMNRVFKDFLD+FVIVFIDDILI SK +AEHE+HLH+VLETLRANKLYAKFS CEFWL+KV+FLGHVVSSEGVS+DPAK+E +T+W
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA
        PRPSTVSEIRSFLGL GYYRRFVEDFSRIASPL QLTRKGTPFVWSPACE SFQELKQKLV+APVLTVPDGSG+FVIYSD+SKK LGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK
        SRQLK+HEQNY THDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVK+YDCEILY PGKANVVA+ALSRKVAHSAALITK
Subjt:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK

Query:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH
        Q PLLRD ERAEI VS+ EVT+Q AQL VQPTLRQ+II AQLNDPYL EKR +VE GQ EDFSISSDD L F+GRL VPEDSAVK ELL EAHSSPF MH
Subjt:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH

Query:  PGSTKMYQDLRCVY
        PGSTKMYQDLR VY
Subjt:  PGSTKMYQDLRCVY

A0A5A7V6R2 Reverse transcriptase8.3e-20687.44Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        MSFGLTNAP VFMDLMNRVFKDFLD+FVIVFIDDILI SK +AEHE+HLH+VLETLRANKLYAKFS CEFWL+KV+FLGHVVSSEGVS+DPAK+E +T+W
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA
        PRPSTVSEIRSFLGL GYYRRFVEDFSRIASPL QLTRKGTPFVWSPACE SFQELKQKLV+APVLTVPDGSG+FVIYSD+SKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK
        SRQLK HEQNY THDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVK+YDCEILY PGKANVVA+ALSRKVAHSAALITK
Subjt:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK

Query:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH
        Q PLLRD ERAEIAVS+ EVT+Q AQL VQPTLRQ+II AQLNDPYL EKR +VE  Q EDFSISSDD L F+GRL VPED+AVK ELL EAHSSPF MH
Subjt:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH

Query:  PGSTKMYQDLRCVY
        PGSTKMYQDLR VY
Subjt:  PGSTKMYQDLRCVY

A0A5A7VMR4 Reverse transcriptase4.4e-20787.92Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        MSFGLTNAP VFMDLMNRVFKDFLD+FVIVFIDDILI SK +AEHE+HLH+VLETLRANKLYAKFS CEFWL+KV+FLGHVVSSEGVS+DPAK+E +T+W
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA
        PRPSTVSEIRSFLGL GYYRRFVEDFSRIASPL QLTRKGTPFVWSPACESSFQELKQKLV+APVLTVPDGSG+FVIYSD+SKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK
        SRQLK HEQNY THDLELAAVVFALKIWRHYLYGEKIQI+TDHKSLKYFFTQKELNMRQRRWLELVK+YDCEILY PGKANVVA+ALSRKVAHSAALITK
Subjt:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK

Query:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH
        Q PLLRD ERAEIAVS+ EVT+Q AQL VQPTLRQ+II AQLNDPYLVEKR +VE GQ EDFSIS DD L F+GRL VPEDSAV+ ELL EAHSSPF MH
Subjt:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH

Query:  PGSTKMYQDLRCVY
        PGSTKMYQDLR VY
Subjt:  PGSTKMYQDLRCVY

A0A5D3CCF0 Pol protein3.7e-20688.16Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        MSFGLTNAP VFMDLMNRVFKDFLD+FVIVFIDDILI SK +AEHE+HLH+VLETLRANKLYAKFS CEFWLKKV+FLGHVVSSEGVS+D AK+E +T+W
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA
        PRPSTV+EIRSFLGL GYYRRFVEDFSRIASPL QLTRKGTPFVWSP CESSF ELKQKLV+APVLTVPDGSGSFVIYSD SKKGLGCVLMQQG+VVAYA
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYA

Query:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK
        SRQLKSHEQNY THDLELAAVVF LKIWRHYLYGE+IQI+TDHKSLKYFFTQKELNMRQRRWLELVK+YDCEILY PGKANVVA+ALSRKVAHSAALITK
Subjt:  SRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITK

Query:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH
        Q PLLRD ERAEIAVS+ EVTSQ AQL+VQPTLRQRIIVAQLNDPYLVEKR LVE  Q EDFSISS+D L F+GRL VPEDSAVK ELL EAHSSPF MH
Subjt:  QAPLLRDLERAEIAVSIEEVTSQFAQLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMH

Query:  PGSTKMYQDLRCVY
        PGSTKMYQDLR VY
Subjt:  PGSTKMYQDLRCVY

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.4e-5638.97Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        M FGL NAP  F   MN + +  L+   +V++DDI++ S    EH + L  V E L    L  +   CEF  ++ +FLGHV++ +G+  +P K+E I  +
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPF-VWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAY
        P P+   EI++FLGL GYYR+F+ +F+ IA P+ +  +K       +P  +S+F++LK  +   P+L VPD +  F + +D+S   LG VL Q G  ++Y
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPF-VWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAY

Query:  ASRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSR
         SR L  HE NY T + EL A+V+A K +RHYL G   +I +DH+ L + +  K+ N +  RW   +  +D +I Y  GK N VA+ALSR
Subjt:  ASRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSR

P0CT41 Transposon Tf2-12 polyprotein1.5e-4730.27Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        M +G++ AP  F   +N +  +  ++ V+ ++DDILI SK ++EH KH+  VL+ L+   L    + CEF   +V F+G+ +S +G +     ++ +  W
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQG-----R
         +P    E+R FLG V Y R+F+   S++  PL  L +K   + W+P    + + +KQ LVS PVL   D S   ++ +D+S   +G VL Q+       
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQG-----R

Query:  VVAYASRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKV
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L    T +    N R  RW   +++++ EI Y PG AN +A+ALSR  
Subjt:  VVAYASRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSLKYFFTQKE--LNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKV

Query:  AHSAALITKQAPLLRDLERAEI-AVSIEEVTSQFA-QLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELL
             ++ +  P+ +D E   I  V+   +T  F  Q++ + T   +++    N+   VE+ + ++ G      I+S D++       +P D+ +   ++
Subjt:  AHSAALITKQAPLLRDLERAEI-AVSIEEVTSQFA-QLIVQPTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELL

Query:  KEAHSSPFAMHPG
        K+ H     +HPG
Subjt:  KEAHSSPFAMHPG

P10401 Retrovirus-related Pol polyprotein from transposon gypsy2.5e-5035.31Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        + FGL NA ++F   ++ V ++ +     V++DD++I S+ +++H +H+  VL+ L    +        F+ + V +LG +VS +G   DP KV+ I  +
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTR-----------KGTPFVWSPACESSFQELKQKLVSAPV-LTVPDGSGSFVIYSDSSKKGLGC
        P P  V ++RSFLGL  YYR F++DF+ IA P+  + +           K  P  ++    ++FQ L+  L S  V L  PD    F + +D+S  G+G 
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTR-----------KGTPFVWSPACESSFQELKQKLVSAPV-LTVPDGSGSFVIYSDSSKKGLGC

Query:  VLMQQGRVVAYASRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEK-IQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANAL
        VL Q+GR +   SR LK  EQNY T++ EL A+V+AL   +++LYG + I IFTDH+ L +    +  N + +RW   +  ++ ++ Y PGK N VA+AL
Subjt:  VLMQQGRVVAYASRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEK-IQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANAL

Query:  SRK
        SR+
Subjt:  SRK

P20825 Retrovirus-related Pol polyprotein from transposon 2972.4e-5338.28Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        M FGL NAP  F   MN + +  L+   +V++DDI+I S    EH   +  V   L    L  +   CEF  K+ +FLGH+V+ +G+  +P KV+ I S+
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPF-VWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAY
        P P+   EIR+FLGL GYYR+F+ +++ IA P+    +K T           +F++LK  ++  P+L +PD    FV+ +D+S   LG VL Q G  +++
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPF-VWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAY

Query:  ASRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSR
         SR L  HE NY   + EL A+V+A K +RHYL G +  I +DH+ L++    KE   +  RW   +  Y  +I Y  GK N VA+ALSR
Subjt:  ASRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus7.8e-5235.08Show/hide
Query:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW
        + FGL NAP +F  +++ + ++ +     V+IDDI++ S+    H K+L  VL +L    L        F   +V FLG++V+++G+  DP KV  I+  
Subjt:  MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSW

Query:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTR-----------KGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCV
        P P++V E++ FLG+  YYR+F++D++++A PL  LTR              P         SF +LK  L S+ +L  P  +  F + +D+S   +G V
Subjt:  PRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTR-----------KGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCV

Query:  LMQ----QGRVVAYASRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVA
        L Q    + R +AY SR L   E+NY T + E+ A++++L   R YLYG   I+++TDH+ L +    +  N + +RW   ++ Y+CE++Y PGK+NVVA
Subjt:  LMQ----QGRVVAYASRQLKSHEQNYHTHDLELAAVVFALKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVA

Query:  NALSR
        +ALSR
Subjt:  NALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.4e-2440.46Show/hide
Query:  HLHRVLETLRANKLYAKFSNCEFWLKKVSFLG--HVVSSEGVSIDPAKVETITSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVW
        HL  VL+    ++ YA    C F   ++++LG  H++S EGVS DPAK+E +  WP P   +E+R FLGL GYYRRFV+++ +I  PL +L +K +   W
Subjt:  HLHRVLETLRANKLYAKFSNCEFWLKKVSFLG--HVVSSEGVSIDPAKVETITSWPRPSTVSEIRSFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVW

Query:  SPACESSFQELKQKLVSAPVLTVPDGSGSFV
        +     +F+ LK  + + PVL +PD    FV
Subjt:  SPACESSFQELKQKLVSAPVLTVPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGGGTTGACTAATGCTCCTACGGTATTCATGGACTTGATGAACAGGGTGTTCAAAGATTTCTTAGACACGTTTGTCATAGTTTTCATTGATGACATCTTGAT
CTGCTCCAAGATCAAGGCTGAGCATGAGAAGCACTTGCACCGGGTTTTGGAGACTCTTCGAGCCAATAAGTTGTATGCCAAGTTCTCCAACTGTGAGTTCTGGCTGAAGA
AGGTATCTTTCCTTGGACATGTGGTGTCCAGTGAGGGAGTTTCTATAGACCCAGCAAAGGTTGAAACTATTACTAGTTGGCCTCGACCGTCGACAGTTAGCGAGATTCGT
AGTTTCCTGGGTTTGGTAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCTCGTATAGCCAGTCCCTTGATTCAGTTGACCAGGAAGGGGACCCCTTTTGTTTGGAGCCC
AGCTTGCGAGAGTAGCTTCCAGGAACTTAAGCAGAAGCTTGTGTCTGCACCAGTCCTGACAGTGCCAGATGGATCGGGGAGCTTTGTGATCTACAGTGATTCCTCCAAGA
AAGGACTGGGTTGCGTGCTGATGCAGCAGGGCAGAGTAGTTGCTTATGCCTCCCGCCAGTTGAAGAGTCACGAGCAGAACTATCATACCCATGACCTAGAGTTGGCAGCA
GTGGTTTTTGCACTGAAAATATGGAGACATTACCTGTACGGTGAGAAGATACAGATTTTCACTGACCATAAGAGCCTGAAATACTTCTTCACCCAGAAGGAGTTGAACAT
GAGGCAGAGGAGGTGGCTCGAGTTAGTGAAAAACTATGACTGCGAGATTTTGTATGACCCAGGTAAGGCAAACGTAGTAGCTAACGCATTGAGTAGGAAGGTTGCGCATT
CCGCAGCGCTTATCACCAAGCAAGCTCCCTTACTAAGAGATCTCGAGAGAGCTGAGATTGCAGTCTCAATAGAAGAAGTGACCTCACAGTTTGCCCAGTTGATCGTGCAG
CCGACCTTGAGACAGAGGATTATTGTTGCTCAGCTAAATGATCCTTACTTGGTCGAGAAGCGTCTATTAGTAGAGGCAGGGCAAAGTGAGGATTTCTCCATATCCTCTGA
TGACAGACTTACCTTCGATGGACGTTTGAGCGTGCCAGAAGACAGTGCAGTCAAGGAAGAGCTTTTGAAGGAGGCTCACAGTTCTCCATTTGCTATGCACCCTGGAAGTA
CGAAGATGTACCAAGACTTGAGGTGTGTCTACTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGGGTTGACTAATGCTCCTACGGTATTCATGGACTTGATGAACAGGGTGTTCAAAGATTTCTTAGACACGTTTGTCATAGTTTTCATTGATGACATCTTGAT
CTGCTCCAAGATCAAGGCTGAGCATGAGAAGCACTTGCACCGGGTTTTGGAGACTCTTCGAGCCAATAAGTTGTATGCCAAGTTCTCCAACTGTGAGTTCTGGCTGAAGA
AGGTATCTTTCCTTGGACATGTGGTGTCCAGTGAGGGAGTTTCTATAGACCCAGCAAAGGTTGAAACTATTACTAGTTGGCCTCGACCGTCGACAGTTAGCGAGATTCGT
AGTTTCCTGGGTTTGGTAGGTTACTACAGGAGGTTCGTGGAAGACTTCTCTCGTATAGCCAGTCCCTTGATTCAGTTGACCAGGAAGGGGACCCCTTTTGTTTGGAGCCC
AGCTTGCGAGAGTAGCTTCCAGGAACTTAAGCAGAAGCTTGTGTCTGCACCAGTCCTGACAGTGCCAGATGGATCGGGGAGCTTTGTGATCTACAGTGATTCCTCCAAGA
AAGGACTGGGTTGCGTGCTGATGCAGCAGGGCAGAGTAGTTGCTTATGCCTCCCGCCAGTTGAAGAGTCACGAGCAGAACTATCATACCCATGACCTAGAGTTGGCAGCA
GTGGTTTTTGCACTGAAAATATGGAGACATTACCTGTACGGTGAGAAGATACAGATTTTCACTGACCATAAGAGCCTGAAATACTTCTTCACCCAGAAGGAGTTGAACAT
GAGGCAGAGGAGGTGGCTCGAGTTAGTGAAAAACTATGACTGCGAGATTTTGTATGACCCAGGTAAGGCAAACGTAGTAGCTAACGCATTGAGTAGGAAGGTTGCGCATT
CCGCAGCGCTTATCACCAAGCAAGCTCCCTTACTAAGAGATCTCGAGAGAGCTGAGATTGCAGTCTCAATAGAAGAAGTGACCTCACAGTTTGCCCAGTTGATCGTGCAG
CCGACCTTGAGACAGAGGATTATTGTTGCTCAGCTAAATGATCCTTACTTGGTCGAGAAGCGTCTATTAGTAGAGGCAGGGCAAAGTGAGGATTTCTCCATATCCTCTGA
TGACAGACTTACCTTCGATGGACGTTTGAGCGTGCCAGAAGACAGTGCAGTCAAGGAAGAGCTTTTGAAGGAGGCTCACAGTTCTCCATTTGCTATGCACCCTGGAAGTA
CGAAGATGTACCAAGACTTGAGGTGTGTCTACTGA
Protein sequenceShow/hide protein sequence
MSFGLTNAPTVFMDLMNRVFKDFLDTFVIVFIDDILICSKIKAEHEKHLHRVLETLRANKLYAKFSNCEFWLKKVSFLGHVVSSEGVSIDPAKVETITSWPRPSTVSEIR
SFLGLVGYYRRFVEDFSRIASPLIQLTRKGTPFVWSPACESSFQELKQKLVSAPVLTVPDGSGSFVIYSDSSKKGLGCVLMQQGRVVAYASRQLKSHEQNYHTHDLELAA
VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELVKNYDCEILYDPGKANVVANALSRKVAHSAALITKQAPLLRDLERAEIAVSIEEVTSQFAQLIVQ
PTLRQRIIVAQLNDPYLVEKRLLVEAGQSEDFSISSDDRLTFDGRLSVPEDSAVKEELLKEAHSSPFAMHPGSTKMYQDLRCVY