; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0011577 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0011577
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase
Genome locationchr06:19430474..19435128
RNA-Seq ExpressionPay0011577
SyntenyPay0011577
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025425.1 pol protein [Cucumis melo var. makuwa]1.3e-24193.13Show/hide
Query:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
        +QPTLRQRIIDAQSNDPYLVEKRGLAEAGQA EFSLSSDGGLLFERRLCVPSDSAVKTELL+EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
Subjt:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS

Query:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------
        +CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG            
Subjt:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------

Query:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
               GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
Subjt:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM

Query:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
        G ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRR DLEFEVGDKVFLKVAPMRGV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
Subjt:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV

Query:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK
        SMLRKYVPDPSHVVDYEPLEIDENLSY EQP+EVLAREVKTLRNKEI  +K
Subjt:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK

KAA0040699.1 pol protein [Cucumis melo var. makuwa]4.4e-24293.35Show/hide
Query:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
        +QPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVP DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKR+VAEFVS
Subjt:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS

Query:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------
        RCLVCQQVKAPRQKPAGLLQPLSIP+WKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG            
Subjt:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------

Query:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
               GLQTAMG+RLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
Subjt:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM

Query:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
        GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGV+RFERR KLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFH+
Subjt:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV

Query:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK
        SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEI  +K
Subjt:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK

KAA0047043.1 pol protein [Cucumis melo var. makuwa]5.2e-24394.01Show/hide
Query:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
        +QPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
Subjt:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS

Query:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------
        RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG            
Subjt:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------

Query:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
               GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
Subjt:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM

Query:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
        GPELVQS NEAIQKIRSRMHTAQSRQKSYADVRR DLEFEVGDKVFLKVAPMRGV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
Subjt:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV

Query:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK
        SMLRKYVPDPSHVVDYEPLEIDENL YTEQPVEVLAREVKTLRNKEI  +K
Subjt:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK

KAA0051368.1 pol protein [Cucumis melo var. makuwa]3.4e-24292.9Show/hide
Query:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
        +QPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
Subjt:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS

Query:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------
        +CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG            
Subjt:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------

Query:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
               GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG+CCRSPVCWGEVGEQRLM
Subjt:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM

Query:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
        GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE+GDKVFLKVAPM+GV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
Subjt:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV

Query:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK
        SMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNK+I  +K
Subjt:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK

KAA0062141.1 pol protein [Cucumis melo var. makuwa]2.2e-24193.13Show/hide
Query:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
        +QPTLRQRIIDAQSNDPYLVEKRGLAE GQAVEFS+SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGS KMYQ+LKRVYWWRNMKREVAEFVS
Subjt:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS

Query:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------
        RCLVCQQVKAPRQKPAGLLQPLSI EWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG            
Subjt:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------

Query:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
               GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFE LYGKCCRSPVCWGEVGEQRLM
Subjt:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM

Query:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
        GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
Subjt:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV

Query:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK
        SMLRKYVPDPSHVVDYEPLEIDEN SYTEQPVEVLAREVKTLRNKEI  +K
Subjt:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK

TrEMBL top hitse value%identityAlignment
A0A5A7SLT0 Pol protein6.2e-24293.13Show/hide
Query:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
        +QPTLRQRIIDAQSNDPYLVEKRGLAEAGQA EFSLSSDGGLLFERRLCVPSDSAVKTELL+EAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
Subjt:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS

Query:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------
        +CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG            
Subjt:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------

Query:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
               GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
Subjt:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM

Query:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
        G ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRR DLEFEVGDKVFLKVAPMRGV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
Subjt:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV

Query:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK
        SMLRKYVPDPSHVVDYEPLEIDENLSY EQP+EVLAREVKTLRNKEI  +K
Subjt:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK

A0A5A7THF3 Reverse transcriptase2.1e-24293.35Show/hide
Query:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
        +QPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVP DSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKR+VAEFVS
Subjt:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS

Query:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------
        RCLVCQQVKAPRQKPAGLLQPLSIP+WKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG            
Subjt:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------

Query:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
               GLQTAMG+RLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
Subjt:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM

Query:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
        GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGV+RFERR KLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFH+
Subjt:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV

Query:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK
        SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEI  +K
Subjt:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK

A0A5A7TU00 Pol protein2.5e-24394.01Show/hide
Query:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
        +QPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
Subjt:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS

Query:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------
        RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG            
Subjt:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------

Query:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
               GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
Subjt:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM

Query:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
        GPELVQS NEAIQKIRSRMHTAQSRQKSYADVRR DLEFEVGDKVFLKVAPMRGV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
Subjt:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV

Query:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK
        SMLRKYVPDPSHVVDYEPLEIDENL YTEQPVEVLAREVKTLRNKEI  +K
Subjt:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK

A0A5A7U7V9 Reverse transcriptase1.6e-24292.9Show/hide
Query:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
        +QPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
Subjt:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS

Query:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------
        +CLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG            
Subjt:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------

Query:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
               GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG+CCRSPVCWGEVGEQRLM
Subjt:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM

Query:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
        GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE+GDKVFLKVAPM+GV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
Subjt:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV

Query:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK
        SMLRKYVPDPSHVVDYEPLEIDENLSY EQPVEVLAREVKTLRNK+I  +K
Subjt:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK

A0A5A7V8X5 Pol protein1.1e-24193.13Show/hide
Query:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS
        +QPTLRQRIIDAQSNDPYLVEKRGLAE GQAVEFS+SSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGS KMYQ+LKRVYWWRNMKREVAEFVS
Subjt:  LQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS

Query:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------
        RCLVCQQVKAPRQKPAGLLQPLSI EWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG            
Subjt:  RCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHG------------

Query:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM
               GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFE LYGKCCRSPVCWGEVGEQRLM
Subjt:  -------GLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLM

Query:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
        GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGV+RFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV
Subjt:  GPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHV

Query:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK
        SMLRKYVPDPSHVVDYEPLEIDEN SYTEQPVEVLAREVKTLRNKEI  +K
Subjt:  SMLRKYVPDPSHVVDYEPLEIDENLSYTEQPVEVLAREVKTLRNKEILWLK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.7e-4829.98Show/hide
Query:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL
        + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C 
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL

Query:  VCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGGLQTAMGTR------
         CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G  +  +         
Subjt:  VCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGGLQTAMGTR------

Query:  -------------LDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGP
                     + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        
Subjt:  -------------LDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGP

Query:  ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVVRF-ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFH
        E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K    R    F  +  KL+P F GPF +L++ GP  Y L LP S+  +    FH
Subjt:  ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVVRF-ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFH

Query:  VSMLRKY
        VS L KY
Subjt:  VSMLRKY

P0CT35 Transposon Tf2-2 polyprotein2.7e-4829.98Show/hide
Query:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL
        + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C 
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL

Query:  VCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGGLQTAMGTR------
         CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G  +  +         
Subjt:  VCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGGLQTAMGTR------

Query:  -------------LDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGP
                     + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        
Subjt:  -------------LDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGP

Query:  ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVVRF-ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFH
        E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K    R    F  +  KL+P F GPF +L++ GP  Y L LP S+  +    FH
Subjt:  ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVVRF-ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFH

Query:  VSMLRKY
        VS L KY
Subjt:  VSMLRKY

P0CT36 Transposon Tf2-3 polyprotein2.7e-4829.98Show/hide
Query:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL
        + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C 
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL

Query:  VCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGGLQTAMGTR------
         CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G  +  +         
Subjt:  VCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGGLQTAMGTR------

Query:  -------------LDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGP
                     + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        
Subjt:  -------------LDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGP

Query:  ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVVRF-ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFH
        E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K    R    F  +  KL+P F GPF +L++ GP  Y L LP S+  +    FH
Subjt:  ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVVRF-ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFH

Query:  VSMLRKY
        VS L KY
Subjt:  VSMLRKY

P0CT41 Transposon Tf2-12 polyprotein2.7e-4829.98Show/hide
Query:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL
        + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C 
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL

Query:  VCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGGLQTAMGTR------
         CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G  +  +         
Subjt:  VCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGGLQTAMGTR------

Query:  -------------LDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGP
                     + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        
Subjt:  -------------LDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGP

Query:  ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVVRF-ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFH
        E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K    R    F  +  KL+P F GPF +L++ GP  Y L LP S+  +    FH
Subjt:  ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVVRF-ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFH

Query:  VSMLRKY
        VS L KY
Subjt:  VSMLRKY

Q9UR07 Transposon Tf2-11 polyprotein2.7e-4829.98Show/hide
Query:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL
        + +++   +ND  L+    L    + VE ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C 
Subjt:  RQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSRCL

Query:  VCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGGLQTAMGTR------
         CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G  +  +         
Subjt:  VCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGGLQTAMGTR------

Query:  -------------LDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGP
                     + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+        
Subjt:  -------------LDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEVGEQRLMGP

Query:  ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVVRF-ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFH
        E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K    R    F  +  KL+P F GPF +L++ GP  Y L LP S+  +    FH
Subjt:  ELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVVRF-ERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTV-HDVFH

Query:  VSMLRKY
        VS L KY
Subjt:  VSMLRKY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAAACGATCCTTGCCGCTCGTCAGGTACGCTCCCATGAAAGTATATATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACTTACAGCCGACTTTGAGGCAAAG
GATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAAGCAGGGCAAGCGGTTGAGTTCTCATTATCCTCTGATGGTGGACTTTTGTTCG
AGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGAC
CTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTT
ATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGGCTGCCGAGGACTCTGAGGGGTTTTACAGTGATTTGGGTTGTGGTGG
ATAGGCTTACCAAGTCAGCACACTTTGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCCGAGATAGTGAGGTTGCATGGGGGTTTGCAG
ACTGCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGGATATGTTGCGAGCGTGTGCATT
GGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATCGGCATGGCACCATTTGAGGCCTTGTACGGCAAAT
GTTGTAGATCCCCGGTTTGCTGGGGTGAAGTGGGCGAGCAAAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAGATTAGATCACGCATGCAT
ACCGCTCAGAGTAGGCAGAAGAGTTATGCAGATGTGAGGCGGAAGGATCTTGAGTTTGAGGTAGGGGACAAGGTGTTCTTAAAGGTAGCACCTATGAGAGGTGTCGTGCG
TTTTGAAAGGAGGGGAAAGCTGAGTCCCCGTTTTGTTGGGCCGTTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCACCATCACTCTCGACAG
TTCATGATGTGTTTCATGTTTCTATGTTGAGGAAGTACGTGCCAGATCCATCCCATGTAGTGGATTACGAGCCACTGGAGATTGATGAAAACTTGAGCTATACTGAACAA
CCTGTTGAGGTGCTTGCTAGAGAGGTGAAAACGTTGAGGAATAAAGAAATCCTCTGGTTAAAGTCTTATGGCGGAATCACCGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAAACGATCCTTGCCGCTCGTCAGGTACGCTCCCATGAAAGTATATATAGAGGCAGTCACCGGTTGGACCCGACCTTCCACTTACAGCCGACTTTGAGGCAAAG
GATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAAGCAGGGCAAGCGGTTGAGTTCTCATTATCCTCTGATGGTGGACTTTTGTTCG
AGAGACGCCTCTGTGTGCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGAC
CTGAAGCGGGTTTATTGGTGGCGTAACATGAAGAGGGAGGTAGCAGAATTTGTTAGTAGATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTT
ATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAAAACGTGTCCATGGATTTCATTACAGGGCTGCCGAGGACTCTGAGGGGTTTTACAGTGATTTGGGTTGTGGTGG
ATAGGCTTACCAAGTCAGCACACTTTGTTCCGGGTAAATCCACCTATACTGCTAGTAAGTGGGCACAGTTGTACATGTCCGAGATAGTGAGGTTGCATGGGGGTTTGCAG
ACTGCTATGGGCACGAGGTTAGACTTTAGTACAGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGTCTGAACCAAGTTTTAGAGGATATGTTGCGAGCGTGTGCATT
GGAATTTCCAGGTAGCTGGGACTCCCACTTACATTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATCGGCATGGCACCATTTGAGGCCTTGTACGGCAAAT
GTTGTAGATCCCCGGTTTGCTGGGGTGAAGTGGGCGAGCAAAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCGATACAGAAGATTAGATCACGCATGCAT
ACCGCTCAGAGTAGGCAGAAGAGTTATGCAGATGTGAGGCGGAAGGATCTTGAGTTTGAGGTAGGGGACAAGGTGTTCTTAAAGGTAGCACCTATGAGAGGTGTCGTGCG
TTTTGAAAGGAGGGGAAAGCTGAGTCCCCGTTTTGTTGGGCCGTTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCACCATCACTCTCGACAG
TTCATGATGTGTTTCATGTTTCTATGTTGAGGAAGTACGTGCCAGATCCATCCCATGTAGTGGATTACGAGCCACTGGAGATTGATGAAAACTTGAGCTATACTGAACAA
CCTGTTGAGGTGCTTGCTAGAGAGGTGAAAACGTTGAGGAATAAAGAAATCCTCTGGTTAAAGTCTTATGGCGGAATCACCGAGTAG
Protein sequenceShow/hide protein sequence
MAETILAARQVRSHESIYRGSHRLDPTFHLQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQD
LKRVYWWRNMKREVAEFVSRCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPRTLRGFTVIWVVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGGLQ
TAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDSHLHLMEFAYNNSYQATIGMAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMH
TAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVVRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDVFHVSMLRKYVPDPSHVVDYEPLEIDENLSYTEQ
PVEVLAREVKTLRNKEILWLKSYGGITE