; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0094731 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0094731
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionPol protein
Genome locationCMiso1.1chr04:7678185..7679589
RNA-Seq ExpressionCmc04g0094731
SyntenyCmc04g0094731
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043669.1 pol protein [Cucumis melo var. makuwa]1.9e-24296.3Show/hide
Query:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
        +PLHRD+ERAEI    GAVT QLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
Subjt:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT
        GSTKMYQDLKRVYWW NMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLP TLRGFTVIWVVVDRLTKSAHFVPGKSTYTA+
Subjt:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT

Query:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG
        KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFP SWD+HLHLMEFAYNNSYQATIG
Subjt:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
        MAPFEALYG+CCRSPVCW EVGEQRLMGPELVQSTNEA+QKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPM+GVLRFERRGKLSPRFVGPFE
Subjt:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE

Query:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
        ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
Subjt:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK

KAA0051368.1 pol protein [Cucumis melo var. makuwa]1.9e-24597.45Show/hide
Query:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
        +PLHRDLERAEIA   GAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
Subjt:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT
        GSTKMYQDLKRVYWW NMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLP TLRGFTVIWVVVDRLTKSAHFVPGKSTYTA+
Subjt:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT

Query:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG
        KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD+HLHLMEFAYNNSYQATIG
Subjt:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
        MAPFEALYG+CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE+GDKVFLKVAPM+GVLRFERRGKLSPRFVGPFE
Subjt:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE

Query:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
        ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
Subjt:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK

KAA0060848.1 pol protein [Cucumis melo var. makuwa]2.5e-24296.76Show/hide
Query:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
        +PLHRDLERAEIA   GAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRL VPSDSAVKTELLSEAHSSPFSMHP
Subjt:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT
        GSTKMYQDLKRVYWW NMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI GLP TLRGFTVIWVVVDRLTKSAHFVPGKSTYTA+
Subjt:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT

Query:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG
        KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD+HLHLMEFAYNNSYQATIG
Subjt:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
        MAPFEALYGKCCRSPVCW EVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFE+RGKLSPRFVGPFE
Subjt:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE

Query:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
        ILERIGPVAYRLALPPSL+T HDVFHVSMLRK
Subjt:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK

KAA0062245.1 pol protein [Cucumis melo var. makuwa]3.9e-24396.76Show/hide
Query:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
        +PLHRDLERAEIA   GAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ AEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
Subjt:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT
        GSTKMYQDLKRVYWW NMKREVAEFVSKCLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDFITGLP TLRGF+VIWVVVDRLTKSAHFV GKSTYTA+
Subjt:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT

Query:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG
        KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD+HLHLMEFAYNNSYQATIG
Subjt:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
        MAPFEALYGKCC+SPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
Subjt:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE

Query:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
        ILERIGP+AYRLALPPSLSTVHDVFHVSMLRK
Subjt:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK

KAA0066456.1 pol protein [Cucumis melo var. makuwa]1.5e-24296.3Show/hide
Query:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
        +PLHRDLERAEIA   GAVTMQLA+L VQPTLRQRIIDAQ NDPYLVEKRGL EAGQTAEFSLSSDGGLLFERRLCVPSDSAVK ELLSEAHSSPFSMHP
Subjt:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT
        GSTK+YQDLKRVYWW NMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLP TLRGFTVIWVVVDRLTKSAHFVPGKSTYTA+
Subjt:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT

Query:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG
        KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD+HLHLMEFAYNNSYQATIG
Subjt:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
        MAPFEALYG+CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGV+RFERRGKLSPRFVGPFE
Subjt:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE

Query:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
        ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
Subjt:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK

TrEMBL top hitse value%identityAlignment
A0A5A7TR61 Pol protein9.3e-24396.3Show/hide
Query:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
        +PLHRD+ERAEI    GAVT QLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
Subjt:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT
        GSTKMYQDLKRVYWW NMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLP TLRGFTVIWVVVDRLTKSAHFVPGKSTYTA+
Subjt:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT

Query:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG
        KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFP SWD+HLHLMEFAYNNSYQATIG
Subjt:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
        MAPFEALYG+CCRSPVCW EVGEQRLMGPELVQSTNEA+QKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPM+GVLRFERRGKLSPRFVGPFE
Subjt:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE

Query:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
        ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
Subjt:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK

A0A5A7U7V9 Reverse transcriptase9.0e-24697.45Show/hide
Query:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
        +PLHRDLERAEIA   GAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
Subjt:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT
        GSTKMYQDLKRVYWW NMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLP TLRGFTVIWVVVDRLTKSAHFVPGKSTYTA+
Subjt:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT

Query:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG
        KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD+HLHLMEFAYNNSYQATIG
Subjt:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
        MAPFEALYG+CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFE+GDKVFLKVAPM+GVLRFERRGKLSPRFVGPFE
Subjt:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE

Query:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
        ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
Subjt:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK

A0A5A7UZZ4 Pol protein1.2e-24296.76Show/hide
Query:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
        +PLHRDLERAEIA   GAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRL VPSDSAVKTELLSEAHSSPFSMHP
Subjt:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT
        GSTKMYQDLKRVYWW NMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFI GLP TLRGFTVIWVVVDRLTKSAHFVPGKSTYTA+
Subjt:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT

Query:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG
        KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD+HLHLMEFAYNNSYQATIG
Subjt:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
        MAPFEALYGKCCRSPVCW EVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFE+RGKLSPRFVGPFE
Subjt:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE

Query:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
        ILERIGPVAYRLALPPSL+T HDVFHVSMLRK
Subjt:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK

A0A5A7V8L8 Pol protein1.9e-24396.76Show/hide
Query:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
        +PLHRDLERAEIA   GAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ AEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
Subjt:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT
        GSTKMYQDLKRVYWW NMKREVAEFVSKCLVCQQVKAP QKPAGLLQPLSIPEWKWENVSMDFITGLP TLRGF+VIWVVVDRLTKSAHFV GKSTYTA+
Subjt:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT

Query:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG
        KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD+HLHLMEFAYNNSYQATIG
Subjt:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
        MAPFEALYGKCC+SPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
Subjt:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE

Query:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
        ILERIGP+AYRLALPPSLSTVHDVFHVSMLRK
Subjt:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK

A0A5A7VJE2 Reverse transcriptase7.1e-24396.3Show/hide
Query:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP
        +PLHRDLERAEIA   GAVTMQLA+L VQPTLRQRIIDAQ NDPYLVEKRGL EAGQTAEFSLSSDGGLLFERRLCVPSDSAVK ELLSEAHSSPFSMHP
Subjt:  SPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSSPFSMHP

Query:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT
        GSTK+YQDLKRVYWW NMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLP TLRGFTVIWVVVDRLTKSAHFVPGKSTYTA+
Subjt:  GSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTAT

Query:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG
        KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWD+HLHLMEFAYNNSYQATIG
Subjt:  KWAQLYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIG

Query:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE
        MAPFEALYG+CCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGV+RFERRGKLSPRFVGPFE
Subjt:  MAPFEALYGKCCRSPVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFE

Query:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
        ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK
Subjt:  ILERIGPVAYRLALPPSLSTVHDVFHVSMLRK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.2e-5531.48Show/hide
Query:  QLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREV
        Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W  +++++
Subjt:  QLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREV

Query:  AEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTATKWAQLYMSEIVRLHGVPVSIVS
         E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++
Subjt:  AEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTATKWAQLYMSEIVRLHGVPVSIVS

Query:  DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEV
        D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+
Subjt:  DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEV

Query:  GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLST
                E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  
Subjt:  GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLST

Query:  V-HDVFHVSMLRK
        +    FHVS L K
Subjt:  V-HDVFHVSMLRK

P0CT35 Transposon Tf2-2 polyprotein2.2e-5531.48Show/hide
Query:  QLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREV
        Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W  +++++
Subjt:  QLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREV

Query:  AEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTATKWAQLYMSEIVRLHGVPVSIVS
         E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++
Subjt:  AEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTATKWAQLYMSEIVRLHGVPVSIVS

Query:  DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEV
        D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+
Subjt:  DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEV

Query:  GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLST
                E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  
Subjt:  GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLST

Query:  V-HDVFHVSMLRK
        +    FHVS L K
Subjt:  V-HDVFHVSMLRK

P0CT36 Transposon Tf2-3 polyprotein2.2e-5531.48Show/hide
Query:  QLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREV
        Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W  +++++
Subjt:  QLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREV

Query:  AEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTATKWAQLYMSEIVRLHGVPVSIVS
         E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++
Subjt:  AEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTATKWAQLYMSEIVRLHGVPVSIVS

Query:  DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEV
        D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+
Subjt:  DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEV

Query:  GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLST
                E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  
Subjt:  GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLST

Query:  V-HDVFHVSMLRK
        +    FHVS L K
Subjt:  V-HDVFHVSMLRK

P0CT41 Transposon Tf2-12 polyprotein2.2e-5531.48Show/hide
Query:  QLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREV
        Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W  +++++
Subjt:  QLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREV

Query:  AEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTATKWAQLYMSEIVRLHGVPVSIVS
         E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++
Subjt:  AEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTATKWAQLYMSEIVRLHGVPVSIVS

Query:  DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEV
        D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+
Subjt:  DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEV

Query:  GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLST
                E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  
Subjt:  GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLST

Query:  V-HDVFHVSMLRK
        +    FHVS L K
Subjt:  V-HDVFHVSMLRK

Q9UR07 Transposon Tf2-11 polyprotein2.2e-5531.48Show/hide
Query:  QLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREV
        Q+++    + +++   +ND  L+    L    +  E ++    GLL   +  + +P+D+ +   ++ + H     +HPG   +   + R + W  +++++
Subjt:  QLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERR--LCVPSDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWHNMKREV

Query:  AEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTATKWAQLYMSEIVRLHGVPVSIVS
         E+V  C  CQ  K+   KP G LQP+   E  WE++SMDFIT LP +  G+  ++VVVDR +K A  VP   + TA + A+++   ++   G P  I++
Subjt:  AEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTATKWAQLYMSEIVRLHGVPVSIVS

Query:  DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEV
        D D  FTS+ WK         + FS  + PQTDGQTER NQ +E +LR      P +W  H+ L++ +YNN+  +   M PFE ++      SP+   E+
Subjt:  DRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIGMAPFEALYG-KCCRSPVCWGEV

Query:  GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLST
                E  Q T +  Q ++  ++T   + K Y D++ +++ EF+ GD V +K     G L   +  KL+P F GPF +L++ GP  Y L LP S+  
Subjt:  GEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDL-EFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLST

Query:  V-HDVFHVSMLRK
        +    FHVS L K
Subjt:  V-HDVFHVSMLRK

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCTTAGTAGGAAGGTGTCACATTCAGCAGCACTTATTACCCGGCAGCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAAGTGTCAGTGGGGCAGTTACTAT
GCAGTTAGCCCAGTTGACAGTACAGCCGACTTTGAGGCAGAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAA
CGGCTGAGTTCTCGTTATCCTCTGATGGTGGACTGTTGTTTGAAAGACGCCTCTGTGTTCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCGCACAGTTCC
CCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCATAACATGAAGAGGGAAGTAGCAGAATTTGTTAGTAAATGCCTGGT
GTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAGAACGTGTCCATGGATTTCATTACAGGGCTAC
CGGGAACTCTGAGGGGATTTACAGTGATTTGGGTTGTGGTGGACAGACTTACTAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTACTAAGTGGGCACAG
TTGTACATGTCTGAGATAGTGAGATTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTTTGGAAGGGCTTGCAGACTGCTATGGG
CACAAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGCCTGAACCAGGTTTTAGAGGATATGTTGCGAGCGTGTGCATTGGAATTTCCAG
GTAGCTGGGACGCCCACTTACACTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATCGGCATGGCACCATTTGAGGCCCTGTACGGCAAATGTTGTAGATCC
CCGGTTTGCTGGGGTGAGGTAGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCGATTCAGAAGATTAGGTCACGCATGCATACCGCTCAGAG
TAGACAGAAGAGTTATGCAGATGTGAGGCGGAAGGACCTTGAGTTTGAGGTAGGGGATAAAGTGTTCTTAAAGGTAGCACCTATGAGAGGTGTCTTGCGTTTTGAAAGGA
GGGGAAAATTGAGTCCCCGTTTTGTTGGGCCATTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCTCCATCACTCTCGACAGTCCATGATGTG
TTTCACGTTTCTATGTTGAGGAAATTGATGAGAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCTTAGTAGGAAGGTGTCACATTCAGCAGCACTTATTACCCGGCAGCCCATTGCATCGGGATCTCGAGCGGGCTGAGATTGCAAGTGTCAGTGGGGCAGTTACTAT
GCAGTTAGCCCAGTTGACAGTACAGCCGACTTTGAGGCAGAGGATCATTGATGCTCAGAGTAACGATCCTTATCTGGTTGAGAAACGTGGCCTAGCAGAGGCAGGGCAAA
CGGCTGAGTTCTCGTTATCCTCTGATGGTGGACTGTTGTTTGAAAGACGCCTCTGTGTTCCGTCAGATAGTGCGGTTAAGACAGAATTATTATCTGAGGCGCACAGTTCC
CCATTTTCCATGCACCCAGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCATAACATGAAGAGGGAAGTAGCAGAATTTGTTAGTAAATGCCTGGT
GTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTGAGCATACCGGAATGGAAGTGGGAGAACGTGTCCATGGATTTCATTACAGGGCTAC
CGGGAACTCTGAGGGGATTTACAGTGATTTGGGTTGTGGTGGACAGACTTACTAAATCAGCGCACTTCGTTCCGGGTAAATCCACCTATACTGCTACTAAGTGGGCACAG
TTGTACATGTCTGAGATAGTGAGATTACATGGAGTGCCAGTGTCGATTGTTTCTGATAGAGATGCCCGTTTCACTTCCAAATTTTGGAAGGGCTTGCAGACTGCTATGGG
CACAAGGTTGGACTTTAGTACGGCTTTCCATCCACAGACTGACGGTCAGACTGAGCGCCTGAACCAGGTTTTAGAGGATATGTTGCGAGCGTGTGCATTGGAATTTCCAG
GTAGCTGGGACGCCCACTTACACTTGATGGAATTTGCTTATAATAACAGTTATCAGGCTACTATCGGCATGGCACCATTTGAGGCCCTGTACGGCAAATGTTGTAGATCC
CCGGTTTGCTGGGGTGAGGTAGGTGAGCAGAGATTGATGGGTCCTGAGTTAGTTCAGTCTACTAACGAAGCGATTCAGAAGATTAGGTCACGCATGCATACCGCTCAGAG
TAGACAGAAGAGTTATGCAGATGTGAGGCGGAAGGACCTTGAGTTTGAGGTAGGGGATAAAGTGTTCTTAAAGGTAGCACCTATGAGAGGTGTCTTGCGTTTTGAAAGGA
GGGGAAAATTGAGTCCCCGTTTTGTTGGGCCATTTGAGATTCTGGAGCGGATTGGCCCTGTAGCTTATCGCTTGGCGTTGCCTCCATCACTCTCGACAGTCCATGATGTG
TTTCACGTTTCTATGTTGAGGAAATTGATGAGAACTTGA
Protein sequenceShow/hide protein sequence
MLLVGRCHIQQHLLPGSPLHRDLERAEIASVSGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQTAEFSLSSDGGLLFERRLCVPSDSAVKTELLSEAHSS
PFSMHPGSTKMYQDLKRVYWWHNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSIPEWKWENVSMDFITGLPGTLRGFTVIWVVVDRLTKSAHFVPGKSTYTATKWAQ
LYMSEIVRLHGVPVSIVSDRDARFTSKFWKGLQTAMGTRLDFSTAFHPQTDGQTERLNQVLEDMLRACALEFPGSWDAHLHLMEFAYNNSYQATIGMAPFEALYGKCCRS
PVCWGEVGEQRLMGPELVQSTNEAIQKIRSRMHTAQSRQKSYADVRRKDLEFEVGDKVFLKVAPMRGVLRFERRGKLSPRFVGPFEILERIGPVAYRLALPPSLSTVHDV
FHVSMLRKLMRT