; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc03g0072421 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc03g0072421
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr03:19619050..19620093
RNA-Seq ExpressionCmc03g0072421
SyntenyCmc03g0072421
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041588 - Integrase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047043.1 pol protein [Cucumis melo var. makuwa]2.4e-18494.38Show/hide
Query:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS
        VADALSRKVSHSAALIT+QAPLHRDLERAEI VSVGAVT+QLA LTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLF+RRLCVPSDS
Subjt:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV
        AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSMD ITGLPR LRGFTVIWVVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD
        DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP+SI SDRDARFTSKFWKGLQTA+GTRLDF+ AFHPQT+GQTERLNQVLEDMLRACALEFPGS D
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD

Query:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG
        SHLHLMEFAYNNS+QATIGMAPFEALYGKCCRSPVCWG
Subjt:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG

KAA0051368.1 pol protein [Cucumis melo var. makuwa]6.3e-18594.67Show/hide
Query:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS
        VADALSRKVSHSAALIT+QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFS+SSDGGLLF+RRLCVPSDS
Subjt:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV
        AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSMD ITGLPR LRGFTVIWVVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD
        DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP+SI SDRDARFTSKFWKGLQTA+GTRLDF+ AFHPQT+GQTERLNQVLEDMLRACALEFPGS D
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD

Query:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG
        SHLHLMEFAYNNS+QATIGMAPFEALYG+CCRSPVCWG
Subjt:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG

KAA0051744.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.4e-18494.97Show/hide
Query:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS
        VADALSRKVSHSAALIT+QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQA EFS+SSDGGLLF+RRLCVPSDS
Subjt:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV
        AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSMD ITGL R LRGFTVIWVVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD
        DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP+SI SDRDARFTSKFWKGLQTA+GTRLDF  AFHPQT+GQTERLNQVLEDMLRACALEFPGS D
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD

Query:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG
        SHLHLMEFAYNNS+QATIGMAPFEALYGKCCRSPVCWG
Subjt:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG

KAA0062835.1 pol protein [Cucumis melo var. makuwa]6.3e-18594.67Show/hide
Query:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS
        VADALSRKVSHSAALIT+QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLF+RRLCVPSDS
Subjt:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV
        AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSMD ITGLPR LRGFTVIWVVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD
        DRLTKSAHFVPGKSTYTA+KWAQLYMSEIVRLHGVP+SI SDRDARFTSKFWKGLQTA+GTRLDF+ AFHPQT+GQTERLNQVLE+MLRACALEFPGS D
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD

Query:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG
        SHLHLMEFAYNNS+QATIGMAPFEALYGKCCRSPVCWG
Subjt:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG

KAA0065438.1 pol protein [Cucumis melo var. makuwa]5.3e-18494.67Show/hide
Query:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS
        VADALSRKVSHSAALIT+QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSND YLVEKRGLAE GQAVEFSISSDGGLLF+RRLCVPSDS
Subjt:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV
        AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSMD ITGLPR LRGFTVIWVVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD
        DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP+SI SDRDARFTSKFWKGLQTA+GTRLDF+ AFHPQT+GQTERLNQVLEDMLRACALEFPGS D
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD

Query:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG
        SHLHLMEFAYNNS+QATI MAPFEALYGKCCRSPVCWG
Subjt:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG

TrEMBL top hitse value%identityAlignment
A0A5A7TU00 Pol protein1.2e-18494.38Show/hide
Query:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS
        VADALSRKVSHSAALIT+QAPLHRDLERAEI VSVGAVT+QLA LTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLF+RRLCVPSDS
Subjt:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV
        AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSMD ITGLPR LRGFTVIWVVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD
        DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP+SI SDRDARFTSKFWKGLQTA+GTRLDF+ AFHPQT+GQTERLNQVLEDMLRACALEFPGS D
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD

Query:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG
        SHLHLMEFAYNNS+QATIGMAPFEALYGKCCRSPVCWG
Subjt:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG

A0A5A7U7V9 Reverse transcriptase3.0e-18594.67Show/hide
Query:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS
        VADALSRKVSHSAALIT+QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQ  EFS+SSDGGLLF+RRLCVPSDS
Subjt:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV
        AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSMD ITGLPR LRGFTVIWVVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD
        DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP+SI SDRDARFTSKFWKGLQTA+GTRLDF+ AFHPQT+GQTERLNQVLEDMLRACALEFPGS D
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD

Query:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG
        SHLHLMEFAYNNS+QATIGMAPFEALYG+CCRSPVCWG
Subjt:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG

A0A5A7U8T5 Reverse transcriptase1.2e-18494.97Show/hide
Query:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS
        VADALSRKVSHSAALIT+QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQA EFS+SSDGGLLF+RRLCVPSDS
Subjt:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV
        AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSMD ITGL R LRGFTVIWVVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD
        DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP+SI SDRDARFTSKFWKGLQTA+GTRLDF  AFHPQT+GQTERLNQVLEDMLRACALEFPGS D
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD

Query:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG
        SHLHLMEFAYNNS+QATIGMAPFEALYGKCCRSPVCWG
Subjt:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG

A0A5A7VAA9 Pol protein3.0e-18594.67Show/hide
Query:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS
        VADALSRKVSHSAALIT+QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFS+SSDGGLLF+RRLCVPSDS
Subjt:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV
        AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSMD ITGLPR LRGFTVIWVVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD
        DRLTKSAHFVPGKSTYTA+KWAQLYMSEIVRLHGVP+SI SDRDARFTSKFWKGLQTA+GTRLDF+ AFHPQT+GQTERLNQVLE+MLRACALEFPGS D
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD

Query:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG
        SHLHLMEFAYNNS+QATIGMAPFEALYGKCCRSPVCWG
Subjt:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG

A0A5A7VCT0 Pol protein2.6e-18494.67Show/hide
Query:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS
        VADALSRKVSHSAALIT+QAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSND YLVEKRGLAE GQAVEFSISSDGGLLF+RRLCVPSDS
Subjt:  VADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDS

Query:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV
        AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVS+CLVCQQVKAPRQKPAGLLQPLS+ EWKWENVSMD ITGLPR LRGFTVIWVVV
Subjt:  AVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVV

Query:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD
        DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVP+SI SDRDARFTSKFWKGLQTA+GTRLDF+ AFHPQT+GQTERLNQVLEDMLRACALEFPGS D
Subjt:  DRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSD

Query:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG
        SHLHLMEFAYNNS+QATI MAPFEALYGKCCRSPVCWG
Subjt:  SHLHLMEFAYNNSFQATIGMAPFEALYGKCCRSPVCWG

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.2e-4229.7Show/hide
Query:  NVADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRR--LCVP
        ++ADALSR       ++ +  P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE +I    GLL   +  + +P
Subjt:  NVADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRR--LCVP

Query:  SDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIW
        +D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMD IT LP +  G+  ++
Subjt:  SDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPG
        VVVDR +K A  VP   + TA + A+++   ++   G P  I +D D  FTS+ WK         + F++ + PQT+GQTER NQ +E +LR      P 
Subjt:  VVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPG

Query:  SSDSHLHLMEFAYNNSFQATIGMAPFEALY
        +   H+ L++ +YNN+  +   M PFE ++
Subjt:  SSDSHLHLMEFAYNNSFQATIGMAPFEALY

P0CT35 Transposon Tf2-2 polyprotein1.2e-4229.7Show/hide
Query:  NVADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRR--LCVP
        ++ADALSR       ++ +  P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE +I    GLL   +  + +P
Subjt:  NVADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRR--LCVP

Query:  SDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIW
        +D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMD IT LP +  G+  ++
Subjt:  SDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPG
        VVVDR +K A  VP   + TA + A+++   ++   G P  I +D D  FTS+ WK         + F++ + PQT+GQTER NQ +E +LR      P 
Subjt:  VVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPG

Query:  SSDSHLHLMEFAYNNSFQATIGMAPFEALY
        +   H+ L++ +YNN+  +   M PFE ++
Subjt:  SSDSHLHLMEFAYNNSFQATIGMAPFEALY

P0CT36 Transposon Tf2-3 polyprotein1.2e-4229.7Show/hide
Query:  NVADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRR--LCVP
        ++ADALSR       ++ +  P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE +I    GLL   +  + +P
Subjt:  NVADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRR--LCVP

Query:  SDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIW
        +D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMD IT LP +  G+  ++
Subjt:  SDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPG
        VVVDR +K A  VP   + TA + A+++   ++   G P  I +D D  FTS+ WK         + F++ + PQT+GQTER NQ +E +LR      P 
Subjt:  VVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPG

Query:  SSDSHLHLMEFAYNNSFQATIGMAPFEALY
        +   H+ L++ +YNN+  +   M PFE ++
Subjt:  SSDSHLHLMEFAYNNSFQATIGMAPFEALY

P0CT41 Transposon Tf2-12 polyprotein1.2e-4229.7Show/hide
Query:  NVADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRR--LCVP
        ++ADALSR       ++ +  P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE +I    GLL   +  + +P
Subjt:  NVADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRR--LCVP

Query:  SDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIW
        +D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMD IT LP +  G+  ++
Subjt:  SDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPG
        VVVDR +K A  VP   + TA + A+++   ++   G P  I +D D  FTS+ WK         + F++ + PQT+GQTER NQ +E +LR      P 
Subjt:  VVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPG

Query:  SSDSHLHLMEFAYNNSFQATIGMAPFEALY
        +   H+ L++ +YNN+  +   M PFE ++
Subjt:  SSDSHLHLMEFAYNNSFQATIGMAPFEALY

Q9UR07 Transposon Tf2-11 polyprotein1.2e-4229.7Show/hide
Query:  NVADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRR--LCVP
        ++ADALSR       ++ +  P+ +D E   I          + Q+++    + +++   +ND  L+    L    + VE +I    GLL   +  + +P
Subjt:  NVADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRR--LCVP

Query:  SDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIW
        +D+ +   ++ + H     +HPG   +   + R + W+ +++++ E+V  C  CQ  K+   KP G LQP+   E  WE++SMD IT LP +  G+  ++
Subjt:  SDSAVKTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIW

Query:  VVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPG
        VVVDR +K A  VP   + TA + A+++   ++   G P  I +D D  FTS+ WK         + F++ + PQT+GQTER NQ +E +LR      P 
Subjt:  VVVDRLTKSAHFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPG

Query:  SSDSHLHLMEFAYNNSFQATIGMAPFEALY
        +   H+ L++ +YNN+  +   M PFE ++
Subjt:  SSDSHLHLMEFAYNNSFQATIGMAPFEALY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAATGTTGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCAACAGGCCCCATTACATCGAGATCTTGAGAGGGCTGAGATTGCA
GTGTCAGTAGGGGCAGTCACTATGCAGTTAGCCCAGTTGACGGTACAACCAACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATTTGGTTGAG
AAGCGTGGCCTAGCAGAGGCAGGGCAAGCTGTTGAGTTCTCCATATCCTCTGATGGTGGACTTTTGTTTCAGAGGCGTCTCTGTGTGCCATCAGATAGTGCGGTT
AAAACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCGGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAATATG
AAGAGAGAGGTGGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTAAGCGTACTGGAA
TGGAAGTGGGAAAACGTGTCGATGGATATCATTACAGGACTGCCAAGAAATCTGAGGGGTTTTACAGTGATTTGGGTTGTAGTTGACAGGCTTACCAAATCAGCG
CACTTCGTTCCGGGTAAATCCACCTATACCGCTAGTAAGTGGGCACAACTGTACATGTCTGAGATAGTGAGACTACATGGAGTGCCATTGTCGATTTTTTCTGAT
AGAGATGCTCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTTTGGGCACGAGGTTAGACTTTAATATAGCTTTCCATCCACAGACTGAAGGTCAGACT
GAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCGAGCGTGTGCATTGGAATTTCCAGGCAGTTCGGACTCCCACTTACATTTGATGGAATTTGCTTATAATAAC
AGTTTTCAAGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGCAAATGTTGTAGATCCCCTGTTTGCTGGGGTATGTGGGTGAGCAGAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAATGTTGCTGATGCTCTTAGTAGAAAGGTATCACATTCAGCAGCACTTATTACCCAACAGGCCCCATTACATCGAGATCTTGAGAGGGCTGAGATTGCA
GTGTCAGTAGGGGCAGTCACTATGCAGTTAGCCCAGTTGACGGTACAACCAACTTTGAGGCAAAGGATCATTGATGCTCAGAGTAACGATCCTTATTTGGTTGAG
AAGCGTGGCCTAGCAGAGGCAGGGCAAGCTGTTGAGTTCTCCATATCCTCTGATGGTGGACTTTTGTTTCAGAGGCGTCTCTGTGTGCCATCAGATAGTGCGGTT
AAAACAGAATTATTATCTGAGGCTCACAGTTCCCCATTTTCCATGCACCCGGGTAGTACGAAGATGTATCAGGACCTGAAGCGGGTTTATTGGTGGCGTAATATG
AAGAGAGAGGTGGCAGAATTTGTTAGTAAATGCTTGGTGTGTCAGCAGGTTAAGGCACCAAGGCAGAAACCAGCGGGTTTATTACAACCCTTAAGCGTACTGGAA
TGGAAGTGGGAAAACGTGTCGATGGATATCATTACAGGACTGCCAAGAAATCTGAGGGGTTTTACAGTGATTTGGGTTGTAGTTGACAGGCTTACCAAATCAGCG
CACTTCGTTCCGGGTAAATCCACCTATACCGCTAGTAAGTGGGCACAACTGTACATGTCTGAGATAGTGAGACTACATGGAGTGCCATTGTCGATTTTTTCTGAT
AGAGATGCTCGTTTCACTTCCAAATTCTGGAAGGGTTTGCAGACTGCTTTGGGCACGAGGTTAGACTTTAATATAGCTTTCCATCCACAGACTGAAGGTCAGACT
GAGCGTCTGAACCAGGTTTTAGAGGATATGTTGCGAGCGTGTGCATTGGAATTTCCAGGCAGTTCGGACTCCCACTTACATTTGATGGAATTTGCTTATAATAAC
AGTTTTCAAGCTACTATTGGCATGGCACCATTTGAGGCCTTGTACGGCAAATGTTGTAGATCCCCTGTTTGCTGGGGTATGTGGGTGAGCAGAGATTGA
Protein sequenceShow/hide protein sequence
MANVADALSRKVSHSAALITQQAPLHRDLERAEIAVSVGAVTMQLAQLTVQPTLRQRIIDAQSNDPYLVEKRGLAEAGQAVEFSISSDGGLLFQRRLCVPSDSAV
KTELLSEAHSSPFSMHPGSTKMYQDLKRVYWWRNMKREVAEFVSKCLVCQQVKAPRQKPAGLLQPLSVLEWKWENVSMDIITGLPRNLRGFTVIWVVVDRLTKSA
HFVPGKSTYTASKWAQLYMSEIVRLHGVPLSIFSDRDARFTSKFWKGLQTALGTRLDFNIAFHPQTEGQTERLNQVLEDMLRACALEFPGSSDSHLHLMEFAYNN
SFQATIGMAPFEALYGKCCRSPVCWGMWVSRD