; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G18860 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G18860
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr4:16376853..16378327
RNA-Seq ExpressionCSPI04G18860
SyntenyCSPI04G18860
Gene Ontology termsGO:0009987 - cellular process (biological process)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032310.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.8e-16060.53Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIEE----LNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        MERLVEEMLASGIIRPS SP+SSPVLLV+KKDGSWRFCVDYRA+NN T+PDKFPIPV EE    LNGAT+F+KIDLKS YHQIRM   DI K AFRTHEG
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIEE----LNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
         YEF+VMPFGLTNAP+TFQ+LMN+IF+P+L KFVLVFFDDILIYSK  KDH+ H+  V   LRK+ L+ANKKKC+F Q +V+YL HIISG+GVEVD EK+
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------
        +A+  WP   N+RE                                   W K+ +E+F +LK +MM+LP LALP+F++PFEIETDASG+G+GA       
Subjt:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------

Query:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ
            YSHTL+MRDR +PVYERELMAVV++VQRWRPYLLG KF+VKTDQ+SLKFLLEQRVIQPQYQKW+SKLL YSFEVVYKPGLENKAA ALSR P  +Q
Subjt:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ

Query:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ------------------------------GHSGFLRTYKRMAAELH
        LN ++    +D+E IK EV++DEKLK I   LN+ +E +  K+++K                               GHSGFLRTYKR+A+EL+
Subjt:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ------------------------------GHSGFLRTYKRMAAELH

TYK02195.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.6e-16361.13Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        MERLVEEMLASGIIRPS SP+SSPVLLV+KKDGSWRFCVDYRA+NN T+PDKFPIPV E    ELNGAT+F+KIDLKSGYHQIRM   DI K AFRTHEG
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
         YEF+VMPFGLTNAP+TFQ+LMN+IF+P+LRKFVLVFFDDILIYSK  KDH+ H+  V   LRK+ L+ANKKKC+F Q +V+YL HIISG+GVEVD EK+
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------
        +A+  WP   N+RE                                   W ++ +E+F +LK +MM+LP LALP+F++PFEIETDASG+G+GA       
Subjt:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------

Query:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ
            YSHTL+MRDR +PVYERELMAVV++VQRWRPYLLG KF+VKTDQ+SLKFLLEQRVIQPQYQKW+SKLLGYSFEVVYKPGLENKAA ALSR P  +Q
Subjt:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ

Query:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ------------------------------GHSGFLRTYKRMAAELH
        LN ++    +D+E IK EV++DEKLK I+  LNE +E +  K+++K                               GHSGFLRTYKR+A+EL+
Subjt:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ------------------------------GHSGFLRTYKRMAAELH

TYK15990.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.4e-15961.54Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        MERLV+EML+SGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPV+E    ELNGA +F+KIDLK+GYHQIRM   DIEK AFRTHEG
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
         YEFMVMPFGLTNAPSTFQ+LMN +FKP+LR+FVLVFFDDILIYSKG+++H  H+  VLE+LR++ LYAN  KC FA+ R+ YL H IS  G+EVDPEK+
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------
         A+K+WP  ANVRE                                   WT+ET+ AF +LK++MMTLPVLA+PDF++PFEIE+DASG+G+GA       
Subjt:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------

Query:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ
            +S  L+ RDR +PVYERELMAVV AVQRWRPYLLG+KF VKTDQRSLKFLLEQRVIQPQYQ+WI+KLLGYSFEV+YKPGLENKAA ALSR+  T  
Subjt:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ

Query:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNE--------TEERKDGKY----------------------SMKQGHSGFLRTYKRMAAELH
        LNQLT   ++D+EVI+ EV +D  L+ I+  + E        T  +   K+                      S+  GHSGFLRTYKRMA EL+
Subjt:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNE--------TEERKDGKY----------------------SMKQGHSGFLRTYKRMAAELH

TYK23090.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.4e-15961.54Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        MERLV+EML+SGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPV+E    ELNGA +F+KIDLK+GYHQIRM   DIEK AFRTHEG
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
         YEFMVMPFGLTNAPSTFQ+LMN +FKP+LR+FVLVFFDDILIYSKG+++H  H+  VLE+LR++ LYAN  KC FA+ R+ YL H IS  G+EVDPEK+
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------
         A+K+WP  ANVRE                                   WT+ET+ AF +LK++MMTLPVLA+PDF++PFEIE+DASG+G+GA       
Subjt:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------

Query:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ
            +S  L+ RDR +PVYERELMAVV AVQRWRPYLLG+KF VKTDQRSLKFLLEQRVIQPQYQ+WI+KLLGYSFEV+YKPGLENKAA ALSR+  T  
Subjt:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ

Query:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNE--------TEERKDGKY----------------------SMKQGHSGFLRTYKRMAAELH
        LNQLT   ++D+EVI+ EV +D  L+ I+  + E        T  +   K+                      S+  GHSGFLRTYKRMA EL+
Subjt:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNE--------TEERKDGKY----------------------SMKQGHSGFLRTYKRMAAELH

TYK23779.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]4.0e-16260.61Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        MERLVEEMLASGIIRPS SP+SSPVLLV+KKDGSWRFCVDYR +NN T+PDKFPIPV E    ELNGAT+F+KIDLKSGYHQIRM   DI K AFRTHEG
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
         YEF+VMPFGLTNAP+TFQ+LMN+IF+P+LRKFVLVFFDDILIYSK  KDH+ H+  V   LRK+ L+ANKKKC+F Q +V+YL HIISG+GVEVD EK+
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------
        + +  WP   N+RE                                   W ++ +E+F +LK +MM+LP LALP+F++PFEIETDASG+G+GA       
Subjt:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------

Query:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ
            YSHTL+MRDR +PVYERELMAVV++VQRWRPYLLG KF+VKTDQ+SLKFLLEQRVIQPQYQKW+SKLLGYSFEVVYKPGLENKAA ALSR P  +Q
Subjt:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ

Query:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ-------------------------------GHSGFLRTYKRMAAELH
        LN ++    +D+E IK EV++DEKLK I+  LNE +E +  K+++K                                GHSGFLRTYKR+A+EL+
Subjt:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ-------------------------------GHSGFLRTYKRMAAELH

TrEMBL top hitse value%identityAlignment
A0A5A7SNE3 Ty3/gypsy retrotransposon protein1.8e-16060.53Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIEE----LNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        MERLVEEMLASGIIRPS SP+SSPVLLV+KKDGSWRFCVDYRA+NN T+PDKFPIPV EE    LNGAT+F+KIDLKS YHQIRM   DI K AFRTHEG
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIEE----LNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
         YEF+VMPFGLTNAP+TFQ+LMN+IF+P+L KFVLVFFDDILIYSK  KDH+ H+  V   LRK+ L+ANKKKC+F Q +V+YL HIISG+GVEVD EK+
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------
        +A+  WP   N+RE                                   W K+ +E+F +LK +MM+LP LALP+F++PFEIETDASG+G+GA       
Subjt:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------

Query:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ
            YSHTL+MRDR +PVYERELMAVV++VQRWRPYLLG KF+VKTDQ+SLKFLLEQRVIQPQYQKW+SKLL YSFEVVYKPGLENKAA ALSR P  +Q
Subjt:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ

Query:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ------------------------------GHSGFLRTYKRMAAELH
        LN ++    +D+E IK EV++DEKLK I   LN+ +E +  K+++K                               GHSGFLRTYKR+A+EL+
Subjt:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ------------------------------GHSGFLRTYKRMAAELH

A0A5A7T6B1 Transposon Ty3-G Gag-Pol polyprotein1.2e-15960.12Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        MERLVEEMLASGIIRPS SP+SSPVLLV+KKDGSWRFCVDYRA+NN T+PDKFPIPV E    ELNGAT+F+KIDLKSGYHQIRM   DI K AFRTHEG
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
         YEF+VMPFGLTNAP+TFQ+LMN+IF+P+LRKFVLVFFDDILIYSK  KDH+ H+  V   LRK+ L+ANKKKC+F Q +V+YL HIISG+GVEVD +K+
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------
        +A+  WP   N+RE                                   W ++ +E+F +LK +MM+LP LALP+F++PFEIETDASG+G+ A       
Subjt:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------

Query:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ
            YSHTL+MRDR +PVYERELMAVV++VQRWRPYLLG KF+VKTDQ+SLKFLLEQ+VIQPQYQKW+SKLLGYSFEVVYKP LENKAA ALSR P  +Q
Subjt:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ

Query:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ-------------------------------GHSGFLRTYKRMAAEL
        LN ++    +D+E IK EV++DEKLK I+  LNE +E +  K+++K                                GHSGFLRTYKR+A  L
Subjt:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ-------------------------------GHSGFLRTYKRMAAEL

A0A5D3BSP2 Ty3/gypsy retrotransposon protein1.8e-16361.13Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        MERLVEEMLASGIIRPS SP+SSPVLLV+KKDGSWRFCVDYRA+NN T+PDKFPIPV E    ELNGAT+F+KIDLKSGYHQIRM   DI K AFRTHEG
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
         YEF+VMPFGLTNAP+TFQ+LMN+IF+P+LRKFVLVFFDDILIYSK  KDH+ H+  V   LRK+ L+ANKKKC+F Q +V+YL HIISG+GVEVD EK+
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------
        +A+  WP   N+RE                                   W ++ +E+F +LK +MM+LP LALP+F++PFEIETDASG+G+GA       
Subjt:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------

Query:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ
            YSHTL+MRDR +PVYERELMAVV++VQRWRPYLLG KF+VKTDQ+SLKFLLEQRVIQPQYQKW+SKLLGYSFEVVYKPGLENKAA ALSR P  +Q
Subjt:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ

Query:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ------------------------------GHSGFLRTYKRMAAELH
        LN ++    +D+E IK EV++DEKLK I+  LNE +E +  K+++K                               GHSGFLRTYKR+A+EL+
Subjt:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ------------------------------GHSGFLRTYKRMAAELH

A0A5D3CXB1 Ty3/gypsy retrotransposon protein1.2e-15961.54Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        MERLV+EML+SGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPV+E    ELNGA +F+KIDLK+GYHQIRM   DIEK AFRTHEG
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
         YEFMVMPFGLTNAPSTFQ+LMN +FKP+LR+FVLVFFDDILIYSKG+++H  H+  VLE+LR++ LYAN  KC FA+ R+ YL H IS  G+EVDPEK+
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------
         A+K+WP  ANVRE                                   WT+ET+ AF +LK++MMTLPVLA+PDF++PFEIE+DASG+G+GA       
Subjt:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------

Query:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ
            +S  L+ RDR +PVYERELMAVV AVQRWRPYLLG+KF VKTDQRSLKFLLEQRVIQPQYQ+WI+KLLGYSFEV+YKPGLENKAA ALSR+  T  
Subjt:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ

Query:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNE--------TEERKDGKY----------------------SMKQGHSGFLRTYKRMAAELH
        LNQLT   ++D+EVI+ EV +D  L+ I+  + E        T  +   K+                      S+  GHSGFLRTYKRMA EL+
Subjt:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNE--------TEERKDGKY----------------------SMKQGHSGFLRTYKRMAAELH

A0A5D3DKH5 Ty3/gypsy retrotransposon protein1.9e-16260.61Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        MERLVEEMLASGIIRPS SP+SSPVLLV+KKDGSWRFCVDYR +NN T+PDKFPIPV E    ELNGAT+F+KIDLKSGYHQIRM   DI K AFRTHEG
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIE----ELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
         YEF+VMPFGLTNAP+TFQ+LMN+IF+P+LRKFVLVFFDDILIYSK  KDH+ H+  V   LRK+ L+ANKKKC+F Q +V+YL HIISG+GVEVD EK+
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------
        + +  WP   N+RE                                   W ++ +E+F +LK +MM+LP LALP+F++PFEIETDASG+G+GA       
Subjt:  EAIKKWPVLANVRE-----------------------------------WTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA-------

Query:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ
            YSHTL+MRDR +PVYERELMAVV++VQRWRPYLLG KF+VKTDQ+SLKFLLEQRVIQPQYQKW+SKLLGYSFEVVYKPGLENKAA ALSR P  +Q
Subjt:  ----YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRVPATVQ

Query:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ-------------------------------GHSGFLRTYKRMAAELH
        LN ++    +D+E IK EV++DEKLK I+  LNE +E +  K+++K                                GHSGFLRTYKR+A+EL+
Subjt:  LNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQ-------------------------------GHSGFLRTYKRMAAELH

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.9e-6436.07Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGS-----WRFCVDYRALNNVTVPDKFPIPVIEELNG----ATMFTKIDLKSGYHQIRMCTYDIEKIAF
        +E  +++ML  GIIR S SPY+SP+ +V KK  +     +R  +DYR LN +TV D+ PIP ++E+ G       FT IDL  G+HQI M    + K AF
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGS-----WRFCVDYRALNNVTVPDKFPIPVIEELNG----ATMFTKIDLKSGYHQIRMCTYDIEKIAF

Query:  RTHEGRYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEV
         T  G YE++ MPFGL NAP+TFQ  MN I +P L K  LV+ DDI+++S  L +HL  +  V E L K  L     KC F +    +L H+++ DG++ 
Subjt:  RTHEGRYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEV

Query:  DPEKVEAIKKWPVLANVRE------------------------WTK-------------ETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA
        +PEK+EAI+K+P+    +E                         TK             E   AF +LK  +   P+L +PDF+  F + TDAS   LGA
Subjt:  DPEKVEAIKKWPVLANVRE------------------------WTK-------------ETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA

Query:  -----------YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALS
                    S TL   +      E+EL+A+V A + +R YLLG+ F + +D + L +L   +    +  +W  KL  + F++ Y  G EN  A ALS
Subjt:  -----------YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALS

Query:  RV
        R+
Subjt:  RV

P20825 Retrovirus-related Pol polyprotein from transposon 2978.1e-6537.06Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKD-----GSWRFCVDYRALNNVTVPDKFPIPVIEELNG----ATMFTKIDLKSGYHQIRMCTYDIEKIAF
        +E  V+EML  G+IR S SPY+SP  +V KK        +R  +DYR LN +T+PD++PIP ++E+ G       FT IDL  G+HQI M    I K AF
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKD-----GSWRFCVDYRALNNVTVPDKFPIPVIEELNG----ATMFTKIDLKSGYHQIRMCTYDIEKIAF

Query:  RTHEGRYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEV
         T  G YE++ MPFGL NAP+TFQ  MN I +P L K  LV+ DDI+I+S  L +HLN ++ V   L    L     KC F +   ++L HI++ DG++ 
Subjt:  RTHEGRYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEV

Query:  DPEKVEAIKKWPV--------------------LANVREWTK------------ETQ-----EAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA
        +P KV+AI  +P+                    + N  +  K            +TQ     EAF +LK  ++  P+L LPDF   F + TDAS   LGA
Subjt:  DPEKVEAIKKWPV--------------------LANVREWTK------------ETQ-----EAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA

Query:  -----------YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALS
                    S TL   +      E+EL+A+V A + +R YLLG++FL+ +D + L++L   +    + ++W  +L  Y F++ Y  G EN  A ALS
Subjt:  -----------YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALS

Query:  RV
        R+
Subjt:  RV

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.8e-5733.71Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIEEL----NGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        + ++V+++L +  I PS SP SSPV+LV KKDG++R CVDYR LN  T+ D FP+P I+ L      A +FT +DL SGYHQI M   D  K AF T  G
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIEEL----NGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
        +YE+ VMPFGL NAPSTF   M   F+    +FV V+ DDILI+S+  ++H  H+  VLE L+   L   KKKC FA    ++L + I    +     K 
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVR----------------------------------EWTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA--------
         AI+ +P    V+                                  +WT++  +A  +LK ++   PVL   +    + + TDAS  G+GA        
Subjt:  EAIKKWPVLANVR----------------------------------EWTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA--------

Query:  ---------YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRV
                 +S +L    +  P  E EL+ ++ A+  +R  L GK F ++TD  SL  L  +     + Q+W+  L  Y F + Y  G +N  A A+SR 
Subjt:  ---------YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRV

Query:  PATVQLNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNE
          T+      TS  ID E  K+    D     ++  + E
Subjt:  PATVQLNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNE

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus3.4e-6334.11Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKK-----DGSWRFCVDYRALNNVTVPDKFPIP----VIEELNGATMFTKIDLKSGYHQIRMCTYDIEKIAF
        +ER ++E+L  GIIRPS SPY+SP+ +V KK     +  +R  VD++ LN VT+PD +PIP     +  L  A  FT +DL SG+HQI M   DI K AF
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKK-----DGSWRFCVDYRALNNVTVPDKFPIP----VIEELNGATMFTKIDLKSGYHQIRMCTYDIEKIAF

Query:  RTHEGRYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEV
         T  G+YEF+ +PFGL NAP+ FQ +++ I + ++ K   V+ DDI+++S+    H  ++R VL  L K  L  N +K  F   +V++L +I++ DG++ 
Subjt:  RTHEGRYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEV

Query:  DPEKVEAIKKWPVLANVREWTK----------------------------------------------ETQ-EAFNRLKQSMMTLPVLALPDFSVPFEIE
        DP+KV AI + P   +V+E  +                                              ET  ++FN LK  + +  +LA P F+ PF + 
Subjt:  DPEKVEAIKKWPVLANVREWTK----------------------------------------------ETQ-EAFNRLKQSMMTLPVLALPDFSVPFEIE

Query:  TDASGYGLGA---------------YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFL-VKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEV
        TDAS + +GA                S +L   +      E+E++A++ ++   R YL G   + V TD + L F L  R    + ++W +++  Y+ E+
Subjt:  TDASGYGLGA---------------YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFL-VKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEV

Query:  VYKPGLENKAAYALSRVPATVQLNQLTT
        +YKPG  N  A ALSR+P   QLNQL+T
Subjt:  VYKPGLENKAAYALSRVPATVQLNQLTT

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.1e-5733.71Show/hide
Query:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIEEL----NGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG
        + ++V+++L +  I PS SP SSPV+LV KKDG++R CVDYR LN  T+ D FP+P I+ L      A +FT +DL SGYHQI M   D  K AF T  G
Subjt:  MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIEEL----NGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEG

Query:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV
        +YE+ VMPFGL NAPSTF   M   F+    +FV V+ DDILI+S+  ++H  H+  VLE L+   L   KKKC FA    ++L + I    +     K 
Subjt:  RYEFMVMPFGLTNAPSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKV

Query:  EAIKKWPVLANVR----------------------------------EWTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA--------
         AI+ +P    V+                                  +WT++  +A ++LK ++   PVL   +    + + TDAS  G+GA        
Subjt:  EAIKKWPVLANVR----------------------------------EWTKETQEAFNRLKQSMMTLPVLALPDFSVPFEIETDASGYGLGA--------

Query:  ---------YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRV
                 +S +L    +  P  E EL+ ++ A+  +R  L GK F ++TD  SL  L  +     + Q+W+  L  Y F + Y  G +N  A A+SR 
Subjt:  ---------YSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVYKPGLENKAAYALSRV

Query:  PATVQLNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNE
          T+      TS  ID E  K+    D     ++  + E
Subjt:  PATVQLNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNE

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein6.7e-0675.86Show/hide
Query:  EMLASGIIRPSTSPYSSPVLLVRKKDGSW
        EML + II+PS SPYSSPVLLV+KKDG W
Subjt:  EMLASGIIRPSTSPYSSPVLLVRKKDGSW

ATMG00860.1 DNA/RNA polymerases superfamily protein2.1e-1535.88Show/hide
Query:  LNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYL--EHIISGDGVEVDPEKVEAIKKWPVLANVRE-----------------------------------
        +NH+  VL++  ++  YAN+KKC+F Q ++ YL   HIISG+GV  DP K+EA+  WP   N  E                                   
Subjt:  LNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYL--EHIISGDGVEVDPEKVEAIKKWPVLANVRE-----------------------------------

Query:  WTKETQEAFNRLKQSMMTLPVLALPDFSVPF
        WT+    AF  LK ++ TLPVLALPD  +PF
Subjt:  WTKETQEAFNRLKQSMMTLPVLALPDFSVPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGATTAGTGGAAGAAATGTTAGCATCAGGGATAATAAGGCCGAGTACGAGCCCATATTCCAGTCCCGTATTGCTGGTGAGGAAAAAGGATGGAAGCTGGCGTTT
TTGTGTAGATTACAGGGCTCTAAATAATGTAACCGTACCTGATAAGTTTCCAATCCCAGTGATTGAGGAGTTGAATGGAGCTACAATGTTTACTAAGATTGATCTTAAAT
CAGGATACCATCAGATTAGGATGTGTACATATGACATTGAAAAGATAGCGTTCAGAACCCATGAGGGTCGCTATGAGTTTATGGTGATGCCGTTTGGGTTGACAAACGCA
CCCTCCACTTTTCAATCATTGATGAATACTATATTCAAACCATACCTTCGAAAATTTGTCTTAGTGTTTTTTGATGATATATTGATCTATAGTAAAGGGTTGAAAGATCA
CTTGAATCATATGAGAGCAGTATTGGAAGTGCTGAGGAAGAATGGATTATATGCGAATAAGAAGAAATGCAGCTTTGCTCAGTTTCGAGTAGATTACTTGGAACATATTA
TCTCAGGAGATGGAGTTGAAGTGGATCCTGAAAAGGTCGAGGCTATAAAGAAGTGGCCAGTTCTAGCTAACGTGAGGGAGTGGACAAAAGAAACTCAAGAAGCATTCAAC
AGGTTGAAGCAATCCATGATGACACTTCCGGTATTGGCTCTGCCTGATTTCAGTGTGCCATTTGAAATAGAAACTGATGCTTCTGGATATGGATTGGGAGCTTATAGCCA
TACGTTGGCAATGAGAGATAGGGTGAAACCTGTATATGAAAGGGAGTTAATGGCAGTAGTAATGGCTGTGCAAAGATGGCGTCCGTACTTATTGGGGAAGAAATTTTTAG
TCAAGACTGATCAACGATCCCTAAAGTTTCTTTTGGAACAAAGAGTAATTCAACCTCAGTATCAAAAATGGATATCCAAACTTCTTGGATACTCATTCGAAGTGGTTTAT
AAACCGGGATTGGAAAACAAAGCAGCATATGCATTATCTAGAGTACCAGCAACGGTACAATTGAATCAGTTAACAACTTCTAATGTGATTGATATAGAAGTGATCAAAGC
GGAGGTAGATCAAGATGAGAAGCTGAAGAATATTATGAAGAAATTAAATGAGACAGAAGAGAGGAAAGATGGTAAATATTCGATGAAACAGGGTCACTCGGGATTCTTGC
GGACATATAAGAGGATGGCTGCGGAGTTACATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGAGATTAGTGGAAGAAATGTTAGCATCAGGGATAATAAGGCCGAGTACGAGCCCATATTCCAGTCCCGTATTGCTGGTGAGGAAAAAGGATGGAAGCTGGCGTTT
TTGTGTAGATTACAGGGCTCTAAATAATGTAACCGTACCTGATAAGTTTCCAATCCCAGTGATTGAGGAGTTGAATGGAGCTACAATGTTTACTAAGATTGATCTTAAAT
CAGGATACCATCAGATTAGGATGTGTACATATGACATTGAAAAGATAGCGTTCAGAACCCATGAGGGTCGCTATGAGTTTATGGTGATGCCGTTTGGGTTGACAAACGCA
CCCTCCACTTTTCAATCATTGATGAATACTATATTCAAACCATACCTTCGAAAATTTGTCTTAGTGTTTTTTGATGATATATTGATCTATAGTAAAGGGTTGAAAGATCA
CTTGAATCATATGAGAGCAGTATTGGAAGTGCTGAGGAAGAATGGATTATATGCGAATAAGAAGAAATGCAGCTTTGCTCAGTTTCGAGTAGATTACTTGGAACATATTA
TCTCAGGAGATGGAGTTGAAGTGGATCCTGAAAAGGTCGAGGCTATAAAGAAGTGGCCAGTTCTAGCTAACGTGAGGGAGTGGACAAAAGAAACTCAAGAAGCATTCAAC
AGGTTGAAGCAATCCATGATGACACTTCCGGTATTGGCTCTGCCTGATTTCAGTGTGCCATTTGAAATAGAAACTGATGCTTCTGGATATGGATTGGGAGCTTATAGCCA
TACGTTGGCAATGAGAGATAGGGTGAAACCTGTATATGAAAGGGAGTTAATGGCAGTAGTAATGGCTGTGCAAAGATGGCGTCCGTACTTATTGGGGAAGAAATTTTTAG
TCAAGACTGATCAACGATCCCTAAAGTTTCTTTTGGAACAAAGAGTAATTCAACCTCAGTATCAAAAATGGATATCCAAACTTCTTGGATACTCATTCGAAGTGGTTTAT
AAACCGGGATTGGAAAACAAAGCAGCATATGCATTATCTAGAGTACCAGCAACGGTACAATTGAATCAGTTAACAACTTCTAATGTGATTGATATAGAAGTGATCAAAGC
GGAGGTAGATCAAGATGAGAAGCTGAAGAATATTATGAAGAAATTAAATGAGACAGAAGAGAGGAAAGATGGTAAATATTCGATGAAACAGGGTCACTCGGGATTCTTGC
GGACATATAAGAGGATGGCTGCGGAGTTACATTAG
Protein sequenceShow/hide protein sequence
MERLVEEMLASGIIRPSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIPVIEELNGATMFTKIDLKSGYHQIRMCTYDIEKIAFRTHEGRYEFMVMPFGLTNA
PSTFQSLMNTIFKPYLRKFVLVFFDDILIYSKGLKDHLNHMRAVLEVLRKNGLYANKKKCSFAQFRVDYLEHIISGDGVEVDPEKVEAIKKWPVLANVREWTKETQEAFN
RLKQSMMTLPVLALPDFSVPFEIETDASGYGLGAYSHTLAMRDRVKPVYERELMAVVMAVQRWRPYLLGKKFLVKTDQRSLKFLLEQRVIQPQYQKWISKLLGYSFEVVY
KPGLENKAAYALSRVPATVQLNQLTTSNVIDIEVIKAEVDQDEKLKNIMKKLNETEERKDGKYSMKQGHSGFLRTYKRMAAELH