; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G20730 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G20730
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationChr4:18992295..18993560
RNA-Seq ExpressionCSPI04G20730
SyntenyCSPI04G20730
Gene Ontology termsGO:0048583 - regulation of response to stimulus (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0016740 - transferase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69746.1 VIRB2-interacting protein 2 [Prunus dulcis]4.6e-10546.51Show/hide
Query:  KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSI
        +L+ +K  +K WNKE FG + S K+    +I  LD +E    L+    KERE+    + DL+ KE+  W Q+ K+ W R+G+ N+ FFH   S R+ ++ 
Subjt:  KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSI

Query:  LSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSDLV
        +  L    G  +V+E EI  EI++FF NLY + + + +  + LNW  +S++++  L+ PF E+E++  VF+ G  KSPGPDG +   ++  W+I+K DL+
Subjt:  LSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSDLV

Query:  RVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV
        +V  DFF  GIIN   NET+I LIPKKKE+ +VSDFRPISL+TSLYK++SKVL +RL++VL S I+  Q AFV+GRQILDA L A+E V+E     + G+
Subjt:  RVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV

Query:  LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFEN
        + K+DLEKAYD V+W F+D  +  KGFG R R WI GCL T NFS+++NGRPRGK  A RG+RQGDPL+PFLFT+V D  S ++    +     G    N
Subjt:  LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFEN

Query:  LSEDLTHLQYADDTL
           +++HLQ+ADDT+
Subjt:  LSEDLTHLQYADDTL

CAN75040.1 hypothetical protein VITISV_026478 [Vitis vinifera]2.7e-10545.24Show/hide
Query:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK
        M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L D+++KE+  W QKS++ W++EG+ NS FFH   + R+S+
Subjt:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK

Query:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD
          + SL+S  G+TL   ++I +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+PGPDG T   Y++ W+++K D
Subjt:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD

Query:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK
        L+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RPISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Subjt:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK

Query:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF
        G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A RG+RQGDPL+PFLFT+V D  S ++    E    +GF  
Subjt:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF

Query:  ENLSEDLTHLQYADDTLLSS
              ++ LQ+ADDT+  S
Subjt:  ENLSEDLTHLQYADDTLLSS

PRQ36601.1 putative RNA-directed DNA polymerase [Rosa chinensis]3.5e-10546.04Show/hide
Query:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK
        M KLK +K  LK W++ETFG I  +K+V+  +IN LD  E S+ +     +ERE  RG L +L ++E+  W Q++KL W +EG+ N+ FFH  V+ R+ +
Subjt:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK

Query:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD
        +++  L    G  +  E  I +EI+ F+ NLY ++    F  + L+W  +S++ +  LE PF E+EI+  VFE   +KSPGPDG +    +++W ++K +
Subjt:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD

Query:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK
        ++ V  +F  NG++N+  NETYI LIPKK  + +V D+RPISLITSLYK+I+KVL  RL++VL   I+ +Q AF++GRQILDA+L A+E VDE   + ++
Subjt:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK

Query:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF
        G++ K+D EKAYD V+W+FLD AM+ KGFG R RKWI GCL + NFSI +NGRPRGK  A RG+RQGDPL+ FLFT+V D    L+    + R ++G   
Subjt:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF

Query:  ENLSEDLTHLQYADDTL
             ++THLQ+ADDT+
Subjt:  ENLSEDLTHLQYADDTL

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.7e-10545.24Show/hide
Query:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK
        M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L D+++KE+  W QKS++ W++EG+ NS FFH   + R+S+
Subjt:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK

Query:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD
          + SL+S  G+TL   ++I +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+PGPDG T   Y++ W+++K D
Subjt:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD

Query:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK
        L+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RPISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Subjt:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK

Query:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF
        G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A RG+RQGDPL+PFLFT+V D  S ++    E    +GF  
Subjt:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF

Query:  ENLSEDLTHLQYADDTLLSS
              ++ LQ+ADDT+  S
Subjt:  ENLSEDLTHLQYADDTLLSS

RVW97045.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.6e-10545.48Show/hide
Query:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK
        M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L D+++KE+  W QKS++ W++EG+ NS FFH   + RKS+
Subjt:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK

Query:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD
          + SL+S  G+TL   ++I +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+PGPDG T   Y++ W+++K D
Subjt:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD

Query:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK
        L+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RPISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Subjt:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK

Query:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF
        G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A RG+RQGDPL+PFLFT+V D  S ++    E    +GF  
Subjt:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF

Query:  ENLSEDLTHLQYADDTLLSS
              ++ LQ+ADDT+  S
Subjt:  ENLSEDLTHLQYADDTLLSS

TrEMBL top hitse value%identityAlignment
A0A2P6QQZ3 Putative RNA-directed DNA polymerase1.7e-10546.04Show/hide
Query:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK
        M KLK +K  LK W++ETFG I  +K+V+  +IN LD  E S+ +     +ERE  RG L +L ++E+  W Q++KL W +EG+ N+ FFH  V+ R+ +
Subjt:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK

Query:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD
        +++  L    G  +  E  I +EI+ F+ NLY ++    F  + L+W  +S++ +  LE PF E+EI+  VFE   +KSPGPDG +    +++W ++K +
Subjt:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD

Query:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK
        ++ V  +F  NG++N+  NETYI LIPKK  + +V D+RPISLITSLYK+I+KVL  RL++VL   I+ +Q AF++GRQILDA+L A+E VDE   + ++
Subjt:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK

Query:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF
        G++ K+D EKAYD V+W+FLD AM+ KGFG R RKWI GCL + NFSI +NGRPRGK  A RG+RQGDPL+ FLFT+V D    L+    + R ++G   
Subjt:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF

Query:  ENLSEDLTHLQYADDTL
             ++THLQ+ADDT+
Subjt:  ENLSEDLTHLQYADDTL

A0A438FWU5 LINE-1 retrotransposable element ORF2 protein1.3e-10545.24Show/hide
Query:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK
        M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L D+++KE+  W QKS++ W++EG+ NS FFH   + R+S+
Subjt:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK

Query:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD
          + SL+S  G+TL   ++I +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+PGPDG T   Y++ W+++K D
Subjt:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD

Query:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK
        L+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RPISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Subjt:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK

Query:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF
        G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A RG+RQGDPL+PFLFT+V D  S ++    E    +GF  
Subjt:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF

Query:  ENLSEDLTHLQYADDTLLSS
              ++ LQ+ADDT+  S
Subjt:  ENLSEDLTHLQYADDTLLSS

A0A438IK87 Transposon TX1 uncharacterized 149 kDa protein7.6e-10645.48Show/hide
Query:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK
        M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L D+++KE+  W QKS++ W++EG+ NS FFH   + RKS+
Subjt:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK

Query:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD
          + SL+S  G+TL   ++I +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+PGPDG T   Y++ W+++K D
Subjt:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD

Query:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK
        L+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RPISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Subjt:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK

Query:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF
        G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A RG+RQGDPL+PFLFT+V D  S ++    E    +GF  
Subjt:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF

Query:  ENLSEDLTHLQYADDTLLSS
              ++ LQ+ADDT+  S
Subjt:  ENLSEDLTHLQYADDTLLSS

A0A5E4GN72 PREDICTED: RNA-directed DNA polymerase (Fragment)2.2e-10546.51Show/hide
Query:  KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSI
        +L+ +K  +K WNKE FG + S K+    +I  LD +E    L+    KERE+    + DL+ KE+  W Q+ K+ W R+G+ N+ FFH   S R+ ++ 
Subjt:  KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSI

Query:  LSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSDLV
        +  L    G  +V+E EI  EI++FF NLY + + + +  + LNW  +S++++  L+ PF E+E++  VF+ G  KSPGPDG +   ++  W+I+K DL+
Subjt:  LSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSDLV

Query:  RVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV
        +V  DFF  GIIN   NET+I LIPKKKE+ +VSDFRPISL+TSLYK++SKVL +RL++VL S I+  Q AFV+GRQILDA L A+E V+E     + G+
Subjt:  RVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV

Query:  LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFEN
        + K+DLEKAYD V+W F+D  +  KGFG R R WI GCL T NFS+++NGRPRGK  A RG+RQGDPL+PFLFT+V D  S ++    +     G    N
Subjt:  LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFEN

Query:  LSEDLTHLQYADDTL
           +++HLQ+ADDT+
Subjt:  LSEDLTHLQYADDTL

A5BV95 Reverse transcriptase domain-containing protein1.3e-10545.24Show/hide
Query:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK
        M KLK +K+ LK WN  TFG +  +K++++  ++ +D +E+   LN + V ER   R  L D+++KE+  W QKS++ W++EG+ NS FFH   + R+S+
Subjt:  MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSK

Query:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD
          + SL+S  G+TL   ++I +EI++FF NLY   +   +  + ++W  +S +    L+ PFTE+E+R  VF++   K+PGPDG T   Y++ W+++K D
Subjt:  SILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD

Query:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK
        L+RVF +F  NG+IN+  N T+I L+PKK ++ ++SD+RPISL+TSLYK+I+KVL  RL+KVL   I+DSQ AFVEGR ILDA+L A+E VDE    G +
Subjt:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK

Query:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF
        G++ K+D EKAYD VDW FLD  ++ KGF ++ R WI GCLS+++F+I+VNG  +G + A RG+RQGDPL+PFLFT+V D  S ++    E    +GF  
Subjt:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF

Query:  ENLSEDLTHLQYADDTLLSS
              ++ LQ+ADDT+  S
Subjt:  ENLSEDLTHLQYADDTLLSS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-3026.59Show/hide
Query:  VSARKSKSILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISS----PFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEF
        +  ++ K+ + ++ + +G       EI   I  ++ +LY  ++ +        D      L+ ++   L  P T  EI  ++  +   KSPGPDG T EF
Subjt:  VSARKSKSILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISS----PFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEF

Query:  YKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKK-KEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTAS
        Y++    L   L+++FQ   K GI+     E  I LIPK  ++  +  +FRPISL+    K+++K+L  R+++ +  +I+  Q+ F+ G Q    I  + 
Subjt:  YKKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKK-KEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTAS

Query:  EAVDEWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIH
          +   +  + +  V++ +D EKA+DK+   F+   +   G      K I         +II+NG+       K G RQG PL+P LF IV +    L  
Subjt:  EAVDEWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIH

Query:  YCNEKRSLKGFHFENLSEDLTHLQYADDTLL
           +++ +KG       E++    +ADD ++
Subjt:  YCNEKRSLKGFHFENLSEDLTHLQYADDTLL

P08548 LINE-1 reverse transcriptase homolog7.0e-3226.78Show/hide
Query:  KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSI
        K   L+A LK   +E    +    + L          EE S       KE    R  L ++  K     I KSK  +  +  +           ++ KS+
Subjt:  KLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSI

Query:  LSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISS----PFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILK
        +SS+ +   +      EI   +  ++  LY  +  +        +  +   LS ++  +L  P +  EI   +  +   KSPGPDG T EFY+     L 
Subjt:  LSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISS----PFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILK

Query:  SDLVRVFQDFFKNGIINRRCNETYIYLIPKK-KEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWS-L
          L+ +FQ+  K GI+     E  I LIPK  K+  R  ++RPISL+    K+++K+L  R+++ +  II+  Q+ F+ G Q    I  +   +   + L
Subjt:  SDLVRVFQDFFKNGIINRRCNETYIYLIPKK-KEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWS-L

Query:  RGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLK
        + +  ++L +D EKA+D +   F+   +K  G      K I    S    +II+NG        + G RQG PL+P LF IV +  +  I    E++++K
Subjt:  RGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLK

Query:  GFHFENLSEDLTHLQYADDTLL
        G H    SE++    +ADD ++
Subjt:  GFHFENLSEDLTHLQYADDTLL

P11369 LINE-1 retrotransposable element ORF2 protein2.5e-2928.13Show/hide
Query:  KSKSILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISS----PFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKS
        + K +++ + + +G      +EI + I SF+  LY T++ +        D      L+      L  P + KEI  V+  +   KSPGPDG + EFY+  
Subjt:  KSKSILSSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISS----PFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKS

Query:  WNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPK-KKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVD
           L   L ++F      G +     E  I LIPK +K+  ++ +FRPISL+    K+++K+L  R+++ + +II+  Q+ F+ G Q    I  +   + 
Subjt:  WNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPK-KKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVD

Query:  EWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNE
          + L+ +  +++ LD EKA+DK+   F+   ++  G        I    S    +I VNG     I  K G RQG PL+P+LF IV +    L     +
Subjt:  EWS-LRGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNE

Query:  KRSLKGFHFENLSEDLTHLQYADDTLL
        ++ +KG       E++     ADD ++
Subjt:  KRSLKGFHFENLSEDLTHLQYADDTLL

P14381 Transposon TX1 uncharacterized 149 kDa protein2.3e-3827.27Show/hide
Query:  LKAILKSWNKETFGKIFSQKQVLIDKINYLD---SLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSIL
        LK + + + K   G+  ++ + L  ++  L+   S  E   L  E ++ +E    AL ++  ++ +    +S++  L + +  S FF+     + ++  +
Subjt:  LKAILKSWNKETFGKIFSQKQVLIDKINYLD---SLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSIL

Query:  SSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGL---SLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD
        + L + +G  L   + I D   SF+ NL+     SP  C+ L W GL   S +    LE P T  E+ + +  M   KSPG DGLT EF++  W+ L  D
Subjt:  SSLVSIEGKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGL---SLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSD

Query:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK
          RV  + FK G +   C    + L+PKK +   + ++RP+SL+++ YK+++K +  RLK VL  +I+  Q   V GR I D +    + +      G  
Subjt:  LVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRK

Query:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF
           L LD EKA+D+VD  +L   ++   FG +   ++    ++    + +N      +   RG+RQG PL+  L+++  +   CL+     ++ L G   
Subjt:  GVLLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHF

Query:  ENLSEDLTHLQYADDTLL
        +     +    YADD +L
Subjt:  ENLSEDLTHLQYADDTLL

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)2.1e-1529.27Show/hide
Query:  SPGPDGLTGEFYKKSWNILKSDLVRVF-QDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEG
        +PG DGLT +       I ++ L R F Q     G +          LIPK  +    S++RPI++ ++L +++ ++L  RL+  +   ++ +Q  +   
Subjt:  SPGPDGLTGEFYKKSWNILKSDLVRVF-QDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEG

Query:  RQILDAILTASEAVDEW--SLRGRKGV--LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVN-GRPRGKIIAKRGIRQGDPLAP
           +D  L  S  +D +  S R ++    ++ LD+ KA+D V  S +  A++  G  +    +I G LS +  +I V  G    KI  +RG++QGDPL+P
Subjt:  RQILDAILTASEAVDEW--SLRGRKGV--LLKLDLEKAYDKVDWSFLDMAMKLKGFGKRCRKWIWGCLSTTNFSIIVN-GRPRGKIIAKRGIRQGDPLAP

Query:  FLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLL
        FLF  V D   C +      +S  G       E +  L +ADD LL
Subjt:  FLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.2e-2529.77Show/hide
Query:  EKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSL-----EESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSA
        E LK  K   K  N++ FG I  + +  +D +  + S       +S    E   +++ N   A L      +  + QKS++ WL++G+ N+ FFH  + A
Subjt:  EKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSL-----EESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSA

Query:  RKSKSILSSLVSIEGKTLVTEKEIVDEILSFFSNLYG------TRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFY
         ++K+++  L   +   +    ++ + I++++++L G      T  S   I DI  +R      S L  +P ++KEI   VF M   K+PGPD  T EF+
Subjt:  RKSKSILSSLVSIEGKTLVTEKEIVDEILSFFSNLYG------TRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFY

Query:  KKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVIS
         +SW ++K   +   ++FF+ G + +R N T I LIPK     ++S FRP+S  T +YK+I+
Subjt:  KKSWNILKSDLVRVFQDFFKNGIINRRCNETYIYLIPKKKEAARVSDFRPISLITSLYKVIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases8.6e-0943.42Show/hide
Query:  RLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV----LLKLDLEKAYDKVDWSFLDMAMKLKGF
        RLK ++ ++I  +Q +F+ GR   D I+   EAV   S+R +KGV    LLKLDLEKAYD++ W +L+  +   GF
Subjt:  RLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGV----LLKLDLEKAYDKVDWSFLDMAMKLKGF

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)9.2e-1142.65Show/hide
Query:  IVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDT
        I+NG P+G +   RG+RQGDPL+P+LF +  +  S L     E+  L G    N S  + HL +ADDT
Subjt:  IVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGCTAAAAGGACTGAAAGCCATCCTAAAGAGTTGGAATAAGGAGACTTTTGGTAAGATTTTCTCCCAAAAACAGGTGCTGATTGATAAGATTAACTATCTTGA
CTCACTTGAAGAGTCAAGTTGTCTCAACGAGGAAAATGTGAAGGAAAGAGAAAATTGTAGAGGGGCTCTGCTTGATTTGATTGTGAAAGAGCAAAAGTTGTGGATTCAGA
AGTCGAAGCTTCATTGGCTTAGAGAGGGGGAGGAGAACTCAAGCTTCTTCCACATATGGGTTTCGGCTCGTAAAAGTAAAAGTATTCTTTCTTCCTTGGTTAGTATCGAA
GGGAAGACTCTTGTCACAGAGAAGGAAATTGTGGATGAGATCCTTAGTTTCTTTTCAAATTTATATGGCACAAGGATCTCCTCGCCGTTTATTTGTGACATTCTTAATTG
GAGAGGCCTTAGCTTACAGGATTCGAGTTTACTTGAGGTTCCCTTTACCGAAAAAGAAATTAGAGAAGTTGTATTTGAGATGGGTTGTCTCAAGTCCCCTGGCCCTGATG
GCTTGACTGGAGAGTTTTATAAAAAGTCATGGAACATTTTGAAGTCCGACCTCGTAAGGGTGTTCCAAGATTTTTTTAAAAACGGAATTATTAACAGAAGATGTAATGAG
ACTTATATTTATCTCATCCCCAAAAAGAAAGAGGCGGCCCGTGTCAGTGACTTCAGACCCATTAGCTTAATTACCTCCTTGTATAAAGTTATCTCCAAGGTGCTTCCAAC
AAGACTTAAAAAAGTTCTTCCTTCGATAATTAATGATTCTCAAATGGCTTTTGTGGAAGGAAGGCAAATCCTTGATGCTATTCTAACTGCTTCTGAGGCTGTTGACGAAT
GGTCTTTAAGAGGCAGAAAAGGTGTTCTTTTAAAGCTCGATTTGGAGAAAGCTTATGATAAGGTGGATTGGTCTTTTCTTGATATGGCCATGAAACTTAAAGGCTTTGGT
AAGAGATGTAGGAAGTGGATATGGGGATGCTTGTCGACAACTAATTTTTCCATAATTGTCAACGGCAGGCCTAGAGGAAAGATTATTGCTAAAAGGGGCATTCGTCAAGG
TGATCCTCTTGCCCCTTTTCTTTTTACGATAGTGGGAGATGCTCCAAGTTGCCTTATTCACTACTGTAATGAGAAAAGGAGTTTAAAAGGCTTTCATTTTGAGAACCTGT
CAGAGGATTTAACCCATCTTCAGTATGCAGACGACACTCTTCTTTCTTCTTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGCTAAAAGGACTGAAAGCCATCCTAAAGAGTTGGAATAAGGAGACTTTTGGTAAGATTTTCTCCCAAAAACAGGTGCTGATTGATAAGATTAACTATCTTGA
CTCACTTGAAGAGTCAAGTTGTCTCAACGAGGAAAATGTGAAGGAAAGAGAAAATTGTAGAGGGGCTCTGCTTGATTTGATTGTGAAAGAGCAAAAGTTGTGGATTCAGA
AGTCGAAGCTTCATTGGCTTAGAGAGGGGGAGGAGAACTCAAGCTTCTTCCACATATGGGTTTCGGCTCGTAAAAGTAAAAGTATTCTTTCTTCCTTGGTTAGTATCGAA
GGGAAGACTCTTGTCACAGAGAAGGAAATTGTGGATGAGATCCTTAGTTTCTTTTCAAATTTATATGGCACAAGGATCTCCTCGCCGTTTATTTGTGACATTCTTAATTG
GAGAGGCCTTAGCTTACAGGATTCGAGTTTACTTGAGGTTCCCTTTACCGAAAAAGAAATTAGAGAAGTTGTATTTGAGATGGGTTGTCTCAAGTCCCCTGGCCCTGATG
GCTTGACTGGAGAGTTTTATAAAAAGTCATGGAACATTTTGAAGTCCGACCTCGTAAGGGTGTTCCAAGATTTTTTTAAAAACGGAATTATTAACAGAAGATGTAATGAG
ACTTATATTTATCTCATCCCCAAAAAGAAAGAGGCGGCCCGTGTCAGTGACTTCAGACCCATTAGCTTAATTACCTCCTTGTATAAAGTTATCTCCAAGGTGCTTCCAAC
AAGACTTAAAAAAGTTCTTCCTTCGATAATTAATGATTCTCAAATGGCTTTTGTGGAAGGAAGGCAAATCCTTGATGCTATTCTAACTGCTTCTGAGGCTGTTGACGAAT
GGTCTTTAAGAGGCAGAAAAGGTGTTCTTTTAAAGCTCGATTTGGAGAAAGCTTATGATAAGGTGGATTGGTCTTTTCTTGATATGGCCATGAAACTTAAAGGCTTTGGT
AAGAGATGTAGGAAGTGGATATGGGGATGCTTGTCGACAACTAATTTTTCCATAATTGTCAACGGCAGGCCTAGAGGAAAGATTATTGCTAAAAGGGGCATTCGTCAAGG
TGATCCTCTTGCCCCTTTTCTTTTTACGATAGTGGGAGATGCTCCAAGTTGCCTTATTCACTACTGTAATGAGAAAAGGAGTTTAAAAGGCTTTCATTTTGAGAACCTGT
CAGAGGATTTAACCCATCTTCAGTATGCAGACGACACTCTTCTTTCTTCTTCCTAG
Protein sequenceShow/hide protein sequence
MEKLKGLKAILKSWNKETFGKIFSQKQVLIDKINYLDSLEESSCLNEENVKERENCRGALLDLIVKEQKLWIQKSKLHWLREGEENSSFFHIWVSARKSKSILSSLVSIE
GKTLVTEKEIVDEILSFFSNLYGTRISSPFICDILNWRGLSLQDSSLLEVPFTEKEIREVVFEMGCLKSPGPDGLTGEFYKKSWNILKSDLVRVFQDFFKNGIINRRCNE
TYIYLIPKKKEAARVSDFRPISLITSLYKVISKVLPTRLKKVLPSIINDSQMAFVEGRQILDAILTASEAVDEWSLRGRKGVLLKLDLEKAYDKVDWSFLDMAMKLKGFG
KRCRKWIWGCLSTTNFSIIVNGRPRGKIIAKRGIRQGDPLAPFLFTIVGDAPSCLIHYCNEKRSLKGFHFENLSEDLTHLQYADDTLLSSS