; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G20950 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G20950
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr1:16516430..16518860
RNA-Seq ExpressionCSPI01G20950
SyntenyCSPI01G20950
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR000953 - Chromo/chromo shadow domain
IPR016197 - Chromo-like domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035107.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]6.9e-22852.33Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW
        MPFGLTNA  TFQ+LMN VF+ YLR+FVLVF DDIL+YS+ ++EH QH+E+VL  L+  +L                                  A+ +W
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW

Query:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS
        PTPTNVREVRGFLGLTGYYRRF+                G++KW+  A+ AF KL++AMM LP+L +PDFN PFE+++DASG GVGA+L Q ++P+A++S
Subjt:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS

Query:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA
         TL++RDRA+PVYEREL+AVVL VQRWRPYLLGR F VKTDQRSLKF LEQRV+QPQYQKW+AKLLGYSFEVVY+PGLENKAA+ALSRVPP VHL+Q+TA
Subjt:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA

Query:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------
        P ++D+++I+EE   D  LQ+I   ++   ++ +YTLQQG+L++KGRLVI   S+LIP I+HTYHDSV GGHSGFLRTYKRLTGEI              
Subjt:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------

Query:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH
                          + P+EIP+ +W DISMDFIE L KS G++VIFVVVD  +KY HFL L+HPF AK+VA      E+F++ VV    Y   P+ 
Subjt:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH

Query:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG
         +   D+      W +     G ++ R+             + K I       C                W NT                   P LIYYG
Subjt:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG

Query:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA
        D ETPNS LD+QLK+RD+ LGALKEHL++AQE+MK  AD KRR VEF+EGD VFLK+RPYRQ SLRK+RNEKLSPK+FGPYRI++RIG VAY+LELPA A
Subjt:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA

Query:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI
         IHPVFH+SQLK+A G+    + L P++ AN EW   P+EV+ Y+KN     WE L+SWKGLP HEATWE+  D +  FPDFHLEDKV L+ E + RPPI
Subjt:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI

Query:  TDQYSRRKNRKEKQEE
           Y R+  +K +  E
Subjt:  TDQYSRRKNRKEKQEE

KAA0038753.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]6.9e-22852.21Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW
        MPFGLTNA  TFQ+LMN VF+ YLR+FVLVF DDIL+YS+ ++EH QH+E+VL  L+  +L                                  A+ +W
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW

Query:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS
        PTPTNVREVRGFLGLTGYYRRF+                G++KW+  A+ AF KL++AMM LP+L +PDFN PFE+++DASG GVGA+L Q ++P+A++S
Subjt:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS

Query:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA
         TL++RDRA+PVYEREL+AVVL VQRWRPYLLGR F VKTDQRSLKF LEQRV+QPQYQKW+AKLLGYSFEVVY+PGLENKAA+ALSRVPP VHL+Q+TA
Subjt:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA

Query:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------
        P ++D+++I+EE   D  LQ+I   ++   ++ +YTLQQG+L++KGRLVI   S+LIP I+HTYHDSV GGHSGFLRTYKRLTGEI              
Subjt:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------

Query:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH
                          + P+EIP+ +W DISMDFIE L KS G++VIFVVVD  +KY HFL L+HPF AK+VA      E+F++ VV    Y   P+ 
Subjt:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH

Query:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG
         +   D+      W +     G ++ R+             + K I       C                W NT                   P LIYYG
Subjt:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG

Query:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA
        D ETPNS LD+QLK+RD+ LGALKEHL++AQE+MK  AD KRR VEF+EGD VFLK+RPYRQ SLRK+RNEKLSPK+FGPYR+++RIG VAY+LELPA A
Subjt:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA

Query:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI
         IHPVFH+SQLK+A G+    + L P++ AN EW   P+EV+ Y+KN     WE L+SWKGLP HEATWE+  D +  FPDFHLEDKV L+ E + RPPI
Subjt:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI

Query:  TDQYSRRKNRKEKQEE
           Y R+  +K +  E
Subjt:  TDQYSRRKNRKEKQEE

KAA0055700.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.4e-22852.21Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW
        MPFGLTNA  TFQ+LMN VF+ YLR+FVLVF DDIL+YS+ ++EH QH+E+VL  L+  +L                                  A+ +W
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW

Query:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS
        PTPTNVREVRGFLGLTGYYRRF+                G++KW+  A++AF KL++AMM LP+L +PDFN PFE+++DASG GVGA+L Q ++P+A++S
Subjt:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS

Query:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA
         TL++RDRA+PVYEREL+AVVL VQRWRPYLLGR F VKTDQRSLKF LEQRV+QPQYQKW+AKLLGYSFEVVY+PGLENKAA+ALSR PP VHL+Q+TA
Subjt:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA

Query:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------
        P ++D+++I+EE   D  LQ+I   ++   ++ +YTLQQG+L++KGRLVI   S+LIP I+HTYHDSV GGHSGFLRTYKRLTGEI              
Subjt:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------

Query:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH
                          + P+EIP+ +W DISMDFIE L KS G++VIFVVVD  +KY HFL L+HPF AKMVA      E+F++ VV    Y   P+ 
Subjt:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH

Query:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG
         +   D+      W +     G ++ R+             + K I       C                W NT                   P LIYYG
Subjt:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG

Query:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA
        D ETPNS LD+QLK+RD+ LGALKEHL++AQE+MK  AD KRR VEF+EGD VFLK+RPYRQ SLRK+RNEKLSPK+FGPYR+++RIG VAY+LELPA A
Subjt:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA

Query:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI
         IHPVFH+SQLK+A G+    + L P++ AN EW   P+EV+ Y+KN     WE L+SWKGLP HEATWE+  D +  FPDFHLEDKV L+ E + RPPI
Subjt:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI

Query:  TDQYSRRKNRKEKQEE
           Y R+  +K +  E
Subjt:  TDQYSRRKNRKEKQEE

TYK08591.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]6.9e-22852.21Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW
        MPFGLTNA  TFQ+LMN VF+ YLR+FVLVF DDIL+YS+ ++EH QH+E+VL  L+  +L                                  A+ +W
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW

Query:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS
        PTPTNVREVRGFLGLTGYYRRF+                G++KW+  A+ AF KL++AMM LP+L +PDFN PFE+++DASG GVGA+L Q ++P+A++S
Subjt:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS

Query:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA
         TL++RDRA+PVYEREL+AVVL VQRWRPYLLGR F VKTDQRSLKF LEQRV+QPQYQKW+AKLLGYSFEVVY+PGLENKAA+ALSRVPP VHL+Q+TA
Subjt:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA

Query:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------
        P ++D+++I+EE   D  LQ+I   ++   ++ +YTLQQG+L++KGRLVI   S+LIP I+HTYHDSV GGHSGFLRTYKRLTGEI              
Subjt:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------

Query:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH
                          + P+EIP+ +W DISMDFIE L KS G++VIFVVVD  +KY HFL L+HPF AK+VA      E+F++ VV    Y   P+ 
Subjt:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH

Query:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG
         +   D+      W +     G ++ R+             + K I       C                W NT                   P LIYYG
Subjt:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG

Query:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA
        D ETPNS LD+QLK+RD+ LGALKEHL++AQE+MK  AD KRR VEF+EGD VFLK+RPYRQ SLRK+RNEKLSPK+FGPYR+++RIG VAY+LELPA A
Subjt:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA

Query:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI
         IHPVFH+SQLK+A G+    + L P++ AN EW   P+EV+ Y+KN     WE L+SWKGLP HEATWE+  D +  FPDFHLEDKV L+ E + RPPI
Subjt:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI

Query:  TDQYSRRKNRKEKQEE
           Y R+  +K +  E
Subjt:  TDQYSRRKNRKEKQEE

TYK24981.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]6.9e-22852.21Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW
        MPFGLTNA  TFQ+LMN VF+ YLR+FVLVF DDIL+YS+ ++EH QH+E+VL  L+  +L                                  A+ +W
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW

Query:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS
        PTPTNVREVRGFLGLTGYYRRF+                G++KW+  A+ AF KL++AMM LP+L +PDFN PFE+++DASG GVGA+L Q ++P+A++S
Subjt:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS

Query:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA
         TL++RDRA+PVYEREL+AVVL VQRWRPYLLGR F VKTDQRSLKF LEQRV+QPQYQKW+AKLLGYSFEVVY+PGLENKAA+ALSRVPP VHL+Q+TA
Subjt:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA

Query:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------
        P ++D+++I+EE   D  LQ+I   ++   ++ +YTLQQG+L++KGRLVI   S+LIP I+HTYHDSV GGHSGFLRTYKRLTGEI              
Subjt:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------

Query:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH
                          + P+EIP+ +W DISMDFIE L KS G++VIFVVVD  +KY HFL L+HPF AK+VA      E+F++ VV    Y   P+ 
Subjt:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH

Query:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG
         +   D+      W +     G ++ R+             + K I       C                W NT                   P LIYYG
Subjt:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG

Query:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA
        D ETPNS LD+QLK+RD+ LGALKEHL++AQE+MK  AD KRR VEF+EGD VFLK+RPYRQ SLRK+RNEKLSPK+FGPYR+++RIG VAY+LELPA A
Subjt:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA

Query:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI
         IHPVFH+SQLK+A G+    + L P++ AN EW   P+EV+ Y+KN     WE L+SWKGLP HEATWE+  D +  FPDFHLEDKV L+ E + RPPI
Subjt:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI

Query:  TDQYSRRKNRKEKQEE
           Y R+  +K +  E
Subjt:  TDQYSRRKNRKEKQEE

TrEMBL top hitse value%identityAlignment
A0A5A7T0J9 Ty3/gypsy retrotransposon protein3.3e-22852.33Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW
        MPFGLTNA  TFQ+LMN VF+ YLR+FVLVF DDIL+YS+ ++EH QH+E+VL  L+  +L                                  A+ +W
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW

Query:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS
        PTPTNVREVRGFLGLTGYYRRF+                G++KW+  A+ AF KL++AMM LP+L +PDFN PFE+++DASG GVGA+L Q ++P+A++S
Subjt:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS

Query:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA
         TL++RDRA+PVYEREL+AVVL VQRWRPYLLGR F VKTDQRSLKF LEQRV+QPQYQKW+AKLLGYSFEVVY+PGLENKAA+ALSRVPP VHL+Q+TA
Subjt:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA

Query:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------
        P ++D+++I+EE   D  LQ+I   ++   ++ +YTLQQG+L++KGRLVI   S+LIP I+HTYHDSV GGHSGFLRTYKRLTGEI              
Subjt:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------

Query:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH
                          + P+EIP+ +W DISMDFIE L KS G++VIFVVVD  +KY HFL L+HPF AK+VA      E+F++ VV    Y   P+ 
Subjt:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH

Query:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG
         +   D+      W +     G ++ R+             + K I       C                W NT                   P LIYYG
Subjt:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG

Query:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA
        D ETPNS LD+QLK+RD+ LGALKEHL++AQE+MK  AD KRR VEF+EGD VFLK+RPYRQ SLRK+RNEKLSPK+FGPYRI++RIG VAY+LELPA A
Subjt:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA

Query:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI
         IHPVFH+SQLK+A G+    + L P++ AN EW   P+EV+ Y+KN     WE L+SWKGLP HEATWE+  D +  FPDFHLEDKV L+ E + RPPI
Subjt:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI

Query:  TDQYSRRKNRKEKQEE
           Y R+  +K +  E
Subjt:  TDQYSRRKNRKEKQEE

A0A5A7T725 Ty3/gypsy retrotransposon protein3.3e-22852.21Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW
        MPFGLTNA  TFQ+LMN VF+ YLR+FVLVF DDIL+YS+ ++EH QH+E+VL  L+  +L                                  A+ +W
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW

Query:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS
        PTPTNVREVRGFLGLTGYYRRF+                G++KW+  A+ AF KL++AMM LP+L +PDFN PFE+++DASG GVGA+L Q ++P+A++S
Subjt:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS

Query:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA
         TL++RDRA+PVYEREL+AVVL VQRWRPYLLGR F VKTDQRSLKF LEQRV+QPQYQKW+AKLLGYSFEVVY+PGLENKAA+ALSRVPP VHL+Q+TA
Subjt:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA

Query:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------
        P ++D+++I+EE   D  LQ+I   ++   ++ +YTLQQG+L++KGRLVI   S+LIP I+HTYHDSV GGHSGFLRTYKRLTGEI              
Subjt:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------

Query:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH
                          + P+EIP+ +W DISMDFIE L KS G++VIFVVVD  +KY HFL L+HPF AK+VA      E+F++ VV    Y   P+ 
Subjt:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH

Query:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG
         +   D+      W +     G ++ R+             + K I       C                W NT                   P LIYYG
Subjt:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG

Query:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA
        D ETPNS LD+QLK+RD+ LGALKEHL++AQE+MK  AD KRR VEF+EGD VFLK+RPYRQ SLRK+RNEKLSPK+FGPYR+++RIG VAY+LELPA A
Subjt:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA

Query:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI
         IHPVFH+SQLK+A G+    + L P++ AN EW   P+EV+ Y+KN     WE L+SWKGLP HEATWE+  D +  FPDFHLEDKV L+ E + RPPI
Subjt:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI

Query:  TDQYSRRKNRKEKQEE
           Y R+  +K +  E
Subjt:  TDQYSRRKNRKEKQEE

A0A5A7UKN8 Ty3/gypsy retrotransposon protein1.2e-22852.21Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW
        MPFGLTNA  TFQ+LMN VF+ YLR+FVLVF DDIL+YS+ ++EH QH+E+VL  L+  +L                                  A+ +W
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW

Query:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS
        PTPTNVREVRGFLGLTGYYRRF+                G++KW+  A++AF KL++AMM LP+L +PDFN PFE+++DASG GVGA+L Q ++P+A++S
Subjt:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS

Query:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA
         TL++RDRA+PVYEREL+AVVL VQRWRPYLLGR F VKTDQRSLKF LEQRV+QPQYQKW+AKLLGYSFEVVY+PGLENKAA+ALSR PP VHL+Q+TA
Subjt:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA

Query:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------
        P ++D+++I+EE   D  LQ+I   ++   ++ +YTLQQG+L++KGRLVI   S+LIP I+HTYHDSV GGHSGFLRTYKRLTGEI              
Subjt:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------

Query:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH
                          + P+EIP+ +W DISMDFIE L KS G++VIFVVVD  +KY HFL L+HPF AKMVA      E+F++ VV    Y   P+ 
Subjt:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH

Query:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG
         +   D+      W +     G ++ R+             + K I       C                W NT                   P LIYYG
Subjt:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG

Query:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA
        D ETPNS LD+QLK+RD+ LGALKEHL++AQE+MK  AD KRR VEF+EGD VFLK+RPYRQ SLRK+RNEKLSPK+FGPYR+++RIG VAY+LELPA A
Subjt:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA

Query:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI
         IHPVFH+SQLK+A G+    + L P++ AN EW   P+EV+ Y+KN     WE L+SWKGLP HEATWE+  D +  FPDFHLEDKV L+ E + RPPI
Subjt:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI

Query:  TDQYSRRKNRKEKQEE
           Y R+  +K +  E
Subjt:  TDQYSRRKNRKEKQEE

A0A5D3C9P5 Ty3/gypsy retrotransposon protein3.3e-22852.21Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW
        MPFGLTNA  TFQ+LMN VF+ YLR+FVLVF DDIL+YS+ ++EH QH+E+VL  L+  +L                                  A+ +W
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW

Query:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS
        PTPTNVREVRGFLGLTGYYRRF+                G++KW+  A+ AF KL++AMM LP+L +PDFN PFE+++DASG GVGA+L Q ++P+A++S
Subjt:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS

Query:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA
         TL++RDRA+PVYEREL+AVVL VQRWRPYLLGR F VKTDQRSLKF LEQRV+QPQYQKW+AKLLGYSFEVVY+PGLENKAA+ALSRVPP VHL+Q+TA
Subjt:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA

Query:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------
        P ++D+++I+EE   D  LQ+I   ++   ++ +YTLQQG+L++KGRLVI   S+LIP I+HTYHDSV GGHSGFLRTYKRLTGEI              
Subjt:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------

Query:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH
                          + P+EIP+ +W DISMDFIE L KS G++VIFVVVD  +KY HFL L+HPF AK+VA      E+F++ VV    Y   P+ 
Subjt:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH

Query:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG
         +   D+      W +     G ++ R+             + K I       C                W NT                   P LIYYG
Subjt:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG

Query:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA
        D ETPNS LD+QLK+RD+ LGALKEHL++AQE+MK  AD KRR VEF+EGD VFLK+RPYRQ SLRK+RNEKLSPK+FGPYR+++RIG VAY+LELPA A
Subjt:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA

Query:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI
         IHPVFH+SQLK+A G+    + L P++ AN EW   P+EV+ Y+KN     WE L+SWKGLP HEATWE+  D +  FPDFHLEDKV L+ E + RPPI
Subjt:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI

Query:  TDQYSRRKNRKEKQEE
           Y R+  +K +  E
Subjt:  TDQYSRRKNRKEKQEE

A0A5D3DMY9 Ty3/gypsy retrotransposon protein3.3e-22852.21Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW
        MPFGLTNA  TFQ+LMN VF+ YLR+FVLVF DDIL+YS+ ++EH QH+E+VL  L+  +L                                  A+ +W
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKL----------------------------------AIKQW

Query:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS
        PTPTNVREVRGFLGLTGYYRRF+                G++KW+  A+ AF KL++AMM LP+L +PDFN PFE+++DASG GVGA+L Q ++P+A++S
Subjt:  PTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYS

Query:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA
         TL++RDRA+PVYEREL+AVVL VQRWRPYLLGR F VKTDQRSLKF LEQRV+QPQYQKW+AKLLGYSFEVVY+PGLENKAA+ALSRVPP VHL+Q+TA
Subjt:  HTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVPPTVHLNQLTA

Query:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------
        P ++D+++I+EE   D  LQ+I   ++   ++ +YTLQQG+L++KGRLVI   S+LIP I+HTYHDSV GGHSGFLRTYKRLTGEI              
Subjt:  PTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYKRLTGEI--------------

Query:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH
                          + P+EIP+ +W DISMDFIE L KS G++VIFVVVD  +KY HFL L+HPF AK+VA      E+F++ VV    Y   P+ 
Subjt:  ------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKH

Query:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG
         +   D+      W +     G ++ R+             + K I       C                W NT                   P LIYYG
Subjt:  SIPPPDR------WID---RGGQQICRN-------------LFKIIRRVTIPSC---------------LWKNT-------------------PALIYYG

Query:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA
        D ETPNS LD+QLK+RD+ LGALKEHL++AQE+MK  AD KRR VEF+EGD VFLK+RPYRQ SLRK+RNEKLSPK+FGPYR+++RIG VAY+LELPA A
Subjt:  DRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIGPVAYRLELPATA

Query:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI
         IHPVFH+SQLK+A G+    + L P++ AN EW   P+EV+ Y+KN     WE L+SWKGLP HEATWE+  D +  FPDFHLEDKV L+ E + RPPI
Subjt:  TIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKN-EKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPPI

Query:  TDQYSRRKNRKEKQEE
           Y R+  +K +  E
Subjt:  TDQYSRRKNRKEKQEE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.9e-4724.86Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKLAIK----------------------------------QW
        MP+G++ A   FQ  +N++        V+ ++DDIL++S++  EH +H++ VL++L+   L I                                   QW
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKLAIK----------------------------------QW

Query:  PTPTNVREVRGFLGLTGYYRRFLGS-----------------FKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNK-----R
          P N +E+R FLG   Y R+F+                   +KW     +A E +++ +++ P+L   DF+    ++TDAS   VGA+L Q        
Subjt:  PTPTNVREVRGFLGLTGYYRRFLGS-----------------FKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNK-----R

Query:  PIAFYSHTLALRDRAKPVYERELMAVVLTVQRWRPYLLG--RTFIVKTDQRSL--KFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVP
        P+ +YS  ++       V ++E++A++ +++ WR YL      F + TD R+L  +   E      +  +W   L  ++FE+ Y+PG  N  A+ALSR+ 
Subjt:  PIAFYSHTLALRDRAKPVYERELMAVVLTVQRWRPYLLG--RTFIVKTDQRSL--KFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVP

Query:  PTVH-------------LNQLTAPTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGIL-RYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSG--
                         +NQ++       +V+ E  N    L  + N  +R E  +N  L+ G+L   K ++++  ++ L   I+  YH+     H G  
Subjt:  PTVH-------------LNQLTAPTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGIL-RYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSG--

Query:  -----FLR--TYKRLTGEI-----------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFL---------GLQH
              LR  T+K +  +I                       + PI    R WE +SMDFI  L +S G+  +FVVVD F+K A  +             
Subjt:  -----FLR--TYKRLTGEI-----------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFL---------GLQH

Query:  PFDAKMVAELQDLSESFLER---VVSFSW------YEVEPKHSIP--------------------------PPDRWIDRGG--QQICRNLFKIIRRVTIP
         FD +++A   +  E   +      S +W      Y    K S+P                           P+ W+D     QQ   N      ++T  
Subjt:  PFDAKMVAELQDLSESFLER---VVSFSW------YEVEPKHSIP--------------------------PPDRWIDRGG--QQICRNLFKIIRRVTIP

Query:  SCLWKNTPALIYYGDRETP--NSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRV-EFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRI
          + + +PAL      E P  +   DE  +E       +KEHL     KMK   DMK + + EF+ GD V +K    R  +    ++ KL+P + GP+ +
Subjt:  SCLWKNTPALIYYGDRETP--NSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRV-EFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRI

Query:  VKRIGPVAYRLELPATA--TIHPVFHISQLKR
        +++ GP  Y L+LP +        FH+S L++
Subjt:  VKRIGPVAYRLELPATA--TIHPVFHISQLKR

P0CT35 Transposon Tf2-2 polyprotein1.9e-4724.86Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKLAIK----------------------------------QW
        MP+G++ A   FQ  +N++        V+ ++DDIL++S++  EH +H++ VL++L+   L I                                   QW
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKLAIK----------------------------------QW

Query:  PTPTNVREVRGFLGLTGYYRRFLGS-----------------FKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNK-----R
          P N +E+R FLG   Y R+F+                   +KW     +A E +++ +++ P+L   DF+    ++TDAS   VGA+L Q        
Subjt:  PTPTNVREVRGFLGLTGYYRRFLGS-----------------FKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNK-----R

Query:  PIAFYSHTLALRDRAKPVYERELMAVVLTVQRWRPYLLG--RTFIVKTDQRSL--KFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVP
        P+ +YS  ++       V ++E++A++ +++ WR YL      F + TD R+L  +   E      +  +W   L  ++FE+ Y+PG  N  A+ALSR+ 
Subjt:  PIAFYSHTLALRDRAKPVYERELMAVVLTVQRWRPYLLG--RTFIVKTDQRSL--KFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVP

Query:  PTVH-------------LNQLTAPTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGIL-RYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSG--
                         +NQ++       +V+ E  N    L  + N  +R E  +N  L+ G+L   K ++++  ++ L   I+  YH+     H G  
Subjt:  PTVH-------------LNQLTAPTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGIL-RYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSG--

Query:  -----FLR--TYKRLTGEI-----------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFL---------GLQH
              LR  T+K +  +I                       + PI    R WE +SMDFI  L +S G+  +FVVVD F+K A  +             
Subjt:  -----FLR--TYKRLTGEI-----------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFL---------GLQH

Query:  PFDAKMVAELQDLSESFLER---VVSFSW------YEVEPKHSIP--------------------------PPDRWIDRGG--QQICRNLFKIIRRVTIP
         FD +++A   +  E   +      S +W      Y    K S+P                           P+ W+D     QQ   N      ++T  
Subjt:  PFDAKMVAELQDLSESFLER---VVSFSW------YEVEPKHSIP--------------------------PPDRWIDRGG--QQICRNLFKIIRRVTIP

Query:  SCLWKNTPALIYYGDRETP--NSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRV-EFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRI
          + + +PAL      E P  +   DE  +E       +KEHL     KMK   DMK + + EF+ GD V +K    R  +    ++ KL+P + GP+ +
Subjt:  SCLWKNTPALIYYGDRETP--NSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRV-EFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRI

Query:  VKRIGPVAYRLELPATA--TIHPVFHISQLKR
        +++ GP  Y L+LP +        FH+S L++
Subjt:  VKRIGPVAYRLELPATA--TIHPVFHISQLKR

P0CT36 Transposon Tf2-3 polyprotein1.9e-4724.86Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKLAIK----------------------------------QW
        MP+G++ A   FQ  +N++        V+ ++DDIL++S++  EH +H++ VL++L+   L I                                   QW
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKLAIK----------------------------------QW

Query:  PTPTNVREVRGFLGLTGYYRRFLGS-----------------FKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNK-----R
          P N +E+R FLG   Y R+F+                   +KW     +A E +++ +++ P+L   DF+    ++TDAS   VGA+L Q        
Subjt:  PTPTNVREVRGFLGLTGYYRRFLGS-----------------FKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNK-----R

Query:  PIAFYSHTLALRDRAKPVYERELMAVVLTVQRWRPYLLG--RTFIVKTDQRSL--KFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVP
        P+ +YS  ++       V ++E++A++ +++ WR YL      F + TD R+L  +   E      +  +W   L  ++FE+ Y+PG  N  A+ALSR+ 
Subjt:  PIAFYSHTLALRDRAKPVYERELMAVVLTVQRWRPYLLG--RTFIVKTDQRSL--KFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVP

Query:  PTVH-------------LNQLTAPTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGIL-RYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSG--
                         +NQ++       +V+ E  N    L  + N  +R E  +N  L+ G+L   K ++++  ++ L   I+  YH+     H G  
Subjt:  PTVH-------------LNQLTAPTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGIL-RYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSG--

Query:  -----FLR--TYKRLTGEI-----------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFL---------GLQH
              LR  T+K +  +I                       + PI    R WE +SMDFI  L +S G+  +FVVVD F+K A  +             
Subjt:  -----FLR--TYKRLTGEI-----------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFL---------GLQH

Query:  PFDAKMVAELQDLSESFLER---VVSFSW------YEVEPKHSIP--------------------------PPDRWIDRGG--QQICRNLFKIIRRVTIP
         FD +++A   +  E   +      S +W      Y    K S+P                           P+ W+D     QQ   N      ++T  
Subjt:  PFDAKMVAELQDLSESFLER---VVSFSW------YEVEPKHSIP--------------------------PPDRWIDRGG--QQICRNLFKIIRRVTIP

Query:  SCLWKNTPALIYYGDRETP--NSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRV-EFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRI
          + + +PAL      E P  +   DE  +E       +KEHL     KMK   DMK + + EF+ GD V +K    R  +    ++ KL+P + GP+ +
Subjt:  SCLWKNTPALIYYGDRETP--NSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRV-EFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRI

Query:  VKRIGPVAYRLELPATA--TIHPVFHISQLKR
        +++ GP  Y L+LP +        FH+S L++
Subjt:  VKRIGPVAYRLELPATA--TIHPVFHISQLKR

P0CT37 Transposon Tf2-4 polyprotein1.9e-4724.86Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKLAIK----------------------------------QW
        MP+G++ A   FQ  +N++        V+ ++DDIL++S++  EH +H++ VL++L+   L I                                   QW
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKLAIK----------------------------------QW

Query:  PTPTNVREVRGFLGLTGYYRRFLGS-----------------FKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNK-----R
          P N +E+R FLG   Y R+F+                   +KW     +A E +++ +++ P+L   DF+    ++TDAS   VGA+L Q        
Subjt:  PTPTNVREVRGFLGLTGYYRRFLGS-----------------FKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNK-----R

Query:  PIAFYSHTLALRDRAKPVYERELMAVVLTVQRWRPYLLG--RTFIVKTDQRSL--KFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVP
        P+ +YS  ++       V ++E++A++ +++ WR YL      F + TD R+L  +   E      +  +W   L  ++FE+ Y+PG  N  A+ALSR+ 
Subjt:  PIAFYSHTLALRDRAKPVYERELMAVVLTVQRWRPYLLG--RTFIVKTDQRSL--KFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVP

Query:  PTVH-------------LNQLTAPTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGIL-RYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSG--
                         +NQ++       +V+ E  N    L  + N  +R E  +N  L+ G+L   K ++++  ++ L   I+  YH+     H G  
Subjt:  PTVH-------------LNQLTAPTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGIL-RYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSG--

Query:  -----FLR--TYKRLTGEI-----------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFL---------GLQH
              LR  T+K +  +I                       + PI    R WE +SMDFI  L +S G+  +FVVVD F+K A  +             
Subjt:  -----FLR--TYKRLTGEI-----------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFL---------GLQH

Query:  PFDAKMVAELQDLSESFLER---VVSFSW------YEVEPKHSIP--------------------------PPDRWIDRGG--QQICRNLFKIIRRVTIP
         FD +++A   +  E   +      S +W      Y    K S+P                           P+ W+D     QQ   N      ++T  
Subjt:  PFDAKMVAELQDLSESFLER---VVSFSW------YEVEPKHSIP--------------------------PPDRWIDRGG--QQICRNLFKIIRRVTIP

Query:  SCLWKNTPALIYYGDRETP--NSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRV-EFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRI
          + + +PAL      E P  +   DE  +E       +KEHL     KMK   DMK + + EF+ GD V +K    R  +    ++ KL+P + GP+ +
Subjt:  SCLWKNTPALIYYGDRETP--NSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRV-EFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRI

Query:  VKRIGPVAYRLELPATA--TIHPVFHISQLKR
        +++ GP  Y L+LP +        FH+S L++
Subjt:  VKRIGPVAYRLELPATA--TIHPVFHISQLKR

P0CT41 Transposon Tf2-12 polyprotein1.9e-4724.86Show/hide
Query:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKLAIK----------------------------------QW
        MP+G++ A   FQ  +N++        V+ ++DDIL++S++  EH +H++ VL++L+   L I                                   QW
Subjt:  MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKLAIK----------------------------------QW

Query:  PTPTNVREVRGFLGLTGYYRRFLGS-----------------FKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNK-----R
          P N +E+R FLG   Y R+F+                   +KW     +A E +++ +++ P+L   DF+    ++TDAS   VGA+L Q        
Subjt:  PTPTNVREVRGFLGLTGYYRRFLGS-----------------FKWNEGAQEAFEKLQRAMMALPILALPDFNAPFEVDTDASGYGVGAMLMQNK-----R

Query:  PIAFYSHTLALRDRAKPVYERELMAVVLTVQRWRPYLLG--RTFIVKTDQRSL--KFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVP
        P+ +YS  ++       V ++E++A++ +++ WR YL      F + TD R+L  +   E      +  +W   L  ++FE+ Y+PG  N  A+ALSR+ 
Subjt:  PIAFYSHTLALRDRAKPVYERELMAVVLTVQRWRPYLLG--RTFIVKTDQRSL--KFFLEQRVIQPQYQKWIAKLLGYSFEVVYKPGLENKAANALSRVP

Query:  PTVH-------------LNQLTAPTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGIL-RYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSG--
                         +NQ++       +V+ E  N    L  + N  +R E  +N  L+ G+L   K ++++  ++ L   I+  YH+     H G  
Subjt:  PTVH-------------LNQLTAPTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGIL-RYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSG--

Query:  -----FLR--TYKRLTGEI-----------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFL---------GLQH
              LR  T+K +  +I                       + PI    R WE +SMDFI  L +S G+  +FVVVD F+K A  +             
Subjt:  -----FLR--TYKRLTGEI-----------------------IDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFL---------GLQH

Query:  PFDAKMVAELQDLSESFLER---VVSFSW------YEVEPKHSIP--------------------------PPDRWIDRGG--QQICRNLFKIIRRVTIP
         FD +++A   +  E   +      S +W      Y    K S+P                           P+ W+D     QQ   N      ++T  
Subjt:  PFDAKMVAELQDLSESFLER---VVSFSW------YEVEPKHSIP--------------------------PPDRWIDRGG--QQICRNLFKIIRRVTIP

Query:  SCLWKNTPALIYYGDRETP--NSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRV-EFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRI
          + + +PAL      E P  +   DE  +E       +KEHL     KMK   DMK + + EF+ GD V +K    R  +    ++ KL+P + GP+ +
Subjt:  SCLWKNTPALIYYGDRETP--NSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRV-EFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRI

Query:  VKRIGPVAYRLELPATA--TIHPVFHISQLKR
        +++ GP  Y L+LP +        FH+S L++
Subjt:  VKRIGPVAYRLELPATA--TIHPVFHISQLKR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.2e-1246.84Show/hide
Query:  AIKQWPTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPF
        A+  WP P N  E+RGFLGLTGYYRRF+                 S KW E A  AF+ L+ A+  LP+LALPD   PF
Subjt:  AIKQWPTPTNVREVRGFLGLTGYYRRFL----------------GSFKWNEGAQEAFEKLQRAMMALPILALPDFNAPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGTTTGGACTCACAAACGCATCAGAAACTTTCCAATCACTAATGAATTCGGTTTTTAGATCGTACTTGAGGAAGTTCGTCTTGGTCTTCCTCGATGATATACTGGT
CTATAGTAGGAACTTAGAGGAACATTGTCAGCACATTGAATTAGTTCTGGAAGAATTGAGGAGGCATAAACTAGCAATCAAGCAATGGCCAACTCCAACAAATGTTCGGG
AAGTTAGAGGGTTTCTGGGGTTGACTGGTTACTACCGCCGTTTTCTGGGATCATTTAAATGGAATGAGGGAGCACAAGAAGCGTTTGAAAAGCTTCAACGAGCAATGATG
GCCTTGCCTATATTAGCTCTTCCAGATTTTAATGCACCATTCGAAGTAGATACAGATGCGTCAGGCTATGGGGTAGGGGCAATGCTAATGCAGAACAAGAGACCAATTGC
TTTTTATAGCCATACACTAGCCTTGCGAGACCGAGCCAAACCAGTATACGAGAGGGAATTAATGGCAGTAGTATTGACAGTCCAACGTTGGCGACCCTATTTGTTAGGAA
GGACATTCATAGTTAAGACAGATCAGCGATCACTTAAGTTCTTTCTGGAACAGAGAGTCATACAACCGCAATATCAGAAGTGGATTGCAAAATTGTTGGGTTATTCATTT
GAGGTGGTGTATAAACCGGGCTTGGAAAACAAGGCAGCAAATGCCCTTTCACGAGTACCACCAACTGTCCATCTTAACCAACTAACAGCCCCCACCTTGGTGGACATAAA
GGTAATCAGAGAGGAGGTTAACAAGGATGACTACTTGCAAGATATAATCAACAGGATTCAGAGGGAGGAGAAGGTAAAGAATTACACTCTGCAACAAGGAATACTGAGAT
ACAAAGGGAGATTAGTGATTGCGAAGAATTCTTCATTGATACCTATTATTATGCACACATATCATGACTCGGTCCTAGGAGGTCATTCCGGGTTCTTAAGAACGTATAAG
AGGCTGACAGGAGAGATTATTGACCCCATAGAGATACCAAATAGAGTATGGGAGGATATATCCATGGATTTTATTGAAAGACTAACTAAATCAATGGGGTTTGAAGTAAT
ATTCGTAGTGGTGGATTGCTTCAATAAATATGCTCACTTCCTCGGCCTTCAACATCCTTTTGACGCTAAGATGGTAGCTGAATTACAAGATCTTTCTGAGTCATTTTTGG
AAAGAGTTGTTTCGTTTAGCTGGTACGAAGTTGAACCGAAGCACAGCATACCACCCCCAGACAGATGGATAGACAGAGGTGGTCAACAGATCTGTAGAAATTTATTTAAG
ATCATTAGGCGTGTCACCATTCCAAGCTGTCTATGGAAGAACACCCCAGCCCTGATATATTATGGAGATCGTGAAACTCCCAACTCGGCTTTAGATGAGCAACTTAAGGA
AAGAGATGTAGCCTTGGGTGCTTTGAAGGAACATCTACGCATAGCCCAAGAAAAGATGAAGAGTTGTGCCGATATGAAGAGAAGACGTGTCGAATTTGAAGAAGGCGATA
AGGTGTTCCTAAAGATTAGGCCATACAGGCAGGTATCACTGCGGAAAAGGAGAAATGAGAAGTTGTCACCGAAGTATTTCGGGCCATATCGAATAGTGAAGAGGATTGGT
CCGGTTGCATATAGGCTGGAGTTACCGGCGACAGCAACAATTCATCCTGTGTTCCATATTTCACAGTTGAAAAGAGCCTTTGGGGAGAGTGCGAACAGCGAGGAGCTTTT
GCCATTCTTGACTGCAAATGATGAGTGGAAGGCTGTGCCTCAGGAGGTCTTCGATTATCAGAAAAACGAGAAAGGAGGATGGGAAGTCTTAATGAGTTGGAAGGGTCTAC
CGCATCATGAAGCAACATGGGAAAACTATGATGACTTTCAGCAATCCTTCCCCGATTTCCACCTTGAGGACAAGGTGAAATTGGACCGGGAATGCAATGTTAGACCACCC
ATCACAGATCAATACAGTAGGAGAAAGAATAGGAAAGAGAAGCAGGAGGAGGAGTTAGTTATGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGTTTGGACTCACAAACGCATCAGAAACTTTCCAATCACTAATGAATTCGGTTTTTAGATCGTACTTGAGGAAGTTCGTCTTGGTCTTCCTCGATGATATACTGGT
CTATAGTAGGAACTTAGAGGAACATTGTCAGCACATTGAATTAGTTCTGGAAGAATTGAGGAGGCATAAACTAGCAATCAAGCAATGGCCAACTCCAACAAATGTTCGGG
AAGTTAGAGGGTTTCTGGGGTTGACTGGTTACTACCGCCGTTTTCTGGGATCATTTAAATGGAATGAGGGAGCACAAGAAGCGTTTGAAAAGCTTCAACGAGCAATGATG
GCCTTGCCTATATTAGCTCTTCCAGATTTTAATGCACCATTCGAAGTAGATACAGATGCGTCAGGCTATGGGGTAGGGGCAATGCTAATGCAGAACAAGAGACCAATTGC
TTTTTATAGCCATACACTAGCCTTGCGAGACCGAGCCAAACCAGTATACGAGAGGGAATTAATGGCAGTAGTATTGACAGTCCAACGTTGGCGACCCTATTTGTTAGGAA
GGACATTCATAGTTAAGACAGATCAGCGATCACTTAAGTTCTTTCTGGAACAGAGAGTCATACAACCGCAATATCAGAAGTGGATTGCAAAATTGTTGGGTTATTCATTT
GAGGTGGTGTATAAACCGGGCTTGGAAAACAAGGCAGCAAATGCCCTTTCACGAGTACCACCAACTGTCCATCTTAACCAACTAACAGCCCCCACCTTGGTGGACATAAA
GGTAATCAGAGAGGAGGTTAACAAGGATGACTACTTGCAAGATATAATCAACAGGATTCAGAGGGAGGAGAAGGTAAAGAATTACACTCTGCAACAAGGAATACTGAGAT
ACAAAGGGAGATTAGTGATTGCGAAGAATTCTTCATTGATACCTATTATTATGCACACATATCATGACTCGGTCCTAGGAGGTCATTCCGGGTTCTTAAGAACGTATAAG
AGGCTGACAGGAGAGATTATTGACCCCATAGAGATACCAAATAGAGTATGGGAGGATATATCCATGGATTTTATTGAAAGACTAACTAAATCAATGGGGTTTGAAGTAAT
ATTCGTAGTGGTGGATTGCTTCAATAAATATGCTCACTTCCTCGGCCTTCAACATCCTTTTGACGCTAAGATGGTAGCTGAATTACAAGATCTTTCTGAGTCATTTTTGG
AAAGAGTTGTTTCGTTTAGCTGGTACGAAGTTGAACCGAAGCACAGCATACCACCCCCAGACAGATGGATAGACAGAGGTGGTCAACAGATCTGTAGAAATTTATTTAAG
ATCATTAGGCGTGTCACCATTCCAAGCTGTCTATGGAAGAACACCCCAGCCCTGATATATTATGGAGATCGTGAAACTCCCAACTCGGCTTTAGATGAGCAACTTAAGGA
AAGAGATGTAGCCTTGGGTGCTTTGAAGGAACATCTACGCATAGCCCAAGAAAAGATGAAGAGTTGTGCCGATATGAAGAGAAGACGTGTCGAATTTGAAGAAGGCGATA
AGGTGTTCCTAAAGATTAGGCCATACAGGCAGGTATCACTGCGGAAAAGGAGAAATGAGAAGTTGTCACCGAAGTATTTCGGGCCATATCGAATAGTGAAGAGGATTGGT
CCGGTTGCATATAGGCTGGAGTTACCGGCGACAGCAACAATTCATCCTGTGTTCCATATTTCACAGTTGAAAAGAGCCTTTGGGGAGAGTGCGAACAGCGAGGAGCTTTT
GCCATTCTTGACTGCAAATGATGAGTGGAAGGCTGTGCCTCAGGAGGTCTTCGATTATCAGAAAAACGAGAAAGGAGGATGGGAAGTCTTAATGAGTTGGAAGGGTCTAC
CGCATCATGAAGCAACATGGGAAAACTATGATGACTTTCAGCAATCCTTCCCCGATTTCCACCTTGAGGACAAGGTGAAATTGGACCGGGAATGCAATGTTAGACCACCC
ATCACAGATCAATACAGTAGGAGAAAGAATAGGAAAGAGAAGCAGGAGGAGGAGTTAGTTATGTAA
Protein sequenceShow/hide protein sequence
MPFGLTNASETFQSLMNSVFRSYLRKFVLVFLDDILVYSRNLEEHCQHIELVLEELRRHKLAIKQWPTPTNVREVRGFLGLTGYYRRFLGSFKWNEGAQEAFEKLQRAMM
ALPILALPDFNAPFEVDTDASGYGVGAMLMQNKRPIAFYSHTLALRDRAKPVYERELMAVVLTVQRWRPYLLGRTFIVKTDQRSLKFFLEQRVIQPQYQKWIAKLLGYSF
EVVYKPGLENKAANALSRVPPTVHLNQLTAPTLVDIKVIREEVNKDDYLQDIINRIQREEKVKNYTLQQGILRYKGRLVIAKNSSLIPIIMHTYHDSVLGGHSGFLRTYK
RLTGEIIDPIEIPNRVWEDISMDFIERLTKSMGFEVIFVVVDCFNKYAHFLGLQHPFDAKMVAELQDLSESFLERVVSFSWYEVEPKHSIPPPDRWIDRGGQQICRNLFK
IIRRVTIPSCLWKNTPALIYYGDRETPNSALDEQLKERDVALGALKEHLRIAQEKMKSCADMKRRRVEFEEGDKVFLKIRPYRQVSLRKRRNEKLSPKYFGPYRIVKRIG
PVAYRLELPATATIHPVFHISQLKRAFGESANSEELLPFLTANDEWKAVPQEVFDYQKNEKGGWEVLMSWKGLPHHEATWENYDDFQQSFPDFHLEDKVKLDRECNVRPP
ITDQYSRRKNRKEKQEEELVM