; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G08770 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G08770
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr4:6529770..6532840
RNA-Seq ExpressionCSPI04G08770
SyntenyCSPI04G08770
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR016197 - Chromo-like domain superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025132.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.2e-21147.67Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------
        ME+LVDEML +GIIRPSTSPYSSP+LLV+K+D SWRFCVDY ALNNVTVPDKFPI VV+EL DELN  ++FSKIDLK G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------

Query:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
                                                     +G+ EH QHLE+VLEILRE+ LYANL KC FAK +I YLGH+IS+KGIEVDPEKI
Subjt:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR
        RA+KEWP P  V ++RGF+GLT YYR+FVQNYGSI+ PLTQL K+G       TE   ++ + A+MTLP+LAMPDFNLPF++E+DASGFGVGAVLVQ K+
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR

Query:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH
        P+A+FSK L  RDRARPVYEREL+ V                                       +WIAKLLGYSFEV+Y PGLENKAADALSR+ PT H
Subjt:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH

Query:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------
        +N ++APALLD+  IQ+E                           QG+LK+KGRLVLSK   L+PTI+HTYHDS                          
Subjt:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------

Query:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV
                     KNK+ AL+PAGLL PLEIP+ +WSDISMDFIEGLPKS G EVI VVVDRLSKYAHF+ +KHP+TA +VA  FVKE+V LHG+P++IV
Subjt:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV

Query:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM
        SDRDKIF+SHFW EMFKL+GTKL+RSS+YHPQ   +                    + WS                                   +YG+M
Subjt:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM

Query:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI
        ETPNSTLD+QL++R                                          P        KRNEKL+  +FGP+KILERIG VAY+LELPS A+I
Subjt:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI

Query:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI
        HPVFHVSQLKK + N ++IQ L P                                WKGLPPHEATWE+    K QFP+F LEDKV LE ES+AR PI  
Subjt:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI

Query:  TYSRKKKGKDRITCGAGGEGK
        TY+R+K  KD+ T G G E K
Subjt:  TYSRKKKGKDRITCGAGGEGK

KAA0049630.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.0e-21047.45Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------
        ME+LVDEML +GIIRPSTSPYSSP+LLV+KKD SWRFCVDY ALNNVTVPDKFPI VV+EL DELN  ++FSKIDLK G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------

Query:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
                                                     +G+ EH QHLE+VLEILRE+ LYAN  KC FAK +I YLGH+IS+KGIEVDPEKI
Subjt:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR
        RA+KEWP P  V ++RGF+GLT YYR+FVQNYGSI+ PLTQL K+G       TE   ++ + A+MTLP+LAMPDFNLPF++E+DASGFGVGAVLVQ K+
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR

Query:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH
        P+A+FSK L  RDRARPVYEREL+ V                                       +WI KLLGYSFEV+Y PGLENKAADALSR+ PT H
Subjt:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH

Query:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------
        +N ++APALLD+  IQ+E                           QG+LK+KGRLVLSK   L+PTI+HTYHDS                          
Subjt:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------

Query:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV
                     KNK+ AL+PAGLL PLEIP+ +WSDISMDFIEGLPKS G EVI VVVDRLSKYAHF+ +KHP+TA +VA  FVKE+V LHG+P++IV
Subjt:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV

Query:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM
        SDRDKIF+SHFW EMFKL+GTKL+RSS+YHPQ   +                    + WS                                   +YG+M
Subjt:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM

Query:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI
        ETPNSTLD+QL++R                                          P        KRNEKL+  +FGP+KILERIG VAY+LELP+ A+I
Subjt:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI

Query:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI
        HPVFHVSQLKK + N ++IQ L P                                WKGLPPHEATWE+    K QFP+F LEDKV LE ES+AR PI  
Subjt:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI

Query:  TYSRKKKGKDRITCGAGGEGK
        TY+R+K  KD+ T G G E K
Subjt:  TYSRKKKGKDRITCGAGGEGK

KAA0049776.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.7e-21147.88Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------
        ME+LVDEML +GIIRPSTSPYSSP+LLV+KKD SWRFCVDY ALNNVTVPDKFPI VV+EL DELN  ++FSKIDLK G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------

Query:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
                                                     +G+ EH QHLE+VLEILRE+ LYAN  KC FAK +I YLGH+IS+KGIEVDPEKI
Subjt:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR
        RA+KEWP P  V ++RGF+GLT YYR+FVQNYGSI+ PLTQL K+G       TE   ++ + A+MTLP+LAMPDFNLPF++E+DASGFGVGAVLVQ K+
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR

Query:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH
        P+A+FSK L  RDRARPVYEREL+ V                                       +WIAKLLGYSFEV+Y PGLENKAADALSR+ PT H
Subjt:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH

Query:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------
        +N ++APALLD+  IQ+E                           QG+LK+KGRLVLSK   L+PTI+HTYHDS                          
Subjt:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------

Query:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV
                     KNK+ AL+PAGLL PLEIP+ +WSDISMDFIEGLPKS G EVI VVVDRLSKYAHF+ +KHP+TA +VA  FVKE+V LHG+P++IV
Subjt:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV

Query:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM
        SDRDKIF+SHFW EMFKL+GTKL+RSS+YHPQ   +                    + WS                                   +YG+M
Subjt:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM

Query:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI
        ETPNSTLD+QL++R                                          P        KRNEKL+  +FGP+KILERIG VAY+LELPS A+I
Subjt:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI

Query:  HPVFHVSQLKKPLRN--------------HEQIQQ------------------LVPWKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI
        HPVFHVSQLKK + N              HE I Q                  LV WKGLPPHEATWE+    K QFP+F LEDKV LE ES+AR PI  
Subjt:  HPVFHVSQLKKPLRN--------------HEQIQQ------------------LVPWKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI

Query:  TYSRKKKGKDRITCGAGGEGK
        TY+R+K  KD+ T G G E K
Subjt:  TYSRKKKGKDRITCGAGGEGK

TYK15990.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.5e-21147.56Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------
        ME+LVDEML +GIIRPSTSPYSSP+LLV+KKD SWRFCVDY ALNNVTVPDKFPI VV+EL DELN  ++FSKIDLK G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------

Query:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
                                                     +G+ EH QHLE+VLEILRE+ LYAN  KC FAK +I YLGH+IS+KGIEVDPEKI
Subjt:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR
        RA+KEWP P  V ++RGF+GLT YYR+FVQNYGSI+ PLTQL K+G       TE   ++ + A+MTLP+LAMPDFNLPF++E+DASGFGVGAVLVQ K+
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR

Query:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH
        P+A+FSK L  RDRARPVYEREL+ V                                       +WIAKLLGYSFEV+Y PGLENKAADALSR+ PT H
Subjt:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH

Query:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------
        +N ++APALLD+  IQ+E                           QG+LK+KGRLVLSK   L+PTI+HTYHDS                          
Subjt:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------

Query:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV
                     KNK+ AL+PAGLL PLEIP+ +WSDISMDFIEGLPKS G EVI VVVDRLSKYAHF+ +KHP+TA +VA  FVKE+V LHG+P++IV
Subjt:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV

Query:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM
        SDRDKIF+SHFW EMFKL+GTKL+RSS+YHPQ   +                    + WS                                   +YG+M
Subjt:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM

Query:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI
        ETPNSTLD+QL++R                                          P        KRNEKL+  +FGP+KILERIG VAY+LELP+ A+I
Subjt:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI

Query:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI
        HPVFHVSQLKK + N ++IQ L P                                WKGLPPHEATWE+    K QFP+F LEDKV LE ES+AR PI  
Subjt:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI

Query:  TYSRKKKGKDRITCGAGGEGK
        TY+R+K  KD+ T G G E K
Subjt:  TYSRKKKGKDRITCGAGGEGK

TYK23090.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-21147.67Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------
        ME+LVDEML +GIIRPSTSPYSSP+LLV+KKD SWRFCVDY ALNNVTVPDKFPI VV+EL DELN  ++FSKIDLK G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------

Query:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
                                                     +G+ EH QHLE+VLEILRE+ LYAN  KC FAK +I YLGH+IS+KGIEVDPEKI
Subjt:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR
        RA+KEWP P  V ++RGF+GLT YYR+FVQNYGSI+ PLTQL K+G       TE   ++ + A+MTLP+LAMPDFNLPF++E+DASGFGVGAVLVQ K+
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR

Query:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH
        P+A+FSK L  RDRARPVYEREL+ V                                       +WIAKLLGYSFEV+Y PGLENKAADALSR+ PT H
Subjt:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH

Query:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------
        +N ++APALLD+  IQ+E                           QG+LK+KGRLVLSK   L+PTI+HTYHDS                          
Subjt:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------

Query:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV
                     KNK+ AL+PAGLL PLEIP+ +WSDISMDFIEGLPKS G EVI VVVDRLSKYAHF+ +KHP+TA +VA  FVKE+V LHG+P++IV
Subjt:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV

Query:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM
        SDRDKIF+SHFW EMFKL+GTKL+RSS+YHPQ   +                    + WS                                   +YG+M
Subjt:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM

Query:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI
        ETPNSTLD+QL++R                                          P        KRNEKL+  +FGP+KILERIG VAY+LELPS A+I
Subjt:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI

Query:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI
        HPVFHVSQLKK + N ++IQ L P                                WKGLPPHEATWE+    K QFP+F LEDKV LE ES+AR PI  
Subjt:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI

Query:  TYSRKKKGKDRITCGAGGEGK
        TY+R+K  KD+ T G G E K
Subjt:  TYSRKKKGKDRITCGAGGEGK

TrEMBL top hitse value%identityAlignment
A0A5A7SIV7 Ty3/gypsy retrotransposon protein5.8e-21247.67Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------
        ME+LVDEML +GIIRPSTSPYSSP+LLV+K+D SWRFCVDY ALNNVTVPDKFPI VV+EL DELN  ++FSKIDLK G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------

Query:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
                                                     +G+ EH QHLE+VLEILRE+ LYANL KC FAK +I YLGH+IS+KGIEVDPEKI
Subjt:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR
        RA+KEWP P  V ++RGF+GLT YYR+FVQNYGSI+ PLTQL K+G       TE   ++ + A+MTLP+LAMPDFNLPF++E+DASGFGVGAVLVQ K+
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR

Query:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH
        P+A+FSK L  RDRARPVYEREL+ V                                       +WIAKLLGYSFEV+Y PGLENKAADALSR+ PT H
Subjt:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH

Query:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------
        +N ++APALLD+  IQ+E                           QG+LK+KGRLVLSK   L+PTI+HTYHDS                          
Subjt:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------

Query:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV
                     KNK+ AL+PAGLL PLEIP+ +WSDISMDFIEGLPKS G EVI VVVDRLSKYAHF+ +KHP+TA +VA  FVKE+V LHG+P++IV
Subjt:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV

Query:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM
        SDRDKIF+SHFW EMFKL+GTKL+RSS+YHPQ   +                    + WS                                   +YG+M
Subjt:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM

Query:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI
        ETPNSTLD+QL++R                                          P        KRNEKL+  +FGP+KILERIG VAY+LELPS A+I
Subjt:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI

Query:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI
        HPVFHVSQLKK + N ++IQ L P                                WKGLPPHEATWE+    K QFP+F LEDKV LE ES+AR PI  
Subjt:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI

Query:  TYSRKKKGKDRITCGAGGEGK
        TY+R+K  KD+ T G G E K
Subjt:  TYSRKKKGKDRITCGAGGEGK

A0A5A7U2S1 Ty3/gypsy retrotransposon protein4.9e-21147.45Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------
        ME+LVDEML +GIIRPSTSPYSSP+LLV+KKD SWRFCVDY ALNNVTVPDKFPI VV+EL DELN  ++FSKIDLK G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------

Query:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
                                                     +G+ EH QHLE+VLEILRE+ LYAN  KC FAK +I YLGH+IS+KGIEVDPEKI
Subjt:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR
        RA+KEWP P  V ++RGF+GLT YYR+FVQNYGSI+ PLTQL K+G       TE   ++ + A+MTLP+LAMPDFNLPF++E+DASGFGVGAVLVQ K+
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR

Query:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH
        P+A+FSK L  RDRARPVYEREL+ V                                       +WI KLLGYSFEV+Y PGLENKAADALSR+ PT H
Subjt:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH

Query:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------
        +N ++APALLD+  IQ+E                           QG+LK+KGRLVLSK   L+PTI+HTYHDS                          
Subjt:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------

Query:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV
                     KNK+ AL+PAGLL PLEIP+ +WSDISMDFIEGLPKS G EVI VVVDRLSKYAHF+ +KHP+TA +VA  FVKE+V LHG+P++IV
Subjt:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV

Query:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM
        SDRDKIF+SHFW EMFKL+GTKL+RSS+YHPQ   +                    + WS                                   +YG+M
Subjt:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM

Query:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI
        ETPNSTLD+QL++R                                          P        KRNEKL+  +FGP+KILERIG VAY+LELP+ A+I
Subjt:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI

Query:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI
        HPVFHVSQLKK + N ++IQ L P                                WKGLPPHEATWE+    K QFP+F LEDKV LE ES+AR PI  
Subjt:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI

Query:  TYSRKKKGKDRITCGAGGEGK
        TY+R+K  KD+ T G G E K
Subjt:  TYSRKKKGKDRITCGAGGEGK

A0A5A7U6J3 Ty3/gypsy retrotransposon protein1.3e-21147.88Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------
        ME+LVDEML +GIIRPSTSPYSSP+LLV+KKD SWRFCVDY ALNNVTVPDKFPI VV+EL DELN  ++FSKIDLK G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------

Query:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
                                                     +G+ EH QHLE+VLEILRE+ LYAN  KC FAK +I YLGH+IS+KGIEVDPEKI
Subjt:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR
        RA+KEWP P  V ++RGF+GLT YYR+FVQNYGSI+ PLTQL K+G       TE   ++ + A+MTLP+LAMPDFNLPF++E+DASGFGVGAVLVQ K+
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR

Query:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH
        P+A+FSK L  RDRARPVYEREL+ V                                       +WIAKLLGYSFEV+Y PGLENKAADALSR+ PT H
Subjt:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH

Query:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------
        +N ++APALLD+  IQ+E                           QG+LK+KGRLVLSK   L+PTI+HTYHDS                          
Subjt:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------

Query:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV
                     KNK+ AL+PAGLL PLEIP+ +WSDISMDFIEGLPKS G EVI VVVDRLSKYAHF+ +KHP+TA +VA  FVKE+V LHG+P++IV
Subjt:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV

Query:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM
        SDRDKIF+SHFW EMFKL+GTKL+RSS+YHPQ   +                    + WS                                   +YG+M
Subjt:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM

Query:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI
        ETPNSTLD+QL++R                                          P        KRNEKL+  +FGP+KILERIG VAY+LELPS A+I
Subjt:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI

Query:  HPVFHVSQLKKPLRN--------------HEQIQQ------------------LVPWKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI
        HPVFHVSQLKK + N              HE I Q                  LV WKGLPPHEATWE+    K QFP+F LEDKV LE ES+AR PI  
Subjt:  HPVFHVSQLKKPLRN--------------HEQIQQ------------------LVPWKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI

Query:  TYSRKKKGKDRITCGAGGEGK
        TY+R+K  KD+ T G G E K
Subjt:  TYSRKKKGKDRITCGAGGEGK

A0A5D3CXB1 Ty3/gypsy retrotransposon protein1.7e-21147.56Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------
        ME+LVDEML +GIIRPSTSPYSSP+LLV+KKD SWRFCVDY ALNNVTVPDKFPI VV+EL DELN  ++FSKIDLK G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------

Query:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
                                                     +G+ EH QHLE+VLEILRE+ LYAN  KC FAK +I YLGH+IS+KGIEVDPEKI
Subjt:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR
        RA+KEWP P  V ++RGF+GLT YYR+FVQNYGSI+ PLTQL K+G       TE   ++ + A+MTLP+LAMPDFNLPF++E+DASGFGVGAVLVQ K+
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR

Query:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH
        P+A+FSK L  RDRARPVYEREL+ V                                       +WIAKLLGYSFEV+Y PGLENKAADALSR+ PT H
Subjt:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH

Query:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------
        +N ++APALLD+  IQ+E                           QG+LK+KGRLVLSK   L+PTI+HTYHDS                          
Subjt:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------

Query:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV
                     KNK+ AL+PAGLL PLEIP+ +WSDISMDFIEGLPKS G EVI VVVDRLSKYAHF+ +KHP+TA +VA  FVKE+V LHG+P++IV
Subjt:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV

Query:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM
        SDRDKIF+SHFW EMFKL+GTKL+RSS+YHPQ   +                    + WS                                   +YG+M
Subjt:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM

Query:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI
        ETPNSTLD+QL++R                                          P        KRNEKL+  +FGP+KILERIG VAY+LELP+ A+I
Subjt:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI

Query:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI
        HPVFHVSQLKK + N ++IQ L P                                WKGLPPHEATWE+    K QFP+F LEDKV LE ES+AR PI  
Subjt:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI

Query:  TYSRKKKGKDRITCGAGGEGK
        TY+R+K  KD+ T G G E K
Subjt:  TYSRKKKGKDRITCGAGGEGK

A0A5D3DI73 Ty3/gypsy retrotransposon protein7.5e-21247.67Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------
        ME+LVDEML +GIIRPSTSPYSSP+LLV+KKD SWRFCVDY ALNNVTVPDKFPI VV+EL DELN  ++FSKIDLK G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG---------------------

Query:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
                                                     +G+ EH QHLE+VLEILRE+ LYAN  KC FAK +I YLGH+IS+KGIEVDPEKI
Subjt:  ---------------------------------------------RGIVEHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR
        RA+KEWP P  V ++RGF+GLT YYR+FVQNYGSI+ PLTQL K+G       TE   ++ + A+MTLP+LAMPDFNLPF++E+DASGFGVGAVLVQ K+
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI-----TE---QRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKR

Query:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH
        P+A+FSK L  RDRARPVYEREL+ V                                       +WIAKLLGYSFEV+Y PGLENKAADALSR+ PT H
Subjt:  PIAFFSKTLCIRDRARPVYERELIVV---------------------------------------KWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIH

Query:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------
        +N ++APALLD+  IQ+E                           QG+LK+KGRLVLSK   L+PTI+HTYHDS                          
Subjt:  INHISAPALLDLAKIQEE---------------------------QGVLKYKGRLVLSKNLVLLPTILHTYHDS--------------------------

Query:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV
                     KNK+ AL+PAGLL PLEIP+ +WSDISMDFIEGLPKS G EVI VVVDRLSKYAHF+ +KHP+TA +VA  FVKE+V LHG+P++IV
Subjt:  -------------KNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIV

Query:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM
        SDRDKIF+SHFW EMFKL+GTKL+RSS+YHPQ   +                    + WS                                   +YG+M
Subjt:  SDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVK--------------------QGWS----------------------------------TKYGDM

Query:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI
        ETPNSTLD+QL++R                                          P        KRNEKL+  +FGP+KILERIG VAY+LELPS A+I
Subjt:  ETPNSTLDKQLKER----------------------------------------CDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASI

Query:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI
        HPVFHVSQLKK + N ++IQ L P                                WKGLPPHEATWE+    K QFP+F LEDKV LE ES+AR PI  
Subjt:  HPVFHVSQLKKPLRNHEQIQQLVP--------------------------------WKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFI

Query:  TYSRKKKGKDRITCGAGGEGK
        TY+R+K  KD+ T G G E K
Subjt:  TYSRKKKGKDRITCGAGGEGK

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.8e-4623.6Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG-------------------RG
        M   +++ L++GIIR S +  + P++ V KK+ + R  VDY  LN    P+ +P+ ++++LL ++  + +F+K+DLK+                    RG
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG-------------------RG

Query:  IV-----------------------------------------------EHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
        +                                                EH +H++ VL+ L+   L  N AKC F + ++ ++G+ ISEKG     E I
Subjt:  IV-----------------------------------------------EHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAG----------GITEQRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQA
          + +W  P    ++R F+G   Y RKF+     + +PL  L K               E  KQ  +++ P+L   DF+    +ETDAS   VGAVL Q 
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAG----------GITEQRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQA

Query:  K-----RPIAFFSKTLCIRDRARPVYERELIVV-------------------------------------------KWIAKLLGYSFEVVYNPGLENKAA
               P+ ++S  +        V ++E++ +                                           +W   L  ++FE+ Y PG  N  A
Subjt:  K-----RPIAFFSKTLCIRDRARPVYERELIVV-------------------------------------------KWIAKLLGYSFEVVYNPGLENKAA

Query:  DALSRMTPTIH----------INHISAPALLDLAKIQ---------------------EEQGVLKYKGRLVLSKNLVLLP-------TILHTYHDSK---
        DALSR+               IN ++  ++ D  K Q                      E+ +    G L+ SK+ +LLP       TI+  YH+     
Subjt:  DALSRMTPTIH----------INHISAPALLDLAKIQ---------------------EEQGVLKYKGRLVLSKNLVLLP-------TILHTYHDSK---

Query:  ------------------------------------NKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTA
                                            NK+    P G L P+      W  +SMDFI  LP+S+G   +FVVVDR SK A  +      TA
Subjt:  ------------------------------------NKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTA

Query:  SSVAAEFVKEIVGLHGYPQTIVSDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVKQGWSTKYGDMETPNSTLDKQLKERCDPRGTERA-------ST
           A  F + ++   G P+ I++D D IF S  W++        +  S  Y PQ           G  E  N T++K L+  C                +
Subjt:  SSVAAEFVKEIVGLHGYPQTIVSDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVKQGWSTKYGDMETPNSTLDKQLKERCDPRGTERA-------ST

Query:  KRNEKLTTNFFGPFKILERIGIVAYRLELPSLA
          N   +     PF+I+ R       LELPS +
Subjt:  KRNEKLTTNFFGPFKILERIGIVAYRLELPSLA

P0CT41 Transposon Tf2-12 polyprotein4.8e-4623.6Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG-------------------RG
        M   +++ L++GIIR S +  + P++ V KK+ + R  VDY  LN    P+ +P+ ++++LL ++  + +F+K+DLK+                    RG
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG-------------------RG

Query:  IV-----------------------------------------------EHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
        +                                                EH +H++ VL+ L+   L  N AKC F + ++ ++G+ ISEKG     E I
Subjt:  IV-----------------------------------------------EHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAG----------GITEQRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQA
          + +W  P    ++R F+G   Y RKF+     + +PL  L K               E  KQ  +++ P+L   DF+    +ETDAS   VGAVL Q 
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAG----------GITEQRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQA

Query:  K-----RPIAFFSKTLCIRDRARPVYERELIVV-------------------------------------------KWIAKLLGYSFEVVYNPGLENKAA
               P+ ++S  +        V ++E++ +                                           +W   L  ++FE+ Y PG  N  A
Subjt:  K-----RPIAFFSKTLCIRDRARPVYERELIVV-------------------------------------------KWIAKLLGYSFEVVYNPGLENKAA

Query:  DALSRMTPTIH----------INHISAPALLDLAKIQ---------------------EEQGVLKYKGRLVLSKNLVLLP-------TILHTYHDSK---
        DALSR+               IN ++  ++ D  K Q                      E+ +    G L+ SK+ +LLP       TI+  YH+     
Subjt:  DALSRMTPTIH----------INHISAPALLDLAKIQ---------------------EEQGVLKYKGRLVLSKNLVLLP-------TILHTYHDSK---

Query:  ------------------------------------NKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTA
                                            NK+    P G L P+      W  +SMDFI  LP+S+G   +FVVVDR SK A  +      TA
Subjt:  ------------------------------------NKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTA

Query:  SSVAAEFVKEIVGLHGYPQTIVSDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVKQGWSTKYGDMETPNSTLDKQLKERCDPRGTERA-------ST
           A  F + ++   G P+ I++D D IF S  W++        +  S  Y PQ           G  E  N T++K L+  C                +
Subjt:  SSVAAEFVKEIVGLHGYPQTIVSDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVKQGWSTKYGDMETPNSTLDKQLKERCDPRGTERA-------ST

Query:  KRNEKLTTNFFGPFKILERIGIVAYRLELPSLA
          N   +     PF+I+ R       LELPS +
Subjt:  KRNEKLTTNFFGPFKILERIGIVAYRLELPSLA

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.5e-4725.8Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTGR--------------------
        + K+V ++L    I PS SP SSP++LV KKD ++R CVDY  LN  T+ D FP+  +D LL  +    +F+ +DL +G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTGR--------------------

Query:  ---------GIV-----------------------------------EHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKIRA
                 G+V                                   EH +HL+ VLE L+   L     KC FA E+  +LG+ I  + I     K  A
Subjt:  ---------GIV-----------------------------------EHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKIRA

Query:  IKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPL-------TQLRKAGGITEQRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKRP--
        I+++P P  V   + F+G+  YYR+F+ N   IA P+       +Q  +      ++ + A+   P+L   +    +++ TDAS  G+GAVL +      
Subjt:  IKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPL-------TQLRKAGGITEQRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKRP--

Query:  ----IAFFSKTLCIRDRARPVYERELI---------------------------------------VVKWIAKLLGYSFEVVYNPGLENKAADALSRMTP
            + +FSK+L    +  P  E EL+                                       V +W+  L  Y F + Y  G +N  ADA+SR   
Subjt:  ----IAFFSKTLCIRDRARPVYERELI---------------------------------------VVKWIAKLLGYSFEVVYNPGLENKAADALSRMTP

Query:  TI-----------------HINHISAPALLDLAKI-------QEEQGVLKYKGRLVLSKNL----------------VLLP-----TILHTYHD------
        TI                   + + +  L+ + ++       ++      Y+ +L LS+                  +++P      ++  YHD      
Subjt:  TI-----------------HINHISAPALLDLAKI-------QEEQGVLKYKGRLVLSKNL----------------VLLP-----TILHTYHD------

Query:  --SKNKTLA-LTPA-------------------------------GLLSPLEIPNEVWSDISMDFIEGLPKSAGK-EVIFVVVDRLSKYAHFIAIKHPFT
              TLA ++P                                GLL PL I    W DISMDF+ GLP ++    +I VVVDR SK AHFIA +    
Subjt:  --SKNKTLA-LTPA-------------------------------GLLSPLEIPNEVWSDISMDFIEGLPKSAGK-EVIFVVVDRLSKYAHFIAIKHPFT

Query:  ASSVAAEFVKEIVGLHGYPQTIVSDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQ
        A+ +     + I   HG+P+TI SDRD    +  ++E+ K  G K   SSA HPQ
Subjt:  ASSVAAEFVKEIVGLHGYPQTIVSDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQ

Q99315 Transposon Ty3-G Gag-Pol polyprotein4.3e-4726.11Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTGR--------------------
        + K+V ++L    I PS SP SSP++LV KKD ++R CVDY  LN  T+ D FP+  +D LL  +    +F+ +DL +G                     
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTGR--------------------

Query:  ---------GIV-----------------------------------EHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKIRA
                 G+V                                   EH +HL+ VLE L+   L     KC FA E+  +LG+ I  + I     K  A
Subjt:  ---------GIV-----------------------------------EHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKIRA

Query:  IKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQL--RKAGGITEQRKQM-----AIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKRP--
        I+++P P  V   + F+G+  YYR+F+ N   IA P+      K+    +Q K +     A+   P+L   +    +++ TDAS  G+GAVL +      
Subjt:  IKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQL--RKAGGITEQRKQM-----AIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQAKRP--

Query:  ----IAFFSKTLCIRDRARPVYERELI---------------------------------------VVKWIAKLLGYSFEVVYNPGLENKAADALSRMTP
            + +FSK+L    +  P  E EL+                                       V +W+  L  Y F + Y  G +N  ADA+SR   
Subjt:  ----IAFFSKTLCIRDRARPVYERELI---------------------------------------VVKWIAKLLGYSFEVVYNPGLENKAADALSRMTP

Query:  TI-----------------HINHISAPALLDLAKI-------QEEQGVLKYKGRLVLSKNL----------------VLLP-----TILHTYHD------
        TI                   + + +  L+ + ++       ++      Y+ +L LS+                  +++P      ++  YHD      
Subjt:  TI-----------------HINHISAPALLDLAKI-------QEEQGVLKYKGRLVLSKNL----------------VLLP-----TILHTYHD------

Query:  --SKNKTLA-LTPA-------------------------------GLLSPLEIPNEVWSDISMDFIEGLPKSAGK-EVIFVVVDRLSKYAHFIAIKHPFT
              TLA ++P                                GLL PL I    W DISMDF+ GLP ++    +I VVVDR SK AHFIA +    
Subjt:  --SKNKTLA-LTPA-------------------------------GLLSPLEIPNEVWSDISMDFIEGLPKSAGK-EVIFVVVDRLSKYAHFIAIKHPFT

Query:  ASSVAAEFVKEIVGLHGYPQTIVSDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQ
        A+ +     + I   HG+P+TI SDRD    +  ++E+ K  G K   SSA HPQ
Subjt:  ASSVAAEFVKEIVGLHGYPQTIVSDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQ

Q9UR07 Transposon Tf2-11 polyprotein4.8e-4623.6Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG-------------------RG
        M   +++ L++GIIR S +  + P++ V KK+ + R  VDY  LN    P+ +P+ ++++LL ++  + +F+K+DLK+                    RG
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTG-------------------RG

Query:  IV-----------------------------------------------EHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI
        +                                                EH +H++ VL+ L+   L  N AKC F + ++ ++G+ ISEKG     E I
Subjt:  IV-----------------------------------------------EHAQHLEIVLEILRENALYANLAKCSFAKEKIGYLGHFISEKGIEVDPEKI

Query:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAG----------GITEQRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQA
          + +W  P    ++R F+G   Y RKF+     + +PL  L K               E  KQ  +++ P+L   DF+    +ETDAS   VGAVL Q 
Subjt:  RAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAG----------GITEQRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAVLVQA

Query:  K-----RPIAFFSKTLCIRDRARPVYERELIVV-------------------------------------------KWIAKLLGYSFEVVYNPGLENKAA
               P+ ++S  +        V ++E++ +                                           +W   L  ++FE+ Y PG  N  A
Subjt:  K-----RPIAFFSKTLCIRDRARPVYERELIVV-------------------------------------------KWIAKLLGYSFEVVYNPGLENKAA

Query:  DALSRMTPTIH----------INHISAPALLDLAKIQ---------------------EEQGVLKYKGRLVLSKNLVLLP-------TILHTYHDSK---
        DALSR+               IN ++  ++ D  K Q                      E+ +    G L+ SK+ +LLP       TI+  YH+     
Subjt:  DALSRMTPTIH----------INHISAPALLDLAKIQ---------------------EEQGVLKYKGRLVLSKNLVLLP-------TILHTYHDSK---

Query:  ------------------------------------NKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTA
                                            NK+    P G L P+      W  +SMDFI  LP+S+G   +FVVVDR SK A  +      TA
Subjt:  ------------------------------------NKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTA

Query:  SSVAAEFVKEIVGLHGYPQTIVSDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVKQGWSTKYGDMETPNSTLDKQLKERCDPRGTERA-------ST
           A  F + ++   G P+ I++D D IF S  W++        +  S  Y PQ           G  E  N T++K L+  C                +
Subjt:  SSVAAEFVKEIVGLHGYPQTIVSDRDKIFVSHFWREMFKLSGTKLHRSSAYHPQPTVKQGWSTKYGDMETPNSTLDKQLKERCDPRGTERA-------ST

Query:  KRNEKLTTNFFGPFKILERIGIVAYRLELPSLA
          N   +     PF+I+ R       LELPS +
Subjt:  KRNEKLTTNFFGPFKILERIGIVAYRLELPSLA

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein1.2e-0457.14Show/hide
Query:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSW
        ++  + EML   II+PS SPYSSP+LLV+KKD  W
Subjt:  MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSW

ATMG00860.1 DNA/RNA polymerases superfamily protein1.5e-2645.74Show/hide
Query:  HLEIVLEILRENALYANLAKCSFAKEKIGYLG--HFISEKGIEVDPEKIRAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI--T
        HL +VL+I  ++  YAN  KC+F + +I YLG  H IS +G+  DP K+ A+  WP P    ++RGF+GLT YYR+FV+NYG I  PLT+L K   +  T
Subjt:  HLEIVLEILRENALYANLAKCSFAKEKIGYLG--HFISEKGIEVDPEKIRAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGI--T

Query:  E------QRKQMAIMTLPILAMPDFNLPF
        E      +  + A+ TLP+LA+PD  LPF
Subjt:  E------QRKQMAIMTLPILAMPDFNLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGCTGGTTGATGAGATGTTAAGGACTGGAATTATAAGGCCAAGTACTAGCCCTTACTCCAGCCCAATTCTGTTAGTGAAAAAGAAGGATTGCAGCTGGAGGTT
CTGCGTGGATTATTGTGCCTTAAACAATGTGACAGTCCCTGATAAATTTCCAATTCTAGTGGTGGATGAATTATTGGATGAGCTGAACGAGACCAATATGTTTTCAAAGA
TAGATCTCAAAACTGGTAGGGGAATAGTAGAACACGCGCAACACTTAGAAATAGTATTGGAAATCCTACGGGAGAACGCATTGTATGCCAATCTGGCTAAATGCAGTTTT
GCCAAGGAAAAAATTGGCTATCTCGGTCACTTCATATCTGAGAAGGGAATTGAGGTGGATCCAGAGAAGATTAGAGCTATCAAGGAATGGCCAGTGCCTACTGGTGTGAT
AGATGTCAGGGGGTTCATAGGCCTTACCAGGTATTATAGGAAGTTTGTGCAGAATTATGGAAGCATTGCCAATCCATTGACACAATTACGGAAAGCAGGGGGTATCACTG
AACAAAGGAAGCAAATGGCTATAATGACACTTCCAATACTAGCAATGCCAGATTTTAACCTTCCATTTCAAGTAGAGACAGACGCATCTGGTTTCGGTGTGGGGGCAGTA
TTGGTTCAAGCCAAGAGACCCATTGCTTTCTTTAGTAAAACACTATGCATAAGGGATAGAGCTCGTCCGGTTTATGAAAGGGAACTAATAGTAGTGAAGTGGATAGCCAA
GCTTTTGGGATACTCGTTTGAGGTGGTTTACAACCCGGGGTTAGAAAATAAGGCAGCCGATGCCTTGTCCAGAATGACACCTACAATTCACATCAACCACATCTCCGCTC
CAGCATTGCTTGACTTAGCTAAGATCCAAGAAGAGCAGGGAGTGTTGAAATATAAAGGGAGACTGGTGTTGTCAAAGAACTTAGTGCTGCTGCCAACTATTCTGCACACA
TACCATGACTCCAAAAACAAGACCCTTGCTTTGACCCCTGCTGGATTGCTTTCACCTTTGGAAATTCCTAATGAAGTATGGTCGGATATCTCTATGGATTTTATAGAAGG
CCTACCGAAATCGGCTGGGAAAGAAGTGATTTTTGTGGTAGTGGATCGCTTGAGCAAATATGCTCACTTTATAGCCATTAAGCATCCCTTCACAGCCAGCTCTGTAGCAG
CAGAATTTGTTAAAGAGATAGTGGGACTCCACGGGTACCCCCAAACGATCGTGTCAGATAGAGATAAGATCTTTGTCAGTCATTTCTGGAGGGAAATGTTCAAGTTATCG
GGTACTAAGTTACATCGAAGTTCAGCTTATCATCCACAACCAACGGTCAAACAAGGGTGGTCAACAAAGTATGGAGATATGGAGACTCCTAACTCCACCTTAGACAAGCA
GTTGAAAGAGAGATGTGATCCTAGGGGCACTGAAAGAGCATCTACGAAAAGAAATGAGAAGCTGACAACCAATTTTTTTGGCCCCTTTAAAATCTTGGAGAGAATTGGAA
TCGTGGCCTATCGGCTGGAATTACCCAGTTTAGCCTCTATTCATCCTGTCTTCCATGTGTCCCAACTAAAGAAACCTCTCAGAAATCATGAGCAAATACAACAGCTAGTT
CCTTGGAAGGGGTTGCCACCTCATGAAGCGACTTGGGAGGACTGTCAATACTTCAAACAGCAATTCCCAGATTTTCAGCTTGAGGACAAGGTGGCTTTAGAAGTGGAAAG
TAATGCTAGGTCCCCGATATTCATCACGTATAGTAGGAAAAAGAAAGGGAAGGACAGAATTACATGTGGAGCTGGGGGAGAAGGAAAAGGGCAGGACCATCGTCTGAAGA
AAGCAGGGAGTTCAGGTTTTATTGTTCTTGGTTTTCGTTGTTCATTGCTGATTGTTGCTTTATTGTTATTAGAATTTCCTATTCTTAGTTTTATTTCACGTCAATTGTTA
AACATTGTGTTCTGGAGCTTTCTGTGGTATTTCGAACTCTATATTGTGACTGTTGGAAGTTTTATTGATTACTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGCTGGTTGATGAGATGTTAAGGACTGGAATTATAAGGCCAAGTACTAGCCCTTACTCCAGCCCAATTCTGTTAGTGAAAAAGAAGGATTGCAGCTGGAGGTT
CTGCGTGGATTATTGTGCCTTAAACAATGTGACAGTCCCTGATAAATTTCCAATTCTAGTGGTGGATGAATTATTGGATGAGCTGAACGAGACCAATATGTTTTCAAAGA
TAGATCTCAAAACTGGTAGGGGAATAGTAGAACACGCGCAACACTTAGAAATAGTATTGGAAATCCTACGGGAGAACGCATTGTATGCCAATCTGGCTAAATGCAGTTTT
GCCAAGGAAAAAATTGGCTATCTCGGTCACTTCATATCTGAGAAGGGAATTGAGGTGGATCCAGAGAAGATTAGAGCTATCAAGGAATGGCCAGTGCCTACTGGTGTGAT
AGATGTCAGGGGGTTCATAGGCCTTACCAGGTATTATAGGAAGTTTGTGCAGAATTATGGAAGCATTGCCAATCCATTGACACAATTACGGAAAGCAGGGGGTATCACTG
AACAAAGGAAGCAAATGGCTATAATGACACTTCCAATACTAGCAATGCCAGATTTTAACCTTCCATTTCAAGTAGAGACAGACGCATCTGGTTTCGGTGTGGGGGCAGTA
TTGGTTCAAGCCAAGAGACCCATTGCTTTCTTTAGTAAAACACTATGCATAAGGGATAGAGCTCGTCCGGTTTATGAAAGGGAACTAATAGTAGTGAAGTGGATAGCCAA
GCTTTTGGGATACTCGTTTGAGGTGGTTTACAACCCGGGGTTAGAAAATAAGGCAGCCGATGCCTTGTCCAGAATGACACCTACAATTCACATCAACCACATCTCCGCTC
CAGCATTGCTTGACTTAGCTAAGATCCAAGAAGAGCAGGGAGTGTTGAAATATAAAGGGAGACTGGTGTTGTCAAAGAACTTAGTGCTGCTGCCAACTATTCTGCACACA
TACCATGACTCCAAAAACAAGACCCTTGCTTTGACCCCTGCTGGATTGCTTTCACCTTTGGAAATTCCTAATGAAGTATGGTCGGATATCTCTATGGATTTTATAGAAGG
CCTACCGAAATCGGCTGGGAAAGAAGTGATTTTTGTGGTAGTGGATCGCTTGAGCAAATATGCTCACTTTATAGCCATTAAGCATCCCTTCACAGCCAGCTCTGTAGCAG
CAGAATTTGTTAAAGAGATAGTGGGACTCCACGGGTACCCCCAAACGATCGTGTCAGATAGAGATAAGATCTTTGTCAGTCATTTCTGGAGGGAAATGTTCAAGTTATCG
GGTACTAAGTTACATCGAAGTTCAGCTTATCATCCACAACCAACGGTCAAACAAGGGTGGTCAACAAAGTATGGAGATATGGAGACTCCTAACTCCACCTTAGACAAGCA
GTTGAAAGAGAGATGTGATCCTAGGGGCACTGAAAGAGCATCTACGAAAAGAAATGAGAAGCTGACAACCAATTTTTTTGGCCCCTTTAAAATCTTGGAGAGAATTGGAA
TCGTGGCCTATCGGCTGGAATTACCCAGTTTAGCCTCTATTCATCCTGTCTTCCATGTGTCCCAACTAAAGAAACCTCTCAGAAATCATGAGCAAATACAACAGCTAGTT
CCTTGGAAGGGGTTGCCACCTCATGAAGCGACTTGGGAGGACTGTCAATACTTCAAACAGCAATTCCCAGATTTTCAGCTTGAGGACAAGGTGGCTTTAGAAGTGGAAAG
TAATGCTAGGTCCCCGATATTCATCACGTATAGTAGGAAAAAGAAAGGGAAGGACAGAATTACATGTGGAGCTGGGGGAGAAGGAAAAGGGCAGGACCATCGTCTGAAGA
AAGCAGGGAGTTCAGGTTTTATTGTTCTTGGTTTTCGTTGTTCATTGCTGATTGTTGCTTTATTGTTATTAGAATTTCCTATTCTTAGTTTTATTTCACGTCAATTGTTA
AACATTGTGTTCTGGAGCTTTCTGTGGTATTTCGAACTCTATATTGTGACTGTTGGAAGTTTTATTGATTACTGTTGA
Protein sequenceShow/hide protein sequence
MEKLVDEMLRTGIIRPSTSPYSSPILLVKKKDCSWRFCVDYCALNNVTVPDKFPILVVDELLDELNETNMFSKIDLKTGRGIVEHAQHLEIVLEILRENALYANLAKCSF
AKEKIGYLGHFISEKGIEVDPEKIRAIKEWPVPTGVIDVRGFIGLTRYYRKFVQNYGSIANPLTQLRKAGGITEQRKQMAIMTLPILAMPDFNLPFQVETDASGFGVGAV
LVQAKRPIAFFSKTLCIRDRARPVYERELIVVKWIAKLLGYSFEVVYNPGLENKAADALSRMTPTIHINHISAPALLDLAKIQEEQGVLKYKGRLVLSKNLVLLPTILHT
YHDSKNKTLALTPAGLLSPLEIPNEVWSDISMDFIEGLPKSAGKEVIFVVVDRLSKYAHFIAIKHPFTASSVAAEFVKEIVGLHGYPQTIVSDRDKIFVSHFWREMFKLS
GTKLHRSSAYHPQPTVKQGWSTKYGDMETPNSTLDKQLKERCDPRGTERASTKRNEKLTTNFFGPFKILERIGIVAYRLELPSLASIHPVFHVSQLKKPLRNHEQIQQLV
PWKGLPPHEATWEDCQYFKQQFPDFQLEDKVALEVESNARSPIFITYSRKKKGKDRITCGAGGEGKGQDHRLKKAGSSGFIVLGFRCSLLIVALLLLEFPILSFISRQLL
NIVFWSFLWYFELYIVTVGSFIDYC