; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021821 (gene) of Chayote v1 genome

Gene IDSed0021821
OrganismSechium edule (Chayote v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationLG11:7070995..7075687
RNA-Seq ExpressionSed0021821
SyntenySed0021821
Gene Ontology termsNA
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN81099.1 hypothetical protein VITISV_017741 [Vitis vinifera]2.4e-6135.91Show/hide
Query:  LSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEA
        L Q F  +  A+  + K+ LQ  KKGGST+ EY +KIK CVD L +VG  +  +DH+  IL GL  +Y+S V+ +  +   ++V++  ALLM HE+R+E 
Subjt:  LSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEA

Query:  KAMSIESVHPVANVHIQHSSPSFKDNN--------SHQQSNNGNQRGRGRSGN---------------NRGGRSNL---------------------NNR
           S++S     + H+  S+   K N         + Q S++G   G GR G+               N  GRSN                      N  
Subjt:  KAMSIESVHPVANVHIQHSSPSFKDNN--------SHQQSNNGNQRGRGRSGN---------------NRGGRSNL---------------------NNR

Query:  NKLQCYHCGCYGYTTNRCYYINDFSQQHPR------YSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLS
         K  C  CG  G+   +CYY  D + Q P+       SPRA             Y S   Q+  V     +P +    D NWYPD GA+NH+T N  NL 
Subjt:  NKLQCYHCGCYGYTTNRCYYINDFSQQHPR------YSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLS

Query:  VSSEYQGNNQVHMGNGACLATTHCGYGSIMS--SNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLN
         S+E+ G NQVH+GNG  L+  H G    +S  S++   LN LLHVP+ITKNL+SVS+FA+DN V+F+FH   C VKD+ +   L+ G + +GLY FD  
Subjt:  VSSEYQGNNQVHMGNGACLATTHCGYGSIMS--SNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLN

Query:  SHVQNPTSHVSNVVEYSLPYSNSVVSNSPSN---GSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSN-KSSFCDVCALGKNHALPFSKISYSLH
              +SH++     SL  S SVV++S S+    + +++T D+WH+RLGHPS +T ++++  C  ++ H +   S+FC  C LGK H  PFS      H
Subjt:  SHVQNPTSHVSNVVEYSLPYSNSVVSNSPSN---GSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSN-KSSFCDVCALGKNHALPFSKISYSLH

Query:  QTFT
         T+T
Subjt:  QTFT

KAA0048297.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.2e-8542.86Show/hide
Query:  QIWPIQHR------RSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILA
        ++W  Q R         +   IL +MLH K+AK+IW  L  IF+ R LAQ M+ K+ L  IKKG   L EYF KI +CVD L ++ K +  +DHI+YILA
Subjt:  QIWPIQHR------RSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILA

Query:  GLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQ---HSSPSFKDNNSHQQSNNGNQRGRGRSGNNRGGRSNLNNRNKL
        GLG +Y SM+SVI+ +  S +VQ+ M+LL+T E++ E+K +S E+  P  N+  Q     + S+   N +   NN +   RG  GN R  R    NRNK 
Subjt:  GLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQ---HSSPSFKDNNSHQQSNNGNQRGRGRSGNNRGGRSNLNNRNKL

Query:  QCYHCGCYGYTTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNN
        QC  C   GY+ +RC++         RY+PR+          +  Y +M    Q  A +A L       D NWYPD GATNHLT+++ NLS+ SEY G N
Subjt:  QCYHCGCYGYTTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNN

Query:  QVHMGNGACLATTHCGYGSIMSSN---RVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTS
        Q++  NG+ L  TH G  S  SS    + F LN+LL VP+ITKNLISVSQFA+DN V+F+FHP+ C VKD  + Q LL+G L++GLY+F +     +   
Subjt:  QVHMGNGACLATTHCGYGSIMSSN---RVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTS

Query:  HVSNVVEYSLPYSNSVV--SNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFS
        H SN    + P  N+VV  SN+P         LD+WHRRLGHP L   ++++ N + +     NK +FC+ CALGK+HALPFS
Subjt:  HVSNVVEYSLPYSNSVV--SNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFS

KZV26181.1 hypothetical protein F511_06348 [Dorcoceras hygrometricum]1.9e-7438.57Show/hide
Query:  SKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITT
        + +  S   +M+  +T+  +W  ++Q+F  R+ A+VM+ K  LQT+KKG  ++ +Y  K+K  +D+L A G  IP +D I++IL G+G EY+S+V  +T+
Subjt:  SKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITT

Query:  KICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSFKDNNSHQQSNNGNQ-----RGRGRSGNNRGGRSNLNNRNKLQCYHCGCYGYTTN
        ++ S ++ +  ALL+ HE R+E         + +   H    S +     S +++ N +Q     RGRGR  N RGGR   +N  +  C  CG  G+   
Subjt:  KICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSFKDNNSHQQSNNGNQ-----RGRGRSGNNRGGRSNLNNRNKLQCYHCGCYGYTTN

Query:  RCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNNQVHMGNGACLATT
         CYY  D       + P++   ++ T  Q     S  Y     A       +    +  WYPD GA++H+TN++GNLSVSSEY G ++V +GNGA L+ +
Subjt:  RCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNNQVHMGNGACLATT

Query:  HCGYGSI--MSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTSHVSNVVEYSLPYSN
        + G  ++    S+R F L +LLHVP ITKNLISVS+FA DN VYF+FHPS+CLVKD A++  LLRGTLHNGLYRF+L S +  P    + +     P   
Subjt:  HCGYGSI--MSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTSHVSNVVEYSLPYSN

Query:  SVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFSKISYSLHQTF
         V   SP    +  NTLD WH RLGHPS++T + ++ +C   +    N  SFC  C LGKNH LPF + + +    F
Subjt:  SVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFSKISYSLHQTF

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]1.5e-6335.27Show/hide
Query:  SKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITT
        S IGS+ L +++   +A ++W  +SQ FN ++ A+VM  KS +Q +KK G T+ +Y +K+K   D+L   G  I   DHI+ I+ GLG EY+S+++VI++
Subjt:  SKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITT

Query:  KICSYTVQDFMALLMTHETRLEAKAMSIE-SVHPVANVHIQHSSPSFKDN---NSHQQSNN---GNQRGRGRSGNNRG-GRSNLNNRNKLQCYHCGCYGY
        K  S ++Q   + L+ HE R+  K  S + SV+  +    +  S S+  N   +S  Q+ N   GNQ  RG   +NRG GR       K QC  C  +G+
Subjt:  KICSYTVQDFMALLMTHETRLEAKAMSIE-SVHPVANVHIQHSSPSFKDN---NSHQQSNN---GNQRGRGRSGNNRG-GRSNLNNRNKLQCYHCGCYGY

Query:  TTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPN--------------------WYPDPGATNHLTNNMGNL
        T +RC+Y  D     P +    P       P  +  G+       ++   N+    Y    N                    W+PD GATNH+T+++GNL
Subjt:  TTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPN--------------------WYPDPGATNHLTNNMGNL

Query:  SVSSEYQGNNQVHMGNGACLATTHCG---YGSIMSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFD
        +  +EY GN+++HMGNG  L  +H G   + S  S N+V  L ++L VP I KNL+SVSQFARDN+VYF+FHP  C VKD++++  LL+G LH GLY+F+
Subjt:  SVSSEYQGNNQVHMGNGACLATTHCG---YGSIMSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFD

Query:  LNSHVQNPTSHVS--------NVVEYSLPYSNSVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVK-NCMPSLLHCSNKSSFCDVCALGKNHALPF
        L+  +    S +S             SL ++++  S+ P   +   +  D+WH+RLGHP+      ++  N +P      + SS C  C LGK+H LPF
Subjt:  LNSHVQNPTSHVS--------NVVEYSLPYSNSVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVK-NCMPSLLHCSNKSSFCDVCALGKNHALPF

TYK10642.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]3.2e-8542.86Show/hide
Query:  QIWPIQHR------RSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILA
        ++W  Q R         +   IL +MLH K+AK+IW  L  IF+ R LAQ M+ K+ L  IKKG   L EYF KI +CVD L ++ K +  +DHI+YILA
Subjt:  QIWPIQHR------RSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILA

Query:  GLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQ---HSSPSFKDNNSHQQSNNGNQRGRGRSGNNRGGRSNLNNRNKL
        GLG +Y SM+SVI+ +  S +VQ+ M+LL+T E++ E+K +S E+  P  N+  Q     + S+   N +   NN +   RG  GN R  R    NRNK 
Subjt:  GLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQ---HSSPSFKDNNSHQQSNNGNQRGRGRSGNNRGGRSNLNNRNKL

Query:  QCYHCGCYGYTTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNN
        QC  C   GY+ +RC++         RY+PR+          +  Y +M    Q  A +A L       D NWYPD GATNHLT+++ NLS+ SEY G N
Subjt:  QCYHCGCYGYTTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNN

Query:  QVHMGNGACLATTHCGYGSIMSSN---RVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTS
        Q++  NG+ L  TH G  S  SS    + F LN+LL VP+ITKNLISVSQFA+DN V+F+FHP+ C VKD  + Q LL+G L++GLY+F +     +   
Subjt:  QVHMGNGACLATTHCGYGSIMSSN---RVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTS

Query:  HVSNVVEYSLPYSNSVV--SNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFS
        H SN    + P  N+VV  SN+P         LD+WHRRLGHP L   ++++ N + +     NK +FC+ CALGK+HALPFS
Subjt:  HVSNVVEYSLPYSNSVV--SNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFS

TrEMBL top hitse value%identityAlignment
A0A2Z7AWA7 Integrase catalytic domain-containing protein9.3e-7538.57Show/hide
Query:  SKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITT
        + +  S   +M+  +T+  +W  ++Q+F  R+ A+VM+ K  LQT+KKG  ++ +Y  K+K  +D+L A G  IP +D I++IL G+G EY+S+V  +T+
Subjt:  SKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITT

Query:  KICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSFKDNNSHQQSNNGNQ-----RGRGRSGNNRGGRSNLNNRNKLQCYHCGCYGYTTN
        ++ S ++ +  ALL+ HE R+E         + +   H    S +     S +++ N +Q     RGRGR  N RGGR   +N  +  C  CG  G+   
Subjt:  KICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSFKDNNSHQQSNNGNQ-----RGRGRSGNNRGGRSNLNNRNKLQCYHCGCYGYTTN

Query:  RCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNNQVHMGNGACLATT
         CYY  D       + P++   ++ T  Q     S  Y     A       +    +  WYPD GA++H+TN++GNLSVSSEY G ++V +GNGA L+ +
Subjt:  RCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNNQVHMGNGACLATT

Query:  HCGYGSI--MSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTSHVSNVVEYSLPYSN
        + G  ++    S+R F L +LLHVP ITKNLISVS+FA DN VYF+FHPS+CLVKD A++  LLRGTLHNGLYRF+L S +  P    + +     P   
Subjt:  HCGYGSI--MSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTSHVSNVVEYSLPYSN

Query:  SVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFSKISYSLHQTF
         V   SP    +  NTLD WH RLGHPS++T + ++ +C   +    N  SFC  C LGKNH LPF + + +    F
Subjt:  SVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFSKISYSLHQTF

A0A438FJP6 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-6435.27Show/hide
Query:  SKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITT
        S IGS+ L +++   +A ++W  +SQ FN ++ A+VM  KS +Q +KK G T+ +Y +K+K   D+L   G  I   DHI+ I+ GLG EY+S+++VI++
Subjt:  SKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITT

Query:  KICSYTVQDFMALLMTHETRLEAKAMSIE-SVHPVANVHIQHSSPSFKDN---NSHQQSNN---GNQRGRGRSGNNRG-GRSNLNNRNKLQCYHCGCYGY
        K  S ++Q   + L+ HE R+  K  S + SV+  +    +  S S+  N   +S  Q+ N   GNQ  RG   +NRG GR       K QC  C  +G+
Subjt:  KICSYTVQDFMALLMTHETRLEAKAMSIE-SVHPVANVHIQHSSPSFKDN---NSHQQSNN---GNQRGRGRSGNNRG-GRSNLNNRNKLQCYHCGCYGY

Query:  TTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPN--------------------WYPDPGATNHLTNNMGNL
        T +RC+Y  D     P +    P       P  +  G+       ++   N+    Y    N                    W+PD GATNH+T+++GNL
Subjt:  TTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPN--------------------WYPDPGATNHLTNNMGNL

Query:  SVSSEYQGNNQVHMGNGACLATTHCG---YGSIMSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFD
        +  +EY GN+++HMGNG  L  +H G   + S  S N+V  L ++L VP I KNL+SVSQFARDN+VYF+FHP  C VKD++++  LL+G LH GLY+F+
Subjt:  SVSSEYQGNNQVHMGNGACLATTHCG---YGSIMSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFD

Query:  LNSHVQNPTSHVS--------NVVEYSLPYSNSVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVK-NCMPSLLHCSNKSSFCDVCALGKNHALPF
        L+  +    S +S             SL ++++  S+ P   +   +  D+WH+RLGHP+      ++  N +P      + SS C  C LGK+H LPF
Subjt:  LNSHVQNPTSHVS--------NVVEYSLPYSNSVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVK-NCMPSLLHCSNKSSFCDVCALGKNHALPF

A0A5A7U233 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-8542.86Show/hide
Query:  QIWPIQHR------RSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILA
        ++W  Q R         +   IL +MLH K+AK+IW  L  IF+ R LAQ M+ K+ L  IKKG   L EYF KI +CVD L ++ K +  +DHI+YILA
Subjt:  QIWPIQHR------RSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILA

Query:  GLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQ---HSSPSFKDNNSHQQSNNGNQRGRGRSGNNRGGRSNLNNRNKL
        GLG +Y SM+SVI+ +  S +VQ+ M+LL+T E++ E+K +S E+  P  N+  Q     + S+   N +   NN +   RG  GN R  R    NRNK 
Subjt:  GLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQ---HSSPSFKDNNSHQQSNNGNQRGRGRSGNNRGGRSNLNNRNKL

Query:  QCYHCGCYGYTTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNN
        QC  C   GY+ +RC++         RY+PR+          +  Y +M    Q  A +A L       D NWYPD GATNHLT+++ NLS+ SEY G N
Subjt:  QCYHCGCYGYTTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNN

Query:  QVHMGNGACLATTHCGYGSIMSSN---RVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTS
        Q++  NG+ L  TH G  S  SS    + F LN+LL VP+ITKNLISVSQFA+DN V+F+FHP+ C VKD  + Q LL+G L++GLY+F +     +   
Subjt:  QVHMGNGACLATTHCGYGSIMSSN---RVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTS

Query:  HVSNVVEYSLPYSNSVV--SNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFS
        H SN    + P  N+VV  SN+P         LD+WHRRLGHP L   ++++ N + +     NK +FC+ CALGK+HALPFS
Subjt:  HVSNVVEYSLPYSNSVV--SNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFS

A0A5D3CH97 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-8542.86Show/hide
Query:  QIWPIQHR------RSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILA
        ++W  Q R         +   IL +MLH K+AK+IW  L  IF+ R LAQ M+ K+ L  IKKG   L EYF KI +CVD L ++ K +  +DHI+YILA
Subjt:  QIWPIQHR------RSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILA

Query:  GLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQ---HSSPSFKDNNSHQQSNNGNQRGRGRSGNNRGGRSNLNNRNKL
        GLG +Y SM+SVI+ +  S +VQ+ M+LL+T E++ E+K +S E+  P  N+  Q     + S+   N +   NN +   RG  GN R  R    NRNK 
Subjt:  GLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQ---HSSPSFKDNNSHQQSNNGNQRGRGRSGNNRGGRSNLNNRNKL

Query:  QCYHCGCYGYTTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNN
        QC  C   GY+ +RC++         RY+PR+          +  Y +M    Q  A +A L       D NWYPD GATNHLT+++ NLS+ SEY G N
Subjt:  QCYHCGCYGYTTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNN

Query:  QVHMGNGACLATTHCGYGSIMSSN---RVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTS
        Q++  NG+ L  TH G  S  SS    + F LN+LL VP+ITKNLISVSQFA+DN V+F+FHP+ C VKD  + Q LL+G L++GLY+F +     +   
Subjt:  QVHMGNGACLATTHCGYGSIMSSN---RVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTS

Query:  HVSNVVEYSLPYSNSVV--SNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFS
        H SN    + P  N+VV  SN+P         LD+WHRRLGHP L   ++++ N + +     NK +FC+ CALGK+HALPFS
Subjt:  HVSNVVEYSLPYSNSVV--SNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFS

A5BFT3 Integrase catalytic domain-containing protein1.2e-6135.91Show/hide
Query:  LSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEA
        L Q F  +  A+  + K+ LQ  KKGGST+ EY +KIK CVD L +VG  +  +DH+  IL GL  +Y+S V+ +  +   ++V++  ALLM HE+R+E 
Subjt:  LSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEA

Query:  KAMSIESVHPVANVHIQHSSPSFKDNN--------SHQQSNNGNQRGRGRSGN---------------NRGGRSNL---------------------NNR
           S++S     + H+  S+   K N         + Q S++G   G GR G+               N  GRSN                      N  
Subjt:  KAMSIESVHPVANVHIQHSSPSFKDNN--------SHQQSNNGNQRGRGRSGN---------------NRGGRSNL---------------------NNR

Query:  NKLQCYHCGCYGYTTNRCYYINDFSQQHPR------YSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLS
         K  C  CG  G+   +CYY  D + Q P+       SPRA             Y S   Q+  V     +P +    D NWYPD GA+NH+T N  NL 
Subjt:  NKLQCYHCGCYGYTTNRCYYINDFSQQHPR------YSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLS

Query:  VSSEYQGNNQVHMGNGACLATTHCGYGSIMS--SNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLN
         S+E+ G NQVH+GNG  L+  H G    +S  S++   LN LLHVP+ITKNL+SVS+FA+DN V+F+FH   C VKD+ +   L+ G + +GLY FD  
Subjt:  VSSEYQGNNQVHMGNGACLATTHCGYGSIMS--SNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLN

Query:  SHVQNPTSHVSNVVEYSLPYSNSVVSNSPSN---GSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSN-KSSFCDVCALGKNHALPFSKISYSLH
              +SH++     SL  S SVV++S S+    + +++T D+WH+RLGHPS +T ++++  C  ++ H +   S+FC  C LGK H  PFS      H
Subjt:  SHVQNPTSHVSNVVEYSLPYSNSVVSNSPSN---GSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSN-KSSFCDVCALGKNHALPFSKISYSLH

Query:  QTFT
         T+T
Subjt:  QTFT

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-0620.8Show/hide
Query:  RSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKG-GSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVI
        R  +   ++  ++   TA+ IW  L  ++  + L   + LK  L  +    G+    + +     +  L  +G  I  ED  + +L  L   YD++ + I
Subjt:  RSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKG-GSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVI

Query:  TTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSFKDNNSHQQSNNGNQRGRGRSGNN---RGGRSNLNNRNKLQ---CYHCGCYGY
             +  ++D  + L+ +E ++  K                       +N        G  R   RS NN    G R    NR+K +   CY+C   G+
Subjt:  TTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSFKDNNSHQQSNNGNQRGRGRSGNN---RGGRSNLNNRNKLQ---CYHCGCYGY

Query:  TTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLT--NNMGNLSVSSEYQGNNQVHMGNGA
            C         +PR         +     + M  +    +  + +     H S G +  W  D  A++H T   ++    V+ ++     V MGN +
Subjt:  TTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLT--NNMGNLSVSSEYQGNNQVHMGNGA

Query:  CLATTHCGYGSIMSSNRV---FHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTSHVSNVVEY
           +   G G I     V     L D+ HVP +  NLIS     RD    +  +  + L K    +  + +G     LYR                    
Subjt:  CLATTHCGYGSIMSSNRV---FHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTSHVSNVVEY

Query:  SLPYSNSVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFSKIS
            +N+ +     N +    ++D+WH+R+GH S    Q + K  + S    +     CD C  GK H + F   S
Subjt:  SLPYSNSVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFSKIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.1e-3527.54Show/hide
Query:  IGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITTKI
        I  S+   +    TA  IW  L +I+   +   V +L++ L+   KG  T+ +Y   +    D L  +GK +  ++ +  +L  L  EY  ++  I  K 
Subjt:  IGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITTKI

Query:  CSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSFKDNNSH--------QQSNNGNQRGRGRSGNNRGGRSNLNNRNKLQCYHCGCYGYTT
           T+ +    L+ HE+++   A+S  +V P+    + H + +  +NN++         ++NN N +   +S  N    +N +     +C  CG  G++ 
Subjt:  CSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSFKDNNSH--------QQSNNGNQRGRGRSGNNRGGRSNLNNRNKLQCYHCGCYGYTT

Query:  NRCYYINDF-----SQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNNQVHMGNG
         RC  +  F     SQQ P  SP  P + +                      ANL   S     NW  D GAT+H+T++  NLS+   Y G + V + +G
Subjt:  NRCYYINDF-----SQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNNQVHMGNG

Query:  ACLATTHCGYGSIMSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTSHVSNVVEYSL
        + +  +H G  S+ + +R  +L+++L+VP I KNLISV +    N V  +F P+   VKD  +   LL+G   + LY + + S               S 
Subjt:  ACLATTHCGYGSIMSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTSHVSNVVEYSL

Query:  PYSNSVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFSK
        P S   +  SPS+ +    T   WH RLGHP+ S   S++ N   S+L+ S+K   C  C + K++ +PFS+
Subjt:  PYSNSVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFSK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.0e-2726.55Show/hide
Query:  IGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITTKI
        I  S+   +    TA  IW  L +I+   +   V +L+   +                    D L  +GK +  ++ +  +L  L  +Y  ++  I  K 
Subjt:  IGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITTKI

Query:  CSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSFKDNNS---HQQSNNGNQRGRGRSGNNRGGRSNLNNRNKL---QCYHCGCYGYTTNR
           ++ +    L+  E++L A   S E V   ANV    ++ + ++ N+   ++  NN N R      ++ G RS+ N + K    +C  C   G++  R
Subjt:  CSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSFKDNNS---HQQSNNGNQRGRGRSGNNRGGRSNLNNRNKL---QCYHCGCYGYTTNR

Query:  CYYINDF---SQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNNQVHMGNGACLA
        C  ++ F   + Q    SP  P + +                      ANL   S     NW  D GAT+H+T++  NLS    Y G + V + +G+ + 
Subjt:  CYYINDF---SQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNNQVHMGNGACLA

Query:  TTHCGYGSIMSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTSHVSNVVEYSLPYSN
         TH G  S+ +S+R   LN +L+VP I KNLISV +    N V  +F P+   VKD  +   LL+G   + LY +        P +    V  ++ P S 
Subjt:  TTHCGYGSIMSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTSHVSNVVEYSLPYSN

Query:  SVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFS
        +  S+              WH RLGHPSL+   S++ N    +L+ S+K   C  C + K+H +PFS
Subjt:  SVVSNSPSNGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFS

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)3.1e-0629.13Show/hide
Query:  VMLILRSQIWPIQHRRSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYIL
        V L L   + P Q + S + SS         T++DIW+ +   F     A+ ++L S L+T   G   +++Y+ K+KK  D L  V   +   + +MY+L
Subjt:  VMLILRSQIWPIQHRRSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYIL

Query:  AGLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSF------KDNNSHQQSNNGNQ---RGRGRSGN---NRGG
         GL  ++D++++VI  +    +  D   +L   E RL+       ++ P    H+ HSS S           ++ Q + GNQ   RGRGR  N    RGG
Subjt:  AGLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSF------KDNNSHQQSNNGNQ---RGRGRSGN---NRGG

Query:  RSNLNN
        R +  N
Subjt:  RSNLNN

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)8.2e-0729.82Show/hide
Query:  TAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITTKICSYTVQDFMALLM
        TA+D+W+ L  +F     A+ ++ ++ L+T      ++ EY  K+K   D+LT V   I     +M++L GL  +YD +++VI  K    +  +  ++L+
Subjt:  TAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVGKLIPLEDHIMYILAGLGLEYDSMVSVITTKICSYTVQDFMALLM

Query:  THETRLEAKAMS--IESVHPVANVHIQHSSPSFKDNNSHQQSNNGNQRGRGRS-GNNRGGRSN---LNNRN
          E+RL  K+ S    + HP  + ++  + P  ++    +  NN +  GRGRS   NRGG S+    NN N
Subjt:  THETRLEAKAMS--IESVHPVANVHIQHSSPSFKDNNSHQQSNNGNQRGRGRS-GNNRGGRSN---LNNRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTGTTCGAATCTCATATCGACCAAAGTTCTACTGCACCTCCAGAAACGATCATTGTACGAATCGGTGATGTTGATACTACGCAGTCAAATCTGGCCTATACAGCA
TAGAAGAAGCAAGATCGGCTCATCTATCCTTGAAAAAATGCTTCACTACAAAACTGCTAAAGATATCTGGGTGTGTTTATCTCAGATCTTCAATTTGAGAAATCTGGCCC
AAGTTATGAAGCTTAAATCAACGCTCCAAACTATCAAGAAGGGAGGTTCAACATTAAGTGAGTACTTCTCAAAGATTAAGAAATGTGTAGATGTCTTAACTGCAGTAGGT
AAGTTGATTCCTCTTGAGGATCATATTATGTATATTCTTGCCGGCCTTGGTCTAGAGTATGATTCTATGGTCTCTGTCATTACCACAAAAATCTGCTCTTATACAGTGCA
AGATTTCATGGCTTTGCTCATGACACATGAAACCAGACTTGAAGCTAAAGCTATGAGTATTGAATCTGTTCATCCAGTGGCTAATGTTCACATCCAACACTCGTCTCCTT
CATTTAAGGATAATAACTCTCATCAGCAGTCGAACAATGGAAATCAGAGAGGACGTGGTCGATCTGGCAATAACAGAGGAGGTCGTTCCAACTTGAATAATAGGAATAAA
CTTCAGTGCTATCATTGTGGTTGTTATGGCTATACAACAAATCGTTGTTACTACATAAATGACTTCTCACAACAACATCCACGGTATTCTCCTCGAGCACCAAAGGAAAA
TCAAGTTACAGTTCCTCAGTCCATGATGTATGGTTCAATGGGATATCAAATTCAACATGTGGCTCAGTTGGCAAATCTACCTCATGCTTCATATGGTCAGGACCCAAATT
GGTATCCAGACCCTGGAGCAACCAATCATCTTACCAATAACATGGGAAATCTCTCAGTAAGCTCAGAATATCAAGGAAACAATCAAGTTCATATGGGAAATGGTGCATGT
TTGGCTACCACACATTGTGGCTATGGTTCTATTATGTCTTCTAATAGAGTTTTTCATCTTAATGATCTATTACATGTTCCTACGATTACTAAGAATCTTATTAGTGTGAG
TCAATTTGCTCGGGATAATTCTGTTTATTTTAAATTTCATCCTTCTTATTGTCTTGTGAAAGATCGAGCATCTAATCAGGAGCTTCTTCGAGGGACTCTCCATAATGGCC
TCTACCGGTTTGATTTGAATTCTCATGTTCAAAATCCCACATCTCATGTTTCTAATGTTGTTGAATATTCTCTGCCTTATTCAAATTCTGTTGTTTCTAATTCCCCCTCT
AATGGGTCTATTATTAATAATACTCTTGATGTGTGGCATAGGCGCCTAGGCCATCCATCTCTATCTACCTTTCAATCTATTGTGAAGAATTGTATGCCCTCATTGCTACA
TTGTTCAAATAAATCAAGTTTTTGTGATGTTTGTGCTTTAGGGAAAAATCATGCTCTCCCTTTTTCCAAAATCTCTTACTCATTACACCAAACCTTTACAACTTGTTGTT
TTTTATGTGTGGGGACCTACTTATTCTTTATCTCAAAGGTGGTTTTCGTTATTATGTTTCATTTGTTGATGCTTTCTCAAGATATACATGGATATACATGTTATCGTCTA
AGTCTGAAGTTGATTTCATTCACTTTCGAAATCAAGTGGAAAAATTTCTAG
mRNA sequenceShow/hide mRNA sequence
AAGAGGATAAATGATACTTAAAGATTAGGCCAGCTGTCATTGTTCTAGAAGCTTAGTTTTCCCCTTTACTCTTTTCGTTGCTTTCTTTCTGTTGTGTGGCGTGGATCATT
TTCTAAATGGTTTTTATGTGATCTTTTCTTGTTGCTCAAGAGTATTTTCACCTCTCATGTACTCTATATATTCCTAACATTTTCTTCTTCATTTTGTATTTGATTGAATA
GAAAATGGAATAGATCAGAGGTTTTTCTCTTGAGGCAAATTTTGTATCAGAGCTCGACCTGTCTTGCGACGTCTCTGATAGCTTTTTTTTTTTTTCTTTCTCCTCTCTAA
CCGTTCCTTCTCTCATGGAATCCTCAGAAGTAAGCTTCGATCAAACGGTGAGTCCGATTTCATTGTCTTCTACGATTTTACCAGGTAGCAAAATTGCTATTGTTCAACTC
ACGAGTGAAAATTTCCTGTTGGAAATTTCAAGTTGAGTTTGCTCTTGAAGGCCATGGCCTGTTCGAATCTCATATCGACCAAAGTTCTACTGCACCTCCAGAAACGATCA
TTGTACGAATCGGTGATGTTGATACTACGCAGTCAAATCTGGCCTATACAGCATAGAAGAAGCAAGATCGGCTCATCTATCCTTGAAAAAATGCTTCACTACAAAACTGC
TAAAGATATCTGGGTGTGTTTATCTCAGATCTTCAATTTGAGAAATCTGGCCCAAGTTATGAAGCTTAAATCAACGCTCCAAACTATCAAGAAGGGAGGTTCAACATTAA
GTGAGTACTTCTCAAAGATTAAGAAATGTGTAGATGTCTTAACTGCAGTAGGTAAGTTGATTCCTCTTGAGGATCATATTATGTATATTCTTGCCGGCCTTGGTCTAGAG
TATGATTCTATGGTCTCTGTCATTACCACAAAAATCTGCTCTTATACAGTGCAAGATTTCATGGCTTTGCTCATGACACATGAAACCAGACTTGAAGCTAAAGCTATGAG
TATTGAATCTGTTCATCCAGTGGCTAATGTTCACATCCAACACTCGTCTCCTTCATTTAAGGATAATAACTCTCATCAGCAGTCGAACAATGGAAATCAGAGAGGACGTG
GTCGATCTGGCAATAACAGAGGAGGTCGTTCCAACTTGAATAATAGGAATAAACTTCAGTGCTATCATTGTGGTTGTTATGGCTATACAACAAATCGTTGTTACTACATA
AATGACTTCTCACAACAACATCCACGGTATTCTCCTCGAGCACCAAAGGAAAATCAAGTTACAGTTCCTCAGTCCATGATGTATGGTTCAATGGGATATCAAATTCAACA
TGTGGCTCAGTTGGCAAATCTACCTCATGCTTCATATGGTCAGGACCCAAATTGGTATCCAGACCCTGGAGCAACCAATCATCTTACCAATAACATGGGAAATCTCTCAG
TAAGCTCAGAATATCAAGGAAACAATCAAGTTCATATGGGAAATGGTGCATGTTTGGCTACCACACATTGTGGCTATGGTTCTATTATGTCTTCTAATAGAGTTTTTCAT
CTTAATGATCTATTACATGTTCCTACGATTACTAAGAATCTTATTAGTGTGAGTCAATTTGCTCGGGATAATTCTGTTTATTTTAAATTTCATCCTTCTTATTGTCTTGT
GAAAGATCGAGCATCTAATCAGGAGCTTCTTCGAGGGACTCTCCATAATGGCCTCTACCGGTTTGATTTGAATTCTCATGTTCAAAATCCCACATCTCATGTTTCTAATG
TTGTTGAATATTCTCTGCCTTATTCAAATTCTGTTGTTTCTAATTCCCCCTCTAATGGGTCTATTATTAATAATACTCTTGATGTGTGGCATAGGCGCCTAGGCCATCCA
TCTCTATCTACCTTTCAATCTATTGTGAAGAATTGTATGCCCTCATTGCTACATTGTTCAAATAAATCAAGTTTTTGTGATGTTTGTGCTTTAGGGAAAAATCATGCTCT
CCCTTTTTCCAAAATCTCTTACTCATTACACCAAACCTTTACAACTTGTTGTTTTTTATGTGTGGGGACCTACTTATTCTTTATCTCAAAGGTGGTTTTCGTTATTATGT
TTCATTTGTTGATGCTTTCTCAAGATATACATGGATATACATGTTATCGTCTAAGTCTGAAGTTGATTTCATTCACTTTCGAAATCAAGTGGAAAAATTTCTAGGAACAC
ATGTGTTAAGACTTCAAACAGATGGGGGAACAGAGTTCAAACCATTAAAATCTTACCTCAAACAACATGGCATAACTCATCGCATATCCTGTCCTTACACATCAAAACAA
AACGGTATTTTTGAGCGAAAACATAGACATGTTATTGATGTAGGACTCACTTTACTTGCTCAAGCCTCTATGCCCCTACGTTTTTTGGATGAAGCTTTCTCTACTACTAC
TTTTCTTATTAACAGGCTACCAACTTCTGTCCTAGATGGAGTGAGTCCTTTCAAGAAAATCTTTAACCAGGTACCTCAATACTCATCCTTTAAAGTTTTTGGCAGCAAAT
GTTTTCCTTGTTTGTGTCCATATAACAATCATAAGCTATCTTTTCGCTCAGAACGCTGTACTTTTATTGGGTATAGTTCGCTTCATAAAGGATACAAATGCGTAGTTAAG
GATGGTCATGTCTTCATATCTAGACATGTTCTGTTTGATGAACATTGTTTTCCATTTGCCTTTTCCAAAACCTTGTCAAATACTCCAATTGTTTCTATTGGCTCCATACT
TCACAATGTTATTCCTTTAGTTAAATCTGCTGAGCCTCTGGTAAGGAGTGATGCCTCCCTGAATCCTACTATTTCACCAACCTTACCTCTTGCCTCGGAGTCCCCTACAC
ATTCTATGTCATTGTGTGATAGTTCAACAAATGCACCTACTGTTCCTCACGAGTCTATAGGTTCAACATCTTTATCAAATTCAGGTGGTCAACCAGAAAACTCACAGGTA
GAGGTTGTTGCGCAGGTTATAAGGTCCATACCTCAAAACCAACATTCGATGATGACTCGTGCAAAAAGAGGCATTTTCAAGCCTAAAGTTCTGTTGAGTGAATATGTTGA
AAGGGAACCTCTAACGACTAAGATAGCTTTGAAGCATACTCATTGGAAACAAGCAATGTAAGAGGAATATAATGCTCTTCTAGTAAATAACACATGGACCTTAGTTCCAA
AACCATCCAACCAAAAGATCATTGGTTGCAAGTGGGTGTTTAAGATCAAAATAAATTCGGATGGATCCATATCACAATATAAGGCAAGACTTGTAGCAAAAGACTTTCAT
CAATCCCCTCAGGTTGATTATTTTGAAACTTTCAGTCCCGTGGTCAAACCGATACCATACGTGTTCCCTTAACACTAGCTTTAGCATATGGATGGTCTATCAGACAAATT
GATATTAACAATGTGTTTCTTCATGGTGTGTTGTCTGAGACAGTATATATGGAACAACCTGTCGGTTTCTATGAAGGTGATGGGAAATCAACAGTTTGTAAGTTTACGAA
AGCTTTATATGGTTTAAAGCAGGCACCGCGTGTGTGGTTTGATAGGTTGAATATGTTTCTTCACAAGGATGGTTTTGTTTCCTCTCGTGCAGATACATCCTTGCTGTTTA
AACATATTCGAAATTTCAATTGCTATGTGCTTATATATGTTGATGATATTTTAATCATGGGAAATTCTGATGCAGAGATCCAAAGCTTGATCAAACGGTTGAATGCTACT
TTCTCATTAAATGATCTTGGTCCACTTACTTATTTTCTTGGCATTGAGGTTTCTTATCCACAAACATGTTGCATGTTTCTTTCTCAAACGAAGTACATTAAAGATGTCCT
CAGCAAAACAAATATGTTGAATGCTAAGCCAATAGCCACTCCCATGGTCAGTGGTGCATTGCCATCTGCATATGGAGGTGTCTTGACCAAACCGCTTTCGGCGATGATGT
TTGTGCAGCTACGATTCAAGCTCCACGTTCGGGATTTCACCTCTCTAGGCTTGCGGGGGGTGTTAGGAAAGAGGTCCATTTGGCAAAGACTCGTTCATCCTAAGAAGATA
AATGGTACTTG
Protein sequenceShow/hide protein sequence
MACSNLISTKVLLHLQKRSLYESVMLILRSQIWPIQHRRSKIGSSILEKMLHYKTAKDIWVCLSQIFNLRNLAQVMKLKSTLQTIKKGGSTLSEYFSKIKKCVDVLTAVG
KLIPLEDHIMYILAGLGLEYDSMVSVITTKICSYTVQDFMALLMTHETRLEAKAMSIESVHPVANVHIQHSSPSFKDNNSHQQSNNGNQRGRGRSGNNRGGRSNLNNRNK
LQCYHCGCYGYTTNRCYYINDFSQQHPRYSPRAPKENQVTVPQSMMYGSMGYQIQHVAQLANLPHASYGQDPNWYPDPGATNHLTNNMGNLSVSSEYQGNNQVHMGNGAC
LATTHCGYGSIMSSNRVFHLNDLLHVPTITKNLISVSQFARDNSVYFKFHPSYCLVKDRASNQELLRGTLHNGLYRFDLNSHVQNPTSHVSNVVEYSLPYSNSVVSNSPS
NGSIINNTLDVWHRRLGHPSLSTFQSIVKNCMPSLLHCSNKSSFCDVCALGKNHALPFSKISYSLHQTFTTCCFLCVGTYLFFISKVVFVIMFHLLMLSQDIHGYTCYRL
SLKLISFTFEIKWKNF