; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0003227 (gene) of Chayote v1 genome

Gene IDSed0003227
OrganismSechium edule (Chayote v1)
DescriptionIntegrase catalytic domain-containing protein
Genome locationLG04:11534062..11540641
RNA-Seq ExpressionSed0003227
SyntenySed0003227
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]4.5e-13147.51Show/hide
Query:  LDSQLNPSDSQLNPSDSQLNPLDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLIASWKCNNHIITSWIL
        +++Q + + S+ +P  +  +  D QLNP+ +HHS   T+ +V QPL GA NY SW RA L+A++G+NK GF+ G I+KP +  L+ +W CNN I+ SWIL
Subjt:  LDSQLNPSDSQLNPSDSQLNPLDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLIASWKCNNHIITSWIL

Query:  NSVSKEIAASIICTGTAKDVWDELAERFKR-----IFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGL
        NSVSKEIAASII  G+ K++WDEL +RFK+     I+QLRKE   + QG +TIE Y+TK+KT+WQ+L E+R   +CTCG LKPFI+HL SEY+M FLMGL
Subjt:  NSVSKEIAASIICTGTAKDVWDELAERFKR-----IFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGL

Query:  NETYAAVRTQILLMEPLPSINKAFSLIVQEERQRIIGNSSKEVITLMAYNNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRR-N
        N++YAAVR QILLM+PLPSIN  FSL++QEE+QR  G  +  +  +    N   +   ++++ R+  +  C++CG KGH  DKCY+ HG+PPG+K R  N
Subjt:  NETYAAVRTQILLMEPLPSINKAFSLIVQEERQRIIGNSSKEVITLMAYNNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRR-N

Query:  INENGSQNAGTNSVSQGSSS-------FLDNLDKNQCTQLIEMLNSRLQDEKKTAI--ASAVNHISGINSVSL--SKSPNSWILDSGASKHICFNRQSFT
                + TN+V+  +S+       F  +L+  Q +QL+ +LN+ LQ      I  A+A+ H SGI +++   ++S + WI+DSGAS+HIC ++  F 
Subjt:  INENGSQNAGTNSVSQGSSS-------FLDNLDKNQCTQLIEMLNSRLQDEKKTAI--ASAVNHISGINSVSL--SKSPNSWILDSGASKHICFNRQSFT

Query:  NLHSINDMSIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYILD-----SCI
        N    N+M ++LPN  R++V+ +GD+QIN  L L DVL V QF YNLISVSCLL + +I+LDF   CCI+QD     MIGKA C NGLY+L+     +CI
Subjt:  NLHSINDMSIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYILD-----SCI

Query:  SS--IACSVTVDIWHRRLGHLS
        ++     +++VD WH+RLGHLS
Subjt:  SS--IACSVTVDIWHRRLGHLS

XP_012833844.1 PREDICTED: uncharacterized protein LOC105954710 [Erythranthe guttata]5.4e-9239.25Show/hide
Query:  NPLDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLIA--SWKCNNHIITSWILNSVSKEIAASIICTGTA
        +PLD   +P  +H S     ILV+Q L+  DNY SW RA  I+L  KNK GF++G+I +P  ++LI   +W  NN+I+ SWI+NSVSK+I  SI+ + ++
Subjt:  NPLDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLIA--SWKCNNHIITSWILNSVSKEIAASIICTGTA

Query:  KDVWDELAERFK-----RIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDV---ECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLM
        K++WD+L  RF      RIFQLR++LAN+TQG+ ++  YFTK+K +W +L  +RP     +C CG  +    H N EYVM FLMGLN++ A+ R QILLM
Subjt:  KDVWDELAERFK-----RIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDV---ECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLM

Query:  EPLPSINKAFSLIVQEERQRIIGNS--SKEVITLMAYNNTKKNSVDN-----SNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRR-----NIN-
        +PLP I+K F+ I QEERQR + +S            N   K S++N       K+RE  +S CTHC  +GHT +KCY++HG+PP +K ++     ++N 
Subjt:  EPLPSINKAFSLIVQEERQRIIGNS--SKEVITLMAYNNTKKNSVDN-----SNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRR-----NIN-

Query:  --------ENGSQNAGTNSVSQGSSSFLDNLDKNQCTQLIEMLNSRLQDEKKTAIAS------------AVNHISGINSVS--LSKSPNSWILDSGASKH
                ++ S +AG +  SQ    +L ++  +QC Q + M +S +  +++ + AS             V+ ++GI ++S   S S   WILDSGASKH
Subjt:  --------ENGSQNAGTNSVSQGSSSFLDNLDKNQCTQLIEMLNSRLQDEKKTAIAS------------AVNHISGINSVS--LSKSPNSWILDSGASKH

Query:  ICFNRQSFTNLHSINDMSIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYIL
        IC ++Q F N+  +++  ++LP++  + V FVGDV+++  + L  V  VPQF +NLISVS  L +N   + FS    ++QD K F+MIG+      L +L
Subjt:  ICFNRQSFTNLHSINDMSIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYIL

Query:  D-----------SCISSIACS-VTVDIWHRRLGHL
        D           SC S   C+ V+  + H+RLGH+
Subjt:  D-----------SCISSIACS-VTVDIWHRRLGHL

XP_012857659.1 PREDICTED: uncharacterized protein LOC105976934 [Erythranthe guttata]1.7e-9038.65Show/hide
Query:  NPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEK--LIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDEL
        +P+ +H S     +LV+  L+  DNY +W RA +I+L  KNK GF++GSI KP  ++  L+ +W  NN I+ SWILN++S +I AS++ + +A D+W++L
Subjt:  NPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEK--LIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDEL

Query:  AERFK-----RIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDV---ECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSIN
          RF      RIFQLR+ELAN+TQ   ++  YFTK+K +W +L  FRP      C+CG +    +H + E+VM FLMGLN++ A+ R QILLM+PLP IN
Subjt:  AERFK-----RIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDV---ECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSIN

Query:  KAFSLIVQEERQRIIG-NSSKEVITLMAY-----------NNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRN----------
        K F+L+ QEER R +   SS +V   +A+              + N    +  +R+  K  CTHC   GHT +KCYR+HGFPPG++ R+           
Subjt:  KAFSLIVQEERQRIIG-NSSKEVITLMAY-----------NNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRN----------

Query:  -----------INENGSQNAGTNSVSQGSS-SFLDNLDKNQCTQLIEMLNSRLQDE------KKTAIASAVNHISGINSVSL-------SKSPNSWILDS
                   ++ + S N+G+ S S  SS +FLD +  +QC QL+  ++S L ++       K +     +HIS +  + L       S  P+ WILDS
Subjt:  -----------INENGSQNAGTNSVSQGSS-SFLDNLDKNQCTQLIEMLNSRLQDE------KKTAIASAVNHISGINSVSL-------SKSPNSWILDS

Query:  GASKHICFNRQSFTNLHSINDMSIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTN
        GAS+HIC N+  F N+ S+++  +VLP++  V V  +GDVQ+ + L+LH+V  VP+F +NL+SVS LL  +   + F      +QD+   Q IGK +   
Subjt:  GASKHICFNRQSFTNLHSINDMSIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTN

Query:  GLYILD----SCISSIACS-VTVDIWHRRLGHL
        GLY+LD    S I    C+ ++  +WH RLGH+
Subjt:  GLYILD----SCISSIACS-VTVDIWHRRLGHL

XP_022145891.1 uncharacterized protein LOC111015239 [Momordica charantia]2.9e-9351.1Show/hide
Query:  LDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVW
        ++ QLNP+L+HHS   T++LV Q L+GA NYNSW R+ LIAL+GKNK GF++G+I+KP N  L+A+WKCNN IITSWI+NSVSKEIAASII TG+AKD+W
Subjt:  LDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVW

Query:  DELAERFK-----RIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSIN
        DEL ERF+     RIFQLRKEL    QGT++IEAY+TK+KTVWQ+L ++RP ++CTC  LK   E   SEYVM FLMGLNE+YA +R QILLM+P+P +N
Subjt:  DELAERFK-----RIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSIN

Query:  KAFSLIVQEERQRIIG--NSSKEVITLMAYNNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRNI-------NENGSQNAG---
        K FSL++QEERQR IG  N     + +     +K+NS   +  RR+  +S CTHCG +GH  DKCY++HG+PPG+++           N NG+ ++    
Subjt:  KAFSLIVQEERQRIIG--NSSKEVITLMAYNNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRNI-------NENGSQNAG---

Query:  TNSVSQ----------------GSSSFLDNLDKNQCTQLIEMLNSRLQDEKKTAIASAVNHISG
         N VS+                 S +F ++L+ +Q +QL+EML S LQ  K   I + +NH++G
Subjt:  TNSVSQ----------------GSSSFLDNLDKNQCTQLIEMLNSRLQDEKKTAIASAVNHISG

XP_022856063.1 uncharacterized protein LOC111377235, partial [Olea europaea var. sylvestris]6.6e-9041.26Show/hide
Query:  LIALAGKNKEGFVNGSIEKPKN-EKLIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDELAERFK-----RIFQLRKELANITQGTMTIEAYFT
        +I L  KNK GF++GSI KP N +  +++W  NN+I+ SWILNSVSKEI+AS+I + +A D+W +L ERF+     RIFQLR+EL N+TQG +++  YFT
Subjt:  LIALAGKNKEGFVNGSIEKPKN-EKLIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDELAERFK-----RIFQLRKELANITQGTMTIEAYFT

Query:  KIKTVWQDLIEFRPDV---ECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSINKAFSLIVQEERQRIIG-----NSSKEVITLMAYN
        K+KT+W++L  +RP     +C+CG  K   EH   EYVM FLMGLN+T+A  R Q+LLM+P+PSINK FSL+ QEE QR I      +++ + +   A  
Subjt:  KIKTVWQDLIEFRPDV---ECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSINKAFSLIVQEERQRIIG-----NSSKEVITLMAYN

Query:  NTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRNIN-----ENGSQNAGTNSVSQGSSS---------FLDNLDKNQCTQLIEML
        + KK S+   +K ++  K +CT+C   GH+ DKCY++HG+PPG+K ++  N     +N  ++   N V   + S         F+ +L+  Q  QL+ ML
Subjt:  NTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRNIN-----ENGSQNAGTNSVSQGSSS---------FLDNLDKNQCTQLIEML

Query:  NSRLQDEK-KTAIASAVNHISG-INSVSLS---KSPNSWILDSGASKHICFNRQSFTNLHSINDMSIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQF
         + L   K  T      N +SG   S+ L+    SP  W++DSGA+ HICF++ +F ++  I +  + LPN+ R+ V FVG V+ ++ L+L DVL VP F
Subjt:  NSRLQDEK-KTAIASAVNHISG-INSVSLS---KSPNSWILDSGASKHICFNRQSFTNLHSINDMSIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQF

Query:  TYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYI-----LDS---------------CISSIACSVTVDIWHRRLGHLS
         +NL+SVS L+  ++I + F    C +QD  N +MIGK   + GLYI     LDS                +SS+   V  + WH RLGHLS
Subjt:  TYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYI-----LDS---------------CISSIACSVTVDIWHRRLGHLS

TrEMBL top hitse value%identityAlignment
A0A151U9A5 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-8937.05Show/hide
Query:  SDSQLNPLDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKP-KNEKLIASWKCNNHIITSWILNSVSKEIAASIIC
        ++S   P D   NP+ +H S    + +V+QPL G DNYNSW RA L+AL  KNK GFV+G+I KP   +K   SW+ NN+I+ SW+LN +SK++ AS+I 
Subjt:  SDSQLNPLDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKP-KNEKLIASWKCNNHIITSWILNSVSKEIAASIIC

Query:  TGTAKDVWDELAERFK-----RIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILL
        + +A  +W++L  RF+     R+FQLR++L  + QG++ I  YFTKIK +W++L E++P   CTCG +KP+I+H  SEY M+FLMGLNE Y+ +R QILL
Subjt:  TGTAKDVWDELAERFK-----RIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILL

Query:  MEPLPSINKAFSLIVQEERQRIIG--NSSKEVITLMAY--NNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRNINENGSQNAG
        M+P+P I K FSL++QEE+Q+ +G   +S +  T  AY   N  K+  +N  K R      C HCG  GH  DKC+++HG+P   K        G+ N  
Subjt:  MEPLPSINKAFSLIVQEERQRIIG--NSSKEVITLMAY--NNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRNINENGSQNAG

Query:  TNSVSQGSSSFLDNLDKNQCTQLIEMLNSRLQDEKKTAIASAVNHISGINSVSLSKSPN-------SWILDSGASKHICFNRQSFTNLHSINDMSIVLPN
         N VS  S+        +Q  Q++ +L ++         ++ +     +N + LS  P+        WILDSGAS H+  +   F     I +  + LPN
Subjt:  TNSVSQGSSSFLDNLDKNQCTQLIEMLNSRLQDEKKTAIASAVNHISGINSVSLSKSPN-------SWILDSGASKHICFNRQSFTNLHSINDMSIVLPN

Query:  NFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYILDSCIS------------------
           + V  +G V ++  ++L DV+ VPQF YNL+S++CLL  N ++L F     ILQD  + +MIG  +   G+Y+L+  ++                  
Subjt:  NFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYILDSCIS------------------

Query:  ----------SIACSVTVDIWHRRLGHLS
                  +  C+    IWH R GH++
Subjt:  ----------SIACSVTVDIWHRRLGHLS

A0A2Z7AT15 Cysteine-rich RLK (Receptor-like protein kinase) 81.6e-8937.43Show/hide
Query:  NPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLI-ASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDELA
        +P+ +H+       LV+ PLIG+ NYN+W+RA ++AL  KNK GF++ SI++P++E L+  SW   N ++ SWILNSV++ IA S++   TA+++W +L 
Subjt:  NPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLI-ASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDELA

Query:  ERF-----KRIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSINKAFS
        ERF      RI+Q++K L+ + QG+M + +Y+TK++T+W +L +++P   CTCG+++ +  + N E VM FLMGLN++YA VR Q+L++EPLP+I K F+
Subjt:  ERF-----KRIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSINKAFS

Query:  LIVQEERQRIIG-NSSKEVITLMAYNNTKKNSVDNSNKRRETIKS--------VCTHCGFKGHTTDKCYRIHGFPPG---FKSRRNINENGSQNAGTNSV
        L++QEERQR I  + SK  +      +   +S + +   R +  S        +C+HC F+ HT DKCY++HG+PPG   FKS+ +     +  A ++S 
Subjt:  LIVQEERQRIIG-NSSKEVITLMAYNNTKKNSVDNSNKRRETIKS--------VCTHCGFKGHTTDKCYRIHGFPPG---FKSRRNINENGSQNAGTNSV

Query:  SQGSSSFLDNLD---KNQCTQLIEMLNSRLQDEKK--------------TAIASAVNHISGINSVSLSKSPNSWILDSGASKHICFNRQSFTNLHSINDM
        +   +  +D+ D   ++QC QLIE L+S+LQ  +               T I SA +HI  I       +   WI+D+GA+ HIC +   F +  +I   
Subjt:  SQGSSSFLDNLD---KNQCTQLIEMLNSRLQDEKK--------------TAIASAVNHISGINSVSLSKSPNSWILDSGASKHICFNRQSFTNLHSINDM

Query:  SIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYIL---DSCISSIACSVTV-
         +VLPN   + V   G V + S L+L +VL VP F +NL+SVS L  +++ ++ F    C +QD    +MIG       LY+L   D  + S  C+  V 
Subjt:  SIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYIL---DSCISSIACSVTV-

Query:  --DIWHRRLGHLS
          ++WHRR+GH S
Subjt:  --DIWHRRLGHLS

A0A2Z7CMI0 Uncharacterized protein1.2e-8937.43Show/hide
Query:  NPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLI-ASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDELA
        +P+ +H+       LV+ PLIG+ NYN+W+RA ++AL  KNK GF++ SI++P++E L+  SW   N ++ SWILNSV++ IA S++   TA+++W +L 
Subjt:  NPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLI-ASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDELA

Query:  ERF-----KRIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSINKAFS
        ERF      RI+Q++K L+ + QG+M + +Y+TK++T+W +L +++P   CTCG+++ +  + N E VM FLMGLN++YA VR Q+L++EPLP+I K F+
Subjt:  ERF-----KRIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSINKAFS

Query:  LIVQEERQRIIG-NSSKEVITLMAYNNTKKNSVDNSNKRRETIKS--------VCTHCGFKGHTTDKCYRIHGFPPG---FKSRRNINENGSQNAGTNSV
        L++QEERQR I  + SK  +      +   +S + +   R +  S        +C+HC F+ HT DKCY++HG+PPG   FKS+ +     +  A ++S 
Subjt:  LIVQEERQRIIG-NSSKEVITLMAYNNTKKNSVDNSNKRRETIKS--------VCTHCGFKGHTTDKCYRIHGFPPG---FKSRRNINENGSQNAGTNSV

Query:  SQGSSSFLDNLD---KNQCTQLIEMLNSRLQDEKK--------------TAIASAVNHISGINSVSLSKSPNSWILDSGASKHICFNRQSFTNLHSINDM
        ++  +  +D+ D   ++QC QLIE L+S+LQ  +               T I SA +HI  I       +   WI+D+GA+ HIC +   F +  +I   
Subjt:  SQGSSSFLDNLD---KNQCTQLIEMLNSRLQDEKK--------------TAIASAVNHISGINSVSLSKSPNSWILDSGASKHICFNRQSFTNLHSINDM

Query:  SIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYIL---DSCISSIACSVTV-
         +VLPN   + V   G V + S L+L +VL VP F +NL+SVS L  +++ ++ F    C +QD    +MIG       LY+L   D  + S  C+  V 
Subjt:  SIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYIL---DSCISSIACSVTV-

Query:  --DIWHRRLGHLS
          ++WHRR+GH S
Subjt:  --DIWHRRLGHLS

A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 82.2e-13147.51Show/hide
Query:  LDSQLNPSDSQLNPSDSQLNPLDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLIASWKCNNHIITSWIL
        +++Q + + S+ +P  +  +  D QLNP+ +HHS   T+ +V QPL GA NY SW RA L+A++G+NK GF+ G I+KP +  L+ +W CNN I+ SWIL
Subjt:  LDSQLNPSDSQLNPSDSQLNPLDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLIASWKCNNHIITSWIL

Query:  NSVSKEIAASIICTGTAKDVWDELAERFKR-----IFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGL
        NSVSKEIAASII  G+ K++WDEL +RFK+     I+QLRKE   + QG +TIE Y+TK+KT+WQ+L E+R   +CTCG LKPFI+HL SEY+M FLMGL
Subjt:  NSVSKEIAASIICTGTAKDVWDELAERFKR-----IFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGL

Query:  NETYAAVRTQILLMEPLPSINKAFSLIVQEERQRIIGNSSKEVITLMAYNNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRR-N
        N++YAAVR QILLM+PLPSIN  FSL++QEE+QR  G  +  +  +    N   +   ++++ R+  +  C++CG KGH  DKCY+ HG+PPG+K R  N
Subjt:  NETYAAVRTQILLMEPLPSINKAFSLIVQEERQRIIGNSSKEVITLMAYNNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRR-N

Query:  INENGSQNAGTNSVSQGSSS-------FLDNLDKNQCTQLIEMLNSRLQDEKKTAI--ASAVNHISGINSVSL--SKSPNSWILDSGASKHICFNRQSFT
                + TN+V+  +S+       F  +L+  Q +QL+ +LN+ LQ      I  A+A+ H SGI +++   ++S + WI+DSGAS+HIC ++  F 
Subjt:  INENGSQNAGTNSVSQGSSS-------FLDNLDKNQCTQLIEMLNSRLQDEKKTAI--ASAVNHISGINSVSL--SKSPNSWILDSGASKHICFNRQSFT

Query:  NLHSINDMSIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYILD-----SCI
        N    N+M ++LPN  R++V+ +GD+QIN  L L DVL V QF YNLISVSCLL + +I+LDF   CCI+QD     MIGKA C NGLY+L+     +CI
Subjt:  NLHSINDMSIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYILD-----SCI

Query:  SS--IACSVTVDIWHRRLGHLS
        ++     +++VD WH+RLGHLS
Subjt:  SS--IACSVTVDIWHRRLGHLS

A0A6J1CXR2 uncharacterized protein LOC1110152391.4e-9351.1Show/hide
Query:  LDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVW
        ++ QLNP+L+HHS   T++LV Q L+GA NYNSW R+ LIAL+GKNK GF++G+I+KP N  L+A+WKCNN IITSWI+NSVSKEIAASII TG+AKD+W
Subjt:  LDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVW

Query:  DELAERFK-----RIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSIN
        DEL ERF+     RIFQLRKEL    QGT++IEAY+TK+KTVWQ+L ++RP ++CTC  LK   E   SEYVM FLMGLNE+YA +R QILLM+P+P +N
Subjt:  DELAERFK-----RIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSIN

Query:  KAFSLIVQEERQRIIG--NSSKEVITLMAYNNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRNI-------NENGSQNAG---
        K FSL++QEERQR IG  N     + +     +K+NS   +  RR+  +S CTHCG +GH  DKCY++HG+PPG+++           N NG+ ++    
Subjt:  KAFSLIVQEERQRIIG--NSSKEVITLMAYNNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRNI-------NENGSQNAG---

Query:  TNSVSQ----------------GSSSFLDNLDKNQCTQLIEMLNSRLQDEKKTAIASAVNHISG
         N VS+                 S +F ++L+ +Q +QL+EML S LQ  K   I + +NH++G
Subjt:  TNSVSQ----------------GSSSFLDNLDKNQCTQLIEMLNSRLQDEKKTAIASAVNHISG

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-1522.52Show/hide
Query:  NQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSI----------EKPKNEKLIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDELAE-----R
        N   + + NY  W R       G    GF++GS             P+       WK  + +I S +L ++S  +  ++    TA  +W+ L +      
Subjt:  NQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSI----------EKPKNEKLIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDELAE-----R

Query:  FKRIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSINKAFSLIVQEER
        +  + QLR +L   T+GT TI+ Y   + T +  L      ++             + E V   L  L E Y  V  QI   +  P++ +    ++  E 
Subjt:  FKRIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQILLMEPLPSINKAFSLIVQEER

Query:  QRIIGNSSKEVITLMAYNNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRNINENGSQNAGTNSVSQGSSSFLDNLDKNQCTQL
         +I+  SS  VI + A   + +N+   +N             G + +  D     +   P  +S  N + N +Q+       Q     +      +C+QL
Subjt:  QRIIGNSSKEVITLMAYNNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRNINENGSQNAGTNSVSQGSSSFLDNLDKNQCTQL

Query:  ---IEMLNSRLQDEKKTAIASAVNHISGINSVSLSKSPNSWILDSGASKHIC--FNRQSFTNLHSINDMSIVLPNNFRVNVEFVGDVQINSR---LILHD
           +  +NS+      T      N   G        S N+W+LDSGA+ HI   FN  S    ++  D  +++ +   + +   G   ++++   L LH+
Subjt:  ---IEMLNSRLQDEKKTAIASAVNHISGINSVSLSKSPNSWILDSGASKHIC--FNRQSFTNLHSINDMSIVLPNNFRVNVEFVGDVQINSR---LILHD

Query:  VLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYILD-------SCISSIACSVTVDIWHRRLGH
        +L VP    NLISV  L  +N ++++F      ++D      + +    + LY          S  +S +   T   WH RLGH
Subjt:  VLLVPQFTYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYILD-------SCISSIACSVTVDIWHRRLGH

Arabidopsis top hitse value%identityAlignment
AT1G21280.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162); Has 707 Blast hits to 705 proteins in 25 species: Archae - 0; Bacteria - 0; Metazoa - 4; Fungi - 0; Plants - 703; Viruses - 0; Other Eukaryotes - 0 (source: NCBI BLink).2.7e-2534.21Show/hide
Query:  DNYNSWKRAFLIALAGKNKEGFVNGSIEKPKN-EKLIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDELAERF-----KRIFQLRKELANITQ
        DNY +WK  F   L    K GF++G++ KP     L   W+  N ++  W++NS++ ++  S++   TA  +W++L   F      +I+QLR+ LA + Q
Subjt:  DNYNSWKRAFLIALAGKNKEGFVNGSIEKPKN-EKLIASWKCNNHIITSWILNSVSKEIAASIICTGTAKDVWDELAERF-----KRIFQLRKELANITQ

Query:  GTMTIEAYFTKIKTVWQDLIEFRPDVECTCG-----ALKPFIEHLNSEYVMIFLMG--LNETYAAVRTQILLMEPLPSINKAFSLIVQEE
        G  ++E YF K+  VW +L E+ P  EC CG       K   E    E    FLMG  LN+ + AV T+I+  +P PS+++AF+++   E
Subjt:  GTMTIEAYFTKIKTVWQDLIEFRPDVECTCG-----ALKPFIEHLNSEYVMIFLMG--LNETYAAVRTQILLMEPLPSINKAFSLIVQEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGACAAATCAAACTCTAGCAATGTCGATGTACCAGATCTGGATTCGCAACTCAATCCCTCAGATTCGCAACTCAATCCCTCAGATTCGCAACTCAATCCTTTGGA
TCTGCAACTCAATCCATTTTTGATGCATCATTCGTACACTGCAACATCGATACTAGTCAATCAGCCCCTCATCGGAGCCGATAACTATAATTCATGGAAGCGAGCTTTCC
TCATAGCTCTTGCAGGGAAGAATAAGGAAGGATTCGTGAACGGATCCATTGAAAAACCGAAGAATGAGAAACTTATAGCATCCTGGAAATGTAACAATCACATAATCACT
TCTTGGATTCTGAACTCTGTTTCTAAAGAGATTGCAGCAAGTATAATTTGTACAGGCACTGCAAAAGATGTTTGGGACGAACTTGCTGAACGATTCAAACGCATCTTTCA
ACTAAGGAAAGAACTAGCGAATATCACTCAAGGAACAATGACGATAGAGGCTTACTTCACGAAGATCAAAACGGTATGGCAAGATCTCATTGAATTCCGTCCTGATGTTG
AATGTACTTGTGGTGCTCTCAAACCTTTCATCGAACACCTTAACAGCGAGTACGTTATGATATTTCTTATGGGTCTTAATGAGACCTATGCTGCTGTAAGAACACAGATT
CTTTTGATGGAGCCCTTACCTTCCATCAACAAAGCCTTTTCGCTTATTGTACAAGAAGAACGACAGAGAATTATTGGAAATTCCTCAAAGGAAGTCATCACCTTGATGGC
TTACAACAATACGAAGAAAAATAGTGTTGACAATTCAAACAAAAGGAGAGAGACGATCAAATCTGTCTGTACTCACTGTGGATTCAAAGGCCACACCACCGATAAGTGCT
ATAGGATTCATGGATTTCCTCCTGGGTTCAAAAGCCGAAGGAACATCAATGAGAATGGTTCTCAGAATGCAGGAACAAACTCAGTATCACAGGGTTCGTCTAGTTTTTTG
GACAATCTTGATAAGAATCAATGCACCCAGTTAATTGAAATGCTTAATTCTCGATTGCAAGATGAAAAGAAGACTGCCATAGCCTCTGCAGTAAACCACATTTCAGGTAT
AAACTCAGTTTCTTTATCAAAATCTCCAAATTCTTGGATATTAGATTCTGGTGCATCAAAGCATATCTGCTTCAATAGACAATCATTTACTAACCTTCATAGTATAAATG
ATATGAGTATAGTTCTACCTAATAATTTTCGTGTGAATGTTGAGTTTGTTGGAGATGTACAAATAAATTCAAGACTAATTCTACATGATGTTCTGCTGGTCCCTCAGTTT
ACCTATAACTTGATATCAGTCAGTTGTCTGTTAGCTTCCAATGATATAGCATTGGATTTTTCTGGAAAATGTTGCATTCTGCAGGACAAGAAGAATTTTCAGATGATTGG
CAAGGCTGATTGTACAAATGGCTTATACATCCTGGATTCTTGCATTAGTTCTATTGCTTGTTCAGTAACAGTTGATATTTGGCATAGAAGATTGGGACACTTATCATAA
mRNA sequenceShow/hide mRNA sequence
AAAAAAAGAAAGATTTCAAGATGGAGACTGAACCTCCTCCTCTTCCTCCCGTTCCTCTTCCTCACCCTCCTCTTCCTCATATTGTGGATATTGAGCCTCTTGCTCCTCGC
GTCCCTCCTCCTCCTGCGGCTTTCGAGATTGAACCTCCTCCTGCTGTAGAGAAACAAATTTGGCTTCAATTTCTTGATGTTATACTTGACATGGCAATCGTGATCATACA
AAAAGAGAACCCTTATGTCAAAGTACAAGACCTCCGATACTGCTCCATCTCGAGTATTCGATTTCAATTTGGGTATGGTTTTTTCTACTCGTATACAATAGTAAACTTAG
TTTCTTTTTATAAAAAAAATCGATGAAATAATTTTAATAGTTAAATTAAATGAACCCACTAAAATTAACAATCGTACATTTAAGGAAAAAAATTAAAATAATGTATGTGA
ACTTTTTTTTAGTTAGTATAATTAGACTGACATAATAAATGTTTTAATGATCAAAACGTATTTGAACTAATGAATTTGATTTTGTCGATATAAATTTATGAGAAAAACTT
GTCTCCTTAGCTAAGGAGGCAATGTTGGAAAAATTATAATTAGTTTTTTATTTAATTATAATTATTTATTAAGTGGTCCATGTTTTTAGGAAACTACCCCCACACTAGTT
TTGTAACTTCCCACTATTTTTTTTTATCCATTTATTTTCTGTTAATGGTAGAGAAAATAATCGATAGATTATTCAGAACACATTACAGAGAATTAAAGAGAGTTTTTACA
GAGAACTCTGTTGGATTTCTACTCTTGGTCCCCTATGCCCACTTTTACATTAATAAACCACACCATTTGAGTTAGGAAGAGTGAGGGTCTAGAAGAACTAAGAACTCCGC
GGCCGATGTGTTTATTGTAGTTCCAGGTCAAATTGTCATTCGTCGATTACCAAAAACAATGGTGGGAGGTATACGCTTTATTATTTTTAATTACGTTTCCGCTGCAGGTG
ATTAAATTTGTTAATTACATTTTATTATAGCAAAAGCTCAAATTGTTATTACAGTTTTCTTGTTAGCCCAGTTGTTAGCTTACCTGAGAGTTTGTTAAAAGAGTTTGTTA
AAACATCTCTCTATATAACGAGGATGTACTTTGCTAATTCAAATAACAAAAAATTCAAGATTGCATTTTACCCATTTTCATCTTATATGGTATCAGAGCAGCTTCCGAGC
ATGCTCTTCTCTTTCCTATGATTTTCATCCAAAACAATTAAGCCTCGAAGCTTTATTCGTCTTCAATGGCTGACAAATCAAACTCTAGCAATGTCGATGTACCAGATCTG
GATTCGCAACTCAATCCCTCAGATTCGCAACTCAATCCCTCAGATTCGCAACTCAATCCTTTGGATCTGCAACTCAATCCATTTTTGATGCATCATTCGTACACTGCAAC
ATCGATACTAGTCAATCAGCCCCTCATCGGAGCCGATAACTATAATTCATGGAAGCGAGCTTTCCTCATAGCTCTTGCAGGGAAGAATAAGGAAGGATTCGTGAACGGAT
CCATTGAAAAACCGAAGAATGAGAAACTTATAGCATCCTGGAAATGTAACAATCACATAATCACTTCTTGGATTCTGAACTCTGTTTCTAAAGAGATTGCAGCAAGTATA
ATTTGTACAGGCACTGCAAAAGATGTTTGGGACGAACTTGCTGAACGATTCAAACGCATCTTTCAACTAAGGAAAGAACTAGCGAATATCACTCAAGGAACAATGACGAT
AGAGGCTTACTTCACGAAGATCAAAACGGTATGGCAAGATCTCATTGAATTCCGTCCTGATGTTGAATGTACTTGTGGTGCTCTCAAACCTTTCATCGAACACCTTAACA
GCGAGTACGTTATGATATTTCTTATGGGTCTTAATGAGACCTATGCTGCTGTAAGAACACAGATTCTTTTGATGGAGCCCTTACCTTCCATCAACAAAGCCTTTTCGCTT
ATTGTACAAGAAGAACGACAGAGAATTATTGGAAATTCCTCAAAGGAAGTCATCACCTTGATGGCTTACAACAATACGAAGAAAAATAGTGTTGACAATTCAAACAAAAG
GAGAGAGACGATCAAATCTGTCTGTACTCACTGTGGATTCAAAGGCCACACCACCGATAAGTGCTATAGGATTCATGGATTTCCTCCTGGGTTCAAAAGCCGAAGGAACA
TCAATGAGAATGGTTCTCAGAATGCAGGAACAAACTCAGTATCACAGGGTTCGTCTAGTTTTTTGGACAATCTTGATAAGAATCAATGCACCCAGTTAATTGAAATGCTT
AATTCTCGATTGCAAGATGAAAAGAAGACTGCCATAGCCTCTGCAGTAAACCACATTTCAGGTATAAACTCAGTTTCTTTATCAAAATCTCCAAATTCTTGGATATTAGA
TTCTGGTGCATCAAAGCATATCTGCTTCAATAGACAATCATTTACTAACCTTCATAGTATAAATGATATGAGTATAGTTCTACCTAATAATTTTCGTGTGAATGTTGAGT
TTGTTGGAGATGTACAAATAAATTCAAGACTAATTCTACATGATGTTCTGCTGGTCCCTCAGTTTACCTATAACTTGATATCAGTCAGTTGTCTGTTAGCTTCCAATGAT
ATAGCATTGGATTTTTCTGGAAAATGTTGCATTCTGCAGGACAAGAAGAATTTTCAGATGATTGGCAAGGCTGATTGTACAAATGGCTTATACATCCTGGATTCTTGCAT
TAGTTCTATTGCTTGTTCAGTAACAGTTGATATTTGGCATAGAAGATTGGGACACTTATCATAAAAAAGGCTCGAGTTAATGAAGGACCAATTACAATATTCAGAACATC
CTTCTAAGCATTGTGATATCTGTCCCCTAGCTAAACAAAAAAGGCTATGTTTTCCCTTTAACAACAATGTAGCAAAGAATGTTTTCGATTTAATACATTGTGATATATGG
GGACCCTTTAAAGTTCCAACATATAAAGGATATAAATACTTCCTTACCATTGTAGATGATTGTTCTCGATACACTTAGGTTTTTCTTATGACCTCTAAGAGTGATGCTCT
TCATATTGTGCCTAACTTTTTTAAGCTAGTTGAGACACAATTTTCTAAGAAAATAAAATTGTTTAGATCTGACAATGCTCATGAGCTCAAGTTTACAGAATTTTTTGCTT
CAGTTGGAACGCTGCATCAATTTTCTTGTGTAGAAACTCCTCAACAAAATTCTGTTGTAGAAAGAAAACACCAACATCTTTTAAATGTTGCTAGGTCTCTATATTTCCAA
TCCCAAGTTCCTTTAAGATTTTGAGGAGATTGCATCCTTATAGCAACCTACATTGTTAATAGAATCCCTATGCCTCTTCTAAACAATGAAACTCCATATTCTATCCTGCA
TCAACAAGAGGCAAACTATTCTGATCTTAAGTCCTTCGGATGTTTATGTTATGCTTCCACTTTAAAAGCAAATAGAGGTAAGTTTGATGCAAGAGCCAGCAGATGTGCTT
TCATTGGCTATCCTCCAGAGGTAAAAGGATACAAAGTGTATGACATGAAAACCAAGCAGATCTTTATTTCAAGGGATGTAGTTTTTTTTTTAAAACTCTTATCCTTTTCA
TGTCTTAAGCACAGAAGAAAAAGACTCAGTAGATGATATCTTTGCAGAAACTATCCTTCCTTTACCAATAATCTCTCCTGAGGTCCAGGGACAATACATAAATATACAAG
GATTACCCACTCTTAATCAAGGTTTTCCTACTGCTGATGCCACTATTGATGAGATAAACACAGTGAATCAAAATGATGACTGCAACATTGATTGCTCACAAGAAACTGAC
ATTTTTGCTCATGAACATTCTGATCAGCAAGCCTTAAATGAGAACATTGATTCCTTGTCTCTGCAAAACAACCTTTGTGAAGTCCCTAATATAGATCAGTCTGGCGTTCA
AAGTGGAGAACAACCAGAGACAACAAACATGAATCAATTTGAGATGCACGAAGTTGAACAGATTGTTGGGAATCCTCATGAGTCTTCTTCAAGTAGTAATCATGATGGGT
ATCATGAAAATCTGGAACAAGTCTTAATTCAACCAACCATAGATCCTGCTGTATCACAACTTAGAAGGTCTTCAAGAACTCATAAACCTCCTGGTTTTCTCCAAGATTAT
CACTGTAACATGCTGCTTCACAAGAATAATCAATCTACTGGTCAGTACTCCATACATAAATTTTTGTGTTATGACAAATTGAGACCAGCCTATCAAAACTACATATTGAA
TGTCTCAATGATTTTTGAACCTTCATATTATCATCAAGCTATAAAATTCCAGCATTGGAAAAGTGCAATGGATGAAGAGATAGCTGCCTTGGAGAAAAATAATACTTGGA
CTATTGTCCCCTTACCTTCGGGACATCACTCAATAAGTTGCAAATGGGTATATTGAGTAAAATACAAACCTGATGGCTCTGTTGACCGCTTTAAGGCTCGACTTGTAGCA
AGAGGTTTTAATCAGCAAGAAGGAGTTGATTTCCTAGAAACATTCTCTCCCGTTGCAAAAATTGTAACAGTTCGCTTATTCATTGCTATATCTGCCTCTTTCGGGTGGAA
TATTTTTCAAATGGATGTCAACAATGCCTTTTTAAATGGAGACTTGTTTGAAGAAGTATATATGACTCTTCCTATTGGATACTATCCTCAGAGAAATGATTCAACAACCA
CACCTATGGTTTGTAAGCTTCAAAAATCTATTTATGGACTTAAACAAGCTTCAAGACAATGGTTTCACAAGTTTTCTTGTGTTTTATTGTCCTCTGGTTTTTCCCAATCG
AAGGCAGATTATTCACTCTTCACAAAGGGAAGCAAAGAGACATTTGTTGCATTGCTTGTATATGTGGATGACATTCTTATAACTGGCCCATGTTCAGCTGAAGTAAACAA
GGTAAAAATGATGTTAAAACATCATTTTTCATTGAAAGATCTTGGAAGAGCAGCCTATTTTCTAGGCCTCGAATTATCTCGAACATCTCAAGGAATTTATATCACACAAA
GAAAGTATTGTCTCCAGATATTGGAAGATCATGGTTTGCTTGCCTCTAAACCAGTTTCTCAACCAATGACTCCAAATATCAAACTGGCAGCTACAACAGGAGAATTACTT
AGCAGTGTTGAAACAACCAACTTCAGAAGGATTATTGGCAGATTGATTTATCTGCAAATTTCCAGGCCTGATATCACCTATCCAGTTCACAAATTAAGTCAATTTCTAGC
AAAACCTACCACTGAGCACATGAAGGCAGCAGTTTACTTGTTAAGATATCTAAAAGGCACACCTGGACAAGGAATTTTACTACATGCTCACACTGATTTCCACTTAAAAG
CTTTTGTCGACGCTGACTGGGGAGCATGCCTCGATTCACGACGGTCTACAACAGGCTTTTGTATATTTCTTGGAAAATCCATGATATCGTGGAAATCAAAGAAGCAATCT
ACAGTATCTAGATCCTCAGCAGAAGCTGAATATCGAGCTTTAGCAGGAATAACAAGTGAAGTTACTTGGTTATGTAGTCTTATGAAAGATCTACTCATCACACCAAGACA
TCCCATAGTCATTTTCTGTGATAATCTAGCAGCTATCTCAATAGCTTCAAATCCAACCTTTCATGAAAGAACAAAACATATAGAGATTGATTGTCATTTTGTAAGGGAAA
AGATAGAGAAAGGCCTTATTAAGCTGCTTCCAATCCGTTCAAAATCTCAGTTAGATGACATGTTC
Protein sequenceShow/hide protein sequence
MADKSNSSNVDVPDLDSQLNPSDSQLNPSDSQLNPLDLQLNPFLMHHSYTATSILVNQPLIGADNYNSWKRAFLIALAGKNKEGFVNGSIEKPKNEKLIASWKCNNHIIT
SWILNSVSKEIAASIICTGTAKDVWDELAERFKRIFQLRKELANITQGTMTIEAYFTKIKTVWQDLIEFRPDVECTCGALKPFIEHLNSEYVMIFLMGLNETYAAVRTQI
LLMEPLPSINKAFSLIVQEERQRIIGNSSKEVITLMAYNNTKKNSVDNSNKRRETIKSVCTHCGFKGHTTDKCYRIHGFPPGFKSRRNINENGSQNAGTNSVSQGSSSFL
DNLDKNQCTQLIEMLNSRLQDEKKTAIASAVNHISGINSVSLSKSPNSWILDSGASKHICFNRQSFTNLHSINDMSIVLPNNFRVNVEFVGDVQINSRLILHDVLLVPQF
TYNLISVSCLLASNDIALDFSGKCCILQDKKNFQMIGKADCTNGLYILDSCISSIACSVTVDIWHRRLGHLS