; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011437 (gene) of Snake gourd v1 genome

Gene IDTan0011437
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
Genome locationLG08:50554562..50557499
RNA-Seq ExpressionTan0011437
SyntenyTan0011437
Gene Ontology termsNA
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]5.0e-17449.78Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+Y++QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED
        RS + ILQS + KFRTF+ TL + +ILPFKDEP  L++PP+KY HIDQ QW +FVNARLSEEWE                                L+ D
Subjt:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA
        PS RA LW +ARKGKNNEYFDD T++C  RI +ELA  +KG+DILT ++  +    RVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK +
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA

Query:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF
               K KE+++  EEI       +EG PCHLA+ S DN++ VGT++ ++ Q  T+HGVPLG +NVRV+VD+++ E A +PIP+RGE+E+L+Q +G F
Subjt:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF

Query:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA
        VAWPR LVI ++ K ++SS   ++   +       +K+TD HV+IKLLNRY ML MQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+EIGYSCI+ 
Subjt:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA

Query:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRM
        +I              FL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G                                   SL++
Subjt:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRM

Query:  WQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV
        WQAKHS+ +YR++  WK +KCP Q GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEIDE+R++WADF+G  V
Subjt:  WQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]9.4e-17349.78Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+Y++QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED
        RS + ILQS + KFRTF+ TL + +ILPFKDEP  L++PP+KY HIDQ QW +FVNARLSEEWE                                L+ D
Subjt:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA
        PS RA LW +ARKGKNNEYFDD T++C  RI +ELA  +KG+DILT ++  +    RVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK +
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA

Query:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF
               K KE+++  EEI       +EG PCHLA+ S DN++ VGT++ ++ Q  T+HGVPLG +NVRV+VD+++ E A +PIP+RGE+E+L+Q +G F
Subjt:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF

Query:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA
        VAWPR LVI ++ K ++SS   ++   +       +K+TD HV+IKLLNRY ML MQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+EIGYSCI+ 
Subjt:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA

Query:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRM
        +I              FL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G                                   SL++
Subjt:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRM

Query:  WQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV
        WQAKHS+ +YR++  WK +KCP Q GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEIDE+R++WADF+G  V
Subjt:  WQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]1.1e-17651.52Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+Y++QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED
        RS + ILQS + KFRTF+ TL + +ILPFKDEP  L++PP+KY HIDQ QW +FVNARLSEEWE                                L+ D
Subjt:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA
        PS RA LW +ARKGKNNEYFDD T++C  RI +ELA  +KG+DILT ++  +    RVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK +
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA

Query:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF
               K KE+++  EEI       +EG PCHLA+ S DN++ VGT++ ++ Q  T+HGVPLG +NVRV+VD+++ E A +PIP+RGE+E+L+Q +G F
Subjt:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF

Query:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA
        VAWPR LVI ++ K ++SS   ++   +       +K+TD HV+IKLLNRY ML MQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+EIGYSCI+ 
Subjt:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA

Query:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG------------SLRMWQAKHSLPQYRSSITWKLVKCPR
        +I              FL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G            SL++WQAKHS+ +YR++  WK +KCP 
Subjt:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG------------SLRMWQAKHSLPQYRSSITWKLVKCPR

Query:  QPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV
        Q GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEIDE+R++WADF+G  V
Subjt:  QPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]4.2e-17349.43Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS DE NV I+ E + T RRG T M  L  +R +GER  I+Y++ GQ VG+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++MSFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED
         S + ILQS + KFRTF+ TL Q++ILP+KDEPS L++PP+KYSHID+ QWE+FV ARLSEEWE                               +L+ D
Subjt:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPS-------
        P  RATLW +ARK KNNEY D  T++C  RI +ELA   KG+DILT ++     + R+RG+G  V P+ ++N+ + K K  +ES N+     S       
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPS-------

Query:  ---------------------KKSASIGSN------HPKDKEVIDEVEEILEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIV
                             +K    G N       PK K V+ + EEILEG PCHLAIGS DN++ VGTM+ SDAQ  +I+ +PLG +NVR +VD+++
Subjt:  ---------------------KKSASIGSN------HPKDKEVIDEVEEILEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIV

Query:  GEDAPLPIPIRGEVESLSQFMGNFVAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKL
        GED  LPIP + ++++L Q +GNFVAWPR LVI  K KK  S    KS        + S+KYTD HVTIKLLNRYAM  MQ DD IQ+ LSE + G+EK 
Subjt:  GEDAPLPIPIRGEVESLSQFMGNFVAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKL

Query:  IYLHRDDILHYCGMVEIGYSCIVAFI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG---------------
        IYL RDDI+ YCGM EIGYSCI+A+I              F++VDQ TIS+ VK QE R  NL NRLEMV+  LDQ V IPYNTG               
Subjt:  IYLHRDDILHYCGMVEIGYSCIVAFI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG---------------

Query:  ---------------------SLRMWQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFI
                             SL+ WQAKHSL QYR+ I WK +KCPRQ G++ECGYYVQKYIREIV NS+  I NLFNT+ AY+Q+EID +R++WA+F+
Subjt:  ---------------------SLRMWQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFI

Query:  GRFV
         RFV
Subjt:  GRFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]3.2e-17349.5Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS DE NV I+ E + T RRG T M  L  +R +GER  I+Y++ GQ VG+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++MSFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED
         S + ILQS + KFRTF+ TL Q++ILP+KDEPS L++PP+KYSHID+ QWE+FV ARLSEEWE                               +L+ D
Subjt:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPS-------
        P  RATLW +ARK KNNEY D  T++C  RI +ELA   KG+DILT ++     + R+RG+G  V P+ ++N+ + K K  +ES N+     S       
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPS-------

Query:  ---------------------KKSASIGSN------HPKDKEVIDEVEEILEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIV
                             +K    G N       PK K V+ + EEILEG PCHLAIGS DN++ VGTM+ SDAQ  +I+ +PLG +NVR +VD+++
Subjt:  ---------------------KKSASIGSN------HPKDKEVIDEVEEILEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIV

Query:  GEDAPLPIPIRGEVESLSQFMGNFVAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKL
        GED  LPIP + ++++L Q +GNFVAWPR LVI  K KK  S    KS        + S+KYTD HVTIKLLNRYAM  MQ DD IQ+ LSE + G+EK 
Subjt:  GEDAPLPIPIRGEVESLSQFMGNFVAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKL

Query:  IYLHRDDILHYCGMVEIGYSCIVAFI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG---------------
        IYL RDDI+ YCGM EIGYSCI+A+I              F++VDQ TIS+ VK QE R  NL NRLEMV+  LDQ V IPYNTG               
Subjt:  IYLHRDDILHYCGMVEIGYSCIVAFI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG---------------

Query:  --------------------SLRMWQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIG
                            SL+ WQAKHSL QYR+ I WK +KCPRQ G++ECGYYVQKYIREIV NS+  I NLFNT+ AY+Q+EID +R++WA+F+ 
Subjt:  --------------------SLRMWQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIG

Query:  RFV
        RFV
Subjt:  RFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X17.3e-17148.12Show/hide
Query:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDPRS
        SS DE NV I+ E + T RRG T M  L  +R +GER  I+Y+++GQ VG+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++MSFV+D  S
Subjt:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDPRS

Query:  NNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAEDPS
         + ILQS + KFR+F+ TL Q +ILP+KDEPS L++PP+KYSHID+ QWE+FV ARLSEEWE                               +L+ DP 
Subjt:  NNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAEDPS

Query:  TRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPSK--------
         RATLW +ARK KNN  FDD T++C+ RI +ELA   KG+DILT ++     + R+RG+G  V P+ + N+ R   K S++S +K     S+        
Subjt:  TRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPSK--------

Query:  -----------------------------KSASIGSNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPL
                                     K    G   PK K V+ E EE LE            G PCHLAIGS DNV+ VG M+ SD Q  TIHG+PL
Subjt:  -----------------------------KSASIGSNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPL

Query:  GDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNFVAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQ
        G EN+RV VD+ + ED  LPIP++G++E+L+Q +GNFVAWPR LVI  K KK  S    +S       T+ S+KYTD HVTIKLLNRYAM  MQ +D IQ
Subjt:  GDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNFVAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQ

Query:  VTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAFI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG--
        ++LSEH+FG+EK IYL RDDI+ YCGM EIGYSCI+ +I              F+LVDQ TIS+ +KSQE R  NL NRLEM N  LDQ V IPYNTG  
Subjt:  VTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAFI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG--

Query:  ----------------------------------SLRMWQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQE
                                          SL+ WQ +HS   YRS I WK +KCPR  GS+ECGYYVQKY+RE+V N++  I NLFNT  AY QE
Subjt:  ----------------------------------SLRMWQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQE

Query:  EIDEIRVQWADFIGRFV
        EID +RV+WA+F+ RFV
Subjt:  EIDEIRVQWADFIGRFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein7.3e-17148.12Show/hide
Query:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDPRS
        SS DE NV I+ E + T RRG T M  L  +R +GER  I+Y+++GQ VG+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++MSFV+D  S
Subjt:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDPRS

Query:  NNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAEDPS
         + ILQS + KFR+F+ TL Q +ILP+KDEPS L++PP+KYSHID+ QWE+FV ARLSEEWE                               +L+ DP 
Subjt:  NNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAEDPS

Query:  TRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPSK--------
         RATLW +ARK KNN  FDD T++C+ RI +ELA   KG+DILT ++     + R+RG+G  V P+ + N+ R   K S++S +K     S+        
Subjt:  TRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPSK--------

Query:  -----------------------------KSASIGSNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPL
                                     K    G   PK K V+ E EE LE            G PCHLAIGS DNV+ VG M+ SD Q  TIHG+PL
Subjt:  -----------------------------KSASIGSNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPL

Query:  GDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNFVAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQ
        G EN+RV VD+ + ED  LPIP++G++E+L+Q +GNFVAWPR LVI  K KK  S    +S       T+ S+KYTD HVTIKLLNRYAM  MQ +D IQ
Subjt:  GDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNFVAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQ

Query:  VTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAFI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG--
        ++LSEH+FG+EK IYL RDDI+ YCGM EIGYSCI+ +I              F+LVDQ TIS+ +KSQE R  NL NRLEM N  LDQ V IPYNTG  
Subjt:  VTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVAFI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG--

Query:  ----------------------------------SLRMWQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQE
                                          SL+ WQ +HS   YRS I WK +KCPR  GS+ECGYYVQKY+RE+V N++  I NLFNT  AY QE
Subjt:  ----------------------------------SLRMWQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQE

Query:  EIDEIRVQWADFIGRFV
        EID +RV+WA+F+ RFV
Subjt:  EIDEIRVQWADFIGRFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X12.4e-17449.78Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+Y++QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED
        RS + ILQS + KFRTF+ TL + +ILPFKDEP  L++PP+KY HIDQ QW +FVNARLSEEWE                                L+ D
Subjt:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA
        PS RA LW +ARKGKNNEYFDD T++C  RI +ELA  +KG+DILT ++  +    RVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK +
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA

Query:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF
               K KE+++  EEI       +EG PCHLA+ S DN++ VGT++ ++ Q  T+HGVPLG +NVRV+VD+++ E A +PIP+RGE+E+L+Q +G F
Subjt:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF

Query:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA
        VAWPR LVI ++ K ++SS   ++   +       +K+TD HV+IKLLNRY ML MQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+EIGYSCI+ 
Subjt:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA

Query:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRM
        +I              FL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G                                   SL++
Subjt:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRM

Query:  WQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV
        WQAKHS+ +YR++  WK +KCP Q GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEIDE+R++WADF+G  V
Subjt:  WQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV

A0A6J1C398 uncharacterized protein LOC111007859 isoform X35.2e-17751.52Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+Y++QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED
        RS + ILQS + KFRTF+ TL + +ILPFKDEP  L++PP+KY HIDQ QW +FVNARLSEEWE                                L+ D
Subjt:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA
        PS RA LW +ARKGKNNEYFDD T++C  RI +ELA  +KG+DILT ++  +    RVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK +
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA

Query:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF
               K KE+++  EEI       +EG PCHLA+ S DN++ VGT++ ++ Q  T+HGVPLG +NVRV+VD+++ E A +PIP+RGE+E+L+Q +G F
Subjt:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF

Query:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA
        VAWPR LVI ++ K ++SS   ++   +       +K+TD HV+IKLLNRY ML MQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+EIGYSCI+ 
Subjt:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA

Query:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG------------SLRMWQAKHSLPQYRSSITWKLVKCPR
        +I              FL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G            SL++WQAKHS+ +YR++  WK +KCP 
Subjt:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG------------SLRMWQAKHSLPQYRSSITWKLVKCPR

Query:  QPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV
        Q GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEIDE+R++WADF+G  V
Subjt:  QPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X24.5e-17349.78Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+Y++QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED
        RS + ILQS + KFRTF+ TL + +ILPFKDEP  L++PP+KY HIDQ QW +FVNARLSEEWE                                L+ D
Subjt:  RSNNAILQSVANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------KLAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA
        PS RA LW +ARKGKNNEYFDD T++C  RI +ELA  +KG+DILT ++  +    RVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK +
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIRHARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSA

Query:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF
               K KE+++  EEI       +EG PCHLA+ S DN++ VGT++ ++ Q  T+HGVPLG +NVRV+VD+++ E A +PIP+RGE+E+L+Q +G F
Subjt:  SIGSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRVIVDMIVGEDAPLPIPIRGEVESLSQFMGNF

Query:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA
        VAWPR LVI ++ K ++SS   ++   +       +K+TD HV+IKLLNRY ML MQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+EIGYSCI+ 
Subjt:  VAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLHRDDILHYCGMVEIGYSCIVA

Query:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRM
        +I              FL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G                                   SL++
Subjt:  FI--------------FLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRM

Query:  WQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV
        WQAKHS+ +YR++  WK +KCP Q GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEIDE+R++WADF+G  V
Subjt:  WQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATCAAGTGACGATGAAGTGAACGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGAGAACGTTTAGTCATCCAATACAGCAATCAAGGGCAGAGTGTTGGGGATAATGCAAACCAAATGCAAAGTTATATTGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAAGAACTGAAAGATAAAATTTTTGATTCTGTAGAGATGTCATTTGTGATCGACCCTCGGTCCAATAATGCTATTCTTCAATCA
GTGGCAAATAAATTTCGAACATTTCGATACACGTTGTATCAAAAACACATACTTCCATTTAAGGATGAGCCGTCCTTGTTGAAGCATCCTCCACAAAAGTATTCACATAT
TGATCAAAACCAATGGGAAGCATTTGTGAATGCTAGATTATCGGAAGAATGGGAGAAATTAGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAGGCACGAAAAG
GAAAAAATAATGAATACTTCGATGATGAAACCAAACAATGCATTTGTCGAATCGTAAACGAACTAGCTATGAAGAATAAAGGTAAAGACATATTGACCAGAAGCATTAGG
CACGCCAGACCACAGAGGCGTGTTAGAGGATTAGGTATGTCTGTCAAACCATCAACATACTTTAACATTCCTCGAGTGAAATCAAAATCAAGCAAAGAGTCTGGCAACAA
AATGTCGTGCTCACCTTCCAAAAAGTCTGCAAGCATAGGCAGTAATCATCCAAAAGACAAGGAGGTCATTGACGAGGTGGAAGAAATTTTAGAGGGAACTCCATGCCATC
TAGCAATAGGATCAAAGGATAATGTGATTACTGTAGGCACAATGTACACGTCTGACGCTCAATTTTTCACAATCCATGGAGTTCCCTTAGGAGATGAAAATGTTAGAGTG
ATTGTGGACATGATCGTAGGTGAAGATGCTCCACTACCAATTCCTATACGGGGAGAAGTAGAGTCCCTGAGTCAATTTATGGGAAATTTTGTGGCATGGCCTCGTGACCT
TGTCATTTTTAATAAGGGAAAAAAGGTGGCTTCTTCAGCAAAACATAAGTCGGATGTTTCTGTACATGTGCCTACTTCACATTCTACAAAGTACACAGATGCTCACGTGA
CTATTAAACTTCTGAATCGTTATGCAATGTTATTGATGCAAGAAGATGATACGATTCAAGTCACGTTGAGTGAGCACATGTTCGGGGAGGAAAAGTTAATTTATTTACAT
CGCGATGATATCCTGCATTACTGTGGGATGGTGGAGATAGGGTACTCCTGCATAGTCGCATTCATTTTCTTGCTAGTTGATCAAATAACCATTTCTAATTTTGTTAAAAG
TCAAGAAACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATCCCATATAATACTGGATCCTTGAGAATGTGGCAAG
CCAAGCACTCACTTCCACAATATCGTTCTTCCATCACTTGGAAACTTGTAAAGTGCCCCCGTCAACCGGGTTCTGTAGAGTGTGGGTACTATGTACAGAAGTATATACGA
GAAATCGTATATAATTCTAGTATCCCTATAATGAACCTGTTTAACACAAAAACTGCATATAAACAAGAAGAAATCGACGAGATTCGAGTACAATGGGCGGATTTTATTGG
CAGATTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATCAAGTGACGATGAAGTGAACGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGAGAACGTTTAGTCATCCAATACAGCAATCAAGGGCAGAGTGTTGGGGATAATGCAAACCAAATGCAAAGTTATATTGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAAGAACTGAAAGATAAAATTTTTGATTCTGTAGAGATGTCATTTGTGATCGACCCTCGGTCCAATAATGCTATTCTTCAATCA
GTGGCAAATAAATTTCGAACATTTCGATACACGTTGTATCAAAAACACATACTTCCATTTAAGGATGAGCCGTCCTTGTTGAAGCATCCTCCACAAAAGTATTCACATAT
TGATCAAAACCAATGGGAAGCATTTGTGAATGCTAGATTATCGGAAGAATGGGAGAAATTAGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAGGCACGAAAAG
GAAAAAATAATGAATACTTCGATGATGAAACCAAACAATGCATTTGTCGAATCGTAAACGAACTAGCTATGAAGAATAAAGGTAAAGACATATTGACCAGAAGCATTAGG
CACGCCAGACCACAGAGGCGTGTTAGAGGATTAGGTATGTCTGTCAAACCATCAACATACTTTAACATTCCTCGAGTGAAATCAAAATCAAGCAAAGAGTCTGGCAACAA
AATGTCGTGCTCACCTTCCAAAAAGTCTGCAAGCATAGGCAGTAATCATCCAAAAGACAAGGAGGTCATTGACGAGGTGGAAGAAATTTTAGAGGGAACTCCATGCCATC
TAGCAATAGGATCAAAGGATAATGTGATTACTGTAGGCACAATGTACACGTCTGACGCTCAATTTTTCACAATCCATGGAGTTCCCTTAGGAGATGAAAATGTTAGAGTG
ATTGTGGACATGATCGTAGGTGAAGATGCTCCACTACCAATTCCTATACGGGGAGAAGTAGAGTCCCTGAGTCAATTTATGGGAAATTTTGTGGCATGGCCTCGTGACCT
TGTCATTTTTAATAAGGGAAAAAAGGTGGCTTCTTCAGCAAAACATAAGTCGGATGTTTCTGTACATGTGCCTACTTCACATTCTACAAAGTACACAGATGCTCACGTGA
CTATTAAACTTCTGAATCGTTATGCAATGTTATTGATGCAAGAAGATGATACGATTCAAGTCACGTTGAGTGAGCACATGTTCGGGGAGGAAAAGTTAATTTATTTACAT
CGCGATGATATCCTGCATTACTGTGGGATGGTGGAGATAGGGTACTCCTGCATAGTCGCATTCATTTTCTTGCTAGTTGATCAAATAACCATTTCTAATTTTGTTAAAAG
TCAAGAAACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATCCCATATAATACTGGATCCTTGAGAATGTGGCAAG
CCAAGCACTCACTTCCACAATATCGTTCTTCCATCACTTGGAAACTTGTAAAGTGCCCCCGTCAACCGGGTTCTGTAGAGTGTGGGTACTATGTACAGAAGTATATACGA
GAAATCGTATATAATTCTAGTATCCCTATAATGAACCTGTTTAACACAAAAACTGCATATAAACAAGAAGAAATCGACGAGATTCGAGTACAATGGGCGGATTTTATTGG
CAGATTTGTGTAA
Protein sequenceShow/hide protein sequence
MSGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYSNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDPRSNNAILQS
VANKFRTFRYTLYQKHILPFKDEPSLLKHPPQKYSHIDQNQWEAFVNARLSEEWEKLAEDPSTRATLWIQARKGKNNEYFDDETKQCICRIVNELAMKNKGKDILTRSIR
HARPQRRVRGLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPSKKSASIGSNHPKDKEVIDEVEEILEGTPCHLAIGSKDNVITVGTMYTSDAQFFTIHGVPLGDENVRV
IVDMIVGEDAPLPIPIRGEVESLSQFMGNFVAWPRDLVIFNKGKKVASSAKHKSDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLLMQEDDTIQVTLSEHMFGEEKLIYLH
RDDILHYCGMVEIGYSCIVAFIFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTGSLRMWQAKHSLPQYRSSITWKLVKCPRQPGSVECGYYVQKYIR
EIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWADFIGRFV