; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002761 (gene) of Snake gourd v1 genome

Gene IDTan0002761
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
Genome locationLG06:39197517..39200398
RNA-Seq ExpressionTan0002761
SyntenyTan0002761
Gene Ontology termsNA
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]4.7e-18051.62Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT

Query:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEEWE                               +L+ D
Subjt:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++C  RIDELA  +KG+DILTEAL          G+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS

Query:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV
              K KE++N  EEI       +EG PCHL + S DN+V VGT++ ++ Q P VHGV LG +NVRV+VD+++ E A +PIP+ GE+E+L+Q++  FV
Subjt:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV

Query:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY
        AWPR LVI ++ K ++S    ++        +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LS+ +F +EK IYL R+DI+ YC M+EIGYSCI TY
Subjt:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY

Query:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRMW
        IAYLW V +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G                                   SL++W
Subjt:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRMW

Query:  QAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV
        QAKHS+ +YR++  WK +KCP Q GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEID++R++WADFVG  V
Subjt:  QAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]8.8e-17951.62Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT

Query:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEEWE                               +L+ D
Subjt:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++C  RIDELA  +KG+DILTEAL          G+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS

Query:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV
              K KE++N  EEI       +EG PCHL + S DN+V VGT++ ++ Q P VHGV LG +NVRV+VD+++ E A +PIP+ GE+E+L+Q++  FV
Subjt:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV

Query:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY
        AWPR LVI ++ K ++S    ++        +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LS+ +F +EK IYL R+DI+ YC M+EIGYSCI TY
Subjt:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY

Query:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRMW
        IAYLW V +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G                                   SL++W
Subjt:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRMW

Query:  QAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV
        QAKHS+ +YR++  WK +KCP Q GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEID++R++WADFVG  V
Subjt:  QAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]1.0e-18253.44Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT

Query:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEEWE                               +L+ D
Subjt:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++C  RIDELA  +KG+DILTEAL          G+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS

Query:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV
              K KE++N  EEI       +EG PCHL + S DN+V VGT++ ++ Q P VHGV LG +NVRV+VD+++ E A +PIP+ GE+E+L+Q++  FV
Subjt:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV

Query:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY
        AWPR LVI ++ K ++S    ++        +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LS+ +F +EK IYL R+DI+ YC M+EIGYSCI TY
Subjt:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY

Query:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG------------SLRMWQAKHSLPQYRSSITWKLVKCPHQ
        IAYLW V +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G            SL++WQAKHS+ +YR++  WK +KCP Q
Subjt:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG------------SLRMWQAKHSLPQYRSSITWKLVKCPHQ

Query:  PGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV
         GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEID++R++WADFVG  V
Subjt:  PGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]3.0e-17951.21Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT
        S SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ VG+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++MSFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT

Query:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED
         SK+ ILQSA+ KFRTF+ TL Q++ILP+KDE S L++PP+KYSHID+ QWE+FV ARLSEEWE                               EL+ D
Subjt:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPS--------
        P  RATLW +ARK KNNEY D  T++C  RIDELA   KG+DILTEAL          G+G  V P+ ++N+ + K K  +ES N+     S        
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPS--------

Query:  --------------------KKSASIGSN------NPKDKEVINEVEEILEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVG
                            +K    G N       PK K V+ + EEILEG PCHL IGS DN+V VGTM+ SDAQ P ++ + LG +NVR +VD+++G
Subjt:  --------------------KKSASIGSN------NPKDKEVINEVEEILEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVG

Query:  EDAPLPIPIWGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLI
        ED  LPIP   ++++L Q++ NFVAWPR LVI  K KK  SP   KS        + S+KYTD HVTIKLLNRYAM SMQ DD IQ+ LSE +  +EK I
Subjt:  EDAPLPIPIWGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLI

Query:  YLHRDDILHYCGMVEIGYSCIETYIAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG----------------
        YL RDDI+ YCGM EIGYSCI  YIA LW  CD EI  KF++VDQ TIS+ VK QE R  NL NRLEMV+  LDQ V IPYNTG                
Subjt:  YLHRDDILHYCGMVEIGYSCIETYIAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG----------------

Query:  --------------------SLRMWQAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVG
                            SL+ WQAKHSL QYR+ I WK +KCP Q G++ECGYYVQKYIREIV NS+  I NLFNT+ AY+Q+EID +R++WA+FV 
Subjt:  --------------------SLRMWQAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVG

Query:  RFV
        RFV
Subjt:  RFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]2.3e-17951.28Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT
        S SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ VG+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++MSFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT

Query:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED
         SK+ ILQSA+ KFRTF+ TL Q++ILP+KDE S L++PP+KYSHID+ QWE+FV ARLSEEWE                               EL+ D
Subjt:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPS--------
        P  RATLW +ARK KNNEY D  T++C  RIDELA   KG+DILTEAL          G+G  V P+ ++N+ + K K  +ES N+     S        
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPS--------

Query:  --------------------KKSASIGSN------NPKDKEVINEVEEILEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVG
                            +K    G N       PK K V+ + EEILEG PCHL IGS DN+V VGTM+ SDAQ P ++ + LG +NVR +VD+++G
Subjt:  --------------------KKSASIGSN------NPKDKEVINEVEEILEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVG

Query:  EDAPLPIPIWGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLI
        ED  LPIP   ++++L Q++ NFVAWPR LVI  K KK  SP   KS        + S+KYTD HVTIKLLNRYAM SMQ DD IQ+ LSE +  +EK I
Subjt:  EDAPLPIPIWGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLI

Query:  YLHRDDILHYCGMVEIGYSCIETYIAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG----------------
        YL RDDI+ YCGM EIGYSCI  YIA LW  CD EI  KF++VDQ TIS+ VK QE R  NL NRLEMV+  LDQ V IPYNTG                
Subjt:  YLHRDDILHYCGMVEIGYSCIETYIAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG----------------

Query:  -------------------SLRMWQAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGR
                           SL+ WQAKHSL QYR+ I WK +KCP Q G++ECGYYVQKYIREIV NS+  I NLFNT+ AY+Q+EID +R++WA+FV R
Subjt:  -------------------SLRMWQAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGR

Query:  FV
        FV
Subjt:  FV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X14.4e-17649.72Show/hide
Query:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDTRS
        SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ VG+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++MSFV+D  S
Subjt:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDTRS

Query:  KNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAEDPS
        K+ ILQSA+ KFR+F+ TL Q +ILP+KDE S L++PP+KYSHID+ QWE+FV ARLSEEWE                               EL+ DP 
Subjt:  KNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAEDPS

Query:  TRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPSK---------
         RATLW +ARK KNN  FDD T++CV RIDELA   KG+DILTEAL          G+G  V P+ + N+ R   K S++S +K     S+         
Subjt:  TRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPSK---------

Query:  ----------------------------KSASIGSNNPKDKEVINEVEEILE------------GTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLG
                                    K    G   PK K V+ E EE LE            G PCHL IGS DNVV VG M+ SD Q P +HG+ LG
Subjt:  ----------------------------KSASIGSNNPKDKEVINEVEEILE------------GTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLG

Query:  DENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQV
         EN+RV VD+ + ED  LPIP+ G++E+L+Q++ NFVAWPR LVI  K KK  S    +S       T+ S+KYTD HVTIKLLNRYAM +MQ +D IQ+
Subjt:  DENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQV

Query:  TLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETYIAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG---
        +LSEH+F +EK IYL RDDI+ YCGM EIGYSCI TYIA LW VC+ EI  +F+LVDQ TIS+ +KSQE R  NL NRLEM N  LDQ V IPYNTG   
Subjt:  TLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETYIAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG---

Query:  ---------------------------------SLRMWQAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEE
                                         SL+ WQ +HS   YRS I WK +KCP   GS+ECGYYVQKY+RE+V N++  I NLFNT  AY QEE
Subjt:  ---------------------------------SLRMWQAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEE

Query:  IDDIRVQWADFVGRFV
        ID +RV+WA+FV RFV
Subjt:  IDDIRVQWADFVGRFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein4.4e-17649.72Show/hide
Query:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDTRS
        SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ VG+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++MSFV+D  S
Subjt:  SSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDTRS

Query:  KNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAEDPS
        K+ ILQSA+ KFR+F+ TL Q +ILP+KDE S L++PP+KYSHID+ QWE+FV ARLSEEWE                               EL+ DP 
Subjt:  KNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAEDPS

Query:  TRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPSK---------
         RATLW +ARK KNN  FDD T++CV RIDELA   KG+DILTEAL          G+G  V P+ + N+ R   K S++S +K     S+         
Subjt:  TRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMSCSPSK---------

Query:  ----------------------------KSASIGSNNPKDKEVINEVEEILE------------GTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLG
                                    K    G   PK K V+ E EE LE            G PCHL IGS DNVV VG M+ SD Q P +HG+ LG
Subjt:  ----------------------------KSASIGSNNPKDKEVINEVEEILE------------GTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLG

Query:  DENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQV
         EN+RV VD+ + ED  LPIP+ G++E+L+Q++ NFVAWPR LVI  K KK  S    +S       T+ S+KYTD HVTIKLLNRYAM +MQ +D IQ+
Subjt:  DENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQV

Query:  TLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETYIAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG---
        +LSEH+F +EK IYL RDDI+ YCGM EIGYSCI TYIA LW VC+ EI  +F+LVDQ TIS+ +KSQE R  NL NRLEM N  LDQ V IPYNTG   
Subjt:  TLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETYIAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG---

Query:  ---------------------------------SLRMWQAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEE
                                         SL+ WQ +HS   YRS I WK +KCP   GS+ECGYYVQKY+RE+V N++  I NLFNT  AY QEE
Subjt:  ---------------------------------SLRMWQAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEE

Query:  IDDIRVQWADFVGRFV
        ID +RV+WA+FV RFV
Subjt:  IDDIRVQWADFVGRFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X12.3e-18051.62Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT

Query:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEEWE                               +L+ D
Subjt:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++C  RIDELA  +KG+DILTEAL          G+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS

Query:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV
              K KE++N  EEI       +EG PCHL + S DN+V VGT++ ++ Q P VHGV LG +NVRV+VD+++ E A +PIP+ GE+E+L+Q++  FV
Subjt:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV

Query:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY
        AWPR LVI ++ K ++S    ++        +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LS+ +F +EK IYL R+DI+ YC M+EIGYSCI TY
Subjt:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY

Query:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRMW
        IAYLW V +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G                                   SL++W
Subjt:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRMW

Query:  QAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV
        QAKHS+ +YR++  WK +KCP Q GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEID++R++WADFVG  V
Subjt:  QAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV

A0A6J1C398 uncharacterized protein LOC111007859 isoform X34.9e-18353.44Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT

Query:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEEWE                               +L+ D
Subjt:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++C  RIDELA  +KG+DILTEAL          G+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS

Query:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV
              K KE++N  EEI       +EG PCHL + S DN+V VGT++ ++ Q P VHGV LG +NVRV+VD+++ E A +PIP+ GE+E+L+Q++  FV
Subjt:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV

Query:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY
        AWPR LVI ++ K ++S    ++        +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LS+ +F +EK IYL R+DI+ YC M+EIGYSCI TY
Subjt:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY

Query:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG------------SLRMWQAKHSLPQYRSSITWKLVKCPHQ
        IAYLW V +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G            SL++WQAKHS+ +YR++  WK +KCP Q
Subjt:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG------------SLRMWQAKHSLPQYRSSITWKLVKCPHQ

Query:  PGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV
         GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEID++R++WADFVG  V
Subjt:  PGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X24.3e-17951.62Show/hide
Query:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDT

Query:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEEWE                               +L+ D
Subjt:  RSKNAILQSATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++C  RIDELA  +KG+DILTEAL          G+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEAL----------GLGMSVKPSTYFNIPRVKSKSSKESGNKMS---CSPSKKSAS

Query:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV
              K KE++N  EEI       +EG PCHL + S DN+V VGT++ ++ Q P VHGV LG +NVRV+VD+++ E A +PIP+ GE+E+L+Q++  FV
Subjt:  IGSNNPKDKEVINEVEEI-------LEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAPLPIPIWGEVESLSQSMENFV

Query:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY
        AWPR LVI ++ K ++S    ++        +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LS+ +F +EK IYL R+DI+ YC M+EIGYSCI TY
Subjt:  AWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMVEIGYSCIETY

Query:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRMW
        IAYLW V +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +G                                   SL++W
Subjt:  IAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTG-----------------------------------SLRMW

Query:  QAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV
        QAKHS+ +YR++  WK +KCP Q GSVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEID++R++WADFVG  V
Subjt:  QAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATCAAGTGACGATGAAGTGAATGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGGGAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTGGTGATAATGCAAACCAAATGCAAAGTTATATAGGAGTTTGTGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTCTGTAGAGATGTCATTTGTGATCGACACTCGGTCCAAAAATGCTATTCTTCAATCA
GCGACAAATAAATTTCGAACATTTCGATACACGTTGTATCAAAAACACATACTTCCATTTAAGGATGAGTCGTCCTTGTTGAAGCATCCTCCACAAAAGTATTCACATAT
TGATCAAAACCAATGGGAAGCATTTGTGAATGCTAGATTATCGGAAGAATGGGAGGAATTGGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAGGCACGAAAAG
GAAAAAATAATGAATACTTCGATGATGAAACCAAACAATGCGTTGGTCGAATCGACGAACTAGCTATGAAGAATAAAGGTAAAGACATATTGACCGAAGCATTAGGATTA
GGTATGTCTGTCAAACCATCAACATACTTTAACATTCCTCGAGTGAAATCAAAATCAAGTAAAGAGTCTGGCAACAAAATGTCGTGCTCACCTTCCAAAAAGTCTGCAAG
CATAGGCAGTAATAATCCAAAAGACAAGGAGGTCATTAATGAGGTGGAAGAAATTTTAGAGGGAACTCCATGCCATCTAGTAATAGGATCAAAGGATAATGTGGTTGTTG
TAGGCACAATGTACACGTCTGACGCTCAATTTCCCATAGTCCATGGAGTTCTCTTAGGAGATGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGTGAAGATGCTCCA
CTACCAATTCCTATATGGGGAGAAGTAGAGTCCCTGAGTCAATCTATGGAAAATTTTGTGGCATGGCCTCGTGACCTTGTCATTTTTAATAAGGGAAAAAAGGTGGCTTC
TCCTGCAAAACATAAGTCGGATGTTTTTGTACGTGTGCCTACTTCACATTCTACAAAGTACACAGATGCTCACGTGACTATTAAACTTCTGAATCGTTATGCAATGTTAT
CGATGCAAGAAGATGATACGATTCAAGTCACGTTGAGTGAGCACATGTTCGAGGAGGAAAAGTTAATTTATTTACATCGCGATGATATCCTGCATTACTGTGGGATGGTG
GAGATAGGGTACTCCTGCATAGAAACATACATTGCGTATCTTTGGACTGTATGTGACTATGAAATAATCGTCAAGTTCTTGCTAGTTGACCAAATAACCATTTCTAATTT
TGTTAAAAGTCAAGAAACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATCCCATATAATACTGGATCCTTGAGAA
TGTGGCAAGCCAAGCACTCACTTCCACAATATCGTTCATCCATCACTTGGAAACTTGTAAAGTGCCCCCATCAACCGGGTTCTGTAGAGTGCGGGTACTATGTACAGAAG
TATATACGAGAAATCGTATATAATTCTAGTATCCCTATAATGAACCTGTTTAACACAAAAACTGCTTATAAACAAGAAGAAATCGACGACATTCGAGTACAATGGGCGGA
TTTTGTTGGCAGATTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATCAAGTGACGATGAAGTGAATGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGGGAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTGGTGATAATGCAAACCAAATGCAAAGTTATATAGGAGTTTGTGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTCTGTAGAGATGTCATTTGTGATCGACACTCGGTCCAAAAATGCTATTCTTCAATCA
GCGACAAATAAATTTCGAACATTTCGATACACGTTGTATCAAAAACACATACTTCCATTTAAGGATGAGTCGTCCTTGTTGAAGCATCCTCCACAAAAGTATTCACATAT
TGATCAAAACCAATGGGAAGCATTTGTGAATGCTAGATTATCGGAAGAATGGGAGGAATTGGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAGGCACGAAAAG
GAAAAAATAATGAATACTTCGATGATGAAACCAAACAATGCGTTGGTCGAATCGACGAACTAGCTATGAAGAATAAAGGTAAAGACATATTGACCGAAGCATTAGGATTA
GGTATGTCTGTCAAACCATCAACATACTTTAACATTCCTCGAGTGAAATCAAAATCAAGTAAAGAGTCTGGCAACAAAATGTCGTGCTCACCTTCCAAAAAGTCTGCAAG
CATAGGCAGTAATAATCCAAAAGACAAGGAGGTCATTAATGAGGTGGAAGAAATTTTAGAGGGAACTCCATGCCATCTAGTAATAGGATCAAAGGATAATGTGGTTGTTG
TAGGCACAATGTACACGTCTGACGCTCAATTTCCCATAGTCCATGGAGTTCTCTTAGGAGATGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGTGAAGATGCTCCA
CTACCAATTCCTATATGGGGAGAAGTAGAGTCCCTGAGTCAATCTATGGAAAATTTTGTGGCATGGCCTCGTGACCTTGTCATTTTTAATAAGGGAAAAAAGGTGGCTTC
TCCTGCAAAACATAAGTCGGATGTTTTTGTACGTGTGCCTACTTCACATTCTACAAAGTACACAGATGCTCACGTGACTATTAAACTTCTGAATCGTTATGCAATGTTAT
CGATGCAAGAAGATGATACGATTCAAGTCACGTTGAGTGAGCACATGTTCGAGGAGGAAAAGTTAATTTATTTACATCGCGATGATATCCTGCATTACTGTGGGATGGTG
GAGATAGGGTACTCCTGCATAGAAACATACATTGCGTATCTTTGGACTGTATGTGACTATGAAATAATCGTCAAGTTCTTGCTAGTTGACCAAATAACCATTTCTAATTT
TGTTAAAAGTCAAGAAACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATCCCATATAATACTGGATCCTTGAGAA
TGTGGCAAGCCAAGCACTCACTTCCACAATATCGTTCATCCATCACTTGGAAACTTGTAAAGTGCCCCCATCAACCGGGTTCTGTAGAGTGCGGGTACTATGTACAGAAG
TATATACGAGAAATCGTATATAATTCTAGTATCCCTATAATGAACCTGTTTAACACAAAAACTGCTTATAAACAAGAAGAAATCGACGACATTCGAGTACAATGGGCGGA
TTTTGTTGGCAGATTTGTGTAA
Protein sequenceShow/hide protein sequence
MSGSSDDEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDTRSKNAILQS
ATNKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEEWEELAEDPSTRATLWIQARKGKNNEYFDDETKQCVGRIDELAMKNKGKDILTEALGL
GMSVKPSTYFNIPRVKSKSSKESGNKMSCSPSKKSASIGSNNPKDKEVINEVEEILEGTPCHLVIGSKDNVVVVGTMYTSDAQFPIVHGVLLGDENVRVVVDMIVGEDAP
LPIPIWGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVRVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSEHMFEEEKLIYLHRDDILHYCGMV
EIGYSCIETYIAYLWTVCDYEIIVKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTGSLRMWQAKHSLPQYRSSITWKLVKCPHQPGSVECGYYVQK
YIREIVYNSSIPIMNLFNTKTAYKQEEIDDIRVQWADFVGRFV