; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003553 (gene) of Snake gourd v1 genome

Gene IDTan0003553
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
Genome locationLG11:36801641..36804542
RNA-Seq ExpressionTan0003553
SyntenyTan0003553
Gene Ontology termsNA
InterPro domainsIPR004264 - Transposase, Tnp1/En/Spm-like
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]1.6e-17248.88Show/hide
Query:  SSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDPRS
        SS D+ NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ V +NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++MSFVVD  S
Subjt:  SSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDPRS

Query:  KNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI----------
        K+ ILQSA+ KFR+F+ TL Q +ILP+KDE S L++PP+KYSHID+K+WE+FV ARLSEEWEV                      Y  L           
Subjt:  KNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI----------

Query:  --------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCSPSK-------
                A K+KNN  FD+AT++CV RIDELA   KG DILTEALGTPEHRG +RG+G  V+P+ + N+ RG  K S++S +K  T  S  +       
Subjt:  --------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCSPSK-------

Query:  ----------------------------KSASIGTNHPKDKEVIDEVKEILE------------GTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLG
                                    K    G   PK K V+ E +E LE            G PCHLAIGS DNVVAVG M+ SD Q PT+HG+PLG
Subjt:  ----------------------------KSASIGTNHPKDKEVIDEVKEILE------------GTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLG

Query:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQV
         EN+RV VD+ + ED  LPIP++G++E+L+Q++ NFVAWPR LVI  K KK  S    +S       T+ S+KYTD HVTIKLLNRYAM +MQ +D IQ+
Subjt:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQV

Query:  KLSEHMFGEEKLIYLHRDDILHYCGMVEIEY---------LWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT----
         LSEH+FG+EK IYL RDDI+ YCGM EI Y         LW VC++EI  +F+LVDQ TIS+ +KSQE R  NL NRLEM N  LDQ V IPYNT    
Subjt:  KLSEHMFGEEKLIYLHRDDILHYCGMVEIEY---------LWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT----

Query:  --------------------------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREE
                                        +SL+ WQ +HS   YRS I WK +KCPR   S+EC Y+VQKY+RE+V N++  I NLFNT  AY +EE
Subjt:  --------------------------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREE

Query:  IDEIRVQWADFVGRFV
        ID +RV+WA+FV RFV
Subjt:  IDEIRVQWADFVGRFV

XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]5.1e-17150.15Show/hide
Query:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP
        S SS D+ +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ + +NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L++PP+KY HIDQ++W +FVNARLSEEWE                       Y  L         
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------

Query:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCS-PSKKSAS
                  A K KNNEYFD+AT++C  RIDELA  +KG DILTEALGT EH G VRG+G  V+PS YFN+ +GKSK+ +   NK +T+ S PSKK + 
Subjt:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCS-PSKKSAS

Query:  IGTNHPKDKEVIDEVKEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFV
              K KE+++  +EI       +EG PCHLA+ S DN+VAVGT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q++  FV
Subjt:  IGTNHPKDKEVIDEVKEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFV

Query:  AWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVE---------
        AWPR LVI ++ K ++S    ++        +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+E         
Subjt:  AWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVE---------

Query:  IEYLWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMW
        I YLW V + EI  KFL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +                                    SL++W
Subjt:  IEYLWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMW

Query:  QAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGRFV
        QAKHS+ +YR+   WK +KCP Q  SVEC Y+VQKYIREIV N+S  I N+FNTK AY++EEIDE+R++WADFVG  V
Subjt:  QAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGRFV

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]1.1e-17351.91Show/hide
Query:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP
        S SS D+ +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ + +NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L++PP+KY HIDQ++W +FVNARLSEEWE                       Y  L         
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------

Query:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCS-PSKKSAS
                  A K KNNEYFD+AT++C  RIDELA  +KG DILTEALGT EH G VRG+G  V+PS YFN+ +GKSK+ +   NK +T+ S PSKK + 
Subjt:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCS-PSKKSAS

Query:  IGTNHPKDKEVIDEVKEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFV
              K KE+++  +EI       +EG PCHLA+ S DN+VAVGT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q++  FV
Subjt:  IGTNHPKDKEVIDEVKEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFV

Query:  AWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVE---------
        AWPR LVI ++ K ++S    ++        +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+E         
Subjt:  AWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVE---------

Query:  IEYLWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQ
        I YLW V + EI  KFL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +             SL++WQAKHS+ +YR+   WK +KCP Q
Subjt:  IEYLWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQ

Query:  PSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGRFV
          SVEC Y+VQKYIREIV N+S  I N+FNTK AY++EEIDE+R++WADFVG  V
Subjt:  PSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGRFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]4.1e-17650.5Show/hide
Query:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP
        S SS D+ NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ V +NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++MSFVVD 
Subjt:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------
         SK+ ILQSA+ KFRTF+ TL Q++ILP+KDE S L++PP+KYSHID+K+WE+FV ARLSEEWE                       Y  L         
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------

Query:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCSPSK-----
                  A K+KNNEY D AT++C  RIDELA   KG DILTEALGTPEHRG +RG+G  V+P+ ++N+ +GK K  +ES N+  T  S  K     
Subjt:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCSPSK-----

Query:  ---------------------KSASIGTN------HPKDKEVIDEVKEILEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVG
                             K    G N       PK K V+ + +EILEG PCHLAIGS DN+VAVGTM+ SDAQ P+++ +PLG +NVR +VD+++G
Subjt:  ---------------------KSASIGTN------HPKDKEVIDEVKEILEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVG

Query:  EDAPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLI
        ED  LPIP + ++++L Q++ NFVAWPR LVI  K KK  SP   KS        + S+KYTD HVTIKLLNRYAM SMQ DD IQ+ LSE + G+EK I
Subjt:  EDAPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLI

Query:  YLHRDDILHYCGMVEIEY---------LWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------
        YL RDDI+ YCGM EI Y         LW  CD+EI  KF++VDQ TIS+ VK QE R  NL NRLEMV+  LDQ V IPYNT                 
Subjt:  YLHRDDILHYCGMVEIEY---------LWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------

Query:  -------------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVG
                            SL+ WQAKHSL QYR+ I WK +KCPRQ  ++EC Y+VQKYIREIV NS+  I NLFNT+ AY+++EID +R++WA+FV 
Subjt:  -------------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVG

Query:  RFV
        RFV
Subjt:  RFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]3.1e-17650.57Show/hide
Query:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP
        S SS D+ NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ V +NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++MSFVVD 
Subjt:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------
         SK+ ILQSA+ KFRTF+ TL Q++ILP+KDE S L++PP+KYSHID+K+WE+FV ARLSEEWE                       Y  L         
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------

Query:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCSPSK-----
                  A K+KNNEY D AT++C  RIDELA   KG DILTEALGTPEHRG +RG+G  V+P+ ++N+ +GK K  +ES N+  T  S  K     
Subjt:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCSPSK-----

Query:  ---------------------KSASIGTN------HPKDKEVIDEVKEILEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVG
                             K    G N       PK K V+ + +EILEG PCHLAIGS DN+VAVGTM+ SDAQ P+++ +PLG +NVR +VD+++G
Subjt:  ---------------------KSASIGTN------HPKDKEVIDEVKEILEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVG

Query:  EDAPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLI
        ED  LPIP + ++++L Q++ NFVAWPR LVI  K KK  SP   KS        + S+KYTD HVTIKLLNRYAM SMQ DD IQ+ LSE + G+EK I
Subjt:  EDAPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLI

Query:  YLHRDDILHYCGMVEIEY---------LWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------
        YL RDDI+ YCGM EI Y         LW  CD+EI  KF++VDQ TIS+ VK QE R  NL NRLEMV+  LDQ V IPYNT                 
Subjt:  YLHRDDILHYCGMVEIEY---------LWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------

Query:  ------------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGR
                           SL+ WQAKHSL QYR+ I WK +KCPRQ  ++EC Y+VQKYIREIV NS+  I NLFNT+ AY+++EID +R++WA+FV R
Subjt:  ------------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGR

Query:  FV
        FV
Subjt:  FV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X17.7e-17348.88Show/hide
Query:  SSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDPRS
        SS D+ NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ V +NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++MSFVVD  S
Subjt:  SSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDPRS

Query:  KNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI----------
        K+ ILQSA+ KFR+F+ TL Q +ILP+KDE S L++PP+KYSHID+K+WE+FV ARLSEEWEV                      Y  L           
Subjt:  KNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI----------

Query:  --------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCSPSK-------
                A K+KNN  FD+AT++CV RIDELA   KG DILTEALGTPEHRG +RG+G  V+P+ + N+ RG  K S++S +K  T  S  +       
Subjt:  --------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCSPSK-------

Query:  ----------------------------KSASIGTNHPKDKEVIDEVKEILE------------GTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLG
                                    K    G   PK K V+ E +E LE            G PCHLAIGS DNVVAVG M+ SD Q PT+HG+PLG
Subjt:  ----------------------------KSASIGTNHPKDKEVIDEVKEILE------------GTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLG

Query:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQV
         EN+RV VD+ + ED  LPIP++G++E+L+Q++ NFVAWPR LVI  K KK  S    +S       T+ S+KYTD HVTIKLLNRYAM +MQ +D IQ+
Subjt:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQV

Query:  KLSEHMFGEEKLIYLHRDDILHYCGMVEIEY---------LWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT----
         LSEH+FG+EK IYL RDDI+ YCGM EI Y         LW VC++EI  +F+LVDQ TIS+ +KSQE R  NL NRLEM N  LDQ V IPYNT    
Subjt:  KLSEHMFGEEKLIYLHRDDILHYCGMVEIEY---------LWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT----

Query:  --------------------------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREE
                                        +SL+ WQ +HS   YRS I WK +KCPR   S+EC Y+VQKY+RE+V N++  I NLFNT  AY +EE
Subjt:  --------------------------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREE

Query:  IDEIRVQWADFVGRFV
        ID +RV+WA+FV RFV
Subjt:  IDEIRVQWADFVGRFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein7.7e-17348.88Show/hide
Query:  SSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDPRS
        SS D+ NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ V +NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++MSFVVD  S
Subjt:  SSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDPRS

Query:  KNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI----------
        K+ ILQSA+ KFR+F+ TL Q +ILP+KDE S L++PP+KYSHID+K+WE+FV ARLSEEWEV                      Y  L           
Subjt:  KNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI----------

Query:  --------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCSPSK-------
                A K+KNN  FD+AT++CV RIDELA   KG DILTEALGTPEHRG +RG+G  V+P+ + N+ RG  K S++S +K  T  S  +       
Subjt:  --------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCSPSK-------

Query:  ----------------------------KSASIGTNHPKDKEVIDEVKEILE------------GTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLG
                                    K    G   PK K V+ E +E LE            G PCHLAIGS DNVVAVG M+ SD Q PT+HG+PLG
Subjt:  ----------------------------KSASIGTNHPKDKEVIDEVKEILE------------GTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLG

Query:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQV
         EN+RV VD+ + ED  LPIP++G++E+L+Q++ NFVAWPR LVI  K KK  S    +S       T+ S+KYTD HVTIKLLNRYAM +MQ +D IQ+
Subjt:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQV

Query:  KLSEHMFGEEKLIYLHRDDILHYCGMVEIEY---------LWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT----
         LSEH+FG+EK IYL RDDI+ YCGM EI Y         LW VC++EI  +F+LVDQ TIS+ +KSQE R  NL NRLEM N  LDQ V IPYNT    
Subjt:  KLSEHMFGEEKLIYLHRDDILHYCGMVEIEY---------LWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT----

Query:  --------------------------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREE
                                        +SL+ WQ +HS   YRS I WK +KCPR   S+EC Y+VQKY+RE+V N++  I NLFNT  AY +EE
Subjt:  --------------------------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREE

Query:  IDEIRVQWADFVGRFV
        ID +RV+WA+FV RFV
Subjt:  IDEIRVQWADFVGRFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X12.5e-17150.15Show/hide
Query:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP
        S SS D+ +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ + +NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L++PP+KY HIDQ++W +FVNARLSEEWE                       Y  L         
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------

Query:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCS-PSKKSAS
                  A K KNNEYFD+AT++C  RIDELA  +KG DILTEALGT EH G VRG+G  V+PS YFN+ +GKSK+ +   NK +T+ S PSKK + 
Subjt:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCS-PSKKSAS

Query:  IGTNHPKDKEVIDEVKEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFV
              K KE+++  +EI       +EG PCHLA+ S DN+VAVGT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q++  FV
Subjt:  IGTNHPKDKEVIDEVKEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFV

Query:  AWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVE---------
        AWPR LVI ++ K ++S    ++        +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+E         
Subjt:  AWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVE---------

Query:  IEYLWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMW
        I YLW V + EI  KFL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +                                    SL++W
Subjt:  IEYLWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMW

Query:  QAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGRFV
        QAKHS+ +YR+   WK +KCP Q  SVEC Y+VQKYIREIV N+S  I N+FNTK AY++EEIDE+R++WADFVG  V
Subjt:  QAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGRFV

A0A6J1C398 uncharacterized protein LOC111007859 isoform X35.3e-17451.91Show/hide
Query:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP
        S SS D+ +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ + +NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L++PP+KY HIDQ++W +FVNARLSEEWE                       Y  L         
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------

Query:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCS-PSKKSAS
                  A K KNNEYFD+AT++C  RIDELA  +KG DILTEALGT EH G VRG+G  V+PS YFN+ +GKSK+ +   NK +T+ S PSKK + 
Subjt:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCS-PSKKSAS

Query:  IGTNHPKDKEVIDEVKEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFV
              K KE+++  +EI       +EG PCHLA+ S DN+VAVGT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q++  FV
Subjt:  IGTNHPKDKEVIDEVKEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFV

Query:  AWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVE---------
        AWPR LVI ++ K ++S    ++        +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+E         
Subjt:  AWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVE---------

Query:  IEYLWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQ
        I YLW V + EI  KFL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +             SL++WQAKHS+ +YR+   WK +KCP Q
Subjt:  IEYLWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT------------RSLRMWQAKHSLPQYRSAITWKLLKCPRQ

Query:  PSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGRFV
          SVEC Y+VQKYIREIV N+S  I N+FNTK AY++EEIDE+R++WADFVG  V
Subjt:  PSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGRFV

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X24.7e-17050.15Show/hide
Query:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP
        S SS D+ +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ + +NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L++PP+KY HIDQ++W +FVNARLSEEWE                       Y  L         
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEV---------------------KYIILI--------

Query:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCS-PSKKSAS
                  A K KNNEYFD+AT++C  RIDELA  +KG DILTEALGT EH G VRG+G  V+PS YFN+ +GKSK+ +   NK +T+ S PSKK + 
Subjt:  ----------AFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCS-PSKKSAS

Query:  IGTNHPKDKEVIDEVKEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFV
              K KE+++  +EI       +EG PCHLA+ S DN+VAVGT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q++  FV
Subjt:  IGTNHPKDKEVIDEVKEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMENFV

Query:  AWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVE---------
        AWPR LVI ++ K ++S    ++        +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LS+ +FG+EK IYL R+DI+ YC M+E         
Subjt:  AWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHYCGMVE---------

Query:  IEYLWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMW
        I YLW V + EI  KFL+VD  TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +                                    SL++W
Subjt:  IEYLWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMW

Query:  QAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGRFV
        QAKHS+ +YR+   WK +KCP Q  SVEC Y+VQKYIREIV N+S  I N+FNTK AY++EEIDE+R++WADFVG  V
Subjt:  QAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREIVYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGRFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAGATCAAGTGACGATAAAGTGAATGTAGCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGGGAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTAGTGATAATGCAAACCAAATGCAAAGTTATATAGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTTTGTAGAGATGTCATTTGTGGTAGACCCTCGGTCCAAGAATTGTATTCTTCAATCA
GCAGCAAATAAATTTCGAACATTTCGATACACGTTGTATCAAAAGCACATACTTCCATTTAAGGATGAGTCGTCCTTGTTGAAACATCCTCCACAAAAGTATTCCCATAT
TGATCAAAAAGAATGGGAAGCATTTGTGAATGCTAGATTATCGGAAGAATGGGAGGTAAAATACATTATACTTATTGCTTTTAAAAAAAAAAATAATGAATACTTCGATG
AAGCCACCAAACAATGTGTTGGTCGAATTGATGAACTAGCTATGAAGAATAAAGGTAATGACATATTGACCGAAGCATTAGGCACGCCAGAACACAGAGGGCATGTTAGA
GGAATAGGTATGTCTGTCAATCCATCAACATACTTTAACATTCCTCGAGGGAAATCAAAATCAAGCAAAGAGTCTGGCAACAAAATGTCGACTGATTGCTCACCTTCCAA
AAAGTCTGCAAGCATAGGCACTAATCATCCAAAAGACAAGGAGGTTATTGACGAGGTGAAAGAAATTTTAGAGGGAACTCCATGCCATCTAGCAATAGGGTCAAAGGATA
ATGTGGTTGCTGTAGGCACAATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAGGAGTCGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGT
GAAGATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCTCTGAGTCAATCAATGGAAAATTTTGTGGCATGGCCTCGGGACCTTGTCATTTTTAATAAGGGAAA
AAAGGTGGCTTCTCCAGCAAAACATAAGTCGGATGTTTTTGTACATGTGCCTACTTCACATTCTACAAAGTACACAGATGCTCATGTGACTATTAAGCTTCTGAATCGTT
ATGCAATGTTATCGATGCAAGAAGATGATACGATTCAAGTAAAGTTGAGCGAGCACATGTTCGGGGAGGAGAAGTTAATTTATTTACATCGCGATGATATCCTGCATTAC
TGTGGGATGGTGGAGATAGAGTATCTTTGGACTGTATGTGACAATGAAATAATCGCCAAGTTCTTGCTAGTTGATCAAATAACCATTTCTAATTTTGTTAAAAGTCAAGA
AACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATCCCATATAATACTAGATCCTTGAGAATGTGGCAAGCCAAGC
ACTCACTTCCACAATATCGTTCTGCCATCACTTGGAAACTTTTAAAGTGCCCCCGTCAACCGAGTTCTGTAGAGTGCAGGTACCATGTACAGAAGTATATACGAGAAATC
GTATATAATTCTAGTATCCCTATAATGAACCTTTTTAACACAAAAACTGCATATAAACGAGAAGAAATCGACGAGATTCGAGTACAATGGGCAGATTTTGTTGGCAGATT
TGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTAGATCAAGTGACGATAAAGTGAATGTAGCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGGGAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTAGTGATAATGCAAACCAAATGCAAAGTTATATAGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTTTGTAGAGATGTCATTTGTGGTAGACCCTCGGTCCAAGAATTGTATTCTTCAATCA
GCAGCAAATAAATTTCGAACATTTCGATACACGTTGTATCAAAAGCACATACTTCCATTTAAGGATGAGTCGTCCTTGTTGAAACATCCTCCACAAAAGTATTCCCATAT
TGATCAAAAAGAATGGGAAGCATTTGTGAATGCTAGATTATCGGAAGAATGGGAGGTAAAATACATTATACTTATTGCTTTTAAAAAAAAAAATAATGAATACTTCGATG
AAGCCACCAAACAATGTGTTGGTCGAATTGATGAACTAGCTATGAAGAATAAAGGTAATGACATATTGACCGAAGCATTAGGCACGCCAGAACACAGAGGGCATGTTAGA
GGAATAGGTATGTCTGTCAATCCATCAACATACTTTAACATTCCTCGAGGGAAATCAAAATCAAGCAAAGAGTCTGGCAACAAAATGTCGACTGATTGCTCACCTTCCAA
AAAGTCTGCAAGCATAGGCACTAATCATCCAAAAGACAAGGAGGTTATTGACGAGGTGAAAGAAATTTTAGAGGGAACTCCATGCCATCTAGCAATAGGGTCAAAGGATA
ATGTGGTTGCTGTAGGCACAATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAGGAGTCGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGT
GAAGATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCTCTGAGTCAATCAATGGAAAATTTTGTGGCATGGCCTCGGGACCTTGTCATTTTTAATAAGGGAAA
AAAGGTGGCTTCTCCAGCAAAACATAAGTCGGATGTTTTTGTACATGTGCCTACTTCACATTCTACAAAGTACACAGATGCTCATGTGACTATTAAGCTTCTGAATCGTT
ATGCAATGTTATCGATGCAAGAAGATGATACGATTCAAGTAAAGTTGAGCGAGCACATGTTCGGGGAGGAGAAGTTAATTTATTTACATCGCGATGATATCCTGCATTAC
TGTGGGATGGTGGAGATAGAGTATCTTTGGACTGTATGTGACAATGAAATAATCGCCAAGTTCTTGCTAGTTGATCAAATAACCATTTCTAATTTTGTTAAAAGTCAAGA
AACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATCCCATATAATACTAGATCCTTGAGAATGTGGCAAGCCAAGC
ACTCACTTCCACAATATCGTTCTGCCATCACTTGGAAACTTTTAAAGTGCCCCCGTCAACCGAGTTCTGTAGAGTGCAGGTACCATGTACAGAAGTATATACGAGAAATC
GTATATAATTCTAGTATCCCTATAATGAACCTTTTTAACACAAAAACTGCATATAAACGAGAAGAAATCGACGAGATTCGAGTACAATGGGCAGATTTTGTTGGCAGATT
TGTGTAA
Protein sequenceShow/hide protein sequence
MSRSSDDKVNVAIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSVSDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDFVEMSFVVDPRSKNCILQS
AANKFRTFRYTLYQKHILPFKDESSLLKHPPQKYSHIDQKEWEAFVNARLSEEWEVKYIILIAFKKKNNEYFDEATKQCVGRIDELAMKNKGNDILTEALGTPEHRGHVR
GIGMSVNPSTYFNIPRGKSKSSKESGNKMSTDCSPSKKSASIGTNHPKDKEVIDEVKEILEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVG
EDAPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKVASPAKHKSDVFVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHRDDILHY
CGMVEIEYLWTVCDNEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTRSLRMWQAKHSLPQYRSAITWKLLKCPRQPSSVECRYHVQKYIREI
VYNSSIPIMNLFNTKTAYKREEIDEIRVQWADFVGRFV