; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006821 (gene) of Snake gourd v1 genome

Gene IDTan0006821
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
Genome locationLG02:44312843..44315707
RNA-Seq ExpressionTan0006821
SyntenyTan0006821
Gene Ontology termsNA
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]4.3e-17551.67Show/hide
Query:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP
        S SS DE +V I  EV+   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L+ PP+KY HIDQ++W +FVNARLSEEWE                               +L  +
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE

Query:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS
        PS RA LW +ARKGKNNEYFD+AT++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V+PS YFN+ +GKSK+ +   NK +T  S PSK  + 
Subjt:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS

Query:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV
              K KE+++             EG PCHLA+ S DN+V VGT++ ++ Q PTVHGVPL V+NVRV+VD+++ E A +PIP+RGE+E+L+Q++G FV
Subjt:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV

Query:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K    S   +T    +K+TD HV+I LLNRY MLSMQ +DT+++ LS+ +FG+EK IYL ++DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR----------------------------------------------------SLRMWQAKHSLPQY
         +YEI  KFL+VD  TIS +VKSQE R  NLANR                                                    SL++WQAKHS+ +Y
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR----------------------------------------------------SLRMWQAKHSLPQY

Query:  RSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ
        R+   WK +K P Q GSVECGYYVQKYIREIV N+S  I N FNTK AY+QEEIDE+R++
Subjt:  RSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]8.1e-17451.67Show/hide
Query:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP
        S SS DE +V I  EV+   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L+ PP+KY HIDQ++W +FVNARLSEEWE                               +L  +
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE

Query:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS
        PS RA LW +ARKGKNNEYFD+AT++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V+PS YFN+ +GKSK+ +   NK +T  S PSK  + 
Subjt:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS

Query:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV
              K KE+++             EG PCHLA+ S DN+V VGT++ ++ Q PTVHGVPL V+NVRV+VD+++ E A +PIP+RGE+E+L+Q++G FV
Subjt:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV

Query:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K    S   +T    +K+TD HV+I LLNRY MLSMQ +DT+++ LS+ +FG+EK IYL ++DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR----------------------------------------------------SLRMWQAKHSLPQY
         +YEI  KFL+VD  TIS +VKSQE R  NLANR                                                    SL++WQAKHS+ +Y
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR----------------------------------------------------SLRMWQAKHSLPQY

Query:  RSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ
        R+   WK +K P Q GSVECGYYVQKYIREIV N+S  I N FNTK AY+QEEIDE+R++
Subjt:  RSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]9.2e-17853.53Show/hide
Query:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP
        S SS DE +V I  EV+   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L+ PP+KY HIDQ++W +FVNARLSEEWE                               +L  +
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE

Query:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS
        PS RA LW +ARKGKNNEYFD+AT++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V+PS YFN+ +GKSK+ +   NK +T  S PSK  + 
Subjt:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS

Query:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV
              K KE+++             EG PCHLA+ S DN+V VGT++ ++ Q PTVHGVPL V+NVRV+VD+++ E A +PIP+RGE+E+L+Q++G FV
Subjt:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV

Query:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K    S   +T    +K+TD HV+I LLNRY MLSMQ +DT+++ LS+ +FG+EK IYL ++DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR-----------------------------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYY
         +YEI  KFL+VD  TIS +VKSQE R  NLANR                             SL++WQAKHS+ +YR+   WK +K P Q GSVECGYY
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR-----------------------------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYY

Query:  VQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ
        VQKYIREIV N+S  I N FNTK AY+QEEIDE+R++
Subjt:  VQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]1.4e-17350.51Show/hide
Query:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP
        S SS DE NV I+ EV+ T RRG T M  L  +R  GER  I+YN+ GQ VG+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++MSFVVD 
Subjt:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE
         SK+ ILQSA+ KFRTF+ TL Q++ILP+KDE S L+ PP+KYSHID+K+WE+FV ARLSEEWE                               EL  +
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE

Query:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCSPSK----M
        P  RATLW +ARK KNNEY D AT++CA RIDELA   KG+DILTEALGTPEHRGR+RG+G  V+P+ ++N+ +GK K  +ES N+  T  S  K     
Subjt:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCSPSK----M

Query:  SASIDSNHP----------------------------------KDKEVIDEGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVG
        S   D                                      KD E I EG PCHLAIGS DN+V VGTM+ SDAQ P+++ +PL  +NVR +VD+++G
Subjt:  SASIDSNHP----------------------------------KDKEVIDEGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVG

Query:  EDAPLPIPIRGEVESLSQSMGNFVAWPRDLVIFNKGKKSDVSVHVRTL-HSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDI
        ED  LPIP + ++++L Q++GNFVAWPR LVI  K KK+      +++  S+KYTD HVTI LLNRYAM SMQ DD IQ+ LSE + G+EK IYL +DDI
Subjt:  EDAPLPIPIRGEVESLSQSMGNFVAWPRDLVIFNKGKKSDVSVHVRTL-HSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDI

Query:  LHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANR------------------------------------------
        + YCGM EIGYSCI+AYIA LW  CD EI  KF++VDQ TIS+ VK QE R  NL NR                                          
Subjt:  LHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANR------------------------------------------

Query:  -----------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ
                   SL+ WQAKHSL QYR+ I WK +K PRQ G++ECGYYVQKYIREIV NS+  I N FNT+ AY+Q+EID +R++
Subjt:  -----------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]1.1e-17350.58Show/hide
Query:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP
        S SS DE NV I+ EV+ T RRG T M  L  +R  GER  I+YN+ GQ VG+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++MSFVVD 
Subjt:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE
         SK+ ILQSA+ KFRTF+ TL Q++ILP+KDE S L+ PP+KYSHID+K+WE+FV ARLSEEWE                               EL  +
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE

Query:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCSPSK----M
        P  RATLW +ARK KNNEY D AT++CA RIDELA   KG+DILTEALGTPEHRGR+RG+G  V+P+ ++N+ +GK K  +ES N+  T  S  K     
Subjt:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCSPSK----M

Query:  SASIDSNHP----------------------------------KDKEVIDEGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVG
        S   D                                      KD E I EG PCHLAIGS DN+V VGTM+ SDAQ P+++ +PL  +NVR +VD+++G
Subjt:  SASIDSNHP----------------------------------KDKEVIDEGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVG

Query:  EDAPLPIPIRGEVESLSQSMGNFVAWPRDLVIFNKGKKSDVSVHVRTL-HSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDI
        ED  LPIP + ++++L Q++GNFVAWPR LVI  K KK+      +++  S+KYTD HVTI LLNRYAM SMQ DD IQ+ LSE + G+EK IYL +DDI
Subjt:  EDAPLPIPIRGEVESLSQSMGNFVAWPRDLVIFNKGKKSDVSVHVRTL-HSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDI

Query:  LHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANR------------------------------------------
        + YCGM EIGYSCI+AYIA LW  CD EI  KF++VDQ TIS+ VK QE R  NL NR                                          
Subjt:  LHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANR------------------------------------------

Query:  ----------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ
                  SL+ WQAKHSL QYR+ I WK +K PRQ G++ECGYYVQKYIREIV NS+  I N FNT+ AY+Q+EID +R++
Subjt:  ----------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X14.5e-17048.85Show/hide
Query:  SSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDPRS
        SS DE NV I+ EV+ T RRG T M  L  +R  GER  I+YN++GQ VG+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++MSFVVD  S
Subjt:  SSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDPRS

Query:  KNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDEPS
        K+ ILQSA+ KFR+F+ TL Q +ILP+KDE S L+ PP+KYSHID+K+WE+FV ARLSEEWE                               EL  +P 
Subjt:  KNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDEPS

Query:  TRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS----------
         RATLW +ARK KNN  FD+AT++C  RIDELA   KG+DILTEALGTPEHRGR+RG+G  V+P+ + N+ RG  K S++S +K  T  S          
Subjt:  TRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS----------

Query:  -PSKMSASIDSNH------------------------PKDKEVIDE------------------GTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLR
          ++   S D N                         PK K V+ E                  G PCHLAIGS DNVV VG M+ SD Q PT+HG+PL 
Subjt:  -PSKMSASIDSNH------------------------PKDKEVIDE------------------GTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLR

Query:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPRDLVIFNKGKKS-DVSVHVRTLHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMF
         EN+RV VD+ + ED  LPIP++G++E+L+Q++GNFVAWPR LVI  K KK+  ++    T  S+KYTD HVTI LLNRYAM +MQ +D IQ+ LSEH+F
Subjt:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPRDLVIFNKGKKS-DVSVHVRTLHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMF

Query:  GEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANR-----------------------------
        G+EK IYL +DDI+ YCGM EIGYSCI+ YIA LW VC+ EI  +F+LVDQ TIS+ +KSQE R  NL NR                             
Subjt:  GEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANR-----------------------------

Query:  ------------------------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ
                                SL+ WQ +HS   YRS I WK +K PR  GS+ECGYYVQKY+RE+V N++  I N FNT  AY QEEID +RV+
Subjt:  ------------------------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ

A0A5D3CYL9 ULP_PROTEASE domain-containing protein4.5e-17048.85Show/hide
Query:  SSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDPRS
        SS DE NV I+ EV+ T RRG T M  L  +R  GER  I+YN++GQ VG+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++MSFVVD  S
Subjt:  SSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDPRS

Query:  KNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDEPS
        K+ ILQSA+ KFR+F+ TL Q +ILP+KDE S L+ PP+KYSHID+K+WE+FV ARLSEEWE                               EL  +P 
Subjt:  KNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDEPS

Query:  TRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS----------
         RATLW +ARK KNN  FD+AT++C  RIDELA   KG+DILTEALGTPEHRGR+RG+G  V+P+ + N+ RG  K S++S +K  T  S          
Subjt:  TRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS----------

Query:  -PSKMSASIDSNH------------------------PKDKEVIDE------------------GTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLR
          ++   S D N                         PK K V+ E                  G PCHLAIGS DNVV VG M+ SD Q PT+HG+PL 
Subjt:  -PSKMSASIDSNH------------------------PKDKEVIDE------------------GTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLR

Query:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPRDLVIFNKGKKS-DVSVHVRTLHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMF
         EN+RV VD+ + ED  LPIP++G++E+L+Q++GNFVAWPR LVI  K KK+  ++    T  S+KYTD HVTI LLNRYAM +MQ +D IQ+ LSEH+F
Subjt:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFVAWPRDLVIFNKGKKS-DVSVHVRTLHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMF

Query:  GEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANR-----------------------------
        G+EK IYL +DDI+ YCGM EIGYSCI+ YIA LW VC+ EI  +F+LVDQ TIS+ +KSQE R  NL NR                             
Subjt:  GEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANR-----------------------------

Query:  ------------------------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ
                                SL+ WQ +HS   YRS I WK +K PR  GS+ECGYYVQKY+RE+V N++  I N FNT  AY QEEID +RV+
Subjt:  ------------------------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X12.1e-17551.67Show/hide
Query:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP
        S SS DE +V I  EV+   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L+ PP+KY HIDQ++W +FVNARLSEEWE                               +L  +
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE

Query:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS
        PS RA LW +ARKGKNNEYFD+AT++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V+PS YFN+ +GKSK+ +   NK +T  S PSK  + 
Subjt:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS

Query:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV
              K KE+++             EG PCHLA+ S DN+V VGT++ ++ Q PTVHGVPL V+NVRV+VD+++ E A +PIP+RGE+E+L+Q++G FV
Subjt:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV

Query:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K    S   +T    +K+TD HV+I LLNRY MLSMQ +DT+++ LS+ +FG+EK IYL ++DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR----------------------------------------------------SLRMWQAKHSLPQY
         +YEI  KFL+VD  TIS +VKSQE R  NLANR                                                    SL++WQAKHS+ +Y
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR----------------------------------------------------SLRMWQAKHSLPQY

Query:  RSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ
        R+   WK +K P Q GSVECGYYVQKYIREIV N+S  I N FNTK AY+QEEIDE+R++
Subjt:  RSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ

A0A6J1C398 uncharacterized protein LOC111007859 isoform X34.5e-17853.53Show/hide
Query:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP
        S SS DE +V I  EV+   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L+ PP+KY HIDQ++W +FVNARLSEEWE                               +L  +
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE

Query:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS
        PS RA LW +ARKGKNNEYFD+AT++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V+PS YFN+ +GKSK+ +   NK +T  S PSK  + 
Subjt:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS

Query:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV
              K KE+++             EG PCHLA+ S DN+V VGT++ ++ Q PTVHGVPL V+NVRV+VD+++ E A +PIP+RGE+E+L+Q++G FV
Subjt:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV

Query:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K    S   +T    +K+TD HV+I LLNRY MLSMQ +DT+++ LS+ +FG+EK IYL ++DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR-----------------------------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYY
         +YEI  KFL+VD  TIS +VKSQE R  NLANR                             SL++WQAKHS+ +YR+   WK +K P Q GSVECGYY
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR-----------------------------SLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYY

Query:  VQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ
        VQKYIREIV N+S  I N FNTK AY+QEEIDE+R++
Subjt:  VQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X23.9e-17451.67Show/hide
Query:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP
        S SS DE +V I  EV+   RRG TTM  L  +R  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDP

Query:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE
        RSK+ ILQSA+ KFRTF+ TL + +ILPFKDE   L+ PP+KY HIDQ++W +FVNARLSEEWE                               +L  +
Subjt:  RSKNCILQSAANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWE-------------------------------ELGDE

Query:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS
        PS RA LW +ARKGKNNEYFD+AT++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V+PS YFN+ +GKSK+ +   NK +T  S PSK  + 
Subjt:  PSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCS-PSKMSAS

Query:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV
              K KE+++             EG PCHLA+ S DN+V VGT++ ++ Q PTVHGVPL V+NVRV+VD+++ E A +PIP+RGE+E+L+Q++G FV
Subjt:  IDSNHPKDKEVID-------------EGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSMGNFV

Query:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K    S   +T    +K+TD HV+I LLNRY MLSMQ +DT+++ LS+ +FG+EK IYL ++DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGKKSDVSVHVRT-LHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR----------------------------------------------------SLRMWQAKHSLPQY
         +YEI  KFL+VD  TIS +VKSQE R  NLANR                                                    SL++WQAKHS+ +Y
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANR----------------------------------------------------SLRMWQAKHSLPQY

Query:  RSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ
        R+   WK +K P Q GSVECGYYVQKYIREIV N+S  I N FNTK AY+QEEIDE+R++
Subjt:  RSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKTAYKQEEIDEIRVQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATCAAGCGACGATGAAGTGAATGTATCGATCAAAATAGAGGTTAGGCATACTAATCGACGTGGTCTCACTACTATGCGTGGTCTGGCACGCGTAAGGACTAA
AGGGGAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTGGTGATAATGCAAACCAAATGCAAAGTTATATAGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAGGAATTGAAAGATAAAATTTTTGATTCTGTAGAGATGTCATTTGTGGTAGACCCTCGGTCCAAGAATTGTATTCTTCAATCA
GCGGCAAATAAATTTCGAACATTTCGATACACGTTGTATCAAAAGCACATACTTCCATTTAAGGATGAGTTGTCCTTGTTGAAACAACCTCCACAAAAGTATTCCCATAT
TGATCAAAAAGAATGGGAAGCATTTGTGAATGCTAGATTATCGGAAGAATGGGAGGAATTGGGGGATGAGCCTTCCACTCGTGCGACCTTATGGATACAAGCACGAAAAG
GAAAAAATAATGAATACTTCGATGAAGCCACCAAACAATGTGCTGGTCGAATCGATGAACTAGCTATGAAGAATAAAGGTAAAGACATATTGACCGAAGCATTAGGCACG
CCAGAACACAGAGGGCGTGTTAGAGGAATAGGTATGTCTGTCAATCCATCAACATACTTTAACATTCCTCGAGGGAAATCAAAATCAAGCAAAGAGTCTGGCAACAAAAT
GTCGACTAATTGCTCACCTTCCAAAATGTCTGCAAGCATAGACAGTAATCATCCAAAAGACAAGGAGGTCATTGACGAGGGAACTCCATGCCATCTAGCAATAGGATCAA
AGGATAATGTGGTTGTTGTAGGCACAATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAAGAGTCGAAAATGTTAGAGTGGTAGTGGACATGATC
GTAGGTGAAGATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCTCTGAGTCAATCAATGGGAAATTTTGTGGCATGGCCTCGGGACCTTGTCATTTTTAATAA
GGGGAAAAAGTCGGATGTTTCTGTACATGTGCGCACTTTACATTCTACAAAGTACACAGATGCTCATGTGACTATTATGCTTCTGAATCGTTATGCAATGTTATCGATGC
AAGAAGATGATACGATTCAAGTCAAGTTGAGCGAGCACATGTTCGGGGAGGAGAAGTTAATTTATTTACATCAAGATGATATCCTGCATTACTGTGGGATGGTGGAGATA
GGGTACTCCTGCATAGTCGCATACATTGCGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTGCTAGTTGATCAAATAACCATTTCTAATTTTGTTAA
AAGTCAAGAAACACGTTGTATAAATCTGGCTAACAGATCCTTGAGAATGTGGCAAGCCAAGCACTCACTTCCACAATATCGTTCTGCCATCACTTGGAAACTTGTAAAGA
GCCCTCGTCAACCGGGTTCTGTAGAGTGCGGGTACTATGTACAAAAATATATACGAGAAATCGTATATAATTCTAGTATTCCTATAATGAACCATTTTAACACAAAAACT
GCATATAAACAAGAAGAAATAGACGAGATTCGAGTACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATCAAGCGACGATGAAGTGAATGTATCGATCAAAATAGAGGTTAGGCATACTAATCGACGTGGTCTCACTACTATGCGTGGTCTGGCACGCGTAAGGACTAA
AGGGGAACGCTTGGTCATCCAATACAACAATCAAGGCCAAAGTGTTGGTGATAATGCAAACCAAATGCAAAGTTATATAGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAAGTTCCCCAGGAATTGAAAGATAAAATTTTTGATTCTGTAGAGATGTCATTTGTGGTAGACCCTCGGTCCAAGAATTGTATTCTTCAATCA
GCGGCAAATAAATTTCGAACATTTCGATACACGTTGTATCAAAAGCACATACTTCCATTTAAGGATGAGTTGTCCTTGTTGAAACAACCTCCACAAAAGTATTCCCATAT
TGATCAAAAAGAATGGGAAGCATTTGTGAATGCTAGATTATCGGAAGAATGGGAGGAATTGGGGGATGAGCCTTCCACTCGTGCGACCTTATGGATACAAGCACGAAAAG
GAAAAAATAATGAATACTTCGATGAAGCCACCAAACAATGTGCTGGTCGAATCGATGAACTAGCTATGAAGAATAAAGGTAAAGACATATTGACCGAAGCATTAGGCACG
CCAGAACACAGAGGGCGTGTTAGAGGAATAGGTATGTCTGTCAATCCATCAACATACTTTAACATTCCTCGAGGGAAATCAAAATCAAGCAAAGAGTCTGGCAACAAAAT
GTCGACTAATTGCTCACCTTCCAAAATGTCTGCAAGCATAGACAGTAATCATCCAAAAGACAAGGAGGTCATTGACGAGGGAACTCCATGCCATCTAGCAATAGGATCAA
AGGATAATGTGGTTGTTGTAGGCACAATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAAGAGTCGAAAATGTTAGAGTGGTAGTGGACATGATC
GTAGGTGAAGATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCTCTGAGTCAATCAATGGGAAATTTTGTGGCATGGCCTCGGGACCTTGTCATTTTTAATAA
GGGGAAAAAGTCGGATGTTTCTGTACATGTGCGCACTTTACATTCTACAAAGTACACAGATGCTCATGTGACTATTATGCTTCTGAATCGTTATGCAATGTTATCGATGC
AAGAAGATGATACGATTCAAGTCAAGTTGAGCGAGCACATGTTCGGGGAGGAGAAGTTAATTTATTTACATCAAGATGATATCCTGCATTACTGTGGGATGGTGGAGATA
GGGTACTCCTGCATAGTCGCATACATTGCGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTGCTAGTTGATCAAATAACCATTTCTAATTTTGTTAA
AAGTCAAGAAACACGTTGTATAAATCTGGCTAACAGATCCTTGAGAATGTGGCAAGCCAAGCACTCACTTCCACAATATCGTTCTGCCATCACTTGGAAACTTGTAAAGA
GCCCTCGTCAACCGGGTTCTGTAGAGTGCGGGTACTATGTACAAAAATATATACGAGAAATCGTATATAATTCTAGTATTCCTATAATGAACCATTTTAACACAAAAACT
GCATATAAACAAGAAGAAATAGACGAGATTCGAGTACAATAG
Protein sequenceShow/hide protein sequence
MSGSSDDEVNVSIKIEVRHTNRRGLTTMRGLARVRTKGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVVDPRSKNCILQS
AANKFRTFRYTLYQKHILPFKDELSLLKQPPQKYSHIDQKEWEAFVNARLSEEWEELGDEPSTRATLWIQARKGKNNEYFDEATKQCAGRIDELAMKNKGKDILTEALGT
PEHRGRVRGIGMSVNPSTYFNIPRGKSKSSKESGNKMSTNCSPSKMSASIDSNHPKDKEVIDEGTPCHLAIGSKDNVVVVGTMYTSDAQFPTVHGVPLRVENVRVVVDMI
VGEDAPLPIPIRGEVESLSQSMGNFVAWPRDLVIFNKGKKSDVSVHVRTLHSTKYTDAHVTIMLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHQDDILHYCGMVEI
GYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRSLRMWQAKHSLPQYRSAITWKLVKSPRQPGSVECGYYVQKYIREIVYNSSIPIMNHFNTKT
AYKQEEIDEIRVQ