; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002595 (gene) of Snake gourd v1 genome

Gene IDTan0002595
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
Genome locationLG06:20020703..20023601
RNA-Seq ExpressionTan0002595
SyntenyTan0002595
Gene Ontology termsNA
InterPro domainsIPR004264 - Transposase, Tnp1/En/Spm-like
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.5e-18052.16Show/hide
Query:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS +E +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ IG+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +IL FKDEP  L++PP+KY HIDQ +W +FVNARLS+EWE                               +L+ D
Subjt:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS

Query:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV
              K KE+++  EEI       +EG PCHLA+ S DN+VAVGT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E   +PIP+RGE+E+L+Q++  FV
Subjt:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV

Query:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K                        KLLNRY MLSMQ +DT+++ LS+ +FG+EK IYL  +DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------------------SLRMWQAKHSLP
         +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVNL+  Q V IPY +G                                   SL++WQAKHS+ 
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------------------SLRMWQAKHSLP

Query:  QYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV
        +YR+   WK +KCP Q   VEC YYVQKYIREIV N+S  I ++FNTK AY+QEEIDE+++EWADFVG  V
Subjt:  QYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]2.9e-17952.16Show/hide
Query:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS +E +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ IG+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +IL FKDEP  L++PP+KY HIDQ +W +FVNARLS+EWE                               +L+ D
Subjt:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS

Query:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV
              K KE+++  EEI       +EG PCHLA+ S DN+VAVGT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E   +PIP+RGE+E+L+Q++  FV
Subjt:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV

Query:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K                        KLLNRY MLSMQ +DT+++ LS+ +FG+EK IYL  +DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------------------SLRMWQAKHSLP
         +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVNL+  Q V IPY +G                                   SL++WQAKHS+ 
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------------------SLRMWQAKHSLP

Query:  QYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV
        +YR+   WK +KCP Q   VEC YYVQKYIREIV N+S  I ++FNTK AY+QEEIDE+++EWADFVG  V
Subjt:  QYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]3.3e-18354.01Show/hide
Query:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS +E +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ IG+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +IL FKDEP  L++PP+KY HIDQ +W +FVNARLS+EWE                               +L+ D
Subjt:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS

Query:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV
              K KE+++  EEI       +EG PCHLA+ S DN+VAVGT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E   +PIP+RGE+E+L+Q++  FV
Subjt:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV

Query:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K                        KLLNRY MLSMQ +DT+++ LS+ +FG+EK IYL  +DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECR
         +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVNL+  Q V IPY +G            SL++WQAKHS+ +YR+   WK +KCP Q   VEC 
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECR

Query:  YYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV
        YYVQKYIREIV N+S  I ++FNTK AY+QEEIDE+++EWADFVG  V
Subjt:  YYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]5.5e-17850.43Show/hide
Query:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS +E NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ +G+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++MSFV+D 
Subjt:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED
         SK+ ILQSA+ KFRTF+ TL Q++IL +KDEPS L++PP+KYSHID+ +WE+FV ARLS+EWE                               EL+ D
Subjt:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMSCSPSK-------
        P  RATLW +ARK KNNEY D  T++CA RIDELA   KG+DILTEALGTPEHRGR+RG+G  V P+ ++N+ + K K  +ES N+     S+       
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMSCSPSK-------

Query:  ---------------------------KSASICSNHPKDKEVIDEVEEILEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVG
                                   ++       PK K V+ + EEILEG PCHLAIGS DN+VAVGTM+ SDAQ P+++ +PLG +NVR +VD+++G
Subjt:  ---------------------------KSASICSNHPKDKEVIDEVEEILEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVG

Query:  EDGPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDI
        ED  LPIP + ++++L Q++ NFVAWPR LVI  K K                        KLLNRYAM SMQ DD IQ+ LSE + G+EK IYL  DDI
Subjt:  EDGPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDI

Query:  LHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------
        + YCGM EIGYSCI+AYIA LW  CD EI  KF++VDQ TIS+ VK QE R  NL NRLEMV+LD  Q V IPYNTG                       
Subjt:  LHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------

Query:  -------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV
                     SL+ WQAKHSL QYR+ I WK +KCPRQ   +EC YYVQKYIREIV NS+  I +LFNT+ AY+Q+EID +++EWA+FV RFV
Subjt:  -------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]4.2e-17850.5Show/hide
Query:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS +E NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ +G+NA +MQS+IGVCVRQQIPL+YK+WK VPQELKD IFD ++MSFV+D 
Subjt:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED
         SK+ ILQSA+ KFRTF+ TL Q++IL +KDEPS L++PP+KYSHID+ +WE+FV ARLS+EWE                               EL+ D
Subjt:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMSCSPSK-------
        P  RATLW +ARK KNNEY D  T++CA RIDELA   KG+DILTEALGTPEHRGR+RG+G  V P+ ++N+ + K K  +ES N+     S+       
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMSCSPSK-------

Query:  ---------------------------KSASICSNHPKDKEVIDEVEEILEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVG
                                   ++       PK K V+ + EEILEG PCHLAIGS DN+VAVGTM+ SDAQ P+++ +PLG +NVR +VD+++G
Subjt:  ---------------------------KSASICSNHPKDKEVIDEVEEILEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVG

Query:  EDGPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDI
        ED  LPIP + ++++L Q++ NFVAWPR LVI  K K                        KLLNRYAM SMQ DD IQ+ LSE + G+EK IYL  DDI
Subjt:  EDGPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDI

Query:  LHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------
        + YCGM EIGYSCI+AYIA LW  CD EI  KF++VDQ TIS+ VK QE R  NL NRLEMV+LD  Q V IPYNTG                       
Subjt:  LHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------

Query:  ------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV
                    SL+ WQAKHSL QYR+ I WK +KCPRQ   +EC YYVQKYIREIV NS+  I +LFNT+ AY+Q+EID +++EWA+FV RFV
Subjt:  ------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X12.0e-17348.8Show/hide
Query:  SSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDPRS
        SS +E NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ +G+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++MSFV+D  S
Subjt:  SSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDPRS

Query:  KNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAEDPS
        K+ ILQSA+ KFR+F+ TL Q +IL +KDEPS L++PP+KYSHID+ +WE+FV ARLS+EWE                               EL+ DP 
Subjt:  KNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAEDPS

Query:  TRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMSCSPSK---------
         RATLW +ARK KNN  FDD T++C  RIDELA   KG+DILTEALGTPEHRGR+RG+G  V P+ + N+ R   K S++S +K     S+         
Subjt:  TRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMSCSPSK---------

Query:  ----------------------------KSASICSNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLG
                                    K        PK K V+ E EE LE            G PCHLAIGS DNVVAVG M+ SD Q PT+HG+PLG
Subjt:  ----------------------------KSASICSNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLG

Query:  VENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMF
         EN+RV VD+ + ED  LPIP++G++E+L+Q++ NFVAWPR LVI  K K                        KLLNRYAM +MQ +D IQ+ LSEH+F
Subjt:  VENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMF

Query:  GEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG----------
        G+EK IYL  DDI+ YCGM EIGYSCI+ YIA LW VC+ EI  +F+LVDQ TIS+ +KSQE R  NL NRLEM NLD  Q V IPYNTG          
Subjt:  GEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG----------

Query:  --------------------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVE
                                  SL+ WQ +HS   YRS I WK +KCPR    +EC YYVQKY+RE+V N++  I +LFNT  AY QEEID ++VE
Subjt:  --------------------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVE

Query:  WADFVGRFV
        WA+FV RFV
Subjt:  WADFVGRFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein2.0e-17348.8Show/hide
Query:  SSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDPRS
        SS +E NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ +G+NA +MQS+IGVCVRQQIP++Y +WKEVPQELKD IFD ++MSFV+D  S
Subjt:  SSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDPRS

Query:  KNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAEDPS
        K+ ILQSA+ KFR+F+ TL Q +IL +KDEPS L++PP+KYSHID+ +WE+FV ARLS+EWE                               EL+ DP 
Subjt:  KNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAEDPS

Query:  TRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMSCSPSK---------
         RATLW +ARK KNN  FDD T++C  RIDELA   KG+DILTEALGTPEHRGR+RG+G  V P+ + N+ R   K S++S +K     S+         
Subjt:  TRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMSCSPSK---------

Query:  ----------------------------KSASICSNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLG
                                    K        PK K V+ E EE LE            G PCHLAIGS DNVVAVG M+ SD Q PT+HG+PLG
Subjt:  ----------------------------KSASICSNHPKDKEVIDEVEEILE------------GTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLG

Query:  VENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMF
         EN+RV VD+ + ED  LPIP++G++E+L+Q++ NFVAWPR LVI  K K                        KLLNRYAM +MQ +D IQ+ LSEH+F
Subjt:  VENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMF

Query:  GEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG----------
        G+EK IYL  DDI+ YCGM EIGYSCI+ YIA LW VC+ EI  +F+LVDQ TIS+ +KSQE R  NL NRLEM NLD  Q V IPYNTG          
Subjt:  GEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG----------

Query:  --------------------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVE
                                  SL+ WQ +HS   YRS I WK +KCPR    +EC YYVQKY+RE+V N++  I +LFNT  AY QEEID ++VE
Subjt:  --------------------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVE

Query:  WADFVGRFV
        WA+FV RFV
Subjt:  WADFVGRFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X17.5e-18152.16Show/hide
Query:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS +E +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ IG+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +IL FKDEP  L++PP+KY HIDQ +W +FVNARLS+EWE                               +L+ D
Subjt:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS

Query:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV
              K KE+++  EEI       +EG PCHLA+ S DN+VAVGT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E   +PIP+RGE+E+L+Q++  FV
Subjt:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV

Query:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K                        KLLNRY MLSMQ +DT+++ LS+ +FG+EK IYL  +DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------------------SLRMWQAKHSLP
         +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVNL+  Q V IPY +G                                   SL++WQAKHS+ 
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------------------SLRMWQAKHSLP

Query:  QYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV
        +YR+   WK +KCP Q   VEC YYVQKYIREIV N+S  I ++FNTK AY+QEEIDE+++EWADFVG  V
Subjt:  QYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV

A0A6J1C398 uncharacterized protein LOC111007859 isoform X31.6e-18354.01Show/hide
Query:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS +E +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ IG+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +IL FKDEP  L++PP+KY HIDQ +W +FVNARLS+EWE                               +L+ D
Subjt:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS

Query:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV
              K KE+++  EEI       +EG PCHLA+ S DN+VAVGT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E   +PIP+RGE+E+L+Q++  FV
Subjt:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV

Query:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K                        KLLNRY MLSMQ +DT+++ LS+ +FG+EK IYL  +DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECR
         +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVNL+  Q V IPY +G            SL++WQAKHS+ +YR+   WK +KCP Q   VEC 
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG------------SLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECR

Query:  YYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV
        YYVQKYIREIV N+S  I ++FNTK AY+QEEIDE+++EWADFVG  V
Subjt:  YYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X21.4e-17952.16Show/hide
Query:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP
        S SS +E +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ IG+NA +MQS+IGVCVRQ+IP++Y  WKEVPQELKDKIF+ VE SFV+D 
Subjt:  SGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDP

Query:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED
        RSK+ ILQSA+ KFRTF+ TL + +IL FKDEP  L++PP+KY HIDQ +W +FVNARLS+EWE                               +L+ D
Subjt:  RSKNAILQSAANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWE-------------------------------ELAED

Query:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS
        PS RA LW +ARKGKNNEYFDD T++CA RIDELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + KSK+ +   NK +    +PSKK + 
Subjt:  PSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGTPEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMS---CSPSKKSAS

Query:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV
              K KE+++  EEI       +EG PCHLA+ S DN+VAVGT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E   +PIP+RGE+E+L+Q++  FV
Subjt:  ICSNHPKDKEVIDEVEEI-------LEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDGPLPIPIRGEVESLSQSMENFV

Query:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV
        AWPR LVI ++ K                        KLLNRY MLSMQ +DT+++ LS+ +FG+EK IYL  +DI+ YC M+EIGYSCI+ YIAYLW V
Subjt:  AWPRDLVIFNKGK------------------------KLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTV

Query:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------------------SLRMWQAKHSLP
         +YEI  KFL+VD  TIS +VKSQE R  NLANRLEMVNL+  Q V IPY +G                                   SL++WQAKHS+ 
Subjt:  CDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTG-----------------------------------SLRMWQAKHSLP

Query:  QYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV
        +YR+   WK +KCP Q   VEC YYVQKYIREIV N+S  I ++FNTK AY+QEEIDE+++EWADFVG  V
Subjt:  QYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKTAYKQEEIDEIQVEWADFVGRFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATCAAGTGACGAAGAAGTGAATGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGGGAACGCTTGGTCATCCAATATAACAATCAAGGCCAAAGTATTGGTGATAATGCAAACCAAATGCAAAGTTACATAGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAGGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTCTGTAGAGATGTCATTTGTGATCGACCCTCGATCCAAAAATGCTATTCTTCAATCA
GCGGCAAATAAATTTCGAACATTTCGATACACGTTGTATCAAAAACACATACTTTCATTTAAGGATGAGCCGTCCTTGTTGAAGCATCCTCCACAAAAGTATTCACATAT
TGATCAAAACGAATGGGAAGCATTTGTGAATGCTAGATTATCGAAAGAATGGGAGGAATTGGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAGGCACGAAAAG
GAAAAAATAATGAATACTTCGATGATGAAACCAAACAATGCGCTGGTCGAATCGACGAACTAGCTATGAAGAATAAAGGTAAAGACATATTGACCGAAGCATTAGGCACG
CCAGAACACAGAGGGCGTGTTAGAGGATTGGGTATGTCTGTCAAACCATCAACATATTTTAACATTCCTCGAACGAAATCAAAATCAAGCAAAGAGTCTGGCAACAAAAT
GTCATGCTCACCTTCCAAAAAGTCTGCAAGCATATGCAGTAATCATCCAAAAGATAAGGAGGTCATTGACGAGGTGGAAGAAATTTTAGAGGGAACTCCATGCCATCTAG
CAATAGGATCAAAGGATAATGTGGTTGCTGTAGGCACAATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAGGAGTCGAAAATGTTAGAGTGGTA
GTGGACATGATCGTAGGTGAAGATGGTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCTCTGAGTCAATCAATGGAAAATTTTGTGGCATGGCCTCGGGACCTTGT
CATTTTTAATAAGGGGAAAAAGCTTCTGAATCGTTATGCAATGTTATCGATGCAAGAAGATGATACGATTCAAGTCAAGTTGAGTGAGCACATGTTCGGGGAGGAGAAGT
TAATTTATTTACATTGCGATGATATCCTGCATTACTGTGGGATGGTGGAGATAGGGTACTCCTGCATAGTCGCATACATTGCGTATCTTTGGACTGTATGTGACTATGAA
ATAATCGCCAAGTTCTTGTTAGTTGATCAAATAACCATTTCTAATTTTGTTAAAAGTCAAGAAACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGA
CTTGAATCAACAAGTTTTCATCCCATATAATACTGGATCCTTGAGAATGTGGCAAGCCAAGCACTCACTTCCACAATATCGTTCTGCCATCACTTGGAAACTTGTAAAGT
GCCCACGTCAACCGAGTTTTGTAGAGTGCAGGTACTATGTACAGAAGTATATACGAGAAATCGTATATAATTCTAGTATCCCTATAATGAGTCTTTTTAACACAAAAACT
GCATATAAACAAGAAGAAATCGACGAGATTCAAGTAGAATGGGCGGATTTTGTTGGCAGATTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATCAAGTGACGAAGAAGTGAATGTATCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGCGTGGTCTGGCACGCGTAAGGACTAC
AGGGGAACGCTTGGTCATCCAATATAACAATCAAGGCCAAAGTATTGGTGATAATGCAAACCAAATGCAAAGTTACATAGGAGTTTGCGTTAGGCAACAAATTCCATTAA
GTTACAAGACTTGGAAAGAGGTTCCCCAAGAATTGAAAGATAAAATTTTTGATTCTGTAGAGATGTCATTTGTGATCGACCCTCGATCCAAAAATGCTATTCTTCAATCA
GCGGCAAATAAATTTCGAACATTTCGATACACGTTGTATCAAAAACACATACTTTCATTTAAGGATGAGCCGTCCTTGTTGAAGCATCCTCCACAAAAGTATTCACATAT
TGATCAAAACGAATGGGAAGCATTTGTGAATGCTAGATTATCGAAAGAATGGGAGGAATTGGCGGAAGATCCTTCCACTCGTGCCACCTTATGGATACAGGCACGAAAAG
GAAAAAATAATGAATACTTCGATGATGAAACCAAACAATGCGCTGGTCGAATCGACGAACTAGCTATGAAGAATAAAGGTAAAGACATATTGACCGAAGCATTAGGCACG
CCAGAACACAGAGGGCGTGTTAGAGGATTGGGTATGTCTGTCAAACCATCAACATATTTTAACATTCCTCGAACGAAATCAAAATCAAGCAAAGAGTCTGGCAACAAAAT
GTCATGCTCACCTTCCAAAAAGTCTGCAAGCATATGCAGTAATCATCCAAAAGATAAGGAGGTCATTGACGAGGTGGAAGAAATTTTAGAGGGAACTCCATGCCATCTAG
CAATAGGATCAAAGGATAATGTGGTTGCTGTAGGCACAATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAGGAGTCGAAAATGTTAGAGTGGTA
GTGGACATGATCGTAGGTGAAGATGGTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCTCTGAGTCAATCAATGGAAAATTTTGTGGCATGGCCTCGGGACCTTGT
CATTTTTAATAAGGGGAAAAAGCTTCTGAATCGTTATGCAATGTTATCGATGCAAGAAGATGATACGATTCAAGTCAAGTTGAGTGAGCACATGTTCGGGGAGGAGAAGT
TAATTTATTTACATTGCGATGATATCCTGCATTACTGTGGGATGGTGGAGATAGGGTACTCCTGCATAGTCGCATACATTGCGTATCTTTGGACTGTATGTGACTATGAA
ATAATCGCCAAGTTCTTGTTAGTTGATCAAATAACCATTTCTAATTTTGTTAAAAGTCAAGAAACACGTTGTATAAATCTGGCTAACAGGTTAGAAATGGTTAATTTGGA
CTTGAATCAACAAGTTTTCATCCCATATAATACTGGATCCTTGAGAATGTGGCAAGCCAAGCACTCACTTCCACAATATCGTTCTGCCATCACTTGGAAACTTGTAAAGT
GCCCACGTCAACCGAGTTTTGTAGAGTGCAGGTACTATGTACAGAAGTATATACGAGAAATCGTATATAATTCTAGTATCCCTATAATGAGTCTTTTTAACACAAAAACT
GCATATAAACAAGAAGAAATCGACGAGATTCAAGTAGAATGGGCGGATTTTGTTGGCAGATTTGTGTAA
Protein sequenceShow/hide protein sequence
MSGSSDEEVNVSIQMEARHTNRRGLTTMRGLARVRTTGERLVIQYNNQGQSIGDNANQMQSYIGVCVRQQIPLSYKTWKEVPQELKDKIFDSVEMSFVIDPRSKNAILQS
AANKFRTFRYTLYQKHILSFKDEPSLLKHPPQKYSHIDQNEWEAFVNARLSKEWEELAEDPSTRATLWIQARKGKNNEYFDDETKQCAGRIDELAMKNKGKDILTEALGT
PEHRGRVRGLGMSVKPSTYFNIPRTKSKSSKESGNKMSCSPSKKSASICSNHPKDKEVIDEVEEILEGTPCHLAIGSKDNVVAVGTMYTSDAQFPTVHGVPLGVENVRVV
VDMIVGEDGPLPIPIRGEVESLSQSMENFVAWPRDLVIFNKGKKLLNRYAMLSMQEDDTIQVKLSEHMFGEEKLIYLHCDDILHYCGMVEIGYSCIVAYIAYLWTVCDYE
IIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLNQQVFIPYNTGSLRMWQAKHSLPQYRSAITWKLVKCPRQPSFVECRYYVQKYIREIVYNSSIPIMSLFNTKT
AYKQEEIDEIQVEWADFVGRFV