; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010025 (gene) of Snake gourd v1 genome

Gene IDTan0010025
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTransposase
Genome locationLG02:58617875..58620733
RNA-Seq ExpressionTan0010025
SyntenyTan0010025
Gene Ontology termsNA
InterPro domainsIPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.2e-18353.5Show/hide
Query:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF
        E +   RRG TTMH L CIR  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY  WKEVPQELKDKIF  +E SFV+D RSK+ ILQSA+ KF
Subjt:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF

Query:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG
        RTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEE E                               +L+ D S RA LW +ARKG
Subjt:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG

Query:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID
        KNNEYFDD T++CA RI +ELA  +KG+D+LTEALGT EH G VRG+G  V PS YFN+ + K K+ +   NK +    +PSKK +       K KE ++
Subjt:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID

Query:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK
          EEI       +EG PCHLA+ S DN+VA+G ++ ++ Q  TVHGVPLGV+NVRV+VD+++ E+A +PIP+RGE+E+L+Q+I  FVAWPR LVI ++ K
Subjt:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK

Query:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD
            S    T +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LSK +F +EK IYL R+DI+ YC M+EIGYSCI+TYIAYLW V +YEI  KFL+VD
Subjt:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD

Query:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMWQAKHSLPQYRFAITWKLVKC
          TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +                                    SL++WQAKHS+ +YR    WK +KC
Subjt:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMWQAKHSLPQYRFAITWKLVKC

Query:  PHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV
        P Q  SVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEIDE+R++W DFVG  V
Subjt:  PHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]2.2e-18253.5Show/hide
Query:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF
        E +   RRG TTMH L CIR  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY  WKEVPQELKDKIF  +E SFV+D RSK+ ILQSA+ KF
Subjt:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF

Query:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG
        RTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEE E                               +L+ D S RA LW +ARKG
Subjt:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG

Query:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID
        KNNEYFDD T++CA RI +ELA  +KG+D+LTEALGT EH G VRG+G  V PS YFN+ + K K+ +   NK +    +PSKK +       K KE ++
Subjt:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID

Query:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK
          EEI       +EG PCHLA+ S DN+VA+G ++ ++ Q  TVHGVPLGV+NVRV+VD+++ E+A +PIP+RGE+E+L+Q+I  FVAWPR LVI ++ K
Subjt:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK

Query:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD
            S    T +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LSK +F +EK IYL R+DI+ YC M+EIGYSCI+TYIAYLW V +YEI  KFL+VD
Subjt:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD

Query:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMWQAKHSLPQYRFAITWKLVKC
          TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +                                    SL++WQAKHS+ +YR    WK +KC
Subjt:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMWQAKHSLPQYRFAITWKLVKC

Query:  PHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV
        P Q  SVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEIDE+R++W DFVG  V
Subjt:  PHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]2.5e-18655.43Show/hide
Query:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF
        E +   RRG TTMH L CIR  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY  WKEVPQELKDKIF  +E SFV+D RSK+ ILQSA+ KF
Subjt:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF

Query:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG
        RTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEE E                               +L+ D S RA LW +ARKG
Subjt:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG

Query:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID
        KNNEYFDD T++CA RI +ELA  +KG+D+LTEALGT EH G VRG+G  V PS YFN+ + K K+ +   NK +    +PSKK +       K KE ++
Subjt:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID

Query:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK
          EEI       +EG PCHLA+ S DN+VA+G ++ ++ Q  TVHGVPLGV+NVRV+VD+++ E+A +PIP+RGE+E+L+Q+I  FVAWPR LVI ++ K
Subjt:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK

Query:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD
            S    T +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LSK +F +EK IYL R+DI+ YC M+EIGYSCI+TYIAYLW V +YEI  KFL+VD
Subjt:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD

Query:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT------------RSLRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYN
          TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +             SL++WQAKHS+ +YR    WK +KCP Q  SVECGYYVQKYIREIV N
Subjt:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT------------RSLRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYN

Query:  SSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV
        +S  I N+FNTK AY+QEEIDE+R++W DFVG  V
Subjt:  SSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]2.1e-17751.54Show/hide
Query:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF
        E + T RRG T M  L  IR +GER  I+YN+ GQ VG+NA +MQS+IGVCVRQQIPLTYK+WK VPQELKD IF  I+MSFV+D  SK+ ILQSA+ KF
Subjt:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF

Query:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG
        RTF+ TL Q++ILP+KDE S L++PP+KYSHID+ QWE+FV ARLSEE E                               EL+ D   RATLW +ARK 
Subjt:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG

Query:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMSCSPS---------------------
        KNNEY D  T++CA RI +ELA   KG+D+LTEALGTPEHRG +RG+G  V P+ ++N+ + K K  +ES N+     S                     
Subjt:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMSCSPS---------------------

Query:  -------KKSASIGSN------HPKDKEGIDDVEEILEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEV
               +K    G N       PK K  + D EEILEG PCHLAIGS DN+VA+G M+ SDAQ  +++ +PLG +NVR +VD+++GE   LPIP + ++
Subjt:  -------KKSASIGSN------HPKDKEGIDDVEEILEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEV

Query:  ESLSQSIRNFVAWPRDLVIFNKGKK-PDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSC
        ++L Q+I NFVAWPR LVI  K KK P  +     + S+KYTD HVTIKLLNRYAM SMQ DD IQ+ LS+ +  +EK IYL RDDI+ YCGM EIGYSC
Subjt:  ESLSQSIRNFVAWPRDLVIFNKGKK-PDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSC

Query:  IVTYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT------------------------------------R
        I+ YIA LW  CD EI  KF++VDQ TIS+ VK QE R  NL NRLEMV+  LDQ V IPYNT                                     
Subjt:  IVTYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT------------------------------------R

Query:  SLRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV
        SL+ WQAKHSL QYR  I WK +KCP Q  ++ECGYYVQKYIREIV NS+  I NLFNT+ AY+Q+EID +R++W +FV RFV
Subjt:  SLRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]1.6e-17751.61Show/hide
Query:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF
        E + T RRG T M  L  IR +GER  I+YN+ GQ VG+NA +MQS+IGVCVRQQIPLTYK+WK VPQELKD IF  I+MSFV+D  SK+ ILQSA+ KF
Subjt:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF

Query:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG
        RTF+ TL Q++ILP+KDE S L++PP+KYSHID+ QWE+FV ARLSEE E                               EL+ D   RATLW +ARK 
Subjt:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG

Query:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMSCSPS---------------------
        KNNEY D  T++CA RI +ELA   KG+D+LTEALGTPEHRG +RG+G  V P+ ++N+ + K K  +ES N+     S                     
Subjt:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMSCSPS---------------------

Query:  -------KKSASIGSN------HPKDKEGIDDVEEILEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEV
               +K    G N       PK K  + D EEILEG PCHLAIGS DN+VA+G M+ SDAQ  +++ +PLG +NVR +VD+++GE   LPIP + ++
Subjt:  -------KKSASIGSN------HPKDKEGIDDVEEILEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEV

Query:  ESLSQSIRNFVAWPRDLVIFNKGKK-PDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSC
        ++L Q+I NFVAWPR LVI  K KK P  +     + S+KYTD HVTIKLLNRYAM SMQ DD IQ+ LS+ +  +EK IYL RDDI+ YCGM EIGYSC
Subjt:  ESLSQSIRNFVAWPRDLVIFNKGKK-PDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSC

Query:  IVTYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RS
        I+ YIA LW  CD EI  KF++VDQ TIS+ VK QE R  NL NRLEMV+  LDQ V IPYNT                                    S
Subjt:  IVTYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RS

Query:  LRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV
        L+ WQAKHSL QYR  I WK +KCP Q  ++ECGYYVQKYIREIV NS+  I NLFNT+ AY+Q+EID +R++W +FV RFV
Subjt:  LRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X17.4e-17649.86Show/hide
Query:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF
        E + T RRG T M  L  IR +GER  I+YN++GQ VG+NA +MQS+IGVCVRQQIP+TY +WKEVPQELKD IF  I+MSFV+D  SK+ ILQSA+ KF
Subjt:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF

Query:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG
        R+F+ TL Q +ILP+KDE S L++PP+KYSHID+ QWE+FV ARLSEE E                               EL+ D   RATLW +ARK 
Subjt:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG

Query:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMSCSPSK--------------------
        KNN  FDD T++C  RI +ELA   KG+D+LTEALGTPEHRG +RG+G  V P+ + N+ R   K S++S +K     S+                    
Subjt:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMSCSPSK--------------------

Query:  -----------------KSASIGSNHPKDKEGIDDVEEILE------------GTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMI
                         K    G   PK K  + + EE LE            G PCHLAIGS DNVVA+G M+ SD Q  T+HG+PLG EN+RV VD+ 
Subjt:  -----------------KSASIGSNHPKDKEGIDDVEEILE------------GTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMI

Query:  VGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGKK-PDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRD
        + E   LPIP++G++E+L+Q+I NFVAWPR LVI  K KK P ++    T+ S+KYTD HVTIKLLNRYAM +MQ +D IQ++LS+H+F +EK IYL RD
Subjt:  VGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGKK-PDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRD

Query:  DILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT----------------------
        DI+ YCGM EIGYSCI+TYIA LW VC+ EI  +F+LVDQ TIS+ +KSQE R  NL NRLEM N  LDQ V IPYNT                      
Subjt:  DILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT----------------------

Query:  --------------RSLRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV
                      +SL+ WQ +HS   YR  I WK +KCP    S+ECGYYVQKY+RE+V N++  I NLFNT  AY QEEID +RV+W +FV RFV
Subjt:  --------------RSLRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV

A0A5D3CYL9 ULP_PROTEASE domain-containing protein7.4e-17649.86Show/hide
Query:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF
        E + T RRG T M  L  IR +GER  I+YN++GQ VG+NA +MQS+IGVCVRQQIP+TY +WKEVPQELKD IF  I+MSFV+D  SK+ ILQSA+ KF
Subjt:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF

Query:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG
        R+F+ TL Q +ILP+KDE S L++PP+KYSHID+ QWE+FV ARLSEE E                               EL+ D   RATLW +ARK 
Subjt:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG

Query:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMSCSPSK--------------------
        KNN  FDD T++C  RI +ELA   KG+D+LTEALGTPEHRG +RG+G  V P+ + N+ R   K S++S +K     S+                    
Subjt:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMSCSPSK--------------------

Query:  -----------------KSASIGSNHPKDKEGIDDVEEILE------------GTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMI
                         K    G   PK K  + + EE LE            G PCHLAIGS DNVVA+G M+ SD Q  T+HG+PLG EN+RV VD+ 
Subjt:  -----------------KSASIGSNHPKDKEGIDDVEEILE------------GTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMI

Query:  VGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGKK-PDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRD
        + E   LPIP++G++E+L+Q+I NFVAWPR LVI  K KK P ++    T+ S+KYTD HVTIKLLNRYAM +MQ +D IQ++LS+H+F +EK IYL RD
Subjt:  VGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGKK-PDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRD

Query:  DILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT----------------------
        DI+ YCGM EIGYSCI+TYIA LW VC+ EI  +F+LVDQ TIS+ +KSQE R  NL NRLEM N  LDQ V IPYNT                      
Subjt:  DILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT----------------------

Query:  --------------RSLRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV
                      +SL+ WQ +HS   YR  I WK +KCP    S+ECGYYVQKY+RE+V N++  I NLFNT  AY QEEID +RV+W +FV RFV
Subjt:  --------------RSLRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X15.6e-18453.5Show/hide
Query:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF
        E +   RRG TTMH L CIR  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY  WKEVPQELKDKIF  +E SFV+D RSK+ ILQSA+ KF
Subjt:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF

Query:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG
        RTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEE E                               +L+ D S RA LW +ARKG
Subjt:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG

Query:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID
        KNNEYFDD T++CA RI +ELA  +KG+D+LTEALGT EH G VRG+G  V PS YFN+ + K K+ +   NK +    +PSKK +       K KE ++
Subjt:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID

Query:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK
          EEI       +EG PCHLA+ S DN+VA+G ++ ++ Q  TVHGVPLGV+NVRV+VD+++ E+A +PIP+RGE+E+L+Q+I  FVAWPR LVI ++ K
Subjt:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK

Query:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD
            S    T +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LSK +F +EK IYL R+DI+ YC M+EIGYSCI+TYIAYLW V +YEI  KFL+VD
Subjt:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD

Query:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMWQAKHSLPQYRFAITWKLVKC
          TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +                                    SL++WQAKHS+ +YR    WK +KC
Subjt:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMWQAKHSLPQYRFAITWKLVKC

Query:  PHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV
        P Q  SVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEIDE+R++W DFVG  V
Subjt:  PHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV

A0A6J1C398 uncharacterized protein LOC111007859 isoform X31.2e-18655.43Show/hide
Query:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF
        E +   RRG TTMH L CIR  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY  WKEVPQELKDKIF  +E SFV+D RSK+ ILQSA+ KF
Subjt:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF

Query:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG
        RTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEE E                               +L+ D S RA LW +ARKG
Subjt:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG

Query:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID
        KNNEYFDD T++CA RI +ELA  +KG+D+LTEALGT EH G VRG+G  V PS YFN+ + K K+ +   NK +    +PSKK +       K KE ++
Subjt:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID

Query:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK
          EEI       +EG PCHLA+ S DN+VA+G ++ ++ Q  TVHGVPLGV+NVRV+VD+++ E+A +PIP+RGE+E+L+Q+I  FVAWPR LVI ++ K
Subjt:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK

Query:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD
            S    T +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LSK +F +EK IYL R+DI+ YC M+EIGYSCI+TYIAYLW V +YEI  KFL+VD
Subjt:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD

Query:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT------------RSLRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYN
          TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +             SL++WQAKHS+ +YR    WK +KCP Q  SVECGYYVQKYIREIV N
Subjt:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT------------RSLRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYN

Query:  SSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV
        +S  I N+FNTK AY+QEEIDE+R++W DFVG  V
Subjt:  SSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X21.1e-18253.5Show/hide
Query:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF
        E +   RRG TTMH L CIR  G+R  I+YN+QGQ +G+NA +MQS+IGVCVRQ+IP+TY  WKEVPQELKDKIF  +E SFV+D RSK+ ILQSA+ KF
Subjt:  EARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKF

Query:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG
        RTF+ TL + +ILPFKDE   L++PP+KY HIDQ QW +FVNARLSEE E                               +L+ D S RA LW +ARKG
Subjt:  RTFRHTLYQKHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELE-------------------------------ELADDTSTRATLWLQARKG

Query:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID
        KNNEYFDD T++CA RI +ELA  +KG+D+LTEALGT EH G VRG+G  V PS YFN+ + K K+ +   NK +    +PSKK +       K KE ++
Subjt:  KNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMSVKPSTYFNIPRVKEKSSKESDNKMS---CSPSKKSASIGSNHPKDKEGID

Query:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK
          EEI       +EG PCHLA+ S DN+VA+G ++ ++ Q  TVHGVPLGV+NVRV+VD+++ E+A +PIP+RGE+E+L+Q+I  FVAWPR LVI ++ K
Subjt:  DVEEI-------LEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPIPIRGEVESLSQSIRNFVAWPRDLVIFNKGK

Query:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD
            S    T +  +K+TD HV+IKLLNRY MLSMQ +DT+++ LSK +F +EK IYL R+DI+ YC M+EIGYSCI+TYIAYLW V +YEI  KFL+VD
Subjt:  KPDVSVHVPT-SHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYIAYLWTVCDYEIIAKFLLVD

Query:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMWQAKHSLPQYRFAITWKLVKC
          TIS +VKSQE R  NLANRLEMVN  L+Q V IPY +                                    SL++WQAKHS+ +YR    WK +KC
Subjt:  QITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNT-----------------------------------RSLRMWQAKHSLPQYRFAITWKLVKC

Query:  PHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV
        P Q  SVECGYYVQKYIREIV N+S  I N+FNTK AY+QEEIDE+R++W DFVG  V
Subjt:  PHQPSSVECGYYVQKYIREIVYNSSIPIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGCATGGTCTGGCATGCATAAGGACTACAGGAGAACGCTTAGTCATCCAATACAACAATCAAGGCCAGAG
TGTTGGTGATAATGCAAACCAAATGCAAAGTTATATTGGAGTTTGCGTTAGGCAACAAATTCCATTAACTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATA
AAATTTTTTATTCTATAGAGATGTCATTTGTGATCGACCCTCGGTCCAAAAATAGTATTCTTCAATCAGCGGCAAATAAATTTCGAACATTTCGACACACGTTGTATCAA
AAACACATACTTCCATTTAAGGATGAGTCGTCCTTGTTGAAACATCCCCCACAAAAGTATTCACATATTGATCAAAATCAATGGGAGGCATTTGTGAATGCTAGATTATC
GGAAGAATTGGAGGAATTGGCGGATGATACTTCCACTCGTGCCACCTTATGGTTACAGGCACGAAAAGGAAAAAATAATGAATACTTCGATGATGAAACCAAACAATGCG
CTGGTCGAATCGTAAACGAACTAGCTATGAAGAATAAAGGTAAAGACGTATTGACCGAAGCATTAGGCACGCCAGAACACAGAGGGGGTGTTAGAGGAATAGGTATGTCT
GTCAAACCATCAACATACTTTAACATTCCTCGAGTGAAGGAAAAATCAAGCAAAGAGTCTGACAACAAAATGTCGTGCTCACCTTCCAAAAAGTCTGCAAGCATAGGCAG
TAATCATCCAAAAGACAAGGAGGGCATTGACGATGTGGAAGAAATTTTAGAGGGAACTCCATGTCATCTAGCAATAGGATCAAAGGATAATGTGGTTGCTATAGGCATAA
TGTACACGTCTGACGCTCAATTTTCCACAGTCCATGGAGTTCCCTTAGGAGTTGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGTGAACATGCTCCATTACCAATT
CCTATACGGGGAGAAGTAGAGTCCCTGAGTCAATCTATAAGAAATTTTGTAGCATGGCCTCGTGACCTTGTCATTTTTAATAAGGGGAAAAAGCCGGATGTTTCTGTACA
TGTGCCTACTTCACATTCTACAAAGTACACAGATGCTCACGTGACTATTAAACTTCTGAATCGTTATGCAATGTTATCGATGCAAGAAGATGATACGATTCAAGTCACAT
TGAGCAAGCACATGTTCAGGGAGGAAAAGTTAATTTATTTACATCGCGATGATATCCTACATTATTGTGGGATGGTGGAGATAGGGTACTCCTGCATAGTCACATACATT
GCGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTGCTAGTTGATCAAATAACCATTTCTAATTTTGTTAAAAGTCAAGAAACACGTTGTATAAATCT
GGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATCCCTTATAATACTAGATCCTTGAGAATGTGGCAAGCCAAGCACTCACTTCCACAATATC
GTTTTGCCATCACTTGGAAACTTGTAAAATGCCCCCATCAACCGAGTTCTGTGGAGTGCGGGTACTATGTACAGAAGTATATACGAGAAATCGTATATAATTCTAGTATC
CCTATAATGAACCTTTTCAACACAAAAACTGCATATAAACAAGAAGAAATCGACGAGATTCGAGTGCAATGGACGGATTTTGTTGGCAGATTTGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGCATGGTCTGGCATGCATAAGGACTACAGGAGAACGCTTAGTCATCCAATACAACAATCAAGGCCAGAG
TGTTGGTGATAATGCAAACCAAATGCAAAGTTATATTGGAGTTTGCGTTAGGCAACAAATTCCATTAACTTACAAGACTTGGAAAGAAGTTCCCCAAGAATTGAAAGATA
AAATTTTTTATTCTATAGAGATGTCATTTGTGATCGACCCTCGGTCCAAAAATAGTATTCTTCAATCAGCGGCAAATAAATTTCGAACATTTCGACACACGTTGTATCAA
AAACACATACTTCCATTTAAGGATGAGTCGTCCTTGTTGAAACATCCCCCACAAAAGTATTCACATATTGATCAAAATCAATGGGAGGCATTTGTGAATGCTAGATTATC
GGAAGAATTGGAGGAATTGGCGGATGATACTTCCACTCGTGCCACCTTATGGTTACAGGCACGAAAAGGAAAAAATAATGAATACTTCGATGATGAAACCAAACAATGCG
CTGGTCGAATCGTAAACGAACTAGCTATGAAGAATAAAGGTAAAGACGTATTGACCGAAGCATTAGGCACGCCAGAACACAGAGGGGGTGTTAGAGGAATAGGTATGTCT
GTCAAACCATCAACATACTTTAACATTCCTCGAGTGAAGGAAAAATCAAGCAAAGAGTCTGACAACAAAATGTCGTGCTCACCTTCCAAAAAGTCTGCAAGCATAGGCAG
TAATCATCCAAAAGACAAGGAGGGCATTGACGATGTGGAAGAAATTTTAGAGGGAACTCCATGTCATCTAGCAATAGGATCAAAGGATAATGTGGTTGCTATAGGCATAA
TGTACACGTCTGACGCTCAATTTTCCACAGTCCATGGAGTTCCCTTAGGAGTTGAAAATGTTAGAGTGGTAGTGGACATGATCGTAGGTGAACATGCTCCATTACCAATT
CCTATACGGGGAGAAGTAGAGTCCCTGAGTCAATCTATAAGAAATTTTGTAGCATGGCCTCGTGACCTTGTCATTTTTAATAAGGGGAAAAAGCCGGATGTTTCTGTACA
TGTGCCTACTTCACATTCTACAAAGTACACAGATGCTCACGTGACTATTAAACTTCTGAATCGTTATGCAATGTTATCGATGCAAGAAGATGATACGATTCAAGTCACAT
TGAGCAAGCACATGTTCAGGGAGGAAAAGTTAATTTATTTACATCGCGATGATATCCTACATTATTGTGGGATGGTGGAGATAGGGTACTCCTGCATAGTCACATACATT
GCGTATCTTTGGACTGTATGTGACTATGAAATAATCGCCAAGTTCTTGCTAGTTGATCAAATAACCATTTCTAATTTTGTTAAAAGTCAAGAAACACGTTGTATAAATCT
GGCTAACAGGTTAGAAATGGTTAATTTGGACTTGGATCAACAAGTTTTCATCCCTTATAATACTAGATCCTTGAGAATGTGGCAAGCCAAGCACTCACTTCCACAATATC
GTTTTGCCATCACTTGGAAACTTGTAAAATGCCCCCATCAACCGAGTTCTGTGGAGTGCGGGTACTATGTACAGAAGTATATACGAGAAATCGTATATAATTCTAGTATC
CCTATAATGAACCTTTTCAACACAAAAACTGCATATAAACAAGAAGAAATCGACGAGATTCGAGTGCAATGGACGGATTTTGTTGGCAGATTTGTGTAA
Protein sequenceShow/hide protein sequence
MEARHTNRRGLTTMHGLACIRTTGERLVIQYNNQGQSVGDNANQMQSYIGVCVRQQIPLTYKTWKEVPQELKDKIFYSIEMSFVIDPRSKNSILQSAANKFRTFRHTLYQ
KHILPFKDESSLLKHPPQKYSHIDQNQWEAFVNARLSEELEELADDTSTRATLWLQARKGKNNEYFDDETKQCAGRIVNELAMKNKGKDVLTEALGTPEHRGGVRGIGMS
VKPSTYFNIPRVKEKSSKESDNKMSCSPSKKSASIGSNHPKDKEGIDDVEEILEGTPCHLAIGSKDNVVAIGIMYTSDAQFSTVHGVPLGVENVRVVVDMIVGEHAPLPI
PIRGEVESLSQSIRNFVAWPRDLVIFNKGKKPDVSVHVPTSHSTKYTDAHVTIKLLNRYAMLSMQEDDTIQVTLSKHMFREEKLIYLHRDDILHYCGMVEIGYSCIVTYI
AYLWTVCDYEIIAKFLLVDQITISNFVKSQETRCINLANRLEMVNLDLDQQVFIPYNTRSLRMWQAKHSLPQYRFAITWKLVKCPHQPSSVECGYYVQKYIREIVYNSSI
PIMNLFNTKTAYKQEEIDEIRVQWTDFVGRFV