; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007562 (gene) of Snake gourd v1 genome

Gene IDTan0007562
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationLG02:38300757..38302348
RNA-Seq ExpressionTan0007562
SyntenyTan0007562
Gene Ontology termsNA
InterPro domainsIPR004264 - Transposase, Tnp1/En/Spm-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136076.1 uncharacterized protein LOC111007859 isoform X1 [Momordica charantia]1.1e-5035.91Show/hide
Query:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G NA +MQS+IGVCVRQ+IP+TY  WKEV QELKDKIF+ VE       
Subjt:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS
                                       DELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + K K+ +   NK +    +PSKK   
Subjt:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS

Query:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV
              K KE+++  EEI       +E  PCHLA+ S DN+VA+GT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q+IG FV
Subjt:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV

Query:  A
        A
Subjt:  A

XP_022136077.1 uncharacterized protein LOC111007859 isoform X2 [Momordica charantia]8.7e-5136Show/hide
Query:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G NA +MQS+IGVCVRQ+IP+TY  WKEV QELKDKIF+ VE       
Subjt:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFASI
                                      DELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + K K+ +   NK +    +PSKK    
Subjt:  ------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFASI

Query:  DSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFVA
             K KE+++  EEI       +E  PCHLA+ S DN+VA+GT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q+IG FVA
Subjt:  DSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFVA

XP_022136079.1 uncharacterized protein LOC111007859 isoform X3 [Momordica charantia]1.1e-5035.91Show/hide
Query:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G NA +MQS+IGVCVRQ+IP+TY  WKEV QELKDKIF+ VE       
Subjt:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS
                                       DELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + K K+ +   NK +    +PSKK   
Subjt:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS

Query:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV
              K KE+++  EEI       +E  PCHLA+ S DN+VA+GT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q+IG FV
Subjt:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV

Query:  A
        A
Subjt:  A

XP_022136080.1 uncharacterized protein LOC111007859 isoform X4 [Momordica charantia]1.1e-5035.91Show/hide
Query:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G NA +MQS+IGVCVRQ+IP+TY  WKEV QELKDKIF+ VE       
Subjt:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS
                                       DELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + K K+ +   NK +    +PSKK   
Subjt:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS

Query:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV
              K KE+++  EEI       +E  PCHLA+ S DN+VA+GT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q+IG FV
Subjt:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV

Query:  A
        A
Subjt:  A

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]4.8e-4933.41Show/hide
Query:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------
        S SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN+ GQ VG NA +MQS+IGVCVRQQIPLTYK+WK V QELKD IFD ++       
Subjt:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMSCSPSKKFASIDS
                                       DELA   KG+DILTEALGTPEHRGR+RG+G  V P+ ++N+ + K K  + S N+     S+       
Subjt:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMSCSPSKKFASIDS

Query:  NH----------------------------------PKDKEVIDDVEEILEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVG
        +                                   PK K V+ D EEILE  PCHLAIGS DN+VA+GTM+ SDAQ P+++ +PLG +NVR +VD+++G
Subjt:  NH----------------------------------PKDKEVIDDVEEILEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVG

Query:  EDAPLPIPIRGEVESLSQSIGNFVA
        ED  LPIP + ++++L Q+IGNFVA
Subjt:  EDAPLPIPIRGEVESLSQSIGNFVA

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X15.7e-4833.33Show/hide
Query:  SSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE---------
        SS DE NV I+ E + T RRG T M  L  +R +GER  I+YN++GQ VG NA +MQS+IGVCVRQQIP+TY +WKEV QELKD IFD ++         
Subjt:  SSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE---------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNK-------------MSC
                                     DELA   KG+DILTEALGTPEHRGR+RG+G  V P+ + N+ R   K S+ S +K              S 
Subjt:  -----------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNK-------------MSC

Query:  SPSKKFASIDSNH------------------------PKDKEVIDDVEEILE------------ETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLG
        + ++   S D N                         PK K V+ + EE LE              PCHLAIGS DNVVA+G M+ SD Q PT+HG+PLG
Subjt:  SPSKKFASIDSNH------------------------PKDKEVIDDVEEILE------------ETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLG

Query:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFVA
         EN+RV VD+ + ED  LPIP++G++E+L+Q+IGNFVA
Subjt:  VENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFVA

A0A6J1C2H7 uncharacterized protein LOC111007859 isoform X15.5e-5135.91Show/hide
Query:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G NA +MQS+IGVCVRQ+IP+TY  WKEV QELKDKIF+ VE       
Subjt:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS
                                       DELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + K K+ +   NK +    +PSKK   
Subjt:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS

Query:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV
              K KE+++  EEI       +E  PCHLA+ S DN+VA+GT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q+IG FV
Subjt:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV

Query:  A
        A
Subjt:  A

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X45.5e-5135.91Show/hide
Query:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G NA +MQS+IGVCVRQ+IP+TY  WKEV QELKDKIF+ VE       
Subjt:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS
                                       DELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + K K+ +   NK +    +PSKK   
Subjt:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS

Query:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV
              K KE+++  EEI       +E  PCHLA+ S DN+VA+GT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q+IG FV
Subjt:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV

Query:  A
        A
Subjt:  A

A0A6J1C398 uncharacterized protein LOC111007859 isoform X35.5e-5135.91Show/hide
Query:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G NA +MQS+IGVCVRQ+IP+TY  WKEV QELKDKIF+ VE       
Subjt:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS
                                       DELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + K K+ +   NK +    +PSKK   
Subjt:  -------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFAS

Query:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV
              K KE+++  EEI       +E  PCHLA+ S DN+VA+GT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q+IG FV
Subjt:  IDSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFV

Query:  A
        A
Subjt:  A

A0A6J1C4J7 uncharacterized protein LOC111007859 isoform X24.2e-5136Show/hide
Query:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------
        S SS DE +V I  E +   RRG TTM  L  +R  G+R  I+YN+QGQ +G NA +MQS+IGVCVRQ+IP+TY  WKEV QELKDKIF+ VE       
Subjt:  SGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVE-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFASI
                                      DELA  +KG+DILTEALGT EH GRVRG+G  V PS YFN+ + K K+ +   NK +    +PSKK    
Subjt:  ------------------------------DELAMKNKGKDILTEALGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMS---CSPSKKFASI

Query:  DSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFVA
             K KE+++  EEI       +E  PCHLA+ S DN+VA+GT++ ++ Q PTVHGVPLGV+NVRV+VD+++ E A +PIP+RGE+E+L+Q+IG FVA
Subjt:  DSNHPKDKEVIDDVEEI-------LEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENVRVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFVA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATCAAGTGACGATGAAGTGAACGTTGCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGTGTGGTCTGGCACGCGTAAGGACTAC
AGGAGAACGCTTAGTCATCCAATACAACAATCAAGGCCAGAGTGTTGGTTATAATGCAAACCAAATGCAAAGTTATATTGGAGTTTGCGTTAGGCAACAAATTCCATTAA
CTTACAAGACTTGGAAAGAAGTTCTCCAAGAATTGAAAGATAAAATTTTTGATTCTGTAGAGGACGAACTGGCTATGAAGAATAAAGGTAAAGACATATTGACCGAAGCA
TTAGGCACGCCAGAACACAGAGGGCGTGTTAGAGGAATAGGTATGTCTGTCAAACCATCAACATACTTTAACATTCCTCGAGTGAAGGAAAAATCAAGCAAAGGGTCTGG
CAATAAAATGTCGTGCTCACCTTCCAAAAAGTTTGCAAGTATAGACAGTAATCATCCAAAAGACAAGGAGGTCATTGACGATGTGGAAGAAATTTTAGAGGAAACTCCAT
GTCATCTAGCAATAGGATCAAAGGATAATGTGGTTGCTCTAGGCACAATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAGGAGTTGAAAATGTT
AGAGTGGTAGTGGACATGATCGTAGGTGAAGATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCCCTGAGTCAATCTATAGGAAATTTTGTGGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATCAAGTGACGATGAAGTGAACGTTGCGATCCAAATGGAGGCTAGGCATACTAATCGACGTGGTCTCACTACTATGTGTGGTCTGGCACGCGTAAGGACTAC
AGGAGAACGCTTAGTCATCCAATACAACAATCAAGGCCAGAGTGTTGGTTATAATGCAAACCAAATGCAAAGTTATATTGGAGTTTGCGTTAGGCAACAAATTCCATTAA
CTTACAAGACTTGGAAAGAAGTTCTCCAAGAATTGAAAGATAAAATTTTTGATTCTGTAGAGGACGAACTGGCTATGAAGAATAAAGGTAAAGACATATTGACCGAAGCA
TTAGGCACGCCAGAACACAGAGGGCGTGTTAGAGGAATAGGTATGTCTGTCAAACCATCAACATACTTTAACATTCCTCGAGTGAAGGAAAAATCAAGCAAAGGGTCTGG
CAATAAAATGTCGTGCTCACCTTCCAAAAAGTTTGCAAGTATAGACAGTAATCATCCAAAAGACAAGGAGGTCATTGACGATGTGGAAGAAATTTTAGAGGAAACTCCAT
GTCATCTAGCAATAGGATCAAAGGATAATGTGGTTGCTCTAGGCACAATGTACACGTCTGACGCTCAATTTCCCACAGTCCATGGAGTTCCCTTAGGAGTTGAAAATGTT
AGAGTGGTAGTGGACATGATCGTAGGTGAAGATGCTCCATTACCAATTCCTATACGGGGAGAAGTAGAGTCCCTGAGTCAATCTATAGGAAATTTTGTGGCATGA
Protein sequenceShow/hide protein sequence
MSGSSDDEVNVAIQMEARHTNRRGLTTMCGLARVRTTGERLVIQYNNQGQSVGYNANQMQSYIGVCVRQQIPLTYKTWKEVLQELKDKIFDSVEDELAMKNKGKDILTEA
LGTPEHRGRVRGIGMSVKPSTYFNIPRVKEKSSKGSGNKMSCSPSKKFASIDSNHPKDKEVIDDVEEILEETPCHLAIGSKDNVVALGTMYTSDAQFPTVHGVPLGVENV
RVVVDMIVGEDAPLPIPIRGEVESLSQSIGNFVA