; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008930 (gene) of Snake gourd v1 genome

Gene IDTan0008930
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPlant basic secretory protein (BSP) family protein
Genome locationLG02:13293873..13294619
RNA-Seq ExpressionTan0008930
SyntenyTan0008930
Gene Ontology termsNA
InterPro domainsIPR007541 - Uncharacterised protein family, basic secretory protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571941.1 hypothetical protein SDJN03_28669, partial [Cucurbita argyrosperma subsp. sororia]1.6e-9173.78Show/hide
Query:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA
        ++N +IFFLF  L LLQ VSAVEY+VTN A  TPGGIRFDNE+G +YS+Q+L AATDFIWNIF Q    DRK+VQK+SLFIDR+ D VAFA N +IHVGA
Subjt:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA

Query:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT
        +           EI+GVLY EMTHIWQWNGNPSAP GLIEGIADYVR++SGYIPGHWVGPGGGSRW QGYDVTARFL+ LEGLRSGFVAELNR L+NGY+
Subjt:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT

Query:  ADYFVQLLGKPVDQLWSDYKTEYGN
        ADYF+QLLGKPVDQLW+DYK  +GN
Subjt:  ADYFVQLLGKPVDQLWSDYKTEYGN

XP_022148820.1 uncharacterized protein LOC111017379 [Momordica charantia]1.8e-9576.89Show/hide
Query:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA
        ++NN IFFL   LALLQ VSAVEY+VTN A  TPGG+RFDNE+GA+YS Q LVAATDFIWNIF+Q+TAADRK+V K+SLFID D D VAFA N EIHVGA
Subjt:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA

Query:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT
        N           EI+GVLY EMTHIWQWNGNPSAP GLIEGIADYVR++SGYIPGHWVGPGGGSRW QGYDVTARFL+ LEGLRSGFV+ELNRRLRNGY 
Subjt:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT

Query:  ADYFVQLLGKPVDQLWSDYKTEYGN
        ADYFVQLLGK VDQLW+DYK  +GN
Subjt:  ADYFVQLLGKPVDQLWSDYKTEYGN

XP_022952790.1 uncharacterized protein LOC111455381 [Cucurbita moschata]2.1e-9173.13Show/hide
Query:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA
        ++N +IFFLF  L LLQ VSAVEY+VTN A  T GGIRFDNE+G +YSQQ+L AATDFIWNIF Q++ ADRK+VQK+SLFID++ D VAFA N EIHVGA
Subjt:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA

Query:  -------------NEISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNG
                     N  +GVLY EMTHIWQWNGNPSAP GLIEGIADYVR++SGYIPGHWVGPGGGSRW QGYDVTARFL+ LEGLRSGFVAELNRRL+NG
Subjt:  -------------NEISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNG

Query:  YTADYFVQLLGKPVDQLWSDYKTEYGN
        Y+ADYF+QLLGKPVD+LW+DYK  +GN
Subjt:  YTADYFVQLLGKPVDQLWSDYKTEYGN

XP_022972382.1 uncharacterized protein LOC111470952 [Cucurbita maxima]4.3e-9274.22Show/hide
Query:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA
        ++N +IFFLF+ L LLQ VSAVEY VTN A  TPGGIRFDNE+GA+YS+Q+L AATDFIW IF Q++ ADRK+VQK+SLFIDRD D VAFA N EIHVGA
Subjt:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA

Query:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT
        +           EI+GVLY EMTHIWQWNGNPS+P GLIEGIADYVR++SGYIPGHWVGPGGGSRW QGYDVTARFL+ LEGLRSGFVAELNR L+NGY+
Subjt:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT

Query:  ADYFVQLLGKPVDQLWSDYKTEYGN
        ADYF+QLLGKPVDQLW+DYK  + N
Subjt:  ADYFVQLLGKPVDQLWSDYKTEYGN

XP_038888417.1 uncharacterized protein LOC120078264 [Benincasa hispida]1.6e-9168.29Show/hide
Query:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA
        ++NNLIFFL   LALLQ++ AVEY+VTN A  TPGG RFDN +GANYS+Q L AAT  IWNIF Q+TAADRKDVQKISLFID++ + VAF  N EIH+GA
Subjt:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA

Query:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT
        N           EI+GVLY EMT+IWQW+GN  AP GLIEGIADY+R++SGYIPG WV PGGG+RW +GYDVTARFL+ LEG+RSG VA+LNRRLRNGY+
Subjt:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT

Query:  ADYFVQLLGKPVDQLWSDYKT--EYGNIDNKCNNFVKIKANGVFAL
        ADYF QLLGKPVD+LW+ YKT  +YG ++NKCN F +I+ NGVFAL
Subjt:  ADYFVQLLGKPVDQLWSDYKT--EYGNIDNKCNNFVKIKANGVFAL

TrEMBL top hitse value%identityAlignment
A0A0A0K4M0 NtPRp27-like protein6.9e-8870.04Show/hide
Query:  MASNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHV
        MA +N + FFLF+ L LLQ VSAV+++VTN A  TPGG+RFDNE+G + S+Q++VA+TDFIWNIF+Q++ A+RK+V K+ LFI  D D VAFA N EIHV
Subjt:  MASNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHV

Query:  GA-----------NEISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNG
         A            EI+GVLY EMTHIWQWNG P+AP GLIEGIADYVR++SGYIPGHWVGPGGGS W QGYDVTARFL+ LEGLRSGFVAELNRRLRNG
Subjt:  GA-----------NEISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNG

Query:  YTADYFVQLLGKPVDQLWSDYKTEYGN
        Y+A+YFVQLLGKPVDQLW+DYK  YGN
Subjt:  YTADYFVQLLGKPVDQLWSDYKTEYGN

A0A6J1D406 uncharacterized protein LOC1110173799.0e-9676.89Show/hide
Query:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA
        ++NN IFFL   LALLQ VSAVEY+VTN A  TPGG+RFDNE+GA+YS Q LVAATDFIWNIF+Q+TAADRK+V K+SLFID D D VAFA N EIHVGA
Subjt:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA

Query:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT
        N           EI+GVLY EMTHIWQWNGNPSAP GLIEGIADYVR++SGYIPGHWVGPGGGSRW QGYDVTARFL+ LEGLRSGFV+ELNRRLRNGY 
Subjt:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT

Query:  ADYFVQLLGKPVDQLWSDYKTEYGN
        ADYFVQLLGK VDQLW+DYK  +GN
Subjt:  ADYFVQLLGKPVDQLWSDYKTEYGN

A0A6J1D681 uncharacterized protein LOC1110173063.2e-8567.56Show/hide
Query:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA
        ++NNLIFFL   LALLQ VSAVEY++T  AG TPGG+RF+NE+GA YS+Q L AAT+FIWNIF Q+TAADRK+VQ ++LFIDRD + VAFA N +IH+ A
Subjt:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA

Query:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT
        +           EI+G+LY EMTH+WQWNGNPSAP  +IEGIADYVR++SGYIPGHWV PGGG  W +GYD TARF E L+G RSGFVAELNRRLR GY+
Subjt:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT

Query:  ADYFVQLLGKPVDQLWSDYKTEYGN
         DYFVQLLGKPV+QLW++YK  +GN
Subjt:  ADYFVQLLGKPVDQLWSDYKTEYGN

A0A6J1GMQ9 uncharacterized protein LOC1114553811.0e-9173.13Show/hide
Query:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA
        ++N +IFFLF  L LLQ VSAVEY+VTN A  T GGIRFDNE+G +YSQQ+L AATDFIWNIF Q++ ADRK+VQK+SLFID++ D VAFA N EIHVGA
Subjt:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA

Query:  -------------NEISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNG
                     N  +GVLY EMTHIWQWNGNPSAP GLIEGIADYVR++SGYIPGHWVGPGGGSRW QGYDVTARFL+ LEGLRSGFVAELNRRL+NG
Subjt:  -------------NEISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNG

Query:  YTADYFVQLLGKPVDQLWSDYKTEYGN
        Y+ADYF+QLLGKPVD+LW+DYK  +GN
Subjt:  YTADYFVQLLGKPVDQLWSDYKTEYGN

A0A6J1I4N3 uncharacterized protein LOC1114709522.1e-9274.22Show/hide
Query:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA
        ++N +IFFLF+ L LLQ VSAVEY VTN A  TPGGIRFDNE+GA+YS+Q+L AATDFIW IF Q++ ADRK+VQK+SLFIDRD D VAFA N EIHVGA
Subjt:  SNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA

Query:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT
        +           EI+GVLY EMTHIWQWNGNPS+P GLIEGIADYVR++SGYIPGHWVGPGGGSRW QGYDVTARFL+ LEGLRSGFVAELNR L+NGY+
Subjt:  N-----------EISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYT

Query:  ADYFVQLLGKPVDQLWSDYKTEYGN
        ADYF+QLLGKPVDQLW+DYK  + N
Subjt:  ADYFVQLLGKPVDQLWSDYKTEYGN

SwissProt top hitse value%identityAlignment
C0HJG8 Basic secretory protease (Fragments)9.8e-0747.46Show/hide
Query:  WFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYTADYFVQLLGKPVDQLWSDYKTEYG
        W +GYDVTARFL+ L  L +GFVAEL                  K V+QLWS+YK  YG
Subjt:  WFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYTADYFVQLLGKPVDQLWSDYKTEYG

Arabidopsis top hitse value%identityAlignment
AT2G15130.1 Plant basic secretory protein (BSP) family protein2.2e-6250.91Show/hide
Query:  IFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEV-GANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGANEI-
        IF + + +  +  VSAV++SV +  G +PGG RF NE+ G +Y +Q L  ATDF W +F+QT  +DRKDV KI+LF++ + + +A++   EIH  A  + 
Subjt:  IFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEV-GANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGANEI-

Query:  ----------SGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYTADYF
                  +GV+Y E+ H WQWNG   AP GLIEGIADYVR+++GY+  HWV PGGG RW QGYDVTARFLE    LR+GFVAELN+++R+ Y   +F
Subjt:  ----------SGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYTADYF

Query:  VQLLGKPVDQLWSDYKTEYG
        V LLGK V+QLW +YK  YG
Subjt:  VQLLGKPVDQLWSDYKTEYG

AT2G15130.2 Plant basic secretory protein (BSP) family protein7.4e-4254.74Show/hide
Query:  VAFAMNGEIHVGANEI-----------SGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGF
        +A++   EIH  A  +           +GV+Y E+ H WQWNG   AP GLIEGIADYVR+++GY+  HWV PGGG RW QGYDVTARFLE    LR+GF
Subjt:  VAFAMNGEIHVGANEI-----------SGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGF

Query:  VAELNRRLRNGYTADYFVQLLGKPVDQLWSDYKTEYG
        VAELN+++R+ Y   +FV LLGK V+QLW +YK  YG
Subjt:  VAELNRRLRNGYTADYFVQLLGKPVDQLWSDYKTEYG

AT2G15170.1 Plant basic secretory protein (BSP) family protein6.1e-1241.49Show/hide
Query:  FLFACLAL-LQTVSAVEYSVTNGAGRTPGGIRFDNEV-GANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMN--GEIH
        FL  CL L +  V+AV++ V +  G +PGG +F +E+ G +Y +Q + +ATDF W +F+QT   DRK +  I+LFI+ + + VA+  N   EIH
Subjt:  FLFACLAL-LQTVSAVEYSVTNGAGRTPGGIRFDNEV-GANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMN--GEIH

AT2G15220.1 Plant basic secretory protein (BSP) family protein3.1e-6452.27Show/hide
Query:  IFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGA-NYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA----
        IFF+ + + ++  V+AV+YSV + +G + GG RF  E+G  +Y  Q L +ATDF+W +F+QT  +DRK V KI+LF++   D VA+    EIH       
Subjt:  IFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGA-NYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGA----

Query:  -------NEISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYTADYF
                E +GV+Y E+ H WQWNG   AP GLIEGIADYVR+++GY P HWVGPG G RW QGYDVTARFL+   GLR+GFVAELN+++RNGY+  +F
Subjt:  -------NEISGVLYREMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYTADYF

Query:  VQLLGKPVDQLWSDYKTEYG
        V LLGK V+QLW +YK +YG
Subjt:  VQLLGKPVDQLWSDYKTEYG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCAACAACAACTTAATCTTCTTCCTCTTCGCCTGTCTCGCTCTCCTGCAAACCGTTTCCGCCGTCGAGTATTCCGTCACCAACGGCGCCGGCAGAACTCCCGG
CGGCATCCGCTTCGACAACGAAGTCGGAGCGAATTATAGCCAGCAGATGCTGGTAGCTGCCACTGATTTTATCTGGAACATTTTCGAGCAAACCACCGCCGCCGACCGGA
AAGACGTGCAGAAGATCAGCCTGTTCATCGACAGAGACTGCGATAGAGTGGCGTTCGCCATGAACGGCGAAATTCACGTCGGCGCAAACGAGATTTCAGGGGTGCTGTAC
CGCGAAATGACCCATATCTGGCAGTGGAACGGGAACCCATCGGCGCCGGAGGGGCTGATTGAAGGGATCGCCGATTACGTACGGATGAGGTCTGGGTATATACCGGGGCA
CTGGGTGGGGCCCGGCGGCGGGAGTAGGTGGTTCCAAGGGTACGACGTGACGGCGAGATTTTTGGAAAATTTAGAGGGGCTGAGGAGTGGGTTCGTGGCGGAGCTGAACC
GGAGGCTCAGAAATGGCTACACTGCCGATTACTTCGTGCAGCTGCTGGGGAAGCCGGTGGATCAGCTTTGGTCGGATTACAAGACTGAGTATGGAAATATAGATAATAAA
TGCAATAATTTTGTTAAAATAAAAGCAAATGGGGTGTTTGCTCTACAGCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCAACAACAACTTAATCTTCTTCCTCTTCGCCTGTCTCGCTCTCCTGCAAACCGTTTCCGCCGTCGAGTATTCCGTCACCAACGGCGCCGGCAGAACTCCCGG
CGGCATCCGCTTCGACAACGAAGTCGGAGCGAATTATAGCCAGCAGATGCTGGTAGCTGCCACTGATTTTATCTGGAACATTTTCGAGCAAACCACCGCCGCCGACCGGA
AAGACGTGCAGAAGATCAGCCTGTTCATCGACAGAGACTGCGATAGAGTGGCGTTCGCCATGAACGGCGAAATTCACGTCGGCGCAAACGAGATTTCAGGGGTGCTGTAC
CGCGAAATGACCCATATCTGGCAGTGGAACGGGAACCCATCGGCGCCGGAGGGGCTGATTGAAGGGATCGCCGATTACGTACGGATGAGGTCTGGGTATATACCGGGGCA
CTGGGTGGGGCCCGGCGGCGGGAGTAGGTGGTTCCAAGGGTACGACGTGACGGCGAGATTTTTGGAAAATTTAGAGGGGCTGAGGAGTGGGTTCGTGGCGGAGCTGAACC
GGAGGCTCAGAAATGGCTACACTGCCGATTACTTCGTGCAGCTGCTGGGGAAGCCGGTGGATCAGCTTTGGTCGGATTACAAGACTGAGTATGGAAATATAGATAATAAA
TGCAATAATTTTGTTAAAATAAAAGCAAATGGGGTGTTTGCTCTACAGCTTTAA
Protein sequenceShow/hide protein sequence
MASNNNLIFFLFACLALLQTVSAVEYSVTNGAGRTPGGIRFDNEVGANYSQQMLVAATDFIWNIFEQTTAADRKDVQKISLFIDRDCDRVAFAMNGEIHVGANEISGVLY
REMTHIWQWNGNPSAPEGLIEGIADYVRMRSGYIPGHWVGPGGGSRWFQGYDVTARFLENLEGLRSGFVAELNRRLRNGYTADYFVQLLGKPVDQLWSDYKTEYGNIDNK
CNNFVKIKANGVFALQL