; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025055 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025055
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionCoiled-coil domain-containing protein 21, putative isoform 2
Genome locationtig00003412:955540..991791
RNA-Seq ExpressionSgr025055
SyntenySgr025055
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008455465.1 PREDICTED: uncharacterized protein LOC103495622 [Cucumis melo]1.3e-11485.54Show/hide
Query:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN
        LAGRLAGKEAAYFIQESKHAVGRLAQKNT QKFPA+ PPS HS ADGE QADILPEVLRHSLPSKIFRE+S +S+GS STSKWVLPS+PNYRSVSSDALN
Subjt:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN

Query:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV
        PLRAFL+LPQVTFGPKRWELPQ ENS+L STANDLR DKHTP+NPEKLKAAAEGLA+VGKAFAVATALVFGGATLIFGFTMSKLD+NN++EIQTK ++++
Subjt:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV

Query:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK
        EPKMEMIR+Q VPFK WADD+SKKWHVER+ D KEK LIKELSK LGAK
Subjt:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK

XP_022135886.1 uncharacterized protein LOC111007726 [Momordica charantia]3.2e-12191.16Show/hide
Query:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN
        LAGRLAGKEAAYFIQESKHA+GRL QKNTTQKFPAQPPPS HSPADGE QADILPEVLRHSLPSKIFREQS DSNGSL TSKWVLPS+PN RSVSSDALN
Subjt:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN

Query:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV
        PLRAFLTLPQVTFGPKRWELPQ ENSVL STANDLR DKHT INPEKLKAAAEGLA+VGKAFA+ATALVFGGATLIFGFT+SKLDLNNSNEIQTK RNL+
Subjt:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV

Query:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK
        EPKMEMIREQLVPFK WA+DMSKKWHVER+KD KEK LIKELSKTLGAK
Subjt:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK

XP_022969227.1 uncharacterized protein LOC111468292 [Cucurbita maxima]9.4e-11385.94Show/hide
Query:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN
        LAGRLA KEAAYFIQESKHAVGRLA     QKFPAQPPPS +S  DGESQADILPEVLRHSLPSKI+REQS DS+GSLSTSKWVLPSNPNYRSVSSD+LN
Subjt:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN

Query:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV
        PLRAF++LPQV+FGPKRWELPQ  NSVL STANDLR DKHTPINPEKLKAAAEGLA++GKAFAVATALVFGGATLIFG TMSKLDLNN+NEIQTK ++++
Subjt:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV

Query:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK
        EP+M+MIREQLVPFK WAD MSKKWHVEREKDTKEK LIKELSKTLGAK
Subjt:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK

XP_023554586.1 uncharacterized protein LOC111811789 [Cucurbita pepo subsp. pepo]2.3e-11183.94Show/hide
Query:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN
        LAGRLA KEAAYFIQESKHAVGRLA     QKFPAQPPPS HS  DGESQAD+LPEVLRHSLPSKI+REQS DS+GSLSTSKW+LPSNPNYRSVSSD+LN
Subjt:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN

Query:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV
        PLRAF++LPQV+ GPKRWELPQ  NSVL STANDLR DKHTPINPEKLKAAAEGLA++GKAFAVATALVFGGATLIFG TMSKLDLNN+NEIQTK ++++
Subjt:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV

Query:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK
        EP+M+MIREQLVPFK WAD MSKKWHVEREKD KEK L+KELSKTLGA+
Subjt:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK

XP_038887935.1 uncharacterized protein LOC120077905 [Benincasa hispida]8.2e-11786.75Show/hide
Query:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN
        LAGRLA KEAAYFIQESKHAVGRLAQKN  QKFPAQ PPS HSP DGESQADILPEVLRHSLPSKIFRE+S DSNGS STSKWVLPSNPNYRSVS D+LN
Subjt:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN

Query:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV
        PLRAFL+LPQVTFGPKRWELPQ ENSVL STANDLR DKHTP+NPEKLKAAAEGLA+VGKAFAVATALVFGGATLIFG+T+SKLDLNN++EIQTK ++++
Subjt:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV

Query:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK
        EPKMEMIREQL+PFK WADDMSKKWHVER++D KEK LIKELSK LGAK
Subjt:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK

TrEMBL top hitse value%identityAlignment
A0A0A0K240 Uncharacterized protein1.9e-11183.53Show/hide
Query:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN
        LAGRLAGKEAAYFIQESKHAVGRLAQKNT  KFPAQ PPS HS   GE QADILPEVLRHSLPS IFRE+S +S+GS STSKWVLPS+PNYRSVSS++LN
Subjt:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN

Query:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV
        PLR FL+LPQVTFGPKRWELPQ ENS+L STANDLR DKHTPINPEKLKAAAEGLA+VGKAFA ATALVFGGATLIFGFT+SKLD+NN++EIQTK + ++
Subjt:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV

Query:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK
        EPKMEMIR+QLVPFK WADDMSKKWHVER+ D KEK LIKELSK LGAK
Subjt:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK

A0A1S3C0Z0 uncharacterized protein LOC1034956226.3e-11585.54Show/hide
Query:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN
        LAGRLAGKEAAYFIQESKHAVGRLAQKNT QKFPA+ PPS HS ADGE QADILPEVLRHSLPSKIFRE+S +S+GS STSKWVLPS+PNYRSVSSDALN
Subjt:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN

Query:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV
        PLRAFL+LPQVTFGPKRWELPQ ENS+L STANDLR DKHTP+NPEKLKAAAEGLA+VGKAFAVATALVFGGATLIFGFTMSKLD+NN++EIQTK ++++
Subjt:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV

Query:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK
        EPKMEMIR+Q VPFK WADD+SKKWHVER+ D KEK LIKELSK LGAK
Subjt:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK

A0A6J1C1Z9 uncharacterized protein LOC1110077261.6e-12191.16Show/hide
Query:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN
        LAGRLAGKEAAYFIQESKHA+GRL QKNTTQKFPAQPPPS HSPADGE QADILPEVLRHSLPSKIFREQS DSNGSL TSKWVLPS+PN RSVSSDALN
Subjt:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN

Query:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV
        PLRAFLTLPQVTFGPKRWELPQ ENSVL STANDLR DKHT INPEKLKAAAEGLA+VGKAFA+ATALVFGGATLIFGFT+SKLDLNNSNEIQTK RNL+
Subjt:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV

Query:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK
        EPKMEMIREQLVPFK WA+DMSKKWHVER+KD KEK LIKELSKTLGAK
Subjt:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK

A0A6J1GLK0 uncharacterized protein LOC1114550891.6e-11083.94Show/hide
Query:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN
        LAGRLA KEAAYFIQESKHAVGRLA     QKFPAQPPPS HS  +GESQADILPEVLRHSLPS  +REQS DS+GSLSTSKWVLPSNPNYRSVSSD+LN
Subjt:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN

Query:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV
        PLRAF++LPQV+ GPKRWELPQ  NSVL STANDLR DKHTPINPEKLKAAAEGLA++GKAFAVATALVFGGATLIFG TMSKLDLNN+NEIQTK ++++
Subjt:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV

Query:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK
        EP+M+MIREQLVPFK WAD MSKKWHVEREKD KEK LIKELSKTLGA+
Subjt:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK

A0A6J1I0D6 uncharacterized protein LOC1114682924.5e-11385.94Show/hide
Query:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN
        LAGRLA KEAAYFIQESKHAVGRLA     QKFPAQPPPS +S  DGESQADILPEVLRHSLPSKI+REQS DS+GSLSTSKWVLPSNPNYRSVSSD+LN
Subjt:  LAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYRSVSSDALN

Query:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV
        PLRAF++LPQV+FGPKRWELPQ  NSVL STANDLR DKHTPINPEKLKAAAEGLA++GKAFAVATALVFGGATLIFG TMSKLDLNN+NEIQTK ++++
Subjt:  PLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVRNLV

Query:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK
        EP+M+MIREQLVPFK WAD MSKKWHVEREKDTKEK LIKELSKTLGAK
Subjt:  EPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05060.1 unknown protein7.0e-7457.48Show/hide
Query:  AGRLAGKEAAYFIQESKHAVGRLAQKN--TTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYR-SVSSDA
        AGRLAGKEAAYF QESKHAV RLA+K+  T +K P+ PP  P      E Q D+LPE+LRHSLPSKI+  +  D +     SKW L S+PN   S+S D 
Subjt:  AGRLAGKEAAYFIQESKHAVGRLAQKN--TTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKWVLPSNPNYR-SVSSDA

Query:  LNPLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKH-TPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVR
        LNPLR +++LPQVTFG +RW+LP+ ENSVL STAN+LR D++ TP+NPEKLKAA EGL ++GKAFA AT ++FG ATL+FG   SKLD+ N+++I+TK +
Subjt:  LNPLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKH-TPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQTKVR

Query:  NLVEPKMEMIREQLVPFKAWADDMSKKWHVERE--KDTKEKLLIKELSKTLGAK
        +L +PK+E ++EQ+ P + WA++MSKKWH+E E     KEK ++KELSK LG K
Subjt:  NLVEPKMEMIREQLVPFKAWADDMSKKWHVERE--KDTKEKLLIKELSKTLGAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGACGGCTCGCTCGCGTTCAAGACTGATTGCTCGCTCGTGCGAGCCGCAGTGTCGGAGCAGGCGGTCCATGAGTGCAGCATGGGCGATCACTGCCACGTTCCTGAG
TCGGCACGGGTCAAGGGCGCTATTAGCGGCTGAGAAGGTGCGCGAAAGGAGATTCGGCCGCAGGCCAGAGCCAACGGAGAACCGGACCCGCCATTTTCGAGCTTTAAGAG
GTGTTGGCGCTTTCAGGCGACTCTCGCAGCTTTATGGGTTTCTTCCGGCGATTCTGAACTCAAACGGCGAGTTAGGATTTGTTATCTCTTCTATGTATTTTTATATATTT
TTTATTTTTTTTTTTATATTTTCTTTCTTTCTTTCTTTCTTTTTTTATTGCTCTCGGTATGGTTGGGAGAAGTATGAGCTTTTGGCCGGAAGATTGGCCGGGAAAGAGGC
GGCATACTTCATTCAGGAATCCAAACATGCCGTCGGCCGGCTTGCCCAGAAAAACACGACCCAGAAGTTTCCGGCGCAACCACCGCCTTCTCCCCATTCTCCGGCGGACG
GTGAATCGCAAGCCGACATTCTTCCCGAGGTCTTAAGGCACTCTCTGCCCTCTAAAATCTTTAGGGAGCAATCTGTTGACTCTAATGGCTCCCTCTCTACTTCGAAATGG
GTCCTTCCTTCAAACCCCAATTATCGCTCTGTCTCTTCAGATGCTCTCAACCCTCTTAGGGCTTTCCTCACCCTTCCCCAAGTTACCTTCGGCCCCAAAAGGTGGGAATT
GCCTCAATATGAAAACTCAGTATTGGGCTCAACAGCTAATGACTTGCGGATCGACAAGCATACTCCGATTAACCCTGAGAAGTTGAAGGCTGCTGCTGAAGGCCTTGCTT
ATGTTGGAAAGGCATTTGCTGTAGCTACTGCGCTTGTCTTTGGTGGTGCCACCTTGATCTTTGGATTCACCATGTCCAAGCTAGATCTCAACAATTCCAATGAGATCCAA
ACAAAAGTAAGAAACTTGGTTGAACCAAAGATGGAGATGATTAGAGAGCAACTGGTTCCCTTCAAAGCCTGGGCTGACGATATGTCAAAGAAATGGCATGTGGAAAGAGA
AAAAGATACCAAAGAGAAGCTCCTTATAAAGGAACTCTCAAAGACTTTGGGTGCCAAGATTCTCGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGACGGCTCGCTCGCGTTCAAGACTGATTGCTCGCTCGTGCGAGCCGCAGTGTCGGAGCAGGCGGTCCATGAGTGCAGCATGGGCGATCACTGCCACGTTCCTGAG
TCGGCACGGGTCAAGGGCGCTATTAGCGGCTGAGAAGGTGCGCGAAAGGAGATTCGGCCGCAGGCCAGAGCCAACGGAGAACCGGACCCGCCATTTTCGAGCTTTAAGAG
GTGTTGGCGCTTTCAGGCGACTCTCGCAGCTTTATGGGTTTCTTCCGGCGATTCTGAACTCAAACGGCGAGTTAGGATTTGTTATCTCTTCTATGTATTTTTATATATTT
TTTATTTTTTTTTTTATATTTTCTTTCTTTCTTTCTTTCTTTTTTTATTGCTCTCGGTATGGTTGGGAGAAGTATGAGCTTTTGGCCGGAAGATTGGCCGGGAAAGAGGC
GGCATACTTCATTCAGGAATCCAAACATGCCGTCGGCCGGCTTGCCCAGAAAAACACGACCCAGAAGTTTCCGGCGCAACCACCGCCTTCTCCCCATTCTCCGGCGGACG
GTGAATCGCAAGCCGACATTCTTCCCGAGGTCTTAAGGCACTCTCTGCCCTCTAAAATCTTTAGGGAGCAATCTGTTGACTCTAATGGCTCCCTCTCTACTTCGAAATGG
GTCCTTCCTTCAAACCCCAATTATCGCTCTGTCTCTTCAGATGCTCTCAACCCTCTTAGGGCTTTCCTCACCCTTCCCCAAGTTACCTTCGGCCCCAAAAGGTGGGAATT
GCCTCAATATGAAAACTCAGTATTGGGCTCAACAGCTAATGACTTGCGGATCGACAAGCATACTCCGATTAACCCTGAGAAGTTGAAGGCTGCTGCTGAAGGCCTTGCTT
ATGTTGGAAAGGCATTTGCTGTAGCTACTGCGCTTGTCTTTGGTGGTGCCACCTTGATCTTTGGATTCACCATGTCCAAGCTAGATCTCAACAATTCCAATGAGATCCAA
ACAAAAGTAAGAAACTTGGTTGAACCAAAGATGGAGATGATTAGAGAGCAACTGGTTCCCTTCAAAGCCTGGGCTGACGATATGTCAAAGAAATGGCATGTGGAAAGAGA
AAAAGATACCAAAGAGAAGCTCCTTATAAAGGAACTCTCAAAGACTTTGGGTGCCAAGATTCTCGACTGA
Protein sequenceShow/hide protein sequence
MTTARSRSRLIARSCEPQCRSRRSMSAAWAITATFLSRHGSRALLAAEKVRERRFGRRPEPTENRTRHFRALRGVGAFRRLSQLYGFLPAILNSNGELGFVISSMYFYIF
FIFFFIFSFFLSFFFYCSRYGWEKYELLAGRLAGKEAAYFIQESKHAVGRLAQKNTTQKFPAQPPPSPHSPADGESQADILPEVLRHSLPSKIFREQSVDSNGSLSTSKW
VLPSNPNYRSVSSDALNPLRAFLTLPQVTFGPKRWELPQYENSVLGSTANDLRIDKHTPINPEKLKAAAEGLAYVGKAFAVATALVFGGATLIFGFTMSKLDLNNSNEIQ
TKVRNLVEPKMEMIREQLVPFKAWADDMSKKWHVEREKDTKEKLLIKELSKTLGAKILD