; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr003494 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr003494
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionnitrate regulatory gene2 protein-like
Genome locationtig00002207:390..1547
RNA-Seq ExpressionSgr003494
SyntenySgr003494
Gene Ontology termsNA
InterPro domainsIPR006867 - Domain of unknown function DUF632


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577285.1 Protein ALTERED PHOSPHATE STARVATION RESPONSE 1, partial [Cucurbita argyrosperma subsp. sororia]3.4e-10163.02Show/hide
Query:  SPESFFWKSRWEAIDSEVGVLILENYASDLDLMKWVVEIFGENGWYELQNREHEALRFVRNETVHQAGYRSRVVSVNVGHRARQDPSHSSYPSPCPSQTA
        S +SFFWKSRWE +DSE    I   YA     + +V           L+N      R+   E + ++   +    ++      + PSH SYPSPCPSQTA
Subjt:  SPESFFWKSRWEAIDSEVGVLILENYASDLDLMKWVVEIFGENGWYELQNREHEALRFVRNETVHQAGYRSRVVSVNVGHRARQDPSHSSYPSPCPSQTA

Query:  EASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPPPPPPHHESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQ
        +ASESPL   ESPI+PPI TISYMVAGGSTPLTVKV+P+SHS+VYEE   +   PPPPPP H+ GSSWDYFDTNDEI+SF FL TGGMDVN E+ERMWKQ
Subjt:  EASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPPPPPPHHESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQ

Query:  FKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCTEQEDPSEFITHRAKDFLSS
        FKG M DA    QEG +KPE  QKACENG HL+SS  VEER SEMA R+DKELN  S S  +LLEQSGSRG +KLEKSLCTEQEDPSEFITHRAKDFLSS
Subjt:  FKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCTEQEDPSEFITHRAKDFLSS

Query:  IKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL
        IK+I++RFQRAS+SGRE+SRMLE +KIRV YLE NG +
Subjt:  IKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL

KAG6600640.1 Protein ALTERED PHOSPHATE STARVATION RESPONSE 1, partial [Cucurbita argyrosperma subsp. sororia]1.8e-9979.77Show/hide
Query:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPP---PPPPHHESGSSWDYFDTNDEIESFRF
        PSHSSYPSPCPSQTA+ SESPL   ESPISPPIATISYMVAG + PLTVKVRP+SHS+ Y EESVAS + P   PPPP HESGSSWDYFDT+DE ESFRF
Subjt:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPP---PPPPHHESGSSWDYFDTNDEIESFRF

Query:  LGT-GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCT
        +GT GGMDVNFEDERMWKQFKGEMADAK   QE TAKPE E    ENG H++ SG VEE+N EMA +EDKELN  S SS +LLEQSGSRG+I+LEKSLCT
Subjt:  LGT-GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCT

Query:  EQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL
        EQEDPSEFITHRAKDFLSSIK+IEHRFQRASESGREISRMLEA+KIRVGYLEANG +
Subjt:  EQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL

KAG7031277.1 hypothetical protein SDJN02_05317, partial [Cucurbita argyrosperma subsp. argyrosperma]6.4e-10079.77Show/hide
Query:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPP---PPPPHHESGSSWDYFDTNDEIESFRF
        PSHSSYPSPCPSQTA+ SESPL   ESPISPPIATISYMVAG + PLTVKVRP+SHS+ Y EES+AS + P   PPPP HESGSSWDYFDT+DEIESFRF
Subjt:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPP---PPPPHHESGSSWDYFDTNDEIESFRF

Query:  LGT-GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCT
        +GT GGMDVNFEDERMWKQFKGEMADAK   QE TAKPE E    ENG H++ SG VEE+N EMA +EDKELN  S SS +LLEQSGSRG+I+LEKSLCT
Subjt:  LGT-GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCT

Query:  EQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL
        EQEDPSEFITHRAKDFLSSIK+IEHRFQRASESGREISRMLEA+KIRVGYLEANG +
Subjt:  EQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL

XP_022943262.1 nitrate regulatory gene2 protein-like [Cucurbita moschata]1.4e-9979.77Show/hide
Query:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPP---PPPPHHESGSSWDYFDTNDEIESFRF
        PSHSSYPSPCPSQTA+ SESPL   ESPISPPIATISYMVAG + PLTVKV+P+SHS+ Y EESVAS + P   PPPP HESGSSWDYFDT+DEIESFRF
Subjt:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPP---PPPPHHESGSSWDYFDTNDEIESFRF

Query:  LGT-GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCT
        +GT GGMDVNFEDERMWKQFKGEMADAK   QE TAKPE E    ENG H++ SG VEE+N EMA +EDKELN  S SS +LLEQSGSRG+I+LEKSLCT
Subjt:  LGT-GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCT

Query:  EQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL
        EQEDPSEFITHRAKDFLSSIK+IEHRFQRASESGREISRMLEA+KIRVGYLEANG +
Subjt:  EQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL

XP_038878051.1 protein ALTERED PHOSPHATE STARVATION RESPONSE 1 [Benincasa hispida]2.8e-10380.24Show/hide
Query:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPPPPPPHHESGSSWDYFDTNDEIESFRFLGT
        PSHSSYPSPCPSQTA+ASESPL   ESPISPPIATISYMVAGG TPLTVK+RP+SH+FVYEE  V+   PPPPPP HESG SWDYFDTNDEIESFRFLGT
Subjt:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPPPPPPHHESGSSWDYFDTNDEIESFRFLGT

Query:  GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCTEQED
        GGMDVNFEDERMWKQFKGEM DAK    EGT+KPEA QKACENG HL+S+  VEERN EMA REDKE++    S+ ++LEQSGSRGA++LEK LCTEQED
Subjt:  GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCTEQED

Query:  PSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL
        PSEFITHRAKDFLSSIK+I++RFQRASESGREISRMLEA+KIRVGYLE NG +
Subjt:  PSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL

TrEMBL top hitse value%identityAlignment
A0A0A0KUA2 Uncharacterized protein5.1e-9576.17Show/hide
Query:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASH--YPPPPPPHHESGSSWDYFDTNDEIESFRFL
        PSHSSYPSPCPS TA+ASESPL   ESPISPPIATISYMVAGG TPLTVKVRP++HSFVYEE    S    PPPPPP HESG SWDYFDTNDEIESFRFL
Subjt:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASH--YPPPPPPHHESGSSWDYFDTNDEIESFRFL

Query:  GTGGMDVNFEDERMWKQFKGEMAD--AKVLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCTE
        GTGGMDV+FEDERMWKQFKGEM D       EGT+K EA QKA +NG +L+S   VEERN EM  REDKE+N  S S+ ++LEQS SRG ++LEK LCTE
Subjt:  GTGGMDVNFEDERMWKQFKGEMAD--AKVLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCTE

Query:  QEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL
        QEDPSEFITHRAKDFLSSIK+I++RFQRASESGREISRMLEA+KIRVGYLE NG +
Subjt:  QEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL

A0A6J1EU29 nitrate regulatory gene2 protein-like7.8e-9676.77Show/hide
Query:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPPPPPP-HHESGSSWDYFDTNDEIESFRFLG
        PSH SYPSPCPSQTA+ASESPL   ESPI+PPI TISYMVAGGSTPLTVKV+P+SHS+VY EESVAS  PPPPPP  H+ GSSWDYFDTNDEI+SF FL 
Subjt:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPPPPPP-HHESGSSWDYFDTNDEIESFRFLG

Query:  TGGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCTEQE
        TGGMDVN E+ERMWKQFKG M DA    QEG +KPE  QKACENG HL+SS  VEER SEMA R+DKELN  S S  +LLEQSGSRG +K+EKSLCTEQE
Subjt:  TGGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCTEQE

Query:  DPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL
        DPSEFITHRAKDFLSSIK+I++RFQRAS+SGRE+SRMLE +KIRV YLE NG +
Subjt:  DPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL

A0A6J1FSJ9 nitrate regulatory gene2 protein-like6.9e-10079.77Show/hide
Query:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPP---PPPPHHESGSSWDYFDTNDEIESFRF
        PSHSSYPSPCPSQTA+ SESPL   ESPISPPIATISYMVAG + PLTVKV+P+SHS+ Y EESVAS + P   PPPP HESGSSWDYFDT+DEIESFRF
Subjt:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPP---PPPPHHESGSSWDYFDTNDEIESFRF

Query:  LGT-GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCT
        +GT GGMDVNFEDERMWKQFKGEMADAK   QE TAKPE E    ENG H++ SG VEE+N EMA +EDKELN  S SS +LLEQSGSRG+I+LEKSLCT
Subjt:  LGT-GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCT

Query:  EQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL
        EQEDPSEFITHRAKDFLSSIK+IEHRFQRASESGREISRMLEA+KIRVGYLEANG +
Subjt:  EQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL

A0A6J1IQA5 nitrate regulatory gene2 protein-like4.4e-9978.99Show/hide
Query:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPP---PPPPHHESGSSWDYFDTNDEIESFRF
        PSHSSYPSPCPSQTA+ SESPL   ESPISPPIATISYMVAG + PLTVKVRP+SHS+ Y EESVAS + P   PPPP HESGSSWDYFDT+DEIESFRF
Subjt:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPP---PPPPHHESGSSWDYFDTNDEIESFRF

Query:  LGT-GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCT
        +GT GGMDVNFEDERMWKQFKGEMADAK   QE TAKPE +    ENG H++ SG VEE+N EMA +E KELN  S SS +LLEQ+GSRG+I+LEKSLCT
Subjt:  LGT-GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCT

Query:  EQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL
        EQEDPSEFITHRAKDFLSSIK+IEHRFQRASESGREISRMLEA+KIRVGYLEANG +
Subjt:  EQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL

A0A6J1J4U2 nitrate regulatory gene2 protein-like8.7e-9574.7Show/hide
Query:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPPPPPPHHESGSSWDYFDTNDEIESFRFLGT
        PSH SYPSPCPSQTA+ASESPL   E PI+PPI TISYMVAGGS PLTVKV+P+SHS VYEE   +   PPPPPP H+ GSSWDYFDTNDEI+SF FLGT
Subjt:  PSHSSYPSPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPPPPPPHHESGSSWDYFDTNDEIESFRFLGT

Query:  GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCTEQED
        GGM+VN E+ERMWKQFKG M DA    QEG +KPE  QKACENG HL+SS  +EER SEMA R+DKELN  S S  +LLEQSGSRG +KLEKSLCTEQED
Subjt:  GGMDVNFEDERMWKQFKGEMADAK-VLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMS-SSSILLEQSGSRGAIKLEKSLCTEQED

Query:  PSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL
        PSEFITHRAKDFLSSIK+I++RFQRAS+SGRE+SRMLE +KIRV YLE NG +
Subjt:  PSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39790.1 Protein of unknown function (DUF630 and DUF632)1.7e-2638.22Show/hide
Query:  QDPSH-SSYPSPCPSQTAEASESPL-HNGE-SPISPPIATISYM-VAGGSTPLTVKVRPNSHSFVYEEESVASHYPPPPP-PHHESGSSWDYFDTNDEIE
        + PSH SSYP        ++ +SPL HN   +P   P+  +SYM     S+ +T  + P S      E ++ +  PPPPP P     SSWDYFDT D+ +
Subjt:  QDPSH-SSYPSPCPSQTAEASESPL-HNGE-SPISPPIATISYM-VAGGSTPLTVKVRPNSHSFVYEEESVASHYPPPPP-PHHESGSSWDYFDTNDEIE

Query:  SFRFLGTGGMDVNFEDERMWKQFKGEMADAKVLQEGTAKPEAEQKACENGSHL--NSSGVVEERNSEMASREDKELNLMSSSSILLEQSGSRGAIKLEKS
        SFRF+G        + E           DA V+  G  K  ++    ++GS    +SS   ++R     S ED +                         
Subjt:  SFRFLGTGGMDVNFEDERMWKQFKGEMADAKVLQEGTAKPEAEQKACENGSHL--NSSGVVEERNSEMASREDKELNLMSSSSILLEQSGSRGAIKLEKS

Query:  LCTEQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGK
           E+EDPSEFITHRAKDF+SS+KDIEH+F RASESGRE+SRMLE +KIRVG+ +  GK
Subjt:  LCTEQEDPSEFITHRAKDFLSSIKDIEHRFQRASESGREISRMLEASKIRVGYLEANGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTTCTTCTCACTTTTAAGAGACATCCATCTCGATGCTCTGAAATGCTCGTCGCCGGAAAGTTTCTTCTGGAAATCGAGGTGGGAAGCCATTGATTCCGAAGTTGG
TGTTCTGATACTCGAGAATTATGCCTCCGACCTCGATTTAATGAAGTGGGTGGTTGAGATTTTTGGCGAAAATGGGTGGTACGAACTCCAAAATCGAGAACATGAAGCCC
TACGCTTTGTAAGGAACGAAACGGTTCATCAAGCAGGCTATCGATCGAGAGTCGTCTCTGTCAACGTCGGCCACCGAGCTCGACAAGACCCGTCGCATTCCTCTTATCCG
TCGCCGTGCCCGTCGCAGACCGCCGAAGCTTCGGAGTCTCCATTGCACAATGGAGAAAGCCCCATTTCGCCGCCGATAGCTACTATAAGTTACATGGTCGCCGGAGGTAG
TACCCCTCTGACTGTCAAGGTCCGACCGAATAGCCATAGTTTTGTCTATGAAGAAGAATCAGTTGCTTCCCATTACCCCCCGCCTCCACCGCCGCATCATGAGTCGGGAT
CTTCTTGGGATTACTTCGATACCAATGACGAAATCGAGAGCTTCAGGTTTCTGGGAACTGGTGGGATGGATGTGAACTTTGAAGATGAGAGAATGTGGAAACAATTTAAA
GGAGAAATGGCAGATGCCAAAGTTCTCCAGGAAGGAACTGCAAAACCAGAAGCAGAGCAAAAAGCTTGTGAAAATGGTAGCCATTTGAATTCCTCAGGGGTTGTTGAAGA
AAGAAATTCAGAGATGGCAAGCCGGGAAGACAAAGAACTTAATTTGATGAGTTCATCGAGCATATTGCTTGAACAATCCGGTTCGAGGGGGGCGATTAAGTTGGAGAAAA
GTTTATGTACTGAACAGGAAGATCCTTCAGAGTTTATTACTCATAGAGCAAAAGATTTTCTTTCCAGCATTAAGGACATAGAGCATCGTTTTCAGAGAGCTTCAGAATCT
GGGAGGGAGATCTCTAGAATGCTTGAGGCTAGTAAAATCAGAGTTGGATACCTTGAAGCAAATGGTAAATTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCTTCTTCTCACTTTTAAGAGACATCCATCTCGATGCTCTGAAATGCTCGTCGCCGGAAAGTTTCTTCTGGAAATCGAGGTGGGAAGCCATTGATTCCGAAGTTGG
TGTTCTGATACTCGAGAATTATGCCTCCGACCTCGATTTAATGAAGTGGGTGGTTGAGATTTTTGGCGAAAATGGGTGGTACGAACTCCAAAATCGAGAACATGAAGCCC
TACGCTTTGTAAGGAACGAAACGGTTCATCAAGCAGGCTATCGATCGAGAGTCGTCTCTGTCAACGTCGGCCACCGAGCTCGACAAGACCCGTCGCATTCCTCTTATCCG
TCGCCGTGCCCGTCGCAGACCGCCGAAGCTTCGGAGTCTCCATTGCACAATGGAGAAAGCCCCATTTCGCCGCCGATAGCTACTATAAGTTACATGGTCGCCGGAGGTAG
TACCCCTCTGACTGTCAAGGTCCGACCGAATAGCCATAGTTTTGTCTATGAAGAAGAATCAGTTGCTTCCCATTACCCCCCGCCTCCACCGCCGCATCATGAGTCGGGAT
CTTCTTGGGATTACTTCGATACCAATGACGAAATCGAGAGCTTCAGGTTTCTGGGAACTGGTGGGATGGATGTGAACTTTGAAGATGAGAGAATGTGGAAACAATTTAAA
GGAGAAATGGCAGATGCCAAAGTTCTCCAGGAAGGAACTGCAAAACCAGAAGCAGAGCAAAAAGCTTGTGAAAATGGTAGCCATTTGAATTCCTCAGGGGTTGTTGAAGA
AAGAAATTCAGAGATGGCAAGCCGGGAAGACAAAGAACTTAATTTGATGAGTTCATCGAGCATATTGCTTGAACAATCCGGTTCGAGGGGGGCGATTAAGTTGGAGAAAA
GTTTATGTACTGAACAGGAAGATCCTTCAGAGTTTATTACTCATAGAGCAAAAGATTTTCTTTCCAGCATTAAGGACATAGAGCATCGTTTTCAGAGAGCTTCAGAATCT
GGGAGGGAGATCTCTAGAATGCTTGAGGCTAGTAAAATCAGAGTTGGATACCTTGAAGCAAATGGTAAATTGTAG
Protein sequenceShow/hide protein sequence
MFFFSLLRDIHLDALKCSSPESFFWKSRWEAIDSEVGVLILENYASDLDLMKWVVEIFGENGWYELQNREHEALRFVRNETVHQAGYRSRVVSVNVGHRARQDPSHSSYP
SPCPSQTAEASESPLHNGESPISPPIATISYMVAGGSTPLTVKVRPNSHSFVYEEESVASHYPPPPPPHHESGSSWDYFDTNDEIESFRFLGTGGMDVNFEDERMWKQFK
GEMADAKVLQEGTAKPEAEQKACENGSHLNSSGVVEERNSEMASREDKELNLMSSSSILLEQSGSRGAIKLEKSLCTEQEDPSEFITHRAKDFLSSIKDIEHRFQRASES
GREISRMLEASKIRVGYLEANGKL