; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019889 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019889
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAmmonium transporter 1 member 2
Genome locationtig00153424:838840..849241
RNA-Seq ExpressionSgr019889
SyntenySgr019889
Gene Ontology termsGO:0072488 - ammonium transmembrane transport (biological process)
GO:0000178 - exosome (RNase complex) (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0008519 - ammonium transmembrane transporter activity (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022988170.1 uncharacterized protein LOC111485488 isoform X1 [Cucurbita maxima]2.4e-11780.2Show/hide
Query:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
        MASLH LL+PLT LS HSAPLFS Y    SHPI F+PS A+K    KPLTLSFALAESDS KSLE DPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
Subjt:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA

Query:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP
        AFDLSNGPV+DECGQ+MGEILLNL+RAWEVADTS+SHTLVSK P+LVQSLTENYKSG GKRLISAGRRFQSMGQYGQGELQKIAK M TTGKLLSA+S P
Subjt:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP

Query:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAKLDSTNYVLWKFQISLILKAHKLFSY
        K  EQPKNETRM KFG+LQVELT DKANIGAAI  VFGVISW+LGQG+QSIPESSLQYANDNALLLAK D      W    S I+  H  F +
Subjt:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAKLDSTNYVLWKFQISLILKAHKLFSY

XP_022988171.1 uncharacterized protein LOC111485488 isoform X2 [Cucurbita maxima]4.1e-11785.45Show/hide
Query:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
        MASLH LL+PLT LS HSAPLFS Y    SHPI F+PS A+K    KPLTLSFALAESDS KSLE DPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
Subjt:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA

Query:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP
        AFDLSNGPV+DECGQ+MGEILLNL+RAWEVADTS+SHTLVSK P+LVQSLTENYKSG GKRLISAGRRFQSMGQYGQGELQKIAK M TTGKLLSA+S P
Subjt:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP

Query:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK
        K  EQPKNETRM KFG+LQVELT DKANIGAAI  VFGVISW+LGQG+QSIPESSLQYANDNALLLAK
Subjt:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK

XP_023516677.1 uncharacterized protein LOC111780489 [Cucurbita pepo subsp. pepo]5.4e-11785.45Show/hide
Query:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
        MASLH LL+PLT LS HSAPLFS Y    SHPI F+PS A+K    KPLTL FALAESDS KSLE DPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
Subjt:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA

Query:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP
        AFDLSNGPV+DECGQ+MGEILLNL+RAWEVADTS+SHTLVSK P+LVQSLTENYKSG GKRLISAGRRFQSMGQYGQGELQKIAK M TTGKLLSA+S P
Subjt:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP

Query:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK
        K  EQPKNETRM KFG+LQVELTADKANIGAAI  VFGVISW+LGQG+QSIPESSLQYANDNALLLAK
Subjt:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK

XP_038879157.1 uncharacterized protein LOC120071143 isoform X1 [Benincasa hispida]3.4e-11979.4Show/hide
Query:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
        MASLH LL+P+T LS HS PLFS   H+  +PISFRP  AKK  P KPLTLSFALAESDSPKSLE DPQ+LLQELADSFDLSRDYFEKLPRDLRLDLNDA
Subjt:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA

Query:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP
        AFDLSNGPV+DECGQ+MGEILLNL+RAWEVADTS+SH LVSKFPTLVQSLT+NYKSGFGKRLISAGRRFQSMGQYGQGELQKIAK+M TTGKLLSA+S  
Subjt:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP

Query:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK--------LDSTNYVLWKF-QISLILKAHKLF
        K  EQPKNETRM KFG+LQVELTADKANIGAAI FVFGVISW+LGQG+QSIPESSLQYANDNALLLAK        +  ++ VL  F  + LIL A +L 
Subjt:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK--------LDSTNYVLWKF-QISLILKAHKLF

Query:  S
        S
Subjt:  S

XP_038879158.1 uncharacterized protein LOC120071143 isoform X2 [Benincasa hispida]2.6e-11985.82Show/hide
Query:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
        MASLH LL+P+T LS HS PLFS   H+  +PISFRP  AKK  P KPLTLSFALAESDSPKSLE DPQ+LLQELADSFDLSRDYFEKLPRDLRLDLNDA
Subjt:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA

Query:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP
        AFDLSNGPV+DECGQ+MGEILLNL+RAWEVADTS+SH LVSKFPTLVQSLT+NYKSGFGKRLISAGRRFQSMGQYGQGELQKIAK+M TTGKLLSA+S  
Subjt:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP

Query:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK
        K  EQPKNETRM KFG+LQVELTADKANIGAAI FVFGVISW+LGQG+QSIPESSLQYANDNALLLAK
Subjt:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK

TrEMBL top hitse value%identityAlignment
A0A6J1H959 uncharacterized protein LOC111461699 isoform X22.4e-11584.7Show/hide
Query:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
        MASLH LL+PLT LS HSAPL    S   SHPI F+PS A K    KPLTLSFALAESDS KSLE DPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
Subjt:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA

Query:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP
        AFDLSNGPV+DECGQ+MGEILLNL+RAWEVADTS+SHTLVSK P+LVQSLTENYKSG GKRLISAGRRFQSMGQYGQGELQKIAK M TTGKLLSA+S P
Subjt:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP

Query:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK
        K  EQPK+ETRM KFG+LQVELTADKANIGAAI  VFGVISW+LGQG+QSIPESSLQYANDNALLLAK
Subjt:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK

A0A6J1H9C7 uncharacterized protein LOC111461699 isoform X12.4e-11584.7Show/hide
Query:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
        MASLH LL+PLT LS HSAPL    S   SHPI F+PS A K    KPLTLSFALAESDS KSLE DPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
Subjt:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA

Query:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP
        AFDLSNGPV+DECGQ+MGEILLNL+RAWEVADTS+SHTLVSK P+LVQSLTENYKSG GKRLISAGRRFQSMGQYGQGELQKIAK M TTGKLLSA+S P
Subjt:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP

Query:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK
        K  EQPK+ETRM KFG+LQVELTADKANIGAAI  VFGVISW+LGQG+QSIPESSLQYANDNALLLAK
Subjt:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK

A0A6J1HAV0 uncharacterized protein LOC111461699 isoform X32.4e-11584.7Show/hide
Query:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
        MASLH LL+PLT LS HSAPL    S   SHPI F+PS A K    KPLTLSFALAESDS KSLE DPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
Subjt:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA

Query:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP
        AFDLSNGPV+DECGQ+MGEILLNL+RAWEVADTS+SHTLVSK P+LVQSLTENYKSG GKRLISAGRRFQSMGQYGQGELQKIAK M TTGKLLSA+S P
Subjt:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP

Query:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK
        K  EQPK+ETRM KFG+LQVELTADKANIGAAI  VFGVISW+LGQG+QSIPESSLQYANDNALLLAK
Subjt:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK

A0A6J1JGH5 uncharacterized protein LOC111485488 isoform X22.0e-11785.45Show/hide
Query:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
        MASLH LL+PLT LS HSAPLFS Y    SHPI F+PS A+K    KPLTLSFALAESDS KSLE DPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
Subjt:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA

Query:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP
        AFDLSNGPV+DECGQ+MGEILLNL+RAWEVADTS+SHTLVSK P+LVQSLTENYKSG GKRLISAGRRFQSMGQYGQGELQKIAK M TTGKLLSA+S P
Subjt:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP

Query:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK
        K  EQPKNETRM KFG+LQVELT DKANIGAAI  VFGVISW+LGQG+QSIPESSLQYANDNALLLAK
Subjt:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK

A0A6J1JIU7 uncharacterized protein LOC111485488 isoform X11.2e-11780.2Show/hide
Query:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
        MASLH LL+PLT LS HSAPLFS Y    SHPI F+PS A+K    KPLTLSFALAESDS KSLE DPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA
Subjt:  MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDA

Query:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP
        AFDLSNGPV+DECGQ+MGEILLNL+RAWEVADTS+SHTLVSK P+LVQSLTENYKSG GKRLISAGRRFQSMGQYGQGELQKIAK M TTGKLLSA+S P
Subjt:  AFDLSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVP

Query:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAKLDSTNYVLWKFQISLILKAHKLFSY
        K  EQPKNETRM KFG+LQVELT DKANIGAAI  VFGVISW+LGQG+QSIPESSLQYANDNALLLAK D      W    S I+  H  F +
Subjt:  KADEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAKLDSTNYVLWKFQISLILKAHKLFSY

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.2e-0426.09Show/hide
Query:  NDNALLLAKLDSTNYVLWKFQISLILKAHKLFSYINGSIVTPNLILQFGDAPQLNPTFEEWYAKDQALIMLTNATLSPPACAYVVGCATSQQ
        N N   + KL STNY++W  Q+  +   ++L  +++GS   P   +    AP++NP +  W  +D+ +       +S      V    T+ Q
Subjt:  NDNALLLAKLDSTNYVLWKFQISLILKAHKLFSYINGSIVTPNLILQFGDAPQLNPTFEEWYAKDQALIMLTNATLSPPACAYVVGCATSQQ

Arabidopsis top hitse value%identityAlignment
AT5G37360.1 unknown protein4.4e-7760.53Show/hide
Query:  LLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESD---PQVLLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD
        LL+PL SLS  S   FS  S   S   S +P+ +K+ +  K LTL FAL ESDS K LE +    + LL +L+  FDL  DYF++LP DLRLDLNDAAFD
Subjt:  LLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESD---PQVLLQELADSFDLSRDYFEKLPRDLRLDLNDAAFD

Query:  LSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSA-TSVPKA
        LSNGPV+DECGQ++GE LLNL+RAWE ADTS+S +LV K P L   LT+  +S FGKRLISAG+RFQ MGQY +GELQKIAK M TTG +LSA TS    
Subjt:  LSNGPVMDECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSA-TSVPKA

Query:  DEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK
          + K+ TRM KFG+LQV +T +KA  GAAIAF++G++SW++ QGIQSIPE+SLQYANDNALL+ K
Subjt:  DEQPKNETRMLKFGDLQVELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCTCTTCATCTTCTTCTTCGACCTCTCACCTCTCTCTCTCCCCATTCTGCCCCTCTCTTTTCCCACTATTCTCATGCCATCTCTCACCCAATCAGTTTCAGGCC
CTCCATTGCTAAGAAGTCCCACCCATTCAAACCCCTCACTCTTTCATTTGCTCTCGCCGAATCGGACTCTCCCAAATCCTTGGAATCCGACCCTCAAGTTCTCCTTCAAG
AACTAGCCGACAGTTTTGATCTCTCACGAGATTACTTTGAAAAACTTCCTCGTGATCTTCGTCTTGATCTCAACGATGCTGCTTTTGATCTTTCGAATGGACCCGTCATG
GATGAGTGTGGTCAAGACATGGGAGAAATATTGCTAAATCTCGCTCGGGCATGGGAAGTAGCTGACACCTCTTCTTCACATACCTTAGTAAGCAAGTTCCCCACGTTGGT
GCAATCTTTGACAGAGAATTACAAGTCAGGATTTGGCAAGCGTTTAATATCTGCCGGAAGACGGTTCCAGTCGATGGGACAGTATGGTCAGGGTGAATTACAGAAGATTG
CCAAAGTAATGACTACAACTGGAAAGCTTCTGTCTGCAACCTCTGTTCCTAAAGCAGATGAGCAGCCTAAGAATGAAACCAGAATGCTAAAGTTTGGAGACCTTCAAGTT
GAACTGACCGCTGATAAGGCGAACATCGGTGCAGCAATAGCTTTCGTTTTTGGAGTAATTTCATGGGAACTGGGTCAGGGCATCCAGAGCATTCCTGAGAGTTCTCTGCA
GTATGCAAATGACAATGCTTTACTTCTTGCAAAGTTAGATTCCACCAATTATGTTCTCTGGAAGTTTCAGATCTCTTTGATCTTGAAGGCACACAAGCTCTTTAGCTATA
TTAATGGTTCCATTGTGACCCCTAATCTAATTCTTCAATTTGGTGATGCTCCTCAGCTTAATCCAACATTTGAGGAGTGGTATGCTAAGGATCAAGCTCTTATTATGTTG
ACTAATGCGACTCTATCTCCTCCAGCTTGTGCATATGTTGTTGGATGTGCTACCTCACAGCAAGAGAATCTTGTCATTTATACTATTAATGGCCTTCTGTCGACCTTCAA
TGTCTTCAAGACTACCTTACGAACACGTTCTCAGGCGCTATCCTCTGAGGATATTCATGTTCTTATGAATTTTGAGGAGAGTGCTCTTGGGAAGCAATCCAAGGCTAATG
ATCCGAATTTAATGAATCCTACACTTACGATGCTTGCTAATTTCAATAATAGAGGACGCGACAGTGGTAAAGGTAGAAATGGTGGTGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCTCTTCATCTTCTTCTTCGACCTCTCACCTCTCTCTCTCCCCATTCTGCCCCTCTCTTTTCCCACTATTCTCATGCCATCTCTCACCCAATCAGTTTCAGGCC
CTCCATTGCTAAGAAGTCCCACCCATTCAAACCCCTCACTCTTTCATTTGCTCTCGCCGAATCGGACTCTCCCAAATCCTTGGAATCCGACCCTCAAGTTCTCCTTCAAG
AACTAGCCGACAGTTTTGATCTCTCACGAGATTACTTTGAAAAACTTCCTCGTGATCTTCGTCTTGATCTCAACGATGCTGCTTTTGATCTTTCGAATGGACCCGTCATG
GATGAGTGTGGTCAAGACATGGGAGAAATATTGCTAAATCTCGCTCGGGCATGGGAAGTAGCTGACACCTCTTCTTCACATACCTTAGTAAGCAAGTTCCCCACGTTGGT
GCAATCTTTGACAGAGAATTACAAGTCAGGATTTGGCAAGCGTTTAATATCTGCCGGAAGACGGTTCCAGTCGATGGGACAGTATGGTCAGGGTGAATTACAGAAGATTG
CCAAAGTAATGACTACAACTGGAAAGCTTCTGTCTGCAACCTCTGTTCCTAAAGCAGATGAGCAGCCTAAGAATGAAACCAGAATGCTAAAGTTTGGAGACCTTCAAGTT
GAACTGACCGCTGATAAGGCGAACATCGGTGCAGCAATAGCTTTCGTTTTTGGAGTAATTTCATGGGAACTGGGTCAGGGCATCCAGAGCATTCCTGAGAGTTCTCTGCA
GTATGCAAATGACAATGCTTTACTTCTTGCAAAGTTAGATTCCACCAATTATGTTCTCTGGAAGTTTCAGATCTCTTTGATCTTGAAGGCACACAAGCTCTTTAGCTATA
TTAATGGTTCCATTGTGACCCCTAATCTAATTCTTCAATTTGGTGATGCTCCTCAGCTTAATCCAACATTTGAGGAGTGGTATGCTAAGGATCAAGCTCTTATTATGTTG
ACTAATGCGACTCTATCTCCTCCAGCTTGTGCATATGTTGTTGGATGTGCTACCTCACAGCAAGAGAATCTTGTCATTTATACTATTAATGGCCTTCTGTCGACCTTCAA
TGTCTTCAAGACTACCTTACGAACACGTTCTCAGGCGCTATCCTCTGAGGATATTCATGTTCTTATGAATTTTGAGGAGAGTGCTCTTGGGAAGCAATCCAAGGCTAATG
ATCCGAATTTAATGAATCCTACACTTACGATGCTTGCTAATTTCAATAATAGAGGACGCGACAGTGGTAAAGGTAGAAATGGTGGTGGCTGA
Protein sequenceShow/hide protein sequence
MASLHLLLRPLTSLSPHSAPLFSHYSHAISHPISFRPSIAKKSHPFKPLTLSFALAESDSPKSLESDPQVLLQELADSFDLSRDYFEKLPRDLRLDLNDAAFDLSNGPVM
DECGQDMGEILLNLARAWEVADTSSSHTLVSKFPTLVQSLTENYKSGFGKRLISAGRRFQSMGQYGQGELQKIAKVMTTTGKLLSATSVPKADEQPKNETRMLKFGDLQV
ELTADKANIGAAIAFVFGVISWELGQGIQSIPESSLQYANDNALLLAKLDSTNYVLWKFQISLILKAHKLFSYINGSIVTPNLILQFGDAPQLNPTFEEWYAKDQALIML
TNATLSPPACAYVVGCATSQQENLVIYTINGLLSTFNVFKTTLRTRSQALSSEDIHVLMNFEESALGKQSKANDPNLMNPTLTMLANFNNRGRDSGKGRNGGG