; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022181 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022181
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionArginine and glutamate-rich protein 1
Genome locationtig00153941:504773..509722
RNA-Seq ExpressionSgr022181
SyntenySgr022181
Gene Ontology termsNA
InterPro domainsIPR033371 - Arginine and glutamate-rich protein 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571723.1 Transmembrane emp24 domain-containing protein p24delta3, partial [Cucurbita argyrosperma subsp. sororia]4.0e-7859.44Show/hide
Query:  VLFFHLWHPNRSDDGEGGAKEGASNLLLNPSEESFNGIRTSPLLLNPFQDFNLDCNSTNYDSWVFAALH------ANTFVDISVGLLPESSVRKLTPRIQ
        ++   LWHPNRSDD E G++E          EE +N   +  L+      F++   +++  + +F  +H         FVD  V LLPESSVR       
Subjt:  VLFFHLWHPNRSDDGEGGAKEGASNLLLNPSEESFNGIRTSPLLLNPFQDFNLDCNSTNYDSWVFAALH------ANTFVDISVGLLPESSVRKLTPRIQ

Query:  FLDVYRPLAVEIMLMLTKGIGIPGEVEETEAVHHILLIAEEKAAQFPQDVIEVVHEHQDDIEVVLQLQGITRSRDEEVPHHLRVIDLLVQALDPLSKK--
                           I IPGEVEET AVH I   AEEKAA FPQDVIE+VHEH D IEVVLQLQG+ RS+ +EVPHHL +IDLLV ALDPLSKK  
Subjt:  FLDVYRPLAVEIMLMLTKGIGIPGEVEETEAVHHILLIAEEKAAQFPQDVIEVVHEHQDDIEVVLQLQGITRSRDEEVPHHLRVIDLLVQALDPLSKK--

Query:  ----------RQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEEN
                  RQQE EAKLL EET  RVEEAIR +VEEGLNS ++KQEINR+LEEGR+ L EEVT QLEKEKEAAL+EARRKEEQARKE E+ E+M+EE+
Subjt:  ----------RQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEEN

Query:  RRRVEEAQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
        RRRVEEAQR+EALERQKREEERYRELEELQRQKEEAIKR KQ+EEEQRL QMK+L
Subjt:  RRRVEEAQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

RYR35552.1 hypothetical protein Ahy_A10g050697 isoform B [Arachis hypogaea]4.0e-4659.04Show/hide
Query:  LDVYR-----PLAVEI-MLMLTKGIGIPGEVEETEAVHHILLIAEEKAAQFPQDVIEVVHEHQDDIEVVLQLQGITRSRDEEVPHHLRVIDLLVQALDPL
        L+ YR     P+ V I  L+L  GI    E  E + + H  LIA EK A F Q+  E V + +  IEV LQ +      +  V  +L     L      L
Subjt:  LDVYR-----PLAVEI-MLMLTKGIGIPGEVEETEAVHHILLIAEEKAAQFPQDVIEVVHEHQDDIEVVLQLQGITRSRDEEVPHHLRVIDLLVQALDPL

Query:  SK-KRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE
        S+ +RQQEAE KL+ EETA RVEEAIR +VEE LNSEE++ EI RRLEEGRK LN EVT QLEKEKEAA+IEA+RKEEQARKE EELE++LEEN+R+ EE
Subjt:  SK-KRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE

Query:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
        AQR+EALE+Q+REEERYRELEELQRQKEEA++R KQEEE++RLNQ+KLL
Subjt:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

XP_022157832.1 uncharacterized protein At1g10890 isoform X1 [Momordica charantia]6.4e-4483.67Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ
        K+RQQEA+AKLL EETA RVEEAI  KVEE L+SEEIKQ+I+R+LEEGRK LNEEVT QLEKEKEAAL+EARRKEEQ+R+E EELE+M+EE+RRRVEE+Q
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ

Query:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
        R+EALERQKREEERYRELEELQRQKEEAIKR KQEEEEQ+LNQMKLL
Subjt:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

XP_022157834.1 uncharacterized protein At1g10890 isoform X2 [Momordica charantia]6.4e-4483.67Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ
        K+RQQEA+AKLL EETA RVEEAI  KVEE L+SEEIKQ+I+R+LEEGRK LNEEVT QLEKEKEAAL+EARRKEEQ+R+E EELE+M+EE+RRRVEE+Q
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ

Query:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
        R+EALERQKREEERYRELEELQRQKEEAIKR KQEEEEQ+LNQMKLL
Subjt:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

XP_038888588.1 uncharacterized protein At1g10890 isoform X3 [Benincasa hispida]8.4e-4482.99Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ
        K+RQQE EAKLL EETA RVEEAIR +VE+ LNS ++K EI+++LEEGRK LNEEVT QLEKEKEAAL+EARRKEEQARKE EE+E+M+EENRRRVEEAQ
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ

Query:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
        R+EALERQKREEERYRELEELQRQKEEAIKR KQEEEEQR+NQMKLL
Subjt:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

TrEMBL top hitse value%identityAlignment
A0A445BA41 Uncharacterized protein2.0e-4659.04Show/hide
Query:  LDVYR-----PLAVEI-MLMLTKGIGIPGEVEETEAVHHILLIAEEKAAQFPQDVIEVVHEHQDDIEVVLQLQGITRSRDEEVPHHLRVIDLLVQALDPL
        L+ YR     P+ V I  L+L  GI    E  E + + H  LIA EK A F Q+  E V + +  IEV LQ +      +  V  +L     L      L
Subjt:  LDVYR-----PLAVEI-MLMLTKGIGIPGEVEETEAVHHILLIAEEKAAQFPQDVIEVVHEHQDDIEVVLQLQGITRSRDEEVPHHLRVIDLLVQALDPL

Query:  SK-KRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE
        S+ +RQQEAE KL+ EETA RVEEAIR +VEE LNSEE++ EI RRLEEGRK LN EVT QLEKEKEAA+IEA+RKEEQARKE EELE++LEEN+R+ EE
Subjt:  SK-KRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE

Query:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
        AQR+EALE+Q+REEERYRELEELQRQKEEA++R KQEEE++RLNQ+KLL
Subjt:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

A0A5D2GXN6 Uncharacterized protein3.4e-4382.31Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ
        K+RQQEAE KL+ EETA RVEEAI  KVEE LNSEEIKQEI +RLEEGR+ LN+EV +QLEKEKEAAL+EAR+KEEQARKE EELEKMLEENR+RVEEAQ
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ

Query:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
        R+EALE+Q+REEERYRELEELQRQKEEA+KR KQ+EEE+RLNQMKLL
Subjt:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

A0A6J1AFR7 uncharacterized protein At1g108901.5e-4382.31Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ
        K+RQQEAE KL+ EET  RVEEAI+ KVEE LNSEE+KQEI RRLEEGR+ LN+EV  QLEKEKEAAL+EARRKEEQARKE EELEKMLEENR+RVEEAQ
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ

Query:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
        R+EALE+Q+REEERYRELEELQRQKEEA+KR KQ+EEE+RLNQMKLL
Subjt:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

A0A6J1DUF6 uncharacterized protein At1g10890 isoform X13.1e-4483.67Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ
        K+RQQEA+AKLL EETA RVEEAI  KVEE L+SEEIKQ+I+R+LEEGRK LNEEVT QLEKEKEAAL+EARRKEEQ+R+E EELE+M+EE+RRRVEE+Q
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ

Query:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
        R+EALERQKREEERYRELEELQRQKEEAIKR KQEEEEQ+LNQMKLL
Subjt:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

A0A6J1DXP6 uncharacterized protein At1g10890 isoform X23.1e-4483.67Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ
        K+RQQEA+AKLL EETA RVEEAI  KVEE L+SEEIKQ+I+R+LEEGRK LNEEVT QLEKEKEAAL+EARRKEEQ+R+E EELE+M+EE+RRRVEE+Q
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ

Query:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
        R+EALERQKREEERYRELEELQRQKEEAIKR KQEEEEQ+LNQMKLL
Subjt:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

SwissProt top hitse value%identityAlignment
P0CB26 Uncharacterized protein At1g108905.1e-3671.43Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ
        K+RQ+EAE KL+ EET  RVEEAIR KVEE L SE+IK EI   LEEGRK LNEEV  QLE+EKEA+LIEA+ KEE+ ++E EE E++ EEN +RVEEAQ
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQ

Query:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
        RKEA+ERQ++EEERYRELEELQRQKEEA++R K EEEE+RL QMKLL
Subjt:  RKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

Q2TA42 Arginine and glutamate-rich protein 12.7e-0841.26Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGL--NSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE
        K RQQE E KL+ EETA RVEE +  +VEE L    +EI++E+ RR+EE ++ + +++  +LE++++A L   + +EE+ R + EELE++LEEN R++ E
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGL--NSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE

Query:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRL
        AQ K A E+ +  EE+ +  EE  + ++E   R +Q++EEQ++
Subjt:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRL

Q5ZL35 Arginine and glutamate-rich protein 11.2e-0841.96Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGL--NSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE
        K RQQE E KL+ EETA RVEE +  +VEE L    +EI++E+ RR+EE ++ + +++  +LE++++A L   + +EE+ R + EELE++LEEN R++ E
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGL--NSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE

Query:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRL
        AQ K A E+ K  EE+ +  EE  + ++E   R +Q++EEQ++
Subjt:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRL

Q9NWB6 Arginine and glutamate-rich protein 12.7e-0841.26Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGL--NSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE
        K RQQE E KL+ EETA RVEE +  +VEE L    +EI++E+ RR+EE ++ + +++  +LE++++A L   + +EE+ R + EELE++LEEN R++ E
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGL--NSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE

Query:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRL
        AQ K A E+ +  EE+ +  EE  + ++E   R +Q++EEQ++
Subjt:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRL

Q9VL63 UPF0430 protein CG317121.2e-0841.73Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNS--EEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE
        ++R +E E K + EE A R+E  ++ +VEE L    +EI+QE+NRR+E  +  +  E+ ++LE+ +E    E RR+EE  +++ EELE++L EN R++EE
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNS--EEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEE

Query:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEE
        AQRK A ER    EE+    EE QR ++E  KR K+E++
Subjt:  AQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEE

Arabidopsis top hitse value%identityAlignment
AT1G10890.1 unknown protein5.8e-3567.74Show/hide
Query:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKE--------EQARKENEELEKMLEEN
        K+RQ+EAE KL+ EET  RVEEAIR KVEE L SE+IK EI   LEEGRK LNEEV  QLE+EKEA+LIEA+ KE        E+ ++E EE E++ EEN
Subjt:  KKRQQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKE--------EQARKENEELEKMLEEN

Query:  RRRVEEAQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL
         +RVEEAQRKEA+ERQ++EEERYRELEELQRQKEEA++R K EEEE+RL QMKLL
Subjt:  RRRVEEAQRKEALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKLL

AT5G13340.1 unknown protein4.3e-3065.03Show/hide
Query:  QQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQRKE
        Q EAE K L EETA R+EEA+R  VEE + +EE+K+EI RR +E  + +  +V +QL+KEKEAAL EARRKEEQAR+E EEL+KMLEEN RRVEE+QR+E
Subjt:  QQEAEAKLL-EETANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQRKE

Query:  ALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKL
        A+E Q++EEERYRELE LQRQKEEA +R K EEEE+  N  KL
Subjt:  ALERQKREEERYRELEELQRQKEEAIKRTKQEEEEQRLNQMKL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGCGAAAAGGAATTAACGTATGCTACCAGACAACGGGTTGAAGTAGAAGTTTTGTTCTTCCACCTTTGGCATCCCAATCGGAGCGACGACGGAGAGGGAGGAGCGAAAGA
AGGCGCATCAAACCTTCTCCTGAATCCATCGGAGGAGAGCTTCAACGGCATACGCACGTCGCCACTTCTCCTCAATCCATTTCAAGACTTCAATCTGGATTGCAACTCCA
CTAACTACGATTCTTGGGTTTTTGCTGCTTTACATGCCAACACCTTTGTGGATATTTCCGTCGGTCTGCTTCCTGAATCAAGTGTCCGGAAGTTGACTCCTCGAATCCAA
TTTCTTGATGTTTATCGACCTCTTGCTGTAGAAATTATGTTGATGCTAACTAAAGGCATAGGTATACCAGGAGAAGTCGAAGAGACAGAAGCCGTTCACCATATTCTTCT
TATAGCAGAAGAAAAAGCCGCTCAATTTCCCCAAGACGTAATAGAAGTCGTTCACGAACACCAAGACGACATAGAAGTCGTTCTCCAACTTCAAGGAATTACAAGAAGCA
GAGACGAAGAAGTTCCTCATCATCTCCGCGTCATAGATCTTCTAGTTCAAGCCTTGGATCCCTTGAGCAAAAAGCGACAACAGGAGGCAGAGGCGAAGTTGCTTGAAGAA
ACAGCGAATAGAGTGGAGGAAGCAATTCGTATGAAAGTTGAAGAGGGCCTGAACTCTGAGGAGATAAAGCAAGAAATAAACCGGAGGTTGGAGGAGGGACGAAAATGGCT
TAACGAGGAAGTGACAGTTCAACTTGAGAAGGAAAAGGAAGCTGCCCTTATTGAGGCCAGACGAAAGGAGGAACAAGCTCGAAAAGAGAACGAAGAGCTGGAAAAAATGC
TTGAGGAGAACCGGAGGAGAGTAGAAGAAGCTCAGAGAAAGGAAGCTTTAGAGAGACAAAAGAGAGAGGAGGAAAGATATAGGGAACTAGAAGAGCTACAAAGGCAAAAA
GAAGAGGCTATTAAGAGGACAAAACAGGAAGAGGAGGAACAACGACTTAACCAGATGAAACTGTTGG
mRNA sequenceShow/hide mRNA sequence
GGCGAAAAGGAATTAACGTATGCTACCAGACAACGGGTTGAAGTAGAAGTTTTGTTCTTCCACCTTTGGCATCCCAATCGGAGCGACGACGGAGAGGGAGGAGCGAAAGA
AGGCGCATCAAACCTTCTCCTGAATCCATCGGAGGAGAGCTTCAACGGCATACGCACGTCGCCACTTCTCCTCAATCCATTTCAAGACTTCAATCTGGATTGCAACTCCA
CTAACTACGATTCTTGGGTTTTTGCTGCTTTACATGCCAACACCTTTGTGGATATTTCCGTCGGTCTGCTTCCTGAATCAAGTGTCCGGAAGTTGACTCCTCGAATCCAA
TTTCTTGATGTTTATCGACCTCTTGCTGTAGAAATTATGTTGATGCTAACTAAAGGCATAGGTATACCAGGAGAAGTCGAAGAGACAGAAGCCGTTCACCATATTCTTCT
TATAGCAGAAGAAAAAGCCGCTCAATTTCCCCAAGACGTAATAGAAGTCGTTCACGAACACCAAGACGACATAGAAGTCGTTCTCCAACTTCAAGGAATTACAAGAAGCA
GAGACGAAGAAGTTCCTCATCATCTCCGCGTCATAGATCTTCTAGTTCAAGCCTTGGATCCCTTGAGCAAAAAGCGACAACAGGAGGCAGAGGCGAAGTTGCTTGAAGAA
ACAGCGAATAGAGTGGAGGAAGCAATTCGTATGAAAGTTGAAGAGGGCCTGAACTCTGAGGAGATAAAGCAAGAAATAAACCGGAGGTTGGAGGAGGGACGAAAATGGCT
TAACGAGGAAGTGACAGTTCAACTTGAGAAGGAAAAGGAAGCTGCCCTTATTGAGGCCAGACGAAAGGAGGAACAAGCTCGAAAAGAGAACGAAGAGCTGGAAAAAATGC
TTGAGGAGAACCGGAGGAGAGTAGAAGAAGCTCAGAGAAAGGAAGCTTTAGAGAGACAAAAGAGAGAGGAGGAAAGATATAGGGAACTAGAAGAGCTACAAAGGCAAAAA
GAAGAGGCTATTAAGAGGACAAAACAGGAAGAGGAGGAACAACGACTTAACCAGATGAAACTGTTGG
Protein sequenceShow/hide protein sequence
GEKELTYATRQRVEVEVLFFHLWHPNRSDDGEGGAKEGASNLLLNPSEESFNGIRTSPLLLNPFQDFNLDCNSTNYDSWVFAALHANTFVDISVGLLPESSVRKLTPRIQ
FLDVYRPLAVEIMLMLTKGIGIPGEVEETEAVHHILLIAEEKAAQFPQDVIEVVHEHQDDIEVVLQLQGITRSRDEEVPHHLRVIDLLVQALDPLSKKRQQEAEAKLLEE
TANRVEEAIRMKVEEGLNSEEIKQEINRRLEEGRKWLNEEVTVQLEKEKEAALIEARRKEEQARKENEELEKMLEENRRRVEEAQRKEALERQKREEERYRELEELQRQK
EEAIKRTKQEEEEQRLNQMKLLX