; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg022727 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg022727
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationscaffold2:8942699..8946812
RNA-Seq ExpressionSpg022727
SyntenySpg022727
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588080.1 hypothetical protein SDJN03_16645, partial [Cucurbita argyrosperma subsp. sororia]2.2e-11085.11Show/hide
Query:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE
        +S+PSTP+ELRF+RPPP DQ+LVHK++LEFGQFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQEFNAIK+RC G  Q+CTCIVTV KE
Subjt:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE

Query:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL
        QKHIKRTVIKSVVATLD+SLRHL HGETFPGER KS +CSINKEIPNKYAY++NLCVSKAARRQGVASNMLKFAVETA S+GIEQ+YVHVHRNN P + L
Subjt:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL

Query:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH
        Y+KIGFEVVE+AS QLLE+QTYLLC+NTRKLNNA+
Subjt:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH

KAG7021966.1 hypothetical protein SDJN02_15694 [Cucurbita argyrosperma subsp. argyrosperma]2.2e-11085.11Show/hide
Query:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE
        +S+PSTP+ELRF+RPPP DQ+LVHK++LEFGQFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQEFNAIK+RC G  Q+CTCIVTV KE
Subjt:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE

Query:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL
        QKHIKRTVIKSVVATLD+SLRHL HGETFPGER KS +CSINKEIPNKYAY++NLCVSKAARRQGVASNMLKFAVETA S+GIEQ+YVHVHRNN P + L
Subjt:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL

Query:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH
        Y+KIGFEVVE+AS QLLE+QTYLLC+NTRKLNNA+
Subjt:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH

XP_022147601.1 uncharacterized protein LOC111016486 [Momordica charantia]3.4e-11186.02Show/hide
Query:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQ--QTCTCIVTVRK
        +S+PS   +L+F+RPPPADQDL+HKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQEFNAIKKRC GQ  QTCTC VTVRK
Subjt:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQ--QTCTCIVTVRK

Query:  EQKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQA
        EQ+HIKRTVIKSVVATLD+SLRHL HGETFPGER KS LCSINKEIPNKYAYIANLCV+KAARRQG+ASNMLKFAVETA S+GIEQ+YVHVHRNN P QA
Subjt:  EQKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQA

Query:  LYQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH
        LY+KIGFEVVE AS QLLE+Q YLLC+NT+KLNNAH
Subjt:  LYQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH

XP_022967924.1 uncharacterized protein LOC111467291 [Cucurbita maxima]1.3e-11085.53Show/hide
Query:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE
        +S+PSTP+ELRF+RPPP DQDLVHK+RLEFGQFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQEFNAIK+RC G  Q+CTCIVTV KE
Subjt:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE

Query:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL
        QKHIKRTVIKSVVATLD+SLRHL HGETFPGER KS +CSINKEIPNKYAY++NLCVSKAARRQGVASNMLKFAVETA S+GIEQ+YVHVHRNN P + L
Subjt:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL

Query:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH
        Y+KIGFEVVE+AS QLLE+QTYLLC+NTRKL+NA+
Subjt:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH

XP_023531510.1 uncharacterized protein LOC111793724 [Cucurbita pepo subsp. pepo]2.6e-11185.96Show/hide
Query:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE
        +S+PSTP+ELRF+RPPP DQDLVHK+RLEFGQFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQEFNAIK+RC G  Q+CTCIVTV KE
Subjt:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE

Query:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL
        QKHIKRTVIKSVVATLD+SLRHL HGETFPGER KS +CSINKEIPNKYAY++NLCVSKAARRQGVASNMLKFAVETA S+GIEQ+YVHVHRNN P + L
Subjt:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL

Query:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH
        Y+KIGFEVVE+AS QLLE+QTYLLC+NTRKLNNA+
Subjt:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH

TrEMBL top hitse value%identityAlignment
A0A5A7UL26 Putative Acyl-CoA N-acyltransferases (NAT) superfamily protein1.6e-10682.2Show/hide
Query:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQ--QTCTCIVTVRK
        +S+P   + L+FDRPPP D+DLVH+RRLEFGQFVAREAV+DEELWTAAWLRAESHWENR N+RYVDSFKRKFAEQEFNAIKK+C GQ  QTCTCIVTVRK
Subjt:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQ--QTCTCIVTVRK

Query:  EQKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQA
        EQKHIKRTVIKSVVATLD+ LRHL HGE+FPGER KS +CSINKEIPNKYAYI+NLCV KAARRQG+A NMLKFAV TA S GI+Q+YVHVHRNN P QA
Subjt:  EQKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQA

Query:  LYQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH
        LYQKIGFEVVE+AS QL+E+QTYLLC+NT KLNNAH
Subjt:  LYQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH

A0A5D3CIF7 Putative Acyl-CoA N-acyltransferases (NAT) superfamily protein1.6e-10682.2Show/hide
Query:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQ--QTCTCIVTVRK
        +S+P   + L+FDRPPP D+DLVH+RRLEFGQFVAREAV+DEELWTAAWLRAESHWENR N+RYVDSFKRKFAEQEFNAIKK+C GQ  QTCTCIVTVRK
Subjt:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQ--QTCTCIVTVRK

Query:  EQKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQA
        EQKHIKRTVIKSVVATLD+ LRHL HGE+FPGER KS +CSINKEIPNKYAYI+NLCV KAARRQG+A NMLKFAV TA S GI+Q+YVHVHRNN P QA
Subjt:  EQKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQA

Query:  LYQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH
        LYQKIGFEVVE+AS QL+E+QTYLLC+NT KLNNAH
Subjt:  LYQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH

A0A6J1D2V4 uncharacterized protein LOC1110164861.6e-11186.02Show/hide
Query:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQ--QTCTCIVTVRK
        +S+PS   +L+F+RPPPADQDL+HKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQEFNAIKKRC GQ  QTCTC VTVRK
Subjt:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQ--QTCTCIVTVRK

Query:  EQKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQA
        EQ+HIKRTVIKSVVATLD+SLRHL HGETFPGER KS LCSINKEIPNKYAYIANLCV+KAARRQG+ASNMLKFAVETA S+GIEQ+YVHVHRNN P QA
Subjt:  EQKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQA

Query:  LYQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH
        LY+KIGFEVVE AS QLLE+Q YLLC+NT+KLNNAH
Subjt:  LYQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH

A0A6J1EZR4 uncharacterized protein LOC1114410201.1e-11085.11Show/hide
Query:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE
        +S+PSTP+ELRF+RPPP DQ+LVHK++LEFGQFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQEFNAIK+RC G  Q+CTCIVTV KE
Subjt:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE

Query:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL
        QKHIKRTVIKSVVATLD+SLRHL HGETFPGER KS +CSINKEIPNKYAY++NLCVSKAARRQGVASNMLKFAVETA S+GIEQ+YVHVHRNN P + L
Subjt:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL

Query:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH
        Y+KIGFEVVE+AS QLLE+QTYLLC+NTRKLNNA+
Subjt:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH

A0A6J1HY42 uncharacterized protein LOC1114672916.2e-11185.53Show/hide
Query:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE
        +S+PSTP+ELRF+RPPP DQDLVHK+RLEFGQFVAREAVLDEELWTAAWLRAESHWENR NERYVDSFKRKFAEQEFNAIK+RC G  Q+CTCIVTV KE
Subjt:  ESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCG-QQTCTCIVTVRKE

Query:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL
        QKHIKRTVIKSVVATLD+SLRHL HGETFPGER KS +CSINKEIPNKYAY++NLCVSKAARRQGVASNMLKFAVETA S+GIEQ+YVHVHRNN P + L
Subjt:  QKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQAL

Query:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH
        Y+KIGFEVVE+AS QLLE+QTYLLC+NTRKL+NA+
Subjt:  YQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNNAH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G06025.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.9e-8164.91Show/hide
Query:  PDESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQ--QTCTCIVTV
        P +   S P  LRFDR  P + +  H+ R EFG+FVAREA+LDEE WTAAWLRAESHWE+R NERYVD++KRKFAEQEFNAIK+RC G   Q C+CIV V
Subjt:  PDESQPSTPSELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQ--QTCTCIVTV

Query:  RKEQKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPL-CSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLP
        +KE+KHIKR+VIKSVV TLD+S+R+   GETFPGE+ KS L CSIN+E  N+Y YIANLCV+K+ARRQG+A NML+FAVE+A  +G+EQ+YVHVH+NN  
Subjt:  RKEQKHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPL-CSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLP

Query:  GQALYQKIGFEVVEMASPQLLEDQTYLL
         Q LYQK GF++VE    + L+D TYLL
Subjt:  GQALYQKIGFEVVEMASPQLLEDQTYLL

AT4G28030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein2.2e-1230.34Show/hide
Query:  GPDESQPSTPS-ELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQQ----TCTC
        G   S  S PS +LRF   P A    +    ++   FV  E+V ++ELW AA LR  +  E  P+   +   +R  AE+EF A+K+R  G++       C
Subjt:  GPDESQPSTPS-ELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQQ----TCTC

Query:  I-VTVRKEQ------------KHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAIS
        I  T+   Q            K       + VV +LD     L      P E   +    I  +     AY++N+CV+K   R GV   ++  +   A  
Subjt:  I-VTVRKEQ------------KHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAIS

Query:  NGIEQIYVHVHRNNLPGQALYQKIGFEVVEMASP
         GI  +YVHV  +N   ++LY K GFE  E A P
Subjt:  NGIEQIYVHVHRNNLPGQALYQKIGFEVVEMASP

AT4G28030.2 Acyl-CoA N-acyltransferases (NAT) superfamily protein8.2e-0727.78Show/hide
Query:  GPDESQPSTPS-ELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQQ----TCTC
        G   S  S PS +LRF   P A    +    ++   FV  E+V ++ELW AA LR  +  E  P+   +   +R  AE+EF A+K+R  G++       C
Subjt:  GPDESQPSTPS-ELRFDRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQQ----TCTC

Query:  I-VTVRKEQ------------KHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAIS
        I  T+   Q            K       + VV +LD     L      P E   +    I  +     AY++N+CV+K   R GV   ++  +   A  
Subjt:  I-VTVRKEQ------------KHIKRTVIKSVVATLDVSLRHLKHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAIS

Query:  NGIEQIYVHVHRNNLPGQALYQKIGFEVVEMASP
                    +N   ++LY K GFE  E A P
Subjt:  NGIEQIYVHVHRNNLPGQALYQKIGFEVVEMASP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GGGATTGTAGTTAATTGCCCGTTCCGGCTCCGTCTTCCGTCCTGTTTGGAGGTTGAATTTGTTCGATGGAAGCTTCGGTTTCAATGTCGGCATTCTCGATCTGCAGGACG
GAATTTTTGGGATCCGTCCAAGACGGACGCCGGAATCACCTCAAATTTCATAGAAACGTCGCCTCGTGGACTATGTGAGTTTGAGTCTGTTCCTCTTCTTCGGTTCGATT
TTGGTATCGATTTTTCCTACTTTCTGATTGTTACAGAATCGTATAATTCGATTTCCGGCGGGAGCGGGCCGGATGAGTCGCAACCATCCACACCAAGCGAGTTACGGTTC
GACCGACCGCCGCCGGCGGATCAAGATTTAGTTCACAAAAGAAGATTAGAGTTCGGTCAATTCGTTGCGAGGGAGGCTGTGCTTGATGAAGAATTGTGGACAGCAGCATG
GCTTCGGGCTGAAAGTCATTGGGAGAATCGACCAAATGAACGGTATGTTGACAGCTTCAAAAGGAAATTTGCAGAACAGGAGTTCAATGCCATCAAAAAAAGATGCTGTG
GGCAACAGACTTGCACATGCATTGTCACAGTAAGGAAGGAGCAGAAGCATATAAAACGTACTGTGATTAAAAGTGTAGTAGCCACTCTGGATGTGAGCTTGCGGCATTTG
AAGCACGGCGAGACATTTCCTGGGGAAAGAGGGAAGAGTCCATTATGCAGCATCAACAAAGAGATACCAAATAAATATGCATACATTGCAAACCTATGTGTATCGAAAGC
AGCACGTCGTCAGGGTGTTGCTAGCAATATGTTGAAGTTTGCAGTTGAAACGGCAATATCCAATGGTATTGAACAGATATACGTGCATGTACATAGAAACAATTTGCCCG
GCCAAGCATTGTACCAAAAGATAGGCTTCGAGGTGGTCGAAATGGCAAGCCCACAGTTGTTGGAAGATCAAACTTACCTACTATGTATGAACACACGGAAGCTTAACAAT
GCACATTGA
mRNA sequenceShow/hide mRNA sequence
GGGATTGTAGTTAATTGCCCGTTCCGGCTCCGTCTTCCGTCCTGTTTGGAGGTTGAATTTGTTCGATGGAAGCTTCGGTTTCAATGTCGGCATTCTCGATCTGCAGGACG
GAATTTTTGGGATCCGTCCAAGACGGACGCCGGAATCACCTCAAATTTCATAGAAACGTCGCCTCGTGGACTATGTGAGTTTGAGTCTGTTCCTCTTCTTCGGTTCGATT
TTGGTATCGATTTTTCCTACTTTCTGATTGTTACAGAATCGTATAATTCGATTTCCGGCGGGAGCGGGCCGGATGAGTCGCAACCATCCACACCAAGCGAGTTACGGTTC
GACCGACCGCCGCCGGCGGATCAAGATTTAGTTCACAAAAGAAGATTAGAGTTCGGTCAATTCGTTGCGAGGGAGGCTGTGCTTGATGAAGAATTGTGGACAGCAGCATG
GCTTCGGGCTGAAAGTCATTGGGAGAATCGACCAAATGAACGGTATGTTGACAGCTTCAAAAGGAAATTTGCAGAACAGGAGTTCAATGCCATCAAAAAAAGATGCTGTG
GGCAACAGACTTGCACATGCATTGTCACAGTAAGGAAGGAGCAGAAGCATATAAAACGTACTGTGATTAAAAGTGTAGTAGCCACTCTGGATGTGAGCTTGCGGCATTTG
AAGCACGGCGAGACATTTCCTGGGGAAAGAGGGAAGAGTCCATTATGCAGCATCAACAAAGAGATACCAAATAAATATGCATACATTGCAAACCTATGTGTATCGAAAGC
AGCACGTCGTCAGGGTGTTGCTAGCAATATGTTGAAGTTTGCAGTTGAAACGGCAATATCCAATGGTATTGAACAGATATACGTGCATGTACATAGAAACAATTTGCCCG
GCCAAGCATTGTACCAAAAGATAGGCTTCGAGGTGGTCGAAATGGCAAGCCCACAGTTGTTGGAAGATCAAACTTACCTACTATGTATGAACACACGGAAGCTTAACAAT
GCACATTGA
Protein sequenceShow/hide protein sequence
GIVVNCPFRLRLPSCLEVEFVRWKLRFQCRHSRSAGRNFWDPSKTDAGITSNFIETSPRGLCEFESVPLLRFDFGIDFSYFLIVTESYNSISGGSGPDESQPSTPSELRF
DRPPPADQDLVHKRRLEFGQFVAREAVLDEELWTAAWLRAESHWENRPNERYVDSFKRKFAEQEFNAIKKRCCGQQTCTCIVTVRKEQKHIKRTVIKSVVATLDVSLRHL
KHGETFPGERGKSPLCSINKEIPNKYAYIANLCVSKAARRQGVASNMLKFAVETAISNGIEQIYVHVHRNNLPGQALYQKIGFEVVEMASPQLLEDQTYLLCMNTRKLNN
AH