; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1493 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1493
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationMC04:22791165..22792737
RNA-Seq ExpressionMC04g1493
SyntenyMC04g1493
Gene Ontology termsGO:0006474 - N-terminal protein amino acid acetylation (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016290.1 hypothetical protein SDJN02_21396, partial [Cucurbita argyrosperma subsp. argyrosperma]1.54e-16188.8Show/hide
Query:  RNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRIC
        RNF  N+S FAHGVSLL  PK   GAGVCKASQVFDLFPTV+PE+TVREARIEDCWEVAETHCSSFFP YSFPLDFVLRVDRLVAML  LSVPNGCRR+C
Subjt:  RNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRIC

Query:  LVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIA
        LVAVIGGS ND FLIGP+DFKIGGFDGKVSLNKGYVAGILT+DTVADFLPRKGP+RQRRTGIAY+SNVAVRERFRRKGIAKKLI+KAE EARNWGCRAIA
Subjt:  LVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIA

Query:  LHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN
        LHCDT+NPGATKLYRGQG+K IKVPEGANWP+PKTSPDI YSFMMKLLKN
Subjt:  LHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN

XP_004149920.2 uncharacterized protein LOC101207861 [Cucumis sativus]7.90e-16489.29Show/hide
Query:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR
        SS NF  ++S  AHGVSLL  PK  SGA VCKASQVFDLFPT++PE+TVREARIEDCWEVAETHCSSFFP YSFPLDFVLRVDRLVAMLSGLSVPNGCRR
Subjt:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR

Query:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA
        ICLVAVIGGS ND FLIGP+DFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGP+RQRRTGIAY+SNVAVRERFRRKGIAKKLI+KAE EARNWGCRA
Subjt:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA

Query:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN
        IALHCDTNNPGATKLY+GQG+K IKVPEGANWPQPKTSPDI YSFMMKLLKN
Subjt:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN

XP_008456480.1 PREDICTED: uncharacterized protein LOC103496423 [Cucumis melo]1.42e-16489.68Show/hide
Query:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR
        SS NF  ++S  AHGVSLL  PK  SGA VCKASQVFDLFPT++PE+TVREARIEDCWEVAETHCSSFFP YSFPLDFVLRVDRLVAMLSGLSVPNGCRR
Subjt:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR

Query:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA
        ICLVAVIGGS ND FLIGP+DFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGP+RQRRTGIAY+SNVAVRERFRRKGIAKKLI+KAE EARNWGCRA
Subjt:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA

Query:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN
        IALHCDTNNPGATKLYRGQG+K IKVPEGANWPQPKTSPDI YSFMMKLLKN
Subjt:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN

XP_022133755.1 uncharacterized protein LOC111006255 [Momordica charantia]2.85e-189100Show/hide
Query:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR
        SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR
Subjt:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR

Query:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA
        ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA
Subjt:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA

Query:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKNHPA
        IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKNHPA
Subjt:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKNHPA

XP_038886660.1 uncharacterized protein LOC120076811 [Benincasa hispida]2.56e-16191.36Show/hide
Query:  SCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRICLVAVIGG
        S  AHGVSLL  PK  SGA VCKASQVFDLFPTV+PE+TVREARIEDCWEVAETHCSSFFP YSFPLDFVLRVDRLVAMLSGLSVPNGC+RICLVAVIGG
Subjt:  SCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRICLVAVIGG

Query:  SVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIALHCDTNN
        S ND FLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGP+RQRRTGIAY+SNVAVRERFRRKGIAKKLIVKAETEA+NWGCRAIALHCD NN
Subjt:  SVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIALHCDTNN

Query:  PGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN
        PGATKLYRGQG+K IKVPEGANWPQPKTSPDI YSFMMK+LKN
Subjt:  PGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN

TrEMBL top hitse value%identityAlignment
A0A0A0KG42 N-acetyltransferase domain-containing protein3.82e-16489.29Show/hide
Query:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR
        SS NF  ++S  AHGVSLL  PK  SGA VCKASQVFDLFPT++PE+TVREARIEDCWEVAETHCSSFFP YSFPLDFVLRVDRLVAMLSGLSVPNGCRR
Subjt:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR

Query:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA
        ICLVAVIGGS ND FLIGP+DFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGP+RQRRTGIAY+SNVAVRERFRRKGIAKKLI+KAE EARNWGCRA
Subjt:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA

Query:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN
        IALHCDTNNPGATKLY+GQG+K IKVPEGANWPQPKTSPDI YSFMMKLLKN
Subjt:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN

A0A1S3C3C1 uncharacterized protein LOC1034964236.86e-16589.68Show/hide
Query:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR
        SS NF  ++S  AHGVSLL  PK  SGA VCKASQVFDLFPT++PE+TVREARIEDCWEVAETHCSSFFP YSFPLDFVLRVDRLVAMLSGLSVPNGCRR
Subjt:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR

Query:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA
        ICLVAVIGGS ND FLIGP+DFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGP+RQRRTGIAY+SNVAVRERFRRKGIAKKLI+KAE EARNWGCRA
Subjt:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA

Query:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN
        IALHCDTNNPGATKLYRGQG+K IKVPEGANWPQPKTSPDI YSFMMKLLKN
Subjt:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN

A0A6J1BW50 uncharacterized protein LOC1110062551.38e-189100Show/hide
Query:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR
        SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR
Subjt:  SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRR

Query:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA
        ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA
Subjt:  ICLVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRA

Query:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKNHPA
        IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKNHPA
Subjt:  IALHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKNHPA

A0A6J1FJP1 uncharacterized protein LOC1114448586.98e-16087.6Show/hide
Query:  RNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRIC
        R+F  N+S FAHGVSLL  PK   GA VCKASQVFDLFPTV+PE+TVREARIEDCWEVAETHCSSFFP YSFPLDFVLRVDRLVAML  LSVPNGCRR+C
Subjt:  RNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRIC

Query:  LVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIA
        LVAVIGGS ND FLIGP+DFKIGGFDGKVSLNKGYVAGILT+DTVADFLPRKGP+RQRRTGIAY+SNVAVRERFRRKGIAKKLI+KAE EARNW CRAIA
Subjt:  LVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIA

Query:  LHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN
        LHCDT+NPGATKLYRGQG+K IKVPEGANWP+PKTSPDI YSFMMKLLKN
Subjt:  LHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN

A0A6J1JY34 uncharacterized protein LOC1114889251.25e-16087.6Show/hide
Query:  RNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRIC
        RNF  N+S FAHGVSLL  PK   GA VCKASQVFDLFPTV+PE+TVREARIEDCWEVAETHCSSFFP YSFPLDFVLRVDRLVAML  LSVPNGCRR+C
Subjt:  RNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRIC

Query:  LVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIA
        LVAVIGGS +D FLIGP+DFKIGGFDGKVSLNKGYV GILT+DTVADFLPRKGP+RQRRTGIAY+SNVAVRERFRRKGIAKKLI+KAE+EARNWGCRAIA
Subjt:  LVAVIGGSVNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIA

Query:  LHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN
        LHCDT+NPGATKLYRGQG+K IKVPEGANWP+PKTSPDI YSFMMKLLKN
Subjt:  LHCDTNNPGATKLYRGQGYKCIKVPEGANWPQPKTSPDINYSFMMKLLKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G39000.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein7.3e-10476.11Show/hide
Query:  AGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRICLVAVIGGSVNDEFLIGPEDFKIGGF
        +G C ASQ+ DLFP VSPE+ VREAR+EDCWEVAETHCSSFFPGYSFPLD VLRVDRL+AM+ G S+P GC+R CLVAVIG SV++    G +DFKIG F
Subjt:  AGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRICLVAVIGGSVNDEFLIGPEDFKIGGF

Query:  DGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIALHCDTNNPGATKLYRGQGYKCIKVP
        D K+SLNKGYVAGILTVDTVAD+LPRKGPLRQRRTGIAYVSNVAVRE FRRKGIAK+LI KAE  A+NWGCRAI LHCD NN GATKLY+ QG++ IK+P
Subjt:  DGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIALHCDTNNPGATKLYRGQGYKCIKVP

Query:  EGANWPQPKTSPDINYSFMMKLLKNH
        EGA WPQPKTSPD  ++FMMKL+ N+
Subjt:  EGANWPQPKTSPDINYSFMMKLLKNH

AT2G39000.2 Acyl-CoA N-acyltransferases (NAT) superfamily protein9.3e-5976.69Show/hide
Query:  AGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRICLVAVIGGSVNDEFLIGPEDFKIGGF
        +G C ASQ+ DLFP VSPE+ VREAR+EDCWEVAETHCSSFFPGYSFPLD VLRVDRL+AM+ G S+P GC+R CLVAVIG SV++    G +DFKIG F
Subjt:  AGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRICLVAVIGGSVNDEFLIGPEDFKIGGF

Query:  DGKVSLNKGYVAGILTVDTVADFLPRKGPLRQR
        D K+SLNKGYVAGILTVDTVAD+LPRKGPLRQR
Subjt:  DGKVSLNKGYVAGILTVDTVADFLPRKGPLRQR

AT2G39000.3 Acyl-CoA N-acyltransferases (NAT) superfamily protein9.5e-10476.44Show/hide
Query:  GVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRICLVAVIGGSVNDEFLIGPEDFKIGGFD
        G C ASQ+ DLFP VSPE+ VREAR+EDCWEVAETHCSSFFPGYSFPLD VLRVDRL+AM+ G S+P GC+R CLVAVIG SV++    G +DFKIG FD
Subjt:  GVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRICLVAVIGGSVNDEFLIGPEDFKIGGFD

Query:  GKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIALHCDTNNPGATKLYRGQGYKCIKVPE
         K+SLNKGYVAGILTVDTVAD+LPRKGPLRQRRTGIAYVSNVAVRE FRRKGIAK+LI KAE  A+NWGCRAI LHCD NN GATKLY+ QG++ IK+PE
Subjt:  GKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIALHCDTNNPGATKLYRGQGYKCIKVPE

Query:  GANWPQPKTSPDINYSFMMKLLKNH
        GA WPQPKTSPD  ++FMMKL+ N+
Subjt:  GANWPQPKTSPDINYSFMMKLLKNH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCTTCCAGAAACTTCACTCACAACAGATCTTGCTTTGCTCATGGCGTCTCTTTACTTCCTCTCCCCAAATTGGACTCAGGTGCTGGAGTTTGTAAAGCTAGTCAAGTTTT
TGACTTATTTCCAACTGTATCTCCTGAGGTAACGGTTCGAGAGGCAAGAATAGAGGACTGTTGGGAAGTTGCAGAGACTCATTGCAGCTCCTTCTTCCCGGGATACTCCT
TCCCTTTGGATTTTGTGCTGAGGGTTGATAGGCTTGTAGCAATGTTATCTGGATTGTCTGTTCCAAATGGTTGCAGGAGGATTTGTTTGGTTGCTGTGATTGGTGGCTCA
GTGAATGATGAATTCCTTATTGGACCTGAAGATTTTAAGATTGGGGGATTTGATGGCAAGGTTAGTCTCAACAAGGGCTATGTTGCTGGAATCTTGACCGTCGATACCGT
CGCCGATTTCCTACCGAGAAAAGGACCGTTGCGGCAACGGAGGACGGGGATTGCATACGTATCAAATGTAGCAGTTCGTGAGCGGTTCCGACGCAAGGGAATAGCCAAAA
AGCTAATAGTTAAGGCAGAGACTGAAGCTAGGAACTGGGGGTGCCGGGCGATCGCATTGCATTGTGATACAAATAACCCAGGGGCTACAAAGCTGTACAGAGGTCAGGGT
TACAAATGCATCAAAGTACCAGAAGGAGCAAACTGGCCTCAGCCAAAGACCTCTCCAGACATCAACTACAGCTTCATGATGAAGCTTCTGAAGAACCATCCTGCC
mRNA sequenceShow/hide mRNA sequence
TCTTCCAGAAACTTCACTCACAACAGATCTTGCTTTGCTCATGGCGTCTCTTTACTTCCTCTCCCCAAATTGGACTCAGGTGCTGGAGTTTGTAAAGCTAGTCAAGTTTT
TGACTTATTTCCAACTGTATCTCCTGAGGTAACGGTTCGAGAGGCAAGAATAGAGGACTGTTGGGAAGTTGCAGAGACTCATTGCAGCTCCTTCTTCCCGGGATACTCCT
TCCCTTTGGATTTTGTGCTGAGGGTTGATAGGCTTGTAGCAATGTTATCTGGATTGTCTGTTCCAAATGGTTGCAGGAGGATTTGTTTGGTTGCTGTGATTGGTGGCTCA
GTGAATGATGAATTCCTTATTGGACCTGAAGATTTTAAGATTGGGGGATTTGATGGCAAGGTTAGTCTCAACAAGGGCTATGTTGCTGGAATCTTGACCGTCGATACCGT
CGCCGATTTCCTACCGAGAAAAGGACCGTTGCGGCAACGGAGGACGGGGATTGCATACGTATCAAATGTAGCAGTTCGTGAGCGGTTCCGACGCAAGGGAATAGCCAAAA
AGCTAATAGTTAAGGCAGAGACTGAAGCTAGGAACTGGGGGTGCCGGGCGATCGCATTGCATTGTGATACAAATAACCCAGGGGCTACAAAGCTGTACAGAGGTCAGGGT
TACAAATGCATCAAAGTACCAGAAGGAGCAAACTGGCCTCAGCCAAAGACCTCTCCAGACATCAACTACAGCTTCATGATGAAGCTTCTGAAGAACCATCCTGCC
Protein sequenceShow/hide protein sequence
SSRNFTHNRSCFAHGVSLLPLPKLDSGAGVCKASQVFDLFPTVSPEVTVREARIEDCWEVAETHCSSFFPGYSFPLDFVLRVDRLVAMLSGLSVPNGCRRICLVAVIGGS
VNDEFLIGPEDFKIGGFDGKVSLNKGYVAGILTVDTVADFLPRKGPLRQRRTGIAYVSNVAVRERFRRKGIAKKLIVKAETEARNWGCRAIALHCDTNNPGATKLYRGQG
YKCIKVPEGANWPQPKTSPDINYSFMMKLLKNHPA