; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0516 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0516
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationMC05:3915527..3918253
RNA-Seq ExpressionMC05g0516
SyntenyMC05g0516
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138097.1 uncharacterized protein LOC111009349 [Momordica charantia]2.87e-126100Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
        MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA

Query:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSKIIDELYCVKISE
        VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSKIIDELYCVKISE
Subjt:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSKIIDELYCVKISE

XP_022138172.1 uncharacterized protein LOC111009408 [Momordica charantia]4.26e-10689.71Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
        MAVTLRPLDLTDIDDFMVWASDEK AR CSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAI V+ NSAARDRC  ELGYVLGS FWGKGIAT A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA

Query:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSKIIDEL
        VKLVAERIFEERP LER+EALV VENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLST L SK  ++L
Subjt:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSKIIDEL

XP_022955368.1 uncharacterized protein LOC111457417 [Cucurbita moschata]6.20e-10287.5Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
        M +TLRPLDLTDIDDFMVWASDEKAARSCSWEPY DKSDA+K+I D+VL HPW+RAICVDGRPVGAISVTAN AARDRC GELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA

Query:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQ
        VKLVAERIF ERPELER+EALVAVENLASQRV+EKAGF REGVLRKYGV+KG+TRD+VMFSLLSTDL+
Subjt:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQ

XP_022980780.1 uncharacterized protein LOC111480066 [Cucurbita maxima]1.03e-10086.9Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
        M +TLRPLDLTDIDDFM WASDEKAARSCSWEPY DK DA+K+I D+VL HPWYRAICVDGRPVGAISVTAN AARDRC GELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA

Query:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQ
        VKLVAERIF ERPELER+EALV VENLASQRVVEKAGF REGVLRKYGV+KG+TRD+VMFSLLSTDL+
Subjt:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQ

XP_023527199.1 uncharacterized protein LOC111790509 [Cucurbita pepo subsp. pepo]8.43e-10086.9Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
        M +TLRPLDLTDIDDFMVWASDEKAARSCSWEPY D SDA+K+I D+VL HPW+RAICVDGRPVGAISVTAN AARDRC GELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA

Query:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQ
        VKLVAERIF ERPELER+EALV VENLASQRVVEKAGF REGVLRKYGV+KG+ RD+VMFSLLSTDLQ
Subjt:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQ

TrEMBL top hitse value%identityAlignment
A0A5D3D131 Putative N-acetyltransferase p20-like7.14e-9280.47Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
        M +TLRPLDLTDIDDFM WA+DEKAAR CSWEPY DKS+A+KFI D+VL HP+YRAICVDGRPVGAISV +N+AARD+C GELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA

Query:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQS
        VKLVAERIF E PELER+EALV VEN ASQRV+EKAGF REGVLRKYGVLKG  RD+VMFS L TD  S
Subjt:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQS

A0A6J1CA30 uncharacterized protein LOC1110093491.39e-126100Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
        MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA

Query:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSKIIDELYCVKISE
        VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSKIIDELYCVKISE
Subjt:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSKIIDELYCVKISE

A0A6J1CAC0 uncharacterized protein LOC1110094082.06e-10689.71Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
        MAVTLRPLDLTDIDDFMVWASDEK AR CSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAI V+ NSAARDRC  ELGYVLGS FWGKGIAT A
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA

Query:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSKIIDEL
        VKLVAERIFEERP LER+EALV VENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLST L SK  ++L
Subjt:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSKIIDEL

A0A6J1GTR6 uncharacterized protein LOC1114574173.00e-10287.5Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
        M +TLRPLDLTDIDDFMVWASDEKAARSCSWEPY DKSDA+K+I D+VL HPW+RAICVDGRPVGAISVTAN AARDRC GELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA

Query:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQ
        VKLVAERIF ERPELER+EALVAVENLASQRV+EKAGF REGVLRKYGV+KG+TRD+VMFSLLSTDL+
Subjt:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQ

A0A6J1IXH7 uncharacterized protein LOC1114800664.97e-10186.9Show/hide
Query:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
        M +TLRPLDLTDIDDFM WASDEKAARSCSWEPY DK DA+K+I D+VL HPWYRAICVDGRPVGAISVTAN AARDRC GELGYVLGS FWGKGI TAA
Subjt:  MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA

Query:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQ
        VKLVAERIF ERPELER+EALV VENLASQRVVEKAGF REGVLRKYGV+KG+TRD+VMFSLLSTDL+
Subjt:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQ

SwissProt top hitse value%identityAlignment
O31633 Putative [ribosomal protein S5]-alanine N-acetyltransferase5.6e-1045.24Show/hide
Query:  LGYVLGSNFWGKGIATAAVKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTD
        +GY L     GKGI T AV+LV +  F E  +L R+EA V   NL S RV+EKAGFH+EG+ RK   + G   D  + ++L+ D
Subjt:  LGYVLGSNFWGKGIATAAVKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTD

O34569 Uncharacterized N-acetyltransferase YoaA4.6e-0434.15Show/hide
Query:  ELGYVLGSNFWGKGIATAAVKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLL
        E+GY +    W  G A+  +  V    F     L R+ A+V  +N AS R++ K GF +EGVLR+Y    G   D  ++S++
Subjt:  ELGYVLGSNFWGKGIATAAVKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLL

P05332 Uncharacterized N-acetyltransferase p201.8e-1332.74Show/hide
Query:  VTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKS---DALKFIKDKVLPHPWYRAICVDGRPVGAISVTA-NSAARDRCGGELGYVLGSNFWGKGIAT
        +TLR ++L D D    + SD +  +  +  P+TD S   D ++ I D  L     R   +       I     N   ++    E+GY LG N WGKG A+
Subjt:  VTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKS---DALKFIKDKVLPHPWYRAICVDGRPVGAISVTA-NSAARDRCGGELGYVLGSNFWGKGIAT

Query:  AAVKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTD
         AV+ + +  F     L R+EA V  EN  S +++    F +EG+LR Y   KG+  D  MFSLL  +
Subjt:  AAVKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTD

P96579 Putative ribosomal N-acetyltransferase YdaF9.8e-0727.27Show/hide
Query:  LRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYR----------AICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGK
        L P D   + + ++    +   R   W  + +   +    ++ ++P  W R           +  DG   G IS+  ++  +     E+GY +   F GK
Subjt:  LRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYR----------AICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGK

Query:  GIATAAVKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSK
        GI TAA + +    FEE  EL RV    AV N  S+ V E+ GF  EG  R    + G   D V +SLL  + + +
Subjt:  GIATAAVKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSK

Arabidopsis top hitse value%identityAlignment
AT2G32020.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.3e-5056.97Show/hide
Query:  VTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICV-DGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAAV
        ++LRP+ L+D+DD+MVWA+D K AR C+WEP T + +A+K+I D+VL HPW RAIC+ D RP+G I +     A D    E+GYVL   +WGKG AT AV
Subjt:  VTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICV-DGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAAV

Query:  KLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTD
        +LV   +FEE PE+ER+EALV V+N+ SQRV+EK GF REGV+RK+  +KG  RD VMFS LSTD
Subjt:  KLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTD

AT2G32030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein2.7e-5257.58Show/hide
Query:  VTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDG-RPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAAV
        + LRP+ L+D+DDFMVWA+D    R C+WEPYT +  A+ ++ D +LPHPW RAIC+D  RP+G+ISVT      D   GE+GYVLGS +WGKGIAT AV
Subjt:  VTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDG-RPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAAV

Query:  KLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTD
        +LVA  IF+E+PE++R+EALV V+N+ SQ+V+EK GF +EGV+RK+  LKG  RD VMFS L +D
Subjt:  KLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTD

AT3G22560.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein6.1e-3643.37Show/hide
Query:  VTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICV--DGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA
        + LRP +L+D +D   WA D+   R   W+      +A + I +K +PHPW R+I +  DG  +G +SV  +S    RC  +L Y +   FWG+GIATAA
Subjt:  VTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICV--DGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAA

Query:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTD
        V++  E+  E+ PE+ R++A+V VEN ASQRV+EKAGF +EG+L KYG  KG  RD  ++S +  D
Subjt:  VKLVAERIFEERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGTGACTCTCCGGCCGCTGGATCTGACCGACATCGACGATTTCATGGTGTGGGCGTCGGACGAGAAGGCGGCTCGATCCTGCTCGTGGGAGCCATACACGGACAA
ATCAGACGCCCTAAAGTTCATCAAAGACAAAGTCCTGCCGCACCCGTGGTACCGGGCGATCTGCGTCGACGGCCGGCCGGTCGGAGCGATATCGGTGACGGCGAATTCGG
CGGCCAGGGACCGGTGCGGAGGCGAGCTAGGGTACGTATTGGGCTCCAATTTTTGGGGGAAAGGGATCGCGACGGCGGCGGTGAAATTGGTGGCGGAGAGGATTTTCGAG
GAGCGGCCGGAATTGGAGCGGGTGGAGGCGCTGGTGGCTGTGGAGAATTTGGCGTCTCAGAGAGTGGTGGAGAAGGCCGGTTTTCACAGAGAAGGTGTTCTCAGAAAGTA
TGGAGTATTGAAGGGCAAAACCAGGGATTTTGTCATGTTCAGTCTTCTTTCTACTGATCTTCAATCCAAAATTATTGATGAACTGTACTGCGTCAAAATTTCTGAATGA
mRNA sequenceShow/hide mRNA sequence
ATTTTGCAATTCTATTTATTTATTACTTTATAAATAAATAAAAAACTACAGCTGTGCTAAACTTATTGGCGAAGTTTAGAATTCTATAAAAGACATTTAAACCAAAATAT
ATCAACTAAGAAAACTATTAAATGATACCCAGGTTTTGTATACCTAAATTGGTCACTGGATTTGATCGGTTCTTAGAAAACTGAAAATGAAAAAATCCCGAATTTCGCCC
TTACAAAATCCATATTTCCTCAACAGCCCCTCTCTGCAAAACCCCACTTTTTCCAGAACCCTCTGGGATCCATTATTCTCGACTTCCACCAACGCCTGAACCCTAACAAC
CTCCGGGAACTCTTTCAACGCCTCCGTAACGGCCGCCCTCAGCGCCGCCGTGGCAATCCCTCGTCCCCAATGCTCCGCCGCCACGGCATAGCTGATATGTGCTCTGCATC
TCTCCTCCGTTTCCGGCCTGACCGAAACGTACCCGACGGAGCGGCCGTCCAAGCATATGGACCGCCGCCATGGATGGGGAATCGCGACCTTCTCTAGGTAAGTTATTGCT
TCTTCTTTGGAGGTAATTGCGTCCCATCGGAGGTAGCGGGTTACTCTGTCGTCGCCGGCCCACCTGAGAAAGTCGTTGGCGTCGGAAAGCTTGAACGCCCGGACGGAAAT
TCTTGATGACTCCATTGATCAGAAAGAGGGCCGGTTGTTCCTGAGAGTTGGGGCTGGTCAGCTACGGCGGCCATCAATTTATAGGCAAAATTCCCCTTTTACCCTTTATT
TTAGCTGGGTCAATTTTTATCTTAAAATTCAGGGTGAAATTACAAGTTTTCAGTGTCTAATAATTTTTTATTTTTATTTTTGTGTTTAATAAAATTGTCAATTTAAAAAA
GGATATAAGTTCTCAGACTTTCAACTCTAAAACATATGTATAAACTTAAAATTGCCTTATAAATTAGAGATTTATCCTGTTATATATTTTCCAAAGCTTGAAAATCTATT
TAAATACGAACATATTGGATAGTTTTTACAAATTACAAACTTATTAAACACAAGTTTATAATTTTATTTTTATGGGCTAAACTGACGATTTAATCAAAATTTTCATTAAT
TTAAAAGTAAAAAAAAAAGTTCCATCTTGCTTGACCATTACCAGGTATCATGACCTCTCCTGGAAGATAAAACGTTAAAATTTCTACTCCCACCTTTTTTTTAAACTAAA
AACATAAATAATTAAAATTATTTTTAATCTTTGAAATTTTAAAATATCTATTTCAGACTAAATTCTATTAGTATCATTTTTTTTTTCATTTTTAGATTTACATAAATGAC
CGCAAAACACAAAAACCTTATTTTACTTAAAAATAAATCATATAATTTGGATTGAAATTTTTTTAACAAAACAAATAACATGACTAAAAATACTAGACTTGGATTTTTCT
TTTCAAATTACAACAAATATGGAGGAGGAGAACAAACTCAAAAATATCATAATTCAATTAATTAAAGTTTATATATATACACACTAAAAATAAGTCGAAAGTTCTCCATG
TGGTGTTAAATTTAATAAAAATAAAAATAAATTGCTGACGTGCGAGACTTTGAGTCCACCCCAAGCAATGCAATTTTGACTCATCAAATATGAAAGTTCGTCGCAGTTTC
TCCTCTCCTCTCTATAAATCCCCCATTGAAATTTCCAAACTCCGCAGAGGAAGAGAAAAAGATCCGCAGCCATGGCAGTGACTCTCCGGCCGCTGGATCTGACCGACATC
GACGATTTCATGGTGTGGGCGTCGGACGAGAAGGCGGCTCGATCCTGCTCGTGGGAGCCATACACGGACAAATCAGACGCCCTAAAGTTCATCAAAGACAAAGTCCTGCC
GCACCCGTGGTACCGGGCGATCTGCGTCGACGGCCGGCCGGTCGGAGCGATATCGGTGACGGCGAATTCGGCGGCCAGGGACCGGTGCGGAGGCGAGCTAGGGTACGTAT
TGGGCTCCAATTTTTGGGGGAAAGGGATCGCGACGGCGGCGGTGAAATTGGTGGCGGAGAGGATTTTCGAGGAGCGGCCGGAATTGGAGCGGGTGGAGGCGCTGGTGGCT
GTGGAGAATTTGGCGTCTCAGAGAGTGGTGGAGAAGGCCGGTTTTCACAGAGAAGGTGTTCTCAGAAAGTATGGAGTATTGAAGGGCAAAACCAGGGATTTTGTCATGTT
CAGTCTTCTTTCTACTGATCTTCAATCCAAAATTATTGATGAACTGTACTGCGTCAAAATTTCTGAATGATTTTAATTTGAGTTTCGAATTCAGTAGTTTTTACAGATTT
GTGTGCAAGGGAGAAGGGCCGCCATTGTTGTAGTGTGTAAGCTTAAAATCGATATGAATTTTTTCTCTCTCTAAAAGGTCCATCAAGTCTTTACTCATTGGAGAGGAGCT
ATGTATCTGTGGATGTACTCATAATTTCATTGGATTAAGTAAGATTCAATTAAGCTTAAAAGAAAATGTCTTTTTTGGAAGTTTCATTGAGATGATAGAGACGTAATTAA
AATTAAGATTGCCATTATAGTTCAAATACATGGCATATAAATTATTATCCCAAAAAAAATATACATGGTATATAATTTAGTAATTTAATCGTTTAAGTGGTAAATGTGTC
ACAGTTATCAAAATGGTGTTTGACACATCATTTTAATAAGCTTGAATTTGTATGATTGCTTTGTAACATAAAACTTTGGAATACAAT
Protein sequenceShow/hide protein sequence
MAVTLRPLDLTDIDDFMVWASDEKAARSCSWEPYTDKSDALKFIKDKVLPHPWYRAICVDGRPVGAISVTANSAARDRCGGELGYVLGSNFWGKGIATAAVKLVAERIFE
ERPELERVEALVAVENLASQRVVEKAGFHREGVLRKYGVLKGKTRDFVMFSLLSTDLQSKIIDELYCVKISE