; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035647 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035647
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationchr3:26372678..26373408
RNA-Seq ExpressionLag0035647
SyntenyLag0035647
Gene Ontology termsGO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135660.1 uncharacterized protein LOC101218036 [Cucumis sativus]1.5e-7781.29Show/hide
Query:  SSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKG
        +  MEL ++TLRPF+LSDVDDFM+WAGDD+VMKFIRWN  TSKEQA  FIRDVCIPHPWRRSIC+D +SVGFVSVYPWS  DRCKADVGYAVAR++WG+G
Subjt:  SSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKG

Query:  IATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
        IAT+ALR+A+P+VF+ FPDVVRLQAFV  ENRASQRVVEKVGFQKEG LRKYCYIKGEI DLIVYS LSSD
Subjt:  IATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD

XP_008450752.1 PREDICTED: uncharacterized N-acetyltransferase p20-like [Cucumis melo]1.2e-7776.6Show/hide
Query:  GKGARL--SISSST-FSSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRC
        GKG  L   ISS+T  + +ME+ ++TLRPFQLSDVDDFM+WAGDD+VMK IRWN  TSKEQA  FIRDVCIPHPWRRSIC+D +SVGFVSVYPWS  DRC
Subjt:  GKGARL--SISSST-FSSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRC

Query:  KADVGYAVARDHWGKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSDV
        KADVGYAVAR++WG+GI T+AL+IA+PQVF+ FP+VVRLQAFV +ENRASQRVVEKVGFQKEG LRKYCYIKGE+KDL VYS LSSD+
Subjt:  KADVGYAVARDHWGKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSDV

XP_015968252.1 uncharacterized protein LOC107491853 [Arachis duranensis]4.1e-6770Show/hide
Query:  SNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI
        SN++L +++LRPF+LSDVDDF+LWAGDDQV + +RW T  S+E+ALAFIRDVCIPHPWRRSIC+DD+S+GFVSVYPWS  DRCKAD+GYAVA D+WG+GI
Subjt:  SNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI

Query:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
        ATKA+++A+PQVF     +VRLQAF S +N+ASQRV+EK GF KEG LRKY Y KG IKD IV+S LS+D
Subjt:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD

XP_016207519.1 uncharacterized protein LOC107647998 [Arachis ipaensis]2.4e-6770Show/hide
Query:  SNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI
        SN++L +++LRPF+LSDVDDF+LWAGDDQV + +RW T  S+E+ALAFIRDVCIPHPWRRSIC+DD+S+GFVSVYPWS  DRCKAD+GYAVA D+WG+GI
Subjt:  SNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI

Query:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
        ATKA+++A+PQVF     +VRLQAF S EN+ASQRV+EK GF KEG LRKY Y KG IKD +V+S LS+D
Subjt:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD

XP_031249418.1 uncharacterized protein LOC116107261 [Pistacia vera]5.4e-6767.65Show/hide
Query:  SNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI
        + ++L  +TLRPF+L+DVDDFM+WAGDDQVMKFIRWNT TSKE+AL +I+DVC PHPWRRSICIDD S+GFVS++P S  DRC+AD+GYA+A  +WGKGI
Subjt:  SNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI

Query:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
        AT+A++IA+ +VF+ FPDV+RLQA+V VEN+ASQRV+EK GF +EG LRKY YIKG+++DL +YS L+ D
Subjt:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD

TrEMBL top hitse value%identityAlignment
A0A0A0LW78 N-acetyltransferase domain-containing protein7.3e-7881.29Show/hide
Query:  SSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKG
        +  MEL ++TLRPF+LSDVDDFM+WAGDD+VMKFIRWN  TSKEQA  FIRDVCIPHPWRRSIC+D +SVGFVSVYPWS  DRCKADVGYAVAR++WG+G
Subjt:  SSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKG

Query:  IATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
        IAT+ALR+A+P+VF+ FPDVVRLQAFV  ENRASQRVVEKVGFQKEG LRKYCYIKGEI DLIVYS LSSD
Subjt:  IATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD

A0A1S3BPY9 uncharacterized N-acetyltransferase p20-like5.6e-7876.6Show/hide
Query:  GKGARL--SISSST-FSSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRC
        GKG  L   ISS+T  + +ME+ ++TLRPFQLSDVDDFM+WAGDD+VMK IRWN  TSKEQA  FIRDVCIPHPWRRSIC+D +SVGFVSVYPWS  DRC
Subjt:  GKGARL--SISSST-FSSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRC

Query:  KADVGYAVARDHWGKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSDV
        KADVGYAVAR++WG+GI T+AL+IA+PQVF+ FP+VVRLQAFV +ENRASQRVVEKVGFQKEG LRKYCYIKGE+KDL VYS LSSD+
Subjt:  KADVGYAVARDHWGKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSDV

A0A444YK79 N-acetyltransferase domain-containing protein1.2e-6770Show/hide
Query:  SNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI
        SN++L +++LRPF+LSDVDDF+LWAGDDQV + +RW T  S+E+ALAFIRDVCIPHPWRRSIC+DD+S+GFVSVYPWS  DRCKAD+GYAVA D+WG+GI
Subjt:  SNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI

Query:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
        ATKA+++A+PQVF     +VRLQAF S EN+ASQRV+EK GF KEG LRKY Y KG IKD +V+S LS+D
Subjt:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD

A0A5D3CG01 Putative N-acetyltransferase p20-like5.6e-7876.6Show/hide
Query:  GKGARL--SISSST-FSSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRC
        GKG  L   ISS+T  + +ME+ ++TLRPFQLSDVDDFM+WAGDD+VMK IRWN  TSKEQA  FIRDVCIPHPWRRSIC+D +SVGFVSVYPWS  DRC
Subjt:  GKGARL--SISSST-FSSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRC

Query:  KADVGYAVARDHWGKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSDV
        KADVGYAVAR++WG+GI T+AL+IA+PQVF+ FP+VVRLQAFV +ENRASQRVVEKVGFQKEG LRKYCYIKGE+KDL VYS LSSD+
Subjt:  KADVGYAVARDHWGKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSDV

A0A6P4DMD4 uncharacterized protein LOC1074918532.0e-6770Show/hide
Query:  SNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI
        SN++L +++LRPF+LSDVDDF+LWAGDDQV + +RW T  S+E+ALAFIRDVCIPHPWRRSIC+DD+S+GFVSVYPWS  DRCKAD+GYAVA D+WG+GI
Subjt:  SNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI

Query:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
        ATKA+++A+PQVF     +VRLQAF S +N+ASQRV+EK GF KEG LRKY Y KG IKD IV+S LS+D
Subjt:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD

SwissProt top hitse value%identityAlignment
O31633 Putative [ribosomal protein S5]-alanine N-acetyltransferase8.2e-1037.74Show/hide
Query:  DDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVY
        DD+ +G VS++         A +GY + + H GKGI T+A+R+ +   F     + R++A V   N  S RV+EK GF KEG  RK   I G  +D  V 
Subjt:  DDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVY

Query:  SILSSD
        +IL+ D
Subjt:  SILSSD

O34569 Uncharacterized N-acetyltransferase YoaA2.1e-1328.07Show/hide
Query:  MELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSI------CIDDQSVGFVSVYPWSDADRCKADVGYAVARDHW
        +E  +L LR     D +       +D+V ++     + S EQA++ I+     +  +R I          + +G +  +  +   R +A++GY +  +HW
Subjt:  MELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSI------CIDDQSVGFVSVYPWSDADRCKADVGYAVARDHW

Query:  GKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSIL
          G A++ +   +   F +   + R+ A V  +N AS R++ K+GFQKEG LR+Y Y  G   D  VYSI+
Subjt:  GKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSIL

P05332 Uncharacterized N-acetyltransferase p201.3e-1531.76Show/hide
Query:  QLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQA---LAFIRDVCIPHPWRR-SICI--DDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI
        +LTLR  +L D D    +  D +V K++     T   QA   +  I D+ +     R SI +   D+ +G    +   D +  +A++GY + R+HWGKG 
Subjt:  QLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQA---LAFIRDVCIPHPWRR-SICI--DDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI

Query:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
        A++A++  I   F S  ++ R++A V  EN  S +++  + FQKEG LR Y   KG + D+ ++S+L  +
Subjt:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD

P49855 Uncharacterized protein YkkB1.5e-0625Show/hide
Query:  QLTLRPFQLSDVDDFMLWAG---DDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGIATK
        Q TL     +D D    ++G     Q  +++ WN    K   ++          W        + +G   + P    ++   ++GY  AR HWG G A +
Subjt:  QLTLRPFQLSDVDDFMLWAG---DDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGIATK

Query:  ALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKY
        A R  +   F       ++ A +   N+AS RV EK+G     T+RK+
Subjt:  ALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKY

P96579 Putative ribosomal N-acetyltransferase YdaF1.3e-1029.45Show/hide
Query:  RWNTLTSKEQALAFIRDVCIPHPWRR----------SICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGIATKALRIAIPQVFQSFPDVVRLQA
        +W        +    R+  IP  WRR           +  D    G +S++     +R KA++GY +A++  GKGI T A R  I   F+   ++ R+  
Subjt:  RWNTLTSKEQALAFIRDVCIPHPWRR----------SICIDDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGIATKALRIAIPQVFQSFPDVVRLQA

Query:  FVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
          +V N  S+ V E++GF +EG  R   Y+ G   DL+ YS+L  +
Subjt:  FVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD

Arabidopsis top hitse value%identityAlignment
AT2G32020.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein3.1e-4443.02Show/hide
Query:  SISSSTFSSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICI-DDQSVGFVSVYPWSDADRCKADVGYAV
        S  ++   S+   G+++LRP  LSDVDD+M+WA D +V +F  W   TS+++A+ +I D  + HPW R+IC+ DD+ +G++ +      D  + ++GY +
Subjt:  SISSSTFSSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICI-DDQSVGFVSVYPWSDADRCKADVGYAV

Query:  ARDHWGKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
        AR +WGKG AT+A+R+   +VF+ FP++ RL+A V V+N  SQRV+EKVGF +EG +RK+  IKG ++D +++S LS+D
Subjt:  ARDHWGKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD

AT2G32030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein4.7e-4546.67Show/hide
Query:  QLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICID-DQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGIATKAL
        ++ LRP  LSDVDDFM+WA D  V +F  W   TS+E A+A++ D  +PHPW R+IC+D D+ +G +SV P    D  + ++GY +   +WGKGIAT+A+
Subjt:  QLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICID-DQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGIATKAL

Query:  RIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
        R+   ++F+  P++ RL+A V V+N  SQ+V+EKVGF KEG +RK+ Y+KG ++D++++S L SD
Subjt:  RIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD

AT3G22560.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein8.3e-5053.53Show/hide
Query:  MELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICI--DDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI
        ME  ++ LRPF LSD +D   WAGDD V +++RW+++ S E+A   I +  IPHPWRRSI +  D  S+G+VSV P S   RC+AD+ YAVA++ WG+GI
Subjt:  MELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICI--DDQSVGFVSVYPWSDADRCKADVGYAVARDHWGKGI

Query:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD
        AT A+R+A+ Q  + FP+VVRLQA V VEN+ASQRV+EK GF+KEG L KY + KG I+D+ +YS +  D
Subjt:  ATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATCAGGAAGAGCCATGGCCTAGACTTGTGACTTCCATTGCACTTGGGAAGGGTGCGCGTCTAAGCATTTCGTCGAGCACCTTCTCTTCAAATATGGAACTCGG
TCAACTCACTCTCCGCCCATTTCAACTCTCCGATGTCGATGATTTCATGCTCTGGGCTGGAGACGATCAAGTCATGAAGTTCATTAGATGGAACACTCTCACTTCCAAGG
AACAGGCCCTGGCTTTCATTAGAGATGTTTGTATTCCCCATCCTTGGCGCCGCTCCATCTGCATCGACGACCAATCTGTCGGATTCGTCTCGGTTTACCCGTGGTCGGAT
GCCGACCGGTGCAAGGCGGATGTCGGATATGCCGTGGCTAGAGATCACTGGGGTAAAGGAATTGCAACCAAGGCGTTGAGAATTGCCATTCCTCAGGTGTTCCAAAGCTT
CCCCGATGTGGTGAGATTGCAGGCCTTTGTGTCTGTGGAGAATAGGGCGTCTCAGAGAGTTGTTGAGAAAGTTGGGTTTCAGAAGGAGGGGACTTTGAGGAAATATTGCT
ACATCAAGGGGGAGATCAAGGATTTGATTGTTTATAGCATTTTGTCCTCTGATGTCAAAGAAACTGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATCAGGAAGAGCCATGGCCTAGACTTGTGACTTCCATTGCACTTGGGAAGGGTGCGCGTCTAAGCATTTCGTCGAGCACCTTCTCTTCAAATATGGAACTCGG
TCAACTCACTCTCCGCCCATTTCAACTCTCCGATGTCGATGATTTCATGCTCTGGGCTGGAGACGATCAAGTCATGAAGTTCATTAGATGGAACACTCTCACTTCCAAGG
AACAGGCCCTGGCTTTCATTAGAGATGTTTGTATTCCCCATCCTTGGCGCCGCTCCATCTGCATCGACGACCAATCTGTCGGATTCGTCTCGGTTTACCCGTGGTCGGAT
GCCGACCGGTGCAAGGCGGATGTCGGATATGCCGTGGCTAGAGATCACTGGGGTAAAGGAATTGCAACCAAGGCGTTGAGAATTGCCATTCCTCAGGTGTTCCAAAGCTT
CCCCGATGTGGTGAGATTGCAGGCCTTTGTGTCTGTGGAGAATAGGGCGTCTCAGAGAGTTGTTGAGAAAGTTGGGTTTCAGAAGGAGGGGACTTTGAGGAAATATTGCT
ACATCAAGGGGGAGATCAAGGATTTGATTGTTTATAGCATTTTGTCCTCTGATGTCAAAGAAACTGGATGA
Protein sequenceShow/hide protein sequence
MADQEEPWPRLVTSIALGKGARLSISSSTFSSNMELGQLTLRPFQLSDVDDFMLWAGDDQVMKFIRWNTLTSKEQALAFIRDVCIPHPWRRSICIDDQSVGFVSVYPWSD
ADRCKADVGYAVARDHWGKGIATKALRIAIPQVFQSFPDVVRLQAFVSVENRASQRVVEKVGFQKEGTLRKYCYIKGEIKDLIVYSILSSDVKETG