; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021018 (gene) of Snake gourd v1 genome

Gene IDTan0021018
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionN-acetyltransferase domain-containing protein
Genome locationLG01:79291564..79302105
RNA-Seq ExpressionTan0021018
SyntenyTan0021018
Gene Ontology termsGO:0005737 - cytoplasm (cellular component)
GO:0008080 - N-acetyltransferase activity (molecular function)
InterPro domainsIPR000182 - GNAT domain
IPR016181 - Acyl-CoA N-acyltransferase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592197.1 hypothetical protein SDJN03_14543, partial [Cucurbita argyrosperma subsp. sororia]1.9e-10977.74Show/hide
Query:  MVHLLPNPLQVSS---------AVPA---------AIWRNGGIRARSAVVVQCSRDYLSPIAAAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED
        MVHLLPNPL+VSS         AV A          I RNGGI+ R AVVV+CS DY SPIAAA    IG S+EIE  N N YLA EFGWKVRKLI+EED
Subjt:  MVHLLPNPLQVSS---------AVPA---------AIWRNGGIRARSAVVVQCSRDYLSPIAAAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED

Query:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS
        DLR VARIQAEAFHEPVL FNHFF+QFFQ EVLSALIYRLK+YP DRYACLVAE ESES ++EYNFVGVVDVTVAGDLKV+RLLPAG KEYLFV+GIAV 
Subjt:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS

Query:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL
        QTARRRKVAT LLKGCDMLAK+WGFKFLALSAYEDDYGARNLYSKAGYQV   DPLWKSSWIGRKRCVTM+K L
Subjt:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL

XP_004139678.1 uncharacterized protein LOC101214390 isoform X2 [Cucumis sativus]3.2e-10976.64Show/hide
Query:  MVHLLPNPLQVSS---------AVP-----AAIWR-NGGIRARSAVVVQCSRDYLSPIAAAA---EDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED
        MVHLLPNPL+V S         AVP       +WR  GGI+ RSAVVV+CS DY SPI AAA   E+L+G+S+EI+    NEYLA EFGWKVRKLI+EED
Subjt:  MVHLLPNPLQVSS---------AVP-----AAIWR-NGGIRARSAVVVQCSRDYLSPIAAAA---EDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED

Query:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS
        DL+ VARIQAEAFHEPVL FNHFF+QFFQ EVLSALIYRLKNYP DRYACLVAE ESE  EEEYNFVGVVDVTVAGDLK+KRLLP G KEYLFV+GIAV+
Subjt:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS

Query:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL
        Q ARRRKVAT LLKGCDML K+WGFKFLALSAYEDDYGARNLYSKAGYQV+  DPLWKS+WIGRKRCVTMIK L
Subjt:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL

XP_008461923.1 PREDICTED: uncharacterized protein LOC103500412 isoform X1 [Cucumis melo]9.2e-10977.37Show/hide
Query:  MVHLLPNPLQVSS---------AVPA-----AIWRNGG-IRARSAVVVQCSRDYLSPIAAAA---EDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED
        MVHLLPNPL+VSS         AVP      A+WR GG I+ RSAVVV+CS DY SPI AA    E+LI +S+E+     NEYLAREFGWKVRKLI+EED
Subjt:  MVHLLPNPLQVSS---------AVPA-----AIWRNGG-IRARSAVVVQCSRDYLSPIAAAA---EDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED

Query:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS
        DLR VARIQAEAFHEPVL FNHFF+QFFQ EVLSALIYRLKNYP++RYACLVAE ESE  EEEYNFVGVVDVTVAGDLKVKRLLP G KEYLFV+GIAV+
Subjt:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS

Query:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL
        Q ARRRKVAT LLKGCDML K+WGFKFLALSAYEDDYGARNLYSKAGYQV+  DPLWKS+WIGRKRCVTMIK L
Subjt:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL

XP_022936794.1 uncharacterized protein LOC111443273 [Cucurbita moschata]4.1e-10977.37Show/hide
Query:  MVHLLPNPLQVSS---------AVPA---------AIWRNGGIRARSAVVVQCSRDYLSPIAAAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED
        MVHLLPNPL+VSS         AV A          I RNGGI+ R AVVV+CS DY SPIAAA    IG S+EIE  N N YLA EFGWKVRKLI+EED
Subjt:  MVHLLPNPLQVSS---------AVPA---------AIWRNGGIRARSAVVVQCSRDYLSPIAAAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED

Query:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS
        DLR VARIQAEAFHEPVL FNHFF+QFFQ EVLSALIYRLK+YP DRYACLVAE ESE+ ++EYNFVGVVDVTVAGDLKV+RLLPAG KEYLFV+GIAV 
Subjt:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS

Query:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL
        QTARRRKVAT LLKGCDMLAK+WGFKFLALSAYEDDYGARNLYSKAGYQV   DPLWKSSWIGRKRCVTM+K L
Subjt:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL

XP_038897179.1 uncharacterized protein LOC120085320 [Benincasa hispida]1.4e-10977.37Show/hide
Query:  MVHLLPNPLQVS---------SAVPA---------AIWRNGGIRARSAVVVQCSRDYLSPIAAAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED
        MVHLLPNPL+VS         +AV A         AIWRNGGI+ RSAVV++CS DY SP    AE+ IG+ +EI   N NEYLAREFGW VRKLI+EED
Subjt:  MVHLLPNPLQVS---------SAVPA---------AIWRNGGIRARSAVVVQCSRDYLSPIAAAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED

Query:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS
        DLR VARIQAEAFHEPVL FN FF+QFFQ EVLSALIYRLKNYP DRYACLVAE ESES ++EYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFV+GIAV+
Subjt:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS

Query:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL
        QTARRRKVAT LLKGCDML K+WGFKFLALSAYEDDYGARNLYSKAGYQV+  DPLWKS+WIGRKRCVTMIK L
Subjt:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL

TrEMBL top hitse value%identityAlignment
A0A0A0K826 N-acetyltransferase domain-containing protein1.5e-10976.64Show/hide
Query:  MVHLLPNPLQVSS---------AVP-----AAIWR-NGGIRARSAVVVQCSRDYLSPIAAAA---EDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED
        MVHLLPNPL+V S         AVP       +WR  GGI+ RSAVVV+CS DY SPI AAA   E+L+G+S+EI+    NEYLA EFGWKVRKLI+EED
Subjt:  MVHLLPNPLQVSS---------AVP-----AAIWR-NGGIRARSAVVVQCSRDYLSPIAAAA---EDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED

Query:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS
        DL+ VARIQAEAFHEPVL FNHFF+QFFQ EVLSALIYRLKNYP DRYACLVAE ESE  EEEYNFVGVVDVTVAGDLK+KRLLP G KEYLFV+GIAV+
Subjt:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS

Query:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL
        Q ARRRKVAT LLKGCDML K+WGFKFLALSAYEDDYGARNLYSKAGYQV+  DPLWKS+WIGRKRCVTMIK L
Subjt:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL

A0A1S3CFP1 uncharacterized protein LOC103500412 isoform X14.5e-10977.37Show/hide
Query:  MVHLLPNPLQVSS---------AVPA-----AIWRNGG-IRARSAVVVQCSRDYLSPIAAAA---EDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED
        MVHLLPNPL+VSS         AVP      A+WR GG I+ RSAVVV+CS DY SPI AA    E+LI +S+E+     NEYLAREFGWKVRKLI+EED
Subjt:  MVHLLPNPLQVSS---------AVPA-----AIWRNGG-IRARSAVVVQCSRDYLSPIAAAA---EDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED

Query:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS
        DLR VARIQAEAFHEPVL FNHFF+QFFQ EVLSALIYRLKNYP++RYACLVAE ESE  EEEYNFVGVVDVTVAGDLKVKRLLP G KEYLFV+GIAV+
Subjt:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS

Query:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL
        Q ARRRKVAT LLKGCDML K+WGFKFLALSAYEDDYGARNLYSKAGYQV+  DPLWKS+WIGRKRCVTMIK L
Subjt:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL

A0A1S4E3H1 uncharacterized protein LOC103500412 isoform X24.2e-10777.01Show/hide
Query:  MVHLLPNPLQVSS---------AVPA-----AIWRNGG-IRARSAVVVQCSRDYLSPIAAAA---EDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED
        MVHLLPNPL+VSS         AVP      A+WR GG I+ RSAVVV+CS DY SPI AA    E+LI +S+E+     NEYLAREFGWKVRKLI+EED
Subjt:  MVHLLPNPLQVSS---------AVPA-----AIWRNGG-IRARSAVVVQCSRDYLSPIAAAA---EDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED

Query:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS
        DLR VARIQAEAFHEPVL FNHFF+QFFQ EVLSALIYRLKNYP++RYACLVAE ESE  EEEYNFVGVVDVTVAGDLKVKRLLP G KEYLFV+GIAV+
Subjt:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS

Query:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL
        Q A RRKVAT LLKGCDML K+WGFKFLALSAYEDDYGARNLYSKAGYQV+  DPLWKS+WIGRKRCVTMIK L
Subjt:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL

A0A6J1F9A7 uncharacterized protein LOC1114432732.0e-10977.37Show/hide
Query:  MVHLLPNPLQVSS---------AVPA---------AIWRNGGIRARSAVVVQCSRDYLSPIAAAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED
        MVHLLPNPL+VSS         AV A          I RNGGI+ R AVVV+CS DY SPIAAA    IG S+EIE  N N YLA EFGWKVRKLI+EED
Subjt:  MVHLLPNPLQVSS---------AVPA---------AIWRNGGIRARSAVVVQCSRDYLSPIAAAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED

Query:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS
        DLR VARIQAEAFHEPVL FNHFF+QFFQ EVLSALIYRLK+YP DRYACLVAE ESE+ ++EYNFVGVVDVTVAGDLKV+RLLPAG KEYLFV+GIAV 
Subjt:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS

Query:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL
        QTARRRKVAT LLKGCDMLAK+WGFKFLALSAYEDDYGARNLYSKAGYQV   DPLWKSSWIGRKRCVTM+K L
Subjt:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL

A0A6J1IJZ4 uncharacterized protein LOC1114769674.5e-10977.01Show/hide
Query:  MVHLLPNPLQVSS---------AVPA---------AIWRNGGIRARSAVVVQCSRDYLSPIAAAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED
        MVHLLPNPL+VSS         AV A          I RNGGI+ R AVVV+CS DY SPI AA    IG S+EIE  N N YLA EFGWKVRKLI+EED
Subjt:  MVHLLPNPLQVSS---------AVPA---------AIWRNGGIRARSAVVVQCSRDYLSPIAAAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQEED

Query:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS
        DLR VARIQAEAFHEPVL FNHFF+QFFQ EVLSALIYRLKNYP DRYACLVAE ESE  ++EYNFVGVVDVTVAGDLKV+RLLPAG KEYLFV+GIAV 
Subjt:  DLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVS

Query:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL
        QTARRRKVAT LLKGCDMLA++WGFKFLALSAYEDDYGARNLYSKAGYQV   DPLWKSSWIGRKRCVTM+K L
Subjt:  QTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G72030.1 Acyl-CoA N-acyltransferases (NAT) superfamily protein1.8e-5754.72Show/hide
Query:  AAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQ-EEDDLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWE
        A+ E L  +++        +YL  + GW VR+L + +ED++R V+ +QAEAFH P+  F+ FF+ FFQ EVLSAL+Y+LKN P DRYACLVAE  SE+  
Subjt:  AAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQ-EEDDLRVVARIQAEAFHEPVLFFNHFFYQFFQVEVLSALIYRLKNYPSDRYACLVAEAESESWE

Query:  -EEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVSQTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSS
            + VGVVDVT   +  V R  P G +EYL+VSG+AVS++ RR+K+A+TLLK CD+L  +WGFK LAL AYEDD  ARNLYS AGY V   DPLW S+
Subjt:  -EEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVSQTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYGARNLYSKAGYQVFSADPLWKSS

Query:  WIGRKRCVTMIK
        WIGRKR V M K
Subjt:  WIGRKRCVTMIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCATTTGCTTCCAAATCCCCTCCAAGTTTCCTCGGCGGTTCCGGCGGCGATCTGGAGAAATGGAGGAATTAGGGCCAGATCGGCGGTGGTTGTGCAGTGCAGTAG
GGATTATTTGAGTCCGATTGCGGCGGCGGCAGAAGATTTGATCGGAATTTCTAAAGAAATTGAAACTAGAAATGGAAATGAGTATTTGGCGAGGGAATTTGGATGGAAAG
TGAGGAAATTGATTCAAGAAGAAGATGATTTGAGAGTGGTGGCAAGAATTCAAGCCGAAGCTTTTCATGAACCTGTTCTCTTTTTCAACCATTTTTTCTACCAGTTCTTC
CAGGTAGAAGTGCTGTCGGCGTTGATTTACAGATTGAAAAATTACCCTTCGGACAGGTATGCCTGTTTGGTTGCGGAGGCCGAAAGTGAAAGTTGGGAAGAAGAATACAA
TTTTGTGGGAGTGGTGGACGTCACGGTGGCCGGAGATTTGAAAGTAAAGCGCCTCCTTCCCGCCGGCGAAAAGGAGTATCTCTTTGTTTCTGGAATCGCTGTCTCACAAA
CTGCCAGAAGGCGCAAAGTAGCAACCACACTACTGAAGGGTTGTGACATGCTTGCGAAGATTTGGGGGTTCAAGTTTTTGGCATTAAGTGCATATGAAGATGATTATGGG
GCTCGAAATTTGTATAGTAAAGCAGGGTATCAGGTTTTTTCTGCTGATCCTCTTTGGAAATCTTCTTGGATTGGGAGAAAGCGTTGTGTTACTATGATTAAACACCTCTA
A
mRNA sequenceShow/hide mRNA sequence
CAAAACCTTATCTCTCCACTCTCCCAATGCATCGCCTCAAAAAAATTTAACCCAAAAATTACAAATTTATAATAATAAGGGTTTGAAATTCCCTCAAGAAACGCGACTCT
TGACCGCGGTTTCAACGACGACCGACTACCGCCAAGTCCCTTCCAAAGAATGGTCCATTTGCTTCCAAATCCCCTCCAAGTTTCCTCGGCGGTTCCGGCGGCGATCTGGA
GAAATGGAGGAATTAGGGCCAGATCGGCGGTGGTTGTGCAGTGCAGTAGGGATTATTTGAGTCCGATTGCGGCGGCGGCAGAAGATTTGATCGGAATTTCTAAAGAAATT
GAAACTAGAAATGGAAATGAGTATTTGGCGAGGGAATTTGGATGGAAAGTGAGGAAATTGATTCAAGAAGAAGATGATTTGAGAGTGGTGGCAAGAATTCAAGCCGAAGC
TTTTCATGAACCTGTTCTCTTTTTCAACCATTTTTTCTACCAGTTCTTCCAGGTAGAAGTGCTGTCGGCGTTGATTTACAGATTGAAAAATTACCCTTCGGACAGGTATG
CCTGTTTGGTTGCGGAGGCCGAAAGTGAAAGTTGGGAAGAAGAATACAATTTTGTGGGAGTGGTGGACGTCACGGTGGCCGGAGATTTGAAAGTAAAGCGCCTCCTTCCC
GCCGGCGAAAAGGAGTATCTCTTTGTTTCTGGAATCGCTGTCTCACAAACTGCCAGAAGGCGCAAAGTAGCAACCACACTACTGAAGGGTTGTGACATGCTTGCGAAGAT
TTGGGGGTTCAAGTTTTTGGCATTAAGTGCATATGAAGATGATTATGGGGCTCGAAATTTGTATAGTAAAGCAGGGTATCAGGTTTTTTCTGCTGATCCTCTTTGGAAAT
CTTCTTGGATTGGGAGAAAGCGTTGTGTTACTATGATTAAACACCTCTAACTTCCTTTCTTATCTTTGTTTTTTAAAACCCGACACGTGTATAAAAAATATGTTTCAAAT
GATCAAAATTAATTTTAACCATCTCAAAATTACTTTCAAATATACTCATACACAGTTAACTTCAACTACCTGTTCACATAGTTTGTGAAGTTATAAACAAACAAAACACA
TAAGGACTAAAATAAAATTTACCTAGGGTTTGGATCTAAATAATAACGTTTGAAAATTTCTACCTCCAGTATGCAATTAGGGTAAAATTTGGGTTAAACATTAAAAATAT
AGTTATACTTTTATCAAGAATAAAAGTTTC
Protein sequenceShow/hide protein sequence
MVHLLPNPLQVSSAVPAAIWRNGGIRARSAVVVQCSRDYLSPIAAAAEDLIGISKEIETRNGNEYLAREFGWKVRKLIQEEDDLRVVARIQAEAFHEPVLFFNHFFYQFF
QVEVLSALIYRLKNYPSDRYACLVAEAESESWEEEYNFVGVVDVTVAGDLKVKRLLPAGEKEYLFVSGIAVSQTARRRKVATTLLKGCDMLAKIWGFKFLALSAYEDDYG
ARNLYSKAGYQVFSADPLWKSSWIGRKRCVTMIKHL